Skip to content

Pull requests: NVIDIA-NeMo/Curator

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: infer fanout stages from process annotations community-request
#2157 opened Jul 2, 2026 by nightcityblade Contributor Loading…
2 of 3 tasks
fix: validate pyarrow columns in stage input checks community-request
#2156 opened Jul 2, 2026 by nightcityblade Contributor Loading…
2 of 3 tasks
Support running minhash on DocumentBatches
#2155 opened Jul 1, 2026 by ayushdg Contributor Loading…
3 tasks
[codex] Document SLURM job arrays and retries
#2154 opened Jul 1, 2026 by lbliii Contributor Loading…
[codex] Document memory-efficient semantic dedup fitting
#2149 opened Jul 1, 2026 by lbliii Contributor Loading…
[codex] Rewrite InferenceServer docs for Ray Serve and Dynamo
#2147 opened Jul 1, 2026 by lbliii Contributor Loading…
[codex] publish 26.06 release notes and migration checklist
#2143 opened Jun 30, 2026 by lbliii Contributor Loading…
[codex] Document stage worker sizing and backend overrides
#2142 opened Jun 30, 2026 by lbliii Contributor Loading…
[codex] Add Nemotron-CLIMB data-curation recipe
#2138 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: add Nemotron OCR pipeline guide
#2137 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: add long-form audio cutting guide
#2136 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: add audio tagging pipeline guide
#2135 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: document native pipeline resumability
#2134 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: add video caption evaluation guide
#2133 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: complete translation configuration reference
#2132 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: expand Nemotron-Parse tuning guidance
#2131 opened Jun 29, 2026 by lbliii Contributor Loading…
[codex] docs: clarify deduplication input discovery
#2130 opened Jun 29, 2026 by lbliii Contributor Loading… 25.11
Add Lance annotation writer stage
#2113 opened Jun 24, 2026 by VibhuJawa Contributor Draft
Add Lance writer stage
#2112 opened Jun 24, 2026 by VibhuJawa Contributor Loading…
Add Lance reader stage
#2111 opened Jun 24, 2026 by VibhuJawa Contributor Loading…
docs(fern): add local library autodocs without Fern auth
#2102 opened Jun 22, 2026 by lbliii Contributor Loading…
2 of 3 tasks
[WIP] Feat/cc lancedb pipeline
#2101 opened Jun 22, 2026 by VibhuJawa Contributor Draft
3 tasks
docs: fix tutorial doc links and add data curation challenges
#2098 opened Jun 22, 2026 by lbliii Contributor Loading…
2 tasks done
2
2
docs: expand tutorials Quick Start with docs and Core Concepts links
#2097 opened Jun 22, 2026 by lbliii Contributor Loading…
2 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.