-
Notifications
You must be signed in to change notification settings - Fork 298
Pull requests: datajuicer/data-juicer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] feat: add S3 download and upload mappers for distributed processing
#839
opened Dec 4, 2025 by
kyo-tom
Loading…
Refine and merge for PR568
dj:tools
issues/PRs about specific tools
#836
opened Dec 2, 2025 by
HYLcool
Loading…
New Ops: add pipeline op type and support ray vllm engine
#835
opened Dec 2, 2025 by
Cathy0908
Loading…
New Optical Flow OP & Allow to save the computed optical flows
dj:op
issues/PRs about some specific OPs
enhancement
New feature or request
#824
opened Nov 19, 2025 by
HYLcool
Loading…
Add Operator-Level Parallel Data Processing with Ray Actors
dj:dist
issues/PRs about distributed data processing
dj:efficiency
regarding to efficiency issues and enhancements
enhancement
New feature or request
#761
opened Aug 19, 2025 by
Cccccc0630
Loading…
[NewOp] Add generate_challenging_qa_mapper based on MindGYM principles
#703
opened Jun 14, 2025 by
Bat-Reality
Loading…
[WIP] Optimization framework
dj:core
issues/PRs about the core functions of Data-Juicer
dj:efficiency
regarding to efficiency issues and enhancements
#702
opened Jun 13, 2025 by
cyruszhang
Loading…
[NewOp] Add domain_diversity_selector based on DaaR principles
#699
opened Jun 12, 2025 by
lingzhq
Loading…
Add humanvbench operators
dj:multimodal
issues/PRs about multimodal data processing
dj:op
issues/PRs about some specific OPs
good first issue
Good for newcomers
#553
opened Jan 17, 2025 by
SYSUzhouting
Loading…
ProTip!
Adding no:label will show everything without a label.