Skip to content

Commit d1241f1

Browse files
committed
Merge branch 'release/1.2.8'
2 parents 38b6c5d + 87d86c3 commit d1241f1

File tree

5 files changed

+1208
-1
lines changed

5 files changed

+1208
-1
lines changed

merge_trim/README.md

Lines changed: 52 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,52 @@
1+
---
2+
description: Specifications for merging and max length trimming of FASTQ file.
3+
---
4+
5+
## Merge and Trim FASTQ sub-workflow specification - merge_trim_fastq.cwl
6+
7+
### Tools used:
8+
9+
- [merge_fastq](https://cmo-ci.gitbook.io/command-line-tools-cwl/merge_fastq_0.1.7/README.md)
10+
- [fastp](https://cmo-ci.gitbook.io/command-line-tools-cwl/fastp_0.24.0/README.md)
11+
12+
### Usage
13+
14+
```bash
15+
16+
usage: ./merge_trim_fastq.cwl [-h] --merge_fastq_fastq1 MERGE_FASTQ_FASTQ1 --merge_fastq_fastq2 MERGE_FASTQ_FASTQ2 [--fastp_dont_eval_duplication] [--fastp_disable_trim_poly_g] [--fastp_disable_quality_filtering]
17+
[--fastp_disable_length_filtering] [--fastp_disable_adapter_trimming] [--fastp_unpaired2_output_file_name FASTP_UNPAIRED2_OUTPUT_FILE_NAME]
18+
[--fastp_unpaired1_output_file_name FASTP_UNPAIRED1_OUTPUT_FILE_NAME] [--fastp_read2_output_file_name FASTP_READ2_OUTPUT_FILE_NAME] [--fastp_read1_output_file_name FASTP_READ1_OUTPUT_FILE_NAME]
19+
[--fastp_max_len_read2 FASTP_MAX_LEN_READ2] [--fastp_max_len_read1 FASTP_MAX_LEN_READ1]
20+
[job_order]
21+
22+
merge_trim_fastq
23+
24+
positional arguments:
25+
job_order Job input json file
26+
27+
options:
28+
-h, --help show this help message and exit
29+
--merge_fastq_fastq1 MERGE_FASTQ_FASTQ1
30+
Read 1 FASTQ Files that needs to merged, make sure to maintain the array order with Read 2 FASTQ files
31+
--merge_fastq_fastq2 MERGE_FASTQ_FASTQ2
32+
Read 2 FASTQ Files that needs to merged, make sure to maintain the array order with Read 1 FASTQ files
33+
--fastp_dont_eval_duplication
34+
Disable Evaluation of Duplication (default:True)
35+
--fastp_disable_trim_poly_g
36+
Disable Trim PolyG (default:True)
37+
--fastp_disable_quality_filtering
38+
Disable Quality Filtering (default:True)
39+
--fastp_disable_length_filtering
40+
Disable Length Trimming (default:True)
41+
--fastp_disable_adapter_trimming
42+
Disable Adapter Trimming (default:True)
43+
--fastp_unpaired2_output_file_name FASTP_UNPAIRED2_OUTPUT_FILE_NAME
44+
--fastp_unpaired1_output_file_name FASTP_UNPAIRED1_OUTPUT_FILE_NAME
45+
--fastp_read2_output_file_name FASTP_READ2_OUTPUT_FILE_NAME
46+
--fastp_read1_output_file_name FASTP_READ1_OUTPUT_FILE_NAME
47+
--fastp_max_len_read2 FASTP_MAX_LEN_READ2
48+
Max Lenght for Read 2 (default:95)
49+
--fastp_max_len_read1 FASTP_MAX_LEN_READ1
50+
Max Legth for READ 1 (default:95)
51+
52+
```

merge_trim/example_input.yaml

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,23 @@
1+
fastp_disable_trim_poly_g: null
2+
fastp_disable_adapter_trimming: null
3+
fastp_disable_length_filtering: null
4+
fastp_disable_quality_filtering: null
5+
fastp_dont_eval_duplication: null
6+
fastp_max_len_read1: null
7+
fastp_max_len_read2: null
8+
fastp_read1_output_file_name: null
9+
fastp_read2_output_file_name: null
10+
fastp_unpaired1_output_file_name: null
11+
fastp_unpaired2_output_file_name: null
12+
merge_fastq_fastq1:
13+
- class: File
14+
path: >-
15+
/path/to/fastq/L001.R1.fastq.gz
16+
- class: File
17+
path: >-
18+
/path/to/fastq/L002.R1.fastq.gz
19+
merge_fastq_fastq2:
20+
- class: File
21+
path: /path/to/fastq/L001.R2.fastq.gz
22+
- class: File
23+
path: /path/to/fastq/L002.R2.fastq.gz

merge_trim/merge_trim_fastq.cwl

Lines changed: 198 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,198 @@
1+
class: Workflow
2+
cwlVersion: v1.0
3+
id: merge_trim_fastq
4+
label: merge_trim_fastq
5+
$namespaces:
6+
s: 'https://schema.org/'
7+
sbg: 'https://www.sevenbridges.com/'
8+
inputs:
9+
- id: merge_fastq_fastq1
10+
type:
11+
type: array
12+
items: File
13+
inputBinding:
14+
prefix: '--fastq1'
15+
doc: >-
16+
Read 1 FASTQ Files that needs to merged, make sure to maintain the array
17+
order with Read 2 FASTQ files
18+
'sbg:x': 0
19+
'sbg:y': 321
20+
- id: merge_fastq_fastq2
21+
type:
22+
type: array
23+
items: File
24+
inputBinding:
25+
prefix: '--fastq2'
26+
doc: >-
27+
Read 2 FASTQ Files that needs to merged, make sure to maintain the array
28+
order with Read 1 FASTQ files
29+
'sbg:x': 0
30+
'sbg:y': 214
31+
- id: fastp_dont_eval_duplication
32+
type: boolean?
33+
doc: 'Disable Evaluation of Duplication (default:True)'
34+
'sbg:x': 327.28125
35+
'sbg:y': 214
36+
- id: fastp_disable_trim_poly_g
37+
type: boolean?
38+
doc: 'Disable Trim PolyG (default:True)'
39+
'sbg:x': 327.28125
40+
'sbg:y': 321
41+
- id: fastp_disable_quality_filtering
42+
type: boolean?
43+
doc: 'Disable Quality Filtering (default:True)'
44+
'sbg:x': 327.28125
45+
'sbg:y': 428
46+
- id: fastp_disable_length_filtering
47+
type: boolean?
48+
doc: 'Disable Length Trimming (default:True)'
49+
'sbg:x': 327.28125
50+
'sbg:y': 535
51+
- id: fastp_disable_adapter_trimming
52+
type: boolean?
53+
doc: 'Disable Adapter Trimming (default:True)'
54+
'sbg:x': 327.28125
55+
'sbg:y': 642
56+
- id: fastp_unpaired2_output_file_name
57+
type: string?
58+
'sbg:x': 0
59+
'sbg:y': 428
60+
- id: fastp_unpaired1_output_file_name
61+
type: string?
62+
'sbg:x': 0
63+
'sbg:y': 535
64+
- id: fastp_read2_output_file_name
65+
type: string?
66+
'sbg:x': 0
67+
'sbg:y': 642
68+
- id: fastp_read1_output_file_name
69+
type: string?
70+
'sbg:x': 0
71+
'sbg:y': 749
72+
- id: fastp_max_len_read2
73+
type: int?
74+
doc: 'Max Lenght for Read 2 (default:95)'
75+
'sbg:x': 0
76+
'sbg:y': 0
77+
- id: fastp_max_len_read1
78+
type: int?
79+
doc: 'Max Legth for READ 1 (default:95)'
80+
'sbg:x': 0
81+
'sbg:y': 107
82+
outputs:
83+
- id: fastp_unpaired2_output
84+
outputSource:
85+
- fastp_0_24_0/fastp_unpaired2_output
86+
type: File?
87+
'sbg:x': 1058.288818359375
88+
'sbg:y': 107
89+
- id: fastp_unpaired1_output
90+
outputSource:
91+
- fastp_0_24_0/fastp_unpaired1_output
92+
type: File?
93+
'sbg:x': 1058.288818359375
94+
'sbg:y': 214
95+
- id: fastp_read2_output
96+
outputSource:
97+
- fastp_0_24_0/fastp_read2_output
98+
type: File?
99+
'sbg:x': 1058.288818359375
100+
'sbg:y': 321
101+
- id: fastp_read1_output
102+
outputSource:
103+
- fastp_0_24_0/fastp_read1_output
104+
type: File
105+
'sbg:x': 1058.288818359375
106+
'sbg:y': 428
107+
- id: fastp_json_output
108+
outputSource:
109+
- fastp_0_24_0/fastp_json_output
110+
type: File
111+
'sbg:x': 1058.288818359375
112+
'sbg:y': 535
113+
- id: fastp_html_output
114+
outputSource:
115+
- fastp_0_24_0/fastp_html_output
116+
type: File
117+
'sbg:x': 1058.288818359375
118+
'sbg:y': 642
119+
steps:
120+
- id: merge_fastq_0_1_7
121+
in:
122+
- id: fastq1
123+
source:
124+
- merge_fastq_fastq1
125+
- id: fastq2
126+
source:
127+
- merge_fastq_fastq2
128+
out:
129+
- id: merge_fastq_1
130+
- id: merge_fastq_2
131+
run: ../command_line_tools/merge_fastq_0.1.7/merge_fastq_0.1.7.cwl
132+
'sbg:x': 327.28125
133+
'sbg:y': 100
134+
- id: fastp_0_24_0
135+
in:
136+
- id: read1_input
137+
source: merge_fastq_0_1_7/merge_fastq_1
138+
- id: read1_output_path
139+
default: trimmed_R1.fastq.gz
140+
source: fastp_read1_output_file_name
141+
- id: read2_input
142+
source: merge_fastq_0_1_7/merge_fastq_2
143+
- id: read2_output_path
144+
default: trimmed_R2.fastq.gz
145+
source: fastp_read2_output_file_name
146+
- id: unpaired1_path
147+
default: unpaired_R1.fastq.gz
148+
source: fastp_unpaired1_output_file_name
149+
- id: unpaired2_path
150+
default: unpaired_R2.fastq.gz
151+
source: fastp_unpaired2_output_file_name
152+
- id: max_len_read1
153+
default: 95
154+
source: fastp_max_len_read1
155+
- id: max_len_read2
156+
default: 95
157+
source: fastp_max_len_read2
158+
- id: disable_quality_filtering
159+
default: true
160+
source: fastp_disable_quality_filtering
161+
- id: disable_trim_poly_g
162+
default: true
163+
source: fastp_disable_trim_poly_g
164+
- id: disable_adapter_trimming
165+
default: true
166+
source: fastp_disable_adapter_trimming
167+
- id: dont_eval_duplication
168+
default: true
169+
source: fastp_dont_eval_duplication
170+
- id: disable_length_filtering
171+
default: true
172+
source: fastp_disable_length_filtering
173+
out:
174+
- id: fastp_json_output
175+
- id: fastp_html_output
176+
- id: fastp_read1_output
177+
- id: fastp_read2_output
178+
- id: fastp_unpaired1_output
179+
- id: fastp_unpaired2_output
180+
run: ../command_line_tools/fastp_0.24.0/fastp_0.24.0.cwl
181+
label: fastp_0.24.0
182+
'sbg:x': 583.671875
183+
'sbg:y': 290.5
184+
requirements: []
185+
$schemas:
186+
- 'http://schema.org/version/latest/schemaorg-current-http.rdf'
187+
's:author':
188+
- class: 's:Person'
189+
's:email': 'mailto:shahr2@mskcc.org'
190+
's:name': Ronak Shah
191+
's:citation': ''
192+
's:codeRepository': 'https://github.com/msk-access/cwl_subworkflows/merge_trim'
193+
's:contributor':
194+
- class: 's:Person'
195+
's:email': 'mailto:shahr2@mskcc.org'
196+
's:name': Ronak Shah
197+
's:dateCreated': '2025-03-24'
198+
's:license': 'https://spdx.org/licenses/Apache-2.0'

0 commit comments

Comments
 (0)