Input data for Use Case 3-3
As for the previous Use Cases, the first step is to collect all input data in an history that we will name Input data for Use Case 3-3.
- Create a new history
- Rename this history
Input data for Use Case 3-1 - Using the tool
Extract reads in FASTQ/A format from NCBI SRA, we are going to upload 63 paired end datasets. As explained in the previous chapter, theExtract reads in FASTQ/A format from NCBI SRAdirectly merges paired-end sequence data in a single file. As for the previousUse Case 3-2, collecting these 63 datasets is tedious. But the good news is that we are going to use repeatedly the sameExtract reads in FASTQ/A format from NCBI SRAtool and we only need in addition to change 63 times the datatype of these datasets fromfastqtofastqsanger.
So, here are the SSR ids to enter in the Extract reads in FASTQ/A format from NCBI SRA tool form (leaving fastq as an output format):
For Ebola virus samples:
SRR1613381
SRR1613377
SRR1613382
SRR1613378
SRR1613383
SRR1613379
SRR1613384
SRR1613380
When you have finishid with the 8 datasets, make sure to change their datatype from fastqto fastqsanger, and create a dataset collection (as explained in the previous chapter) of these 8 datasets that you will name Ebola virus.
For Lassa virus samples:
SRR1595772
SRR1595696
SRR1595665
SRR1595500
SRR1594619
SRR1595943
SRR1595673
SRR1595797
SRR1595763
SRR1595558
SRR1594664
SRR1595909
SRR1594651
SRR1595835
SRR1594698
SRR1613388
SRR1613389
SRR1613390
SRR1613391
SRR1613392
SRR1613393
SRR1613394
SRR1613395
SRR1613396
SRR1613397
SRR1613398
SRR1613399
SRR1595853
SRR1606288
SRR1613412
SRR1613403
SRR1606277
SRR1613386
SRR1613387
SRR1606267
SRR1614275
SRR1610580
SRR1595846
SRR1594606
SRR1606236
SRR1594723
SRR1594671
SRR1613414
SRR1613400
SRR1613401
SRR1613404
SRR1613402
SRR1613405
SRR1613407
SRR1613408
SRR1613409
SRR1613410
SRR1613406
SRR1613411
SRR1613413
When you have finishid with the 55 datasets, make sure to change their datatype from fastqto fastqsanger, and create a dataset collection (as explained in the previous chapter) of these 8 datasets that you will name Lassa virus.
- Copy the
vir1 nucleotide BLAST databasefrom theReferenceshistory to the current historyInput data for Use Case 3-3.
History for Use Case 3-3 / Ebola virus
- Stay in the history
Input data for Use Case 3-3 - pick the workflow
Metavisitor: Workflow for Use Case 3-3in the workflows menu. It is possible that the workflow manager complains about settings for the Trinity tool that is used in this workflow. This is a minor issue if happens: just edit the workflow, click on the toolTrinityand specify the number of processors accordingly to your computing infrastructure. Save the workflow and select therunoption. - Before Step 1, you have to specify some parameters at run time. For Ebola virus, the field
target_virushas to be filled withEbolaand the fieldreference_virushas to be filled withNC_002549.1(as a guide for reconstruction of the Ebola virus genome). - For Step 1, select
Ebola virus. - For Step 2, select the
nucleotide vir1 blast database(this should also be already selected) - As usual, check the box
Send results to a new history, edit the name of the new history toUse Case 3-3 Ebola virus, andExecutethe workflow ! Note, that for complex workflows with dataset collections in input, the actual warning that the workflow is started make take time to show up; you can even have a "504 Gateway Time-out" warning. This is not a serious issue: just go in yourUser->Saved historymenu, you will see yourUse Case 3-3 Ebola virushistory running and you will be able to access it.
The workflow for Use Case 3-3 may take a long time. Be patient.
History for Use Case 3-3 / Lassa virus, segment L
- Stay in the history
Input data for Use Case 3-3 - pick the workflow
Metavisitor: Workflow for Use Case 3-3in the workflows menu. It is possible that the workflow manager complains about settings for the Trinity tool that is used in this workflow. This is a minor issue if happens: just edit the workflow, click on the toolTrinityand specify the number of processors accordingly to your computing infrastructure. Save the workflow and select therunoption. - Before Step 1, you have to specify some parameters at run time. For Lassa virus, the field
target_virushas to be filled withLassaand the fieldreference_virushas to be filled withNC_004297.1(as a guide for reconstruction of the segment L of the Lassa virus genome). - For Step 1, select
Lassa virus. - For Step 2, select the
nucleotide vir1 blast database(this should also be already selected) - As usual, check the box
Send results to a new history, edit the name of the new history toUse Case 3-3 Lassa virus segment L, andExecutethe workflow ! Note, that for complex workflows with dataset collections in input, the actual warning that the workflow is started make take time to show up; you can even have a "504 Gateway Time-out" warning. This is not a serious issue: just go in yourUser->Saved historymenu, you will see yourUse Case 3-3 Lassa virus segment Lhistory running and you will be able to access it.
The workflow for Use Case 3-3 may take a long time. Be patient.
History for Use Case 3-3 / Lassa virus, segment S
- Stay in the history
Input data for Use Case 3-3 - pick the workflow
Metavisitor: Workflow for Use Case 3-3in the workflows menu. It is possible that the workflow manager complains about settings for the Trinity tool that is used in this workflow. This is a minor issue if happens: just edit the workflow, click on the toolTrinityand specify the number of processors accordingly to your computing infrastructure. Save the workflow and select therunoption. - Before Step 1, you have to specify some parameters at run time. For Lassa virus, the field
target_virushas to be filled withLassaand the fieldreference_virushas to be filled withNC_004296.1(as a guide for reconstruction of the segment S of the Lassa virus genome). - For Step 1, select
Lassa virus(this should be already selected). - For Step 2, select the
nucleotide vir1 blast database(this should also be already selected) - As usual, check the box
Send results to a new history, edit the name of the new history toUse Case 3-3 Lassa virus segment S, andExecutethe workflow ! Note, that for complex workflows with dataset collections in input, the actual warning that the workflow is started make take time to show up; you can even have a "504 Gateway Time-out" warning. This is not a serious issue: just go in yourUser->Saved historymenu, you will see yourUse Case 3-3 Lassa virus segment Shistory running and you will be able to access it.
The workflow for Use Case 3-3 may take a long time. Be patient.