.. |PipeCraft2_logo| image:: _static/PipeCraft2_icon_v2.png
:width: 50
:alt: Alternative text
:target: https://github.com/pipecraft2/user_guide
.. |main_interface| image:: _static/main_interface.png
:width: 2000
:alt: Alternative text
.. |select_pipeline_or_quicktools| image:: _static/select_pipeline_or_quicktools.png
:width: 1000
:alt: Alternative text
.. |select_wd| image:: _static/select_wd.png
:width: 1000
:alt: Alternative text
.. meta::
:description lang=en:
PipeCraft2 manual. User guide for PipeCraft2
.. raw:: html
.. role:: red
.. _quickstart:
============================
QuickStart |PipeCraft2_logo|
============================
Required data formats
=====================
There are few specifc requirements for the input data for PipeCraft.
- **Paired-end data**
* :red:`must contain **R1** and **R2** strings in the paired-end files`,
* :red:`sample names maynot contain 'R1/R2' strings`
+ **OK file names**: ``my_sample_01_R1_L001.fastq`` and ``my_sample_01_R2_L001.fastq``
+ **NOT-OK** file names: ``my_R1sample_01_R1_L001.fastq`` and ``my_R1sample_01_R2_L001.fastq`` or ``my_sample_01_1.fastq`` and ``my_sample_01_2.fastq``
- **index/barcodes file** for demultiplexing:
* :ref:`see formatting requirements here `
- specific directory structure of NextITS pipeline
* :ref:`see NextITS page here `
____________________________________________________
How to START
============
1. To ``START`` any analyses, you must specify the working directory (WORKDIR) by pressing the ``SELECT WORKDIR`` button. E.g., if working with **fastq** files,
then be sure that the working directory contains **only relevant fastq files** because the selected process will be
applied to all fastq files in the working directory!
.. note::
When using Windows OS, the selection window might not display the files while browsing through the directories.
After selecting a working directory, PipeCraft needs you to specify if
* if the data is paired-end or single-end
* and the extension of the data (fastq or fasta)
| ``paired-end data`` --> such as data from Illumina or MGI-Tech platforms (R1 and R2 files). :red:`Be sure to have **R1** and **R2** strings in the paired-end files (not simply _1 and _2; and sample names maynot contain R1/R2 strings)`
| ``single-end data`` --> such as data from PacBio, or assembled paired-end data (single file per library or per sample)
|select_wd|
2. ``SELECT PIPELINE`` or press ``Quick Tools`` button
to select relevant :ref:`step `;
edit settings if needed and **start
running the analyses** by pressing the ``START`` button.
|select_pipeline_or_quicktools|
.. note::
**When running 'step-by-step analyses with Quick Tools'**: when one workflow is finished, then press ``SELECT WORKDIR`` to specify inputs for the next process to ensure the correct workflow piping.
.. warning::
The **outputs will be overwritten** if running the same
analysis step **multiple times in the same working directory**.
If needed, edit the default output directory name to prevent that.
Each process creates a separate output directory with the processed files.
The **README** file in the output directory states some of the details about the finished process.
____________________________________________________
:ref:`Ready-to-run pre-defined pipelines here `
:ref:`QuickTools page here `
____________________________________________________
.. _qualitycheck:
Quality scores and basic statistics screening of the data
==========================================================
.. |multiQC_main| image:: _static/multiQC_main.png
:width: 1000
:alt: Alternative text
.. |multiQC_1-3| image:: _static/multiQC_1-3.png
:width: 550
:alt: Alternative text
.. |multiQC_view_report| image:: _static/multiQC_view_report.png
:width: 550
:alt: Alternative text
Quality and basic statistics screening of the data can be done via ``QualityCheck`` panel.
QualityCheck panel implements `FastQC `_ and `MultiQC `_ to screen the input **fastq** files.
|multiQC_main|
|
**To start:**
1. **Select folder** (a working directory) which contains only **fastq** (fastq/fq) files that you aim to inspect.
2. Press ``CREATE REPORT`` to start MultiQC
3. "LOADING ..." will be displayed while the report is being generated
|multiQC_1-3|
|multiQC_view_report|
4. Click ``VIEW REPORT``. A html file (multiqc_report.html) will open in your default web browser.
*If the summary does not open, check your working floder for the presence of* **multiqc_report.html** *and try to open with some other web browser.*
*Something went wrong if the file multiqc_report.html* **does not exist** *(may fail when maximum number of fastq files in the folder is extremely large, >10 000).*
5. Check out `"using MultiQC reports" `_ in MultiQC web page.
.. note::
Note that '_fastqc.zip' and '_fastqc.html' are generated for each fastq file in the **'quality_check'** directory. These are summarized in **multiqc_report.html**,
so you may delete all individual '_fastqc.zip' and '_fastqc.html' files if those are of no interest.