.. |PipeCraft2_logo| image:: _static/PipeCraft2_icon_v2.png :width: 50 :target: https://github.com/pipecraft2/user_guide .. |PipeCraft_sticker1sideways| image:: _static/PipeCraft_sticker1sideways.png :width: 350 :class: right :target: https://github.com/pipecraft2/user_guide .. |main_interface| image:: _static/main_interface.png :width: 2000 .. raw:: html .. role:: red .. meta:: :description lang=en: PipeCraft manual. PipeCraft is a Graphical User Interface software for metabarcoding data analyses ============================ PipeCraft2 |PipeCraft2_logo| ============================ **PipeCraft2** is a Graphical User Interface (GUI) software that implements :ref:`various popular tools ` for **metabarcoding** data analyses. Implements various :ref:`ready-to-run (pre-defined) pipelines ` as well as an option to run a variety of :ref:`individual steps ` outside of a full-pipeline. .. _interface: *(click on the image for enlargement)* |main_interface| | Software settings for pipeline processes contain key options for metabarcoding sequence data analyses, but all options of any implemented program may be accessed via :ref:`PipeCraft console (command line) `. | Default settings in the panels represent commonly used options for amplicon sequence data analyses, which may be tailored according to user experience or needs. Custom-designed pipeline settings can be saved, and thus the exact same pipeline may be easily re-run on other sequencing data (and for reproducibility, may be used as a supplement material in the manuscript). PipeCraft enables executing the :ref:`full pipeline ` (user specifies the input, and output will be e.g. OTU/ASV table with taxonomic annotations of the generated features), but supports also :ref:`single-step mode (Quick Tools panel) ` where analyses may be performed in a step-by-step manner *(e.g. perform quality filtering, then examine the output and decide whether to adjust the quality filtering options of to proceed with next step, e.g. with chimera filtering step)*. Glossary ======== List of terms that you may encounter in this user guide. =========================== =================================== **working directory** | the directory (folder) that contains the files for the analyses. | The outputs will be written into this directory **paired-end data** | obtained by sequencing two ends of the same DNA fragment, | which results in read 1 (R1) and read 2 (R2) files per library or per sample. | Note that PipeCraft expects that :red:`read 1 file contains the string R1` | and :red:`read 2 contains R2` | (not e.g. my_sample_L001_1.fastq / my_sample_L001_2.fastq) **single-end data** | only one sequencing file per library or per sample. | Herein, may mean also assembled paired-end data. **demultiplexed data** | sequences are sorted into separate files, representing individual samples **multiplexed data** | file(s) that represent a pool of sequences from different samples **read/sequence** | DNA sequence; herein, reads and sequences are used interchangeably =========================== =================================== ____________________________________________________ Docker images ============== .. |pulling_image| image:: _static/pulling_image.png :width: 280 All the processes are run through `docker `_, where the PipeCraft's GUI simply mediates the information exchange. Therefore, whenever a process is initiated for the **first time**, a relevant Docker image (contains required software for the analyses step) will be pulled from `Docker Hub `_. Initial PipeCraft2 installation does not contain any software for sequence data processing. Example: when running DEMULTIPLEXING for the first time |pulling_image| Thus working **Internet connection** is initially required. Once the Docker images are pulled, PipeCraft2 can work without an Internet connection. :ref:`Docker images ` vary in size, and the speed of the first process is extended by the docker image download time. ____________________________________________________ .. _tools: Currently implemented software ------------------------------ :ref:`See software version on the 'Releases' page ` ======================================================================= ========================================================================================= ============= Software Reference Task ======================================================================= ========================================================================================= ============= `docker `_ https://www.docker.com building, sharing and running applications `DADA2 `_ `Callahan et. al 2016 `_ full pipeline operations `vsearch `_ `Rognes et. al 2016 `_ quality filtering, assemble paired-end reads, chimera filtering, clustering `NextITS `_ `Mikryukov et. al `_ pipeline for fungal full-ITS (PacBio); not available in Mac version of PipeCraft2 `trimmomatic `_ `Bolger et al. 2014 `_ quality filtering `fastp `_ `Chen et al. 2018 `_ quality filtering `seqkit `_ `Shen et al. 2016 `_ multiple sequence manipulation operations `cutadapt `_ `Martin 2011 `_ demultiplexing, cut primers `biopython `_ `Cock et al. 2009 `_ multiple sequence manipulation operations `GNU Parallel `_ `Tangle 2021 `_ executing jobs in parallel `mothur `_ `Schloss et al. 2009 `_ submodule in ITSx to make unique and deunique seqs `ITS Extractor `_ `Bengtsson-Palme et al. 2013 `_ extract ITS regions `fqgrep `_ `Indraniel Das 2011 `_ core for reorient reads `BLAST `_ `Camacho et al. 2009 `_ assign taxonomy RDP classifier `Wang et al. 2007 `_ assign taxonomy ORFfinder `NCBI Tool `_ finding open reading frames of protein coding genes (filtering pseudogenes/off-targets) HMMER `Web site `_ HMM based filtering if the sequences (filtering pseudogenes/off-targets) `FastQC `_ `Andrews 2019 `_ QualityCheck module `MultiQC `_ `Ewels et al. 2016 `_ QualityCheck module `LULU `_ `Frøslev et al. 2017 `_ post-clustering curation `DEICODE `_ `Martino et al. 2019 `_ dissimilarity analysis ======================================================================= ========================================================================================= ============= Let us know if you would like to have a specific software implemeted to PipeCraft (:ref:`contacts `) or create an issue in the `main repository `_. ____________________________________________________ Contents of this user guide --------------------------- .. toctree:: :maxdepth: 1 installation quickstart pre-defined_pipelines quicktools postprocessing example_analyses troubleshoot licence citation releases docker_images contact for_developers |PipeCraft_sticker1sideways|