institute of biotechnology >> brc >> bioinformatics >> internal >> biohpc cloud: user guide

BioHPC Cloud:
: User Guide


BioHPC Cloud Software

There is 677 software titles installed in BioHPC Cloud. The sofware is available on all machines (unless stated otherwise in notes), complete list of programs is below, please click on a title to see details and instructions. Tabular list of software is available here

Please read details and instructions before running any program, it may contain important information on how to properly use the software in BioHPC Cloud.

454 gsAssembler or gsMapper, a5, ABRicate, ABruijn, ABySS, AdapterRemoval, adephylo, Admixtools, Admixture, agrep, albacore, Alder, AlleleSeq, ALLMAPS, ALLPATHS-LG, AMOS, AMPHORA,, analysis, ANGSD, Annovar, antiSMASH, apollo, Arlequin, aspera, assembly-stats, atac-seq-pipeline, athena_meta, ATLAS, Atlas-Link, ATLAS_GapFill, ATSAS, Augustus, AWS command line interface, axe, BactSNP, bam2fastx, bamtools, bamUtil, Basset, BayeScan, Bayescenv, BBmap, BCFtools, bcl2fastq, BCP, Beagle, Beast2, bedops, BEDtools, bfc, bgc, bgen, bigQF, bigWig, bioawk, biobambam, Bioconductor, biom-format, BioPerl, BioPython, Birdsuite, Bismark, blasr, BLAST, blast2go, BLAT, BLUPF90, BMGE, bmtagger, Boost, Bowtie, Bowtie2, BPGA, Bracken, BRAKER, BRAT-NextGen, BreedingSchemeLanguage, breseq, brocc, BSseeker2, BUSCO, BWA, bwa-meth, cactus, canu, CAP3, CarveMe, cBar, CBSU RNAseq, CCTpack, cd-hit, CEGMA, CellRanger, cellranger-atac, centrifuge, CFSAN SNP pipeline, CheckM, chimera, chromosomer, Circlator, Circos, Circuitscape, CLUMPP, Clustal Omega, CLUSTALW, Cluster, cmake, CNVnator, compat, CONCOCT, Conda, copyNumberDiff, cortex_var, CRISPRCasFinder, CRISPResso, CrossMap, CRT, cuda, Cufflinks, cutadapt, dadi, dadi-1.6.3_modif, danpos, dDocent, DeconSeq, deepTools, defusion, delly, DESMAN, destruct, DETONATE, diamond, diploSHIC, discoal, Discovar, Discovar de novo, distruct, DiTASiC, DIYABC, Docker, dREG, dREG.HD, Drop-seq, dropEst, dropSeqPipe, dsk, Dsuite, ea-utils, ecopcr, ecoPrimers, ectyper, EDGE, edirect, eems, EGAD, EIGENSOFT, EMBOSS, entropy, ephem, epic2, ermineJ, ete3, exabayes, exonerate, eXpress, FALCON, FALCON_unzip, Fast-GBS, fasta, FastANI, fastcluster, FastME, FastML, fastp, FastQ Screen, fastq_pair, fastq_species_detector, FastQC, fastsimcoal26, fastStructure, FastTree, FASTX, feh, fineRADstructure, fineSTRUCTURE, FIt-SNE, flash, flash2, flexbar, Flexible Adapter Remover, Flye, FMAP, FragGeneScan, FragGeneScan, freebayes, FunGene Pipeline, GAEMR, Galaxy, GATK, gatk4,, Gblocks, GBRS, gcc, GCTA, GDAL, gdc-client, GEM library, GEMMA, geneid, GeneMark, GeneMarker, Genome STRiP, GenomeMapper, GenomeStudio (Illumina), GenomeThreader, GenomicConsensus, gensim, GEOS, germline, gerp++, GET_PHYLOMARKERS, GffCompare, gffread, giggle, GMAP/GSNAP, GNU Compilers, GNU parallel, go-perl, GoShifter, gradle-4.4, graftM, graphviz, GRiD, Grinder, GROMACS, GSEA, GTDB-Tk, GTFtools, Gubbins, GUPPY, HapCompass, HAPCUT, HAPCUT2, hapflk, HaploMerger, Haplomerger2, HapSeq2, HarvestTools, HiC-Pro, HiCExplorer, HISAT2, HMMER, Homer, HOTSPOT, HTSeq, HUMAnN2, hyperopt, HyPhy, iAssembler, IBDLD, IDBA-UD, IDP-denovo, idr, IgBLAST, IGoR, IGV, IMa2, IMa2p, IMAGE, ImageJ, ImageMagick, Immcantation, impute2, IMSA-A, INDELseek, infernal, Infomap, InStruct, InteMAP, InterProScan, ipyrad, IQ-TREE, iRep, jags, java, jbrowse, jellyfish, JoinMap, julia, jupyter, kallisto, Kent Utilities, keras, khmer, KmerFinder, kraken, kSNP, kWIP, LACHESIS, lammps, LAST, lcMLkin, LDAK, leeHom, Lep-MAP3, lftp, Lighter, LINKS, LocARNA, LocusZoom, lofreq, longranger, LS-GKM, LUCY, LUCY2, LUMPY, lyve-SET, MACE, MACS, MaCS simulator, MACS2, MAFFT, mafTools, Magic-BLAST, magick, MAKER, MAQ, MASH, mashtree, Mashtree, MaSuRCA, Mauve, MaxBin, mccortex, mcl, megahit, MeGAMerge, MEGAN, MELT, MEME Suite, MERLIN, MetaBAT, MetaCRAST, metaCRISPR, MetAMOS, MetaPathways, MetaPhlAn, MetaVelvet, MetaVelvet-SL, MGmapper, Migrate-n, mikado, Minimac3, Minimac4, minimap2, mira, miRDeep2, MISO (misopy), MITObim, MiXCR, MixMapper, MKTest, mlst, MMAP, MMSEQ, MMseqs2, monocle3, mosdepth, mothur, MrBayes, mrsFAST, msld, MSMC, msprime, MSR-CA Genome Assembler, msstats, MSTMap, mugsy, MultiQC, multiz-tba, MUMmer, muscle, MUSIC, muTect, Nanopolish, ncftp, Nemo, Netbeans, NEURON, new_fugue, Nextflow, NextGenMap, nf-core/rnaseq, NGS_data_processing, NGSadmix, ngsDist, ngsF, ngsLD, NgsRelate, ngsTools, NGSUtils, NLR-Parser, Novoalign, NovoalignCS, nvidia-docker, Oases, OBITools, OMA, OrthoFinder, Orthomcl, PacBioTestData, PAGIT, paleomix, PAML, pandas, pandaseq, PanPhlAn, Panseq, Parsnp, PASA, PASTEC, PAUP*, pb-assembly, pbalign, pbh5tools, PBJelly, PBSuite, PCAngsd, PeakRanger, PeakSplitter, PEAR, PEER, PennCNV, PfamScan, PGDSpider, ph5tools, Phage_Finder, PHAST, phenopath, Phobius, PHRAPL, PHYLIP, PhyloCSF, phyloFlash, phylophlan, PhyloPhlAn2, PhyML, Picard, pigz, Pilon, Pindel, piPipes, PIQ, PlasFlow, Platypus, plink, plink2, Plotly, popbam, PopCOGenT, Porechop, portcullis, pplacer, PRANK, prinseq, prodigal, progenomics, progressiveCactus, PROJ, prokka, psutil, pyani, PyCogent, pyfaidx, pyGenomeTracks, PyMC, pyopencl, pypy, pyRAD, Pyro4, PySnpTools, python, PyTorch, PyVCF, QIIME, QIIME2 q2cli, QTCAT, Quake, Qualimap, QuantiSNP2, QUAST, QUMA, R, RACA, racon, RADIS, RadSex, RAPTR-SV, RAxML, Ray, rclone, Rcorrector, RDP Classifier, REAPR, Relate, RelocaTE2, RepeatMasker, RepeatModeler, RFMix, rgdal, RGI, Rgtsvm, ripgrep, rJava, RNAMMER, rnaQUAST, Rnightlights, Roary, Rqtl, Rqtl2, RSEM, RSeQC, RStudio, rtfbs_db, sabre, SaguaroGW, salmon, Sambamba, samblaster, sample, SampleTracker, Samtools, Satsuma, Satsuma2, scikit-learn, Scoary, scythe, seaborn, SecretomeP, selscan, Sentieon, SeqPrep, seqtk, Seurat, sf, sgrep, sgrep sorted_grep, SHAPEIT, shore, SHOREmap, shortBRED, SHRiMP, sickle, SignalP, SimPhy, simuPOP, singularity, sistr_cmd, SKESA, skewer, SLiM, smcpp, SMRT Analysis, SMRT LINK, snakemake, snap, SNAPP, snATAC, SNeP, snippy, snp-sites, SnpEff, SNPgenie, SNPhylo, SNPsplit, SNVPhyl, SOAP2, SOAPdenovo, SOAPdenovo-Trans, SOAPdenovo2, SomaticSniper, sorted_grep, SPAdes, SPALN, SparCC, SPARTA, SRA Toolkit, srst2, stacks, Stacks 2, stairway-plot, stampy, STAR, Starcode, statmodels, STITCH, STPGA, StrainPhlAn, strawberry, Strelka, stringMLST, StringTie, STRUCTURE, Structure_threader, supernova, SURPI, sutta, SVDetect, svtools, SWAMP, SweepFinder, sweepsims, tabix, Tandem Repeats Finder (TRF), TargetP, TASSEL 3, TASSEL 4, TASSEL 5, tcoffee, TensorFlow, TEToolkit, tfTarget, TMHMM, tmux, TopHat, Torch, traitRate, Trans-Proteomic Pipeline (TPP), TransComb, TransDecoder, TRANSIT, transrate, TRAP, treeCl, treemix, Trim Galore!, trimmomatic, Trinity, Trinotate, tRNAscan-SE, UCSC Kent utilities, UMAP, UMI-tools, Unicycler, UniRep, unrar, usearch, Variant Effect Predictor, VarScan, VCF-kit, vcf2diploid, vcfCooker, vcflib, vcftools, vdjtools, Velvet, vep, VESPA, vg, ViennaRNA, VIP, viral-ngs, virmap, VirSorter, VirusDetect, VirusFinder 2, VizBin, vmatch, vsearch, vt, WASP, wgs-assembler (Celera), Wise2 (Genewise), Xander_assembler, yaha

Details for MetaPathways (hide)

About:MetaPathways v2.5 is a meta'omic analysis pipeline for the annotation and analysis for environmental sequence information.
Added:9/21/2016 3:40:39 PM

MetaPathways has to be run in graphical environemnt. i.e., you either need to be connected throu VNC, or your ssh clinet has to have X11 ports forwarded and an X-windows manager has to be running on the machine you are connectiing from (e.g.,  your laptop). Please refer to a description of graphical access options in our User Guide: remote access instructions.

Before running MetaPathways you need to set up program environment by running


Create the directory pgdbs (needed by pathway-tools) in /workdir and make it wolrd-writeable:

mkdir /workdir/pgdbs

chmod a+rwx /workdir/pgdbs

Prepare the BLAST/LAST database directory in /workdir:

rsync -av /programs/MetaPathways/MetaPathways_DBs /workdir

In /workdir, create a directory for the run, for example (replace myID with your own userID):

mkdir -p /workdir/myID/my_run

Copy run configuration file to this new directory:

cd /workdir/myID/my_run

cp /programs/MetaPathways/metapathways2-2.5.1/config/*.txt .

If needed, edit the file template_param.txt and set your preferred databases and run parameters (the file template_config.txt does not have to be edited if the database and pgdbs directories have been created as described above).

Place your input fasta files in /workdir/myID/my_run.

Create the output directory for the run, e.g.:

mkdir -p /workdir/myID/my_run/out

Launch the program's GUI interface by typing

/programs/MetaPathways/bin/MetaPathways2 &

which will make a window pop up. NOTE: The GUI interface, although it needs to stay open, does not work correctly and will not be used to initiate the actual analysis. While the GUI can still be used to view the results, the actual run has to be launched in command-line mode, similar to the following (see manual for details):


source /programs/MetaPathways/metapathways2-2.5.1/MetaPathwaysrc

python /programs/MetaPathways/metapathways2-2.5.1/ -i /workdir/myID/my_run/ -o /workdir/myID/my_run/out/ -c ./template_config.txt -p ./template_param.txt -v -r overlay -s my_input >& run.log

The first of the commands above sets the environment for the run, while the second command launches Metapathways. In the example above, it is assumed that the input file names start with my_input, .i.e.., my_input.fasta. For convenience, the two commands can be placed in a shell script (say,, which can then be easily modified for other runs. The screen output from the second command will be saved on disk in the file run.log.

NOTE: In the final stages of the run, pathway-tools program (called from withn Metapathways) will attempt to temporarily display and then close a few graphical windows, which is the reason why the program needs to be run in graphical environment even though the GUI interface is not being used.

After the run completes, the results, created in directory structure under /workdir/myID/my_run/out, can be visualized using the Methapathways GUI (which was launched in the beginning and is already running in the background).

NOTE on databases: you can add more databases to /workdir/MetaPathways_DBs. Just deposit the fasta file in the proper subdirectory (functional or taxonomic), add the name of the fasta file to the proper database line in template_params.txt. During the run the new database will (or at least should) be formatted and searched. It can also be formatted manually before MetaPathways run, although the details of the formatting command are not yet given in Metapathways description. 


Notify me if this software is upgraded or changed [You need to be logged in to use this feature]


Website credentials: login  Web Accessibility Help