Connecting Infrastructure, Connecting Research

FASTA

Description

The FASTA programs find regions of local or global (new) similarity between Protein or DNA sequences, either by searching Protein or DNA databases, or by identifying local duplications within a sequence. Other programs provide information on the statistical significance of an alignment. Like BLAST, FASTA can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

Site Version (Latest: v35.4.9) Notes
ngs.rl.ac.uk v34.26.05  

 

License

The FASTA program and documentation may not be sold or incorporated into a commercial product, in whole or in part, without written consent of William R. Pearson and the University of Virginia.

Code in the smith_waterman_sse2.c and smith_waterman_sse2.h files is copyright (c) 2006 by Michael Farrar. This program may not be sold or incorporated into a commercial product, in whole or in part, without written consent of Michael Farrar.

Further details of the licensing conditions can be found from the Copyright file in the FASTA code download.

Users of this application should credit the code authors and the NGS accordingly and where applicable the specific site where the code is installed, when publishing work using NGS resources.

 

Running FASTA on the NGS

The example input files mentioned in the examples below can be found at:

gsiftp://ngs.rl.ac.uk:2811/apps/fasta/examples/humanDNA2.input

 

FASTA consists of a number of programs which are listed below:

  • Serial programmes:
  • fasta34   fasts34  fasty34  ssearch34  
    tfasts34  tfasty34 fastm34  fastx34  
    tfasta34  tfastx34 prss34 

     

  • Parallel programmes:
  • fasta34_t  fasts34_t  fasty34_t ssearch34_t   
    tfasts34_t tfasty34_t fastm34_t fastx34_t 
    tfasta34_t  tfastx34_t
    

     

  • Submission Using the UI/WMS

    To submit an FASTA job via the WMS, login to the UI and create a JDL file as below, replacing your ngs ID on the UI for ngsxxxx, then submit using:

    glite-wms-job-submit -o jobIDs -a fasta.jdl

    Type = "Job";
    JobType = "Normal";
    Executable = "/usr/ngs/FASTA";
    Arguments = "fasta34 -H humanDNA2.input \"$DB_PATH\"/live/fasta_DB/em_rel_htc_hum 5";
    StdOutput = "fasta34-HTC.out";
    StdError = "std.err";
    InputSandbox = {"humanDNA2.input"};
    InputSandboxBaseURI = "gsiftp://ngsui03.ngs.ac.uk:2811/home/ngsxxxx/fasta";
    OutputSandbox = {"std.err","fasta34-HTC.out"};
    OutputSandboxBaseDestURI = "gsiftp://ngsui03.ngs.ac.uk:2811/home/ngsxxxx/fasta";
    Requirements = {
        Member("NGS-UEE-FASTA", other.GlueHostApplicationSoftwareRunTimeEnvironment)
    };
    ShallowRetryCount = -1;
    
  • NGS Web Portal Submission

    After logging in to the NGS portal, select the "FASTA" template under the " Bioinformatics" category and make the amendments appropriate for your specific job. Read the description page of the template for further details.

  • Serial Submission Using Globus

    Eg To submit a serial FASTA job on ngs.rl.ac.uk use the following command, replacing 'xxx' by your unix ID on the NGS site where the code will be run (one line. You should not include the backslashes):

    globusrun -b -r ngs.rl.ac.uk/jobmanager-lsf \ '&(stdout=fasta34-HTC.out)(directory=/home/xxx/fasta) \ (executable=/usr/ngs/FASTA_34_26_5) \ (arguments=fasta34 -H humanDNA2.input \"$DB_PATH\"/live/fasta_DB/em_rel_htc_hum 5)'

    • This runs the default version. To explicitly run v34.26.5 use FASTA_34_26_5 instead of FASTA.
    • The input file is humanDNA2.input should be given as an argument not as stdin.
    • The FIRST argument should be the name of the FASTA program you want to use, that is fasta34 in this example.
    • The library being searched is the HTC library in FASTA format available in the /${DB}/EBI/output/fasta directory on the NGS site running the program. For information on the available EMBL Nucleotide Sequence data files in FASTA format see here

     

  • Parallel Submission Using Globus

    Eg To submit a parallel FASTA job on ngs.rl.ac.uk use the following command, replacing 'xxx' by your unix ID on the NGS site where the code will be run (one line. You should not include the backslashes):

    globusrun -b -r ngs.rl.ac.uk/jobmanager-lsf \ '&(jobtype=single)(count=4)(stderr=fa_pa.err)(stdout=fast34-HTC.out) \ (directory=/home/xxx/fasta)(executable=/usr/ngs/FASTA_34_26_5) \ (arguments=fasta34_t -H humanDNA2.input \"$DB_PATH\"/EBI/output/fasta/HTC 5)'

    • This runs the default version. To explicitly run v34.26.5 use FASTA_34_26_5 instead of FASTA.

     

Further Information

 

 

Applications Support

The NGS cannot offer scientific support for applications. However if you require further information or believe there is something wrong with the installation, please contact the NGS support centre.

Acknowledgements

Please note: When publishing work based on use of the NGS, users should acknowledge both the authors of any programs used (see the individual program web sites, or contact the authors directly) and the NGS directly using the following line:
"The authors would like to acknowledge the use of the UK National Grid Service in carrying out this work"
This line must also accompany any use of the NGS logos.