Difference between revisions of "FASTA"

Revision as of 15:03, 25 February 2011

FASTA is a software package for aligning nucleotide or amino acid sequences. Its primary use is to search databases for sequences that are similar to a given candidate sequence.

Responsible person: User:Joel Hedlund (NSC)

Computational considerations

Work locally

Many of the features in FASTA require access to database flatfiles, and standard practice when running a compute cluster is to copy all necessary files to a node local directory before any work is done with them. This behaviour is highly encouraged on most resources, since multiple simultaneous accesses to the same large files on a shared disk is likely to cause problems for all computations currently running on the resource, and not only for the owner of the badly behaving jobs.

Do not run out of memory

If possible, you should ensure that you have enough RAM to hold the database as well as the results and still have some headroom. This ensures that FASTA will not need to read data from disk unnecessarily, which otherwise would cause significant slowdown. This can be done for example by:

Choose a system with enough RAM
Multiprocessor systems generally have more memory than single processor systems, and the database will also require proportionally less memory, since only one copy is needed in the OS file cache regardless of the number of processors using it.
Partition the search space
For huge databases or very restricted amounts available memory it may be required to split the database into manageable chunks and process them as separate jobs.

Links

Official site

@@ Line 16: / Line 16: @@
 * '''Choose a system with enough RAM''' <br/> Multiprocessor systems generally have more memory than single processor systems, and the database will also require proportionally less memory, since only one copy is needed in the OS file cache regardless of the number of processors using it.
 * '''Partition the search space''' <br/> For huge databases or very restricted amounts available memory it may be required to split the database into manageable chunks and process them as separate jobs.
+== Links ==
+* [http://fasta.bioch.virginia.edu/fasta_www2/fasta_list2.shtml Official site]

Difference between revisions of "FASTA"

Revision as of 15:03, 25 February 2011

Contents

Computational considerations

Work locally

Do not run out of memory

Links

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

People

For Staff

Tools