summaryrefslogtreecommitdiff
path: root/academic/samtools/README
blob: 0203586f9426858ec4cf202d601d53e220da66a1 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
SAM (Sequence Alignment/Map) format is a generic format for storing
large nucleotide sequence alignments. SAM aims to be a format that:

- Is flexible enough to store all the alignment information generated
  by various alignment programs
- Is simple enough to be easily generated by alignment programs or
  converted from existing alignment formats
- Is compact in file size
- Allows most of operations on the alignment to work on a stream
  without loading the whole alignment into memory
- Allows the file to be indexed by genomic position to efficiently
  retrieve all reads aligning to a locus. 

SAM Tools provide various utilities for manipulating alignments in the
SAM format, including sorting, merging, indexing and generating 
alignments in a per-position format.

Publication:
Li H.*, Handsaker B.*, Wysoker A., Fennell T., Ruan J., Homer N.,
Marth G., Abecasis G., Durbin R. and 1000 Genome Project Data
Processing Subgroup (2009) The Sequence alignment/map (SAM) format and
SAMtools. Bioinformatics, 25, 2078-9. [PMID: 19505943]