ajassemdata


Enum AjEAssemSortOrder

Assembly reads sort order Type enumeration.

Enumerated value(s)

Name
ajEAssemSortOrderUnknownUnknown - default in SAM/BAM formats
ajEAssemSortOrderUnsortedUnsorted - unsorted
ajEAssemSortOrderQuerynameQueryname - sorted by query/read names
ajEAssemSortOrderCoordinateCoordinate - major sort key is the the reference/contig name, order defined by the order of


Enum AjEAssemPlatform

Platforms/technologies to produce reads, as enumerated in SAM specv1.4.

Enumerated value(s)

Name
ajEAssemPlatformUnknownUnknown
ajEAssemPlatformCapillaryCapillary
ajEAssemPlatformLS454LS 454
ajEAssemPlatformIlluminaIllumina
ajEAssemPlatformSolidSolid
ajEAssemPlatformHelicosHelicos
ajEAssemPlatformIontorrentIonTorrent
ajEAssemPlatformPacbioPacific Biosciences


Data type AjPAssemRead

Store individual alignments of reads in assemblies.

In some assemblies some reads can align to more than one contig or reference sequence, so current name of the data type is not perfect.

For storing read information as part of an assembly object. Current version of this data type is MIRA/SAM oriented, it needs to be improved to be a proper common data type, some of the current fields are used by MIRA MAF format only while few others are used by the SAM format only.

Alias name(s)

Name
AjSAssemRead
AjOAssemRead

Attributes

NameTypeDescription
NameAjPStrName
SeqAjPStrSequence
SeqQAjPStrSequence quality string
TemplateAjPStrName of the DNA template a sequence comes from
FileAjPStrName of the sequencing file which contains raw data for this read
TechnologyAjPStrSequencing technology
CigarAjPStrCIGAR string
TagsAjPListList of AjPAssemTag objects
AlignmentBlocksAjPListAlignment blocks
RnextajlongReference number of the mate/next fragment
ReferenceajlongReference sequence
PnextajlongPosition of the mate/next fragment
TlenajintObserved template length
FlagajintFlag
MapQajintMap quality
TemplateSizeMinajintMinimum template size
TemplateSizeMaxajintMaximum template size
ClipLeftajintClip left
ClipRightajintClip right
VectorLeftajintClip left due to sequencing vector
VectorRightajintClip right due to sequencing vector
QualLeftajintClip left due to quality
QualRightajintClip right due to quality
x1ajintinterval of the contig (1-based as in SAM and MAF)
y1ajintend of contig interval
x2ajintinterval of the read
y2ajintend of read interval
ReversedAjBooltrue: has been reverse-complemented
DirectioncharDirection of the read with respect to the template
Paddingchar[7]Padding to alignment boundary


Data type AjPAssemContig

Ajax AssemContig object to store contigs in assemblies.

In mapping assemblies consensus sequence refers to the reference sequence.

Attributes

NameTypeDescription
NameAjPStrName
ConsensusAjPStrConsensus/reference sequence
ConsensusQAjPStrQuality string for the consensus sequence
TagsAjPListTags for the consensus sequence
LengthajintLength of the consensus sequence
NreadsajintNumber of reads
AssemblyIDAjPStrAssembly ID
MD5AjPStrMD5 checksum of the consensus sequence in the uppercase, with gaps and spaces removed
SpeciesAjPStrSpecies
URIAjPStrURI of the consensus sequences


Data type AjPAssemin

Ajax Assembly Input object.

Holds the input specification and information needed to read the assembly and possible further entries

Alias name(s)

Name
AjSAssemin
AjOAssemin

Attributes

NameTypeDescription
InputAjPTextinGeneral text input object
BamIdxAjPBamIndexBAM index
BamInputAjBoolBAM file input
cbeginajintContig start position
cendajintContig end position
LoadingAjBoolTrue if data is now loading


Data type AjPAssem

Ajax Assembly object.

Holds the assembly itself, plus associated information.

Alias name(s)

Name
AjSAssem
AjOAssem

Attributes

NameTypeDescription
IdAjPStrId of term
DbAjPStrDatabase name from input
SetdbAjPStrDatabase name from command line
FullAjPStrFull name
QryAjPStrQuery for re-reading
FormatstrAjPStrInput format name
FilenameAjPStrOriginal filename
TextptrAjPStrFull text
BamHeaderAjPSeqBamHeaderBAM header
ContigsAjPTableContigs table, storing contigs by name
ContigsIgnoredAjPTableContigs named as '*' because of missing header '

Attributes

NameTypeDescription
ContigsOrderAjPListOrder of contigs in the assembly
ContigArrayAjPAssemContig*Contigs array for fast access
ReadsAjPListList of reads in the assembly
ReadgroupsAjPTableTable of read-groups in the assembly
recAjPAssemReadCurrent/last read/alignment record read
FposajlongFile position
FormatAjEnumInput format enum
CountajuintNumber of lines read (contigs in case of BAM)
HasdataAjBoolTrue when data has been loaded
SOAjEAssemSortOrderReads sort order in the assembly


Data type AjPAssemload

Ajax assembly loader object.

Inherits an AjPAssem but allows more assembly data to be read from the same input by also inheriting the AjPVarin input object.

Alias name(s)

Name
AjSAssemload
AjOAssemload

Attributes

NameTypeDescription
AssemAjPAssemCurrent variation
AsseminAjPAsseminAssembly input for reading next
CountajuintCount of terms so far
LoadingAjBoolTrue if data is now loading
ReturnedAjBoolif true: Assembly object has been returned to a new owner and is not to be deleted by the destructor
PaddingajuintPadding to alignment boundary


Data type AjPAssemAccess

Ajax assembly access database reading object.

Holds information needed to read an assembly entry from a database. Access methods are defined for each known database type.

Assembly entries are read from the database using the defined database access function, which is usually a static function within ajassemdb.c

This should be a static data object but is needed for the definition of AjPAssemin.

Alias name(s)

Name
AjSAssemAccess
AjOAssemAccess

Attributes

NameTypeDescription
Nameconst char*Access method name used in emboss.default
AccessAjBool functionAccess function
AccessFreeAjBool functionAccess cleanup function
Qlinkconst char*Supported query link operators
Descconst char*Description
AliasAjBoolAlias for another name
EntryAjBoolSupports retrieval of single entries
QueryAjBoolSupports retrieval of selected entries
AllAjBoolSupports retrieval of all entries
ChunkedAjBoolSupports retrieval of entries in chunks
PaddingAjBoolPadding to alignment boundary


Data type AjPAssemTag

Ajax AssemTag object to store MIRA tags for the read sequences as well as the contig consensus sequences.

SAM format also allows a set of predefined tags as well as it reserves lowercase tags for end users.

Attributes

NameTypeDescription
NameAjPStrName
CommentAjPStrTag comment in case of MAF, or tag value in case of SAM
x1ajuintX value
y1ajuintY value
typecharValue type: AcCsSiIfZHB
Paddingchar[7]Padding to alignment boundary


Data type AjPAssemReadalignmentblock

read alignment block

Attributes

NameTypeDescription
readStartintRead start
referenceStartintReference start
lengthintalignment length


Data type AjPAssemReadgroup

read-group object to store information about read groups as described in SAM spec.

Attributes

NameTypeDescription
IDAjPStrRead group identifier
CNAjPStrName of the sequencing center producing the read
DescAjPStrDescription
DateAjPStrDate the run was produced
FlowOrderAjPStrThe array of nucleotide bases that correspond to the nucleotides used for each flow of each record
KeySeqAjPStrThe array of nucleotide bases that correspond to the key sequence of each read
LibraryAjPStrLibrary
ProgramsAjPStrPrograms used for processing the read group
UnitAjPStrPlatform unit (e.g. flowcell-barcode for Illumina)
SampleAjPStrSample name, or pool name when a pool was sequenced
IsizeajintPredicted median insert size
PlatformAjEAssemPlatformInstrument platform