EMBOSS: Project Meeting (Sep 13th 1999) |
Richard's 'arbitrary alphabet' applications are intended for comparisons of feature tables.
Gary has aded "merger" to align ends of sequences at HGMP. This is a global alignment with a very high gap opening penalty.
Some users would like multiple pattern searches, for example with fuzznuc.
Multiple runs of pattern matching could make multiple GFF output files. There was discussion on how multiple GFF files could be reused. The result was a specification that multiple GFF files concatenated together could be read and the sequence name checked against the input sequence. This means reading an entire GFF file into memory for opne or more input sequences. This is also useful for GFF files and FASTA files with multiple sequences. GFF reading does not cope well with this yet but can be easily modified.
Peter is working on code to read sequences from blast1 and blast2 indexed databases. This will read directly from the databases but will include generating optional EMBOSS entryname and accession number index files for both formats.
Val is working on generalized sequence formatting and printing. Peter has had other requests for this kind of application.
Next meeting Monday 20th September, 11:00am, usual place.