|
|
dbxflat |
Having created the EMBOSS indexes for this file, a database can then be defined in the file emboss.defaults as something like:
DB embl [ type: N dbalias: embl (see below) format: embl method: emboss directory: /data/embl file: *.dat indexdirectory: /data/embl/indexes ]The index file 'basename' given to dbxflat must match the DB name in the definition. If not, then a 'dbalias' line must be given which specifies the basename of the indexes.
% dbxflat
Database b+tree indexing for flat file databases
Basename for index files: embl
Resource name: embl
EMBL : EMBL
SWISS : Swiss-Prot, SpTrEMBL, TrEMBLnew
GB : Genbank, DDBJ
REFSEQ : Refseq
Entry format [SWISS]: embl
Wildcard database filename [*.dat]: rod.dat
Database directory [.]: embl
id : ID
acc : Accession number
sv : Sequence Version and GI
des : Description
key : Keywords
org : Taxonomy
Index fields [id,acc]:
General log output file [outfile.dbxflat]:
|
Go to the output files for this example
SET PAGESIZE 2048 SET CACHESIZE 200The above values are recommended for most systems. The PAGESIZE is a multiple of the size of disc pages the operating system buffers. The CACHESIZE is the number of disc pages dbxflat is allowed to cache.
RES embl [ type: Index idlen: 15 acclen: 15 svlen: 20 keylen: 25 deslen: 25 orglen: 25 ]The length definitions are the maximum lengths of 'words' in the field being indexed. Longer words will be truncated to the value set.
Standard (Mandatory) qualifiers:
[-dbname] string Basename for index files (Any string from 2
to 19 characters, matching regular
expression /[A-z][A-z0-9_]+/)
[-dbresource] string Resource name (Any string from 2 to 19
characters, matching regular expression
/[A-z][A-z0-9_]+/)
-idformat menu [SWISS] Entry format (Values: EMBL (EMBL);
SWISS (Swiss-Prot, SpTrEMBL, TrEMBLnew); GB
(Genbank, DDBJ); REFSEQ (Refseq))
-filenames string [*.dat] Wildcard database filename (Any
string is accepted)
-directory directory [.] Database directory
-fields menu [id,acc] Index fields (Values: id (ID); acc
(Accession number); sv (Sequence Version and
GI); des (Description); key (Keywords); org
(Taxonomy))
-outfile outfile [*.dbxflat] General log output file
Additional (Optional) qualifiers: (none)
Advanced (Unprompted) qualifiers:
-release string [0.0] Release number (Any string up to 9
characters)
-date string [00/00/00] Index date (Date string dd/mm/yy)
-exclude string Wildcard filename(s) to exclude (Any string
is accepted)
-indexoutdir outdir [.] Index file output directory
Associated qualifiers:
"-outfile" associated qualifiers
-odirectory string Output directory
General qualifiers:
-auto boolean Turn off prompts
-stdout boolean Write first file to standard output
-filter boolean Read first file from standard input, write
first file to standard output
-options boolean Prompt for standard and additional values
-debug boolean Write debug output to program.dbg
-verbose boolean Report some/full command line options
-help boolean Report command line options. More
information on associated and general
qualifiers can be found with -help -verbose
-warning boolean Report warnings
-error boolean Report errors
-fatal boolean Report fatal errors
-die boolean Report dying program messages
|
| Standard (Mandatory) qualifiers | Allowed values | Default | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| [-dbname] (Parameter 1) |
Basename for index files | Any string from 2 to 19 characters, matching regular expression /[A-z][A-z0-9_]+/ | Required | ||||||||||||
| [-dbresource] (Parameter 2) |
Resource name | Any string from 2 to 19 characters, matching regular expression /[A-z][A-z0-9_]+/ | Required | ||||||||||||
| -idformat | Entry format |
|
SWISS | ||||||||||||
| -filenames | Wildcard database filename | Any string is accepted | *.dat | ||||||||||||
| -directory | Database directory | Directory | . | ||||||||||||
| -fields | Index fields |
|
id,acc | ||||||||||||
| -outfile | General log output file | Output file | <*>.dbxflat | ||||||||||||
| Additional (Optional) qualifiers | Allowed values | Default | |||||||||||||
| (none) | |||||||||||||||
| Advanced (Unprompted) qualifiers | Allowed values | Default | |||||||||||||
| -release | Release number | Any string up to 9 characters | 0.0 | ||||||||||||
| -date | Index date | Date string dd/mm/yy | 00/00/00 | ||||||||||||
| -exclude | Wildcard filename(s) to exclude | Any string is accepted | An empty string is accepted | ||||||||||||
| -indexoutdir | Index file output directory | Output directory | . | ||||||||||||
Processing directory: /homes/user/test/embl/
Processing file: rod.dat entries: 6 (6) time: 0.0s (0.0s)
Buckets: free:1 count:3 maxsize:7
used:-4 maxused:0 maxfree:1
Empty Buckets: free:2 Used:4 MaxUsed:4 MaxFree:2
Cache size:100 total hits:94
Cache hits last page:14 (14.9%)
Cache misses:9 (9.6%)
0: 66 70.2 70.2%
1: 0 0.0 70.2%
2: 3 3.2 73.4%
3: 2 2.1 75.5%
4: 0 0.0 75.5%
5: 0 0.0 75.5%
6: 0 0.0 75.5%
7: 0 0.0 75.5%
8: 0 0.0 75.5%
9: 0 0.0 75.5%
10: 0 0.0 75.5%
11: 0 0.0 75.5%
12: 0 0.0 75.5%
13: 0 0.0 75.5%
14: 0 0.0 75.5%
15: 0 0.0 75.5%
16: 0 0.0 75.5%
17: 0 0.0 75.5%
18: 0 0.0 75.5%
19: 0 0.0 75.5%
20: 0 0.0 75.5%
21: 0 0.0 75.5%
22: 0 0.0 75.5%
23: 0 0.0 75.5%
24: 0 0.0 75.5%
25: 0 0.0 75.5%
26: 0 0.0 75.5%
27: 0 0.0 75.5%
28: 0 0.0 75.5%
29: 0 0.0 75.5%
30: 0 0.0 75.5%
31: 0 0.0 75.5%
32: 0 0.0 75.5%
33: 0 0.0 75.5%
34: 0 0.0 75.5%
35: 0 0.0 75.5%
36: 0 0.0 75.5%
37: 0 0.0 75.5%
38: 0 0.0 75.5%
39: 0 0.0 75.5%
40: 0 0.0 75.5%
41: 0 0.0 75.5%
[Part of this file has been deleted for brevity]
50: 0 0.0 75.5%
51: 0 0.0 75.5%
52: 0 0.0 75.5%
53: 0 0.0 75.5%
54: 0 0.0 75.5%
55: 0 0.0 75.5%
56: 0 0.0 75.5%
57: 0 0.0 75.5%
58: 0 0.0 75.5%
59: 0 0.0 75.5%
60: 0 0.0 75.5%
61: 0 0.0 75.5%
62: 0 0.0 75.5%
63: 0 0.0 75.5%
64: 0 0.0 75.5%
65: 0 0.0 75.5%
66: 0 0.0 75.5%
67: 0 0.0 75.5%
68: 0 0.0 75.5%
69: 0 0.0 75.5%
70: 0 0.0 75.5%
71: 0 0.0 75.5%
72: 0 0.0 75.5%
73: 0 0.0 75.5%
74: 0 0.0 75.5%
75: 0 0.0 75.5%
76: 0 0.0 75.5%
77: 0 0.0 75.5%
78: 0 0.0 75.5%
79: 0 0.0 75.5%
80: 0 0.0 75.5%
81: 0 0.0 75.5%
82: 0 0.0 75.5%
83: 0 0.0 75.5%
84: 0 0.0 75.5%
85: 0 0.0 75.5%
86: 0 0.0 75.5%
87: 0 0.0 75.5%
88: 0 0.0 75.5%
89: 0 0.0 75.5%
90: 0 0.0 75.5%
91: 0 0.0 75.5%
92: 0 0.0 75.5%
93: 0 0.0 75.5%
94: 0 0.0 75.5%
95: 0 0.0 75.5%
96: 0 0.0 75.5%
97: 0 0.0 75.5%
98: 0 0.0 75.5%
99: 14 14.9 90.4%
Total time: 0.0s
|
# Number of files: 1 # Release: 0.0 # Date: 00/00/00 Single filename database rod.dat |
Order 71 Fill 47 Pagesize 2048 Level 0 Cachesize 100 Order2 82 Fill2 99 Count 8 Kwlimit 15 |
Order 71 Fill 47 Pagesize 2048 Level 0 Cachesize 100 Order2 82 Fill2 99 Count 5 Kwlimit 15 |
This file contains non-printing characters and so cannot be displayed here.
This file contains non-printing characters and so cannot be displayed here.
| Program name | Description |
|---|---|
| dbiblast | Index a BLAST database |
| dbifasta | Database indexing for fasta file databases |
| dbiflat | Index a flat file database |
| dbigcg | Index a GCG formatted database |
| dbxfasta | Database b+tree indexing for fasta file databases |
| dbxgcg | Database b+tree indexing for GCG formatted databases |