![]() |
enetnglyc |
Please help by correcting and extending the Wiki pages.
% enetnglyc Reports N-glycosylation sites in human proteins Input (aligned) sequence set: LEUK_RAT.fsa Output file [leuk_rat.enetnglyc]: |
Go to the input files for this example
Go to the output files for this example
Reports N-glycosylation sites in human proteins Version: EMBOSS:6.2.0 Standard (Mandatory) qualifiers: [-sequence] seqset (Aligned) sequence set filename and optional format, or reference (input USA) [-outfile] outfile [*.enetnglyc] Output file name Additional (Optional) qualifiers: (none) Advanced (Unprompted) qualifiers: -plot boolean [N] Produce graphics -asparagines boolean [N] Predict on all asparagines -thresholds boolean [N] Show additional thresholds when -plot is used Associated qualifiers: "-sequence" associated qualifiers -sbegin1 integer Start of each sequence to be used -send1 integer End of each sequence to be used -sreverse1 boolean Reverse (if DNA) -sask1 boolean Ask for begin/end/reverse -snucleotide1 boolean Sequence is nucleotide -sprotein1 boolean Sequence is protein -slower1 boolean Make lower case -supper1 boolean Make upper case -sformat1 string Input sequence format -sdbname1 string Database name -sid1 string Entryname -ufo1 string UFO features -fformat1 string Features format -fopenfile1 string Features file name "-outfile" associated qualifiers -odirectory2 string Output directory General qualifiers: -auto boolean Turn off prompts -stdout boolean Write first file to standard output -filter boolean Read first file from standard input, write first file to standard output -options boolean Prompt for standard and additional values -debug boolean Write debug output to program.dbg -verbose boolean Report some/full command line options -help boolean Report command line options and exit. More information on associated and general qualifiers can be found with -help -verbose -warning boolean Report warnings -error boolean Report errors -fatal boolean Report fatal errors -die boolean Report dying program messages -version boolean Report version number and exit |
Qualifier | Type | Description | Allowed values | Default |
---|---|---|---|---|
Standard (Mandatory) qualifiers | ||||
[-sequence] (Parameter 1) |
seqset | (Aligned) sequence set filename and optional format, or reference (input USA) | Readable set of sequences | Required |
[-outfile] (Parameter 2) |
outfile | Output file name | Output file | <*>.enetnglyc |
Additional (Optional) qualifiers | ||||
(none) | ||||
Advanced (Unprompted) qualifiers | ||||
-plot | boolean | Produce graphics | Boolean value Yes/No | No |
-asparagines | boolean | Predict on all asparagines | Boolean value Yes/No | No |
-thresholds | boolean | Show additional thresholds when -plot is used | Boolean value Yes/No | No |
Associated qualifiers | ||||
"-sequence" associated seqset qualifiers | ||||
-sbegin1 -sbegin_sequence |
integer | Start of each sequence to be used | Any integer value | 0 |
-send1 -send_sequence |
integer | End of each sequence to be used | Any integer value | 0 |
-sreverse1 -sreverse_sequence |
boolean | Reverse (if DNA) | Boolean value Yes/No | N |
-sask1 -sask_sequence |
boolean | Ask for begin/end/reverse | Boolean value Yes/No | N |
-snucleotide1 -snucleotide_sequence |
boolean | Sequence is nucleotide | Boolean value Yes/No | N |
-sprotein1 -sprotein_sequence |
boolean | Sequence is protein | Boolean value Yes/No | N |
-slower1 -slower_sequence |
boolean | Make lower case | Boolean value Yes/No | N |
-supper1 -supper_sequence |
boolean | Make upper case | Boolean value Yes/No | N |
-sformat1 -sformat_sequence |
string | Input sequence format | Any string | |
-sdbname1 -sdbname_sequence |
string | Database name | Any string | |
-sid1 -sid_sequence |
string | Entryname | Any string | |
-ufo1 -ufo_sequence |
string | UFO features | Any string | |
-fformat1 -fformat_sequence |
string | Features format | Any string | |
-fopenfile1 -fopenfile_sequence |
string | Features file name | Any string | |
"-outfile" associated outfile qualifiers | ||||
-odirectory2 -odirectory_outfile |
string | Output directory | Any string | |
General qualifiers | ||||
-auto | boolean | Turn off prompts | Boolean value Yes/No | N |
-stdout | boolean | Write first file to standard output | Boolean value Yes/No | N |
-filter | boolean | Read first file from standard input, write first file to standard output | Boolean value Yes/No | N |
-options | boolean | Prompt for standard and additional values | Boolean value Yes/No | N |
-debug | boolean | Write debug output to program.dbg | Boolean value Yes/No | N |
-verbose | boolean | Report some/full command line options | Boolean value Yes/No | Y |
-help | boolean | Report command line options and exit. More information on associated and general qualifiers can be found with -help -verbose | Boolean value Yes/No | N |
-warning | boolean | Report warnings | Boolean value Yes/No | Y |
-error | boolean | Report errors | Boolean value Yes/No | Y |
-fatal | boolean | Report fatal errors | Boolean value Yes/No | Y |
-die | boolean | Report dying program messages | Boolean value Yes/No | Y |
-version | boolean | Report version number and exit | Boolean value Yes/No | N |
>LEUK_RAT P13838 LEUKOSIALIN PRECURSOR (LEUCOCYTE SIALOGLYCOPROTEIN) (SIALOPHORIN) (CD43) (W3/13 ANTIGEN) (FRAGMENT). - RATTUS NORVEGICUS (RAT). WAQVVSQENLPNTMTMLPFTPNSESPSTSEALSTYSSIATVPVTEDPKESISPWGQTTAP ASSIPLGTPELSSFFFTSAGASGNTPVPELTTSQEVSTEASLVLFPKSSGVASDPPVTIT NPATSSAVASTSLETFKGTSAPPVTVTSSTMTSGPFVATTVSSETSGPPVTMATGSLGPS KETHGLSATIATSSGESSSVAGGTPVFSTKISTTSTPNPITTVPPRPGSSGMLLVSMLIA LTVVLVLVALLLLWRQRQKRRTGALTLSRGGKRNGTVDAWAGPARVPDEEATTASGSGGN KSSGAPETDGSGQRPTLTTFFSRRKSRQGSVALEELKPGTGPNLKGEEEPLVGSEDEAVE TPTSDGPQAKDGAAPQSL |
# Predictions for N-Glycosylation sites in 1 sequence ######################################################################################## Warning: This sequence may not contain a signal peptide!! Proteins without signal peptides are unlikely to be exposed to the N-glycosylation machinery and thus may not be glycosylated (in vivo) even though they contain potential motifs. SignalP-NN euk predictions are as follows: # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? D ? LEUK_RAT 0.112 32 N 0.088 16 N 0.362 8 N 0.119 N 0.104 N SignalP output is explained at http://www.cbs.dtu.dk/services/SignalP/output.html ######################################################################################## Name: LEUK_RAT Length: 378 WAQVVSQENLPNTMTMLPFTPNSESPSTSEALSTYSSIATVPVTEDPKESISPWGQTTAPASSIPLGTPELSSFFFTSAG 80 ASGNTPVPELTTSQEVSTEASLVLFPKSSGVASDPPVTITNPATSSAVASTSLETFKGTSAPPVTVTSSTMTSGPFVATT 160 VSSETSGPPVTMATGSLGPSKETHGLSATIATSSGESSSVAGGTPVFSTKISTTSTPNPITTVPPRPGSSGMLLVSMLIA 240 LTVVLVLVALLLLWRQRQKRRTGALTLSRGGKRNGTVDAWAGPARVPDEEATTASGSGGNKSSGAPETDGSGQRPTLTTF 320 FSRRKSRQGSVALEELKPGTGPNLKGEEEPLVGSEDEAVETPTSDGPQAKDGAAPQSL 400 ................................................................................ 80 ................................................................................ 160 ................................................................................ 240 .................................N.........................N.................... 320 .......................................................... 400 (Threshold=0.5) ---------------------------------------------------------------------- SeqName Position Potential Jury N-Glyc agreement result ---------------------------------------------------------------------- LEUK_RAT 274 NGTV 0.7323 (9/9) ++ LEUK_RAT 300 NKSS 0.5404 (6/9) + ---------------------------------------------------------------------- |
Program name | Description |
---|---|
antigenic | Finds antigenic sites in proteins |
digest | Reports on protein proteolytic enzyme or reagent cleavage sites |
echlorop | Reports presence of chloroplast transit peptides |
eiprscan | Motif detection |
elipop | Prediction of lipoproteins |
emast | Motif detection |
ememe | Multiple EM for Motif Elicitation |
ememetext | Multiple EM for Motif Elicitation. Text file only |
enetoglyc | Reports mucin type GalNAc O-glycosylation sites in mammalian proteins |
enetphos | Reports ser, thr and tyr phosphorylation sites in eukaryotic proteins |
epestfind | Finds PEST motifs as potential proteolytic cleavage sites |
eprop | Reports propeptide cleavage sites in proteins |
esignalp | Reports protein signal cleavage sites |
etmhmm | Reports transmembrane helices |
eyinoyang | Reports O-(beta)-GlcNAc attachment sites |
fuzzpro | Search for patterns in protein sequences |
fuzztran | Search for patterns in protein sequences (translated) |
helixturnhelix | Identify nucleic acid-binding motifs in protein sequences |
oddcomp | Identify proteins with specified sequence word composition |
omeme | Motif detection |
patmatdb | Searches protein sequences with a sequence motif |
patmatmotifs | Scan a protein sequence with motifs from the PROSITE database |
pepcoil | Predicts coiled coil regions in protein sequences |
preg | Regular expression search of protein sequence(s) |
pscan | Scans protein sequence(s) with fingerprints from the PRINTS database |
sigcleave | Reports on signal cleavage sites in a protein sequence |
The original CBS group application must be licensed and installed to use this wrapper.
Although we take every care to ensure that the results of the EMBOSS version are identical to those from the original package, we recommend that you check your inputs give the same results in both versions before publication.
Please report all bugs in the EMBOSS version to the EMBOSS bug team, not to the original author.