Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCACAACCAACTTTCCCTTCTCTAACTCTCAGGTATTTCGCCATTCCCTTAATCACCGATACCCTTTCTTTTGTTTGAGAAAACTTATACAACTTCCTTCGCTTTTCTCTTTTTCCCAACATTTTCTACCCCAGAAATTGGGCAAACACCAAAAGATCATTGAACAATCAATTCAGAACCATTAAGCTGGTATGTTTCTCTCCATTCAAAAACTGGGTTTGTTTCTGCAAAAACTTTGGAATGGATTTTAAACCAAAATCTCAACCCTATCGTGCTTTTGGTTCTAATTAGTCATCCCCATGTTTGAATACTTCTATATTCCTTTGTTTTGATTGGTTATTTATGGTTTGAATTTCTTTAGAATCCTCTTTTTCGAGGATGTTTGGTGATTATAAAGAATGGGTCAATAATCAATCCGTATTTACTTGTTTCCTAATAAGTTTCGTGATGATGCTGGCAGAGATGGAGGTTTCAGCTTCAGGATATGTCCAAAGATCAACTCTCCCCCAAGCACCACTTTGTTCATATTGTCGAAGGGTAAGTGGAAAAGATATTATTTACTCTGCTTTACTCTTCAAAAGAATTTGATCGCATCTGAAATGTGCATTTTCAAGCCATAATGATTGAAGTTCTTGACAAAAGAAAGTTATTTAGTTCCATAGCAAGTTAGAAGTTACCAACTGAAGTTGTTGCTACTACTTGGATTCTACACTTGTTATGGGATTTAACTTAGTTGTCAGTGTAATGATATGCTGTTCTTATATATCAAATTTTTTTCGAGAACTTTACCTTATTCGAAGTTTCAAATCAAACCATATGGGCAATCACACAGTTAAATGAGTTAGTACTTTAGGAAATTTGGTTGAATTTCTGAATTAGTATAGTTTCATCTAAGTGTTAAGATGTCGATTGGTAGGGGCGAAAGCTTGAGTTCGGTTTTGAACCAGAATGGAGTTCCTTCACATTCCATTGTCATATCTAGTAGGAAGATAATGGACATGGATCTAAAACAAAAGAGGCAGGATATCAAGATTGAAAACCCTCGAGCGGTAAGCTAGCTCGTATTTCTTTCCCACTTTTAAACGATGTTCTAAACTCCTCTGCAGTTTAATTTCGTAACTGTCGTTTTTGTGGTTTCTACTTCCAGCTTCTTGTTAGTTTTCATCAGAAAAAAATTGATAGCTCAGAACTTTTTATAGAAATCATTTTCCTAAGTTATTGATTATGTTTTGATTCTAAACTTTGATAGCATGGTTATGTATTATAATGATCATTAGTTTTGGAATCCATTTTTTTAAAAATTAAGTCTATAAATACTACTCACGCATGAATTTCCTTGTTTTGTTATCTACTTTTTTCTCATGTTTTCAAAAACTAAGTCAAGTTTTGAGAACAAAAAAAATTAAGCCCCGTTTGATAACTATTTGTTTTATGTTTTTTTAAAATTAAGCCTAAAAATGCTACTTCTACCAATGGGTTTCTATGTTTTTTTATCTATTTTGTATCTATGTTTTTAAAAAACCAAACTAAGTTTTGAAAACTAAAAAAATAGTTTCAATAACTAGTTTTTGTTTTTGCAATTTGGTTAGAAACTTAAATGGTACCTCTAGAATGATGGAAATTATTGTAGAGAAGATGAAAAAAAAAAATGTGGAAGAACATGCATCATTTTCAAAAACCAAAAACCAAGACCCAAAAACCAAAATGGTTATCAAACGGGGTCTTGGTTTTCAAAAACTTGTTTTTGTTTTTAGAATTTGACTATGAATTCAAATGTTTTGTCAAGAAAGATGAAATCTATGGTAGAGAATTGATGAGAAGATAAGCACAATTTCCTAAAACCAAGATCCAAATGATTATTAAATGGGCAATTCATCAAAAATGTCAAAGTGCATGTGATTTTAAACTAAGATTTTCATTATATGAGTTGTGTTCATAATTTCGAAGCTTGGTGGTGAAAAGATTGGCATTAGTTGAAGAATTGACAAAATCACTTAAGCCATACTTGGCTGCAATGTGACCTATCACAATTGATGTAAAGAAACAAATCACAATCATGGGGTTAAAAGTATCGTCATATTTGATTGTGTTACTGCACTAGGTTGTGGAACTAGCTAGCCCTTGAAATGTTACTGGCTTTGGTCAATCACTTGACTACATATCCATGTCCATATCTTGTTAGTGGCCTGGAACCTTAATAAAGTTTAGCCCCTTTTTCTTTTGGGAGAGACAGGACAACGTAAAAAATGTGAACTGGTATAGGCTGGGAGAGGCAATGTCTATAGGCATTTTTAAAAAAGTTTAAGAGGTTGTTTGAGGTGCTGAGTGTGTTATAATAACAAGGTTATAATAATCTATAGGTTATTATAATATGTGGAATCATATAATATATTTAAAATGCAAAAGTAATATAGTCTAGGGTTATAATAATATGTGTTTGGGGTGCAAAGTATTCTACATGTTATAATAACATAGGTTATAATAACCTATGCCCCAAACAGACCCTAATAGATATCAATTTTGTTAAACTTAAAGTCCTGACATGATTATGAATTGGTGCTTAAATACCCTCCTAAAATATTTCCAGCACCATCTTCAACCTGGTTAAAGCCTTCTTCTAGGGGCTTGGCCAGGTTCTAGTTTGCAGACTTTTGAGTGATTCAGCGTGAGTTGTTGAAGTGATAAGCCTAGTTGAATTTCAGCTAAGTATCCATATGATTCGTTGCATTTCTGAGCTACATGTGAGAGGAATTTTTGTATGTATGGCCACAAGAGTCAAATTCAAAAGTATTGAGTAAATTATCAATTTTTATAGATGTTTTGATGGGCTTCACTCCCAGGATTGTTCATTCTCATAAGTTTAAGGACATTATTATCAGTTCAACATTAAATTACCATATAATCTATGCTTGTTCTTGGCTTCTTTCTTGTGTTCTACAAATCACTGCAGATTAGATATAGATATCCATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTGGTAAATATAAAATGCATTTCACGCTTGCCTGCCCTCGACTATGTCCTGTAAGTTTCTTTTTCTCACTGGATGTAATGTAACTTTATTCTTTCATTCAGAGCTATGTCTAAACTACACAGATCATCTGTACCGACGCATTTGGAAACTTCATTCAAGTTTATGTGGAGAAGTGACACTAAAACAAACTTATCTGCTCTCTCTTTTTACTTTTTTCTTTGTCAGGCTAGAACCAGTTTGATAGTTTTGATTCCTCTTATAGTATTTTGTGCTAGATGCATAATTGGTGCCTCTTATGCTAGAGTCATCGGAACATCGAAGCTTAAAACCGCTAATAAACCAGAGGGAGAACGTCACAAGTTCAGAAGCGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTCTGAGTCATCCATAGATTATACAGTGAGTAATGTAACTTTATGCTCCTAGTTTTACCCCAATCCACCCTTACATGTTCTTATTCTTTACAGTGTTTTCTTGAGCAGGGTGATTTTAGATTTCTTTAACTGGGGCAGAGGTAATCCTTGCAATGAGAAATGGAAGGAAAATTGACTACTAATAAAGTTGTTGAATTGTTGCTTTCATTTCTTTGAGTACCCAACTCCCCAAGCCTTTTCCAGCTGATCATTCATTTTTGTCTATTACTTGTGGAGTTCTCTGGCCTTCCACTTCCCTTTCTTCTGGTTGGCCTCATCTTTTTGTTTCAATCAGTTGATAATACTGCCAAAATGGTGAACTATCTTCACATAACAACATTGGATTTGCACATGGGTGGGGTGAACAATAAACAAATAAGGGAACAAGGCAGAAATATGTTAATGAGACTGGGTTTAACATGGCCAAAAGTACTTCAAAAAATTATAAAAATACATCATATGTAGAAATCATTTATGCTTCATTTCAAAACAATACCCCTAAAAATCATTCTAAACTGAGCTGAACTCGTAGCACATAAGAGAAAAACTTATCCATGAATTCTTGCTCTGCTGTTCTATATTAGAGACAGCGAGAGCTGCGTATTTGCCTGATTCAAGCTTCGAGAGTCTTCATTTGTCTTCTAATCTCCGTTCCTGCCAAAAGTTTTCAATTTGGTAATAGTTTATAGTTATTCTTTTGCAGAGTCCTTCAGAAGAAGAGATCTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGAGTGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTACACAGAGACCTGAATAG
mRNA sequence
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCACAACCAACTTTCCCTTCTCTAACTCTCAGAAATTGGGCAAACACCAAAAGATCATTGAACAATCAATTCAGAACCATTAAGCTGAGATGGAGGTTTCAGCTTCAGGATATGTCCAAAGATCAACTCTCCCCCAAGCACCACTTTGTTCATATTGTCGAAGGGGGCGAAAGCTTGAGTTCGGTTTTGAACCAGAATGGAGTTCCTTCACATTCCATTGTCATATCTAGTAGGAAGATAATGGACATGGATCTAAAACAAAAGAGGCAGGATATCAAGATTGAAAACCCTCGAGCGATTAGATATAGATATCCATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTGGTAAATATAAAATGCATTTCACGCTTGCCTGCCCTCGACTATGTCCTGCTAGAACCAGTTTGATAGTTTTGATTCCTCTTATAGTATTTTGTGCTAGATGCATAATTGGTGCCTCTTATGCTAGAGTCATCGGAACATCGAAGCTTAAAACCGCTAATAAACCAGAGGGAGAACGTCACAAGTTCAGAAGCGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTCTGAGTCATCCATAGATTATACAAGTCCTTCAGAAGAAGAGATCTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGAGTGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTACACAGAGACCTGAATAG
Coding sequence (CDS)
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCACAACCAACTTTCCCTTCTCTAACTCTCAGAAATTGGGCAAACACCAAAAGATCATTGAACAATCAATTCAGAACCATTAAGCTGAGATGGAGGTTTCAGCTTCAGGATATGTCCAAAGATCAACTCTCCCCCAAGCACCACTTTGTTCATATTGTCGAAGGGGGCGAAAGCTTGAGTTCGGTTTTGAACCAGAATGGAGTTCCTTCACATTCCATTGTCATATCTAGTAGGAAGATAATGGACATGGATCTAAAACAAAAGAGGCAGGATATCAAGATTGAAAACCCTCGAGCGATTAGATATAGATATCCATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTGGTAAATATAAAATGCATTTCACGCTTGCCTGCCCTCGACTATGTCCTGCTAGAACCAGTTTGATAGTTTTGATTCCTCTTATAGTATTTTGTGCTAGATGCATAATTGGTGCCTCTTATGCTAGAGTCATCGGAACATCGAAGCTTAAAACCGCTAATAAACCAGAGGGAGAACGTCACAAGTTCAGAAGCGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTCTGAGTCATCCATAGATTATACAAGTCCTTCAGAAGAAGAGATCTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGAGTGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTACACAGAGACCTGAATAG
Protein sequence
MEVKLSQRNRADRFSLLPQPTFPSLTLRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSEEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Homology
BLAST of Tan0022441 vs. NCBI nr
Match:
KAA0039254.1 (uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa] >TYK00440.1 uncharacterized protein E5676_scaffold169G00440 [Cucumis melo var. makuwa])
HSP 1 Score: 391.0 bits (1003), Expect = 8.5e-105
Identity = 206/276 (74.64%), Postives = 229/276 (82.97%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKD 60
MEVK+ QRNRA RFSLLP PT PSLT LRNWANTK+S NNQ R I LRWRFQL D+SK
Sbjct: 1 MEVKVRQRNRAHRFSLLPHPTLPSLTLSLRNWANTKKSFNNQLRGISLRWRFQLLDVSKH 60
Query: 61 QLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA-- 120
QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+D DL+QK Q+IKI+N RA
Sbjct: 61 QLSTKHHFVHILEGNESLASISNKNGDPPNSIVIDNKKIVDTDLEQKGQNIKIQNSRAVS 120
Query: 121 -IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASY 180
IR + L+EKLQS+LNGL YK F LA RL PAR TS IVL+PL++FC RCIIGASY
Sbjct: 121 KIRDAFQLEEKLQSALNGLQIYKKLFALASSRLPPARTTSFIVLVPLVIFCTRCIIGASY 180
Query: 181 ARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVE 240
ARV GT KLK NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPS +E+ISVE
Sbjct: 181 ARVFGTLKLKAINKQEGERHKFRSGHWRSALRDIRELDGLDCEAPIDSTSPSADEQISVE 240
Query: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 276
BLAST of Tan0022441 vs. NCBI nr
Match:
XP_008459633.1 (PREDICTED: uncharacterized protein LOC103498697 isoform X2 [Cucumis melo])
HSP 1 Score: 389.8 bits (1000), Expect = 1.9e-104
Identity = 205/276 (74.28%), Postives = 228/276 (82.61%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKD 60
MEVK+ QRNRA RFSLLP PT PSLT LRNWANTK+S NNQ R I LRWRFQL D+SK
Sbjct: 1 MEVKVRQRNRAHRFSLLPHPTLPSLTLSLRNWANTKKSFNNQLRGISLRWRFQLLDVSKH 60
Query: 61 QLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA-- 120
QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+D D +QK Q+IKI+NPR
Sbjct: 61 QLSTKHHFVHILEGNESLASISNKNGDPPNSIVIDNKKIVDTDPEQKGQNIKIQNPRVVS 120
Query: 121 -IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASY 180
IR + L+EKLQ++LNGL YK F LA RL PAR TS IVL+PL++FC RCIIGASY
Sbjct: 121 KIRDAFQLEEKLQNALNGLQIYKKLFALASSRLPPARTTSFIVLVPLVIFCTRCIIGASY 180
Query: 181 ARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVE 240
ARV GT KLK NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVE
Sbjct: 181 ARVFGTLKLKAINKQEGERHKFRSGHWRSALRDIRELDGLDCEAPIDSTSPSEDEQISVE 240
Query: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 276
BLAST of Tan0022441 vs. NCBI nr
Match:
XP_038890844.1 (uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida])
HSP 1 Score: 386.7 bits (992), Expect = 1.6e-103
Identity = 206/276 (74.64%), Postives = 228/276 (82.61%), Query Frame = 0
Query: 1 MEVKLSQRNRADRF----SLLPQPTFPSLT-LRNWANTKRSLNNQFRTIKLRWRFQLQDM 60
MEVKLSQRNRADRF LLPQPT PSLT RNWANT++SL NQFR I LRWRFQLQD+
Sbjct: 1 MEVKLSQRNRADRFFLFPKLLPQPTLPSLTHHRNWANTRKSLKNQFRAITLRWRFQLQDI 60
Query: 61 SKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPR 120
SK+QLS KHH VHIVEG ESL+ NQNG P+HSI +++++I D DL+QK Q+IKI+NPR
Sbjct: 61 SKNQLSTKHHLVHIVEGSESLTLNPNQNGDPTHSISVANKRIKDTDLEQKGQNIKIQNPR 120
Query: 121 AIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASY 180
AIR Y L+EKLQS+LN L YK F LA L PAR TS IVL+PLIVFCARCIIGASY
Sbjct: 121 AIRDVYQLEEKLQSALNELRNYKKLFALASSHLPPARTTSFIVLVPLIVFCARCIIGASY 180
Query: 181 ARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVE 240
ARV GTS+L+T +K EG+ HKFRSGHWRSALRDIRELDGLD ES ID SPSE E+IS E
Sbjct: 181 ARVFGTSRLETVDKREGKHHKFRSGHWRSALRDIRELDGLDCESPIDSMSPSEDEQISDE 240
Query: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
DLSH YKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 DLSHDYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 276
BLAST of Tan0022441 vs. NCBI nr
Match:
XP_008459632.1 (PREDICTED: uncharacterized protein LOC103498697 isoform X1 [Cucumis melo])
HSP 1 Score: 383.3 bits (983), Expect = 1.8e-102
Identity = 203/284 (71.48%), Postives = 226/284 (79.58%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKD 60
MEVK+ QRNRA RFSLLP PT PSLT LRNWANTK+S NNQ R I LRWRFQL D+SK
Sbjct: 1 MEVKVRQRNRAHRFSLLPHPTLPSLTLSLRNWANTKKSFNNQLRGISLRWRFQLLDVSKH 60
Query: 61 QLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA-- 120
QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+D D +QK Q+IKI+NPR
Sbjct: 61 QLSTKHHFVHILEGNESLASISNKNGDPPNSIVIDNKKIVDTDPEQKGQNIKIQNPRVVS 120
Query: 121 -IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR---------TSLIVLIPLIVFCA 180
IR + L+EKLQ++LNGL YK F LA RL P TS IVL+PL++FC
Sbjct: 121 KIRDAFQLEEKLQNALNGLQIYKKLFALASSRLPPVNFFFSPEARTTSFIVLVPLVIFCT 180
Query: 181 RCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPS 240
RCIIGASYARV GT KLK NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPS
Sbjct: 181 RCIIGASYARVFGTLKLKAINKQEGERHKFRSGHWRSALRDIRELDGLDCEAPIDSTSPS 240
Query: 241 E-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
E E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 284
BLAST of Tan0022441 vs. NCBI nr
Match:
XP_011656102.1 (uncharacterized protein LOC101208955 isoform X2 [Cucumis sativus] >KGN52688.1 hypothetical protein Csa_009213 [Cucumis sativus])
HSP 1 Score: 375.6 bits (963), Expect = 3.7e-100
Identity = 202/277 (72.92%), Postives = 225/277 (81.23%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPS--LTLRNWANTKRSLNNQFRTIKLRWRFQ-LQDMSK 60
MEVK+ QRNRA RFSLLP T PS L+L NWANTK+S NNQ R I LRWRFQ L D+SK
Sbjct: 1 MEVKVWQRNRAHRFSLLPHSTPPSLILSLSNWANTKKSFNNQLRGIALRWRFQLLADISK 60
Query: 61 DQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENP--- 120
QLS KHHFVHI+EG ESL+S NQNG P HSIV++++KIMD DL+QKRQ+IKI+NP
Sbjct: 61 HQLSTKHHFVHILEGNESLTSTSNQNGDPPHSIVMANKKIMDTDLEQKRQNIKIQNPREV 120
Query: 121 RAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGAS 180
R IR + L+EKLQS+LNGL YK F LA PAR TS IVL+PL++FCARCIIGAS
Sbjct: 121 RKIRNAFQLEEKLQSALNGLRIYKKLFALASSHQPPARTTSFIVLVPLVIFCARCIIGAS 180
Query: 181 YARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISV 240
YAR GT KLK +K EGER KFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISV
Sbjct: 181 YARAFGTLKLKAIDKQEGERRKFRSGHWRSALRDIRELDGLDCEAPIDSTSPSEDEQISV 240
Query: 241 EDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
E+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 EELSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 277
BLAST of Tan0022441 vs. ExPASy TrEMBL
Match:
A0A5A7T6Z4 (LysM domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G00440 PE=4 SV=1)
HSP 1 Score: 391.0 bits (1003), Expect = 4.1e-105
Identity = 206/276 (74.64%), Postives = 229/276 (82.97%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKD 60
MEVK+ QRNRA RFSLLP PT PSLT LRNWANTK+S NNQ R I LRWRFQL D+SK
Sbjct: 1 MEVKVRQRNRAHRFSLLPHPTLPSLTLSLRNWANTKKSFNNQLRGISLRWRFQLLDVSKH 60
Query: 61 QLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA-- 120
QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+D DL+QK Q+IKI+N RA
Sbjct: 61 QLSTKHHFVHILEGNESLASISNKNGDPPNSIVIDNKKIVDTDLEQKGQNIKIQNSRAVS 120
Query: 121 -IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASY 180
IR + L+EKLQS+LNGL YK F LA RL PAR TS IVL+PL++FC RCIIGASY
Sbjct: 121 KIRDAFQLEEKLQSALNGLQIYKKLFALASSRLPPARTTSFIVLVPLVIFCTRCIIGASY 180
Query: 181 ARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVE 240
ARV GT KLK NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPS +E+ISVE
Sbjct: 181 ARVFGTLKLKAINKQEGERHKFRSGHWRSALRDIRELDGLDCEAPIDSTSPSADEQISVE 240
Query: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 276
BLAST of Tan0022441 vs. ExPASy TrEMBL
Match:
A0A1S3CBV5 (uncharacterized protein LOC103498697 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498697 PE=4 SV=1)
HSP 1 Score: 389.8 bits (1000), Expect = 9.2e-105
Identity = 205/276 (74.28%), Postives = 228/276 (82.61%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKD 60
MEVK+ QRNRA RFSLLP PT PSLT LRNWANTK+S NNQ R I LRWRFQL D+SK
Sbjct: 1 MEVKVRQRNRAHRFSLLPHPTLPSLTLSLRNWANTKKSFNNQLRGISLRWRFQLLDVSKH 60
Query: 61 QLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA-- 120
QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+D D +QK Q+IKI+NPR
Sbjct: 61 QLSTKHHFVHILEGNESLASISNKNGDPPNSIVIDNKKIVDTDPEQKGQNIKIQNPRVVS 120
Query: 121 -IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASY 180
IR + L+EKLQ++LNGL YK F LA RL PAR TS IVL+PL++FC RCIIGASY
Sbjct: 121 KIRDAFQLEEKLQNALNGLQIYKKLFALASSRLPPARTTSFIVLVPLVIFCTRCIIGASY 180
Query: 181 ARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVE 240
ARV GT KLK NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVE
Sbjct: 181 ARVFGTLKLKAINKQEGERHKFRSGHWRSALRDIRELDGLDCEAPIDSTSPSEDEQISVE 240
Query: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 DLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 276
BLAST of Tan0022441 vs. ExPASy TrEMBL
Match:
A0A1S3CB50 (uncharacterized protein LOC103498697 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498697 PE=4 SV=1)
HSP 1 Score: 383.3 bits (983), Expect = 8.6e-103
Identity = 203/284 (71.48%), Postives = 226/284 (79.58%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKD 60
MEVK+ QRNRA RFSLLP PT PSLT LRNWANTK+S NNQ R I LRWRFQL D+SK
Sbjct: 1 MEVKVRQRNRAHRFSLLPHPTLPSLTLSLRNWANTKKSFNNQLRGISLRWRFQLLDVSKH 60
Query: 61 QLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA-- 120
QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+D D +QK Q+IKI+NPR
Sbjct: 61 QLSTKHHFVHILEGNESLASISNKNGDPPNSIVIDNKKIVDTDPEQKGQNIKIQNPRVVS 120
Query: 121 -IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR---------TSLIVLIPLIVFCA 180
IR + L+EKLQ++LNGL YK F LA RL P TS IVL+PL++FC
Sbjct: 121 KIRDAFQLEEKLQNALNGLQIYKKLFALASSRLPPVNFFFSPEARTTSFIVLVPLVIFCT 180
Query: 181 RCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPS 240
RCIIGASYARV GT KLK NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPS
Sbjct: 181 RCIIGASYARVFGTLKLKAINKQEGERHKFRSGHWRSALRDIRELDGLDCEAPIDSTSPS 240
Query: 241 E-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
E E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 284
BLAST of Tan0022441 vs. ExPASy TrEMBL
Match:
A0A0A0KSX5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G650530 PE=4 SV=1)
HSP 1 Score: 375.6 bits (963), Expect = 1.8e-100
Identity = 202/277 (72.92%), Postives = 225/277 (81.23%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFSLLPQPTFPS--LTLRNWANTKRSLNNQFRTIKLRWRFQ-LQDMSK 60
MEVK+ QRNRA RFSLLP T PS L+L NWANTK+S NNQ R I LRWRFQ L D+SK
Sbjct: 1 MEVKVWQRNRAHRFSLLPHSTPPSLILSLSNWANTKKSFNNQLRGIALRWRFQLLADISK 60
Query: 61 DQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENP--- 120
QLS KHHFVHI+EG ESL+S NQNG P HSIV++++KIMD DL+QKRQ+IKI+NP
Sbjct: 61 HQLSTKHHFVHILEGNESLTSTSNQNGDPPHSIVMANKKIMDTDLEQKRQNIKIQNPREV 120
Query: 121 RAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGAS 180
R IR + L+EKLQS+LNGL YK F LA PAR TS IVL+PL++FCARCIIGAS
Sbjct: 121 RKIRNAFQLEEKLQSALNGLRIYKKLFALASSHQPPARTTSFIVLVPLVIFCARCIIGAS 180
Query: 181 YARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISV 240
YAR GT KLK +K EGER KFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISV
Sbjct: 181 YARAFGTLKLKAIDKQEGERRKFRSGHWRSALRDIRELDGLDCEAPIDSTSPSEDEQISV 240
Query: 241 EDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 270
E+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Sbjct: 241 EELSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE 277
BLAST of Tan0022441 vs. ExPASy TrEMBL
Match:
A0A6J1CKQ6 (uncharacterized protein LOC111012032 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111012032 PE=4 SV=1)
HSP 1 Score: 357.8 bits (917), Expect = 3.9e-95
Identity = 197/272 (72.43%), Postives = 213/272 (78.31%), Query Frame = 0
Query: 1 MEVKLSQRNRADRFS----LLPQPTFPSLTLRNWANTKRSLNNQFRTIKLRWRFQLQDMS 60
ME+K+SQRNRADRFS LLPQPT PS T R WA KRS NQF + LRWRFQLQD+
Sbjct: 1 MELKVSQRNRADRFSLLPKLLPQPTLPSQTHRTWAYAKRSSKNQFGAVALRWRFQLQDIP 60
Query: 61 KDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIKIENPRA 120
+DQ KHHFV IVEGGE+ +S+L QNGV +HSIVI +RKI D DL+ K QD KI NP A
Sbjct: 61 RDQSFTKHHFVRIVEGGETFTSILKQNGVSTHSIVIVNRKIGDTDLEHKGQDSKIRNPLA 120
Query: 121 IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYA 180
IR Y LQEKLQSSLNGL YK F PRL PAR TS IVL+PLIVFCARCIIGASYA
Sbjct: 121 IRDVYQLQEKLQSSLNGLQNYKKLFLHVSPRLPPARTTSFIVLVPLIVFCARCIIGASYA 180
Query: 181 RVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSID---YTSPS-EEEIS 240
RV TSKLKT +K EGE HKFRSGHWRSALRDIRELDGLDSESS D SPS +E+IS
Sbjct: 181 RVSKTSKLKTIDKSEGEHHKFRSGHWRSALRDIRELDGLDSESSTDPSVSNSPSVDEQIS 240
Query: 241 VEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG 264
VEDLSHAYKKLD+DYEKFLSECGLS GYWRG
Sbjct: 241 VEDLSHAYKKLDKDYEKFLSECGLSNCGYWRG 272
BLAST of Tan0022441 vs. TAIR 10
Match:
AT4G09970.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 13 growth stages; Has 15 Blast hits to 15 proteins in 6 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 13; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 60.1 bits (144), Expect = 3.2e-09
Identity = 60/236 (25.42%), Postives = 105/236 (44.49%), Query Frame = 0
Query: 38 NNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGV----PSHSIVIS 97
+ +F+ R RF +Q MS+++ KH + ESL +L Q GV P S +
Sbjct: 26 SKRFKRASERCRFCVQQMSENEQRTKHELARSAKRSESLRRILKQYGVSVENPEES--KT 85
Query: 98 SRKIMDMDLKQKRQDIK---IENPRAIRYRYPLQ-----EKLQSSLNGLGKYKMHFTLAC 157
S ++ D+D ++K + I++ + L EK ++
Sbjct: 86 SSRLDDLDCEEKHDAVTSSVIDDDSKMNTTEELPDLRRVEKYTETVGTADNLSGQNHQIL 145
Query: 158 PRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSAL 217
P L L L+P++ FC CIIG + + ++ + H S WR+AL
Sbjct: 146 PHL-NTGVLLTSLLPVLGFCIICIIGTLHTII---------SRKTSQGHHHGSERWRTAL 205
Query: 218 RDIRE---LDGLDSES-SIDYTSPSEEEISVEDLSHAYKKLDQDYEKFLSECGLSK 258
D E DG DS S S ++E + ++++ AY +++ +Y++FL ECG+ +
Sbjct: 206 MDWNEPLASDGHDSMSPEYRVASTNQEATATDEMNEAYSRVELEYKRFLLECGVGE 249
BLAST of Tan0022441 vs. TAIR 10
Match:
AT4G09970.2 (unknown protein; Has 13 Blast hits to 13 proteins in 5 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 11; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 53.1 bits (126), Expect = 3.9e-07
Identity = 56/224 (25.00%), Postives = 98/224 (43.75%), Query Frame = 0
Query: 47 RWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGV----PSHSIVISSRKIMDMDL 106
+W F +Q MS+++ KH + ESL +L Q GV P S +S ++ D+D
Sbjct: 4 KW-FCVQQMSENEQRTKHELARSAKRSESLRRILKQYGVSVENPEES--KTSSRLDDLDC 63
Query: 107 KQKRQDIK---IENPRAIRYRYPLQ-----EKLQSSLNGLGKYKMHFTLACPRLCPARTS 166
++K + I++ + L EK ++ P L
Sbjct: 64 EEKHDAVTSSVIDDDSKMNTTEELPDLRRVEKYTETVGTADNLSGQNHQILPHL-NTGVL 123
Query: 167 LIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGL 226
L L+P++ FC CIIG + + ++ + H S WR+AL D E
Sbjct: 124 LTSLLPVLGFCIICIIGTLHTII---------SRKTSQGHHHGSERWRTALMDWNEPLAS 183
Query: 227 DSESSIDYTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSK 258
D S+ SP E + ++++ AY +++ +Y++FL ECG+ +
Sbjct: 184 DGHDSM---SPEYREATATDEMNEAYSRVELEYKRFLLECGVGE 211
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAA0039254.1 | 8.5e-105 | 74.64 | uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa] >TYK00... | [more] |
XP_008459633.1 | 1.9e-104 | 74.28 | PREDICTED: uncharacterized protein LOC103498697 isoform X2 [Cucumis melo] | [more] |
XP_038890844.1 | 1.6e-103 | 74.64 | uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida] | [more] |
XP_008459632.1 | 1.8e-102 | 71.48 | PREDICTED: uncharacterized protein LOC103498697 isoform X1 [Cucumis melo] | [more] |
XP_011656102.1 | 3.7e-100 | 72.92 | uncharacterized protein LOC101208955 isoform X2 [Cucumis sativus] >KGN52688.1 hy... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7T6Z4 | 4.1e-105 | 74.64 | LysM domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
A0A1S3CBV5 | 9.2e-105 | 74.28 | uncharacterized protein LOC103498697 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CB50 | 8.6e-103 | 71.48 | uncharacterized protein LOC103498697 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0KSX5 | 1.8e-100 | 72.92 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G650530 PE=4 SV=1 | [more] |
A0A6J1CKQ6 | 3.9e-95 | 72.43 | uncharacterized protein LOC111012032 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT4G09970.1 | 3.2e-09 | 25.42 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 18 plant structures; EXP... | [more] |
AT4G09970.2 | 3.9e-07 | 25.00 | unknown protein; Has 13 Blast hits to 13 proteins in 5 species: Archae - 0; Bact... | [more] |