Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGCCGTCTTGGCGCCCTAATTCATGAACACGGGGAGATTTTGGTTGGCGAAAGAAATCACGATCACTGTCGATGGGCGGCGCTGCTAAATCCCAATACCAATCTTAAACCCTTACGTTTTCTGTTCATCAACTAAACTAAACTCTTCGAAACTGTTCCTTTCCATTTCGTGCTTTTTTTTCTCTCAGTGAAGTCTAGATGAGTAGATCAATCGGAAGAAAGGTTCCAGGATTTTCACTTCTATCCAATGCCAACAAACTAGGCGTTGTGCCTTTTTCTTCTTCTTCTTCCGGTGGTCATGGTCGTGGCCGAGGTCGAGGTGCCTTTCCTTCCGGACCCTTTGATTTCACTCCTCCAGTCCCCAGTCAAGAACATCCAAATGCCTCTAAACAAGAACCTATAGATTCTCGTCCTACTCCTGGGCTTGGTCATGGCCGTGGTATACCAACTCCTTCCTCCCCAATTCGTCCATCTTTCTCTTCCTTTTCGCCCTCTGTCAGGCCCTCGTCTGTTGGTCGGGGTAGAGGTGATGCTTCGCCATCGATTCGGTCTCCTCCCGAGCCAGATTCAGAGCCTAAGAAACCTGTGTTTTTCTCGAGGAATAATGCAGGGGACTCTGCTGCAAGTACTTCACTTGGCGGGTTACACAGGGTTTCGGGAGAGAGAAACTTGCCTGATTCTTTGCATTCTGGGTTTTCTGGTGTTGGACGAGGAAAACCCATGAAGCAACCAGTCCCGGAAGATCAACCAAAGCAGGAAAATCGTCATCTTAGACCTAGACAAGAGGGGGATGGCCGTGGAGCTGGCGGGCGTGGAAGGGCTCGTGGCGTTGAACCAAGGATAGGCCGTGGTGAACCATGGAGGAACACTAATAGGATGGCGTCAAGGGGCGGGCCTGATGGTGAAGTTGGTGGTGGTCGAGGAAGTAGCGGTTACCGGGGCAGAGGCGTCAGAGGGCCGTTCAGACGGGGACCAAGGGGGTCATTTAGAACTGGGGAGAGATGGGACAGAAGAAGTGGTCAAGATAAGGAGGATGGATATGCTGCTGGACTTTATTTAGGCAACAATGAAGACGGTGAGAGGCTGGCAAAGAAGGTTGGTCCTGAAATTATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGGTAGAGTGCTGCCTTCGCCATTGGAGGATCGGCTTTTGGACGGGATGGATATCAATTTTATGGTGAACATTTCCTGTGTCTTTTTTCTTTTTTGTCCCCGTTTTTCCTCTCTATGGACATATGGACTAAGGATTAACTCTAAACTGTTTTGACATAATTGAAAAGTTTACTGTTTTCGATGTTTGTGATCGGGTGACGAATTAAATCGTGTGCTTGGTTGAATTATCACAAACCTGATTTTGTTCAGTTCTTTCATGAAAAATGTTGAGATTGGATCAATGCTCGGTTATGACTCTCTTTCTGTATGGCAGATCGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAACCCTGATATTGATGAGAATCCACCAATTTCTCTTCGGGATGCATTTGAGAAGATGAAACCATTTTTAATGGCGTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGTATCTTTTTCTGTTATGACTTCATGTTTTCGTTCCTTTGTTCCTGCCAACGTCTTTTCTACTTGTTAATTTGTGCTTTGGATCTGACTGAAGTATTTATGTGTGTTATTTTTGTGGTTTTAGTGTACTGATGAAAAGATCCACTGTACTCTGTTTACGTTGGTTCTTCCTGGATTTTCCCTATTATTACATATTATATTTATCTAATACAGGTAGAGATGCAAAAGTGTTACAAAGTATATAGACTACTCACATCAATTTCTTTTACTTAGGGTCTGTTTAGTTCGCAATCTGGATTATGGATTCTGTTTAATGTTTTCAGATTCACTAAGGCAGATAACTCTACTTCTGTTTTTGAAAACTATTTTTGGATTCTGTGATCTAAATAGTACAAATTTTGAAAACAACATTTTTATGGTTTCTGGTTATGAACCCCTTTTTTAAAAAAATCAATATATGTATCATGTAATGAATTATATGTTAAATTACATTAAAGTCAAAGTTTAGTCCCTAGGACTTATGATAAAGTCAAAACTTAAAAAGTTATTTTCATATGCAGTCCCTATTGTTTGTTAATTCTCTTAATTAGTTCCTACCTTAGGGATTATTTATTACGAAACTTTTATCAAACCATAGGGGGTATATTCTAATTATAAACCATAAGACCTAAACTTTAACTTTCCAAACTATATGGACCAAAATTTCAAAATTTCTTATAAATTATGTAATATTGGAAAAAAAATTATACAATCTTTTTTAACATAAAAACATGTTCATTAATACATTAATGATAATTTATTATCAACTAATATTTATTGGACAGCTACAACTAGTTTACAATTGTTTAAATCAGAATCCAATTTTTGAGATCTGCTACCAAATACATGTATGAAAACATGAAATACAATTTTATTTTTGTTTAATCCCTTGTTTTCGAAATTTATGTTTTCAGATTCTTTACCAAACAAGCCCTTTATTGCTCTTGTCTTGTTTATTTCACGACAATAGTGTTTAAGCTGACATTTTGGAGAATCACATTCCCAAGTTTTCTTAGGGGTTTTATTGAGAAAATTGAACACTGTACAAACGAAAGGACTAGATGCTAAAAAGAGCCTAAATTAGAGTAGCCCACAAGAACAAAATAAGGGAATTACTGGTAGTGACTGCAGTTGTTAGAATGAGAAAACAACGGGAAGTTACAGAAAAGCTCTATGCAAACACTTTGACTATGAGGGATTATCGATTCTAATATCAATGAAACCTATATATGTTTTATCCCTGAAAAAGTGAACGCAGGGGATTGAGAAACTTTAATCCATGAACCATGTTAGCAGCTTCTTATGGGATTGATTATTTTGCCTAGGTGTTGACATATACATTGAAAAAGGTTCTGTCTAAGAACACCTCAAAGTTTGAAGATGATTTTATGATGAGAGAAGTTATGGATGGGGTCCTTGTTAATGATACTATTGAGAACTATATAACTTGTTAGTGAGAAAGAATCGTTTTCAAGATTGATTTTGAGAAAGACTATGATCGTGTGGATTTGGGTTTCTTGGATAAGGTGGTTTGAATGTACATTTTGGGCTTGGTTCAAGGTGAGATCTTGGATATGGAATTGCATTAGGTTTGTTGACCAATCTATTTCTGGTAAATAGTAAGCCTAGGGGTGGAGTTTTTGCCTCTACTGGGACTGGGATTAGGCAAGAGACATTATGCCTTATTGGAACAGGGTTTCGTTTTGGTTTCCACATGGGCGAAGAGTGGATTCACCATAAATAAATAAAAAAATAATAGGACTTGTATAACCAAAAAAGAGTGAGCAACTCAAGCATCAGGAAGGAGAAAACCCTATCCAAAGGAGGACTATGCGCATCCTTCCAATGTGTAAAATCACGAGGGTAAAATTTCTGGTGGTTGGAACACCACCAAGAAGCTGTACTGTATACATAATTCCAAAATAAAAAGACGTGTAGAAGGGTATTAAAAAAATCCTCTAATTTCTTTCCTTCCAAACGATCCATGAAAAAGTACTATTGGTGCAGCTCCATGTTACCTTGGCTCTAGCCGTACTTTTCAGCCTTACCTGAGAGTCCAGCCACTTCAATACGTAATAACCAATCCTTAATTCCATTTGGAAGACACATTGACAAATGAAATTTCTAGGCAACCTCATCCACCCTCAAAATAATAAGGCAATGGGTGAAAAACTCATTATTTAACAATGGTTGCAGATCAAATTGAAGGAGAAATTACCCACCCAGAGGATTTTCTTTGGACTCTATTTGGCTCCATGTTGATACTTCTATACCAAAGAGTCCGAAAAGAAAACTTAACTTTCTTTGGGATTTTAAACTTCCATATCGTCTATACCATGAATTCAACCTTTTGGTTTTTAGTTTTTTGAGGACATTTTTTCTGGCTTGAGTTGCAATGCACTACTAATTATGAAAATTTCCCCCCTTTTTAGATTTGTATGGTATTCAAGGTCAATTGGATAAAGACTCAACTGGTTGGTTGGTATTAAGTATTATTTTGTTAAGTTGGATAGACGGAGTTTTAAGTTGGTGTCATTCTCACTAATTGCCTCAATCCCCCATTCACCTAATTGGTTTATATATTCAAGAACATTAGGAAGTTCACTTGTACCAAAAGGATTAAATTCCTCATATGATCCATCACACACTCTTGTTTTAATACTATGGACATGCTCCAAGAAAGATGCCCTTGGATCCATCTATATCCATCTTGGTGTTGTCTTTACAAAAAGAAAATGAATCCCAGAACCATAATTTTCAAGACGAAGAAAAATCAGCTGGAGAAGTTCTTGAATTGGCTATTTTCAATGTTCTTTCTTTTTGGTATAAAAATTTACAGCCTTTACAAAGTAATAGTTTGAATTTTTTGTAACAAATTGAAGACATCTTTTGGGCCCCTTTTGTATCTTCTTTGATATTTCATTTAATGAATGAAATTCTTTCTTATCCAAAGAAAAACTAATTGCCTCGGTCTTTTTGGTCTTTGTTCCAATTGTTTTATACAAAGAAAGACTAATGTGCCCCTTTCCTTAGTTGGGTTGCCTTTTCTGAGCTTGGTATTTTGTGTTCCCTTGTAATTTTGCATTTTTTCGAATAAAAATTATTTAAAAAATCACCTTCTAGTAGTTCCAAGGTGGAAAAGATGATAAGAGATATTTTATCATTGGGTGTGGAAGAGGACTGGGCTGGGTGTTCCCATTCAGTTAGGTGTGGGCGGTTATTTTAAGCTTAATTGGGCATTGGGCAAGGTCTGAGGCTGAGCATTTGGACATGAAAATCTTGAGGTCTTTAGTACACCCTTGTTGGCCAAGCAGATGTGGCATTTCCCTTTTAAGTTCAATTTTGTGTCTCATAGGATTATCGTCTGTTTTGTTCCTTTTGCTAAGTTTTTGCAATTTTTCAATTGGGTACACTTGCTTGTTCTGTAGGACGGATCTTAGGTAGGGGATGGTTCATCACGCCTTTGATTCCTTCACTTGTATCATTTGTCTTCTATGAAGTTTGCTTGGTAGCTTCCATTATTCCTTGTATCTGGATTTCCATTAATGTTCTTTTTTCTTGATAAGAAACAATATCCATTGATGTATGAAATTACAAAAGAATGGCACCAGCCCAAGCCAAAGGAGTTACTAGAGACTTCTCCAATTGGTCAAAGGAGATGTAAGGTTGTAAGAAATATAGAAAGATGTACACTTGCACCAAGACAGTAGTAAGATAGATAATATGTTAAAATAAACTGCGAAAAGAGCTTCTTTTTTCATTGAAGAGGCGGTCGTTTCTCTCCTTCCATAAAATCCATAAGAAAGCTCGAATCATATGCATCCACATTATCCTCCTTTTGTCTCTGAAAGGGTGGCGCATGAGTCTGCAAGTGAGGAGGCTCACAAGCTTGTTGGGAAGAGTTGTGGACCCTTAGAGGGAGAGGAATACCTCCTTCCATAATCTTCAAAAACACAATTGATGTAAAGGCGACTCAGTGTCTTGTTTACGGCATGACAAATATACAACAATGTGAGGAGAGAGCCATGTGAGGGAAGGGCCGGCCACTTTATTTTGTCACAAATGTTGGTGACGTTGTGGCTAATTTCCCAAAAGAAAGAATTTTACCTTTATCACTAAATATCATGTGTCTGGTTCTTATTAACAGGTTATGCCTTCTGATGATTGGGAGGTGTTGTATTGCCAGCCATTTTGGAATAAGAGAGTGATATTTGATATGATTAAACTGACTCTAGCTTTGTTAATCTCTGATAAATTTTGTGCTTATTTCTAGATAATTTGCTTCCTTAATAATAAGATGTTGGTCTACAGTCGTTTGATTGTGATTTTATCTAGTGATTAAGATTTTTGTGTTGGAGATGTTATAAATAAACTTATATTTACACCTACTACTTTATTTTGATAATTTTCAAATATTGCTTTCCCTTGGCTTTAGAGTTAGAGAAGGTCACACCTGAAGATTTATACACTTCATCTTTCTTACAAAAGAAAAAAAAACTTGAAAATGTTTTCTTCTTGGCTAGTGCATTAATGGGCGTGGTGAGGTTACTAAATTTAAAGTTTGGCCTAGAAGCAACTTTTTTGAGGTTTCAAAACTCCAGGAATTAGGTATTTTGTTGTTGAGTGAGGTTATGTATGACTGGAAGTCCATACTATTCTCGACATTTAGTTGTGTTTTGCATTGAGCTCTTTGAAACTCAATATGAGTTCGAATTTTATAAGCATCCTCTTGGATTGTCATTCATTGATCTCTCTCTTGGTCTTTACCCTTTTCTTTTCTTGTTTATTTTAATTCCTACTAGAGAGGCTTTCTTATCTCCTCTTGGGTGGGAGGATAACATGAAATATGATACGCTCCTTATAAGATGGTTCTTCTTCTTCTAACTTTTTTCATTTGGCTGTACAAGGAAATCGTGGAAGAAACCATGCAAAGCGTCCCATTGATGAAGGAGATAGTTGACGCTTACAGTGGACCAGATAGAGTAACTGCGAAGGAACAACAAGGGGAGCTGGAAAGAGTTGCAAAGACACTTCCACAAAGTGCACCTAATTCCGTAAAGCAATTCACCAATCGTGCTGTTCTTTCTTTGCAGGTTAGTGAAGCCACCTTAATGACTTATTATTCTGAATTATGTTTTAGAAACGAAAATAGGAAAAAAAATTTTTAAAATAAAATCCAAGAAGCAATTCTAAGCTGGAAACATACGTTACAAATCACTATGGTGTGTTTACTGTGAAGTATATGCCAAACTTGTAATAATCAACATCCATGCATGCCATACTTTACATATTTTATGAGTAACCGTTACAATGAAGCAGAATGTATGACAGATAATAGTACATACAATTTAGTGTTGCTATCAAATATATAGTGCAATTAATTTAGAGTTAATAGTATTGAATGTCTTACAACAATTTATCATTTCTATGTCGAAGCTATTGCAAATAATTAATGTTTGGTAGAATTGTATAATTGCATCCTTATTGGGTATGTATTGCATACCTACAAGTTTCATTGATAGGAATAAATTGTTTGGATGTGGTGATGTAAAGGGAACAAGTAAAATTCATAGATCAATGAAATTGTTTCAATCTTACTTGAACATAATGTATCATATTCTATTTGTTGTACAATGTACAACTGTGTCCATGAGTTCTAATGAAATTGTTTTTCTTATAAAAAAGAAAGAATTGAAAAAGAAAAATGAAAAGAGAACTAAGTATTTGTGCAGAACGAAGACATGCGATGCTATTCTGTCCTCTTTGGTTTGAGTGTTTAGGGGAACTAGGTACTGATTTGTCTCTTATCAATGCGAAGCCCTTTAACTTGCATTGTTACTGAGCAATGGAAAATTCTGAGGAAAGAGCATTGGGTAGGAAGATGAATGACGAGGTTTTGAAACTGGCACAAAACTATGAAATGATGTAGGCTTAAATGCATCAGATACAGTGCAACTAGAATGGAAAAAAAAAAAAAAAGAGAAATTTCATTTGGTTCTGCTTCAATAATCTTGAATTGATATAAATGAACTGTTCCGTATTGACTTGATCTGTTTATGGAGCCAATATCATTTTACACAACTGTTGTGTAAACACTGGGCAGCTCAACTTAACGTTGTTATGAGCTACCTAGATCCTGAGTCAAAATGTGCAAGTTCTAGCCACTTTTTGTTTAGCCTCTGTACACTATAGGTTTAATTGACATTTCCTGTCTATGTTGCATATTAGTGAGTAATACATTAGGCTTTGACATTCAATATGAAGATAATGCTAATTTGTACAATTCTTCTATTTGCAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTTATGGACAAACTTGTTGGGGGGTTCTCCCAGCGATACAAGTAGATTTACTTGTTTCATCAAGGACATCAAGAAATTATAACTTACACTCAACTAAAAAAAAGCTACTTGCTAAATTGGTCTACATCTGTTCTTTAGTCTTGGGATTTAATTTTAGCTCATGAGATTTTCAGAAGTTTTGCACTTGACCTTTCTGTTGTGTCTCAATGATGGTCTTTTGTAATCAATATTGCTGCAATAGCCATTGTTTGAGTACTGCTTCATTTTTTTTTTGGGGAAGCAATAATTGCATTTGTTTTTATCAACATGATGCACTGAGAATAGTTCTTTTGTGTTGTACAAAATCTCTCATTTATTTGCTGCTCTTTCACTCTC
mRNA sequence
TGGCCGTCTTGGCGCCCTAATTCATGAACACGGGGAGATTTTGGTTGGCGAAAGAAATCACGATCACTGTCGATGGGCGGCGCTGCTAAATCCCAATACCAATCTTAAACCCTTACGTTTTCTGTTCATCAACTAAACTAAACTCTTCGAAACTGTTCCTTTCCATTTCGTGCTTTTTTTTCTCTCAGTGAAGTCTAGATGAGTAGATCAATCGGAAGAAAGGTTCCAGGATTTTCACTTCTATCCAATGCCAACAAACTAGGCGTTGTGCCTTTTTCTTCTTCTTCTTCCGGTGGTCATGGTCGTGGCCGAGGTCGAGGTGCCTTTCCTTCCGGACCCTTTGATTTCACTCCTCCAGTCCCCAGTCAAGAACATCCAAATGCCTCTAAACAAGAACCTATAGATTCTCGTCCTACTCCTGGGCTTGGTCATGGCCGTGGTATACCAACTCCTTCCTCCCCAATTCGTCCATCTTTCTCTTCCTTTTCGCCCTCTGTCAGGCCCTCGTCTGTTGGTCGGGGTAGAGGTGATGCTTCGCCATCGATTCGGTCTCCTCCCGAGCCAGATTCAGAGCCTAAGAAACCTGTGTTTTTCTCGAGGAATAATGCAGGGGACTCTGCTGCAAGTACTTCACTTGGCGGGTTACACAGGGTTTCGGGAGAGAGAAACTTGCCTGATTCTTTGCATTCTGGGTTTTCTGGTGTTGGACGAGGAAAACCCATGAAGCAACCAGTCCCGGAAGATCAACCAAAGCAGGAAAATCGTCATCTTAGACCTAGACAAGAGGGGGATGGCCGTGGAGCTGGCGGGCGTGGAAGGGCTCGTGGCGTTGAACCAAGGATAGGCCGTGGTGAACCATGGAGGAACACTAATAGGATGGCGTCAAGGGGCGGGCCTGATGGTGAAGTTGGTGGTGGTCGAGGAAGTAGCGGTTACCGGGGCAGAGGCGTCAGAGGGCCGTTCAGACGGGGACCAAGGGGGTCATTTAGAACTGGGGAGAGATGGGACAGAAGAAGTGGTCAAGATAAGGAGGATGGATATGCTGCTGGACTTTATTTAGGCAACAATGAAGACGGTGAGAGGCTGGCAAAGAAGGTTGGTCCTGAAATTATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGGTAGAGTGCTGCCTTCGCCATTGGAGGATCGGCTTTTGGACGGGATGGATATCAATTTTATGATCGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAACCCTGATATTGATGAGAATCCACCAATTTCTCTTCGGGATGCATTTGAGAAGATGAAACCATTTTTAATGGCGTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGAAATCGTGGAAGAAACCATGCAAAGCGTCCCATTGATGAAGGAGATAGTTGACGCTTACAGTGGACCAGATAGAGTAACTGCGAAGGAACAACAAGGGGAGCTGGAAAGAGTTGCAAAGACACTTCCACAAAGTGCACCTAATTCCGTAAAGCAATTCACCAATCGTGCTGTTCTTTCTTTGCAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTTATGGACAAACTTGTTGGGGGGTTCTCCCAGCGATACAAGTAGATTTACTTGTTTCATCAAGGACATCAAGAAATTATAACTTACACTCAACTAAAAAAAAGCTACTTGCTAAATTGGTCTACATCTGTTCTTTAGTCTTGGGATTTAATTTTAGCTCATGAGATTTTCAGAAGTTTTGCACTTGACCTTTCTGTTGTGTCTCAATGATGGTCTTTTGTAATCAATATTGCTGCAATAGCCATTGTTTGAGTACTGCTTCATTTTTTTTTTGGGGAAGCAATAATTGCATTTGTTTTTATCAACATGATGCACTGAGAATAGTTCTTTTGTGTTGTACAAAATCTCTCATTTATTTGCTGCTCTTTCACTCTC
Coding sequence (CDS)
ATGAGTAGATCAATCGGAAGAAAGGTTCCAGGATTTTCACTTCTATCCAATGCCAACAAACTAGGCGTTGTGCCTTTTTCTTCTTCTTCTTCCGGTGGTCATGGTCGTGGCCGAGGTCGAGGTGCCTTTCCTTCCGGACCCTTTGATTTCACTCCTCCAGTCCCCAGTCAAGAACATCCAAATGCCTCTAAACAAGAACCTATAGATTCTCGTCCTACTCCTGGGCTTGGTCATGGCCGTGGTATACCAACTCCTTCCTCCCCAATTCGTCCATCTTTCTCTTCCTTTTCGCCCTCTGTCAGGCCCTCGTCTGTTGGTCGGGGTAGAGGTGATGCTTCGCCATCGATTCGGTCTCCTCCCGAGCCAGATTCAGAGCCTAAGAAACCTGTGTTTTTCTCGAGGAATAATGCAGGGGACTCTGCTGCAAGTACTTCACTTGGCGGGTTACACAGGGTTTCGGGAGAGAGAAACTTGCCTGATTCTTTGCATTCTGGGTTTTCTGGTGTTGGACGAGGAAAACCCATGAAGCAACCAGTCCCGGAAGATCAACCAAAGCAGGAAAATCGTCATCTTAGACCTAGACAAGAGGGGGATGGCCGTGGAGCTGGCGGGCGTGGAAGGGCTCGTGGCGTTGAACCAAGGATAGGCCGTGGTGAACCATGGAGGAACACTAATAGGATGGCGTCAAGGGGCGGGCCTGATGGTGAAGTTGGTGGTGGTCGAGGAAGTAGCGGTTACCGGGGCAGAGGCGTCAGAGGGCCGTTCAGACGGGGACCAAGGGGGTCATTTAGAACTGGGGAGAGATGGGACAGAAGAAGTGGTCAAGATAAGGAGGATGGATATGCTGCTGGACTTTATTTAGGCAACAATGAAGACGGTGAGAGGCTGGCAAAGAAGGTTGGTCCTGAAATTATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGGTAGAGTGCTGCCTTCGCCATTGGAGGATCGGCTTTTGGACGGGATGGATATCAATTTTATGATCGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAACCCTGATATTGATGAGAATCCACCAATTTCTCTTCGGGATGCATTTGAGAAGATGAAACCATTTTTAATGGCGTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGAAATCGTGGAAGAAACCATGCAAAGCGTCCCATTGATGAAGGAGATAGTTGACGCTTACAGTGGACCAGATAGAGTAACTGCGAAGGAACAACAAGGGGAGCTGGAAAGAGTTGCAAAGACACTTCCACAAAGTGCACCTAATTCCGTAAAGCAATTCACCAATCGTGCTGTTCTTTCTTTGCAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTTATGGACAAACTTGTTGGGGGGTTCTCCCAGCGATACAAGTAG
Protein sequence
MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPPEPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGGRGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK
Homology
BLAST of IVF0018253 vs. ExPASy TrEMBL
Match:
A0A5D3CZK6 (Translation initiation factor IF-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G001880 PE=4 SV=1)
HSP 1 Score: 941.0 bits (2431), Expect = 1.9e-270
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP 60
MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP
Sbjct: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP 60
Query: 61 NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP 120
NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP
Sbjct: 61 NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP 120
Query: 121 EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP 180
EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP
Sbjct: 121 EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP 180
Query: 181 EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG 240
EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG
Sbjct: 181 EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG 240
Query: 241 RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV 300
RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV
Sbjct: 241 RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV 300
Query: 301 GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP 360
GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP
Sbjct: 301 GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP 360
Query: 361 ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG 420
ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG
Sbjct: 361 ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG 420
Query: 421 ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 476
ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK
Sbjct: 421 ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 475
BLAST of IVF0018253 vs. ExPASy TrEMBL
Match:
A0A1S3BT69 (translation initiation factor IF-2 OS=Cucumis melo OX=3656 GN=LOC103492997 PE=4 SV=1)
HSP 1 Score: 941.0 bits (2431), Expect = 1.9e-270
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP 60
MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP
Sbjct: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP 60
Query: 61 NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP 120
NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP
Sbjct: 61 NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP 120
Query: 121 EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP 180
EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP
Sbjct: 121 EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP 180
Query: 181 EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG 240
EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG
Sbjct: 181 EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG 240
Query: 241 RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV 300
RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV
Sbjct: 241 RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV 300
Query: 301 GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP 360
GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP
Sbjct: 301 GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP 360
Query: 361 ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG 420
ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG
Sbjct: 361 ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG 420
Query: 421 ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 476
ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK
Sbjct: 421 ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 475
BLAST of IVF0018253 vs. ExPASy TrEMBL
Match:
A0A0A0KVG1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G064010 PE=4 SV=1)
HSP 1 Score: 850.9 bits (2197), Expect = 2.6e-243
Identity = 434/478 (90.79%), Postives = 449/478 (93.93%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFS---SSSSGGHGRGRGRGAFPSGPFDFTPPVPSQ 60
MSRSIGRKV GFSLLSNANKLGVVPFS SSSSGGHGRGRGRGAFPSGPFDFTPPVP+Q
Sbjct: 1 MSRSIGRKVTGFSLLSNANKLGVVPFSSSFSSSSGGHGRGRGRGAFPSGPFDFTPPVPNQ 60
Query: 61 EHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIR 120
EH NASKQEPIDSRPTPGLGHGRG PTPSSP+RPSFSSFSPSVRPSSVGRGRGDASPSIR
Sbjct: 61 EHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPSSVGRGRGDASPSIR 120
Query: 121 SPPEPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQ 180
SPPEPDSEPKKPVFFS+NNAGDSAASTSLGGLHRVSGERNLP+SLHS FSGVGRGKPMKQ
Sbjct: 121 SPPEPDSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLPESLHSEFSGVGRGKPMKQ 180
Query: 181 PVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEV 240
PVPEDQPKQENRHLRPRQEGDG GAG RGR RG EPRIGRGEPWRNTNRM S+ GPDGEV
Sbjct: 181 PVPEDQPKQENRHLRPRQEGDGPGAGERGRGRGFEPRIGRGEPWRNTNRMVSKDGPDGEV 240
Query: 241 GGGRGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLA 300
GGGRG+SGYRGRG RGP+RRG RGSFRTGER +RRSG DKEDGYAAGLYLGNNEDGERLA
Sbjct: 241 GGGRGTSGYRGRGARGPYRRGARGSFRTGERRERRSGHDKEDGYAAGLYLGNNEDGERLA 300
Query: 301 KKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDE 360
K++G E MN+LVEGFEEMSGRVLPSPL D+ LDGMD NFMIECEPEYLMGDFE+NPDIDE
Sbjct: 301 KRIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFMIECEPEYLMGDFENNPDIDE 360
Query: 361 NPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKE 420
NPPI LRDA EKMKPFLMAYENIQSHEEWEEIVEETMQSVPL+KEIVDAY GPDRVTAKE
Sbjct: 361 NPPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLLKEIVDAYGGPDRVTAKE 420
Query: 421 QQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 476
QQGELERVAKTLPQSAPNSVKQFTNR VLSLQSNPGWGFDKK Q MDKLV GFS+RYK
Sbjct: 421 QQGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFDKKWQLMDKLVEGFSKRYK 478
BLAST of IVF0018253 vs. ExPASy TrEMBL
Match:
A0A6J1IZW9 (uncharacterized protein LOC111479946 OS=Cucurbita maxima OX=3661 GN=LOC111479946 PE=4 SV=1)
HSP 1 Score: 672.9 bits (1735), Expect = 9.6e-190
Identity = 364/485 (75.05%), Postives = 391/485 (80.62%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPF--SSSSSGGHGRGRGRGAFPS--GPFDFTPPVPS 60
MSRSIGRKVPG S LSNANKLG VPF SSSSSGGHGRGRGR P+ GPFDF+ VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSGGHGRGRGRAGSPTHGGPFDFSSRVPG 60
Query: 61 QEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSI 120
QE N SK E +DSR T GLGHGRG P+PSS I PS SSF+PSV+ SS GRGRGD S I
Sbjct: 61 QEDSNESKHESVDSRGTSGLGHGRGKPSPSSSILPSLSSFTPSVKSSSAGRGRGDGSQPI 120
Query: 121 RSPPE------PDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVG 180
RSPPE DSE KKPVFFS++NA DSA S G L R GERNLPDS S SG G
Sbjct: 121 RSPPESRSSNGSDSERKKPVFFSKDNAADSAGSARPGALDRDVGERNLPDSFLSVLSGAG 180
Query: 181 RGKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASR 240
RGKPMKQP+PE QPKQENRHLRPRQE GRG G GR G PRI R E RNT RM SR
Sbjct: 181 RGKPMKQPIPESQPKQENRHLRPRQEAGGRGGSGPGRGSGGGPRISRDESVRNTGRMMSR 240
Query: 241 GGPDGEVGGGRGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNN 300
GGPDGE GGGRG G+RG RG FR RG+FRTGER +R GQD EDGYA+GLYLG+N
Sbjct: 241 GGPDGEDGGGRGRGGFRG---RGRFRGRGRGAFRTGERGERGRGQDMEDGYASGLYLGDN 300
Query: 301 EDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFE 360
DGE+LAK++G E MNQLVEG EEMSGRVLPSPLE+ ++ MD+N+MIECEPEYLMGDFE
Sbjct: 301 ADGEKLAKRIGTEHMNQLVEGXEEMSGRVLPSPLEEGYVEAMDMNYMIECEPEYLMGDFE 360
Query: 361 SNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGP 420
SNPDIDENPPI LRDA EKMKPFLMAYE IQSHEEWEEIVEETMQ VPL+KEIVD+YSGP
Sbjct: 361 SNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVDSYSGP 420
Query: 421 DRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGF 476
DRVTAK+QQGELERVAKTLPQSAPNSVK+FTNRAVLSLQSNPGWGFDKKCQFMDKLV F
Sbjct: 421 DRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLVREF 480
BLAST of IVF0018253 vs. ExPASy TrEMBL
Match:
A0A6J1FPB3 (uncharacterized protein LOC111447586 OS=Cucurbita moschata OX=3662 GN=LOC111447586 PE=4 SV=1)
HSP 1 Score: 668.3 bits (1723), Expect = 2.4e-188
Identity = 364/486 (74.90%), Postives = 389/486 (80.04%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPF---SSSSSGGHGRGRGRGAFPS--GPFDFTPPVP 60
MSRSIGRKVPG S LSNANKLG VPF SSSSSGGHGRGRGRG P+ GPFDF+ VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSSGGHGRGRGRGGSPTHGGPFDFSSRVP 60
Query: 61 SQEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPS 120
QE N SK E +DSR T GLGHG G P+PSS I PS SSF+PSV+ S GRGRGD S
Sbjct: 61 GQEDSNESKHESVDSRGTSGLGHGHGKPSPSSSILPSLSSFTPSVKSSFAGRGRGDGSQP 120
Query: 121 IRSPP------EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGV 180
IRSPP E DSE KPVFFS++NAGDSA S G L R GER+LPDS S SG
Sbjct: 121 IRSPPESRSSNESDSERTKPVFFSKDNAGDSAGSARPGALDRDVGERHLPDSFLSVLSGA 180
Query: 181 GRGKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMAS 240
GRGKPMKQPVPE QPKQENRHLRPRQE GRG G GR G PRI R E RNT RM
Sbjct: 181 GRGKPMKQPVPEAQPKQENRHLRPRQEAGGRGGYGPGRGSGGGPRISRDESVRNTGRMMP 240
Query: 241 RGGPDGEVGGGRGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGN 300
RGGPDGE GGGRG G+RGRG G FR RG+FRTGER R GQD EDGYA+GLYLG+
Sbjct: 241 RGGPDGEDGGGRGRGGFRGRG--GRFRGRGRGAFRTGERGQRGRGQDMEDGYASGLYLGD 300
Query: 301 NEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDF 360
N DGE+LAK++G E MNQLVEGFEEMSGRVLPSPLE+ ++ MD N+MIECEPEYLMGDF
Sbjct: 301 NADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYLMGDF 360
Query: 361 ESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSG 420
ESNPDIDENPPI LRDA EKMKPFLMAYE IQSHEEWEEIVEETMQ VPL+KEIVD+YSG
Sbjct: 361 ESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVDSYSG 420
Query: 421 PDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGG 476
PDRVTAK+QQGELERVAKTLPQSAPNSVK+FTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 PDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLVRE 480
BLAST of IVF0018253 vs. NCBI nr
Match:
XP_008451827.1 (PREDICTED: translation initiation factor IF-2 [Cucumis melo] >KAA0062949.1 translation initiation factor IF-2 [Cucumis melo var. makuwa] >TYK16394.1 translation initiation factor IF-2 [Cucumis melo var. makuwa])
HSP 1 Score: 942 bits (2435), Expect = 0.0
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP 60
MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP
Sbjct: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSSGGHGRGRGRGAFPSGPFDFTPPVPSQEHP 60
Query: 61 NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP 120
NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP
Sbjct: 61 NASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIRSPP 120
Query: 121 EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP 180
EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP
Sbjct: 121 EPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQPVP 180
Query: 181 EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG 240
EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG
Sbjct: 181 EDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEVGGG 240
Query: 241 RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV 300
RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV
Sbjct: 241 RGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLAKKV 300
Query: 301 GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP 360
GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP
Sbjct: 301 GPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDENPP 360
Query: 361 ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG 420
ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG
Sbjct: 361 ISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKEQQG 420
Query: 421 ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 475
ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK
Sbjct: 421 ELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 475
BLAST of IVF0018253 vs. NCBI nr
Match:
XP_004147751.1 (uncharacterized protein LOC101215545 [Cucumis sativus] >KGN53518.1 hypothetical protein Csa_014849 [Cucumis sativus])
HSP 1 Score: 852 bits (2201), Expect = 4.24e-310
Identity = 434/478 (90.79%), Postives = 449/478 (93.93%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSS---SSGGHGRGRGRGAFPSGPFDFTPPVPSQ 60
MSRSIGRKV GFSLLSNANKLGVVPFSSS SSGGHGRGRGRGAFPSGPFDFTPPVP+Q
Sbjct: 1 MSRSIGRKVTGFSLLSNANKLGVVPFSSSFSSSSGGHGRGRGRGAFPSGPFDFTPPVPNQ 60
Query: 61 EHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIR 120
EH NASKQEPIDSRPTPGLGHGRG PTPSSP+RPSFSSFSPSVRPSSVGRGRGDASPSIR
Sbjct: 61 EHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPSSVGRGRGDASPSIR 120
Query: 121 SPPEPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPMKQ 180
SPPEPDSEPKKPVFFS+NNAGDSAASTSLGGLHRVSGERNLP+SLHS FSGVGRGKPMKQ
Sbjct: 121 SPPEPDSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLPESLHSEFSGVGRGKPMKQ 180
Query: 181 PVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDGEV 240
PVPEDQPKQENRHLRPRQEGDG GAG RGR RG EPRIGRGEPWRNTNRM S+ GPDGEV
Sbjct: 181 PVPEDQPKQENRHLRPRQEGDGPGAGERGRGRGFEPRIGRGEPWRNTNRMVSKDGPDGEV 240
Query: 241 GGGRGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGERLA 300
GGGRG+SGYRGRG RGP+RRG RGSFRTGER +RRSG DKEDGYAAGLYLGNNEDGERLA
Sbjct: 241 GGGRGTSGYRGRGARGPYRRGARGSFRTGERRERRSGHDKEDGYAAGLYLGNNEDGERLA 300
Query: 301 KKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPDIDE 360
K++G E MN+LVEGFEEMSGRVLPSPL D+ LDGMD NFMIECEPEYLMGDFE+NPDIDE
Sbjct: 301 KRIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFMIECEPEYLMGDFENNPDIDE 360
Query: 361 NPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVTAKE 420
NPPI LRDA EKMKPFLMAYENIQSHEEWEEIVEETMQSVPL+KEIVDAY GPDRVTAKE
Sbjct: 361 NPPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLLKEIVDAYGGPDRVTAKE 420
Query: 421 QQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRYK 475
QQGELERVAKTLPQSAPNSVKQFTNR VLSLQSNPGWGFDKK Q MDKLV GFS+RYK
Sbjct: 421 QQGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFDKKWQLMDKLVEGFSKRYK 478
BLAST of IVF0018253 vs. NCBI nr
Match:
XP_038883040.1 (uncharacterized protein LOC120074102 [Benincasa hispida])
HSP 1 Score: 697 bits (1800), Expect = 5.35e-249
Identity = 375/481 (77.96%), Postives = 402/481 (83.58%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSS---GGHGRGRGRGAFPS--GPFDFTPPVP 60
MSRSIGRKVPGFSLL NANKLG VPFSSSSS GGHGRGRGRG PS G DFT PVP
Sbjct: 1 MSRSIGRKVPGFSLLPNANKLGFVPFSSSSSSSFGGHGRGRGRGDIPSHTGSSDFTSPVP 60
Query: 61 SQEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPS 120
QE NASKQ+ + SRPTPGLGHGRG P+ SS PSF SFSPSV+PSS GRGR DASPS
Sbjct: 61 GQEDSNASKQDSLHSRPTPGLGHGRGKPSSSSSNLPSFPSFSPSVKPSSAGRGRVDASPS 120
Query: 121 IRSPPEPDSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGRGKPM 180
IR PPEP SE KKPVFFS++NAGDSAAST LG H+ GER LPD+L SGF+GVGRGKPM
Sbjct: 121 IRFPPEPVSELKKPVFFSKDNAGDSAASTRLGTPHKGVGERILPDTLLSGFTGVGRGKPM 180
Query: 181 KQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRGGPDG 240
KQ VPE QPK ENRH+RPRQEG GRGAG GR+RG + R EP RNT RM SRGGPDG
Sbjct: 181 KQQVPEAQPKLENRHVRPRQEGGGRGAGEPGRSRGGGQGMSRDEPGRNTGRMVSRGGPDG 240
Query: 241 EVGGGRGSSGYRGRGV-RGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNNEDGE 300
E GGGRG SG++ RG RG +R RG FRTG+R R QD EDGYAAGLYLG+N DGE
Sbjct: 241 EYGGGRGRSGFQSRGRGRGTYRGRGRGEFRTGDRGGRGRVQDTEDGYAAGLYLGDNADGE 300
Query: 301 RLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFESNPD 360
+LAK++GPE MNQLVEGFEEMSGRVLPSPLE+ LD M N+MIECEPEYLMGDFESNPD
Sbjct: 301 KLAKRIGPEHMNQLVEGFEEMSGRVLPSPLEEEYLDAMHTNYMIECEPEYLMGDFESNPD 360
Query: 361 IDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGPDRVT 420
IDENPPI LRDA EKMKPFLMAYENIQSHEEWEEI+EETMQ VPL+KEIVD YSGPDRVT
Sbjct: 361 IDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYSGPDRVT 420
Query: 421 AKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGGFSQRY 475
AK+QQGELERVAKTLPQ+APNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLV FSQ+Y
Sbjct: 421 AKQQQGELERVAKTLPQTAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQKY 480
BLAST of IVF0018253 vs. NCBI nr
Match:
XP_023544535.1 (uncharacterized protein LOC111804080 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 675 bits (1742), Expect = 3.97e-240
Identity = 365/486 (75.10%), Postives = 393/486 (80.86%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSS---GGHGRGRGRGAFPS--GPFDFTPPVP 60
MSRSIGRKVPG S LSNANKLG VPFSSSSS GGHG+GRGRG P+ GPFDF+ VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSSGGHGQGRGRGGSPTHGGPFDFSSRVP 60
Query: 61 SQEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPS 120
QE N SK E +DSR T GLGHGRG P+PSS I PS SSF+PSV+ SS GRGRGD S
Sbjct: 61 GQEDSNESKHESVDSRGTSGLGHGRGKPSPSSSILPSLSSFTPSVKSSSAGRGRGDGSQP 120
Query: 121 IRSPPEP------DSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGV 180
IRSPPE DSE KKPVFFS++NAGDSA S G L +GERNLPDS S SG
Sbjct: 121 IRSPPESRSSNGSDSERKKPVFFSKDNAGDSAGSARPGALGGDAGERNLPDSFLSVLSGA 180
Query: 181 GRGKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMAS 240
GRGKPMKQP+PE QPKQENRHLRPRQE GRG G GR G PRI R E RNT RM S
Sbjct: 181 GRGKPMKQPIPESQPKQENRHLRPRQEAGGRGGYGPGRGSGGGPRISRDESVRNTGRMMS 240
Query: 241 RGGPDGEVGGGRGSSGYRGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGN 300
RGGPDGE GGGRG G+RGRG R FR RG+FRTGER R GQD EDGYA+GLYLG+
Sbjct: 241 RGGPDGEDGGGRGRGGFRGRGGR--FRGRGRGAFRTGERGQRGRGQDMEDGYASGLYLGD 300
Query: 301 NEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDF 360
N DGE+LAK++G E MNQLVEGFEEMSGRVLPSPLE+ ++ MD N+MIECEPEYLMGDF
Sbjct: 301 NADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYLMGDF 360
Query: 361 ESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSG 420
ESNPDIDENPPI LRDA EKMKPFLMAYE I+SHEEWEEIVEETMQ VPL+KEIVD+YSG
Sbjct: 361 ESNPDIDENPPIPLRDALEKMKPFLMAYEGIRSHEEWEEIVEETMQRVPLLKEIVDSYSG 420
Query: 421 PDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVGG 475
PDRVTAK+QQGELERVAKTLPQSAPNSVK+FTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 PDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLVRE 480
BLAST of IVF0018253 vs. NCBI nr
Match:
KAG6600462.1 (hypothetical protein SDJN03_05695, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 674 bits (1740), Expect = 8.30e-240
Identity = 368/487 (75.56%), Postives = 393/487 (80.70%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSS---GGHGRGRGRGAFPS--GPFDFTPPVP 60
MSRSIGRKVPG S LSNANKLG VPFSSSSS GGHGRGRGRG P+ GPFDF+ VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSSGGHGRGRGRGGSPTHGGPFDFSSRVP 60
Query: 61 SQEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPS 120
QE N SK E +DSR T GLGHGRG P+PSS I PS SSF+PSV+ SS GRGRGD S
Sbjct: 61 GQEDSNESKHESVDSRGTSGLGHGRGKPSPSSSILPSLSSFTPSVKSSSAGRGRGDGSQP 120
Query: 121 IRSPPEP------DSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGV 180
IRSPPE DSE KKPVFFS++NAGDSA S G L R GER+LPDS S SG
Sbjct: 121 IRSPPESRSSNESDSERKKPVFFSKDNAGDSAGSAPPGALDRDVGERHLPDSFLSVLSGA 180
Query: 181 GRGKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMAS 240
GRGKPMKQPVPE QPKQENRHLRPRQE GRG G GR G PRI R E RNT RM
Sbjct: 181 GRGKPMKQPVPEAQPKQENRHLRPRQEAGGRGGYGPGRGSGGGPRISRDESVRNTGRMMP 240
Query: 241 RGGPDGEVGGGRGSSGYRGRGVRGPFR-RGPRGSFRTGERWDRRSGQDKEDGYAAGLYLG 300
RGGPDGE GGGRG G+RGRG R FR RG G+FRTGER R GQD EDGYA+GLYLG
Sbjct: 241 RGGPDGEDGGGRGRGGFRGRGGR--FRGRGRGGAFRTGERGQRGRGQDMEDGYASGLYLG 300
Query: 301 NNEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGD 360
+N DGE+LAK++G E MNQLVEGFEEMSGRVLPSPLE+ ++ MD N+MIECEPEYLMGD
Sbjct: 301 DNADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYLMGD 360
Query: 361 FESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYS 420
FESNPDIDENPPI LRDA EKMKPFLMAYE IQSHEEWEEIVEETMQ VPL+KEIVD+YS
Sbjct: 361 FESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVDSYS 420
Query: 421 GPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVG 475
GPDRVTAK+QQGELERVAKTLPQSAPNSVK+FTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 GPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLVR 480
BLAST of IVF0018253 vs. TAIR 10
Match:
AT1G53645.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 309.7 bits (792), Expect = 4.1e-84
Identity = 226/545 (41.47%), Postives = 293/545 (53.76%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLGVVPF---------SSSSSGGHGRGRGR---GAFPS--- 60
M +IGR+ + + A+ + PF SSS S G GRGRG G FP+
Sbjct: 1 MRSAIGRRFSNPNGFTIASLVKQTPFLTQSTSHFSSSSDSSGRGRGRGSGEDGGFPAAGR 60
Query: 61 GPFDFT--PPVPSQEHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPS 120
G F P VP +E +A G GHGRG P S I P+F+SF S P
Sbjct: 61 GQFGVNREPVVPGREPSSAG-----------GYGHGRGRPIQSDSISPAFTSFVKSDSP- 120
Query: 121 SVGRGRGD----------ASPSIRSPP-------------------EPDSEPKK------ 180
S+GRGRG A P +SPP +P S+P++
Sbjct: 121 SIGRGRGSVGSDTVSPFAAEPPRQSPPPPQQQQSQSQQQRSQPQQQQPRSQPQQQPNDES 180
Query: 181 ---PVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGF-------SGVGRGKPMKQP 240
PVF D A++S G+ + PD++ + SG GRGKP+ +
Sbjct: 181 QGSPVFVKLQEMQD--ATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVES 240
Query: 241 VPEDQPKQENRHLR--PRQEGDGRGAGGRGRARGV-----EPRIGRGEPWRNTNRMASRG 300
P Q ++NR +R P R + RA V +P++ E R SRG
Sbjct: 241 APIRQ--EDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRG 300
Query: 301 GPDGEVGGGRGSSGY-RGRGVRGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLGNN 360
+G GGRG G RGRG RG RG R G+ W +++ + A ++ G++
Sbjct: 301 EAEGSSVGGRGGRGRGRGRGARG------RGRGRGGDGWRDDKKEEEGEQEAMRIFAGDS 360
Query: 361 EDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGDFE 420
DGE+ A+K+GPE+M L EGFEE+ + LPS D ++D D N MIECEPEY+M DF
Sbjct: 361 ADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFG 420
Query: 421 SNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYSGP 476
SNPDIDE PP+SLR+ EK+KPF++AYE I+ EEWEE + E M PLMKEIVD YSGP
Sbjct: 421 SNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGP 480
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3CZK6 | 1.9e-270 | 100.00 | Translation initiation factor IF-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
A0A1S3BT69 | 1.9e-270 | 100.00 | translation initiation factor IF-2 OS=Cucumis melo OX=3656 GN=LOC103492997 PE=4 ... | [more] |
A0A0A0KVG1 | 2.6e-243 | 90.79 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G064010 PE=4 SV=1 | [more] |
A0A6J1IZW9 | 9.6e-190 | 75.05 | uncharacterized protein LOC111479946 OS=Cucurbita maxima OX=3661 GN=LOC111479946... | [more] |
A0A6J1FPB3 | 2.4e-188 | 74.90 | uncharacterized protein LOC111447586 OS=Cucurbita moschata OX=3662 GN=LOC1114475... | [more] |
Match Name | E-value | Identity | Description | |
XP_008451827.1 | 0.0 | 100.00 | PREDICTED: translation initiation factor IF-2 [Cucumis melo] >KAA0062949.1 trans... | [more] |
XP_004147751.1 | 4.24e-310 | 90.79 | uncharacterized protein LOC101215545 [Cucumis sativus] >KGN53518.1 hypothetical ... | [more] |
XP_038883040.1 | 5.35e-249 | 77.96 | uncharacterized protein LOC120074102 [Benincasa hispida] | [more] |
XP_023544535.1 | 3.97e-240 | 75.10 | uncharacterized protein LOC111804080 [Cucurbita pepo subsp. pepo] | [more] |
KAG6600462.1 | 8.30e-240 | 75.56 | hypothetical protein SDJN03_05695, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT1G53645.1 | 4.1e-84 | 41.47 | hydroxyproline-rich glycoprotein family protein | [more] |