Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAATTTAATTTGGCGCGAGGGTTTATGCTTGACCCCTCCAGAACCCTCGGTTCAGAGAGAGCGACAATTCCATAGTCAATTTCCTTCCATCGCAAAGCCTCCAGGTTATGGCGTCGCTGATGTTCCCTTTCAGAAAGTTTTCATTGTTCTTCCATTCTCCCAGAGGTATACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCCATATATTCTGCAATTTCGTCTCCAATACTTGAAATTTTCATTGCAGCAGTTTCATGTCAACAATCGTGCGCTGTTCTGTCTCGATCACTTTTTTCTACTCACATTGTTGGTGACGAGCCGATTCTTGTAAGTATTACTTGGTTTTTTCTCGTTTTCGGATTGATTCTTGCCTATTGCATCACATTTCGCATTTCGGAGTTATGGATTTCGATGAATTATTCTCACCAGGTTCGCGATTTCATACATTCCGCATTATATGATCCAAATCATGGATACTTCGCTCAGCGGTCACAGTCCGTGGGAGTTCTGGAACACAGCATTAAGTTCAATCAGCTTGAAGGTATGCTGTAGATATAGTTTGGAGGTTTAGTTTGGAGGTATTCACAAGCATTCAAGCATGAATTATTGGGATTGATATGAGTGCGTTATCATTATGTTGCTATAATAGCTGGCAAGCTGTGTCCTGGAGGCAGGTTTAGGAGTAATCATATTTTCAATCAAATGATTTATCCTTGAAGGACATGGTGTGTGTGTATGTTATTAAACTCCACGGCTATATTTACAAGCAAATGCCCGCACTGAATTTAGTTAACCGCGTGGCTTTTTGTATTTTCTAAGAAGGAAGCTAAAGTGAGCATAGCTCAATGATAATGAGCATTATCCTTAACCAACAAGTTAGAGGTTTGAATCCTCTCACCGCAAATTGTCGTTGAACTAAAAAAAGTATAAACCAACAAGTTAGAGGTTTGAATCCTCTCACCGCAAATTGTCGTTGAACTAAAAAAAAGTATATTCCAAGAGAGAAAGTCTGTACTATAATCGATGAAAGTCCCCTTAAGGTCAAAAGTATTTGAGAATAGGTCTATGGAACTGAATATTCATCGTAAAGAAATGGCAAACAATGGCAGCTAGAGTGAGCTGCTTGCAGTAGCGCAAAGAAAGAAGTCCGAAAACCTCGTGCATGGCACTATCCACTTTTTTTTAATGATTTTTTTTTATCAATTTTTTCCAGAGTTGAAAATGATCAGTAATACCCATGTGAATAATGATGTACTGAGGCTATCTGGCTATGATCTATCTGTTCTACTGATTTTATGTTTTGGAAGTGACCCTTCACCGAGACTTTTCCATTTATAGTGTTTTAATTTATGCTGGCATTGTCCTTTCCCGAGATTTCTCAGATGCGTACTTTTCTACTAAATCTATTATTTTCCAGCAATTGGAAGCTTAGTTCTTCCTGTCTTTTCAATGCTGCTATGTATTACTCTTGTCAATTCAACATGCTGATTTCTTAAGAAGCATAATGTAATTCTGATATCATTGCTGTGATGTTTGCTTCATGTACTGAAGTGTGTGCATGATTTAGGCAGGAAAGCTTATATGAGATACTTGGATAAAATTTACAAGCAGAGCGACATTTCATGGTTTACTCCAGTGGAGCTTTTTAAGGTAATTGATTCCTCTCTCTCTCTCTTTCACACACACACACACACACACACACACGCTACACTGAGTAATGCGTTCAATCAGAAAGTCTTTTTGGTTGTTATGCTTGGGAAGGATTTGTCATATTTGTGTGGAATGTAGAGGCTTGGGGGTATGGAAGACATTGAGGAGTTCTGTAACTTCTGTTGTTCTGTTCTTACCCTCTACGTGGAGTTCCTCGTCTAAATCTTGTTGTGAATATTCTAATTTCTTTTATTTAAATTATTGAAGAGTTTCTATCTTTTTACTCTCTCTCTCTCTCTCTATATTTATCTCACGGACTTTATCCATTTGGCTAACGACTAACTTGGCCACCTGTAGTGCAAGGTTTGAAAATGTATAAAAAAATAACCTTTTTATATCATGTTTTCTGGAGGCAGAGTGGGGATGGTAAGGGGCATAATACACGCGCACTCTGCAAGCTACGGATGTTAATAACTAAAAGATTCCACAAGTGAAATTTGCCCATCAAATTCTATTGTTAGATATTACTGATTGTTCTGGATTCATGATTAAAGCTGTTAAATTCTTCATGTGGCATCATATGTAATATTAGCATTAATCTGGACAAGAAGATCTCTGTGCCAAGGGACTAATGGTTCTGATATTGAGTATTCTCTCCTCAATGTAATCTTTGAGAAATGAGTTAAACTAGCATGAAAAAAATTCATCCATGAATCTTCTTTTTTAATTCCTGCTTACCTTGGTGAGAGTGGAATCAAATTTTCATTTCTCAATAACTATGAAATCTAGCTCAATTGATAGTAACCTCTGGCTACATCTTTCATGTTTGCTTTTGTTTGTCAATTTTGCTCCTTCAGCTTTCATTGCCTTTGATTTTGCAATTTTATTTCCCTGTAGCCTTGGTATGCTCACGGGATTGCTGAAGCAATCATGCGTACTGCCAATCTTTCTATTCCACTAAAAGTAAGATTTCTATCTTTCTTGATTTTAACTATAGATTCATAGCCACTCAGGATCTTTTTGTTGCGTTAAATATTTTTTCCCACTCAGGAGCATTTGCTATCTAACTTTTTCTCATGTTTCCTGCCCGCTAAATCTAAACATTTTTTTTGTGCTGTGCCATTTTGCTTTTGTGGTGGTGGGGGGGGGGTATTTCCTTAACTCTGGCCATAGAAGGATATCTTTCTTGTCTTCGCATCTTCTCTTTATCTTGAGTGCTACATAACTAGTACTTTTGTTCCAGCTGGTTGAGGTGTCCTGTCCCACACAAAAAAGCTCAAAGCTATCAATAGTAGAATATTGAATTGTTGGCAATAAGGAGGTCTCTTTATTTGATAGGAACCATCACTAGTGTTTTTGAGAACTGTATTTCTTTTTTTTTTTTTTTTTAATCTTTAGCAGGTTTCTGAGTATTGTATGAGATGTATGGTTATAAGCCATTACTGCTAACTAGTACCAATTGTGGATTTGAATTTAAGAAAGCAATATTTTTACTAAAGCTATATGGATATTCCCCTGTCAGGTGCCTCAAAACTGTACCATGATTTCTTTCTTTCTTCTTCTTCTTCTTCTCTCTCTTTTTTTTTTTTAATTAATTATTTATTTATTATTTATTTTTTGGGTAAGAAGAAAAAAATGTGTTAATGACAAAGATATATCTCCATCAGGGGAGGGGGCAAGGACGCAAGTCATCTCCTAACCGAGGTCCATAGAGGAAGTTCAAAAGTATCCCTATATTTGTCTTCACAAAGTTTTTCCGTTTCTTCAATCCAAATATTCTGCTGACAACATTGCGGGCATTCAAACAAATTAATTAGTTATAACCTTTTGACATAGGAGCCTTCAAGAAGTTAAACAAGTTATATGTTTTTAATCTCTAATTATGTTATGCCAACTTCATTGTGTGTATGTGTGTCTTTTTTTGTGTCCACAGATATATGAAATTGGTGGTGGATCAGGAACTTGTGCAAAGGGGATAATGGACTACATAATGTTGAATGCACCTACACGAGTTTACAACAATATGACTTATACGTATGTGTTGCTCTCTCTCGAAAACAATATGACTAATATGTGTGTGCTTTTGCGGCACTTTTTCTCAACTATGAAATGAGTAATACCTTCCGATGCCTTAATGAAAGAGTAAGCTGGGATGTCTTAATAGTTTTTAGGCTCAATTACTTTATGGCAGAGACATCTATGGCACGTGCTCTCAATATAGCCAAATGAACTACTTTTAACAATTTTTTTTTGTAAATACAGAGGCGCATTTTTTTAAAGAGGGAATCTTCTAGACTTTTAGAGTTTCCCCCCTAAAAATCCTCCACTATATGAATATGATCATATATTTCTTCATGTGATTTTCTTAAACAACATATTATGTTAACCTAAAAATAAAATTGTTGGAAAATATACATTCCAGTGTTAAACTTTTTTTTCCCCTAAACACATACTATATGACAACCTTAGTAAATATTTTAAGTAGTGAATCAATTAGTAGGATGTTAAAAAGATAAAGCATAAGGAAAAAAGTATTTTTATTTTCTAGTTGCATCTTGAAAATATATGAGATAGAATTTCTTTTAAGGTCTTCTATTCTTGAAATTATTAACAGATTTTCACCGGTACTTGTCAAATAATGTATGTTGTAAATTAAATTATTCTATAAAGTTCAAACAAACTAAATGTATATTATAAGTAGATGGTTTGTTTGTCAACTTATTATTATTAAACCTATACATTGATCATTTTGTGAGAAATTATTACAAGAAGATGGAGAATAAAGTTACCAATCAAAAGTGAAAAGCTTCCAAGTTCATTTTTATCAAATATATTACAAGTATTCACATAATTTTTTGTCTATTGGGGTTCTCCCCTTTCCCCATGTTTCAATTTTTCTTCCTCATATTTCACGTGAAAATTACATATATAACTGTAGCTCAGTGGAAATAAGTCCCTCATTAGCAGAGATACAAAGACAAACTGTTGGAGAGGTTCGCAGTCACCAATCAAAATTCAGAGTAGAGTGTCGTGATGCTGTTGATCTGAGTGGATGGGGTACGTTCTCATACATTTCCCTCAACTATTTCTTTCAAGTGCTGTCAAGGAGTTACATTCTTTGTTCTTGGGCGGTGTGTGTCGTGTGTGTTGAGTGAATGAATAAATGGGTGAAAGTTCTCACCTGGTGTCTAGGCAATTTTTTCTTTTGTGCTGGAGAGACCCCCGAGGTGTGGGGGTAAGAAGGAGAAGATACTCCAAAGTAGATACAAGTGTTCAAACTTTTTTCCTTTTTTATTCTTCCTTGTATGGCAGCAGACAATTTGATATGGATAGTGGGTTGTTACTTTTGTTGCTCCTTTATTTATTTATTTATTTTTATTACATCAACAATTGGGGGTGGGGGATTTGAACCCATGACCAAGATGCCAGTTGAGCTACGCTCTTGTTGGCTTGTTGCTCCTTTATTTAAATATACCCTCTGCGTTTATGATTCTACAATATAGTTGTATCATTAACCATTGTCCTCCTATAATAGGAGACGTGGAGGAGCAGCCTTGTTGGGTAATTATGCTTGAGGTATGTTAAAGATGAACTTGTGAAAAATTGAAATTGCAGATCTTTTGATTGGGCAAGTTTACTTGGATAAACTATTATAGTAAATTGTGAATCAGCAACTCTATGCATCAATAGTCTTGGCTTTTTCCTCAATTTCTTTCTCAGCGAAACATCAAATGATGTATGATTGCATATGCATGAATTGTGTGGAAGATGGGCACAGGGATTGAGATGTTTTCGATGAAATAGTTCAACATAATTTTGTTTCCTCAATTGTTGGACATTGTTTCATTACCCAATCAAACATGAGCCGATCATGTAACAATTTTTCACTGAAGACTTTGTGGAGACGTGATTTTTCATCTTCAAGTGGTTTCTAACCTTCTTTAGATTTTGAGTCAATTTCATGCTTCATTGACCAATTAAATTGTTCATTTCTCACAGGTTCTGGACAATCTTCCTCATGATCTTATTTACTCAGAAAATCAAGTTTCCTCGTGGATGGAAGTATGGGTGGAAAAGCAACTTGATAGGTATGCAACAATTCATAGTTTCAAAAAAAAAAAAAAAAACAACTCATGCCTGCTCACATTTTGTTCTTGCTGCATTTTCCTAAAAGAGGCGATCTCTTATTAGATCGTTGGTTCACTTATTAGTTATTTCATCTTTAATCTTTATGTGATATGGTTGAATACTTAATGAAACAAAGGAGTTTTGTAAACAATTTCAACTTTAATTACTGCTCCATAGTCATTTTCAGCCCACCTGCTGCATATTCTTGTTTCTGAAGCAACATCATTGGGACTCATCCAGAAGTTAGTTATCCTAAACTGATGTTCTTCAGAAATTTTATATATATATATTGAGATCGGGAAGAAAGACTGCTCTCGGTTTTGGCCATGGTTAAGGCCATGGTTAAACATCTTGCTACCCGGCTCTGTTTTTAAAACTGTACAGTCATCAACAGGCATGGAGTATATAAGAAACTCCTCTGTGGTGTGATTTGGTGACAGATGAGAACTAAATATGTGGCTTAAGCTATCCTAGCAAGCTGTTTGGCTTATTTTTACTTGTGAGGATCTTTCGAGTCTTTGCTAATCCTGCTGCCATATGGAAGCAGGGAATCACTCGTGGAATTATACAAACCATTGCAAGACCCTCTCATTAAGCGTTGTGTAGAGATTGTGAATTTCAACGAAAATGACTCTGCCGAAGGCAGTGTATTATCAAAGGCAAAAGGCGTTTGGTCCAAAGCTTTTCCAAAACCTAGAAGATGCTGGCTGCCAACTGGATGCTTGGTGAGAAAACACTTCATTGATCCCTCGTCCCTCCCCCAAGATTTCACTCACTCTAGTATTGTACTGGTCATGCAGAAACTACTTGAGGTGTTGCATCGTGTGCTACCAAAGATGTCATTAATTGCTTCGGACTTTAGTTACTTACCTGATGTGAGGATAGCTGGTGAAAGAGCTCCATTAGTTTCAACCAAGGTATGTTCTTTTGTTTACTCGATGAGGATTGAAAATTGCTGCAGGTATCTTCTGGACGATACGTCTTATGTTTTAAAAATACCAGAACCCCAGTACATAGAAAGATTATAATTTCAATTTTTTACGTTTTAATATGCAGGCAGATGGAAGCAGCTCAGACTATGAAAGCTATTTAGATGCAAAGGTACGTGCAAGCCTGGAGTCCCGAAACTTAAAATTGGATTCAGTTTTTTAACTTTCCTATCATTACAATCTCTTATTGTATTACTAAAAGGGGTTTTGACGGTAGAGACACTCGCTGTTGCCTTTTCGATTTTTAGCAAGCGTTGCCATTTTCACAGTGAAAGGTACTCAACGGTTTTGGTTGATCTGCAGGGTGATGCTGACATCTTTTTTCCCACAGACTTTCTATTATTGGAACAGATTGAACATTATTGTTCTGGATGGCTAAAACTTCATGGAGATAAAACGCCAAAGATAGGGAAAAAGAGGCGAACAATAATTGTAAGTTGCATAATCTTTAATGTGTTATTCGTCAAAACTTCAAGGAACGTTGGCTCTTGAGAAAAATGCCGCTTGTTTCTTGTTGACTTGCTCACTCAAATGATGAGGGATAAGATAAAACAGCCCTCTGATTCTAAACAGATATTGAATTATATAGTTTTTATTTCTGATTTCTTTATAAGTCGAACTAAATTACAGAGTTAAAAAATAAGTATAATTTAAATTTTATGGCGACTATAGCAATTGCATGATGACTTCAATTTCACTGTCAATCACCATCCCCTTTTTTGGTGCCAGCTTGAGACTTCATCGTTCATGGAAGAGTTTGGATTGCCATCGAAGACGAGATTGAAGGATGGTTACAACCCCCTTCTAGATGACTTCAAGAACACAAAATTTTATCTAAGTGTTCCAACACACAACATCAAATAGAGGCAAGGGAAGGAAGCTTCCGATAATTCTGTGCTGGAATTTTTTTTTAACTCTTTCTTTGTTTTCATGTAGTGTATGCATTAGAAATCAGAACATGGAATTTCTTTTGACGGGAAAACAGTATCAGAATAGATGGTTACTAAAGCAGGAGGCTGGTTTGGAAGTTTTCTTCCGTGGACCTATGGACAACGTCGTTGTGGTAATGGTCGTAATGCCAGAGAAATCATTTCCACGAGGCACGGAAGAGAGGTAATAATAATCATCTATTCTGAAACATTGATTTTCGACAGAATGAAAAAGGCGAAGCATTGGGAGCATCCAGTCGTGGAACTTAGAACAGTGCTCTGAGTTAAGTTAATGTTGAAGAGTGAGTAGTATTCTTTTTATTTACTTAAACTGATTATGGGAAATAAAGAGCAATGACTGCACTCACATAGAGTTCCAAATTCACTGTATTTCTTTTATGCAATGACTCGTACTTATTTATAAAGAATGGGGGAAAAGACATTGAGATTCATTCGCCCTTTGATTTTTAA
mRNA sequence
GTAATTTAATTTGGCGCGAGGGTTTATGCTTGACCCCTCCAGAACCCTCGGTTCAGAGAGAGCGACAATTCCATAGTCAATTTCCTTCCATCGCAAAGCCTCCAGGTTATGGCGTCGCTGATGTTCCCTTTCAGAAAGTTTTCATTGTTCTTCCATTCTCCCAGAGCAGTTTCATGTCAACAATCGTGCGCTGTTCTGTCTCGATCACTTTTTTCTACTCACATTGTTGGTGACGAGCCGATTCTTGTTCGCGATTTCATACATTCCGCATTATATGATCCAAATCATGGATACTTCGCTCAGCGGTCACAGTCCGTGGGAGTTCTGGAACACAGCATTAAGTTCAATCAGCTTGAAGGCAGGAAAGCTTATATGAGATACTTGGATAAAATTTACAAGCAGAGCGACATTTCATGGTTTACTCCAGTGGAGCTTTTTAAGCCTTGGTATGCTCACGGGATTGCTGAAGCAATCATGCGTACTGCCAATCTTTCTATTCCACTAAAAATATATGAAATTGGTGGTGGATCAGGAACTTGTGCAAAGGGGATAATGGACTACATAATGTTGAATGCACCTACACGAGTTTACAACAATATGACTTATACCTCAGTGGAAATAAGTCCCTCATTAGCAGAGATACAAAGACAAACTGTTGGAGAGGTTCGCAGTCACCAATCAAAATTCAGAGTAGAGTGTCGTGATGCTGTTGATCTGAGTGGATGGGGAGACGTGGAGGAGCAGCCTTGTTGGGTAATTATGCTTGAGGTTCTGGACAATCTTCCTCATGATCTTATTTACTCAGAAAATCAAGTTTCCTCGTGGATGGAAGTATGGGTGGAAAAGCAACTTGATAGGGAATCACTCGTGGAATTATACAAACCATTGCAAGACCCTCTCATTAAGCGTTGTGTAGAGATTGTGAATTTCAACGAAAATGACTCTGCCGAAGGCAGTGTATTATCAAAGGCAAAAGGCGTTTGGTCCAAAGCTTTTCCAAAACCTAGAAGATGCTGGCTGCCAACTGGATGCTTGAAACTACTTGAGGTGTTGCATCGTGTGCTACCAAAGATGTCATTAATTGCTTCGGACTTTAGTTACTTACCTGATGTGAGGATAGCTGGTGAAAGAGCTCCATTAGTTTCAACCAAGGCAGATGGAAGCAGCTCAGACTATGAAAGCTATTTAGATGCAAAGGGTGATGCTGACATCTTTTTTCCCACAGACTTTCTATTATTGGAACAGATTGAACATTATTGTTCTGGATGGCTAAAACTTCATGGAGATAAAACGCCAAAGATAGGGAAAAAGAGGCGAACAATAATTCTTGAGACTTCATCGTTCATGGAAGAGTTTGGATTGCCATCGAAGACGAGATTGAAGGATGGTTACAACCCCCTTCTAGATGACTTCAAGAACACAAAATTTTATCTAAGTGTTCCAACACACAACATCAAATAGAGGCAAGGGAAGGAAGCTTCCGATAATTCTGTGCTGGAATTTTTTTTTAACTCTTTCTTTGTTTTCATGTAGTGTATGCATTAGAAATCAGAACATGGAATTTCTTTTGACGGGAAAACAGTATCAGAATAGATGGTTACTAAAGCAGGAGGCTGGTTTGGAAGTTTTCTTCCGTGGACCTATGGACAACGTCGTTGTGGTAATGGTCGTAATGCCAGAGAAATCATTTCCACGAGGCACGGAAGAGAGGTAATAATAATCATCTATTCTGAAACATTGATTTTCGACAGAATGAAAAAGGCGAAGCATTGGGAGCATCCAGTCGTGGAACTTAGAACAGTGCTCTGAGTTAAGTTAATGTTGAAGAGTGAGTAGTATTCTTTTTATTTACTTAAACTGATTATGGGAAATAAAGAGCAATGACTGCACTCACATAGAGTTCCAAATTCACTGTATTTCTTTTATGCAATGACTCGTACTTATTTATAAAGAATGGGGGAAAAGACATTGAGATTCATTCGCCCTTTGATTTTTAA
Coding sequence (CDS)
ATGGCGTCGCTGATGTTCCCTTTCAGAAAGTTTTCATTGTTCTTCCATTCTCCCAGAGCAGTTTCATGTCAACAATCGTGCGCTGTTCTGTCTCGATCACTTTTTTCTACTCACATTGTTGGTGACGAGCCGATTCTTGTTCGCGATTTCATACATTCCGCATTATATGATCCAAATCATGGATACTTCGCTCAGCGGTCACAGTCCGTGGGAGTTCTGGAACACAGCATTAAGTTCAATCAGCTTGAAGGCAGGAAAGCTTATATGAGATACTTGGATAAAATTTACAAGCAGAGCGACATTTCATGGTTTACTCCAGTGGAGCTTTTTAAGCCTTGGTATGCTCACGGGATTGCTGAAGCAATCATGCGTACTGCCAATCTTTCTATTCCACTAAAAATATATGAAATTGGTGGTGGATCAGGAACTTGTGCAAAGGGGATAATGGACTACATAATGTTGAATGCACCTACACGAGTTTACAACAATATGACTTATACCTCAGTGGAAATAAGTCCCTCATTAGCAGAGATACAAAGACAAACTGTTGGAGAGGTTCGCAGTCACCAATCAAAATTCAGAGTAGAGTGTCGTGATGCTGTTGATCTGAGTGGATGGGGAGACGTGGAGGAGCAGCCTTGTTGGGTAATTATGCTTGAGGTTCTGGACAATCTTCCTCATGATCTTATTTACTCAGAAAATCAAGTTTCCTCGTGGATGGAAGTATGGGTGGAAAAGCAACTTGATAGGGAATCACTCGTGGAATTATACAAACCATTGCAAGACCCTCTCATTAAGCGTTGTGTAGAGATTGTGAATTTCAACGAAAATGACTCTGCCGAAGGCAGTGTATTATCAAAGGCAAAAGGCGTTTGGTCCAAAGCTTTTCCAAAACCTAGAAGATGCTGGCTGCCAACTGGATGCTTGAAACTACTTGAGGTGTTGCATCGTGTGCTACCAAAGATGTCATTAATTGCTTCGGACTTTAGTTACTTACCTGATGTGAGGATAGCTGGTGAAAGAGCTCCATTAGTTTCAACCAAGGCAGATGGAAGCAGCTCAGACTATGAAAGCTATTTAGATGCAAAGGGTGATGCTGACATCTTTTTTCCCACAGACTTTCTATTATTGGAACAGATTGAACATTATTGTTCTGGATGGCTAAAACTTCATGGAGATAAAACGCCAAAGATAGGGAAAAAGAGGCGAACAATAATTCTTGAGACTTCATCGTTCATGGAAGAGTTTGGATTGCCATCGAAGACGAGATTGAAGGATGGTTACAACCCCCTTCTAGATGACTTCAAGAACACAAAATTTTATCTAAGTGTTCCAACACACAACATCAAATAG
Protein sequence
MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNHGYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAEAIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQRQTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWMEVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPRRCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYLDAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPSKTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Homology
BLAST of Tan0012280 vs. ExPASy Swiss-Prot
Match:
O14138 (Protein arginine methyltransferase NDUFAF7 homolog, mitochondrial OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPAC25A8.03c PE=3 SV=2)
HSP 1 Score: 83.2 bits (204), Expect = 8.3e-15
Identity = 97/434 (22.35%), Postives = 168/434 (38.71%), Query Frame = 0
Query: 2 ASLMFPFRKFS--------LFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHS 61
ASL+F R S L H PR + +L R D + + D+IH
Sbjct: 17 ASLLFGSRVISALAFTNTGLVLHGPRRWYTTDNGFLLHR---------DSKVSLADYIHE 76
Query: 62 ALYDPNHGYFAQR-SQSVGVLEHSIKFNQLEGRKAYMRYLDKIY----KQSDISWFTPVE 121
+ +DP+ GY+++ + S L HS+ + EG K + ++ Q ++ + E
Sbjct: 77 STFDPSKGYYSRLWTGSTNNLSHSVHVLRKEGHKCSKEFDPFLHGIPIPQKALNIY---E 136
Query: 122 LFKPWYAHGIAEAIMRTANLS----IPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNM 181
+ ++ I+ ++ L LKIY+ G+G A I+DY+ N VY
Sbjct: 137 KQRSLFSESISNYLVLQYKLRYFPVFDLKIYDFHSGTGIIALDILDYLYKN-HLEVYGRT 196
Query: 182 TYTSVEISPSLAEIQRQTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDN 241
TY V + A + + VR + ++ + L+ W + PC+V+ L+V+ +
Sbjct: 197 TYNIVLHNSWQASWFKSMLTSVRYAKHGDHIDIYVSDPLT-WNHTDTNPCFVLALQVISS 256
Query: 242 LPHDLIYSENQVSSWMEVWV--EKQLDRESLVELYKPLQDPLIKRCVEIVNFNE----ND 301
HDL N W+ E L+ + ++ + + N +D
Sbjct: 257 FGHDLFRQSNGAMMMERCWLGPEHFLNEFFTLNTHQKVSSLNYHLAFQQARINVQQGFSD 316
Query: 302 SAEGSVLSKAKGVWSKAFPKPRRCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIA 361
S S K V+ F + + PT ++ E L + P SL+ D ++ D +
Sbjct: 317 SRAKRYFSGVKQVFWSFFSTQKLTYYPTKAIRFFERLSKQFPHHSLLLMDVCHV-DKSLP 376
Query: 362 GERAP-LVSTKADGSSSDYES---YLDAKGDADIFFPTDFLLLEQI-------------- 395
G AP ++S + D S+ S ++ FPT L+ I
Sbjct: 377 GINAPSVLSMENDFSTKKMSSNIGHVFQNETVKYVFPTPLYLVSDILQLATHNRSFICSL 435
BLAST of Tan0012280 vs. NCBI nr
Match:
XP_022147060.1 (protein arginine methyltransferase NDUFAF7 homolog, mitochondrial [Momordica charantia])
HSP 1 Score: 871.7 bits (2251), Expect = 2.7e-249
Identity = 420/450 (93.33%), Postives = 432/450 (96.00%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASLM R FSL FHS R +SCQ+SCAVLS+ LFSTHIVGDEPILVRDFIHSALYDPNH
Sbjct: 1 MASLMSSCRSFSLLFHSFRVISCQRSCAVLSQCLFSTHIVGDEPILVRDFIHSALYDPNH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGVLE SIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVLERSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAP RVY NMTYTSVEISPSLA+IQR
Sbjct: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPPRVYKNMTYTSVEISPSLADIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKFRVECRDAVD SGWGDV+EQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFRVECRDAVDPSGWGDVQEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQL+RESLVELYKPLQDPLIKRC EIVNFNEND AE SVLSKAKG+WSKAFPKPR
Sbjct: 241 EVWVEKQLNRESLVELYKPLQDPLIKRCAEIVNFNENDHAESSVLSKAKGIWSKAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
RCWLPTGCLKLLEVLH VLPKMSLIASDFSYLPDVRI GERAPLVSTKADG+SSDYESYL
Sbjct: 301 RCWLPTGCLKLLEVLHHVLPKMSLIASDFSYLPDVRIPGERAPLVSTKADGTSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDF+LLEQI+HYCSGWLKLHGDKTPK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFVLLEQIDHYCSGWLKLHGDKTPKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 450
BLAST of Tan0012280 vs. NCBI nr
Match:
XP_022980290.1 (uncharacterized protein LOC111479709 isoform X1 [Cucurbita maxima])
HSP 1 Score: 867.5 bits (2240), Expect = 5.2e-248
Identity = 417/450 (92.67%), Postives = 434/450 (96.44%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL+F FRK SLFFH+ R +SCQ+SCAVLSRS FSTHIVGDEPILVRDFIHSALYD NH
Sbjct: 1 MASLVFSFRKLSLFFHTTRVISCQESCAVLSRSFFSTHIVGDEPILVRDFIHSALYDANH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGV+E SIKFNQLEGRKAYM YLDKIYKQSD+SWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVMERSIKFNQLEGRKAYMIYLDKIYKQSDMSWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR
Sbjct: 121 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKF VECRDA DLSGWG+V+EQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFSVECRDAADLSGWGEVKEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQL+RESLVELYKPLQDPLIKRCVEIVNFNEND A+ S+LSKAKGVWS+AFPKPR
Sbjct: 241 EVWVEKQLERESLVELYKPLQDPLIKRCVEIVNFNENDQAKSSLLSKAKGVWSRAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
+CWLPTGCL LLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKA GSSSDYESYL
Sbjct: 301 KCWLPTGCLNLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKAHGSSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKLHGDK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLHGDK-PKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 449
BLAST of Tan0012280 vs. NCBI nr
Match:
XP_038894541.1 (uncharacterized protein LOC120083075 isoform X1 [Benincasa hispida])
HSP 1 Score: 866.7 bits (2238), Expect = 8.8e-248
Identity = 413/450 (91.78%), Postives = 435/450 (96.67%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
+ASL+FP+RKFSLF HS R +SCQ+SCA+LS+ LFSTHIVGD+P+LVRDFIHSALYDPNH
Sbjct: 3 LASLVFPYRKFSLFLHSSRVISCQRSCALLSQLLFSTHIVGDKPVLVRDFIHSALYDPNH 62
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFA RS+SVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVE+FKPWYAHGIAE
Sbjct: 63 GYFAHRSRSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVEIFKPWYAHGIAE 122
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTY SVEISPSLAEIQR
Sbjct: 123 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYISVEISPSLAEIQR 182
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 183 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 242
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQLDRESLVELYKPLQDPLIKRC++IVNF END A+ SVLSKAKG+WSKAFPKPR
Sbjct: 243 EVWVEKQLDRESLVELYKPLQDPLIKRCLDIVNFTENDHAKSSVLSKAKGIWSKAFPKPR 302
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
R WLPTGCL LLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKA+GSSSDYESYL
Sbjct: 303 RSWLPTGCLNLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKANGSSSDYESYL 362
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DA GDADIFFPTDF+LLEQI+HYCSGWLKLH DK PK+GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 363 DATGDADIFFPTDFVLLEQIDHYCSGWLKLHEDKKPKMGKKRRTIILDTSSFMEEFGLPS 422
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKD YNPLLDDFKNTKFYLSVPT+NIK
Sbjct: 423 KTRLKDDYNPLLDDFKNTKFYLSVPTYNIK 452
BLAST of Tan0012280 vs. NCBI nr
Match:
KAG7018860.1 (SPAC25A8.03c [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 865.1 bits (2234), Expect = 2.6e-247
Identity = 416/450 (92.44%), Postives = 432/450 (96.00%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL FRK SLFF + R +SCQ+SCAVLSRS FSTHIVGDEPILVRDFIHSALYD NH
Sbjct: 1 MASLALSFRKLSLFFRTTRVISCQESCAVLSRSFFSTHIVGDEPILVRDFIHSALYDANH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGV+E SIKFNQLEGRKAYMRYLDKIYKQSD SWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVMERSIKFNQLEGRKAYMRYLDKIYKQSDTSWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR
Sbjct: 121 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKF VECRDAVDLSGWG+V+E+PCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFSVECRDAVDLSGWGEVKEEPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNEND A+ S+LSKAKG+WS+AFPKPR
Sbjct: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDQAKSSLLSKAKGIWSRAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
+CWLPTGCL LLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKA GSSSDYESYL
Sbjct: 301 KCWLPTGCLNLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKAHGSSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKLHGDK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLHGDK-PKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 449
BLAST of Tan0012280 vs. NCBI nr
Match:
KAG6582471.1 (hypothetical protein SDJN03_22473, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 865.1 bits (2234), Expect = 2.6e-247
Identity = 416/450 (92.44%), Postives = 432/450 (96.00%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL FRK SLFF + R +SCQ+SCAVLSRS FSTHIVGDEPILVRDFIHSALYD NH
Sbjct: 1 MASLALSFRKLSLFFRTTRVISCQESCAVLSRSFFSTHIVGDEPILVRDFIHSALYDANH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGV+E SIKFNQLEGRKAYMRYLDKIYKQSD SWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVMERSIKFNQLEGRKAYMRYLDKIYKQSDTSWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR
Sbjct: 121 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKF VECRDAVDLSGWG+V+E+PCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFSVECRDAVDLSGWGEVKEEPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNEND A+ S+LSKAKG+WS+AFPKPR
Sbjct: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDQAKSSLLSKAKGIWSRAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
+CWLPTGCL LLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKA GSSSDYESYL
Sbjct: 301 KCWLPTGCLNLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKAHGSSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKLHGDK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLHGDK-PKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 449
BLAST of Tan0012280 vs. ExPASy TrEMBL
Match:
A0A6J1CZ34 (Protein arginine methyltransferase NDUFAF7 OS=Momordica charantia OX=3673 GN=LOC111016089 PE=3 SV=1)
HSP 1 Score: 871.7 bits (2251), Expect = 1.3e-249
Identity = 420/450 (93.33%), Postives = 432/450 (96.00%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASLM R FSL FHS R +SCQ+SCAVLS+ LFSTHIVGDEPILVRDFIHSALYDPNH
Sbjct: 1 MASLMSSCRSFSLLFHSFRVISCQRSCAVLSQCLFSTHIVGDEPILVRDFIHSALYDPNH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGVLE SIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVLERSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAP RVY NMTYTSVEISPSLA+IQR
Sbjct: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPPRVYKNMTYTSVEISPSLADIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKFRVECRDAVD SGWGDV+EQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFRVECRDAVDPSGWGDVQEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQL+RESLVELYKPLQDPLIKRC EIVNFNEND AE SVLSKAKG+WSKAFPKPR
Sbjct: 241 EVWVEKQLNRESLVELYKPLQDPLIKRCAEIVNFNENDHAESSVLSKAKGIWSKAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
RCWLPTGCLKLLEVLH VLPKMSLIASDFSYLPDVRI GERAPLVSTKADG+SSDYESYL
Sbjct: 301 RCWLPTGCLKLLEVLHHVLPKMSLIASDFSYLPDVRIPGERAPLVSTKADGTSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDF+LLEQI+HYCSGWLKLHGDKTPK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFVLLEQIDHYCSGWLKLHGDKTPKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 450
BLAST of Tan0012280 vs. ExPASy TrEMBL
Match:
A0A6J1IVV3 (Protein arginine methyltransferase NDUFAF7 OS=Cucurbita maxima OX=3661 GN=LOC111479709 PE=3 SV=1)
HSP 1 Score: 867.5 bits (2240), Expect = 2.5e-248
Identity = 417/450 (92.67%), Postives = 434/450 (96.44%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL+F FRK SLFFH+ R +SCQ+SCAVLSRS FSTHIVGDEPILVRDFIHSALYD NH
Sbjct: 1 MASLVFSFRKLSLFFHTTRVISCQESCAVLSRSFFSTHIVGDEPILVRDFIHSALYDANH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGV+E SIKFNQLEGRKAYM YLDKIYKQSD+SWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVMERSIKFNQLEGRKAYMIYLDKIYKQSDMSWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR
Sbjct: 121 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKF VECRDA DLSGWG+V+EQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFSVECRDAADLSGWGEVKEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQL+RESLVELYKPLQDPLIKRCVEIVNFNEND A+ S+LSKAKGVWS+AFPKPR
Sbjct: 241 EVWVEKQLERESLVELYKPLQDPLIKRCVEIVNFNENDQAKSSLLSKAKGVWSRAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
+CWLPTGCL LLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKA GSSSDYESYL
Sbjct: 301 KCWLPTGCLNLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKAHGSSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKLHGDK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLHGDK-PKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 449
BLAST of Tan0012280 vs. ExPASy TrEMBL
Match:
A0A6J1EAI4 (Protein arginine methyltransferase NDUFAF7 OS=Cucurbita moschata OX=3662 GN=LOC111432199 PE=3 SV=1)
HSP 1 Score: 863.2 bits (2229), Expect = 4.7e-247
Identity = 416/450 (92.44%), Postives = 432/450 (96.00%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL FRK SLFF + R +SCQ+SCAVLSRS FSTHIVG+EPILVRDFIHSALYD NH
Sbjct: 1 MASLALSFRKLSLFFCTTRVISCQESCAVLSRSFFSTHIVGEEPILVRDFIHSALYDANH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGV+E SIKFNQLEGRKAYMRYLDKIYKQSD SWFTPVELFKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVMERSIKFNQLEGRKAYMRYLDKIYKQSDSSWFTPVELFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR
Sbjct: 121 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKF VECRDAVDLSGWG+V+EQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGEVRSHQSKFSVECRDAVDLSGWGEVKEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNEND A+ S+LSKAKG+WS+AFPKPR
Sbjct: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDQAKSSLLSKAKGIWSRAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
+CWLPTGCL LLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKA GSSSDYESYL
Sbjct: 301 KCWLPTGCLNLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKAHGSSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKLHGDK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLHGDK-PKTGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 449
BLAST of Tan0012280 vs. ExPASy TrEMBL
Match:
A0A0A0LZS9 (Protein arginine methyltransferase NDUFAF7 OS=Cucumis sativus OX=3659 GN=Csa_1G615180 PE=3 SV=1)
HSP 1 Score: 860.5 bits (2222), Expect = 3.1e-246
Identity = 412/450 (91.56%), Postives = 428/450 (95.11%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL FPFR FSLFFH PR +SCQQS + LS S FSTHIVGD+P+LVRDFIHSALYD NH
Sbjct: 1 MASLAFPFRNFSLFFHCPRVISCQQSWSFLSGSHFSTHIVGDDPVLVRDFIHSALYDQNH 60
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSD+SWFTPVE+FKPWYAHGIAE
Sbjct: 61 GYFAQRSRSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDVSWFTPVEIFKPWYAHGIAE 120
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVY MTYTSVEISPSLAEIQR
Sbjct: 121 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYKTMTYTSVEISPSLAEIQR 180
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVG+VRSHQSKF+VECRDAVDLSGWGD+EEQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 181 QTVGDVRSHQSKFKVECRDAVDLSGWGDMEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQLDRESLVELYKPLQDPLIKRCVEI+NF END + SVLSKAKG+WSKAFPKPR
Sbjct: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIMNFKENDHTKNSVLSKAKGIWSKAFPKPR 300
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
R WLPTGCL LLEVLH VLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL
Sbjct: 301 RSWLPTGCLSLLEVLHHVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKL DK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 361 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLQEDKKPKSGKKRRTIILDTSSFMEEFGLPS 420
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 450
BLAST of Tan0012280 vs. ExPASy TrEMBL
Match:
A0A1S3BNJ4 (Protein arginine methyltransferase NDUFAF7 OS=Cucumis melo OX=3656 GN=LOC103492021 PE=3 SV=1)
HSP 1 Score: 851.7 bits (2199), Expect = 1.4e-243
Identity = 410/450 (91.11%), Postives = 423/450 (94.00%), Query Frame = 0
Query: 1 MASLMFPFRKFSLFFHSPRAVSCQQSCAVLSRSLFSTHIVGDEPILVRDFIHSALYDPNH 60
MASL R FSLFFH R VSCQQ + LS SLFSTHIVGD+P+LVRDFIHS LYD NH
Sbjct: 29 MASLALLSRNFSLFFHCTRVVSCQQPWSFLSGSLFSTHIVGDDPVLVRDFIHSTLYDQNH 88
Query: 61 GYFAQRSQSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAE 120
GYFAQRS+SVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSD+SWFTPVE+FKPWYAHGIAE
Sbjct: 89 GYFAQRSRSVGVLEHSIKFNQLEGRKAYMRYLDKIYKQSDVSWFTPVEIFKPWYAHGIAE 148
Query: 121 AIMRTANLSIPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQR 180
AIMRTANLS+PLKIYEIGGGSGTCAKGIMDYIMLNAPTRVY NMTYTSVEISPSLAEIQR
Sbjct: 149 AIMRTANLSVPLKIYEIGGGSGTCAKGIMDYIMLNAPTRVYKNMTYTSVEISPSLAEIQR 208
Query: 181 QTVGEVRSHQSKFRVECRDAVDLSGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 240
QTVGEVRSHQSKF+VECRDAVDL GWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM
Sbjct: 209 QTVGEVRSHQSKFKVECRDAVDLRGWGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWM 268
Query: 241 EVWVEKQLDRESLVELYKPLQDPLIKRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPR 300
EVWVEKQLDRESLVELYKP+QDPLIKRCVEIVNF END + SVLSKAKG+WSKAFPKPR
Sbjct: 269 EVWVEKQLDRESLVELYKPIQDPLIKRCVEIVNFKENDHTKNSVLSKAKGIWSKAFPKPR 328
Query: 301 RCWLPTGCLKLLEVLHRVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYL 360
R WLPTGCL LLEVLH VLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSS DYESYL
Sbjct: 329 RSWLPTGCLSLLEVLHHVLPKMSLIASDFSYLPDVRIAGERAPLVSTKADGSSLDYESYL 388
Query: 361 DAKGDADIFFPTDFLLLEQIEHYCSGWLKLHGDKTPKIGKKRRTIILETSSFMEEFGLPS 420
DAKGDADIFFPTDFLLLEQI+HYCSGWLKLH DK PK GKKRRTIIL+TSSFMEEFGLPS
Sbjct: 389 DAKGDADIFFPTDFLLLEQIDHYCSGWLKLHEDKKPKSGKKRRTIILDTSSFMEEFGLPS 448
Query: 421 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 451
KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK
Sbjct: 449 KTRLKDGYNPLLDDFKNTKFYLSVPTHNIK 478
BLAST of Tan0012280 vs. TAIR 10
Match:
AT1G04900.1 (Protein of unknown function (DUF185) )
HSP 1 Score: 681.4 bits (1757), Expect = 4.9e-196
Identity = 323/426 (75.82%), Postives = 370/426 (86.85%), Query Frame = 0
Query: 27 CAVLSRSLFSTH-IVGDEPILVRDFIHSALYDPNHGYFAQRSQSVGVLEHSIKFNQLEGR 86
C R+ FST ++GDEP+LVRDFIH+ALYDP GYF+QRS+SVGVLE SIKFNQLEGR
Sbjct: 23 CLGSLRAFFSTQKLIGDEPVLVRDFIHTALYDPIQGYFSQRSKSVGVLERSIKFNQLEGR 82
Query: 87 KAYMRYLDKIYKQSDISWFTPVELFKPWYAHGIAEAIMRTANLSIPLKIYEIGGGSGTCA 146
KAYM+ L+K+YKQSDISWFTPVELFKPWYAHGIAEAI+RT NLS+PLKIYEIGGGSGTCA
Sbjct: 83 KAYMKLLEKVYKQSDISWFTPVELFKPWYAHGIAEAILRTTNLSVPLKIYEIGGGSGTCA 142
Query: 147 KGIMDYIMLNAPTRVYNNMTYTSVEISPSLAEIQRQTVGEVRSHQSKFRVECRDAVDLSG 206
KG++DYIMLNAP R+Y NM+YTS+EISPSLA+IQ++TV +V SH SKFRVECRDA DL+G
Sbjct: 143 KGVLDYIMLNAPERIYKNMSYTSIEISPSLAKIQKETVAQVGSHLSKFRVECRDASDLAG 202
Query: 207 WGDVEEQPCWVIMLEVLDNLPHDLIYSENQVSSWMEVWVEKQLDRESLVELYKPLQDPLI 266
W +VE+QPCWVIMLEVLDNLPHDL+YS++Q+S WMEV VE + + E+L ELYKPL+DPLI
Sbjct: 203 WKNVEQQPCWVIMLEVLDNLPHDLVYSKSQLSPWMEVLVENKPESEALSELYKPLEDPLI 262
Query: 267 KRCVEIVNFNENDSAEGSVLSKAKGVWSKAFPKPRRCWLPTGCLKLLEVLHRVLPKMSLI 326
KRC+EIV E +SK K +WSK FPKPRR WLPTGCLKLLEVLH LPKMSLI
Sbjct: 263 KRCIEIVEH------EDDPVSKPKEIWSKLFPKPRRSWLPTGCLKLLEVLHAKLPKMSLI 322
Query: 327 ASDFSYLPDVRIAGERAPLVSTKADGSSSDYESYLDAKGDADIFFPTDFLLLEQIEHYCS 386
ASDFS+LPDV++ GERAPLVSTK DG SSDY SYLDAKGDADIFFPTDF LLE+++HYCS
Sbjct: 323 ASDFSFLPDVKVPGERAPLVSTKKDGCSSDYSSYLDAKGDADIFFPTDFWLLERMDHYCS 382
Query: 387 GWLKLHGDKTP-KIGKKRRTIILETSSFMEEFGLPSKTRLKDGYNPLLDDFKNTKFYLSV 446
GW K+ D TP K G+KRRT+ L+TS+FM+EFGLPSKTR KDGYNPLLDDFKNTKFYLSV
Sbjct: 383 GWRKMEKDGTPSKKGRKRRTLTLDTSAFMDEFGLPSKTRTKDGYNPLLDDFKNTKFYLSV 442
Query: 447 PTHNIK 451
PTHN K
Sbjct: 443 PTHNTK 442
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O14138 | 8.3e-15 | 22.35 | Protein arginine methyltransferase NDUFAF7 homolog, mitochondrial OS=Schizosacch... | [more] |
Match Name | E-value | Identity | Description | |
XP_022147060.1 | 2.7e-249 | 93.33 | protein arginine methyltransferase NDUFAF7 homolog, mitochondrial [Momordica cha... | [more] |
XP_022980290.1 | 5.2e-248 | 92.67 | uncharacterized protein LOC111479709 isoform X1 [Cucurbita maxima] | [more] |
XP_038894541.1 | 8.8e-248 | 91.78 | uncharacterized protein LOC120083075 isoform X1 [Benincasa hispida] | [more] |
KAG7018860.1 | 2.6e-247 | 92.44 | SPAC25A8.03c [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
KAG6582471.1 | 2.6e-247 | 92.44 | hypothetical protein SDJN03_22473, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CZ34 | 1.3e-249 | 93.33 | Protein arginine methyltransferase NDUFAF7 OS=Momordica charantia OX=3673 GN=LOC... | [more] |
A0A6J1IVV3 | 2.5e-248 | 92.67 | Protein arginine methyltransferase NDUFAF7 OS=Cucurbita maxima OX=3661 GN=LOC111... | [more] |
A0A6J1EAI4 | 4.7e-247 | 92.44 | Protein arginine methyltransferase NDUFAF7 OS=Cucurbita moschata OX=3662 GN=LOC1... | [more] |
A0A0A0LZS9 | 3.1e-246 | 91.56 | Protein arginine methyltransferase NDUFAF7 OS=Cucumis sativus OX=3659 GN=Csa_1G6... | [more] |
A0A1S3BNJ4 | 1.4e-243 | 91.11 | Protein arginine methyltransferase NDUFAF7 OS=Cucumis melo OX=3656 GN=LOC1034920... | [more] |
Match Name | E-value | Identity | Description | |
AT1G04900.1 | 4.9e-196 | 75.82 | Protein of unknown function (DUF185) | [more] |