Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAAAAAAAAAAAGAACAAAAACTCGCGGAAGAGAAAGAAACGAAAGCATTCGACAAATTCTATGCCATTTCTCTACACAAATTCTCCTTCGTTTCAATCCAAATCTCCATTGACACAATGATTCTGAACTTCACTTCACCATGGCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTACCGTTTTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGATGGCGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTAGAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCAGGGAGAAGTGGTGGTACTGTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGGTGCGCGTTTCTATGTTTGATTATCTGAAGTTCTATCTGTTTAATAATTTAAACTGATGGTGGAAAATATGGGACTAGGTATGTGAATTGTTGGAATGGTTTTCTTTTAGTTTGCGAGTTGTTAGGAAAGATGGTGGACTTTAGCCAATAGGATACATCGTTGATTTGGTTTCTTTAGCTATTAAGAAATAATGAATTTTCAATGGCCAACCGTAAGATATCTACTAATTTGCTTAATCCATTGCCATTGTAAATAGTTACAGCTTGGGAAGGTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTAAGAAACCATTTTTGTTCAGATGTGTATGAGCATATTCGTGTGCGGACATTTACATAGTTGTGATGAACATCAAACAATGTACGACAGACTTCCTGAATAGCTGTACTATGTTTGAAGGTACCGTTTGGAAAGGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCAGTTGCACAACCCCAGCACGGTATTCAGGTATAGAAGTACGTGGCTACTTTTGTGGGTTGGATTCTATCGGGCAGTGTATCTAGGATAAATGGTTGATGAACATTTCTTCAAGTTCCCTGGGAAAACTAGGTTTAGACTTTAGATATTCAGATAGAGTATGACTATGCTGCATTAGCTGTTTCTACGGCCATACTTACTATATACTAAATCATATTATTGTTGAATATCTATTTATTGCTGATTGTGAAGTATTATCTACATCATTGAACACATGTTTGTGCAATATTCATCGTGATATAGGGTCTTTATGACGTGACACTATAGGGTTGTTATAAAATAAAATGTTTCCTATGTTCATTTTCATGATTTTGGCAATTGTTTGCCTTGTTTTGAGTACAACAATTGCTGAGTGGGAAATTGAATCTCTGTTCTCTAGGGACGGATGTCGTGTCAATACAACTTTGGCAATTGGTTGCCTGCTATATTTCAAATTTCTATTAATTTGTTCTTACTTCAGATTTCTCCCATCCTTTCATGCCACTGTGTTTCCATGACTTTTGATATTGTGCACCTAATCAAGCATTTGTTCAAAGCTCCTGGTTTATTTTTGAACAAAAGCTGATGCTTTATCTCTGAGATATATGATTTGGAGACTCGAATGTTTTATATTCGAGAAATACAAATATTTTATCTGCCAATTCTTTTTTTTTTTTACAAACTACAAGTCAACACCTATAACCGAGCAAGCTTTTACCTTTGACAATCTTTTCTTCTCTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAGTATGAACTGATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGGTAGGTTCACCTGGATGTTGCTAACTTGATTTATTTAGTTGACTGGTACTTTAATCTCTCGTCTTATGTGTTTTTGATAATCTAGGGTTCAAGTTCTCTTGTCTTTTAAGTGCACGATCACTTTGCAGTTGTGAGCTCAATACATTCTCCCATTAGTACTATACGTTCATTGCCCACTACCCTAAACACTATGAACTTATAGTTCATGCTAATATTAAATTAATGCAATCTTATTTCGTAACTATATATTAATTATAAGACATATTTGCATTAGATGTTGATCCTCGAATATTCACTTTTTCTAAGGATCTCCTCCTGCCGAACGCAGCAAGGATACTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCAGCTGCTGTGCACTCCATTGTGGTATGTATAATGTAACCTCAGTTTGGCTTTGAACTATTTGGTTCTTTCAATTGTTTTTATCATCCAATTCCAAACATCTGCTCATTTGATCTATATCTGTTTGCTCATCAGCGGGATGAGTGATGCATGAATTATTAACGTTTCTAATATTATTGCAAATTTGCAATGATTTCATATCTTCCCCATTTCTCCATTACCTTATTGTGATCTGTTGAGTACTTACTCATCCAATGTATGTTTCATGACTAAATAATTGTGTGGCCTTAAATTATTCAAACGTGCGAAGTTGTGTACTTATCATGGAAACATCATTTTGTTCTCTTTTCCAATTATCATATCTAGCTGGTTGAATATCTAATCTTTCATCTCGCTCATGAAAATTCGGCACAACCTTGTATTTATTTGAATTTAACGATACTTGCCTACACTTTTCTTACTGATTTTCTTTTATGGCTTATTTGTGAGTTTAGTGTAATCTCATATAATGCATAGTTAAGTAGGTCTCCCGTCTCCTTAATATCTTTGAACTACTAACAATAAACCACATAATGTCGGAATTTTTACTAAATGTACATGTAGAAAAATATGCTCTCATCTCATGAATGAGTGTTACTCTCGTCAATCCCCTATTTAGTTTGCCAGCTTCCTGGTTCTCATAAAAGTTTCGAAGTCATTGATTATCCTAAGTTTTTTAGACAATCAACATACACAGTTATGTTTCATTACATTTCCTACTATTCTCTCTGTTCTTGTGCTCCTGTTACTAATTTTTTCCTTCGTCATTCATTTTAGGTACAAACTCCTTCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGACGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAATTCATCCTTACTTAACCCATCGATCCTCTTTCCAGTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAACCGGGTACAGATTTCTTAGTCACTCTTTTCTGTATAACTACAAGTCTACACCTATATTTAAACCTTACTTTATTTTATCAATTTATTTTTTAATATTTACAGACTCACATTATTTTTTGCATGAGAATGTTGTCATTGATTACAGGTCTTTATACATGTGTATTTTAAAGTTTTTGCCATGTGGTATCAGTAGATGCATTCAATTATAATTTGAATGTTGTCAACTGTTGAAAATCATTTGAGATGAGTTATAAAAATTTATTCCTGCAACCCACATTTAGCAAAGCTTTATGCTAAATACTTTTTGGAGTGCACTTTATTGAATATTTCAAGATCTTTGAAAATAAAAATACTTAACTGTTAAACATGAAAATGTCGAGCATGTAACTATAATATTGAAAAGCAGTTTTCAGTGAAAAATTGGAAGGCACTGCAAATGCAACAGTATGCCACTCTTCATTATGTACCCAAATTTCTTGTTTTCCAAAATCAATCATCTCTTCTTAAACAGAAAAAAGCTGACTAGAATTCGCTGTCATTGACAAATCACATCCTTTTGAAACAAAAATTAGGTACCTTGCTCAAATCCCTAATGTATAGTCATAAAATAGTATTTAGTTATTACACCCATATGGTGATCAGATTTTACTGTTTTGATATTATTAACTTTTCTCCTTGAATGCTTGAGTAGAACTAATTCATATGTTTCTTTGATTGTATCAGCTTCCTCAACGATCAGTTGATATTATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCGGGAATTTAAAACTAGCTGCTGAAAAGGAGGTATGCATTCACATAGTCTTGACTCTTGGTCATTGCATCATGAAGTAACTTTAATTTTTAGTTCACATTCTTAACATTGAGAACATATTACATGGTTTTTAACTAAGTTGAATGCATGAGGTTTAACAACTCATCACACTCATACTAATCTTTTATTCCTGAAGAAAAACACTGGAAGTAGTCACCATTATTTTTCTCTAGTCTTCAACCGGGTTTTCTTCATATACTCTTTCACGAGTTATACAAAGATAAGTTGTCATCAAATAACGTTATAGATCTGGTTGGTGAACTTTACTTTAATTTCTTTCTCTCTTTAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGGGGAACCTTCTTATCGGTGAATCCTACTAAACACTATGAATGATCCTTTCTTAATTCAATGAACTATTGCCAATGTTCATTTTCAATTGACTTATTAAGGTTGTAAAAGTATTTGGATGTTTAGTTTTTTCTTCTTTCTTGATGTCTTCCAAACTCAGTCTTTATTTTATTTTTGTTGTCTCCAGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGCTATGCAAGGGTATGCCATGGCCTAAGTTATACCATTGTGCTAAGAATTGTACTTCCGGAAAGGTATGGTTACTTAGTGGTATCTTTTTTGACAGATTTCCTCGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGTGTGTGCAATTTCTGCCTTGTCATAAGTTTTAGAATGTAGAGTTATTTACCTCTGTGGCCCATTTTACAGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGTGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTGTGAACTAATTTTTTGTCTTCATGCCTTACTATGGATGTTGATGTCTAATTTTTTGTTCACGTTAACTGTTGGAGATCATGACTTTCTCTTACCTCATCCATACTACCATCGGTCTCTGAGTATTTTACTCGTCAACATTTTGAGAAACTTGCTCTGGTTTACTATGGTTCATTCACTATTACTGCCAGGGCTCTCCGTAGCATACCCACTGGACTTATTTTTAGTCACTTCCATCCATACGGTACTACACGAAGCACTGGACTTACCTTTTTGATCTCTCGTTCCTCATGAACTACCGTTTCATGAAGCAGTGTGAGAATACCAATAAACTGCTAAAATATTTTCTTGTTTTCACCTTGAGAATAGGTGAAACTTTGGGTGGAGGTAATGAAAGTTATATTTTGCAAGTTACATGTACGAGGAGATGGTGGAAACATTGACAATTTCTGAGGCGGTTTGAGGGAGGGGGTATTTCGGTTTTTGCGTGCATGAAATGTGGAAATGTATCAGAGGGAGGATGGTGCCGAGAATGTCGGAAAAATTAGTTATGTAGGGAGTTAGGAGAGAGTGAAAATATTATATTGAGAAGCTTTTGTTCAGGAGGGGTGTAAGTTCTTGCATTATCCTACTTAGCGACTTTCTGTCGAGTTTTTTCTGATTTCATTCAACATTTACATGATAGAAATCCATTCTAATTCCATCAAGTCAGTCAAGCTGCCATGGGCCTAGCCATAGAGATGGACCATATAGTGGTGCCTTCTACCACCACCACACTTAGAGCCAGCCTTACAAACAGCCGTGGTTATATGTCCCTTCGTCGTATTCAAGTCCTTGACTAGCAACAATGAACTTTTCAAACTCAATTAAGCTGGTATATCGGAACTCAGAAGAATTTCCGTGAGAGTGCACCGAAGGGTCTTCAAGCTAAGGTCTCAATCACCTATATACATTTGAATGAAATTACTACCCATATGTTGCCTTCCAAACGAATTTAGCGCATGATAGTTACTATCAAAGATTGAACCTACTATCAGCTGATGCTGGGTGTGGGTGTGACACAGAGCTATTTTTAAGTCAGATTGCATTGATCTTCAATTACATGAAAAATATATGACTTTTGTGTCATCAAAGTAGAGTTGTCTGCTGAGTGTTTAGTTGTTCACAGAGTTAGTTCGGCTACTTTTTTTAATATATGATTCGCATCGGTATTTTTACTGAACAATAAAATCCTCACTCAACACCCTTAAATTCAACGAAATACATGGCTTTGGTTTCTGTTGTCTTCTTAGTTCCTCATATTTTCATCTGAATTAGAATGGTTTCCTGATCTTTGGAGCCAATCAGCAGACACTATTTCCATCTTTGGCTAATTAATTATCTTTTTCTTCTAACCTTTTATGTCAATGCCAATTCAGAGATGGAAATTACAAGCAGAAGCCAATGATGAAGCCGAACGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAAGGGTGTTCTCGTGTTTTCAGTCCAGGTAACTGTTAGGCCCCAATTCTTCATACATCCATATCTCGATTTCCACATCACCTTGTACACGCACTACCTTCCCAGAATTACTATTTGTCAAACAAATCTTGGAGCATCAATAACAAAACCAAATACTTCGTAAAGATTCTTACTTCTCAACGAAAATGCCAACCAATGTCTACTTGGTCCATGTACATGAAGTTTTGAAGCAGCATCAGGTCAGTGCTTCGTGATCAGATATCTTATTACTGAAGATCTCATTTGTTAAATTGTATTATTGTTACGAAAAAACATTGAAATCTAAGTTTCATTTGGAACTTCTCCCTCCTGAGTACTGAAATGTGATTTATCACGTAAATATAACACGTTAATGATTCTATGAATTAATCGCCATCTTTGCATTTCA
mRNA sequence
AAAAAAAAAAAAAAAAAAAAAGAACAAAAACTCGCGGAAGAGAAAGAAACGAAAGCATTCGACAAATTCTATGCCATTTCTCTACACAAATTCTCCTTCGTTTCAATCCAAATCTCCATTGACACAATGATTCTGAACTTCACTTCACCATGGCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTACCGTTTTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGATGGCGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTAGAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCAGGGAGAAGTGGTGGTACTGTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCTTGGGAAGGTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTACCGTTTGGAAAGGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAGTATGAACTGATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCAAGGATACTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCAGCTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGACGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAATTCATCCTTACTTAACCCATCGATCCTCTTTCCAGTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAACCGGCTTCCTCAACGATCAGTTGATATTATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCGGGAATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGGGGAACCTTCTTATCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGTGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTGTGAACTAATTTTTTGTCTTCATGCCTTACTATGGATGTTGATGTCTAATTTTTTGTTCACGTTAACTGTTGGAGATCATGACTTTCTCTTACCTCATCCATACTACCATCGGTCTCTGAGTATTTTACTCGTCAACATTTTGAGAAACTTGCTCTGGTTTACTATGGTTCATTCACTATTACTGCCAGGGCTCTCCGTAGCATACCCACTGGACTTATTTTTAGTCACTTCCATCCATACGGTACTACACGAAGCACTGGACTTACCTTTTTGATCTCTCGTTCCTCATGAACTACCGTTTCATGAAGCAGTGTGAGAATACCAATAAACTGCTAAAATATTTTCTTGTTTTCACCTTGAGAATAGGTGAAACTTTGGGTGGAGGTAATGAAAGTTATATTTTGCAAGTTACATGTACGAGGAGATGGTGGAAACATTGACAATTTCTGAGGCGGTTTGAGGGAGGGGGTATTTCGGTTTTTGCGTGCATGAAATGTGGAAATGTATCAGAGGGAGGATGGTGCCGAGAATGTCGGAAAAATTAGTTATGTAGGGAGTTAGGAGAGAGTGAAAATATTATATTGAGAAGCTTTTGTTCAGGAGGGGTGTAAGTTCTTGCATTATCCTACTTAGCGACTTTCTGTCGAGTTTTTTCTGATTTCATTCAACATTTACATGATAGAAATCCATTCTAATTCCATCAAGTCAGTCAAGCTGCCATGGGCCTAGCCATAGAGATGGACCATATAGTGGTGCCTTCTACCACCACCACACTTAGAGCCAGCCTTACAAACAGCCGTGGTTATATGTCCCTTCGTCGTATTCAAGTCCTTGACTAGCAACAATGAACTTTTCAAACTCAATTAAGCTGGTATATCGGAACTCAGAAGAATTTCCGTGAGAGTGCACCGAAGGGTCTTCAAGCTAAGGTCTCAATCACCTATATACATTTGAATGAAATTACTACCCATATGTTGCCTTCCAAACGAATTTAGCGCATGATAGTTACTATCAAAGATTGAACCTACTATCAGCTGATGCTGGGTGTGGGTGTGACACAGAGCTATTTTTAAGTCAGATTGCATTGATCTTCAATTACATGAAAAATATATGACTTTTGTGTCATCAAAGTAGAGTTGTCTGCTGAGTGTTTAGTTGTTCACAGAGTTAGTTCGGCTACTTTTTTTAATATATGATTCGCATCGGTATTTTTACTGAACAATAAAATCCTCACTCAACACCCTTAAATTCAACGAAATACATGGCTTTGGTTTCTGTTGTCTTCTTAGTTCCTCATATTTTCATCTGAATTAGAATGGTTTCCTGATCTTTGGAGCCAATCAGCAGACACTATTTCCATCTTTGGCTAATTAATTATCTTTTTCTTCTAACCTTTTATGTCAATGCCAATTCAGAGATGGAAATTACAAGCAGAAGCCAATGATGAAGCCGAACGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAAGGGTGTTCTCGTGTTTTCAGTCCAGGTAACTGTTAGGCCCCAATTCTTCATACATCCATATCTCGATTTCCACATCACCTTGTACACGCACTACCTTCCCAGAATTACTATTTGTCAAACAAATCTTGGAGCATCAATAACAAAACCAAATACTTCGTAAAGATTCTTACTTCTCAACGAAAATGCCAACCAATGTCTACTTGGTCCATGTACATGAAGTTTTGAAGCAGCATCAGGTCAGTGCTTCGTGATCAGATATCTTATTACTGAAGATCTCATTTGTTAAATTGTATTATTGTTACGAAAAAACATTGAAATCTAAGTTTCATTTGGAACTTCTCCCTCCTGAGTACTGAAATGTGATTTATCACGTAAATATAACACGTTAATGATTCTATGAATTAATCGCCATCTTTGCATTTCA
Coding sequence (CDS)
ATGATTCTGAACTTCACTTCACCATGGCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTACCGTTTTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGATGGCGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTAGAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCAGGGAGAAGTGGTGGTACTGTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCTTGGGAAGGTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTACCGTTTGGAAAGGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAGTATGAACTGATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCAAGGATACTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCAGCTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGACGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAATTCATCCTTACTTAACCCATCGATCCTCTTTCCAGTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAACCGGCTTCCTCAACGATCAGTTGATATTATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCGGGAATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGGGGAACCTTCTTATCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGTGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTGTGAACTAATTTTTTGTCTTCATGCCTTACTATGGATGTTGATGTCTAATTTTTTGTTCACGTTAACTGTTGGAGATCATGACTTTCTCTTACCTCATCCATACTACCATCGGTCTCTGAGTATTTTACTCGTCAACATTTTGAGAAACTTGCTCTGGTTTACTATGGTTCATTCACTATTACTGCCAGGGCTCTCCGTAGCATACCCACTGGACTTATTTTTAGTCACTTCCATCCATACGGTACTACACGAAGCACTGGACTTACCTTTTTGA
Protein sequence
MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEEACLYHCELIFCLHALLWMLMSNFLFTLTVGDHDFLLPHPYYHRSLSILLVNILRNLLWFTMVHSLLLPGLSVAYPLDLFLVTSIHTVLHEALDLPF
Homology
BLAST of PI0003454 vs. ExPASy TrEMBL
Match:
A0A0A0KZV4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G052610 PE=4 SV=1)
HSP 1 Score: 1013.8 bits (2620), Expect = 3.1e-292
Identity = 535/553 (96.75%), Postives = 543/553 (98.19%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILNFTSP LTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHA FAFTSFSKS+RVR SL
Sbjct: 1 MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII
Sbjct: 61 SGSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAAS 180
+AALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLI+GLAAS
Sbjct: 121 SAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAAS 180
Query: 181 LRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKY 240
LRVSDALRMIEIICRVGV+PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKY
Sbjct: 181 LRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKY 240
Query: 241 ELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE 300
ELISGNIVNIESEEI DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFATE
Sbjct: 241 ELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE 300
Query: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV 360
TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV
Sbjct: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV 360
Query: 361 PAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQ 420
P KENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLL+VAGFASLAAGATLNSLILPQ
Sbjct: 361 PGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQ 420
Query: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
FNRLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPS
Sbjct: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
Query: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE
Sbjct: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
Query: 541 QIEQIMVLENLEE 554
QIEQIMVLENLEE
Sbjct: 541 QIEQIMVLENLEE 553
BLAST of PI0003454 vs. ExPASy TrEMBL
Match:
A0A1S3BTU3 (uncharacterized protein LOC103493103 OS=Cucumis melo OX=3656 GN=LOC103493103 PE=4 SV=1)
HSP 1 Score: 1010.7 bits (2612), Expect = 2.6e-291
Identity = 538/555 (96.94%), Postives = 544/555 (98.02%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SL
Sbjct: 1 MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII
Sbjct: 61 SGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLA 180
AAALKRNNPELALSVFYAMRSTFYQ AWE VNENASIVERWKWSRPDVHVYTLLIQGLA
Sbjct: 121 AAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLA 180
Query: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY
Sbjct: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
Query: 241 KYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA 300
KYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Sbjct: 241 KYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA 300
Query: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLL 360
TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLL
Sbjct: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLL 360
Query: 361 RVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
RVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL
Sbjct: 361 RVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
Query: 421 PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGE 480
PQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGE
Sbjct: 421 PQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGE 480
Query: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV 540
PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV
Sbjct: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV 540
Query: 541 SEQIEQIMVLENLEE 554
SEQIEQIM LENLEE
Sbjct: 541 SEQIEQIMALENLEE 555
BLAST of PI0003454 vs. ExPASy TrEMBL
Match:
A0A5A7TPS5 (Pentatricopeptide repeat (PPR) superfamily protein isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001910 PE=4 SV=1)
HSP 1 Score: 982.6 bits (2539), Expect = 7.7e-283
Identity = 522/539 (96.85%), Postives = 528/539 (97.96%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SL
Sbjct: 310 MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSL 369
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII
Sbjct: 370 SGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 429
Query: 121 AAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLA 180
AAALKRNNPELALSVFYAMRSTFYQ AWE VNENASIVERWKWSRPDVHVYTLLIQGLA
Sbjct: 430 AAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLA 489
Query: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY
Sbjct: 490 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 549
Query: 241 KYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA 300
KYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Sbjct: 550 KYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA 609
Query: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLL 360
TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLL
Sbjct: 610 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLL 669
Query: 361 RVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
RVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL
Sbjct: 670 RVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 729
Query: 421 PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGE 480
PQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGE
Sbjct: 730 PQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGE 789
Query: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER 538
PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Sbjct: 790 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVR 848
BLAST of PI0003454 vs. ExPASy TrEMBL
Match:
A0A6J1EYW6 (uncharacterized protein LOC111437671 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437671 PE=4 SV=1)
HSP 1 Score: 954.1 bits (2465), Expect = 2.9e-274
Identity = 501/554 (90.43%), Postives = 529/554 (95.49%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRL-PPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRAS 60
MIL+ +SPWLT+TRL PPPKL+EPLAS++NG +V MPLLLCSHALF FTSFSKS RVRAS
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSHALFRFTSFSKSTRVRAS 60
Query: 61 LSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLI 120
L+ S+IDGAAAFENPVSELLDDELI VVSGAKDADE L +I DKSGR+GGTVSV DCRLI
Sbjct: 61 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLI 120
Query: 121 IAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAA 180
IAAALKRNN ELALSVFYAMRS+FY+AWEGVN+N S VERWKW+RPDVHVYTLLIQGLAA
Sbjct: 121 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 180
Query: 181 SLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYK 240
SLRVSDALR+IEIICRVGVSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+
Sbjct: 181 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQ 240
Query: 241 YELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFAT 300
YELISGNIVNIESEEIS DTPAWEKALRFLN+MK+K+PAAVHSIVVQTPSGVARTQKFAT
Sbjct: 241 YELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 300
Query: 301 ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLR 360
ETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGE MCLTNHSDGRESLLLR
Sbjct: 301 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR 360
Query: 361 VPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILP 420
VPAKE S LL PS LFP+I+LS AGD +SGV+DPSLP+LLLVAGFASLAAGATLNS ILP
Sbjct: 361 VPAKETSFLLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFILP 420
Query: 421 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEP 480
QFNRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEP
Sbjct: 421 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 480
Query: 481 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 540
SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS
Sbjct: 481 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 540
Query: 541 EQIEQIMVLENLEE 554
EQIEQIMVLENLEE
Sbjct: 541 EQIEQIMVLENLEE 554
BLAST of PI0003454 vs. ExPASy TrEMBL
Match:
A0A6J1JDG3 (uncharacterized protein LOC111483407 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483407 PE=4 SV=1)
HSP 1 Score: 950.3 bits (2455), Expect = 4.2e-273
Identity = 497/553 (89.87%), Postives = 528/553 (95.48%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MIL+ +SPWLT+TRLP PKL+EPLAS++NG +V MPLLLCSHA F FTSFS+S RVRASL
Sbjct: 1 MILHLSSPWLTITRLPHPKLIEPLASASNGTSVLMPLLLCSHAFFRFTSFSQSTRVRASL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
+ S+IDGAAAFENPVS+LLDDELI VVSGAKDADE L MI +KSGR+GGTVSV DCRLII
Sbjct: 61 NVSNIDGAAAFENPVSDLLDDELICVVSGAKDADEVLRMIAEKSGRNGGTVSVPDCRLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAAS 180
AAALKRNN ELALSVFYAMRS+FY+AWEGVN+N S VERWKW+RPDVHVYTLLIQGLAAS
Sbjct: 121 AAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAAS 180
Query: 181 LRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKY 240
LRVSDALR+IEIICRVGVSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+Y
Sbjct: 181 LRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQY 240
Query: 241 ELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE 300
ELISGNIVNIESEEIS DTPAWEKALRFLN+MK+K+PAAVHSIVVQTPSGVARTQKFATE
Sbjct: 241 ELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATE 300
Query: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV 360
TADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGE MCLTNHSDGRESLL+RV
Sbjct: 301 TADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLIRV 360
Query: 361 PAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQ 420
PAKE S LL PS LFP+I+LS AGDAASGV+DPSLP++LLVAGFASLAAGATLNS ILPQ
Sbjct: 361 PAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRMLLVAGFASLAAGATLNSFILPQ 420
Query: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
FNRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEPS
Sbjct: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
Query: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE
Sbjct: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
Query: 541 QIEQIMVLENLEE 554
QIEQIMVLENLEE
Sbjct: 541 QIEQIMVLENLEE 553
BLAST of PI0003454 vs. NCBI nr
Match:
XP_004148995.1 (uncharacterized protein LOC101209802 [Cucumis sativus] >KAE8649210.1 hypothetical protein Csa_014779 [Cucumis sativus])
HSP 1 Score: 1013.8 bits (2620), Expect = 6.5e-292
Identity = 535/553 (96.75%), Postives = 543/553 (98.19%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILNFTSP LTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHA FAFTSFSKS+RVR SL
Sbjct: 1 MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII
Sbjct: 61 SGSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAAS 180
+AALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLI+GLAAS
Sbjct: 121 SAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAAS 180
Query: 181 LRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKY 240
LRVSDALRMIEIICRVGV+PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKY
Sbjct: 181 LRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKY 240
Query: 241 ELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE 300
ELISGNIVNIESEEI DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFATE
Sbjct: 241 ELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE 300
Query: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV 360
TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV
Sbjct: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV 360
Query: 361 PAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQ 420
P KENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLL+VAGFASLAAGATLNSLILPQ
Sbjct: 361 PGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQ 420
Query: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
FNRLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPS
Sbjct: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
Query: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE
Sbjct: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
Query: 541 QIEQIMVLENLEE 554
QIEQIMVLENLEE
Sbjct: 541 QIEQIMVLENLEE 553
BLAST of PI0003454 vs. NCBI nr
Match:
XP_008451955.1 (PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo])
HSP 1 Score: 1010.7 bits (2612), Expect = 5.5e-291
Identity = 538/555 (96.94%), Postives = 544/555 (98.02%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SL
Sbjct: 1 MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII
Sbjct: 61 SGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLA 180
AAALKRNNPELALSVFYAMRSTFYQ AWE VNENASIVERWKWSRPDVHVYTLLIQGLA
Sbjct: 121 AAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLA 180
Query: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY
Sbjct: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
Query: 241 KYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA 300
KYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Sbjct: 241 KYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA 300
Query: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLL 360
TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLL
Sbjct: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLL 360
Query: 361 RVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
RVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL
Sbjct: 361 RVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
Query: 421 PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGE 480
PQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGE
Sbjct: 421 PQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGE 480
Query: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV 540
PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV
Sbjct: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV 540
Query: 541 SEQIEQIMVLENLEE 554
SEQIEQIM LENLEE
Sbjct: 541 SEQIEQIMALENLEE 555
BLAST of PI0003454 vs. NCBI nr
Match:
KAA0044898.1 (Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa])
HSP 1 Score: 982.6 bits (2539), Expect = 1.6e-282
Identity = 522/539 (96.85%), Postives = 528/539 (97.96%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SL
Sbjct: 310 MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSL 369
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII
Sbjct: 370 SGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 429
Query: 121 AAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLA 180
AAALKRNNPELALSVFYAMRSTFYQ AWE VNENASIVERWKWSRPDVHVYTLLIQGLA
Sbjct: 430 AAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLA 489
Query: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY
Sbjct: 490 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 549
Query: 241 KYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA 300
KYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Sbjct: 550 KYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA 609
Query: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLL 360
TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLL
Sbjct: 610 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLL 669
Query: 361 RVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
RVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL
Sbjct: 670 RVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 729
Query: 421 PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGE 480
PQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGE
Sbjct: 730 PQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGE 789
Query: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER 538
PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Sbjct: 790 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVR 848
BLAST of PI0003454 vs. NCBI nr
Match:
XP_038895181.1 (uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida])
HSP 1 Score: 981.9 bits (2537), Expect = 2.7e-282
Identity = 519/553 (93.85%), Postives = 531/553 (96.02%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILN TSPWL +TRLPPPKL EPLAS+TNGATV MPLLLCSHALFAFTSFSKSM+VRASL
Sbjct: 1 MILNLTSPWLNITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMQVRASL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDGAAAFENPVS+LL +ELIR VSGAKDADEAL MI DKSGRSGGTVS SDC LII
Sbjct: 61 SGSDIDGAAAFENPVSDLLHNELIRAVSGAKDADEALRMIADKSGRSGGTVSASDCCLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAAS 180
AAALK NNPELALSVFYAMRSTFYQAWEGVNENAS VERWKWSRPDVHVYTLLIQGLAAS
Sbjct: 121 AAALKCNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAAS 180
Query: 181 LRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKY 240
LRVSDALRMIEIICRVGVSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKY
Sbjct: 181 LRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCARCRYKY 240
Query: 241 ELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE 300
ELISGNIVNI+SEEIS DTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE
Sbjct: 241 ELISGNIVNIDSEEISMDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE 300
Query: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV 360
TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRV
Sbjct: 301 TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRV 360
Query: 361 PAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQ 420
PAK SSLLNPS LFP+IVLSAAGDAASGV+DPSLPQLLLVAG ASLAAGATLNSLILPQ
Sbjct: 361 PAKGTSSLLNPSTLFPLIVLSAAGDAASGVLDPSLPQLLLVAGLASLAAGATLNSLILPQ 420
Query: 421 FNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
NRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEPS
Sbjct: 421 INRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPS 480
Query: 481 YRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
YRARRSRI+KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE
Sbjct: 481 YRARRSRIRKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSE 540
Query: 541 QIEQIMVLENLEE 554
QIEQIM LENLEE
Sbjct: 541 QIEQIMALENLEE 553
BLAST of PI0003454 vs. NCBI nr
Match:
XP_038895173.1 (uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida])
HSP 1 Score: 976.9 bits (2524), Expect = 8.8e-281
Identity = 519/555 (93.51%), Postives = 531/555 (95.68%), Query Frame = 0
Query: 1 MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASL 60
MILN TSPWL +TRLPPPKL EPLAS+TNGATV MPLLLCSHALFAFTSFSKSM+VRASL
Sbjct: 1 MILNLTSPWLNITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMQVRASL 60
Query: 61 SGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGTVSVSDCRLII 120
SGSDIDGAAAFENPVS+LL +ELIR VSGAKDADEAL MI DKSGRSGGTVS SDC LII
Sbjct: 61 SGSDIDGAAAFENPVSDLLHNELIRAVSGAKDADEALRMIADKSGRSGGTVSASDCCLII 120
Query: 121 AAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLA 180
AAALK NNPELALSVFYAMRSTFYQ AWEGVNENAS VERWKWSRPDVHVYTLLIQGLA
Sbjct: 121 AAALKCNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLA 180
Query: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRY 240
ASLRVSDALRMIEIICRVGVSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRY
Sbjct: 181 ASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCARCRY 240
Query: 241 KYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA 300
KYELISGNIVNI+SEEIS DTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
Sbjct: 241 KYELISGNIVNIDSEEISMDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA 300
Query: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLL 360
TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLL
Sbjct: 301 TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLL 360
Query: 361 RVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLIL 420
RVPAK SSLLNPS LFP+IVLSAAGDAASGV+DPSLPQLLLVAG ASLAAGATLNSLIL
Sbjct: 361 RVPAKGTSSLLNPSTLFPLIVLSAAGDAASGVLDPSLPQLLLVAGLASLAAGATLNSLIL 420
Query: 421 PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGE 480
PQ NRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGE
Sbjct: 421 PQINRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGE 480
Query: 481 PSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV 540
PSYRARRSRI+KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV
Sbjct: 481 PSYRARRSRIRKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV 540
Query: 541 SEQIEQIMVLENLEE 554
SEQIEQIM LENLEE
Sbjct: 541 SEQIEQIMALENLEE 555
BLAST of PI0003454 vs. TAIR 10
Match:
AT1G64430.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 629.4 bits (1622), Expect = 3.2e-180
Identity = 321/489 (65.64%), Postives = 393/489 (80.37%), Query Frame = 0
Query: 66 DGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAAL 125
D + + S +LDDEL+ VS +DADEAL MI D+ G + GG V + DCR II+AA+
Sbjct: 58 DSVGSAADVSSSILDDELLSSVSAVRDADEALAMISDRFGSNRGGIVELEDCRSIISAAV 117
Query: 126 KRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVS 185
R N +LALS+FY MR++F + S +RW WSRPDV VYT+L+ GLAASLRVS
Sbjct: 118 SRGNVDLALSIFYTMRASF-------DLGGSDNDRWSWSRPDVEVYTMLVNGLAASLRVS 177
Query: 186 DALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELIS 245
D+LR+I ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL S
Sbjct: 178 DSLRIIRDICRVGISPAEEVPFGKIVRCPSCLIAIAVAQPQHGVQIVSCANCRYQYELFS 237
Query: 246 GNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADL 305
G+I +I+SEE+ KD P WEK LR + I K KI ++VHSIVVQTPSG ART +FATETA+L
Sbjct: 238 GDITSIDSEELGKDIPLWEKGLRLIQIKKNKITSSVHSIVVQTPSGTARTHRFATETAEL 297
Query: 306 PAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKE 365
PA+EGERVTIA+AAPSNV+R+VGP KF K PN Y GE M LT H DGRES+LLR P+K+
Sbjct: 298 PAQEGERVTIASAAPSNVYRQVGPFKFISKAPNFYPGEPMSLTKHKDGRESILLRPPSKD 357
Query: 366 NSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRL 425
+L PS L P++ + A GDAASGVIDPSLPQLL VA SLA GAT+NS +LP+ N+L
Sbjct: 358 GDKILQPSFLIPLLAILATGDAASGVIDPSLPQLLSVATVTSLAIGATVNSFVLPKLNQL 417
Query: 426 PQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRAR 485
P+R+VD++ IKQQLLSQY+VLQ RI +LK A EKEVWMLARMCQLENKI AVGEP+YR R
Sbjct: 418 PERTVDVVGIKQQLLSQYDVLQRRIRDLKEAVEKEVWMLARMCQLENKILAVGEPAYRTR 477
Query: 486 RSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQ 545
R+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQ
Sbjct: 478 RTRVKKVRESLENSIKGKIDLIDSYARISSMIEIEVEMDSDVLAAEAVNNTENIAQQIEQ 537
Query: 546 IMVLENLEE 554
IM LENLEE
Sbjct: 538 IMELENLEE 539
BLAST of PI0003454 vs. TAIR 10
Match:
AT1G64430.2 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 629.4 bits (1622), Expect = 3.2e-180
Identity = 321/489 (65.64%), Postives = 393/489 (80.37%), Query Frame = 0
Query: 66 DGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAAL 125
D + + S +LDDEL+ VS +DADEAL MI D+ G + GG V + DCR II+AA+
Sbjct: 58 DSVGSAADVSSSILDDELLSSVSAVRDADEALAMISDRFGSNRGGIVELEDCRSIISAAV 117
Query: 126 KRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVS 185
R N +LALS+FY MR++F + S +RW WSRPDV VYT+L+ GLAASLRVS
Sbjct: 118 SRGNVDLALSIFYTMRASF-------DLGGSDNDRWSWSRPDVEVYTMLVNGLAASLRVS 177
Query: 186 DALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELIS 245
D+LR+I ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL S
Sbjct: 178 DSLRIIRDICRVGISPAEEVPFGKIVRCPSCLIAIAVAQPQHGVQIVSCANCRYQYELFS 237
Query: 246 GNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADL 305
G+I +I+SEE+ KD P WEK LR + I K KI ++VHSIVVQTPSG ART +FATETA+L
Sbjct: 238 GDITSIDSEELGKDIPLWEKGLRLIQIKKNKITSSVHSIVVQTPSGTARTHRFATETAEL 297
Query: 306 PAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKE 365
PA+EGERVTIA+AAPSNV+R+VGP KF K PN Y GE M LT H DGRES+LLR P+K+
Sbjct: 298 PAQEGERVTIASAAPSNVYRQVGPFKFISKAPNFYPGEPMSLTKHKDGRESILLRPPSKD 357
Query: 366 NSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRL 425
+L PS L P++ + A GDAASGVIDPSLPQLL VA SLA GAT+NS +LP+ N+L
Sbjct: 358 GDKILQPSFLIPLLAILATGDAASGVIDPSLPQLLSVATVTSLAIGATVNSFVLPKLNQL 417
Query: 426 PQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRAR 485
P+R+VD++ IKQQLLSQY+VLQ RI +LK A EKEVWMLARMCQLENKI AVGEP+YR R
Sbjct: 418 PERTVDVVGIKQQLLSQYDVLQRRIRDLKEAVEKEVWMLARMCQLENKILAVGEPAYRTR 477
Query: 486 RSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQ 545
R+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQ
Sbjct: 478 RTRVKKVRESLENSIKGKIDLIDSYARISSMIEIEVEMDSDVLAAEAVNNTENIAQQIEQ 537
Query: 546 IMVLENLEE 554
IM LENLEE
Sbjct: 538 IMELENLEE 539
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KZV4 | 3.1e-292 | 96.75 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G052610 PE=4 SV=1 | [more] |
A0A1S3BTU3 | 2.6e-291 | 96.94 | uncharacterized protein LOC103493103 OS=Cucumis melo OX=3656 GN=LOC103493103 PE=... | [more] |
A0A5A7TPS5 | 7.7e-283 | 96.85 | Pentatricopeptide repeat (PPR) superfamily protein isoform 2 OS=Cucumis melo var... | [more] |
A0A6J1EYW6 | 2.9e-274 | 90.43 | uncharacterized protein LOC111437671 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JDG3 | 4.2e-273 | 89.87 | uncharacterized protein LOC111483407 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_004148995.1 | 6.5e-292 | 96.75 | uncharacterized protein LOC101209802 [Cucumis sativus] >KAE8649210.1 hypothetica... | [more] |
XP_008451955.1 | 5.5e-291 | 96.94 | PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo] | [more] |
KAA0044898.1 | 1.6e-282 | 96.85 | Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. ... | [more] |
XP_038895181.1 | 2.7e-282 | 93.85 | uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida] | [more] |
XP_038895173.1 | 8.8e-281 | 93.51 | uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
AT1G64430.1 | 3.2e-180 | 65.64 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G64430.2 | 3.2e-180 | 65.64 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |