Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATGACCCGCCCACATGCAGCCATTGCCCAACCTCGTCATCACGACCTCTCATTCCTCATCCTTTATCCGCCAAAATCCCCACTCTAGAGAGAGAAAAAAAGCACAGAGGAGAGAGAAAGCATTCGACGAATTCTATGCCATTTTCCCTCGTTTCAATCCAAAGCTCCACTGTACAATGATTCTGCACTTGAGTTCACCATGGCTCACTATCACTCGCCTCCCTCCTCCTCCAAAACTCATCGAACCACTCGCCTCTGCAAGCAATGGCACTAGCGTCCTCATGCCTCTCCTTCTATGCTCCAAGTCGACACGAGTTAGAGCTTCTTTAAATGCCAGCAACATCGATGGCGCCGCAGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTGGTGTTGTTTCGGGTGCTAAGGATGCCGATGAAGTGCTGCGGTTGATCGCCGATAAGTCAGGGAGAAGTGGAGGTACTGTGTCTGTTCCGGACTGTCGTTTGATTATTGCGGCTGCACTGAAGCGTAACAATTCGGAGCTTGCTCTGTCCGTGTTCTACGCAATGCGCTCCAGTTTCTATGAAGGTGTGTGCCTGTGACTTGTTCGTTTTTATGTTTGTGTATTCGAAGTTCCATCTGTTTAATAATTAAAGCTAATGCTGCAAAGTATGTGACTAAGTATATGGATTGGTTTCTTTAAATTAGTGAGTAGTTAAGAAAGACGTTTAACTTTGGCCGCTGGGATTCGTCCCTGATTTGGTGTCTTTTACAATGCTTCAATGATTTCTTCATGTCGTTTGACGCTACTTTGTTTCAGTGTCTTCGTCTATACTACAGATCAAGCTGATATCGTGTCCTCGTTACTTTAAGGGCACTAAGCAGGAAGAGTATGCAATTAGGATAATCACTGAATAAGATTGATTACGTGCTTTGATTGAGATTAGGAAGAAGAGGAGCCTAGCAGGGTTCTTAGTGTTGTAGGCATGTATAGTTGCTAAGTTTGGGGCGGCATTCACTGAAATTTCTTTACATTAATTTTCAATGCCCGACGGTAATATATCTGCTAATTTGCTAATCCATTGCCATTGTAAATAGTTACAGCATGGGAGGGTGTTAATGACAATGTTTCCTCTGTTGAGAGATGGAAATGGGCAAGGCCAGATGTCCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCTCTTAGGATTATCGAGATTATTTGCCGAGTTGGTGTATCACCTGCTGAGGAGGTGAGAAACTAATCTTTGTTCAGATATGTATGTGTTCTTCATAGGCAGACATTTACGCAGTTGTTTGCTACTGCATTGTGATGAACATAAAAAATGGTAAGACAGGCTTCCTGAATAGCTGTAAAATGTTTGAAGGTCCCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCAGTAGCGGTTGCACAACCCCAGCACGGTATTCAGGTATGGAAGGACACGGCTGCTTTTGTTGATTGCATTCTATCACGGAGTGTCTCTAGAATTAATGTTTGATGAAAATCTCTTCAAGTTCCCTGGGAAAACTAGGATTAGATAGAGCGTGACTATACTGCATTAGCTGTTTCAATGACAATAACTACTAGATTAAATCATATTATAGTCTTTTAGTTTGATTATTAATTATTTTCTACATCATAACAACACATGCTCGTCCAATGTCTGTCATGATATAGGACATTCACGCCGTGAAAAGATAGGATGGATATAAAATAGAACATCTCCTTTGTTTATTTTCATGATTTGGGCAATTATTTGCCTTCTGTATTACAAATTTCTATTAATTTGTTCCTACTTCAGAATTCTCCCTTCCATTCGTGGCACTATGAAGTACTGTAGATTTTGTCTTGTGGCCTTTGCCAGTGGACATACCACAAAGCCACTATAGTTTCCATGACTTTGATGTTGTGCACCTAATTAAGCATATGTTCAGTGCTCCTGGTTTATTTTTGAACAAATGCTGATGCTTTATCTCTGAGATATGTGGTTTAGTAGCTTGAATGTTTTATCCCTGAGAAATGCAAATATTCTTATCTCCCAATTTTTTTTTTTTATGAACTACACGTGTCCCTCTGCAACCTCCTACCTTTGACAATGATTTTTTTTTTTTTCTTTTTTTGCAGATTGTATCCTGTGCAAAGTGCCGCTACCAATATGAACTTATTTCAGGAAATATAGTTAGTATTGAGTCAGAAGAAATTAGGTACGTTTACTTGCATATCACAAACTTTATTTATTTAGTTTGCTAGTACTTTAAGCCCTTTTCTTTTGTTTTGTGCTTTCAACAAGCTTTGGATTCTTATGAACGAAAAAGGCTATAGGTTCAAGTTCTGAAATGTCTTTTAATATCACACTTCCATTGCCCACTACCCTGAACACTATGCACTTATGGTTCAGGATAATATTCAATTATTGCAAACCTTATTACTTAATTGTATATTAATGACAAAACAAATTTGCATTAGATGTTGATCCTGGAATATTCACTTTTTCTAAGGATCTCCTCCTACCGAACGCAGCATGGATACTCCAGCATGGGAAAAAGCACTCCGGTTCTTGAATGTAATGAAGCAAAAACTCCCTGCTGCTGTGCACTCCATTGTGGTATGCATAACTCAGTTTGACTTTGAACTGTCTGGTTCGTTTCCATGCTTTTGTTATCATCTACAAATATTTGCTCATTTGATCTGGTCAGCATCAGCGAGATGAGTAGATGCATGATTATTATATTTCTAATTTTATTCCAAATTTGCAATAATTTCTCCCCCTTTCCACTACCTTTTGTGTCTGTTGAGTAATTCATCCATCGTACGTTTCACTAATAGTTAATTGTGCGGCCTTAAAAGTATTCAACCGCGCAGAGTTGTGTACTTATCATAGAAACGTCATTTTGTTCTCTTCTCCCACTATCATATCTAGGTGGTTGAGTAACTAATTTTTCATCTCTCTCATGAAAACTTGGCACATCATGGTTTATTTGTAGGGTCTATTTTTAAAGCATATACATAGATCAGTAGGTCTCATTAATGTCTTTGAACTACTAAAAATAAACCACGTAAGGTAGGAGTTTTTACTGACTGGTATATAGGAAAATATGCTATTCATCTCAGGAATAAGTGTTACTCCTGTTAGTCACCTAATTAGTTTGCTAGCTTGCTGGTTCTGATGAAAGTTTCTAAGTAATCTGGATATTATTATTTCCCTGCCTTGATTATCCAAAGTTTTTGAGGCAATTAACATATAATTCTCTCTGTTCTCGTGTTCCTGTTACTAATTTTTTCCTTTGTCATTCTCTGTAGGTACAAACTCCTTCTGGAGTAGCACGAACCCAGAAGTTTGCCACTGAAACAGCAGATCTTCCAGCACGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATACAGAGAAGTTGGTCCAATTAAATTTAGTCCAAAGGATCCAAATTTGTACTCTGGTGAGCCTATGTGCTTGACAAATCATTCAGATGGCCGAGAATCACTATTGCTAAGAGTGCCAGCAAAGGAAACCTCATTCTTACTTAAACCGTCAGCCCTCTTTCCACTCATACTTTTATCTGTCGCTGGAGATGCTGCCTCTGGAGTTCTTGACCCCAGCTTGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACTTTGAATTCATTTATTTTGCCTCAATTCAATCGGGTACAGATTTCTTTAATCACTATTTTCTGTATTGCTACACCTATGTTTAAACCTCATTTTATATTTCAATATTTACACACGCACATTACTTTTTGCATAAGGTTGTTGTCACTGATGCCATGTCTTTGTACGTGTGTATTTTTAAATTTTGGCTTGTGGTATCGATAGACGCATTAAATTATCATTTGAATGTTGTCAACTGCTGAAAATGACTTGAGATGAGTTTAAAAAAATTTATTCCTGCATCCCACTTTTAGCAAAGCTTCATGCTAAATACTTTTTGGAGTGCACTTTAAATTCTCTGAACATTTTTCAAAATCTTTAAGAATAAGAATGCCTAACAGTTGAACATGAAAATTTCGAGCATGTACGTATAATGCTGAAACACAGTTTCCATTGAGAAATTGAAAAGCACTCCAAACGTAACAGTACTCTGCCTTTTGTACCCAAATTTCTTGTTTTCAAAATTGATCTTCTTTTCTTTAACCAAAAAAAGCCTGTTTTTCTTGATGGAATGCTTAAAAAAGATTCCATTATTTTAGACTTTGTATTGTTTATTTTCCTACTCAATTTTCTTCTTTTAAATATTGCCTAAAAATTTGTACAAATTCCGCTGCCATTAACAAGTTATGCCCTCGTGAACCAAAAATTGGGTACCTTGCTCAAATACCTAATGAATATCTGATTATAAAATAGTATTTAGTTATTACACCCATATAGTGCCCATATTTTTCTTCTCTAATATTATTATTAACTTTTCTCCTCGAATGCTTGATTAGAACTAATTCATATGTTTCTGACTGTCAGCTTCCTCAACGATCAGTTGATATCATTGCCATCAAACAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGCACTCACATCGTCTTGGTCATGGCATCATGAAGTAAATTTAATTATTAATTCATATCTCAACATTAACTACATATTATATGGTTAGCTAAATTGAATACATGAGGTTAACAATTTGTCGCGCTTATACTAATCTTTTATTACTGAAGAAAAACATTGGAAGTTATTACCAGTATTGTTCTCTAATCTGTAGCCTGGGTCTTCACTTACGCACTCATCAGTGATACAAAGATTGGTTATCATCAAATAACTGAAATCATGTTCTTTTCATTTTGTTATTATAGCATTATAGATCTGGTTATTGAAATTCACTTGAATTTCTGTCCTCTCTGTAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAATAAAATTTTTGCCGTTGGAGAACCTTCTTACCGGTGAATCCTACTAAACACTATGACTGGTCCTTTCTTAATTCAATGAACTATTGCCAATGCTCATTTTCAATTGACTTATTAAGGTTGTAAAAATATTTGGAATTTTCATTTTTCCATCTTTCTCAATGTCCTCCAAACTCAATTTTATATTATTGTCTCCAGTGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGGTATGTATAAGACATATCACTGTGTTAAGAATCCTACTCCCGGAAGGGCATGGTTACTAAGTGGTATCTTTTTTGACAGATTTCCTCAATGATTGAGATAGAAGTTGAAATGGAGTCTGATGTTATAGCTGCTGAAGCAGCCAGCAGTGTGGTGCGTTCAGTTTCTGTCTTATCATAAGTTTTAGAAGGAATAGAGTTATTTAACTTTGTGAACCATTTTACAGGAAAGGGTTTCTGAACAGATTGAGCAAATCATGGTGCTGGAAAATCTAGAAGAGGTATGTCTATGAATTAGTTTTTTGTTTGCATGCGTAACTATTTCCCCCCCCCTATTTTTCTATTTTGGCAGAGGGATAGCTTGTCTGTTTACTTAGTTGTGATGTTGAAGTATTGCTTGAAGTCTAATAGACCCCAGTTTAAGTTAACTGTTTATCGTGTCTTTCTCTAACTGCATCCATACTACCGGTAGTATATTACTTGTCAACATTTTGAGAAACTTTCTCTTGTTTACTATATTCACTATTACTGCCAGAGTTTGCCAGGTGGTATACCCACTGGACTTACCTTCAATCACTTCCATCCACACGGTATTACATGGAGTACTGGACTTGCCTTTTCTATCTCCCATTCCTCATAACTGAGTTATATTATATTTAGGGAGTTAGGAGAGAAGGATAATGCTATGTTCAGAAACTTTTGTTCTTGGGAAATGGGAGGGGTGTAAGTTCACGATCCTAATAGTACTTTCTTTTGAGTTTTTTCAGATTTCATTTAACAATAGGTGATAGAATTCCGTTCTTAGATGGACCAGATAGTGGTGCCCTCTGCCACCACCACACTTAGAGCCAGCCTTACAAACCACCGGTTGTATTCCCCTTCATCATAATCAAGCCCTTGGCTAGCAAGGCTGTTTATAAGTCACATTGCATTGATTTTTGATTGCATGAAAATTGACTTTGTGTAACCAAAGTTGAGTGTCTGCAGAGTTTTTAGTTCCTTCACAGAAAACAGTAAGTTTGACTACTTTTTAAATATGTGATTCGCATTGTTTTTTTTGTTTTTGTTTTAATAATGAAATCCTCACTTAATACCCTTAGTTTCAGAACGAATAAATGGTTTTGGTTTCTGTTGTCTTTGTTAGTGAAAGAGGGATTGATTGAATGATTGATGAGAAGATTGAATTATGAAGAGCAAGAATGCTCCATATTTATACAGAGATAACAAAAGATATTCAGAAAATCTACCTAAAAATCTACAGAAAATCTGGTAACAAACGAGTAAACAAATCAAAATAAAATCTAATCAAATCAAAACTAAATCAAATAAAAAACTAGATCTAACTCCTAACGGAGATTTAATTAGATACTAACTGACAGCATTCTAAACAAATACGTGATCTTTAACAGTCTTCTTAGTTCCTCGTGGTTTCTTCTGGACTATAATGGTTTCCTCATTTTTGGAGCCAACCAGCAGACGCTATTTCCATCTTTTGGCTAATTAATTGTTTTTTCTTCTAATCTTTTATGTCAATACCAATGCAGAGATGGAGATTACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTTTCAACCAATCAATGCCAACGGAAGAGGTTTAAACAGGCTTGTGCTCATGTCTTCAATCCAGGTAACATTTATACCTCTTCTCCACATCGCCTTGTACCTTCACTTCTTACCCAGACTTGCAATTTGTCAAATCAATCTAGGAGTATGGACACCTCCATCACGAGTCCAGATACTTCCCAAACCTTCTTGCTTGTGAAAATGTCAATCAATCTCTACTTGGTGCATGTACATGAAGTTTTGAAGCAGCATCAGGTCAGTGCTTCGTGGTCAGATGTCTCATTACTGAACATCTCATTTGTTAAATTGTATCATTAAGTTTCTTTGGAACTTTTCTCTCCTGAGTGCTGAAATGTTATTTATTTACCACGTAAATATAACACATTATTGCTTCTATTGGATGAATACCCGTCTTTG
mRNA sequence
CGATGACCCGCCCACATGCAGCCATTGCCCAACCTCGTCATCACGACCTCTCATTCCTCATCCTTTATCCGCCAAAATCCCCACTCTAGAGAGAGAAAAAAAGCACAGAGGAGAGAGAAAGCATTCGACGAATTCTATGCCATTTTCCCTCGTTTCAATCCAAAGCTCCACTGTACAATGATTCTGCACTTGAGTTCACCATGGCTCACTATCACTCGCCTCCCTCCTCCTCCAAAACTCATCGAACCACTCGCCTCTGCAAGCAATGGCACTAGCGTCCTCATGCCTCTCCTTCTATGCTCCAAGTCGACACGAGTTAGAGCTTCTTTAAATGCCAGCAACATCGATGGCGCCGCAGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTGGTGTTGTTTCGGGTGCTAAGGATGCCGATGAAGTGCTGCGGTTGATCGCCGATAAGTCAGGGAGAAGTGGAGGTACTGTGTCTGTTCCGGACTGTCGTTTGATTATTGCGGCTGCACTGAAGCGTAACAATTCGGAGCTTGCTCTGTCCGTGTTCTACGCAATGCGCTCCAGTTTCTATGAAGCATGGGAGGGTGTTAATGACAATGTTTCCTCTGTTGAGAGATGGAAATGGGCAAGGCCAGATGTCCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCTCTTAGGATTATCGAGATTATTTGCCGAGTTGGTGTATCACCTGCTGAGGAGGTCCCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCAGTAGCGGTTGCACAACCCCAGCACGGTATTCAGTTAGTATTGAGTCAGAAGAAATTAGGATCTCCTCCTACCGAACGCAGCATGGATACTCCAGCATGGGAAAAAGCACTCCGGTTCTTGAATGTAATGAAGCAAAAACTCCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTAGCACGAACCCAGAAGTTTGCCACTGAAACAGCAGATCTTCCAGCACGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATACAGAGAAGTTGGTCCAATTAAATTTAGTCCAAAGGATCCAAATTTGTACTCTGGTGAGCCTATGTGCTTGACAAATCATTCAGATGGCCGAGAATCACTATTGCTAAGAGTGCCAGCAAAGGAAACCTCATTCTTACTTAAACCGTCAGCCCTCTTTCCACTCATACTTTTATCTGTCGCTGGAGATGCTGCCTCTGGAGTTCTTGACCCCAGCTTGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACTTTGAATTCATTTATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCCATCAAACAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAATAAAATTTTTGCCGTTGGAGAACCTTCTTACCGTGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCAATGATTGAGATAGAAGTTGAAATGGAGTCTGATGTTATAGCTGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTTCTGAACAGATTGAGCAAATCATGGTGCTGGAAAATCTAGAAGAGAGATGGAGATTACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTTTCAACCAATCAATGCCAACGGAAGAGGTTTAAACAGGCTTGTGCTCATGTCTTCAATCCAGGAGTATGGACACCTCCATCACGAGTCCAGATACTTCCCAAACCTTCTTGCTTGTGAAAATGTCAATCAATCTCTACTTGGTGCATGTACATGAAGTTTTGAAGCAGCATCAGGTCAGTGCTTCGTGGTCAGATGTCTCATTACTGAACATCTCATTTGTTAAATTGTATCATTAAGTTTCTTTGGAACTTTTCTCTCCTGAGTGCTGAAATGTTATTTATTTACCACGTAAATATAACACATTATTGCTTCTATTGGATGAATACCCGTCTTTG
Coding sequence (CDS)
ATGCAGCCATTGCCCAACCTCGTCATCACGACCTCTCATTCCTCATCCTTTATCCGCCAAAATCCCCACTCTAGAGAGAGAAAAAAAGCACAGAGGAGAGAGAAAGCATTCGACGAATTCTATGCCATTTTCCCTCGTTTCAATCCAAAGCTCCACTGTACAATGATTCTGCACTTGAGTTCACCATGGCTCACTATCACTCGCCTCCCTCCTCCTCCAAAACTCATCGAACCACTCGCCTCTGCAAGCAATGGCACTAGCGTCCTCATGCCTCTCCTTCTATGCTCCAAGTCGACACGAGTTAGAGCTTCTTTAAATGCCAGCAACATCGATGGCGCCGCAGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTGGTGTTGTTTCGGGTGCTAAGGATGCCGATGAAGTGCTGCGGTTGATCGCCGATAAGTCAGGGAGAAGTGGAGGTACTGTGTCTGTTCCGGACTGTCGTTTGATTATTGCGGCTGCACTGAAGCGTAACAATTCGGAGCTTGCTCTGTCCGTGTTCTACGCAATGCGCTCCAGTTTCTATGAAGCATGGGAGGGTGTTAATGACAATGTTTCCTCTGTTGAGAGATGGAAATGGGCAAGGCCAGATGTCCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCTCTTAGGATTATCGAGATTATTTGCCGAGTTGGTGTATCACCTGCTGAGGAGGTCCCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCAGTAGCGGTTGCACAACCCCAGCACGGTATTCAGTTAGTATTGAGTCAGAAGAAATTAGGATCTCCTCCTACCGAACGCAGCATGGATACTCCAGCATGGGAAAAAGCACTCCGGTTCTTGAATGTAATGAAGCAAAAACTCCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTAGCACGAACCCAGAAGTTTGCCACTGAAACAGCAGATCTTCCAGCACGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATACAGAGAAGTTGGTCCAATTAAATTTAGTCCAAAGGATCCAAATTTGTACTCTGGTGAGCCTATGTGCTTGACAAATCATTCAGATGGCCGAGAATCACTATTGCTAAGAGTGCCAGCAAAGGAAACCTCATTCTTACTTAAACCGTCAGCCCTCTTTCCACTCATACTTTTATCTGTCGCTGGAGATGCTGCCTCTGGAGTTCTTGACCCCAGCTTGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACTTTGAATTCATTTATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCCATCAAACAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAATAAAATTTTTGCCGTTGGAGAACCTTCTTACCGTGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCAATGATTGAGATAGAAGTTGAAATGGAGTCTGATGTTATAGCTGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTTCTGAACAGATTGAGCAAATCATGGTGCTGGAAAATCTAGAAGAGAGATGGAGATTACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTTTCAACCAATCAATGCCAACGGAAGAGGTTTAA
Protein sequence
MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLVLSQKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV
Homology
BLAST of Cp4.1LG14g02650 vs. NCBI nr
Match:
XP_023552095.1 (uncharacterized protein LOC111809863 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1127 bits (2915), Expect = 0.0
Identity = 600/623 (96.31%), Postives = 603/623 (96.79%), Query Frame = 0
Query: 1 MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLS 60
MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLS
Sbjct: 1 MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLS 60
Query: 61 SPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPV 120
SPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPV
Sbjct: 61 SPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPV 120
Query: 121 SELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSV 180
SELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSV
Sbjct: 121 SELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSV 180
Query: 181 FYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEIICR 240
FYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEIICR
Sbjct: 181 FYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEIICR 240
Query: 241 VGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LSQKKLGSPPTER 300
VGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V +S + E
Sbjct: 241 VGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQYELISGNIVSIESEEI 300
Query: 301 SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIA 360
SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIA
Sbjct: 301 SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIA 360
Query: 361 AAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSALF 420
AAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSALF
Sbjct: 361 AAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSALF 420
Query: 421 PLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIK 480
PLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIK
Sbjct: 421 PLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIK 480
Query: 481 QQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGL 540
QQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGL
Sbjct: 481 QQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGL 540
Query: 541 ENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERW 600
ENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERW
Sbjct: 541 ENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERW 600
Query: 601 RLQAEANDEAERLFNQSMPTEEV 612
RLQAEANDEAERLFNQSMPTEEV
Sbjct: 601 RLQAEANDEAERLFNQSMPTEEV 623
BLAST of Cp4.1LG14g02650 vs. NCBI nr
Match:
XP_023552094.1 (uncharacterized protein LOC111809863 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1122 bits (2902), Expect = 0.0
Identity = 600/625 (96.00%), Postives = 603/625 (96.48%), Query Frame = 0
Query: 1 MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLS 60
MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLS
Sbjct: 1 MQPLPNLVITTSHSSSFIRQNPHSRERKKAQRREKAFDEFYAIFPRFNPKLHCTMILHLS 60
Query: 61 SPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPV 120
SPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPV
Sbjct: 61 SPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAAAFENPV 120
Query: 121 SELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSV 180
SELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSV
Sbjct: 121 SELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNSELALSV 180
Query: 181 FYAMRSSFYE--AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEII 240
FYAMRSSFYE AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEII
Sbjct: 181 FYAMRSSFYEVTAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDALRIIEII 240
Query: 241 CRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LSQKKLGSPPT 300
CRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V +S +
Sbjct: 241 CRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQYELISGNIVSIESE 300
Query: 301 ERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVT 360
E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVT
Sbjct: 301 EISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVT 360
Query: 361 IAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSA 420
IAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSA
Sbjct: 361 IAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSFLLKPSA 420
Query: 421 LFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIA 480
LFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIA
Sbjct: 421 LFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIA 480
Query: 481 IKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVRE 540
IKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVRE
Sbjct: 481 IKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVRE 540
Query: 541 GLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE 600
GLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
Sbjct: 541 GLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE 600
Query: 601 RWRLQAEANDEAERLFNQSMPTEEV 612
RWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 601 RWRLQAEANDEAERLFNQSMPTEEV 625
BLAST of Cp4.1LG14g02650 vs. NCBI nr
Match:
XP_022931519.1 (uncharacterized protein LOC111437671 isoform X3 [Cucurbita moschata])
HSP 1 Score: 1008 bits (2607), Expect = 0.0
Identity = 543/571 (95.10%), Postives = 548/571 (95.97%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA 114
MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA 60
Query: 115 AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNS 174
AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGR+GGTVSVPDCRLIIAAALKRNNS
Sbjct: 61 AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLIIAAALKRNNS 120
Query: 175 ELALSVFYAMRSSFYE--AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL 234
ELALSVFYAMRSSFYE AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL
Sbjct: 121 ELALSVFYAMRSSFYEVTAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL 180
Query: 235 RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LSQKK 294
RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V +S
Sbjct: 181 RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQYELISGNI 240
Query: 295 LGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR 354
+ E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR
Sbjct: 241 VNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR 300
Query: 355 EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF 414
EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF
Sbjct: 301 EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF 360
Query: 415 LLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR 474
LLKPSALFPLILLSVAGD +SGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR
Sbjct: 361 LLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR 420
Query: 475 SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR 534
SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR
Sbjct: 421 SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR 480
Query: 535 IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV 594
IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV
Sbjct: 481 IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV 540
Query: 595 LENLEERWRLQAEANDEAERLFNQSMPTEEV 612
LENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 LENLEERWRLQAEANDEAERLFNQSMPTEEV 571
BLAST of Cp4.1LG14g02650 vs. NCBI nr
Match:
XP_022931518.1 (uncharacterized protein LOC111437671 isoform X2 [Cucurbita moschata])
HSP 1 Score: 1005 bits (2599), Expect = 0.0
Identity = 543/579 (93.78%), Postives = 548/579 (94.65%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS----------KSTRVRAS 114
MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS KSTRVRAS
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSHALFRFTSFSKSTRVRAS 60
Query: 115 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLI 174
LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGR+GGTVSVPDCRLI
Sbjct: 61 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLI 120
Query: 175 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 234
IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA
Sbjct: 121 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 180
Query: 235 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-------- 294
SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V
Sbjct: 181 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQ 240
Query: 295 ---LSQKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 354
+S + E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT
Sbjct: 241 YELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 300
Query: 355 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR 414
ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR
Sbjct: 301 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR 360
Query: 415 VPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILP 474
VPAKETSFLLKPSALFPLILLSVAGD +SGVLDPSLPRLLLVAGFASLAAGATLNSFILP
Sbjct: 361 VPAKETSFLLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFILP 420
Query: 475 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 534
QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP
Sbjct: 421 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 480
Query: 535 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 594
SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS
Sbjct: 481 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 540
Query: 595 EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 612
EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 579
BLAST of Cp4.1LG14g02650 vs. NCBI nr
Match:
XP_022931517.1 (uncharacterized protein LOC111437671 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1000 bits (2586), Expect = 0.0
Identity = 543/581 (93.46%), Postives = 548/581 (94.32%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS----------KSTRVRAS 114
MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS KSTRVRAS
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSHALFRFTSFSKSTRVRAS 60
Query: 115 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLI 174
LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGR+GGTVSVPDCRLI
Sbjct: 61 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLI 120
Query: 175 IAAALKRNNSELALSVFYAMRSSFYE--AWEGVNDNVSSVERWKWARPDVHVYTLLIQGL 234
IAAALKRNNSELALSVFYAMRSSFYE AWEGVNDNVSSVERWKWARPDVHVYTLLIQGL
Sbjct: 121 IAAALKRNNSELALSVFYAMRSSFYEVTAWEGVNDNVSSVERWKWARPDVHVYTLLIQGL 180
Query: 235 AASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV------ 294
AASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V
Sbjct: 181 AASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCR 240
Query: 295 -----LSQKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKF 354
+S + E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKF
Sbjct: 241 YQYELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKF 300
Query: 355 ATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL 414
ATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL
Sbjct: 301 ATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL 360
Query: 415 LRVPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFI 474
LRVPAKETSFLLKPSALFPLILLSVAGD +SGVLDPSLPRLLLVAGFASLAAGATLNSFI
Sbjct: 361 LRVPAKETSFLLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFI 420
Query: 475 LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVG 534
LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVG
Sbjct: 421 LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVG 480
Query: 535 EPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER 594
EPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
Sbjct: 481 EPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER 540
Query: 595 VSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 612
VSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 VSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 581
BLAST of Cp4.1LG14g02650 vs. ExPASy TrEMBL
Match:
A0A6J1EUG6 (uncharacterized protein LOC111437671 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111437671 PE=4 SV=1)
HSP 1 Score: 1008 bits (2607), Expect = 0.0
Identity = 543/571 (95.10%), Postives = 548/571 (95.97%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA 114
MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA 60
Query: 115 AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNS 174
AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGR+GGTVSVPDCRLIIAAALKRNNS
Sbjct: 61 AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLIIAAALKRNNS 120
Query: 175 ELALSVFYAMRSSFYE--AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL 234
ELALSVFYAMRSSFYE AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL
Sbjct: 121 ELALSVFYAMRSSFYEVTAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL 180
Query: 235 RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LSQKK 294
RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V +S
Sbjct: 181 RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQYELISGNI 240
Query: 295 LGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR 354
+ E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR
Sbjct: 241 VNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR 300
Query: 355 EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF 414
EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF
Sbjct: 301 EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF 360
Query: 415 LLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR 474
LLKPSALFPLILLSVAGD +SGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR
Sbjct: 361 LLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR 420
Query: 475 SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR 534
SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR
Sbjct: 421 SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR 480
Query: 535 IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV 594
IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV
Sbjct: 481 IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV 540
Query: 595 LENLEERWRLQAEANDEAERLFNQSMPTEEV 612
LENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 LENLEERWRLQAEANDEAERLFNQSMPTEEV 571
BLAST of Cp4.1LG14g02650 vs. ExPASy TrEMBL
Match:
A0A6J1EYW6 (uncharacterized protein LOC111437671 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437671 PE=4 SV=1)
HSP 1 Score: 1005 bits (2599), Expect = 0.0
Identity = 543/579 (93.78%), Postives = 548/579 (94.65%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS----------KSTRVRAS 114
MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS KSTRVRAS
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSHALFRFTSFSKSTRVRAS 60
Query: 115 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLI 174
LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGR+GGTVSVPDCRLI
Sbjct: 61 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLI 120
Query: 175 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 234
IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA
Sbjct: 121 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 180
Query: 235 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-------- 294
SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V
Sbjct: 181 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQ 240
Query: 295 ---LSQKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 354
+S + E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT
Sbjct: 241 YELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 300
Query: 355 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR 414
ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR
Sbjct: 301 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR 360
Query: 415 VPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILP 474
VPAKETSFLLKPSALFPLILLSVAGD +SGVLDPSLPRLLLVAGFASLAAGATLNSFILP
Sbjct: 361 VPAKETSFLLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFILP 420
Query: 475 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 534
QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP
Sbjct: 421 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 480
Query: 535 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 594
SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS
Sbjct: 481 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 540
Query: 595 EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 612
EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 579
BLAST of Cp4.1LG14g02650 vs. ExPASy TrEMBL
Match:
A0A6J1ETW5 (uncharacterized protein LOC111437671 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111437671 PE=4 SV=1)
HSP 1 Score: 1000 bits (2586), Expect = 0.0
Identity = 543/581 (93.46%), Postives = 548/581 (94.32%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS----------KSTRVRAS 114
MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS KSTRVRAS
Sbjct: 1 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSHALFRFTSFSKSTRVRAS 60
Query: 115 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLI 174
LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGR+GGTVSVPDCRLI
Sbjct: 61 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRNGGTVSVPDCRLI 120
Query: 175 IAAALKRNNSELALSVFYAMRSSFYE--AWEGVNDNVSSVERWKWARPDVHVYTLLIQGL 234
IAAALKRNNSELALSVFYAMRSSFYE AWEGVNDNVSSVERWKWARPDVHVYTLLIQGL
Sbjct: 121 IAAALKRNNSELALSVFYAMRSSFYEVTAWEGVNDNVSSVERWKWARPDVHVYTLLIQGL 180
Query: 235 AASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV------ 294
AASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V
Sbjct: 181 AASLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCR 240
Query: 295 -----LSQKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKF 354
+S + E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKF
Sbjct: 241 YQYELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKF 300
Query: 355 ATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL 414
ATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL
Sbjct: 301 ATETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL 360
Query: 415 LRVPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFI 474
LRVPAKETSFLLKPSALFPLILLSVAGD +SGVLDPSLPRLLLVAGFASLAAGATLNSFI
Sbjct: 361 LRVPAKETSFLLKPSALFPLILLSVAGDVSSGVLDPSLPRLLLVAGFASLAAGATLNSFI 420
Query: 475 LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVG 534
LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVG
Sbjct: 421 LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVG 480
Query: 535 EPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER 594
EPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
Sbjct: 481 EPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER 540
Query: 595 VSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 612
VSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 VSEQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 581
BLAST of Cp4.1LG14g02650 vs. ExPASy TrEMBL
Match:
A0A6J1JB68 (uncharacterized protein LOC111483407 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483407 PE=4 SV=1)
HSP 1 Score: 988 bits (2555), Expect = 0.0
Identity = 535/571 (93.70%), Postives = 544/571 (95.27%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCSKSTRVRASLNASNIDGAA 114
MILHLSSPWLTITRLP P KLIEPLASASNGTSVLMPLLLCS STRVRASLN SNIDGAA
Sbjct: 1 MILHLSSPWLTITRLPHP-KLIEPLASASNGTSVLMPLLLCSHSTRVRASLNVSNIDGAA 60
Query: 115 AFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLIIAAALKRNNS 174
AFENPVS+LLDDELI VVSGAKDADEVLR+IA+KSGR+GGTVSVPDCRLIIAAALKRNNS
Sbjct: 61 AFENPVSDLLDDELICVVSGAKDADEVLRMIAEKSGRNGGTVSVPDCRLIIAAALKRNNS 120
Query: 175 ELALSVFYAMRSSFYE--AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL 234
ELALSVFYAMRSSFYE AWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL
Sbjct: 121 ELALSVFYAMRSSFYEVTAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVSDAL 180
Query: 235 RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LSQKK 294
RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V +S
Sbjct: 181 RIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQYELISGNI 240
Query: 295 LGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR 354
+ E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR
Sbjct: 241 VNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADLPAR 300
Query: 355 EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKETSF 414
EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL+RVPAKETSF
Sbjct: 301 EGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLIRVPAKETSF 360
Query: 415 LLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQR 474
LLKPSALFPLILLSVAGDAASGVLDPSLPR+LLVAGFASLAAGATLNSFILPQFNRLPQR
Sbjct: 361 LLKPSALFPLILLSVAGDAASGVLDPSLPRMLLVAGFASLAAGATLNSFILPQFNRLPQR 420
Query: 475 SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR 534
SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR
Sbjct: 421 SVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSR 480
Query: 535 IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV 594
IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV
Sbjct: 481 IKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMV 540
Query: 595 LENLEERWRLQAEANDEAERLFNQSMPTEEV 612
LENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 LENLEERWRLQAEANDEAERLFNQSMPTEEV 570
BLAST of Cp4.1LG14g02650 vs. ExPASy TrEMBL
Match:
A0A6J1JDG3 (uncharacterized protein LOC111483407 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483407 PE=4 SV=1)
HSP 1 Score: 986 bits (2549), Expect = 0.0
Identity = 535/579 (92.40%), Postives = 545/579 (94.13%), Query Frame = 0
Query: 55 MILHLSSPWLTITRLPPPPKLIEPLASASNGTSVLMPLLLCS----------KSTRVRAS 114
MILHLSSPWLTITRLP P KLIEPLASASNGTSVLMPLLLCS +STRVRAS
Sbjct: 1 MILHLSSPWLTITRLPHP-KLIEPLASASNGTSVLMPLLLCSHAFFRFTSFSQSTRVRAS 60
Query: 115 LNASNIDGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRSGGTVSVPDCRLI 174
LN SNIDGAAAFENPVS+LLDDELI VVSGAKDADEVLR+IA+KSGR+GGTVSVPDCRLI
Sbjct: 61 LNVSNIDGAAAFENPVSDLLDDELICVVSGAKDADEVLRMIAEKSGRNGGTVSVPDCRLI 120
Query: 175 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 234
IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA
Sbjct: 121 IAAALKRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAA 180
Query: 235 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-------- 294
SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ+V
Sbjct: 181 SLRVSDALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYQ 240
Query: 295 ---LSQKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 354
+S + E SMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT
Sbjct: 241 YELISGNIVNIESEEISMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFAT 300
Query: 355 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLR 414
ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL+R
Sbjct: 301 ETADLPAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLIR 360
Query: 415 VPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILP 474
VPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPR+LLVAGFASLAAGATLNSFILP
Sbjct: 361 VPAKETSFLLKPSALFPLILLSVAGDAASGVLDPSLPRMLLVAGFASLAAGATLNSFILP 420
Query: 475 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 534
QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP
Sbjct: 421 QFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEP 480
Query: 535 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 594
SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS
Sbjct: 481 SYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVS 540
Query: 595 EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 612
EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV
Sbjct: 541 EQIEQIMVLENLEERWRLQAEANDEAERLFNQSMPTEEV 578
BLAST of Cp4.1LG14g02650 vs. TAIR 10
Match:
AT1G64430.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 607.8 bits (1566), Expect = 9.4e-174
Identity = 319/506 (63.04%), Postives = 394/506 (77.87%), Query Frame = 0
Query: 111 DGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRS-GGTVSVPDCRLIIAAAL 170
D + + S +LDDEL+ VS +DADE L +I+D+ G + GG V + DCR II+AA+
Sbjct: 58 DSVGSAADVSSSILDDELLSSVSAVRDADEALAMISDRFGSNRGGIVELEDCRSIISAAV 117
Query: 171 KRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVS 230
R N +LALS+FY MR+SF G +DN +RW W+RPDV VYT+L+ GLAASLRVS
Sbjct: 118 SRGNVDLALSIFYTMRASFD---LGGSDN----DRWSWSRPDVEVYTMLVNGLAASLRVS 177
Query: 231 DALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LS 290
D+LRII ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+Q+V S
Sbjct: 178 DSLRIIRDICRVGISPAEEVPFGKIVRCPSCLIAIAVAQPQHGVQIVSCANCRYQYELFS 237
Query: 291 QKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADL 350
E D P WEK LR + + K K+ ++VHSIVVQTPSG ART +FATETA+L
Sbjct: 238 GDITSIDSEELGKDIPLWEKGLRLIQIKKNKITSSVHSIVVQTPSGTARTHRFATETAEL 297
Query: 351 PAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKE 410
PA+EGERVTIA+AAPSNVYR+VGP KF K PN Y GEPM LT H DGRES+LLR P+K+
Sbjct: 298 PAQEGERVTIASAAPSNVYRQVGPFKFISKAPNFYPGEPMSLTKHKDGRESILLRPPSKD 357
Query: 411 TSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRL 470
+L+PS L PL+ + GDAASGV+DPSLP+LL VA SLA GAT+NSF+LP+ N+L
Sbjct: 358 GDKILQPSFLIPLLAILATGDAASGVIDPSLPQLLSVATVTSLAIGATVNSFVLPKLNQL 417
Query: 471 PQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRAR 530
P+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEVWMLARMCQLENKI AVGEP+YR R
Sbjct: 418 PERTVDVVGIKQQLLSQYDVLQRRIRDLKEAVEKEVWMLARMCQLENKILAVGEPAYRTR 477
Query: 531 RSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQ 590
R+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQ
Sbjct: 478 RTRVKKVRESLENSIKGKIDLIDSYARISSMIEIEVEMDSDVLAAEAVNNTENIAQQIEQ 537
Query: 591 IMVLENLEERWRLQAEANDEAERLFN 605
IM LENLEE+W++QAEANDEAERL +
Sbjct: 538 IMELENLEEKWKIQAEANDEAERLLS 556
BLAST of Cp4.1LG14g02650 vs. TAIR 10
Match:
AT1G64430.2 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 607.8 bits (1566), Expect = 9.4e-174
Identity = 319/506 (63.04%), Postives = 394/506 (77.87%), Query Frame = 0
Query: 111 DGAAAFENPVSELLDDELIGVVSGAKDADEVLRLIADKSGRS-GGTVSVPDCRLIIAAAL 170
D + + S +LDDEL+ VS +DADE L +I+D+ G + GG V + DCR II+AA+
Sbjct: 58 DSVGSAADVSSSILDDELLSSVSAVRDADEALAMISDRFGSNRGGIVELEDCRSIISAAV 117
Query: 171 KRNNSELALSVFYAMRSSFYEAWEGVNDNVSSVERWKWARPDVHVYTLLIQGLAASLRVS 230
R N +LALS+FY MR+SF G +DN +RW W+RPDV VYT+L+ GLAASLRVS
Sbjct: 118 SRGNVDLALSIFYTMRASFD---LGGSDN----DRWSWSRPDVEVYTMLVNGLAASLRVS 177
Query: 231 DALRIIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQLV-----------LS 290
D+LRII ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+Q+V S
Sbjct: 178 DSLRIIRDICRVGISPAEEVPFGKIVRCPSCLIAIAVAQPQHGVQIVSCANCRYQYELFS 237
Query: 291 QKKLGSPPTERSMDTPAWEKALRFLNVMKQKLPAAVHSIVVQTPSGVARTQKFATETADL 350
E D P WEK LR + + K K+ ++VHSIVVQTPSG ART +FATETA+L
Sbjct: 238 GDITSIDSEELGKDIPLWEKGLRLIQIKKNKITSSVHSIVVQTPSGTARTHRFATETAEL 297
Query: 351 PAREGERVTIAAAAPSNVYREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKE 410
PA+EGERVTIA+AAPSNVYR+VGP KF K PN Y GEPM LT H DGRES+LLR P+K+
Sbjct: 298 PAQEGERVTIASAAPSNVYRQVGPFKFISKAPNFYPGEPMSLTKHKDGRESILLRPPSKD 357
Query: 411 TSFLLKPSALFPLILLSVAGDAASGVLDPSLPRLLLVAGFASLAAGATLNSFILPQFNRL 470
+L+PS L PL+ + GDAASGV+DPSLP+LL VA SLA GAT+NSF+LP+ N+L
Sbjct: 358 GDKILQPSFLIPLLAILATGDAASGVIDPSLPQLLSVATVTSLAIGATVNSFVLPKLNQL 417
Query: 471 PQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRAR 530
P+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEVWMLARMCQLENKI AVGEP+YR R
Sbjct: 418 PERTVDVVGIKQQLLSQYDVLQRRIRDLKEAVEKEVWMLARMCQLENKILAVGEPAYRTR 477
Query: 531 RSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQ 590
R+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQ
Sbjct: 478 RTRVKKVRESLENSIKGKIDLIDSYARISSMIEIEVEMDSDVLAAEAVNNTENIAQQIEQ 537
Query: 591 IMVLENLEERWRLQAEANDEAERLFN 605
IM LENLEE+W++QAEANDEAERL +
Sbjct: 538 IMELENLEEKWKIQAEANDEAERLLS 556
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023552095.1 | 0.0 | 96.31 | uncharacterized protein LOC111809863 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_023552094.1 | 0.0 | 96.00 | uncharacterized protein LOC111809863 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022931519.1 | 0.0 | 95.10 | uncharacterized protein LOC111437671 isoform X3 [Cucurbita moschata] | [more] |
XP_022931518.1 | 0.0 | 93.78 | uncharacterized protein LOC111437671 isoform X2 [Cucurbita moschata] | [more] |
XP_022931517.1 | 0.0 | 93.46 | uncharacterized protein LOC111437671 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EUG6 | 0.0 | 95.10 | uncharacterized protein LOC111437671 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EYW6 | 0.0 | 93.78 | uncharacterized protein LOC111437671 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1ETW5 | 0.0 | 93.46 | uncharacterized protein LOC111437671 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JB68 | 0.0 | 93.70 | uncharacterized protein LOC111483407 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1JDG3 | 0.0 | 92.40 | uncharacterized protein LOC111483407 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT1G64430.1 | 9.4e-174 | 63.04 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G64430.2 | 9.4e-174 | 63.04 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |