CmoCh15G006560 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh15G006560
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPhotosystem II CP47 reaction center protein
LocationCmo_Chr15: 3172600 .. 3181912 (+)
RNA-Seq ExpressionCmoCh15G006560
SyntenyCmoCh15G006560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTATGCTGAGTCTGCACAAGTTGAGATGAATTATTACTCTTTGTCATTAGAATTGGAGCAATCAAAAGCTCAGATTCAATTATTAGAGGGTCATATCAATAATTTGGATCCTGCCGTTAAGACCTTGAAAGAGGTCATATTCATGAAACCAGATTATGTTGATGCTCACTGTGATTTGGCATCCGCTTTGCATGCAATGAGGGAAGATGAGCGAGAAATTGAGGTGTTTCAGAAGGCAATCGACTTGAAGCTTGGCCATGTAGATGCTTTGTATATTTTGGGTGGACTTTACGTGAACTCAAATGGTGAAGGAGCTTTTATAGTTGTTGAAGCCTCAAAGTTTAAGACCATTGGAGAGAAGACTGTTTTGAGACCGGAACTGTCAAATGCTCTCAAAATTAGATCTTTTCAGAAGATTACCCGATTGAATCGCTGTGATGTGAAATTTATTAAGAAGGAAATCATTCAACATGATCTGTCAATGTCTTATTCAGGTGGTGGCATGTCTAAGAAGTCCATAAGCAAGCCTAGCTGGAAGAAATTCTTCGTAGATTATTTTGTTCGAGAGAAAGTGTTAACGCAGGATGTTGAATTGCAGGAAATACATACTAATGAGCAAGTTGCAGACATATTTACTAAAGCACTTGCCAAAGTGAAGTTTGAAGTTTTTCTTCGGAGTTATTGAGAAATAAGCTTGCACTAAGGGAGGCTGTCACAATTTAGTGCAACTTATTTTAGTTTGAATTAGTTATGGAATATATGATATTCGTAATCTTCCAGAATATATTAGTTAGGAAATATATCTTAGATATCGTAATTAGTTGAATTTGAATAGTAGTATATTTTATTTTATTATCAGGATTTAGTTAGTTTATATTTTCTTGGTAGATATTATTTAGTATCTTGTATCCTATTTAAAGGTCGTGAATATCAATCAAGATAGAACATTTTTCGATCCCAATTCTTTTCCTCATTCTTAAATCTGTATTCTTTCGTTGAGTAATAATATTTTCATATCTATGCACACTTTTGAGTGGTAGATATTGCGTCAGGCCGGTTTTGATCCGGCAATTATGCTCGCCGGAAACTTTTCAGAACCCACCCCTGGTATGTACTCTGCTGTCGGAATTGCCATCGGAATTTTGCCGGTGATTGGTCGGGGTTTCAGAGTCCGAGGAACACGATTGAAGATTTGTTCATGTTGGTTTCTTATATATTATTTCTTACTTCCTGGAAGCTTTTATGTAGTTTCCTTCCCGGCTTCTTTTAATTATTATTATTATTATTATCATCATTATTTGACTTTTCTTAATAAAAAAAAACTTTTTCTGTAACTTCCTCGGACATGCGTGTGCCTTCTCCGTTTAGTCTGTGTAGGAAATTCAGTGTGTTTTTATTTTAAAAATATATATTTATATGGTTAATTAGGTGATTCTTAATCCAAAAAATATTCTTATCGTTGGTATATGGACCGAACTACATAATTAAAAATTTTCAATAATTCATTTTATAATTGATATATAGACATATATCATTTATTTATAAATAAATAGTCAATTTCTTTACATTTAAGGGGAAAAGAACCCGACTTGATTTTTCAAAAATTTTAAGTATTAATAAAACTCTCTGAAACTATTTCCATTCAAATTTTATTAATACAAAGATTAAAATTAAGCACCTTAAAATATAGACTATAAATATTCAATGGCAAAGATATGCAGCTGTGGGCCAATCACAGCCGTCAATCTGTAGTTCCGTCTTAAATTTTTTTTCTTTAAAACTTTAATGAAAAATAAAAGCTGAGCTGGAAATATGTGAGAAACAACGCTGAGCTGTACGGGTGTCAGGGTTGGTGAGTCACTCAGCAGTCACACGCTCACTGTGTTCCGTTTGCTTGGACATGAAAAATTATTTAAATTTCCACTATGCCACATGTCACCTTTCTATTGGGCTCGCCTAATATGTGGAAGTTGGTATAAGTCAGCTATAAAAAATTGGTTTAGGTAGGGTTTAATTTTTAAAAAATAAAAAATTAAATATAATTGATAAGTTGACTTTTAAAAAGTCAATCCAATTAGCTCGAAAATAGGGTTACAATTCAATTGTACATTTAAAATTATTAGAAAATTTAATAACATTTGATATTTGTAATTGATATTAATGATACCGTGCAACTTATAAATGTTTAATATTTGAGTTTAATGAGATTCTAGTTATGAATAAGTATAATTAAAAAATAAAGATGAGATGGGTTGGACTGTGAACTATATTTGAATTATTTGAGTTATCAGCTCCATTAATTTGAAATTTTGGGTGGGTTGAAAATATTTTCCAAATTCAAGAGCAAGAATATATTTTTACCCTTTAAATAGCTTTCAGGCGCTAAAATTTGGTCTATCGAAAATAATAATAATAATAATAAAGAAAAATAACTATTGTTAATTAGTTTAAAATAACTTGCATTATTGCATAATTAATTCATAGGTAATTGAATTAGAGACAATTTAAGGTTAATTAGTATTAACTACTTTGGTAATTTTCATTATTATTTAAAGTTGAGAATGTTAGAAATAATAATAAACCAAACCGGGTTTTTTTTAATACGTATTTGATTTATTAAATAAAATTCAAAAGAAAACCGGCCATCGGTGCCGGTGGCTAGTTTTTCTCCGGCAGAGCTGTCGGATACTCTGTTTTCAATCAGTTTGAACAATCCCCATATAAACCCCAATCTCCGGCAAATAACAGTCATAGTTTCGCACACACCACGGAACAAGAACAAATCACAGCTTTATTTAATTATATATATGTAATTTGTCAAAAACAAGCCAAAACATAAAATGCTCTGTTTTTCCGACATGAAACAGAGCAAAACCCACCCACCAAAACAAAACGGAAATCAAAATCTCAATACAGTAATGAGGCTGCCATTATTACGTGTCATCGGCGTTGGGTTCCCGGCGGCGAGCGACGGAGGAGGGATTTAAGAAAGTCCCTAATAGAGTAGTACATCATCGTCATCACCAACTTTCTTCTTGCCGGAATTACAGTGGCAACCTTCCGCGGCTTCAAACTTGAGACGGCGGTGGCCGGAAATTTGATCGGCTTGTTACGTGGGGCAGTAGTCGGAGCCATCGGAATCTGGCTGAGTTTCGTTGAGGAAATGAGCTGTAAGGAAGAAAGGTTTGTGCTTATAAAGGTTAGGAGCAAGGTTTCCGGTAAGCTTCGGCAATGGAAGTCCTTTATTTTGACAACTTGTAAAGTCAAATTCGGTACTTCCATGGCAAAGGTATTATTGACAACCTTGTAAAGTCAAAATCTAAACTTCCCCTGAGCTTCCAGAGTTATTATGCAAAAACATATTTCATTTAACATTCACAATTTTCCACAACTCATCCTCCTGATCATAGATAGATCGCCCACGTTCGAATCCATAAGCAGTGACAATTGCCCCATGAGAACTCGCTTTCGCTACGGCTCCGGTGGGTTCTCTTAACCAAGCCACTTGCCTACGAGTTGCCGGCTCATTCTTCAACAAGCACGCGGTCAGAGTCTTGGTCTCCTCCCACTGCTTGGAAGCTTACGGTTTCATGTTCTATTTCACTTCCCGATTGGGGTTTCTTCACCCTTCCCTCACAGTACTGCTTCACTATCGGTCACCCATGAGTATTGGCCTTATAAGGTGGTCCTTGCTGATTCACAAGGGATTCTACGTGCCCATGCTACTTGGGTCAGAGTTGATCCTTTTGTTCTGGGCGGAATAGCCTCTCATCATATTGCAGCAGGTACATTGGGCATATTAGCGGGCCTATTCCATCTTAGTGTTCGTCCGCCCCAACGTCTATACAAAGGATTACGTATGGGAAATATTGAAACCGTCCTTTCGAGTAGTATCGCTGCTGTCTTTTTTGCAGCTTTTGTTGTTGCTGGAACTATGTGGTATGGTTCAGCAACTACCCCGATTGAATTATTTGGTCCCACTCGTTATCAATGGGATCAGGAATACTTCCAGCTAGAAATATATCGAAGGGTTAGTGCCCGGCTAGCCGAAAATCAAAGTTTATCAGAAGCTTGGTCTAAAATTCCTGAAAAATTAGCTTTTTATGATTACATCGGGAATAATCCCGCAAAAGGTGGATTATTCAGAGCGGGTTCCATGGACAACGGGGATGGAATAGCTGTTGGGTGGTTAGGACACNAATCCCGCAAAAGGTGGATTATTCAGAGCGGGTTCCATGGACAACGGGGATGGAATAGCTGTTGGGTTTCTTAAGAATGTTGGCTTAATCCACCACACTTAAACCGGGTGAAAGTTATTTTATTTATAATTAAATAAATTATAAACTGAAATAACAATAAAAAATATAAGAATATAATGTATAATTTAATTACGTAATTTTTAATCCAAAAATAGTACAGGGACGCGTAATTTTATTCCCAAAAAAAAAAAAAAAATAAATAAATAAATATATATATTCCTTATGGGTTTTTAAATTTCTTAACATTTTCAGAGAAACATGTAAAATAATAAAATTTAAAAACTAAACAAAAATTAAAGTTATTCAATTTTGAAGATAAAAATAGGAATATAATGTATACATCTTGTGAGCTGGAGATTGACGGCTGTGATTGGTCAGCTACAATCTTTGACCGAATTGTGGCCATTGATTAGAATTATGCTTTTTAATGGAGTTTTCATACTGAATGATGGTGGAAACTAATTACTTATGTTCTAATGGTAAGAATTTGGAGGTGACCTATGGGCATGGAGAATCCGGTTTAAATCGAATATAGTCGAGTCATTTTCGTCGTTACTAACCTTCTAACTCAGACTATCCAATCTAAACTATAAAAATTAGGTCGAGTTGAGTCGGGTTAGATTTTTGTAAAATAAAAAATTTAATATAATTTATAAATGACATTTTTTTTAGTTAGGTCAACTCAATCAACTTGTAATAAGATTACAATTCGATCTTACGTAAGTTTTTATGTAAATATTAAGATTCTGAATATATTTTTTTTATTAAACTAATTAATTGATCATAGTTTACAAAATTATTATAACTGATTATTTAATTATAATACATCTTTTTAAATATATATTTAATGTATTATGAGAACATCTTAGAACAAAAAAAATTATTAAGAGAATGAATTAAATAATATAATAATCTTATTGAAAATAATTTATCACATGAAATTTTGATAACTTATATGAATTAGAGACATCTTGTAATTTTTGTATTTTTTTTATAATAAATTCAAAATTATGAATTTTAGAGTTTATTGTAAATATTATTATTTATATTTAAGGGTAATATTTAATTTCAAATGACATTGAAACTTTAATATTATTTTATAATATTAAAATTTGATTTTTTAGAAATTATGTTAATACATTATATATTATATATATATATATATATTTATGTATTTGTTTGTAAACATTATATTAAAGTTTTAAAAATTAATCCGTACAAAATTAGAGGAGGCTTCTCTTGAAACCAATAGAATGGGCATGTGGGTAAATAGCCCATGAATTTAGATCCGACCCGGAATATTAGTCCGGATCGGGCTCTGAAATTGACCGAGAACCTGCCCGTTTCGTTCCGAATACGAAAAAGCTCTGTATACTTTACCCTGTTCGATCGAGGGTTCGAGGAGACGATGAAGCTCACATGGAAGAACAAGGGCAATAGTAAGAAGCGGTCTTTCACGGCAATTTCAAGCCGTTCAAATCTTCCTTTTGAGGTTCCTGGTGAGGGGGAAGAAGATGGTAGGGTTAATCGCCAAGATAATGAAATAGACCGGAGAGAGGATGGAGCTTCAGAAAGTTCGTCTTCAGACCTCAAGCGTCTCGCGGAATCATTTCGGGATCAAGGCAATACGCTTGCCGAGGTGCGTGTGTTTTCCTTCAACAAATTTGAGCTGAATTTGTTATGGCATTTCCGTTGGAAATTTGTTTAGCTACTACTTCCTGTTCTCAATTGTACTTAAGAACAGTATCTAATTTTGAATTTTAAATTTGAAATGTTTTATCGGAGAGAATGATAACTTCCTGAGTTGAATTGAAGAGAAAATAATTGCGCTGCAAAAACCTTGCTGGTGTGTGGTCACCGAGTTTTCATTTATGAATCACACTGGTAATATTCGAGAAAACATCTCCCAATCGTATGCGTTGTTTCTTGCAATGCAACAACAGAAAATCTTCACGAACAATTTGTTCGGAACATCATCAGCCCAAAGCTATAGGATACTGTTGAAACAGGGAGGCGGCTATCTAGGGAAGAGAAATTTGTGTATCACTTTGAAAACAATGACTAATTTTGGATCTTGCATTGGTTTGAATTGAAATTTCTTGCATACTTGACATCTTGATCCTCTATCTCGGGAGAAGCCTTTTGATTTGCTTATTACTAACACACTGAAACAATAAAACAAACCTAATAGCAAATGAACCCTTGGTTTACGGTGCAAATTATGATTCAAGTATATGGAACCACACCTCTAAAGGACCATTTGCAGAGGATTTGTTTTGGGAAGAAACCGTTTGCAGCTGTTAAATGAGTAAATTATGGTGTTTATTTTATGTTTGCGGTAACTGGTACTTTTATTTGCCTTCCGTGATTTGAATGGTTTACAATATATGAGATATTGATACAGATGTAAATCATTTATCAAAGAATGAAAAAAGGCACAAATATGGAACATTAGTGACGCACTTTCCTCTAAAAATTAGGTAATGGGCGAGGGAAACACACGTCTTTGGAAAGCTTACTTATTTCCATTTCCATTTCCGTATGCATTTGTATTTGTGTTCATAGTTTAACACATGCAATGATGCAATGGTTTTTATTCAAACGAATTTTCACTTTAAGTAATGGTATTTGCACTTGAACTATTATAAGGAAGAAGCCATTTGTATATTTGAATGCGTTGTATTATACCAGAGGTTTATTAGGAAATCTTTTGTCGAACAAGAATGTTTCCAATTGAACACGTAGCTAGATGATTCAGATCTTAGTGACTATTTGGCATTGAAATATTACTATTTGCTTCGGATCATACCGACTGTTTGGTTCATTTCTATACAAATATGGGCTTGCTCACAAACCTTTGGGTTGGTTTTTACTAGGAGAATCGACTTAAAAATGAAGCCTGTAGGTTATCCTTCTTTTATGTTGCAGTTATAGAAATTGTCCTCATGCATTGACATTTGGATGTTAAATATTTTGATTATACTCCAAGGCCATGTGCTTTTTCTTTTTTCAGAATGGGAAATTACGGGAAGCACTGGGAAAATGGGAGACTGCTCTTACCTTGATGCCTGAAAATGCAGTTCTACATGAACAAAAGTCACAAGTTCTACTGGAGATTGGGGAAGCTTGGGGTGCTTTGAAGGCAGCAACCAGTATGTTGCTTTCAATTACCTTTTACTGTTTCGATTGTCTAAGATTTTTCATATTGGTCGGTAAGCTCTATATTGAAATAACATAGGTGATCCGTGATCGGTCTATAGGGTATATGCATGGTGATATGAACTGGGTTTATTGATTAATGAAGTACAATGTCAAGGGAAGAATTCTAGAGCTTCCCCTCTCTCCCACAAGTTACCTCATAAAATGATCAGATTAGGATACCTCATAAAATGGCATTTTGTTATGATCTGGGTGATAAGCCATACCTTTTGTTTTTCTTTTTTGTTGTTGTAAGGAACTTGCTTTTCCACATTTTTCTTGTTTTATTTATTTTATACGATGGCCTATATTTGAGTATTTCATTTAATTCTATTATTAAAATAACATCAAATCTTCATGCCCTTGAAGTTGAAATATCTATTTCTGAATTCTACATTGCAGGAGCTACTGACTTAGATCCGTCATGGGCTGAGGTACAACTAACTCTTAGACTGTCAGGACAATATTAATATATCCTGCTACACCTTTTACTCTATCTTTGAAAACATCGCGCCTTCTTTTTCGTCTTCGATGGGTTTTGTCTTGTGGGATTCTATCCATTTAATTGCTTATTTTCTGTTGGGAACTTGAGTCTGGTATTCATTGGCTCCATATAGTTGGATAGAACCATTTAGATCTATTTCTCTCGTAGACCAATGGCACATTGCTCTATTATGGGAAGTTTACACACTTATAGATTCAATGTTTTAATGCCTTTCTTGGATCTCCCATCCAAGATCAGCTTCTTTTGCATTGGGTGTCTTTAGTAAGCAGTAAACAATTTTCTAGCCGACGACCTTTCCTTCTTGAGCTGATTTGCCGAGAACTTTAAACATTTAAAATTTCCTCATTCATCGTGCTGTTGACCTATGTTCAGTAAATGTGAAGAAGCTAGACTCATTTGTTAGTTTCTCAACTGATTTCTTGTCTCTTTTGATTTTGAATACAATCAGGCGTGGATCACTCTTGGTAGAGCACAGTTAAACTTTGGGGAACCTGATAGTGCTATAGAGAGCTTTGACAGAGCATTGGCCATTAAGGTACAGTGATTTTGCCTGGTTCATCATTTTGTATTTATACTAGTATTTACACTGCATTCCTTGTCCTGACACGATCGATCTGTCTAGTATACTTGCACGGTCAGTTAAAATCTAGCCTTCATAATAGTTTCCCAAAGAAACTGTATAAATGGTGACAGTTTCGATTGTTAATATACCAACCATTTTGGATGCTAGGCTCAGTCAAAAGCTCTTGCATCATTTATCGTCCAACCAGCGCCCTCTTCGGCCTGACTGTTCATTACTGAATTTAAAGTTGAGAAACACGATTTATTCCCAAAAGCCCACTTACGGTCAAAGTCCATTAAAGCTTTGCTCTATAAGACATTGATTTGTTTTCTCGATCCCACATATTTTATGGTGAAGTAAAAATTTCTGTTGTGCACAGCCTGATTCTGGGGATGCCCAAGACGATCGCAAAACTGCAACGCATCTAATAAAGCGGCGAAAACAGCTCCATTCAGCTGGGTTGAGCAGTTCTGAAAATCGTTATTTTGTGGGAGATAAGTTGAACAACGTTGACGCTTTATGAAAGTGTAATCTGAAAAGGAATATGATGAAGTTTGTAATTTAGGTTCTTAATGCATAGCTAGAGTTTCCCCCATGGCCAAAAGCAGCGTGTACCTCCTAAATTCATGAAATCACAGATAGTTTACAAGTAGTTACAAAATACTATGTGAATGTATAGTTTTCTTTCAAAATGTCCGTTTTAATAATTGTGTGTATATATATTCAACGACAAGTGAAGCATCGACTTTTAATATGTTATGGTTATTTTATAATTTTTGTTGGA

mRNA sequence

ATGGAGTATGCTGAGTCTGCACAAGTTGAGATGAATTATTACTCTTTGTCATTAGAATTGGAGCAATCAAAAGCTCAGATTCAATTATTAGAGGGTCATATCAATAATTTGGATCCTGCCGTTAAGACCTTGAAAGAGGTCATATTCATGAAACCAGATTATGTTGATGCTCACTGTGATTTGGCATCCGCTTTGCATGCAATGAGGGAAGATGAGCGAGAAATTGAGGTGTTTCAGAAGGCAATCGACTTGAAGCTTGGCCATGTAGATGCTTTGTATATTTTGGGTGGACTTTACGTGAACTCAAATGGTGAAGGAGCTTTTATAGTTGTTGAAGCCTCAAAGTTTAAGACCATTGGAGAGAAGACTGTTTTGAGACCGGAACTGTCAAATGCTCTCAAAATTAGATCTTTTCAGAAGATTACCCGATTGAATCGCTTGGCAACCTTCCGCGGCTTCAAACTTGAGACGGCGGTGGCCGGAAATTTGATCGGCTTGTTACGTGGGGCAGTAGTCGGAGCCATCGGAATCTGGCTGAGTTTCGTTGAGGAAATGAGCTGTAAGGAAGAAAGTACTGCTTCACTATCGGTCACCCATGAGTATTGGCCTTATAAGGTGGTCCTTGCTGATTCACAAGGGATTCTACGTGCCCATGCTACTTGGGTCAGAGTTGATCCTTTTGTTCTGGGCGGAATAGCCTCTCATCATATTGCAGCAGGTACATTGGGCATATTAGCGGGCCTATTCCATCTTAGTGTTCGTCCGCCCCAACGTCTATACAAAGGATTACGTATGGGAAATATTGAAACCGTCCTTTCGAGTAGTATCGCTGCTGTCTTTTTTGCAGCTTTTGTTGTTGCTGGAACTATGTGGTATGGTTCAGCAACTACCCCGATTGAATTATTTGGTCCCACTCGTTATCAATGGGATCAGGAATACTTCCAGCTAGAAATATATCGAAGGGTTAGTGCCCGGCTAGCCGAAAATCAAAGTTTATCAGAAGCTTGGTCTAAAATTCCTGAAAAATTAGCTTTTTATGATTACATCGGGAATAATCCCGCAAAAGGTGGATTATTCAGAGCGGGTTCCATGGACAACGGGGATGGAATAGCTGTTGGGTGGTTAGGACACNAATCCCGCAAAAGTCCGGATCGGGCTCTGAAATTGACCGAGAACCTGCCCGTTTCGTTCCGAATACGAAAAAGCTCTGTATACTTTACCCTGTTCGATCGAGGGTTCGAGGAGACGATGAAGCTCACATGGAAGAACAAGGGCAATAGTAAGAAGCGGTCTTTCACGGCAATTTCAAGCCGTTCAAATCTTCCTTTTGAGGTTCCTGGTGAGGGGGAAGAAGATGGTAGGGTTAATCGCCAAGATAATGAAATAGACCGGAGAGAGGATGGAGCTTCAGAAAGTTCGTCTTCAGACCTCAAGCGTCTCGCGGAATCATTTCGGGATCAAGGCAATACGCTTGCCGAGAATGGGAAATTACGGGAAGCACTGGGAAAATGGGAGACTGCTCTTACCTTGATGCCTGAAAATGCAGTTCTACATGAACAAAAGTCACAAGTTCTACTGGAGATTGGGGAAGCTTGGGGTGCTTTGAAGGCAGCAACCAGAGCTACTGACTTAGATCCGTCATGGGCTGAGGCGTGGATCACTCTTGGTAGAGCACAGTTAAACTTTGGGGAACCTGATAGTGCTATAGAGAGCTTTGACAGAGCATTGGCCATTAAGCCTGATTCTGGGGATGCCCAAGACGATCGCAAAACTGCAACGCATCTAATAAAGCGGCGAAAACAGCTCCATTCAGCTGGGTTGAGCAGTTCTGAAAATCGTTATTTTGTGGGAGATAAGTTGAACAACGTTGACGCTTTATGAAAGTGTAATCTGAAAAGGAATATGATGAAGTTTGTAATTTAGGTTCTTAATGCATAGCTAGAGTTTCCCCCATGGCCAAAAGCAGCGTGTACCTCCTAAATTCATGAAATCACAGATAGTTTACAAGTAGTTACAAAATACTATGTGAATGTATAGTTTTCTTTCAAAATGTCCGTTTTAATAATTGTGTGTATATATATTCAACGACAAGTGAAGCATCGACTTTTAATATGTTATGGTTATTTTATAATTTTTGTTGGA

Coding sequence (CDS)

ATGGAGTATGCTGAGTCTGCACAAGTTGAGATGAATTATTACTCTTTGTCATTAGAATTGGAGCAATCAAAAGCTCAGATTCAATTATTAGAGGGTCATATCAATAATTTGGATCCTGCCGTTAAGACCTTGAAAGAGGTCATATTCATGAAACCAGATTATGTTGATGCTCACTGTGATTTGGCATCCGCTTTGCATGCAATGAGGGAAGATGAGCGAGAAATTGAGGTGTTTCAGAAGGCAATCGACTTGAAGCTTGGCCATGTAGATGCTTTGTATATTTTGGGTGGACTTTACGTGAACTCAAATGGTGAAGGAGCTTTTATAGTTGTTGAAGCCTCAAAGTTTAAGACCATTGGAGAGAAGACTGTTTTGAGACCGGAACTGTCAAATGCTCTCAAAATTAGATCTTTTCAGAAGATTACCCGATTGAATCGCTTGGCAACCTTCCGCGGCTTCAAACTTGAGACGGCGGTGGCCGGAAATTTGATCGGCTTGTTACGTGGGGCAGTAGTCGGAGCCATCGGAATCTGGCTGAGTTTCGTTGAGGAAATGAGCTGTAAGGAAGAAAGTACTGCTTCACTATCGGTCACCCATGAGTATTGGCCTTATAAGGTGGTCCTTGCTGATTCACAAGGGATTCTACGTGCCCATGCTACTTGGGTCAGAGTTGATCCTTTTGTTCTGGGCGGAATAGCCTCTCATCATATTGCAGCAGGTACATTGGGCATATTAGCGGGCCTATTCCATCTTAGTGTTCGTCCGCCCCAACGTCTATACAAAGGATTACGTATGGGAAATATTGAAACCGTCCTTTCGAGTAGTATCGCTGCTGTCTTTTTTGCAGCTTTTGTTGTTGCTGGAACTATGTGGTATGGTTCAGCAACTACCCCGATTGAATTATTTGGTCCCACTCGTTATCAATGGGATCAGGAATACTTCCAGCTAGAAATATATCGAAGGGTTAGTGCCCGGCTAGCCGAAAATCAAAGTTTATCAGAAGCTTGGTCTAAAATTCCTGAAAAATTAGCTTTTTATGATTACATCGGGAATAATCCCGCAAAAGGTGGATTATTCAGAGCGGGTTCCATGGACAACGGGGATGGAATAGCTGTTGGGTGGTTAGGACACNAATCCCGCAAAAGTCCGGATCGGGCTCTGAAATTGACCGAGAACCTGCCCGTTTCGTTCCGAATACGAAAAAGCTCTGTATACTTTACCCTGTTCGATCGAGGGTTCGAGGAGACGATGAAGCTCACATGGAAGAACAAGGGCAATAGTAAGAAGCGGTCTTTCACGGCAATTTCAAGCCGTTCAAATCTTCCTTTTGAGGTTCCTGGTGAGGGGGAAGAAGATGGTAGGGTTAATCGCCAAGATAATGAAATAGACCGGAGAGAGGATGGAGCTTCAGAAAGTTCGTCTTCAGACCTCAAGCGTCTCGCGGAATCATTTCGGGATCAAGGCAATACGCTTGCCGAGAATGGGAAATTACGGGAAGCACTGGGAAAATGGGAGACTGCTCTTACCTTGATGCCTGAAAATGCAGTTCTACATGAACAAAAGTCACAAGTTCTACTGGAGATTGGGGAAGCTTGGGGTGCTTTGAAGGCAGCAACCAGAGCTACTGACTTAGATCCGTCATGGGCTGAGGCGTGGATCACTCTTGGTAGAGCACAGTTAAACTTTGGGGAACCTGATAGTGCTATAGAGAGCTTTGACAGAGCATTGGCCATTAAGCCTGATTCTGGGGATGCCCAAGACGATCGCAAAACTGCAACGCATCTAATAAAGCGGCGAAAACAGCTCCATTCAGCTGGGTTGAGCAGTTCTGAAAATCGTTATTTTGTGGGAGATAAGTTGAACAACGTTGACGCTTTATGA

Protein sequence

MEYAESAQVEMNYYSLSLELEQSKAQIQLLEGHINNLDPAVKTLKEVIFMKPDYVDAHCDLASALHAMREDEREIEVFQKAIDLKLGHVDALYILGGLYVNSNGEGAFIVVEASKFKTIGEKTVLRPELSNALKIRSFQKITRLNRLATFRGFKLETAVAGNLIGLLRGAVVGAIGIWLSFVEEMSCKEESTASLSVTHEYWPYKVVLADSQGILRAHATWVRVDPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAFVVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLAFYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSFRIRKSSVYFTLFDRGFEETMKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDAL
Homology
BLAST of CmoCh15G006560 vs. ExPASy Swiss-Prot
Match: A0ZZ61 (Photosystem II CP47 reaction center protein OS=Gossypium barbadense OX=3634 GN=psbB PE=3 SV=1)

HSP 1 Score: 306.2 bits (783), Expect = 8.4e-82
Identity = 152/173 (87.86%), Postives = 154/173 (89.02%), Query Frame = 0

Query: 225 DPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 284
           DPFV GGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF
Sbjct: 191 DPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 250

Query: 285 VVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLA 344
           VVAGTMWYGSATTPIELFGPTRYQWDQ YFQ EIYRRVSA LAENQSLSEAWSKIPEKLA
Sbjct: 251 VVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLA 310

Query: 345 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSF 398
           FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGH   +  D        +P  F
Sbjct: 311 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPIFRDKDGRELFVRRMPTFF 363

BLAST of CmoCh15G006560 vs. ExPASy Swiss-Prot
Match: Q2L932 (Photosystem II CP47 reaction center protein OS=Gossypium hirsutum OX=3635 GN=psbB PE=3 SV=1)

HSP 1 Score: 306.2 bits (783), Expect = 8.4e-82
Identity = 152/173 (87.86%), Postives = 154/173 (89.02%), Query Frame = 0

Query: 225 DPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 284
           DPFV GGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF
Sbjct: 191 DPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 250

Query: 285 VVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLA 344
           VVAGTMWYGSATTPIELFGPTRYQWDQ YFQ EIYRRVSA LAENQSLSEAWSKIPEKLA
Sbjct: 251 VVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLA 310

Query: 345 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSF 398
           FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGH   +  D        +P  F
Sbjct: 311 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPIFRDKDGRELFVRRMPTFF 363

BLAST of CmoCh15G006560 vs. ExPASy Swiss-Prot
Match: A4QKD1 (Photosystem II CP47 reaction center protein OS=Barbarea verna OX=50458 GN=psbB PE=3 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 1.9e-81
Identity = 151/173 (87.28%), Postives = 155/173 (89.60%), Query Frame = 0

Query: 225 DPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 284
           DPFV GGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF
Sbjct: 191 DPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 250

Query: 285 VVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLA 344
           VVAGTMWYGSATTPIELFGPTRYQWDQ YFQ EIYRRVSA LAENQSLSEAWSKIPEKLA
Sbjct: 251 VVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLA 310

Query: 345 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSF 398
           FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGH   ++ +        +P  F
Sbjct: 311 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPVFRNKEGRELFVRRMPTFF 363

BLAST of CmoCh15G006560 vs. ExPASy Swiss-Prot
Match: A4QKV7 (Photosystem II CP47 reaction center protein OS=Crucihimalaya wallichii OX=78192 GN=psbB PE=3 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 1.9e-81
Identity = 151/173 (87.28%), Postives = 155/173 (89.60%), Query Frame = 0

Query: 225 DPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 284
           DPFV GGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF
Sbjct: 191 DPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 250

Query: 285 VVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLA 344
           VVAGTMWYGSATTPIELFGPTRYQWDQ YFQ EIYRRVSA LAENQSLSEAWSKIPEKLA
Sbjct: 251 VVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLA 310

Query: 345 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSF 398
           FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGH   ++ +        +P  F
Sbjct: 311 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPVFRNKEGRELFVRRMPTFF 363

BLAST of CmoCh15G006560 vs. ExPASy Swiss-Prot
Match: A4QLD2 (Photosystem II CP47 reaction center protein OS=Lepidium virginicum OX=59292 GN=psbB PE=3 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 1.9e-81
Identity = 151/173 (87.28%), Postives = 155/173 (89.60%), Query Frame = 0

Query: 225 DPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 284
           DPFV GGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF
Sbjct: 191 DPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 250

Query: 285 VVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLA 344
           VVAGTMWYGSATTPIELFGPTRYQWDQ YFQ EIYRRVSA LAENQSLSEAWSKIPEKLA
Sbjct: 251 VVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLA 310

Query: 345 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSF 398
           FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGH   ++ +        +P  F
Sbjct: 311 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPVFRNKEGRELFVRRMPTFF 363

BLAST of CmoCh15G006560 vs. ExPASy TrEMBL
Match: A0A6J1FH27 (tetratricopeptide repeat protein 33 OS=Cucurbita moschata OX=3662 GN=LOC111445415 PE=4 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 4.4e-110
Identity = 210/210 (100.00%), Postives = 210/210 (100.00%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD 476
           MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD
Sbjct: 1   MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD 60

Query: 477 LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK 536
           LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK
Sbjct: 61  LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK 120

Query: 537 AATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLI 596
           AATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLI
Sbjct: 121 AATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLI 180

Query: 597 KRRKQLHSAGLSSSENRYFVGDKLNNVDAL 627
           KRRKQLHSAGLSSSENRYFVGDKLNNVDAL
Sbjct: 181 KRRKQLHSAGLSSSENRYFVGDKLNNVDAL 210

BLAST of CmoCh15G006560 vs. ExPASy TrEMBL
Match: A0A6J1JXQ4 (tetratricopeptide repeat protein 33 OS=Cucurbita maxima OX=3661 GN=LOC111489781 PE=4 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 4.3e-105
Identity = 206/215 (95.81%), Postives = 206/215 (95.81%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEI-----DRREDGASE 476
           MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGE EED RVNRQDNEI     DRREDGASE
Sbjct: 1   MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEVEEDSRVNRQDNEIDVKAVDRREDGASE 60

Query: 477 SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 536
           SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA
Sbjct: 61  SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 120

Query: 537 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 596
           WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT
Sbjct: 121 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 180

Query: 597 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDAL 627
           A HLIKRRKQLHSAGLSSSENRYFVGDKLNNVD L
Sbjct: 181 AMHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDDL 215

BLAST of CmoCh15G006560 vs. ExPASy TrEMBL
Match: A0A0A0KH06 (TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G376230 PE=4 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 6.2e-88
Identity = 176/213 (82.63%), Postives = 187/213 (87.79%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNE-----IDRREDGASE 476
           MKLTWKNKGNSKKRS T IS+RSNLPFEVPG  EED R N Q  E     +DR ED AS+
Sbjct: 1   MKLTWKNKGNSKKRSLTTISTRSNLPFEVPGVVEEDDRANPQVKEVTAQPVDRSEDEASD 60

Query: 477 SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 536
            S SDLKRLAESF+ QGNTLAE GK REALGKWETALTLMPENAVLHEQK+QVLLE+GEA
Sbjct: 61  RSLSDLKRLAESFQAQGNTLAEGGKFREALGKWETALTLMPENAVLHEQKAQVLLEVGEA 120

Query: 537 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 596
           WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGD QDDR+T
Sbjct: 121 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDVQDDRQT 180

Query: 597 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVD 625
           A  LIKRRKQLHSAGLSSS+NRY VG+KL++ D
Sbjct: 181 AMRLIKRRKQLHSAGLSSSKNRYLVGEKLSDAD 213

BLAST of CmoCh15G006560 vs. ExPASy TrEMBL
Match: A0A5D3D571 (Tetratricopeptide repeat protein 33 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold481G00340 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 8.4e-85
Identity = 170/213 (79.81%), Postives = 184/213 (86.38%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNE-----IDRREDGASE 476
           MKLTWKNK NSKKRS   ISSRSNLPFEVPGE  ED   N  D E     +D+ E  AS+
Sbjct: 1   MKLTWKNKANSKKRSLAPISSRSNLPFEVPGEVGEDDAANPLDKEVTAQPVDQSEGEASD 60

Query: 477 SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 536
            S SDLKRLAESF+ QG+TLAE+GK REALGKWETALTLMPENA+LHEQK+QVLLE+GEA
Sbjct: 61  RSFSDLKRLAESFQAQGDTLAESGKFREALGKWETALTLMPENAILHEQKAQVLLEVGEA 120

Query: 537 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 596
           WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFD+ALAIKPDSGDAQDDR+T
Sbjct: 121 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDKALAIKPDSGDAQDDRQT 180

Query: 597 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVD 625
           A  LIKRRKQLHS+GLSSS NRY VG+KL+N D
Sbjct: 181 AMRLIKRRKQLHSSGLSSSNNRYLVGEKLSNAD 213

BLAST of CmoCh15G006560 vs. ExPASy TrEMBL
Match: A0A1S3B3U0 (tetratricopeptide repeat protein 33 OS=Cucumis melo OX=3656 GN=LOC103485686 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 8.4e-85
Identity = 170/213 (79.81%), Postives = 184/213 (86.38%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNE-----IDRREDGASE 476
           MKLTWKNK NSKKRS   ISSRSNLPFEVPGE  ED   N  D E     +D+ E  AS+
Sbjct: 1   MKLTWKNKANSKKRSLAPISSRSNLPFEVPGEVGEDDAANPLDKEVTAQPVDQSEGEASD 60

Query: 477 SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 536
            S SDLKRLAESF+ QG+TLAE+GK REALGKWETALTLMPENA+LHEQK+QVLLE+GEA
Sbjct: 61  RSFSDLKRLAESFQAQGDTLAESGKFREALGKWETALTLMPENAILHEQKAQVLLEVGEA 120

Query: 537 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 596
           WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFD+ALAIKPDSGDAQDDR+T
Sbjct: 121 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDKALAIKPDSGDAQDDRQT 180

Query: 597 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVD 625
           A  LIKRRKQLHS+GLSSS NRY VG+KL+N D
Sbjct: 181 AMRLIKRRKQLHSSGLSSSNNRYLVGEKLSNAD 213

BLAST of CmoCh15G006560 vs. NCBI nr
Match: KAG6578936.1 (Tetratricopeptide repeat protein 33, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 476.1 bits (1224), Expect = 4.7e-130
Identity = 244/245 (99.59%), Postives = 244/245 (99.59%), Query Frame = 0

Query: 382 SPDRALKLTENLPVSFRIRKSSVYFTLFDRGFEETMKLTWKNKGNSKKRSFTAISSRSNL 441
           SPDRALKLTENLPVSFRIRKSSVYFTLFDRGFEETMKLTWKNKGNSKKRSFTAISSRSNL
Sbjct: 10  SPDRALKLTENLPVSFRIRKSSVYFTLFDRGFEETMKLTWKNKGNSKKRSFTAISSRSNL 69

Query: 442 PFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSDLKRLAESFRDQGNTLAENGKLREAL 501
           PFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSDLKRLAESFRDQGNTLAENGKLREAL
Sbjct: 70  PFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSDLKRLAESFRDQGNTLAENGKLREAL 129

Query: 502 GKWETALTLMPENAVLHEQKSQVLLEIGEAWGALKAATRATDLDPSWAEAWITLGRAQLN 561
           GKWETALTLMPENAVLHEQKSQVLLEIGEAWGALKAATRATDLDPSWAEAWITLGRAQLN
Sbjct: 130 GKWETALTLMPENAVLHEQKSQVLLEIGEAWGALKAATRATDLDPSWAEAWITLGRAQLN 189

Query: 562 FGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLN 621
           FGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLN
Sbjct: 190 FGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLN 249

Query: 622 NVDAL 627
           NVD L
Sbjct: 250 NVDGL 254

BLAST of CmoCh15G006560 vs. NCBI nr
Match: XP_022939537.1 (tetratricopeptide repeat protein 33 [Cucurbita moschata])

HSP 1 Score: 408.7 bits (1049), Expect = 9.2e-110
Identity = 210/210 (100.00%), Postives = 210/210 (100.00%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD 476
           MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD
Sbjct: 1   MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD 60

Query: 477 LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK 536
           LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK
Sbjct: 61  LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK 120

Query: 537 AATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLI 596
           AATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLI
Sbjct: 121 AATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKTATHLI 180

Query: 597 KRRKQLHSAGLSSSENRYFVGDKLNNVDAL 627
           KRRKQLHSAGLSSSENRYFVGDKLNNVDAL
Sbjct: 181 KRRKQLHSAGLSSSENRYFVGDKLNNVDAL 210

BLAST of CmoCh15G006560 vs. NCBI nr
Match: XP_023551627.1 (tetratricopeptide repeat protein 33 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 395.2 bits (1014), Expect = 1.0e-105
Identity = 207/215 (96.28%), Postives = 207/215 (96.28%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEI-----DRREDGASE 476
           MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGE EEDGRVNRQDNEI     DRREDGASE
Sbjct: 1   MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEVEEDGRVNRQDNEIDVKAVDRREDGASE 60

Query: 477 SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 536
           SS SDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA
Sbjct: 61  SSFSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 120

Query: 537 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 596
           WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT
Sbjct: 121 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 180

Query: 597 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDAL 627
           ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVD L
Sbjct: 181 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDGL 215

BLAST of CmoCh15G006560 vs. NCBI nr
Match: KAG7016456.1 (Tetratricopeptide repeat protein 33, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 392.9 bits (1008), Expect = 5.2e-105
Identity = 208/230 (90.43%), Postives = 208/230 (90.43%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD 476
           MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD
Sbjct: 1   MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSSD 60

Query: 477 LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK 536
           LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK
Sbjct: 61  LKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEAWGALK 120

Query: 537 AATR--------------------ATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRAL 596
           AAT                     ATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRAL
Sbjct: 121 AATSMLLSITFYCFDCLRFFILVGATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRAL 180

Query: 597 AIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDAL 627
           AIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVD L
Sbjct: 181 AIKPDSGDAQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDGL 230

BLAST of CmoCh15G006560 vs. NCBI nr
Match: XP_022993931.1 (tetratricopeptide repeat protein 33 [Cucurbita maxima])

HSP 1 Score: 392.1 bits (1006), Expect = 8.9e-105
Identity = 206/215 (95.81%), Postives = 206/215 (95.81%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEI-----DRREDGASE 476
           MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGE EED RVNRQDNEI     DRREDGASE
Sbjct: 1   MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEVEEDSRVNRQDNEIDVKAVDRREDGASE 60

Query: 477 SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 536
           SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA
Sbjct: 61  SSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQVLLEIGEA 120

Query: 537 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 596
           WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT
Sbjct: 121 WGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGDAQDDRKT 180

Query: 597 ATHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDAL 627
           A HLIKRRKQLHSAGLSSSENRYFVGDKLNNVD L
Sbjct: 181 AMHLIKRRKQLHSAGLSSSENRYFVGDKLNNVDDL 215

BLAST of CmoCh15G006560 vs. TAIR 10
Match: ATCG00680.1 (photosystem II reaction center protein B )

HSP 1 Score: 303.9 bits (777), Expect = 3.0e-82
Identity = 150/173 (86.71%), Postives = 155/173 (89.60%), Query Frame = 0

Query: 225 DPFVLGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 284
           DPFV GGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF
Sbjct: 191 DPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAF 250

Query: 285 VVAGTMWYGSATTPIELFGPTRYQWDQEYFQLEIYRRVSARLAENQSLSEAWSKIPEKLA 344
           VVAGTMWYGSATTPIELFGPTRYQWDQ YFQ EIYRRVSA LAENQSLSEAW+KIPEKLA
Sbjct: 251 VVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWAKIPEKLA 310

Query: 345 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHXSRKSPDRALKLTENLPVSF 398
           FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGH   ++ +        +P  F
Sbjct: 311 FYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPVFRNKEGRELFVRRMPTFF 363

BLAST of CmoCh15G006560 vs. TAIR 10
Match: AT1G77230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 209.5 bits (532), Expect = 7.6e-54
Identity = 118/215 (54.88%), Postives = 150/215 (69.77%), Query Frame = 0

Query: 417 MKLTWKNKGNSKKRSFTAISSRSNLPFEVPGEGEEDGRVNRQDNEIDRREDGASESSSS- 476
           MKLTW NK N KKRS   +S+  +LPFE     E    ++ ++++IDRR+      SS  
Sbjct: 1   MKLTW-NK-NPKKRSRLVLSNFPDLPFEKDDSLESQSHLHFREDDIDRRQTTDQLDSSEV 60

Query: 477 -----------DLKRLAESFRDQGNTLAENGKLREALGKWETALTLMPENAVLHEQKSQV 536
                      + K+LAES R QG+  AE GK +EALGKWE AL L+PE+A+LHEQK+QV
Sbjct: 61  GGNHPRENFDVEAKKLAESIRAQGDKFAEEGKYQEALGKWEAALNLVPEDAILHEQKAQV 120

Query: 537 LLEIGEAWGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESFDRALAIKPDSGD 596
           LLE+G+AW ALKAATRAT++DPSWAEAW TLGRAQLNFGEPDSAI SF+ AL I  DS +
Sbjct: 121 LLELGDAWKALKAATRATEIDPSWAEAWTTLGRAQLNFGEPDSAIRSFESALLINADSRE 180

Query: 597 AQDDRKTATHLIKRRKQLHSAGLSSSENRYFVGDK 620
           A+DD K+A  LIK+R+QL ++G  +   R+ V DK
Sbjct: 181 AKDDLKSAKQLIKKREQLQTSGQDTDTTRFVVNDK 213

BLAST of CmoCh15G006560 vs. TAIR 10
Match: AT1G05150.1 (Calcium-binding tetratricopeptide family protein )

HSP 1 Score: 128.6 bits (322), Expect = 1.7e-29
Identity = 77/185 (41.62%), Postives = 94/185 (50.81%), Query Frame = 0

Query: 40  AVKTLKEVIFMKPDYVDAHCDLASALHAMREDEREIEVFQKAIDLKLGHVDALYILGGLY 99
           AVK L+E I++KPDY DAHCDLAS+LH+M EDER IEVFQ+AIDLK GHVDALY LGGLY
Sbjct: 363 AVKALEEAIYLKPDYADAHCDLASSLHSMGEDERAIEVFQRAIDLKPGHVDALYNLGGLY 422

Query: 100 VN---------------------------------------------------------- 147
           ++                                                          
Sbjct: 423 MDLGRFQRASEMYTRVLTVWPNHWRAQLNKAVSLLGAGETEEAKRALKEALKLTNRVELH 482

BLAST of CmoCh15G006560 vs. TAIR 10
Match: AT2G32450.1 (Calcium-binding tetratricopeptide family protein )

HSP 1 Score: 127.9 bits (320), Expect = 2.9e-29
Identity = 77/183 (42.08%), Postives = 94/183 (51.37%), Query Frame = 0

Query: 40  AVKTLKEVIFMKPDYVDAHCDLASALHAMREDEREIEVFQKAIDLKLGHVDALYILGGLY 99
           AVK L+E I++KPDY DAHCDLAS+LHAM EDER IEVFQ+AIDLK GHVDALY LGGLY
Sbjct: 358 AVKALEEAIYLKPDYADAHCDLASSLHAMGEDERAIEVFQRAIDLKPGHVDALYNLGGLY 417

Query: 100 V----------------------------------------------------------- 145
           +                                                           
Sbjct: 418 MDLGRFQRASEMYTRVLAVWPNHWRAQLNKAVSLLGAGETEEAKRALKEALKMTNRVELH 477

BLAST of CmoCh15G006560 vs. TAIR 10
Match: AT3G58620.1 (tetratricopetide-repeat thioredoxin-like 4 )

HSP 1 Score: 46.6 bits (109), Expect = 8.5e-05
Identity = 41/146 (28.08%), Postives = 59/146 (40.41%), Query Frame = 0

Query: 453 GRVNRQDNEIDRREDGASESSSSDLKRLAESFRDQGNTLAENGKLREALGKWETALTLMP 512
           G + R   ++      A+E S S      E  +  GN +   G   EAL  ++ A++L P
Sbjct: 189 GNIIRTGGKVSHATKAAAEMSDS------EEVKKAGNVMYRKGNYAEALALYDRAISLSP 248

Query: 513 ENAVLHEQKSQVLLEIGEAWGALKAATRATDLDPSWAEAWITLGRAQLNFGEPDSAIESF 572
           EN      ++  L   G    A+K    A   DPS+A A   L    L  GE ++A    
Sbjct: 249 ENPAYRSNRAAALAASGRLEEAVKECLEAVRCDPSYARAHQRLASLYLRLGEAENA---- 308

Query: 573 DRALAIK---PDSGDAQDDRKTATHL 596
            R L +    PD  D Q  +    HL
Sbjct: 309 RRHLCVSGQCPDQADLQRLQTLEKHL 324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0ZZ618.4e-8287.86Photosystem II CP47 reaction center protein OS=Gossypium barbadense OX=3634 GN=p... [more]
Q2L9328.4e-8287.86Photosystem II CP47 reaction center protein OS=Gossypium hirsutum OX=3635 GN=psb... [more]
A4QKD11.9e-8187.28Photosystem II CP47 reaction center protein OS=Barbarea verna OX=50458 GN=psbB P... [more]
A4QKV71.9e-8187.28Photosystem II CP47 reaction center protein OS=Crucihimalaya wallichii OX=78192 ... [more]
A4QLD21.9e-8187.28Photosystem II CP47 reaction center protein OS=Lepidium virginicum OX=59292 GN=p... [more]
Match NameE-valueIdentityDescription
A0A6J1FH274.4e-110100.00tetratricopeptide repeat protein 33 OS=Cucurbita moschata OX=3662 GN=LOC11144541... [more]
A0A6J1JXQ44.3e-10595.81tetratricopeptide repeat protein 33 OS=Cucurbita maxima OX=3661 GN=LOC111489781 ... [more]
A0A0A0KH066.2e-8882.63TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G376230 ... [more]
A0A5D3D5718.4e-8579.81Tetratricopeptide repeat protein 33 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3B3U08.4e-8579.81tetratricopeptide repeat protein 33 OS=Cucumis melo OX=3656 GN=LOC103485686 PE=4... [more]
Match NameE-valueIdentityDescription
KAG6578936.14.7e-13099.59Tetratricopeptide repeat protein 33, partial [Cucurbita argyrosperma subsp. soro... [more]
XP_022939537.19.2e-110100.00tetratricopeptide repeat protein 33 [Cucurbita moschata][more]
XP_023551627.11.0e-10596.28tetratricopeptide repeat protein 33 [Cucurbita pepo subsp. pepo][more]
KAG7016456.15.2e-10590.43Tetratricopeptide repeat protein 33, partial [Cucurbita argyrosperma subsp. argy... [more]
XP_022993931.18.9e-10595.81tetratricopeptide repeat protein 33 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
ATCG00680.13.0e-8286.71photosystem II reaction center protein B [more]
AT1G77230.17.6e-5454.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05150.11.7e-2941.62Calcium-binding tetratricopeptide family protein [more]
AT2G32450.12.9e-2942.08Calcium-binding tetratricopeptide family protein [more]
AT3G58620.18.5e-0528.08tetratricopetide-repeat thioredoxin-like 4 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 548..581
e-value: 7.3E-5
score: 32.2
coord: 480..513
e-value: 0.52
score: 19.4
coord: 55..88
e-value: 15.0
score: 13.3
coord: 514..547
e-value: 340.0
score: 1.5
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 480..513
score: 9.1749
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 548..581
score: 12.6854
NoneNo IPR availableGENE3D3.10.680.10Photosystem II CP47 reaction center proteincoord: 306..403
e-value: 6.0E-39
score: 135.3
NoneNo IPR availablePFAMPF13432TPR_16coord: 532..581
e-value: 1.1E-5
score: 26.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 439..476
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 444..476
NoneNo IPR availablePANTHERPTHR33180:SF14PHOTOSYSTEM II CP47 REACTION CENTER PROTEINcoord: 223..378
NoneNo IPR availablePROSITEPS50293TPR_REGIONcoord: 548..580
score: 9.664794
IPR000932Photosystem antenna protein-likePFAMPF00421PSIIcoord: 220..378
e-value: 7.0E-72
score: 242.7
IPR000932Photosystem antenna protein-likePANTHERPTHR33180PHOTOSYSTEM II CP43 REACTION CENTER PROTEINcoord: 223..378
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 470..612
e-value: 9.9E-26
score: 92.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 4..141
e-value: 1.1E-9
score: 40.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 479..598
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 34..103
IPR036001Photosystem antenna protein-like superfamilySUPERFAMILY161077Photosystem II antenna protein-likecoord: 220..385

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh15G006560.1CmoCh15G006560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009772 photosynthetic electron transport in photosystem II
biological_process GO:0018298 protein-chromophore linkage
biological_process GO:0019684 photosynthesis, light reaction
biological_process GO:0009767 photosynthetic electron transport chain
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009523 photosystem II
cellular_component GO:0016020 membrane
cellular_component GO:0009521 photosystem
molecular_function GO:0016168 chlorophyll binding
molecular_function GO:0045156 electron transporter, transferring electrons within the cyclic electron transport pathway of photosynthesis activity
molecular_function GO:0005515 protein binding