Cla97C08G156150 (gene) Watermelon (97103) v2

NameCla97C08G156150
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr08 : 24031622 .. 24033406 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTTTTGGTAAAATTGCCGCCATTAATCACCCACACCAACTTCGCCGGAATGTTTGCTTTCTGATGAATTATTCTTCGTCCTGTGCTCTTAGCAGTTTAAATATCATCGAAGAGAGCAATACCCAAAACTGGAATTACCTCGAGTTGCAGTCTCGAATGCAGAACTATGCGGCTTCTGGTGATCTTGCTGAAGCTCTGGAGACTTTGAATTCTATGAGAAATGTTGCTGGGAAGCCCTCTGTGTATGATTACAACGCTTTGTTTCATAGATATTTGAGTTCTGGAAATGTTTTGTTGGAACCATTGGTTCAAGTGTATATAGGAATGAAGAGGTTTGGACCAACCCCAAATAAAACGACTTTCAACATACTTCTTAATGGACTCATGTCCTCGGGTTATCTTAGAGATGCATATTTCTTTGCAGAAGAGATGACCAAGAGTGGGATAAATCCATCCTTCACATCCTTGTCCAAATTGCTTAAAATTTCAATGAAATCGGGTAGCTTACTAGATTCTATTTGGATATTCAAGTTCATGTTGAAGTTAGACCATTTGCCAACTGAACCCACTTTAGCCATGTTTGTTTGTATGCTTTGTAAAGCCAGGATGTTGGAAGAGGCATACAGCTTTTGTGCTGCACTTCTATCCAAAAGTTTTAATTTTCAAGCATATGTATTTAATCCTGTTCTTTGGGCTTTATGTAAGTTTGGCCAGAGTTTTCTAGCTTTGCAGTTGTTTTATATGATGAAAAAGAAAGGCATGACTCATAATGTATGTTCATATACTGCTTTGCTTTATGGATTTGGAAGGGAACGTTTGTGGGTAGATCTTTATCGTTGTTTAGATCAAATGAGAAGTGATGGATTGAAGCCCAATGTCATTACTTATACAGTTATTATTAAGTTTCTTTGTGATGATGGAAGGATTGGTGAAGCATTTGAATTCTTGAAATTCATGGAAGGGGAGGGATGTGATCCAGACTTGGTAACTTACAATATAATTATATGTGCACTTTGCTTTCACGATAAAGCATACGATGTTGCTGAGATTTTGCAGGTGATTCATCACAGAGGTTTCTCTCCTGATGCATATACATATACTGCTTTGGCTGGAGGGATGGTGAAAGTAGGAAAGTTAGAAATTGCTTATGAGTTATTGCGTAATGTGATCTCGAGAAACTGTACTGTTGACGTTGTTGTGTACAATATATACTTCCATTGCTTGTGTCAAAATAGTAGATCAAGAGAAGCACTTTCTCTGTTGAAAAGTATGAAAGAAGAAGGTATTACTCCAACTACTGTGTCATATAACACAGTTTTAAGGGGCTTTTGTAGAGATTATAATCTTGAACATGCACTGAAGCTATTAGTCTGCTTCGAGTGGCCTGAGAGCAGCCCTGATGTGGTTTCATTCAATACAGTTCTCTCTGCAGCATGCAAAACTGGGGATTTAGATCTAATTCACAGGGTCTTGCATTGTATGGAATGTAGAGGTGTTGAGCCAAATGTAATAAGTTTTACTTGTTTAGTACAATATTTGTCTACAATGGGAAGATACTCAGAATGCTTGAAATTATTGGAATACATGGTATGGAACGGCCCTGCTCCCTCAAGTGTCACTTTCAATATCCTCCTTGACAAGCTTTGCAGAAGTGGATTTGTTAGCACTGCATACCAGATATTTGAGTGTCTCCAAAAAGCTGGATTATCGCTCGATAGAAGAACTTACAGAATTCTTCTACGTGCCTTATTAAGGAAGCGTGATGTCAACCTGATTGAATGA

mRNA sequence

ATGCGTTTTGGTAAAATTGCCGCCATTAATCACCCACACCAACTTCGCCGGAATGTTTGCTTTCTGATGAATTATTCTTCGTCCTGTGCTCTTAGCAGTTTAAATATCATCGAAGAGAGCAATACCCAAAACTGGAATTACCTCGAGTTGCAGTCTCGAATGCAGAACTATGCGGCTTCTGGTGATCTTGCTGAAGCTCTGGAGACTTTGAATTCTATGAGAAATGTTGCTGGGAAGCCCTCTGTGTATGATTACAACGCTTTGTTTCATAGATATTTGAGTTCTGGAAATGTTTTGTTGGAACCATTGGTTCAAGTGTATATAGGAATGAAGAGGTTTGGACCAACCCCAAATAAAACGACTTTCAACATACTTCTTAATGGACTCATGTCCTCGGGTTATCTTAGAGATGCATATTTCTTTGCAGAAGAGATGACCAAGAGTGGGATAAATCCATCCTTCACATCCTTGTCCAAATTGCTTAAAATTTCAATGAAATCGGGTAGCTTACTAGATTCTATTTGGATATTCAAGTTCATGTTGAAGTTAGACCATTTGCCAACTGAACCCACTTTAGCCATGTTTGTTTGTATGCTTTGTAAAGCCAGGATGTTGGAAGAGGCATACAGCTTTTGTGCTGCACTTCTATCCAAAAGTTTTAATTTTCAAGCATATGTATTTAATCCTGTTCTTTGGGCTTTATGTAAGTTTGGCCAGAGTTTTCTAGCTTTGCAGTTGTTTTATATGATGAAAAAGAAAGGCATGACTCATAATGTATGTTCATATACTGCTTTGCTTTATGGATTTGGAAGGGAACGTTTGTGGGTAGATCTTTATCGTTGTTTAGATCAAATGAGAAGTGATGGATTGAAGCCCAATGTCATTACTTATACAGTTATTATTAAGTTTCTTTGTGATGATGGAAGGATTGGTGAAGCATTTGAATTCTTGAAATTCATGGAAGGGGAGGGATGTGATCCAGACTTGGTAACTTACAATATAATTATATGTGCACTTTGCTTTCACGATAAAGCATACGATGTTGCTGAGATTTTGCAGGTGATTCATCACAGAGGTTTCTCTCCTGATGCATATACATATACTGCTTTGGCTGGAGGGATGGTGAAAGTAGGAAAGTTAGAAATTGCTTATGAGTTATTGCGTAATGTGATCTCGAGAAACTGTACTGTTGACGTTGTTGTGTACAATATATACTTCCATTGCTTGTGTCAAAATAGTAGATCAAGAGAAGCACTTTCTCTGTTGAAAAGTATGAAAGAAGAAGGTATTACTCCAACTACTGTGTCATATAACACAGTTTTAAGGGGCTTTTGTAGAGATTATAATCTTGAACATGCACTGAAGCTATTAGTCTGCTTCGAGTGGCCTGAGAGCAGCCCTGATGTGGTTTCATTCAATACAGTTCTCTCTGCAGCATGCAAAACTGGGGATTTAGATCTAATTCACAGGGTCTTGCATTGTATGGAATGTAGAGGTGTTGAGCCAAATGTAATAAGTTTTACTTGTTTAGTACAATATTTGTCTACAATGGGAAGATACTCAGAATGCTTGAAATTATTGGAATACATGGTATGGAACGGCCCTGCTCCCTCAAGTGTCACTTTCAATATCCTCCTTGACAAGCTTTGCAGAAGTGGATTTGTTAGCACTGCATACCAGATATTTGAGTGTCTCCAAAAAGCTGGATTATCGCTCGATAGAAGAACTTACAGAATTCTTCTACGTGCCTTATTAAGGAAGCGTGATGTCAACCTGATTGAATGA

Coding sequence (CDS)

ATGCGTTTTGGTAAAATTGCCGCCATTAATCACCCACACCAACTTCGCCGGAATGTTTGCTTTCTGATGAATTATTCTTCGTCCTGTGCTCTTAGCAGTTTAAATATCATCGAAGAGAGCAATACCCAAAACTGGAATTACCTCGAGTTGCAGTCTCGAATGCAGAACTATGCGGCTTCTGGTGATCTTGCTGAAGCTCTGGAGACTTTGAATTCTATGAGAAATGTTGCTGGGAAGCCCTCTGTGTATGATTACAACGCTTTGTTTCATAGATATTTGAGTTCTGGAAATGTTTTGTTGGAACCATTGGTTCAAGTGTATATAGGAATGAAGAGGTTTGGACCAACCCCAAATAAAACGACTTTCAACATACTTCTTAATGGACTCATGTCCTCGGGTTATCTTAGAGATGCATATTTCTTTGCAGAAGAGATGACCAAGAGTGGGATAAATCCATCCTTCACATCCTTGTCCAAATTGCTTAAAATTTCAATGAAATCGGGTAGCTTACTAGATTCTATTTGGATATTCAAGTTCATGTTGAAGTTAGACCATTTGCCAACTGAACCCACTTTAGCCATGTTTGTTTGTATGCTTTGTAAAGCCAGGATGTTGGAAGAGGCATACAGCTTTTGTGCTGCACTTCTATCCAAAAGTTTTAATTTTCAAGCATATGTATTTAATCCTGTTCTTTGGGCTTTATGTAAGTTTGGCCAGAGTTTTCTAGCTTTGCAGTTGTTTTATATGATGAAAAAGAAAGGCATGACTCATAATGTATGTTCATATACTGCTTTGCTTTATGGATTTGGAAGGGAACGTTTGTGGGTAGATCTTTATCGTTGTTTAGATCAAATGAGAAGTGATGGATTGAAGCCCAATGTCATTACTTATACAGTTATTATTAAGTTTCTTTGTGATGATGGAAGGATTGGTGAAGCATTTGAATTCTTGAAATTCATGGAAGGGGAGGGATGTGATCCAGACTTGGTAACTTACAATATAATTATATGTGCACTTTGCTTTCACGATAAAGCATACGATGTTGCTGAGATTTTGCAGGTGATTCATCACAGAGGTTTCTCTCCTGATGCATATACATATACTGCTTTGGCTGGAGGGATGGTGAAAGTAGGAAAGTTAGAAATTGCTTATGAGTTATTGCGTAATGTGATCTCGAGAAACTGTACTGTTGACGTTGTTGTGTACAATATATACTTCCATTGCTTGTGTCAAAATAGTAGATCAAGAGAAGCACTTTCTCTGTTGAAAAGTATGAAAGAAGAAGGTATTACTCCAACTACTGTGTCATATAACACAGTTTTAAGGGGCTTTTGTAGAGATTATAATCTTGAACATGCACTGAAGCTATTAGTCTGCTTCGAGTGGCCTGAGAGCAGCCCTGATGTGGTTTCATTCAATACAGTTCTCTCTGCAGCATGCAAAACTGGGGATTTAGATCTAATTCACAGGGTCTTGCATTGTATGGAATGTAGAGGTGTTGAGCCAAATGTAATAAGTTTTACTTGTTTAGTACAATATTTGTCTACAATGGGAAGATACTCAGAATGCTTGAAATTATTGGAATACATGGTATGGAACGGCCCTGCTCCCTCAAGTGTCACTTTCAATATCCTCCTTGACAAGCTTTGCAGAAGTGGATTTGTTAGCACTGCATACCAGATATTTGAGTGTCTCCAAAAAGCTGGATTATCGCTCGATAGAAGAACTTACAGAATTCTTCTACGTGCCTTATTAAGGAAGCGTGATGTCAACCTGATTGAATGA

Protein sequence

MRFGKIAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAASGDLAEALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNILLNGLMSSGYLRDAYFFAEEMTKSGINPSFTSLSKLLKISMKSGSLLDSIWIFKFMLKLDHLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSFNFQAYVFNPVLWALCKFGQSFLALQLFYMMKKKGMTHNVCSYTALLYGFGRERLWVDLYRCLDQMRSDGLKPNVITYTVIIKFLCDDGRIGEAFEFLKFMEGEGCDPDLVTYNIIICALCFHDKAYDVAEILQVIHHRGFSPDAYTYTALAGGMVKVGKLEIAYELLRNVISRNCTVDVVVYNIYFHCLCQNSRSREALSLLKSMKEEGITPTTVSYNTVLRGFCRDYNLEHALKLLVCFEWPESSPDVVSFNTVLSAACKTGDLDLIHRVLHCMECRGVEPNVISFTCLVQYLSTMGRYSECLKLLEYMVWNGPAPSSVTFNILLDKLCRSGFVSTAYQIFECLQKAGLSLDRRTYRILLRALLRKRDVNLIE
BLAST of Cla97C08G156150 vs. NCBI nr
Match: XP_023002254.1 (pentatricopeptide repeat-containing protein At3g53700, chloroplastic-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 412.9 bits (1060), Expect = 1.8e-111
Identity = 204/283 (72.08%), Postives = 225/283 (79.51%), Query Frame = 0

Query: 1   MRFGKIAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAAS 60
           MRF  I AIN+PH+L R + F +NYS+SCALSS+NII++SNT NW YL+LQSRMQNYAAS
Sbjct: 1   MRFASIPAINYPHRLTRKLRFSVNYSTSCALSSVNIIQDSNTHNWKYLDLQSRMQNYAAS 60

Query: 61  GDLAEALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKT 120
           GDL EALETLN M+NVAGKPS+YDYNALFHRYLSSGNV LE LVQVYIGMK FGP+PN+T
Sbjct: 61  GDLPEALETLNFMKNVAGKPSIYDYNALFHRYLSSGNVSLEQLVQVYIGMKNFGPSPNRT 120

Query: 121 TFNILLNGLMSSGYLRDAYFFAEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           TFNILLNG +S GYLRDAYFFAEEMTKSG                               
Sbjct: 121 TFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKSGNLVDSIWIFKFM 180

Query: 181 XKLDHLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSFNFQAYVFNPVLWALCKFGQS 240
            +LDHLPTEPT+AMF+CMLCKARMLEEAY FCA L+SK+ NFQAYVFNPVLWALCK G+S
Sbjct: 181 LRLDHLPTEPTVAMFICMLCKARMLEEAYRFCAKLISKNLNFQAYVFNPVLWALCKCGKS 240

Query: 241 FLALQLFYMMKKKGMTHNVCSYTALLYGFGRERLWVDLYRCLD 284
            LALQLFYMMKK G+ HNVCSYTALLYGFGRE LWVDLY  LD
Sbjct: 241 SLALQLFYMMKKNGIAHNVCSYTALLYGFGRECLWVDLYSFLD 283

BLAST of Cla97C08G156150 vs. NCBI nr
Match: XP_022131368.1 (pentatricopeptide repeat-containing protein At1g09900-like [Momordica charantia])

HSP 1 Score: 412.5 bits (1059), Expect = 2.4e-111
Identity = 204/281 (72.60%), Postives = 227/281 (80.78%), Query Frame = 0

Query: 1   MRFGKIAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAAS 60
           MRF  I  IN+P++L R+ CFL+NYS+SCA+SS+NIIEESNT NWNYL+LQSRMQ+ AAS
Sbjct: 1   MRFCTIPTINYPYRLGRSFCFLVNYSTSCAISSVNIIEESNTHNWNYLQLQSRMQDRAAS 60

Query: 61  GDLAEALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKT 120
           GDLAEALETLN MR++ GKPSVYDYNALF RYLSS NVLLE LVQVYIGMKRFGP PNKT
Sbjct: 61  GDLAEALETLNFMRSITGKPSVYDYNALFCRYLSSENVLLEQLVQVYIGMKRFGPAPNKT 120

Query: 121 TFNILLNGLMSSGYLRDAYFFAEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           TFNILLNGL+S G+LRDAYFF EEMTKSG                               
Sbjct: 121 TFNILLNGLLSLGFLRDAYFFVEEMTKSGINPSFTFLSKWLKKSLKSGNLVDSIWIFEFM 180

Query: 181 XKLDHLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSFNFQAYVFNPVLWALCKFGQS 240
            +LDHLPTEPTLAMF+C+LCK++MLEEA  FCAALLSK+  FQAYVFNP++WALCK G+S
Sbjct: 181 LRLDHLPTEPTLAMFICLLCKSKMLEEASRFCAALLSKNLTFQAYVFNPIIWALCKSGKS 240

Query: 241 FLALQLFYMMKKKGMTHNVCSYTALLYGFGRERLWVDLYRC 282
           FLALQLFYMMKKKGMTHNVCSYTALLYGFGRE LWVDLYRC
Sbjct: 241 FLALQLFYMMKKKGMTHNVCSYTALLYGFGRECLWVDLYRC 281

BLAST of Cla97C08G156150 vs. NCBI nr
Match: XP_023522453.1 (pentatricopeptide repeat-containing protein At1g64583, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 409.1 bits (1050), Expect = 2.6e-110
Identity = 202/283 (71.38%), Postives = 222/283 (78.45%), Query Frame = 0

Query: 1   MRFGKIAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAAS 60
           MRF  I A+N+PH+L R  C  +NYS+SCALSS+NIIE+SNT +W YL+LQSRMQNYAAS
Sbjct: 1   MRFASIPALNYPHRLTRKFCSSVNYSTSCALSSINIIEDSNTHSWKYLDLQSRMQNYAAS 60

Query: 61  GDLAEALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKT 120
           GDL EALETLN M+NVAGKPSVYDYNALFHRYLSSGNV +E LVQVYIGMK  GP+PN+T
Sbjct: 61  GDLPEALETLNFMKNVAGKPSVYDYNALFHRYLSSGNVSVEQLVQVYIGMKNLGPSPNRT 120

Query: 121 TFNILLNGLMSSGYLRDAYFFAEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           TFNILLNG +S GYLRDAYFFAEEMTKSG                               
Sbjct: 121 TFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKSGNLVDSIWIFKFM 180

Query: 181 XKLDHLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSFNFQAYVFNPVLWALCKFGQS 240
            +LDHLPTEPT+AMF+CMLCKARMLEEAY FCA L+SK+ NFQAYVFNPVLWALCK G S
Sbjct: 181 LRLDHLPTEPTVAMFICMLCKARMLEEAYRFCAKLISKNLNFQAYVFNPVLWALCKCGNS 240

Query: 241 FLALQLFYMMKKKGMTHNVCSYTALLYGFGRERLWVDLYRCLD 284
            LALQLFYMMKK G+ HNVCSYTALLYGFGRE LWVDLY  LD
Sbjct: 241 SLALQLFYMMKKNGIAHNVCSYTALLYGFGRECLWVDLYSFLD 283

BLAST of Cla97C08G156150 vs. NCBI nr
Match: XP_023537059.1 (pentatricopeptide repeat-containing protein At1g09900-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 409.1 bits (1050), Expect = 2.6e-110
Identity = 202/283 (71.38%), Postives = 222/283 (78.45%), Query Frame = 0

Query: 1   MRFGKIAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAAS 60
           MRF  I A+N+PH+L R  C  +NYS+SCALSS+NIIE+SNT +W YL+LQSRMQNYAAS
Sbjct: 1   MRFASIPALNYPHRLTRKFCSSVNYSTSCALSSINIIEDSNTHSWKYLDLQSRMQNYAAS 60

Query: 61  GDLAEALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKT 120
           GDL EALETLN M+NVAGKPSVYDYNALFHRYLSSGNV +E LVQVYIGMK  GP+PN+T
Sbjct: 61  GDLPEALETLNFMKNVAGKPSVYDYNALFHRYLSSGNVSVEQLVQVYIGMKNLGPSPNRT 120

Query: 121 TFNILLNGLMSSGYLRDAYFFAEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           TFNILLNG +S GYLRDAYFFAEEMTKSG                               
Sbjct: 121 TFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKSGNLVDSIWIFKFM 180

Query: 181 XKLDHLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSFNFQAYVFNPVLWALCKFGQS 240
            +LDHLPTEPT+AMF+CMLCKARMLEEAY FCA L+SK+ NFQAYVFNPVLWALCK G S
Sbjct: 181 LRLDHLPTEPTVAMFICMLCKARMLEEAYRFCAKLISKNLNFQAYVFNPVLWALCKCGNS 240

Query: 241 FLALQLFYMMKKKGMTHNVCSYTALLYGFGRERLWVDLYRCLD 284
            LALQLFYMMKK G+ HNVCSYTALLYGFGRE LWVDLY  LD
Sbjct: 241 SLALQLFYMMKKNGIAHNVCSYTALLYGFGRECLWVDLYSFLD 283

BLAST of Cla97C08G156150 vs. NCBI nr
Match: XP_022951234.1 (putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 399.8 bits (1026), Expect = 1.6e-107
Identity = 201/279 (72.04%), Postives = 219/279 (78.49%), Query Frame = 0

Query: 1   MRFGKIAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAAS 60
           MRF  I AIN+PH+L R +    NYS+SCALSS+NIIE+S+T N  YL+LQSRMQNYAAS
Sbjct: 1   MRFASIPAINYPHRLSRKLRSSANYSTSCALSSVNIIEDSHTNNRKYLDLQSRMQNYAAS 60

Query: 61  GDLAEALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKT 120
           GDL EALETLN M+NVAGKPSVYDYNALFHRYLSSGNV LE LVQVYIGMK FGP+PN+T
Sbjct: 61  GDLPEALETLNFMKNVAGKPSVYDYNALFHRYLSSGNVSLEQLVQVYIGMKNFGPSPNRT 120

Query: 121 TFNILLNGLMSSGYLRDAYFFAEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           TFNILLNG +S GYLRDAYFFAEEMTKSG                               
Sbjct: 121 TFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKSGNVVDSIWIFKFM 180

Query: 181 XKLDHLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSFNFQAYVFNPVLWALCKFGQS 240
            +LDHLPTEPT+AMF+CMLCKARMLEEAY FCA L+SK+ NFQAYVFNPVLWALCK G S
Sbjct: 181 LRLDHLPTEPTVAMFICMLCKARMLEEAYRFCAKLISKNLNFQAYVFNPVLWALCKCGNS 240

Query: 241 FLALQLFYMMKKKGMTHNVCSYTALLYGFGRERLWVDLY 280
            LALQLFYMMKK G+ HNVCSYTALLYGFGRE LWVDLY
Sbjct: 241 SLALQLFYMMKKNGIPHNVCSYTALLYGFGRECLWVDLY 279

BLAST of Cla97C08G156150 vs. TrEMBL
Match: tr|A0A2N9F0T1|A0A2N9F0T1_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS12468 PE=4 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 1.1e-40
Identity = 87/135 (64.44%), Postives = 106/135 (78.52%), Query Frame = 0

Query: 14  QLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAASGDLAEALETLNSM 73
           ++ + V FL+N SSSCA+ + + IEESNT + NY+ELQ RMQNYA SG  ++ALE LNSM
Sbjct: 8   RIGKTVRFLVNLSSSCAVRTADFIEESNTHHCNYVELQRRMQNYATSGHFSKALEALNSM 67

Query: 74  RNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNILLNGLMSSG 133
           RNV GKP+VYDYNAL H Y  S NVLLE LV+VY+GMKRFGP PN +TFN LLNG++S G
Sbjct: 68  RNVHGKPTVYDYNALMHCYFKSRNVLLEVLVEVYLGMKRFGPVPNASTFNTLLNGMLSLG 127

Query: 134 YLRDAYFFAEEMTKS 149
            L+DA+F AEEM  S
Sbjct: 128 NLKDAFFIAEEMCGS 142

BLAST of Cla97C08G156150 vs. TrEMBL
Match: tr|A0A2P4K714|A0A2P4K714_QUESU (Putative pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_25246 PE=4 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 3.2e-40
Identity = 86/143 (60.14%), Postives = 111/143 (77.62%), Query Frame = 0

Query: 6   IAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAASGDLAE 65
           I  + +  ++ + V FL+++SSSCAL ++   EESNT ++NY +LQSRMQNYA SG   +
Sbjct: 62  IPRVGYCCRIGKTVRFLLSFSSSCALRAVEFAEESNTHDFNYAKLQSRMQNYAISGHFRK 121

Query: 66  ALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNIL 125
           ALETLNSMRNV GKP+VYDYNAL + +L S NVLLE LV+VY+GMKRFGP PN +TFN L
Sbjct: 122 ALETLNSMRNVPGKPTVYDYNALMYCHLKSRNVLLEVLVEVYVGMKRFGPAPNASTFNTL 181

Query: 126 LNGLMSSGYLRDAYFFAEEMTKS 149
           LNG++S G L+DA+F A+EM  S
Sbjct: 182 LNGMLSLGNLKDAFFIAKEMCGS 204

BLAST of Cla97C08G156150 vs. TrEMBL
Match: tr|A0A124SBS2|A0A124SBS2_CYNCS (Uncharacterized protein OS=Cynara cardunculus var. scolymus OX=59895 GN=Ccrd_006461 PE=4 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 1.1e-37
Identity = 103/262 (39.31%), Postives = 139/262 (53.05%), Query Frame = 0

Query: 22  LMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAASGDLAEALETLNSMRNVAGKPS 81
           L + SSS  + SL++ EE N    NY EL+++MQN   SG + +ALE  + MRNV+GKP+
Sbjct: 3   LSSSSSSSVVQSLDMAEEDNPHCPNYGELRTKMQNLTRSGSIGKALEIFHLMRNVSGKPT 62

Query: 82  VYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNILLNGLMSSGYLRDAYFF 141
           VYDYN+L + YL S  V L  L  +Y  MKR    PN +TFN  L GL   G  + A   
Sbjct: 63  VYDYNSLINCYLKSNKVGLHDLCGLYFEMKRVELHPNASTFNTFLKGLSLLGESKVAISV 122

Query: 142 AEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLDHLPTEPTLAMFVCMLCK 201
             EM   G                                 L+++PTEP + + +  L +
Sbjct: 123 IVEMCNYGFTPSFSCLSNLLKKCLDSMELVDGLRVLDLMLGLNYIPTEPKVILLINSLSR 182

Query: 202 ARMLEEAYSFCAALLSKSFNFQA-YVFNPVLWALCKFGQSFLALQLFYMMKKKGMTHNVC 261
             M  +A      LL    NFQ+ YV+NP+LW+LCK  Q   AL  F  +KKKG+ HNVC
Sbjct: 183 CGMTRDACVVFFKLLEIG-NFQSPYVYNPILWSLCKSDQISGALAFFCSLKKKGLVHNVC 242

Query: 262 SYTALLYGFGRERLWVDLYRCL 283
           SYTAL+YGFG++ L+ +   CL
Sbjct: 243 SYTALVYGFGQKGLFKEASGCL 263

BLAST of Cla97C08G156150 vs. TrEMBL
Match: tr|A0A1U8B6S7|A0A1U8B6S7_NELNU (pentatricopeptide repeat-containing protein At3g53700, chloroplastic-like isoform X1 OS=Nelumbo nucifera OX=4432 GN=LOC104607632 PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 1.5e-32
Identity = 90/258 (34.88%), Postives = 135/258 (52.33%), Query Frame = 0

Query: 12  PHQLRRNVCFLMNYSSSCALSSLNIIE------ESNTQNWN-YLELQSRMQNYAASGDLA 71
           PH +   V  + + +++   + +   E       S +QN N +  LQ +M++YA  G   
Sbjct: 36  PHHIVNQVADIQSLAAALGSNEIEAQEFQEESSNSPSQNPNTFAALQCKMKDYATYGLAQ 95

Query: 72  EALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNI 131
           EA +TLN M+ V+GKP+VYDYNA  +  L SGN+ +E LV+V+  M+  GP+PN  TFN 
Sbjct: 96  EAWDTLNDMKRVSGKPTVYDYNAFLYYNLKSGNLSIEDLVEVHGRMRILGPSPNALTFNT 155

Query: 132 LLNGLMSSGYLRDAYFFAEEMTKSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLD 191
           LLNG +S G L  A++F +EM ++G                                 LD
Sbjct: 156 LLNGSLSLGSLEGAFYFTKEMCRNGFVPSFSFLSKLLKRSLELGDLVYSLDALELMLDLD 215

Query: 192 HLPTEPTLAMFVCMLCKARMLEEAYSFCAALLSKSF-NFQAYVFNPVLWALCKFGQSFLA 251
           + PTEPT  + V    K+  + EA    + L  K F     + +N ++WALCK GQ+ +A
Sbjct: 216 YFPTEPTSNLLVNSFIKSGKMHEACFLLSLLSDKCFLPSMHHSYNSIIWALCKSGQTCVA 275

Query: 252 LQLFYMMKKKGMTHNVCS 262
             LF  +KK+G+ HNVC+
Sbjct: 276 SALFCSLKKRGIGHNVCT 293

BLAST of Cla97C08G156150 vs. TrEMBL
Match: tr|A0A2I4GPB7|A0A2I4GPB7_9ROSI (pentatricopeptide repeat-containing protein At1g09900-like isoform X1 OS=Juglans regia OX=51240 GN=LOC109009621 PE=4 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 2.5e-32
Identity = 76/143 (53.15%), Postives = 101/143 (70.63%), Query Frame = 0

Query: 6   IAAINHPHQLRRNVCFLMNYSSSCALSSLNIIEESNTQNWNYLELQSRMQNYAASGDLAE 65
           I  +++ H++ R V F +N+SS CAL +++ IE+S  ++ +Y+EL  RMQNYA SG   +
Sbjct: 71  IPTVSYTHRIFRFVRFFVNFSSYCALRTVDFIEDSKARDSDYIELHRRMQNYATSGYFRK 130

Query: 66  ALETLNSMRNVAGKPSVYDYNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNIL 125
           ALE L SM NV GKP+VYD NAL + YL S N L E L++VYIGMKR GP PN  TFN+L
Sbjct: 131 ALEILISMGNVPGKPTVYDCNALMYCYLKSRNELFEELLEVYIGMKRIGPPPNALTFNML 190

Query: 126 LNGLMSSGYLRDAYFFAEEMTKS 149
           LN ++S G L+DA F A+EM  S
Sbjct: 191 LNRMLSLGKLKDALFIAKEMCGS 213

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023002254.11.8e-11172.08pentatricopeptide repeat-containing protein At3g53700, chloroplastic-like isofor... [more]
XP_022131368.12.4e-11172.60pentatricopeptide repeat-containing protein At1g09900-like [Momordica charantia][more]
XP_023522453.12.6e-11071.38pentatricopeptide repeat-containing protein At1g64583, mitochondrial-like isofor... [more]
XP_023537059.12.6e-11071.38pentatricopeptide repeat-containing protein At1g09900-like [Cucurbita pepo subsp... [more]
XP_022951234.11.6e-10772.04putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial is... [more]
Match NameE-valueIdentityDescription
tr|A0A2N9F0T1|A0A2N9F0T1_FAGSY1.1e-4064.44Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS12468 PE=4 SV=1[more]
tr|A0A2P4K714|A0A2P4K714_QUESU3.2e-4060.14Putative pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 G... [more]
tr|A0A124SBS2|A0A124SBS2_CYNCS1.1e-3739.31Uncharacterized protein OS=Cynara cardunculus var. scolymus OX=59895 GN=Ccrd_006... [more]
tr|A0A1U8B6S7|A0A1U8B6S7_NELNU1.5e-3234.88pentatricopeptide repeat-containing protein At3g53700, chloroplastic-like isofor... [more]
tr|A0A2I4GPB7|A0A2I4GPB7_9ROSI2.5e-3253.15pentatricopeptide repeat-containing protein At1g09900-like isoform X1 OS=Juglans... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G156150.1Cla97C08G156150.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 393..461
e-value: 5.3E-16
score: 60.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 219..392
e-value: 1.7E-39
score: 138.0
coord: 462..594
e-value: 3.0E-30
score: 107.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 28..218
e-value: 6.9E-23
score: 83.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 292..340
e-value: 6.0E-14
score: 51.8
coord: 502..551
e-value: 1.2E-11
score: 44.4
coord: 398..446
e-value: 7.3E-15
score: 54.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 365..395
e-value: 0.073
score: 13.3
coord: 470..500
e-value: 0.0022
score: 18.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 400..433
e-value: 6.8E-8
score: 30.2
coord: 261..294
e-value: 2.5E-6
score: 25.3
coord: 540..573
e-value: 0.0018
score: 16.3
coord: 121..153
e-value: 0.0015
score: 16.6
coord: 435..457
e-value: 0.0017
score: 16.4
coord: 470..504
e-value: 5.2E-5
score: 21.2
coord: 295..328
e-value: 3.3E-6
score: 24.9
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 220..270
e-value: 7.1E-4
score: 19.5
coord: 79..127
e-value: 1.3E-4
score: 21.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 398..432
score: 11.992
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 538..572
score: 10.293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 433..467
score: 7.837
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 328..362
score: 9.109
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 81..117
score: 7.366
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 258..292
score: 10.084
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 46..80
score: 6.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 153..187
score: 6.325
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 468..502
score: 11.071
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 188..222
score: 6.051
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..397
score: 9.251
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..257
score: 8.451
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 573..594
score: 5.59
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 503..537
score: 9.076
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..327
score: 11.466
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 118..152
score: 10.654
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 56..184
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 243..434
coord: 143..271
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 56..184
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 407..592
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 243..434
coord: 143..271
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 407..592

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C08G156150Watermelon (97103) v2wmbwmbB148
Cla97C08G156150Watermelon (97103) v2wmbwmbB165
Cla97C08G156150Silver-seed gourdcarwmbB0061
Cla97C08G156150Silver-seed gourdcarwmbB0207
Cla97C08G156150Silver-seed gourdcarwmbB0666
Cla97C08G156150Silver-seed gourdcarwmbB1035
Cla97C08G156150Cucumber (Gy14) v2cgybwmbB153
Cla97C08G156150Cucumber (Gy14) v2cgybwmbB239
Cla97C08G156150Cucumber (Gy14) v1cgywmbB282
Cla97C08G156150Cucumber (Gy14) v1cgywmbB546
Cla97C08G156150Cucurbita maxima (Rimu)cmawmbB044
Cla97C08G156150Cucurbita maxima (Rimu)cmawmbB201
Cla97C08G156150Cucurbita maxima (Rimu)cmawmbB503
Cla97C08G156150Cucurbita maxima (Rimu)cmawmbB505
Cla97C08G156150Cucurbita maxima (Rimu)cmawmbB830
Cla97C08G156150Cucurbita moschata (Rifu)cmowmbB031
Cla97C08G156150Cucurbita moschata (Rifu)cmowmbB184
Cla97C08G156150Cucurbita moschata (Rifu)cmowmbB485
Cla97C08G156150Cucurbita moschata (Rifu)cmowmbB487
Cla97C08G156150Cucurbita moschata (Rifu)cmowmbB800
Cla97C08G156150Wild cucumber (PI 183967)cpiwmbB162
Cla97C08G156150Wild cucumber (PI 183967)cpiwmbB257
Cla97C08G156150Cucumber (Chinese Long) v3cucwmbB160
Cla97C08G156150Cucumber (Chinese Long) v3cucwmbB251
Cla97C08G156150Cucumber (Chinese Long) v2cuwmbB159
Cla97C08G156150Cucumber (Chinese Long) v2cuwmbB249
Cla97C08G156150Bottle gourd (USVL1VR-Ls)lsiwmbB417
Cla97C08G156150Melon (DHL92) v3.6.1medwmbB293
Cla97C08G156150Melon (DHL92) v3.6.1medwmbB343
Cla97C08G156150Melon (DHL92) v3.5.1mewmbB306
Cla97C08G156150Melon (DHL92) v3.5.1mewmbB353
Cla97C08G156150Watermelon (Charleston Gray)wcgwmbB249
Cla97C08G156150Watermelon (Charleston Gray)wcgwmbB308
Cla97C08G156150Watermelon (97103) v1wmwmbB182
Cla97C08G156150Watermelon (97103) v1wmwmbB227
Cla97C08G156150Wax gourdwgowmbB603