Cla97C03G052040 (gene) Watermelon (97103) v2

NameCla97C03G052040
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr03 : 1088302 .. 1089815 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAATTCGATGGTTTTCCGGGTCGAAATTGCCCATTTTTGCATCGGTATTCATACGATGTCTAATGAGGAGAGCAATTAGTGATATCTCATTACCAGTGAAATCGACTATTTCTGGAATTCAGCCGGATTTGAGCCGACCCTTATGGGGATTCCGGCTATATCACGACGGGAGGCCGCGGGGACCCCTTTGGAGAAGCCGGAAAGCAATTGGGAAAGAAGCGCTTTTCGTTATTCAGGGATTGAAGAGATTCAAGGAAGATGAAGAGAAACTCGAGAAGTTCATGAAGAGTCATGTATTGAGGCTCTTGAAATTGGATATGATTGCTGTGCTTGGTGAACTTGAGCGGCAGGAAGAGGTTGCTTTAGCTGTTAAGGTTTGTTCCCCCTTCCTTTTCTTCTATATGTTCTGTCCTTTTTGTAGCTCTTTCTGGATCTTGACATACATTTTAGATTTTACTTCATGAGTCGTCTCAAATAGTTAATGGGCTTCTGTAAGTTGATGCTATGAGATTTTTTTCTTCTTAACTGTTATGTTATTGATTGTTATTTACTTTAGGTGAATAACTTTGCTAATGCTGCATTCAAGCTTCTATTCTGCTCTTGAATTTCTAAGATTATGTTTCTCAGCTTCTAGCTTTCTTAGTATGAATGTGATGTTCTTGACGTTTGCCCTGTCTCTTTTCATGGAGCTATTGTGCAATTTTTGTCAAGTTGCAGCTCTCCCTCTCTTTATGCATGGTAGTAATTGAAGCATGCTTAAGGAAGGTCATAGAGGAAGTAAATTCATGTATTGGTGTTCCTCTAAATGATATAATGAGTTGTTTTCCCACTTCCTCTTATATGTTAAGCTTGTTTTATGTCGAGGAAGTTGCCCTTGCATTCAAGATTCTTCCATATGAATACTGCTTGTTTCTCCCTTGTTCAACTCTGATCTGCCCCTTTGAGTGCTTATCATATCTAGTTCAAACTTTTTGTATGAATTAAGATCTCAAATATGAAATCTATTTCACTCTGGTTGCATGTGGGAACATTTGCCTTTTATATTCTTTTATTTTTCAACTCAAGATTGACTTGATTTTATTTCATGGTGTCTGTGAATCTTACACGTTGAATTTTCCAGATATTTAGGTTGATTCGAAAGCAAGACTGGTACAAGCCTGATGTTTTTATGTATAAAGACTTGATTATTGCATTAGCTAGAAGTAAAAAGATGGATGATGCAATGAAATTATGGGAAAGCATGAGAGAGGAGAATTTATTTCCAGATTCTCAAACATATACCGAAGTCATCAGAGGTTTCTTGAGATATGGATCTCCTTCTGATGCAATGAATGTATATGAGGATATGAAGAAGTCTCCAGATCCACCAGATGAGTTGCCATTTAGAATTTTGTTGAAAGGCCTTTTGCCACACCCCCTTTTGAGAAACAGAGTAAAGCAAGATTTTGAGGAGCTCTTTCCTGACCAGCATGTGTATGATCCCCCAGAAGAGATATTCGGCCTACGTTGA

mRNA sequence

ATGGCAATTCGATGGTTTTCCGGGTCGAAATTGCCCATTTTTGCATCGGTATTCATACGATGTCTAATGAGGAGAGCAATTAGTGATATCTCATTACCAGTGAAATCGACTATTTCTGGAATTCAGCCGGATTTGAGCCGACCCTTATGGGGATTCCGGCTATATCACGACGGGAGGCCGCGGGGACCCCTTTGGAGAAGCCGGAAAGCAATTGGGAAAGAAGCGCTTTTCGTTATTCAGGGATTGAAGAGATTCAAGGAAGATGAAGAGAAACTCGAGAAGTTCATGAAGAGTCATGTATTGAGGCTCTTGAAATTGGATATGATTGCTGTGCTTGGTGAACTTGAGCGGCAGGAAGAGGTTGCTTTAGCTGTTAAGATATTTAGGTTGATTCGAAAGCAAGACTGGTACAAGCCTGATGTTTTTATGTATAAAGACTTGATTATTGCATTAGCTAGAAGTAAAAAGATGGATGATGCAATGAAATTATGGGAAAGCATGAGAGAGGAGAATTTATTTCCAGATTCTCAAACATATACCGAAGTCATCAGAGGTTTCTTGAGATATGGATCTCCTTCTGATGCAATGAATGTATATGAGGATATGAAGAAGTCTCCAGATCCACCAGATGAGTTGCCATTTAGAATTTTGTTGAAAGGCCTTTTGCCACACCCCCTTTTGAGAAACAGAGTAAAGCAAGATTTTGAGGAGCTCTTTCCTGACCAGCATGTGTATGATCCCCCAGAAGAGATATTCGGCCTACGTTGA

Coding sequence (CDS)

ATGGCAATTCGATGGTTTTCCGGGTCGAAATTGCCCATTTTTGCATCGGTATTCATACGATGTCTAATGAGGAGAGCAATTAGTGATATCTCATTACCAGTGAAATCGACTATTTCTGGAATTCAGCCGGATTTGAGCCGACCCTTATGGGGATTCCGGCTATATCACGACGGGAGGCCGCGGGGACCCCTTTGGAGAAGCCGGAAAGCAATTGGGAAAGAAGCGCTTTTCGTTATTCAGGGATTGAAGAGATTCAAGGAAGATGAAGAGAAACTCGAGAAGTTCATGAAGAGTCATGTATTGAGGCTCTTGAAATTGGATATGATTGCTGTGCTTGGTGAACTTGAGCGGCAGGAAGAGGTTGCTTTAGCTGTTAAGATATTTAGGTTGATTCGAAAGCAAGACTGGTACAAGCCTGATGTTTTTATGTATAAAGACTTGATTATTGCATTAGCTAGAAGTAAAAAGATGGATGATGCAATGAAATTATGGGAAAGCATGAGAGAGGAGAATTTATTTCCAGATTCTCAAACATATACCGAAGTCATCAGAGGTTTCTTGAGATATGGATCTCCTTCTGATGCAATGAATGTATATGAGGATATGAAGAAGTCTCCAGATCCACCAGATGAGTTGCCATTTAGAATTTTGTTGAAAGGCCTTTTGCCACACCCCCTTTTGAGAAACAGAGTAAAGCAAGATTTTGAGGAGCTCTTTCCTGACCAGCATGTGTATGATCCCCCAGAAGAGATATTCGGCCTACGTTGA

Protein sequence

MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRPRGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEEVALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLFPDSQTYTEVIRGFLRYGSPSDAMNVYEDMKKSPDPPDELPFRILLKGLLPHPLLRNRVKQDFEELFPDQHVYDPPEEIFGLR
BLAST of Cla97C03G052040 vs. NCBI nr
Match: XP_008457435.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g46870 [Cucumis melo])

HSP 1 Score: 351.7 bits (901), Expect = 2.1e-93
Identity = 234/255 (91.76%), Postives = 242/255 (94.90%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MAIRWFS SKLPIFA VF+R L RR I D+ LPV STI+G QPD SRP+WGFRLYHDGRP
Sbjct: 1   MAIRWFSRSKLPIFALVFLRGLTRRPIRDVPLPVTSTITGFQPDSSRPIWGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKL KFMKSHV RLLKLDM+AVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLGKFMKSHVSRLLKLDMLAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVKIFRLIRKQDWYKPDV++YKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX
Sbjct: 121 VALAVKIFRLIRKQDWYKPDVYIYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHV+DPPEEIF LR
Sbjct: 241 DQHVFDPPEEIFSLR 255

BLAST of Cla97C03G052040 vs. NCBI nr
Match: XP_004145279.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g46870 [Cucumis sativus] >KGN65758.1 hypothetical protein Csa_1G525290 [Cucumis sativus])

HSP 1 Score: 351.3 bits (900), Expect = 2.8e-93
Identity = 232/255 (90.98%), Postives = 242/255 (94.90%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MAIRWFS SKLP+FASVF++ L RR I D+ LPVKSTI+  QPD  RP+WGFRLYHDGRP
Sbjct: 1   MAIRWFSRSKLPVFASVFLQGLTRRPIRDVPLPVKSTITDFQPDSRRPIWGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEK EKFMKSHV RLLKLDM+AVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKFEKFMKSHVSRLLKLDMVAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVKIFRLIRKQDWYKPDV++YKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX
Sbjct: 121 VALAVKIFRLIRKQDWYKPDVYIYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHV+DPPEEIF LR
Sbjct: 241 DQHVFDPPEEIFSLR 255

BLAST of Cla97C03G052040 vs. NCBI nr
Match: XP_022964699.1 (protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata] >XP_022964700.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata] >XP_022964701.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata])

HSP 1 Score: 339.3 bits (869), Expect = 1.1e-89
Identity = 227/255 (89.02%), Postives = 241/255 (94.51%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MA+RWFS  KLPI  SVF+R LMRR I DI LPVKST++G QPD SRPL GFRLYHDGRP
Sbjct: 1   MAMRWFSKLKLPICYSVFLRGLMRRPIRDIPLPVKSTVAGFQPDSSRPLSGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKL+KFMK HVLRLLKLDMIAVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLQKFMKCHVLRLLKLDMIAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVK+FRLI+KQ+WYKPDVF+YKDLI+ALARSK+MDDAM+LWESMR+E  XXXXXXXX
Sbjct: 121 VALAVKVFRLIQKQEWYKPDVFLYKDLIVALARSKQMDDAMELWESMRKEXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEE+FP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEEIFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHVYDPPEEIFGLR
Sbjct: 241 DQHVYDPPEEIFGLR 255

BLAST of Cla97C03G052040 vs. NCBI nr
Match: XP_023521051.1 (protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo] >XP_023522412.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 338.6 bits (867), Expect = 1.9e-89
Identity = 226/255 (88.63%), Postives = 242/255 (94.90%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MA+RWFS SKLPI  SVF+R L  R I DISLPVKST++G QPD S+PL GFRLYHDGRP
Sbjct: 1   MAMRWFSKSKLPICYSVFLRGLTTRPIRDISLPVKSTVAGFQPDSSQPLSGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKL+KFMK HVLRLLKLDMIAVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLQKFMKCHVLRLLKLDMIAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVK+FRLI+KQ+WYKPDVF+YKDLI+ALARSK+MD+AM+LWESMR+EN XXXXXXXX
Sbjct: 121 VALAVKVFRLIQKQEWYKPDVFLYKDLIVALARSKQMDEAMELWESMRKENXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEE+FP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEEIFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHVYDPPEEIFGLR
Sbjct: 241 DQHVYDPPEEIFGLR 255

BLAST of Cla97C03G052040 vs. NCBI nr
Match: XP_023519300.1 (protein THYLAKOID ASSEMBLY 8-like, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_023519301.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 337.4 bits (864), Expect = 4.2e-89
Identity = 226/255 (88.63%), Postives = 241/255 (94.51%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MA+RWFS SKLPI  SVF+R L  R I DISLPVKST++G QPD S+PL GFRLYHDGRP
Sbjct: 1   MAMRWFSKSKLPICYSVFLRGLTTRPIRDISLPVKSTVAGFQPDSSQPLSGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKL+KFMK HVLRLLKLDMIAVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLQKFMKCHVLRLLKLDMIAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVK+FRLI+KQ+WYKPDVF+YKDLI+ALARSK+MDDAM+LWESMR+E  XXXXXXXX
Sbjct: 121 VALAVKVFRLIQKQEWYKPDVFLYKDLIVALARSKQMDDAMELWESMRKEXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEE+FP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEEIFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHVYDPPEEIFGLR
Sbjct: 241 DQHVYDPPEEIFGLR 255

BLAST of Cla97C03G052040 vs. TrEMBL
Match: tr|A0A1S3C5H5|A0A1S3C5H5_CUCME (pentatricopeptide repeat-containing protein At3g46870 OS=Cucumis melo OX=3656 GN=LOC103497124 PE=4 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 1.4e-93
Identity = 234/255 (91.76%), Postives = 242/255 (94.90%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MAIRWFS SKLPIFA VF+R L RR I D+ LPV STI+G QPD SRP+WGFRLYHDGRP
Sbjct: 1   MAIRWFSRSKLPIFALVFLRGLTRRPIRDVPLPVTSTITGFQPDSSRPIWGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKL KFMKSHV RLLKLDM+AVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLGKFMKSHVSRLLKLDMLAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVKIFRLIRKQDWYKPDV++YKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX
Sbjct: 121 VALAVKIFRLIRKQDWYKPDVYIYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHV+DPPEEIF LR
Sbjct: 241 DQHVFDPPEEIFSLR 255

BLAST of Cla97C03G052040 vs. TrEMBL
Match: tr|A0A0A0LVE9|A0A0A0LVE9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G525290 PE=4 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 1.8e-93
Identity = 232/255 (90.98%), Postives = 242/255 (94.90%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MAIRWFS SKLP+FASVF++ L RR I D+ LPVKSTI+  QPD  RP+WGFRLYHDGRP
Sbjct: 1   MAIRWFSRSKLPVFASVFLQGLTRRPIRDVPLPVKSTITDFQPDSRRPIWGFRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWRSRKAIGKEALFVIQGLKRFKEDEEK EKFMKSHV RLLKLDM+AVLGELERQEE
Sbjct: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKFEKFMKSHVSRLLKLDMVAVLGELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVKIFRLIRKQDWYKPDV++YKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX
Sbjct: 121 VALAVKIFRLIRKQDWYKPDVYIYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           DQHV+DPPEEIF LR
Sbjct: 241 DQHVFDPPEEIFSLR 255

BLAST of Cla97C03G052040 vs. TrEMBL
Match: tr|A0A2P4ISK6|A0A2P4ISK6_QUESU (Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_34307 PE=4 SV=1)

HSP 1 Score: 289.3 bits (739), Expect = 8.6e-75
Identity = 199/255 (78.04%), Postives = 226/255 (88.63%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MA R FS  K PI  S+ IR L R AI   SLP+K  ++ ++PD  +P+ GF+LYHDGRP
Sbjct: 1   MATRAFSRLKTPICTSILIRNLTRTAIKHHSLPLKPKVAALEPDCCKPICGFKLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWR +K IGKEALFVI GLKRFK+DEEKLEKF+K+HVLRLLK+D+IAVL ELERQEE
Sbjct: 61  RGPLWRGKKLIGKEALFVILGLKRFKDDEEKLEKFIKTHVLRLLKMDLIAVLSELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALAVK+F++IR+QDWY+PD F+YKDLIIALA+ +KMDDAM+LWE MR+EN XXXXXXXX
Sbjct: 121 VALAVKVFKVIRRQDWYRPDAFLYKDLIIALAKCQKMDDAMQLWEDMRKENXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PLLRNRVKQDFEELFP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLLRNRVKQDFEELFP 240

Query: 241 DQHVYDPPEEIFGLR 256
           ++HVYDPPEEIFGLR
Sbjct: 241 ERHVYDPPEEIFGLR 255

BLAST of Cla97C03G052040 vs. TrEMBL
Match: tr|A0A2I4FFU6|A0A2I4FFU6_9ROSI (pentatricopeptide repeat-containing protein At3g46870 OS=Juglans regia OX=51240 GN=LOC108998427 PE=4 SV=1)

HSP 1 Score: 275.8 bits (704), Expect = 9.8e-71
Identity = 200/253 (79.05%), Postives = 221/253 (87.35%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MA R FS  K P FAS+ IR L    I + SLPVK  +    PD  +P   FRLYHDGRP
Sbjct: 1   MATRAFSRLKHPFFASILIRNLRIVPIREPSLPVKPEVLASVPDYCKP---FRLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLWR +K IGKEALFVI GLKRFK+DEEKLEKFMK+HVLRLLK+D+IAVL ELERQEE
Sbjct: 61  RGPLWRGKKLIGKEALFVILGLKRFKDDEEKLEKFMKTHVLRLLKMDIIAVLCELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
           VALA K+FR+I+KQDWYKPDV++YKDLIIALAR K+MDDAM+LWE MR+E+LXXXXXXXX
Sbjct: 121 VALAFKMFRVIQKQDWYKPDVYLYKDLIIALARRKQMDDAMQLWEDMRKEDLXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRN+VKQDFEELFP
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNKVKQDFEELFP 240

Query: 241 DQHVYDPPEEIFG 254
           ++HVYDPPEEIFG
Sbjct: 241 ERHVYDPPEEIFG 250

BLAST of Cla97C03G052040 vs. TrEMBL
Match: tr|A0A1U8PJZ9|A0A1U8PJZ9_GOSHI (pentatricopeptide repeat-containing protein At3g46870-like OS=Gossypium hirsutum OX=3635 GN=LOC107959151 PE=4 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 2.9e-70
Identity = 138/253 (54.55%), Postives = 172/253 (67.98%), Query Frame = 0

Query: 1   MAIRWFSGSKLPIFASVFIRCLMRRAISDISLPVKSTISGIQPDLSRPLWGFRLYHDGRP 60
           MA R FS SK PI  S+ ++ L + +I+     VK  I  +QPD+ +P+ GF+LYHDGRP
Sbjct: 1   MATRAFSKSKFPILTSILLQNLTKNSITRSPFLVKPQIPPVQPDIHKPISGFKLYHDGRP 60

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPLW+ +K IGKEALFVI GLKRFK+DE+K+ KF+K+HVLRLLK+D+IAVL ELERQEE
Sbjct: 61  RGPLWKGKKLIGKEALFVILGLKRFKDDEDKVLKFIKTHVLRLLKMDLIAVLSELERQEE 120

Query: 121 VALAVKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXX 180
            +LAVK+F +I+KQDWYKPDV++YKDLIIALA+ +KMD+AMKLWESMR+ENL        
Sbjct: 121 TSLAVKVFEVIQKQDWYKPDVYLYKDLIIALAKCRKMDEAMKLWESMRKENLFPDSQTYT 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFP 240
                                                      HPLLRN+VK+DFEELFP
Sbjct: 181 EIIRGFLRDGSPADAMNIYEDMIKSPDPPEELPFRILLKGLLPHPLLRNKVKKDFEELFP 240

Query: 241 DQHVYDPPEEIFG 254
           ++H YDPPEEIFG
Sbjct: 241 EKHAYDPPEEIFG 253

BLAST of Cla97C03G052040 vs. Swiss-Prot
Match: sp|Q9STF9|THA8L_ARATH (Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8L PE=1 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 5.1e-62
Identity = 128/249 (51.41%), Postives = 160/249 (64.26%), Query Frame = 0

Query: 10  KLPIFASVFIRCLMR----RAISDISLPVKSTISGIQPDLSRPLWGF-RLYHDGRPRGPL 69
           K P FAS+F + + R      IS  +L  K+ +  I P   +P   F   +HDGRPRGPL
Sbjct: 10  KFPTFASIFFQNITRNPSIHRISFSNLKPKTLLHPIPP---KPFTVFVSRFHDGRPRGPL 69

Query: 70  WRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEEVALA 129
           WR +K IGKEALFVI GLKR KED+EKL+KF+K+HV RLLKLDM+AV+GELERQEE ALA
Sbjct: 70  WRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEETALA 129

Query: 130 VKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXXXXXX 189
           +K+F +I+KQ+WY+PDVFMYKDLI++LA+SK+MD+AM LWE M++ENL            
Sbjct: 130 IKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFPDSQTYTEVIR 189

Query: 190 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFPDQHV 249
                                                  HPLLRN+VK+DFEELFP++H 
Sbjct: 190 GFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNKVKKDFEELFPEKHA 249

Query: 250 YDPPEEIFG 254
           YDPPEEIFG
Sbjct: 250 YDPPEEIFG 255

BLAST of Cla97C03G052040 vs. Swiss-Prot
Match: sp|Q1PFH7|PPR89_ARATH (Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX=3702 GN=At1g62350 PE=2 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 1.3e-25
Identity = 60/181 (33.15%), Postives = 96/181 (53.04%), Query Frame = 0

Query: 71  IGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEEVALAVKIFRL 130
           + KE L   + LKR +    +L++F+ SHV RLLK D+++VL E +RQ +V L +K++ +
Sbjct: 1   MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60

Query: 131 IRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXXXXXXXXXXXX 190
           +R++ WY+PD+F Y+D+++ LAR+KK+D+  K+WE +++E +                  
Sbjct: 61  VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNE 120

Query: 191 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFPDQHVYDPPEE 250
                                            +P LR +VK DF ELFP   VYDPPE+
Sbjct: 121 LPLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPELREKVKDDFLELFPGMIVYDPPED 180

Query: 251 I 252
           I
Sbjct: 181 I 181

BLAST of Cla97C03G052040 vs. Swiss-Prot
Match: sp|Q9LVW6|THA8_ARATH (Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8 PE=2 SV=1)

HSP 1 Score: 53.1 bits (126), Expect = 5.1e-06
Identity = 39/104 (37.50%), Postives = 60/104 (57.69%), Query Frame = 0

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPL + R  +  EA+  IQ LKR       L   ++  + RL+K D+I+VL EL RQ+ 
Sbjct: 39  RGPLLKGR-ILSTEAIQSIQSLKRAHRTGVSLSLTLRP-LRRLIKSDLISVLRELLRQDY 98

Query: 121 VALAVKIFRLIRKQDWYKP-DVFMYKDLIIALARSKKMDDAMKL 164
             LAV +   +R +  Y P D+ +Y D++ AL R+K+ D+  +L
Sbjct: 99  CTLAVHVLSTLRTE--YPPLDLVLYADIVNALTRNKEFDEIDRL 138

BLAST of Cla97C03G052040 vs. TAIR10
Match: AT3G46870.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 239.2 bits (609), Expect = 2.8e-63
Identity = 128/249 (51.41%), Postives = 160/249 (64.26%), Query Frame = 0

Query: 10  KLPIFASVFIRCLMR----RAISDISLPVKSTISGIQPDLSRPLWGF-RLYHDGRPRGPL 69
           K P FAS+F + + R      IS  +L  K+ +  I P   +P   F   +HDGRPRGPL
Sbjct: 10  KFPTFASIFFQNITRNPSIHRISFSNLKPKTLLHPIPP---KPFTVFVSRFHDGRPRGPL 69

Query: 70  WRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEEVALA 129
           WR +K IGKEALFVI GLKR KED+EKL+KF+K+HV RLLKLDM+AV+GELERQEE ALA
Sbjct: 70  WRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEETALA 129

Query: 130 VKIFRLIRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXXXXXX 189
           +K+F +I+KQ+WY+PDVFMYKDLI++LA+SK+MD+AM LWE M++ENL            
Sbjct: 130 IKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFPDSQTYTEVIR 189

Query: 190 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFPDQHV 249
                                                  HPLLRN+VK+DFEELFP++H 
Sbjct: 190 GFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNKVKKDFEELFPEKHA 249

Query: 250 YDPPEEIFG 254
           YDPPEEIFG
Sbjct: 250 YDPPEEIFG 255

BLAST of Cla97C03G052040 vs. TAIR10
Match: AT1G62350.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 118.2 bits (295), Expect = 7.2e-27
Identity = 60/181 (33.15%), Postives = 96/181 (53.04%), Query Frame = 0

Query: 71  IGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEEVALAVKIFRL 130
           + KE L   + LKR +    +L++F+ SHV RLLK D+++VL E +RQ +V L +K++ +
Sbjct: 1   MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60

Query: 131 IRKQDWYKPDVFMYKDLIIALARSKKMDDAMKLWESMREENLXXXXXXXXXXXXXXXXXX 190
           +R++ WY+PD+F Y+D+++ LAR+KK+D+  K+WE +++E +                  
Sbjct: 61  VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNE 120

Query: 191 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHPLLRNRVKQDFEELFPDQHVYDPPEE 250
                                            +P LR +VK DF ELFP   VYDPPE+
Sbjct: 121 LPLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPELREKVKDDFLELFPGMIVYDPPED 180

Query: 251 I 252
           I
Sbjct: 181 I 181

BLAST of Cla97C03G052040 vs. TAIR10
Match: AT5G09320.1 (Vacuolar sorting protein 9 (VPS9) domain)

HSP 1 Score: 71.2 bits (173), Expect = 1.0e-12
Identity = 37/79 (46.84%), Postives = 53/79 (67.09%), Query Frame = 0

Query: 92  LEKFMKSHVLRLLKLDMIAVLGELERQEEVALAVKIFRLIRKQDWYKPDVFMYKDLIIAL 151
           L++ + S   RLLK DM+AVL EL RQ E +LA+K+F  IRK+ WYKP V MY D+I  +
Sbjct: 541 LDRVIISKFRRLLKFDMVAVLRELLRQNECSLALKVFEEIRKEYWYKPQVRMYTDMITVM 600

Query: 152 ARSKKMDDAMKLWESMREE 171
           A +  M++   L+ +M+ E
Sbjct: 601 ADNSLMEEVNYLYSAMKSE 619

BLAST of Cla97C03G052040 vs. TAIR10
Match: AT3G27750.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 53.1 bits (126), Expect = 2.8e-07
Identity = 39/104 (37.50%), Postives = 60/104 (57.69%), Query Frame = 0

Query: 61  RGPLWRSRKAIGKEALFVIQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEE 120
           RGPL + R  +  EA+  IQ LKR       L   ++  + RL+K D+I+VL EL RQ+ 
Sbjct: 39  RGPLLKGR-ILSTEAIQSIQSLKRAHRTGVSLSLTLRP-LRRLIKSDLISVLRELLRQDY 98

Query: 121 VALAVKIFRLIRKQDWYKP-DVFMYKDLIIALARSKKMDDAMKL 164
             LAV +   +R +  Y P D+ +Y D++ AL R+K+ D+  +L
Sbjct: 99  CTLAVHVLSTLRTE--YPPLDLVLYADIVNALTRNKEFDEIDRL 138

BLAST of Cla97C03G052040 vs. TAIR10
Match: AT3G53170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 42.7 bits (99), Expect = 3.8e-04
Identity = 29/94 (30.85%), Postives = 45/94 (47.87%), Query Frame = 0

Query: 79  IQGLKRFKEDEEKLEKFMKSHVLRLLKLDMIAVLGELERQEEVALAVKIFRLIRKQDWYK 138
           ++G++R    E+ L  + K+         ++  L E  ++     A+KIF L+RKQ WY+
Sbjct: 91  VKGIERKANSEKYLTLWPKA---------VLEALDEAIKENRWQSALKIFNLLRKQHWYE 150

Query: 139 PDVFMYKDLIIALARSKKMDDAMKLWESMREENL 173
           P    Y  L   L   K+ D A  L+E M  E L
Sbjct: 151 PRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEGL 175

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008457435.12.1e-9391.76PREDICTED: pentatricopeptide repeat-containing protein At3g46870 [Cucumis melo][more]
XP_004145279.12.8e-9390.98PREDICTED: pentatricopeptide repeat-containing protein At3g46870 [Cucumis sativu... [more]
XP_022964699.11.1e-8989.02protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata] >XP_022964... [more]
XP_023521051.11.9e-8988.63protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo] >X... [more]
XP_023519300.14.2e-8988.63protein THYLAKOID ASSEMBLY 8-like, chloroplastic isoform X1 [Cucurbita pepo subs... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C5H5|A0A1S3C5H5_CUCME1.4e-9391.76pentatricopeptide repeat-containing protein At3g46870 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0LVE9|A0A0A0LVE9_CUCSA1.8e-9390.98Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G525290 PE=4 SV=1[more]
tr|A0A2P4ISK6|A0A2P4ISK6_QUESU8.6e-7578.04Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_3... [more]
tr|A0A2I4FFU6|A0A2I4FFU6_9ROSI9.8e-7179.05pentatricopeptide repeat-containing protein At3g46870 OS=Juglans regia OX=51240 ... [more]
tr|A0A1U8PJZ9|A0A1U8PJZ9_GOSHI2.9e-7054.55pentatricopeptide repeat-containing protein At3g46870-like OS=Gossypium hirsutum... [more]
Match NameE-valueIdentityDescription
sp|Q9STF9|THA8L_ARATH5.1e-6251.41Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
sp|Q1PFH7|PPR89_ARATH1.3e-2533.15Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX... [more]
sp|Q9LVW6|THA8_ARATH5.1e-0637.50Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=T... [more]
Match NameE-valueIdentityDescription
AT3G46870.12.8e-6351.41Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G62350.17.2e-2733.15Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G09320.11.0e-1246.84Vacuolar sorting protein 9 (VPS9) domain[more]
AT3G27750.12.8e-0737.50FUNCTIONS IN: molecular_function unknown[more]
AT3G53170.13.8e-0430.85Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G052040.1Cla97C03G052040.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 139..187
e-value: 2.2E-10
score: 40.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 178..205
e-value: 9.2E-5
score: 20.4
coord: 143..175
e-value: 3.6E-6
score: 24.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 140..174
score: 11.159
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 175..209
score: 10.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 69..255
e-value: 1.8E-53
score: 182.6
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..247
NoneNo IPR availablePANTHERPTHR24015:SF332SUBFAMILY NOT NAMEDcoord: 1..247

The following gene(s) are paralogous to this gene:

None