Cla97C02G033630 (gene) Watermelon (97103) v2

NameCla97C02G033630
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCla97Chr02 : 7182172 .. 7184670 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGACAAGGTTGCTCTTCCCCTCCTCCTCCCTAATCCACCGCCCTCCAAACCCCTCTTCCCCGTCTTCCACCACCACCCCCCTTCACCTTCTTCTTCGCCGCCGCCGCCACTCACCTTTCCTCCGCCCCCCCAGCCGCCGCTCTCTTCCTCATCATCTCCCATAGCCCCTCTTCTCCAAGACCTCCTTCCCCACCAACACCCTTCTTCCACTCAACCCCATCTCCCCAAACCCACTTTCAGAACCCGCACACGAATCGGCCGGTCCCGCGACCCGAACCGCGGGAAGCCATGGTCGCACCACCGTCTCTCTACTCAAGGTCAGCGAATTCTCGATTCTTTACTTAACCCGGAATTCGATTCCTCCTCGTTGAATGAAATTTTACTTCAATTGTTTGAAACCAGTCCTGAAGGACTTAATTTCACCTCCAAGTCTGTTTCCTTCGACATTTTGGGGATAATCAAGGGCTTGGTGTTTAGCAAGAAGAATGAATTAGCCTTGCGTGTGTTTGATTTCGTTCGTAATCGGGTGGATTTTGCATCTATTTTGAGTAGCTCTGTGATTGCTGTGATTATTAGTGTACTCGGTAAAGAGGGTCGGGCTTCTTTTGCAGCTTCTCTGCTTCATGAGCTTCGAAATGATGGGGTAAATATTGATATTTATGCTTATACTTCTTTGATAACTGCTTATGCTAGTAATGGGAGGTATAGAGAGGCTGTGATGGTTTTTAAGAAACTGGAAGAAGACGGTTGTAGACCAACTTTAATTACCTATAATGTTATCTTGAATGTCTATGGGAAAATGGGAATGCCTTGGAGTAAAATTGCTGCTCTTGTTGATAGTATGAAGAGTTCCGGGGTTGCTCCGGATTTGTATACGTATAATACGCTTATTAGTAGTTGTCGCCGGGGGTCGTTGTATGAAGAAGCTGCAGAGGTTTTTGAGGAGATGAAAGCAGCTGGGTTTAGTCCTGATAAGGTGACTTACAATGCGTTGTTGGATGTGTATGGGAAGTCTCGGCGGCCTAAGGAAGCCATGGAGGTTTTGAAGGAGATGGAAGCAAGCGGGTTTGCACCTAGTATTGTCACTTACAATTCGTTGATATCGGCTTATGCACGGGATGGTTTGTTAGATGAAGCTATGGAGCTAAAAGCACAAATGGTGGAGAAGGGGATTAAGCCTGATGTTTTTACATACACCACATTGTTGTCTGGTTTCGAGAAGACGGGTAAGGATGATTATGCTATGAGAGTATTCGAGGAGATGAGGGTTGCAGGGTGCCAAGCGAATATATGCACCTTCAATGCACTGATTAAGATGCACGGTAACAGGGGTAATTTTGTGGAGATGATGAAGGTTTTTGAAGAAATTAAGAAATGTGAATGTGTGCCTGATATTGTTACTTGGAACACTCTTTTGGCAGTGTTTGGGCAAAATGGGATGGATTCTGAAGTCTCAGGAGTATTCAAAGAGATGAAGAGGGCAGGTTTTGTCCCTGAGAGGGACACTTTTAACACTCTTATTAGTGCCTATAGCAGGTGTGGTTCTTTTGATCAAGCAATGGCTATCTATAGAAGAATGTTGGATGCCGGGGTGACTCCCGATCTGTCAACTTACAATGCTGTTTTGGCAGCCTTGGCTCGGGGAGGCCTTTGGGAGCAGTCGGAGAGAGTACTTGCCGAAATGAAGGATGGTAGGTGTAAACCTAATGAGTTAACGTATTGTTCTTTACTTCATGCTTATGCCAATGGCAAAGACGTTGGGCGAATGTCTGCACTTGCTGAGGAGATCTATTCTGGCATTATTGAACCTCAAGCTGTGCTTTTGAAGACACTGGTTTTGGTTTGTAGTAAAAGTGATCTTTTGATGGAGACCGAACGTGCTTTCTTGGAACTTAGAAAAAGAGGTTTTTCGCCCGATATAACGACTCTAAATGCCATGGTTTCTATATATGGTAGGAGGAGGATGGTTTCAAAAACAAATGAAATTTTGAACTTCATAAAGGACAGTGGGTTCACTCCAAGCTTGACAACATACAATAGCTTAATGTACATGTATAGCCGCACCGAGCACTTCGAGAAGTCAGAGGATATCCTAAGGGAAATTATTGGGAAAGGAATGAAGCCTGATATCATTTCGTTTAATACCGTTATTTTCGCCTATTGCCGAAATGGTCGAATGAAAGAGGCCTCGCGGATATTTGCAGAAATGAAGGACTTTGGGCTCGTCCCTGATGTAATTACATATAATACCTTCATTGCAAGCTATGCATCCGATTCAATGTTCATAGAGGCGATCGACGTGGTGCGATATATGATAAAGAATGGATGTAAGCCCAATCAGAATACATACAACTCTCTAGTAGATTGGTTTTGTAAACTTAATCGTCGGGATGAGGCAAGTAGTTTCGTTTCCAACCTTTGCAATCTAGACCCACATGTAACAAAAGAAGAGGAATGCAGGTTGCTGGAGCGTCTTCACAAGAAATGGTCATAG

mRNA sequence

ATGGCGGACAAGGTTGCTCTTCCCCTCCTCCTCCCTAATCCACCGCCCTCCAAACCCCTCTTCCCCGTCTTCCACCACCACCCCCCTTCACCTTCTTCTTCGCCGCCGCCGCCACTCACCTTTCCTCCGCCCCCCCAGCCGCCGCTCTCTTCCTCATCATCTCCCATAGCCCCTCTTCTCCAAGACCTCCTTCCCCACCAACACCCTTCTTCCACTCAACCCCATCTCCCCAAACCCACTTTCAGAACCCGCACACGAATCGGCCGGTCCCGCGACCCGAACCGCGGGAAGCCATGGTCGCACCACCGTCTCTCTACTCAAGGTCAGCGAATTCTCGATTCTTTACTTAACCCGGAATTCGATTCCTCCTCGTTGAATGAAATTTTACTTCAATTGTTTGAAACCAGTCCTGAAGGACTTAATTTCACCTCCAAGTCTGTTTCCTTCGACATTTTGGGGATAATCAAGGGCTTGGTGTTTAGCAAGAAGAATGAATTAGCCTTGCGTGTGTTTGATTTCGTTCGTAATCGGGTGGATTTTGCATCTATTTTGAGTAGCTCTGTGATTGCTGTGATTATTAGTGTACTCGGTAAAGAGGGTCGGGCTTCTTTTGCAGCTTCTCTGCTTCATGAGCTTCGAAATGATGGGGTAAATATTGATATTTATGCTTATACTTCTTTGATAACTGCTTATGCTAGTAATGGGAGGTATAGAGAGGCTGTGATGGTTTTTAAGAAACTGGAAGAAGACGGTTGTAGACCAACTTTAATTACCTATAATGTTATCTTGAATGTCTATGGGAAAATGGGAATGCCTTGGAGTAAAATTGCTGCTCTTGTTGATAGTATGAAGAGTTCCGGGGTTGCTCCGGATTTGTATACGTATAATACGCTTATTAGTAGTTGTCGCCGGGGGTCGTTGTATGAAGAAGCTGCAGAGGTTTTTGAGGAGATGAAAGCAGCTGGGTTTAGTCCTGATAAGGTGACTTACAATGCGTTGTTGGATGTGTATGGGAAGTCTCGGCGGCCTAAGGAAGCCATGGAGGTTTTGAAGGAGATGGAAGCAAGCGGGTTTGCACCTAGTATTGTCACTTACAATTCGTTGATATCGGCTTATGCACGGGATGGTTTGTTAGATGAAGCTATGGAGCTAAAAGCACAAATGGTGGAGAAGGGGATTAAGCCTGATGTTTTTACATACACCACATTGTTGTCTGGTTTCGAGAAGACGGGTAAGGATGATTATGCTATGAGAGTATTCGAGGAGATGAGGGTTGCAGGGTGCCAAGCGAATATATGCACCTTCAATGCACTGATTAAGATGCACGGTAACAGGGGTAATTTTGTGGAGATGATGAAGGTTTTTGAAGAAATTAAGAAATGTGAATGTGTGCCTGATATTGTTACTTGGAACACTCTTTTGGCAGTGTTTGGGCAAAATGGGATGGATTCTGAAGTCTCAGGAGTATTCAAAGAGATGAAGAGGGCAGGTTTTGTCCCTGAGAGGGACACTTTTAACACTCTTATTAGTGCCTATAGCAGGTGTGGTTCTTTTGATCAAGCAATGGCTATCTATAGAAGAATGTTGGATGCCGGGGTGACTCCCGATCTGTCAACTTACAATGCTGTTTTGGCAGCCTTGGCTCGGGGAGGCCTTTGGGAGCAGTCGGAGAGAGTACTTGCCGAAATGAAGGATGGTAGGTGTAAACCTAATGAGTTAACGTATTGTTCTTTACTTCATGCTTATGCCAATGGCAAAGACGTTGGGCGAATGTCTGCACTTGCTGAGGAGATCTATTCTGGCATTATTGAACCTCAAGCTGTGCTTTTGAAGACACTGGTTTTGGTTTGTAGTAAAAGTGATCTTTTGATGGAGACCGAACGTGCTTTCTTGGAACTTAGAAAAAGAGGTTTTTCGCCCGATATAACGACTCTAAATGCCATGGTTTCTATATATGGTAGGAGGAGGATGGTTTCAAAAACAAATGAAATTTTGAACTTCATAAAGGACAGTGGGTTCACTCCAAGCTTGACAACATACAATAGCTTAATGTACATGTATAGCCGCACCGAGCACTTCGAGAAGTCAGAGGATATCCTAAGGGAAATTATTGGGAAAGGAATGAAGCCTGATATCATTTCGTTTAATACCGTTATTTTCGCCTATTGCCGAAATGGTCGAATGAAAGAGGCCTCGCGGATATTTGCAGAAATGAAGGACTTTGGGCTCGTCCCTGATGTAATTACATATAATACCTTCATTGCAAGCTATGCATCCGATTCAATGTTCATAGAGGCGATCGACGTGGTGCGATATATGATAAAGAATGGATGTAAGCCCAATCAGAATACATACAACTCTCTAGTAGATTGGTTTTGTAAACTTAATCGTCGGGATGAGGCAAGTAGTTTCGTTTCCAACCTTTGCAATCTAGACCCACATGTAACAAAAGAAGAGGAATGCAGGTTGCTGGAGCGTCTTCACAAGAAATGGTCATAG

Coding sequence (CDS)

ATGGCGGACAAGGTTGCTCTTCCCCTCCTCCTCCCTAATCCACCGCCCTCCAAACCCCTCTTCCCCGTCTTCCACCACCACCCCCCTTCACCTTCTTCTTCGCCGCCGCCGCCACTCACCTTTCCTCCGCCCCCCCAGCCGCCGCTCTCTTCCTCATCATCTCCCATAGCCCCTCTTCTCCAAGACCTCCTTCCCCACCAACACCCTTCTTCCACTCAACCCCATCTCCCCAAACCCACTTTCAGAACCCGCACACGAATCGGCCGGTCCCGCGACCCGAACCGCGGGAAGCCATGGTCGCACCACCGTCTCTCTACTCAAGGTCAGCGAATTCTCGATTCTTTACTTAACCCGGAATTCGATTCCTCCTCGTTGAATGAAATTTTACTTCAATTGTTTGAAACCAGTCCTGAAGGACTTAATTTCACCTCCAAGTCTGTTTCCTTCGACATTTTGGGGATAATCAAGGGCTTGGTGTTTAGCAAGAAGAATGAATTAGCCTTGCGTGTGTTTGATTTCGTTCGTAATCGGGTGGATTTTGCATCTATTTTGAGTAGCTCTGTGATTGCTGTGATTATTAGTGTACTCGGTAAAGAGGGTCGGGCTTCTTTTGCAGCTTCTCTGCTTCATGAGCTTCGAAATGATGGGGTAAATATTGATATTTATGCTTATACTTCTTTGATAACTGCTTATGCTAGTAATGGGAGGTATAGAGAGGCTGTGATGGTTTTTAAGAAACTGGAAGAAGACGGTTGTAGACCAACTTTAATTACCTATAATGTTATCTTGAATGTCTATGGGAAAATGGGAATGCCTTGGAGTAAAATTGCTGCTCTTGTTGATAGTATGAAGAGTTCCGGGGTTGCTCCGGATTTGTATACGTATAATACGCTTATTAGTAGTTGTCGCCGGGGGTCGTTGTATGAAGAAGCTGCAGAGGTTTTTGAGGAGATGAAAGCAGCTGGGTTTAGTCCTGATAAGGTGACTTACAATGCGTTGTTGGATGTGTATGGGAAGTCTCGGCGGCCTAAGGAAGCCATGGAGGTTTTGAAGGAGATGGAAGCAAGCGGGTTTGCACCTAGTATTGTCACTTACAATTCGTTGATATCGGCTTATGCACGGGATGGTTTGTTAGATGAAGCTATGGAGCTAAAAGCACAAATGGTGGAGAAGGGGATTAAGCCTGATGTTTTTACATACACCACATTGTTGTCTGGTTTCGAGAAGACGGGTAAGGATGATTATGCTATGAGAGTATTCGAGGAGATGAGGGTTGCAGGGTGCCAAGCGAATATATGCACCTTCAATGCACTGATTAAGATGCACGGTAACAGGGGTAATTTTGTGGAGATGATGAAGGTTTTTGAAGAAATTAAGAAATGTGAATGTGTGCCTGATATTGTTACTTGGAACACTCTTTTGGCAGTGTTTGGGCAAAATGGGATGGATTCTGAAGTCTCAGGAGTATTCAAAGAGATGAAGAGGGCAGGTTTTGTCCCTGAGAGGGACACTTTTAACACTCTTATTAGTGCCTATAGCAGGTGTGGTTCTTTTGATCAAGCAATGGCTATCTATAGAAGAATGTTGGATGCCGGGGTGACTCCCGATCTGTCAACTTACAATGCTGTTTTGGCAGCCTTGGCTCGGGGAGGCCTTTGGGAGCAGTCGGAGAGAGTACTTGCCGAAATGAAGGATGGTAGGTGTAAACCTAATGAGTTAACGTATTGTTCTTTACTTCATGCTTATGCCAATGGCAAAGACGTTGGGCGAATGTCTGCACTTGCTGAGGAGATCTATTCTGGCATTATTGAACCTCAAGCTGTGCTTTTGAAGACACTGGTTTTGGTTTGTAGTAAAAGTGATCTTTTGATGGAGACCGAACGTGCTTTCTTGGAACTTAGAAAAAGAGGTTTTTCGCCCGATATAACGACTCTAAATGCCATGGTTTCTATATATGGTAGGAGGAGGATGGTTTCAAAAACAAATGAAATTTTGAACTTCATAAAGGACAGTGGGTTCACTCCAAGCTTGACAACATACAATAGCTTAATGTACATGTATAGCCGCACCGAGCACTTCGAGAAGTCAGAGGATATCCTAAGGGAAATTATTGGGAAAGGAATGAAGCCTGATATCATTTCGTTTAATACCGTTATTTTCGCCTATTGCCGAAATGGTCGAATGAAAGAGGCCTCGCGGATATTTGCAGAAATGAAGGACTTTGGGCTCGTCCCTGATGTAATTACATATAATACCTTCATTGCAAGCTATGCATCCGATTCAATGTTCATAGAGGCGATCGACGTGGTGCGATATATGATAAAGAATGGATGTAAGCCCAATCAGAATACATACAACTCTCTAGTAGATTGGTTTTGTAAACTTAATCGTCGGGATGAGGCAAGTAGTTTCGTTTCCAACCTTTGCAATCTAGACCCACATGTAACAAAAGAAGAGGAATGCAGGTTGCTGGAGCGTCTTCACAAGAAATGGTCATAG

Protein sequence

MADKVALPLLLPNPPPSKPLFPVFHHHPPSPSSSPPPPLTFPPPPQPPLSSSSSPIAPLLQDLLPHQHPSSTQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVDFASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYREAVMVFKKLEEDGCRPTLITYNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLISSCRRGSLYEEAAEVFEEMKAAGFSPDKVTYNALLDVYGKSRRPKEAMEVLKEMEASGFAPSIVTYNSLISAYARDGLLDEAMELKAQMVEKGIKPDVFTYTTLLSGFEKTGKDDYAMRVFEEMRVAGCQANICTFNALIKMHGNRGNFVEMMKVFEEIKKCECVPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERDTFNTLISAYSRCGSFDQAMAIYRRMLDAGVTPDLSTYNAVLAALARGGLWEQSERVLAEMKDGRCKPNELTYCSLLHAYANGKDVGRMSALAEEIYSGIIEPQAVLLKTLVLVCSKSDLLMETERAFLELRKRGFSPDITTLNAMVSIYGRRRMVSKTNEILNFIKDSGFTPSLTTYNSLMYMYSRTEHFEKSEDILREIIGKGMKPDIISFNTVIFAYCRNGRMKEASRIFAEMKDFGLVPDVITYNTFIASYASDSMFIEAIDVVRYMIKNGCKPNQNTYNSLVDWFCKLNRRDEASSFVSNLCNLDPHVTKEEECRLLERLHKKWS
BLAST of Cla97C02G033630 vs. NCBI nr
Match: XP_008455020.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis melo])

HSP 1 Score: 431.8 bits (1109), Expect = 5.3e-117
Identity = 264/305 (86.56%), Postives = 271/305 (88.85%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADKVALPLLLPNP           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    L
Sbjct: 17  MADKVALPLLLPNPPPSKSLFPVFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL 76

Query: 61  QDLLPHQHPSST-QPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
           QDLLPHQHPSS+ QPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE
Sbjct: 77  QDLLPHQHPSSSAQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 136

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FDSSSL+EILLQLFETSP+GLNFTS SVSFDILGIIKGLVF+KKNELAL VFDFVRNR D
Sbjct: 137 FDSSSLDEILLQLFETSPDGLNFTSDSVSFDILGIIKGLVFNKKNELALGVFDFVRNRED 196

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYRE 240
           FASILS+SVIAVIISVLGKEGRASFAASLLHELRNDGV+IDIYAYTSLITAYASNGRYRE
Sbjct: 197 FASILSNSVIAVIISVLGKEGRASFAASLLHELRNDGVHIDIYAYTSLITAYASNGRYRE 256

Query: 241 AVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLI 300
           AVMVFK              NVILNVYGKMGMPWSKI+ALVDSMKSSGV PDLYTYNTLI
Sbjct: 257 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKISALVDSMKSSGVVPDLYTYNTLI 316

Query: 301 SSCRR 305
           SSCRR
Sbjct: 317 SSCRR 321

BLAST of Cla97C02G033630 vs. NCBI nr
Match: XP_023553783.1 (pentatricopeptide repeat-containing protein At5g02860 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 413.3 bits (1061), Expect = 1.9e-111
Identity = 262/305 (85.90%), Postives = 275/305 (90.16%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADKVALPLLLPNP     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXI+P+L
Sbjct: 1   MADKVALPLLLPNP-----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXISPIL 60

Query: 61  QDL-LPHQHPSSTQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
           QDL LPH+  SS +PH+PK TF++R+RIGRSRDPNRGKPWSHHRLSTQGQRI DSLLNPE
Sbjct: 61  QDLFLPHKDSSSPRPHIPKSTFKSRSRIGRSRDPNRGKPWSHHRLSTQGQRIHDSLLNPE 120

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FD+SSLNEILLQLFETSPEGLNFT +SVS DILGIIKGLVF+KKNELALRVFDFVRNR D
Sbjct: 121 FDASSLNEILLQLFETSPEGLNFTPESVSLDILGIIKGLVFNKKNELALRVFDFVRNRED 180

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYRE 240
           FASILSSSVIAVIISVLGKEGRAS AASLLHELRNDGVNIDIYAYTSLITAYA+NGRYRE
Sbjct: 181 FASILSSSVIAVIISVLGKEGRASSAASLLHELRNDGVNIDIYAYTSLITAYANNGRYRE 240

Query: 241 AVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLI 300
           AVMVFK              NVILNVYGKMGMPWSKIAALVDSMKSSG+APD YTYNTLI
Sbjct: 241 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKIAALVDSMKSSGIAPDSYTYNTLI 300

Query: 301 SSCRR 305
           SSCRR
Sbjct: 301 SSCRR 300

BLAST of Cla97C02G033630 vs. NCBI nr
Match: XP_022952798.1 (pentatricopeptide repeat-containing protein At5g02860 [Cucurbita moschata])

HSP 1 Score: 412.9 bits (1060), Expect = 2.5e-111
Identity = 254/305 (83.28%), Postives = 265/305 (86.89%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADKVALPLLLPNP              XXXXXXXXXXXXXXXXXXXXXXXXXXXI+P+L
Sbjct: 1   MADKVALPLLLPNP-----PPSKPLFPVXXXXXXXXXXXXXXXXXXXXXXXXXXXISPIL 60

Query: 61  QD-LLPHQHPSSTQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
           QD LLPH+  SS +PHLPK TF++R RIGRSRDPNRGKPWSHHRLSTQGQRI DSLLNPE
Sbjct: 61  QDLLLPHKDSSSPRPHLPKSTFKSRGRIGRSRDPNRGKPWSHHRLSTQGQRIHDSLLNPE 120

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FD+SSLNEILLQLFETSPEGLNFTS+SVS DILGIIKGLVF+KKNELALRVFDF RNR D
Sbjct: 121 FDASSLNEILLQLFETSPEGLNFTSESVSLDILGIIKGLVFNKKNELALRVFDFFRNRED 180

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYRE 240
           FASILSSSVIAVIISVLGKEGRAS AASLLHELRNDGVNIDIYAYTSLITAYA+NGRYRE
Sbjct: 181 FASILSSSVIAVIISVLGKEGRASSAASLLHELRNDGVNIDIYAYTSLITAYANNGRYRE 240

Query: 241 AVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLI 300
           AVMVFK              NVILNVYGKMGMPWSKIAALVDSMKSSG+APD YTYNTLI
Sbjct: 241 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKIAALVDSMKSSGIAPDSYTYNTLI 300

Query: 301 SSCRR 305
           SSCRR
Sbjct: 301 SSCRR 300

BLAST of Cla97C02G033630 vs. NCBI nr
Match: XP_022972454.1 (pentatricopeptide repeat-containing protein At5g02860 [Cucurbita maxima])

HSP 1 Score: 407.5 bits (1046), Expect = 1.1e-109
Identity = 251/305 (82.30%), Postives = 264/305 (86.56%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADKVALPLLLPNP              XXXXXXXXXXXXXXXXXXXXXXXXXXXI+P+L
Sbjct: 1   MADKVALPLLLPNP-----PPSKPLFPVXXXXXXXXXXXXXXXXXXXXXXXXXXXISPIL 60

Query: 61  QD-LLPHQHPSSTQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
           QD LLPH+   S QPHLPK TF++R+RIGRSRDPNRGKPWSHHRLSTQGQRI DSLL+PE
Sbjct: 61  QDLLLPHKDSYSPQPHLPKSTFKSRSRIGRSRDPNRGKPWSHHRLSTQGQRIHDSLLHPE 120

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FD+SSLNEILLQLFETSPEGLNFTS+SVS DI  IIKGLVF+KKNELALRVFDFVRNR D
Sbjct: 121 FDASSLNEILLQLFETSPEGLNFTSESVSLDISAIIKGLVFNKKNELALRVFDFVRNRED 180

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYRE 240
           FASILSSSVIAVIISVLGKEGRAS A+SLLHELRNDGVNIDIYAYTSLITAYA+NGRYRE
Sbjct: 181 FASILSSSVIAVIISVLGKEGRASSASSLLHELRNDGVNIDIYAYTSLITAYANNGRYRE 240

Query: 241 AVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLI 300
           AVMVFK              NVILNVYGKMGMPWSKIAALVDSMKSSG+APD YTYNTLI
Sbjct: 241 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKIAALVDSMKSSGIAPDSYTYNTLI 300

Query: 301 SSCRR 305
           SSCRR
Sbjct: 301 SSCRR 300

BLAST of Cla97C02G033630 vs. NCBI nr
Match: XP_004137089.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis sativus])

HSP 1 Score: 389.0 bits (998), Expect = 3.9e-104
Identity = 203/231 (87.88%), Postives = 209/231 (90.48%), Query Frame = 0

Query: 74  PHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSLNEILLQLF 133
           PHLPKPTFRTRTRIGRS DPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSL+EILLQLF
Sbjct: 73  PHLPKPTFRTRTRIGRSHDPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSLDEILLQLF 132

Query: 134 ETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVDFASILSSSVIAVII 193
           ETS +GLNFTS SVSFDILGIIKGLVF KKNELAL VF FVRNR DFASILS+SV+AVII
Sbjct: 133 ETSSDGLNFTSDSVSFDILGIIKGLVFYKKNELALCVFYFVRNREDFASILSNSVVAVII 192

Query: 194 SVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYREAVMVFKXXXXXXXX 253
           SVLGKEGRASFAASLLH+LRNDGV+IDIYAYTSLITAYASNGRYREAVMVFK        
Sbjct: 193 SVLGKEGRASFAASLLHDLRNDGVHIDIYAYTSLITAYASNGRYREAVMVFKKLEEEGCR 252

Query: 254 XXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLISSCRR 305
                 NVILNVYGKMGMPWSKIA LVDSMKSSGVAPDLYTYNTLISSCRR
Sbjct: 253 PTLITYNVILNVYGKMGMPWSKIAGLVDSMKSSGVAPDLYTYNTLISSCRR 303

BLAST of Cla97C02G033630 vs. TrEMBL
Match: tr|A0A1S3C150|A0A1S3C150_CUCME (pentatricopeptide repeat-containing protein At5g02860 OS=Cucumis melo OX=3656 GN=LOC103495294 PE=4 SV=1)

HSP 1 Score: 431.8 bits (1109), Expect = 3.5e-117
Identity = 264/305 (86.56%), Postives = 271/305 (88.85%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADKVALPLLLPNP           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    L
Sbjct: 17  MADKVALPLLLPNPPPSKSLFPVFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL 76

Query: 61  QDLLPHQHPSST-QPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
           QDLLPHQHPSS+ QPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE
Sbjct: 77  QDLLPHQHPSSSAQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 136

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FDSSSL+EILLQLFETSP+GLNFTS SVSFDILGIIKGLVF+KKNELAL VFDFVRNR D
Sbjct: 137 FDSSSLDEILLQLFETSPDGLNFTSDSVSFDILGIIKGLVFNKKNELALGVFDFVRNRED 196

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYRE 240
           FASILS+SVIAVIISVLGKEGRASFAASLLHELRNDGV+IDIYAYTSLITAYASNGRYRE
Sbjct: 197 FASILSNSVIAVIISVLGKEGRASFAASLLHELRNDGVHIDIYAYTSLITAYASNGRYRE 256

Query: 241 AVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLI 300
           AVMVFK              NVILNVYGKMGMPWSKI+ALVDSMKSSGV PDLYTYNTLI
Sbjct: 257 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKISALVDSMKSSGVVPDLYTYNTLI 316

Query: 301 SSCRR 305
           SSCRR
Sbjct: 317 SSCRR 321

BLAST of Cla97C02G033630 vs. TrEMBL
Match: tr|A0A0A0K4H7|A0A0A0K4H7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G071530 PE=4 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 2.6e-104
Identity = 203/231 (87.88%), Postives = 209/231 (90.48%), Query Frame = 0

Query: 74  PHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSLNEILLQLF 133
           PHLPKPTFRTRTRIGRS DPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSL+EILLQLF
Sbjct: 73  PHLPKPTFRTRTRIGRSHDPNRGKPWSHHRLSTQGQRILDSLLNPEFDSSSLDEILLQLF 132

Query: 134 ETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVDFASILSSSVIAVII 193
           ETS +GLNFTS SVSFDILGIIKGLVF KKNELAL VF FVRNR DFASILS+SV+AVII
Sbjct: 133 ETSSDGLNFTSDSVSFDILGIIKGLVFYKKNELALCVFYFVRNREDFASILSNSVVAVII 192

Query: 194 SVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYREAVMVFKXXXXXXXX 253
           SVLGKEGRASFAASLLH+LRNDGV+IDIYAYTSLITAYASNGRYREAVMVFK        
Sbjct: 193 SVLGKEGRASFAASLLHDLRNDGVHIDIYAYTSLITAYASNGRYREAVMVFKKLEEEGCR 252

Query: 254 XXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLISSCRR 305
                 NVILNVYGKMGMPWSKIA LVDSMKSSGVAPDLYTYNTLISSCRR
Sbjct: 253 PTLITYNVILNVYGKMGMPWSKIAGLVDSMKSSGVAPDLYTYNTLISSCRR 303

BLAST of Cla97C02G033630 vs. TrEMBL
Match: tr|A0A2N9J6C3|A0A2N9J6C3_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS60848 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 6.5e-79
Identity = 180/307 (58.63%), Postives = 213/307 (69.38%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MA+KVALPLLLPNP                           XXXXXXXX      I+PLL
Sbjct: 1   MAEKVALPLLLPNP-----------------PPSKPLFPNNXXXXXXXXLPSAPPISPLL 60

Query: 61  QDLL---PHQHPSSTQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLN 120
           QDLL   P+    S   H P P  RTR RIG+SRDPNRGKPWSHHRLS +GQ+IL +LL+
Sbjct: 61  QDLLLQSPNTSSHSHSSHSPIP--RTRNRIGKSRDPNRGKPWSHHRLSLKGQQILQTLLD 120

Query: 121 PEFDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNR 180
           P FDS+ L+E+LLQLFE S E L+ + +S+SFD+ GIIKGL F+KK +LAL VF++VRNR
Sbjct: 121 PLFDSAKLDEVLLQLFEPSQEELSSSVESLSFDVSGIIKGLGFNKKCDLALSVFEWVRNR 180

Query: 181 VDFASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRY 240
            D   IL+ SV+AV+I +LG++GR S AASLLH L  DG++ID+Y YTSLITA ASNGRY
Sbjct: 181 KDCELILNGSVVAVVIGILGRQGRVSSAASLLHSLHKDGIDIDVYGYTSLITACASNGRY 240

Query: 241 REAVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNT 300
           REAV VFK              NVILNVYGKMGMPW+KI  +VD MKS+G+APD YTYNT
Sbjct: 241 REAVTVFKKMEEEGCKPTLITYNVILNVYGKMGMPWNKILDVVDGMKSAGIAPDSYTYNT 288

Query: 301 LISSCRR 305
           LIS CRR
Sbjct: 301 LISCCRR 288


HSP 2 Score: 58.5 bits (140), Expect = 8.1e-05
Identity = 25/36 (69.44%), Postives = 30/36 (83.33%), Query Frame = 0

Query: 796 NRRDEASSFVSNLCNLDPHVTKEEECRLLERLHKKW 832
           +R DEAS FV+NL  LDPH++KEEECRL ER+ KKW
Sbjct: 746 HRHDEASMFVNNLHKLDPHISKEEECRLSERIRKKW 781

BLAST of Cla97C02G033630 vs. TrEMBL
Match: tr|A0A2N9ID35|A0A2N9ID35_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50097 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 6.5e-79
Identity = 180/307 (58.63%), Postives = 213/307 (69.38%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MA+KVALPLLLPNP                           XXXXXXXX      I+PLL
Sbjct: 1   MAEKVALPLLLPNP-----------------PPSKPLFPNNXXXXXXXXLPSAPPISPLL 60

Query: 61  QDLL---PHQHPSSTQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLN 120
           QDLL   P+    S   H P P  RTR RIG+SRDPNRGKPWSHHRLS +GQ+IL +LL+
Sbjct: 61  QDLLLQSPNTSSHSHSSHSPIP--RTRNRIGKSRDPNRGKPWSHHRLSLKGQQILQTLLD 120

Query: 121 PEFDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNR 180
           P FDS+ L+E+LLQLFE S E L+ + +S+SFD+ GIIKGL F+KK +LAL VF++VRNR
Sbjct: 121 PLFDSAKLDEVLLQLFEPSQEELSSSVESLSFDVSGIIKGLGFNKKCDLALSVFEWVRNR 180

Query: 181 VDFASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGRY 240
            D   IL+ SV+AV+I +LG++GR S AASLLH L  DG++ID+Y YTSLITA ASNGRY
Sbjct: 181 KDCELILNGSVVAVVIGILGRQGRVSSAASLLHSLHKDGIDIDVYGYTSLITACASNGRY 240

Query: 241 REAVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNT 300
           REAV VFK              NVILNVYGKMGMPW+KI  +VD MKS+G+APD YTYNT
Sbjct: 241 REAVTVFKKMEEEGCKPTLITYNVILNVYGKMGMPWNKILDVVDGMKSAGIAPDSYTYNT 288

Query: 301 LISSCRR 305
           LIS CRR
Sbjct: 301 LISCCRR 288

BLAST of Cla97C02G033630 vs. TrEMBL
Match: tr|A0A2I4F7W7|A0A2I4F7W7_9ROSI (pentatricopeptide repeat-containing protein At5g02860 OS=Juglans regia OX=51240 GN=LOC108996347 PE=4 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 6.0e-77
Identity = 172/308 (55.84%), Postives = 207/308 (67.21%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MA+K+ALPL+L NP                                         + P L
Sbjct: 1   MAEKLALPLVLANP-----------------PPSRPIFPNIHQSQHHPPKPSAESVTPFL 60

Query: 61  QDLL-PHQHPSS--TQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLN 120
           QDLL  HQ+P+S    P  P P  R R RIG+SRDPNRGKPWSHHRLS +GQ+IL SL++
Sbjct: 61  QDLLINHQNPNSQPLSPQSPIPRTR-RKRIGKSRDPNRGKPWSHHRLSLKGQQILQSLID 120

Query: 121 PEFDSSSLNEILLQLFETSP-EGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRN 180
           P+FDS+ L E+LLQLFE SP E ++ +S+ +S D+LGI+KGL FSKK +LAL VF++VRN
Sbjct: 121 PQFDSAKLGEVLLQLFEPSPEEEVSSSSELLSLDVLGILKGLGFSKKCDLALSVFEWVRN 180

Query: 181 RVDFASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAYTSLITAYASNGR 240
           R D   IL+ S+IAV+IS+LGK+GR S AASLLH+L   G++ID+YAYTSLITA ASNGR
Sbjct: 181 RKDCELILNGSIIAVVISILGKDGRVSSAASLLHDLHKAGIDIDVYAYTSLITACASNGR 240

Query: 241 YREAVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYN 300
           YREAV VFK              NVILNVYGKMGMPW+KI  LV SMKS+GV PDLYTYN
Sbjct: 241 YREAVKVFKKMGEEGCEPTLITYNVILNVYGKMGMPWNKILDLVASMKSAGVYPDLYTYN 290

Query: 301 TLISSCRR 305
           TLIS CRR
Sbjct: 301 TLISCCRR 290

BLAST of Cla97C02G033630 vs. Swiss-Prot
Match: sp|Q9LYZ9|PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 2.4e-36
Identity = 101/224 (45.09%), Postives = 138/224 (61.61%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADK+ALPLLLP                                   XXXXXXXX+ PLL
Sbjct: 1   MADKLALPLLLP----------CTPSSKPYSHDQNHHISRTPFLTTSXXXXXXXXVEPLL 60

Query: 61  QDLLPHQHPSSTQPHLPKPT-FRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
            D+  HQ+P+S QP   + +  R RTRIG+SRDPN GKPWS+H LS QGQ++L SL+ P 
Sbjct: 61  HDVFLHQNPNSRQPISSQTSRNRNRTRIGKSRDPNLGKPWSYHGLSPQGQQVLRSLIEPN 120

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FDS  L+ +L +LFE   +      +S S ++L  +KGL F KK +LALR FD+   + D
Sbjct: 121 FDSGQLDSVLSELFEPFKD----KPESTSSELLAFLKGLGFHKKFDLALRAFDWFMKQKD 180

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYA 224
           + S+L +SV+A+IIS+LGKEGR S AA++ + L+ DG ++D+Y+
Sbjct: 181 YQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYS 210

BLAST of Cla97C02G033630 vs. Swiss-Prot
Match: sp|O64624|PP163_ARATH (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 6.6e-18
Identity = 54/162 (33.33%), Postives = 94/162 (58.02%), Query Frame = 0

Query: 144 SKSVSFDILGIIKGLVFSKKNELALRVFDF-VRNRVDFASILSSSVIAVIISVLGKEGRA 203
           S+ +  D++ ++KGL  S   E A+ +F++ V +    A  L   VI + + +LG+E + 
Sbjct: 132 SELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQY 191

Query: 204 SFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYREAVMVFKXXXXXXXXXXXXXXNVI 263
           S AA LL ++      +D+ AYT+++ AY+  G+Y +A+ +F+              NVI
Sbjct: 192 SVAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 251

Query: 264 LNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLISSCRR 305
           L+V+GKMG  W KI  ++D M+S G+  D +T +T++S+C R
Sbjct: 252 LDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACAR 293

BLAST of Cla97C02G033630 vs. Swiss-Prot
Match: sp|Q9FKC3|PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 53.1 bits (126), Expect = 1.7e-05
Identity = 35/137 (25.55%), Postives = 66/137 (48.18%), Query Frame = 0

Query: 165 ELALRVFDFVRNRVDFASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAY 224
           E A++VF+ +R ++ +    +  +   +I +LGK  +   A  L  E+ N+G  ++   Y
Sbjct: 131 ESAIQVFELLREQLWYKP--NVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVY 190

Query: 225 TSLITAYASNGRYREAVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMK 284
           T+L++AY+ +GR+  A  + +                IL         + K+  L+  M+
Sbjct: 191 TALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMR 250

Query: 285 SSGVAPDLYTYNTLISS 302
             G+ P+  TYNTLI +
Sbjct: 251 RQGIRPNTITYNTLIDA 265

BLAST of Cla97C02G033630 vs. TAIR10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 155.6 bits (392), Expect = 1.3e-37
Identity = 101/224 (45.09%), Postives = 138/224 (61.61%), Query Frame = 0

Query: 1   MADKVALPLLLPNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIAPLL 60
           MADK+ALPLLLP                                   XXXXXXXX+ PLL
Sbjct: 1   MADKLALPLLLP----------CTPSSKPYSHDQNHHISRTPFLTTSXXXXXXXXVEPLL 60

Query: 61  QDLLPHQHPSSTQPHLPKPT-FRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNPE 120
            D+  HQ+P+S QP   + +  R RTRIG+SRDPN GKPWS+H LS QGQ++L SL+ P 
Sbjct: 61  HDVFLHQNPNSRQPISSQTSRNRNRTRIGKSRDPNLGKPWSYHGLSPQGQQVLRSLIEPN 120

Query: 121 FDSSSLNEILLQLFETSPEGLNFTSKSVSFDILGIIKGLVFSKKNELALRVFDFVRNRVD 180
           FDS  L+ +L +LFE   +      +S S ++L  +KGL F KK +LALR FD+   + D
Sbjct: 121 FDSGQLDSVLSELFEPFKD----KPESTSSELLAFLKGLGFHKKFDLALRAFDWFMKQKD 180

Query: 181 FASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYA 224
           + S+L +SV+A+IIS+LGKEGR S AA++ + L+ DG ++D+Y+
Sbjct: 181 YQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYS 210

BLAST of Cla97C02G033630 vs. TAIR10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 94.4 bits (233), Expect = 3.6e-19
Identity = 54/162 (33.33%), Postives = 94/162 (58.02%), Query Frame = 0

Query: 144 SKSVSFDILGIIKGLVFSKKNELALRVFDF-VRNRVDFASILSSSVIAVIISVLGKEGRA 203
           S+ +  D++ ++KGL  S   E A+ +F++ V +    A  L   VI + + +LG+E + 
Sbjct: 132 SELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQY 191

Query: 204 SFAASLLHELRNDGVNIDIYAYTSLITAYASNGRYREAVMVFKXXXXXXXXXXXXXXNVI 263
           S AA LL ++      +D+ AYT+++ AY+  G+Y +A+ +F+              NVI
Sbjct: 192 SVAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 251

Query: 264 LNVYGKMGMPWSKIAALVDSMKSSGVAPDLYTYNTLISSCRR 305
           L+V+GKMG  W KI  ++D M+S G+  D +T +T++S+C R
Sbjct: 252 LDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACAR 293

BLAST of Cla97C02G033630 vs. TAIR10
Match: AT5G48730.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 53.1 bits (126), Expect = 9.3e-07
Identity = 35/137 (25.55%), Postives = 66/137 (48.18%), Query Frame = 0

Query: 165 ELALRVFDFVRNRVDFASILSSSVIAVIISVLGKEGRASFAASLLHELRNDGVNIDIYAY 224
           E A++VF+ +R ++ +    +  +   +I +LGK  +   A  L  E+ N+G  ++   Y
Sbjct: 131 ESAIQVFELLREQLWYKP--NVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVY 190

Query: 225 TSLITAYASNGRYREAVMVFKXXXXXXXXXXXXXXNVILNVYGKMGMPWSKIAALVDSMK 284
           T+L++AY+ +GR+  A  + +                IL         + K+  L+  M+
Sbjct: 191 TALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMR 250

Query: 285 SSGVAPDLYTYNTLISS 302
             G+ P+  TYNTLI +
Sbjct: 251 RQGIRPNTITYNTLIDA 265

BLAST of Cla97C02G033630 vs. TAIR10
Match: AT5G09450.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 43.5 bits (101), Expect = 7.4e-04
Identity = 27/92 (29.35%), Postives = 48/92 (52.17%), Query Frame = 0

Query: 215 DGVNID---IYAYTSLITAYASNGRYREAVMVFK-XXXXXXXXXXXXXXNVILNVYGKMG 274
           +G++ID      YTSL+ AYA++ +   A  +FK               N ++ +Y  +G
Sbjct: 147 EGLDIDSKTAETYTSLLHAYAASKQTERAEALFKRIIESDSLTFGAITYNEMMTLYMSVG 206

Query: 275 MPWSKIAALVDSMKSSGVAPDLYTYNTLISSC 303
               K+  +++ +K   V+PD++TYN  +SSC
Sbjct: 207 QV-EKVPEVIEVLKQKKVSPDIFTYNLWLSSC 237

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008455020.15.3e-11786.56PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis melo][more]
XP_023553783.11.9e-11185.90pentatricopeptide repeat-containing protein At5g02860 [Cucurbita pepo subsp. pep... [more]
XP_022952798.12.5e-11183.28pentatricopeptide repeat-containing protein At5g02860 [Cucurbita moschata][more]
XP_022972454.11.1e-10982.30pentatricopeptide repeat-containing protein At5g02860 [Cucurbita maxima][more]
XP_004137089.13.9e-10487.88PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis sativu... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C150|A0A1S3C150_CUCME3.5e-11786.56pentatricopeptide repeat-containing protein At5g02860 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0K4H7|A0A0A0K4H7_CUCSA2.6e-10487.88Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G071530 PE=4 SV=1[more]
tr|A0A2N9J6C3|A0A2N9J6C3_FAGSY6.5e-7958.63Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS60848 PE=4 SV=1[more]
tr|A0A2N9ID35|A0A2N9ID35_FAGSY6.5e-7958.63Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50097 PE=4 SV=1[more]
tr|A0A2I4F7W7|A0A2I4F7W7_9ROSI6.0e-7755.84pentatricopeptide repeat-containing protein At5g02860 OS=Juglans regia OX=51240 ... [more]
Match NameE-valueIdentityDescription
sp|Q9LYZ9|PP362_ARATH2.4e-3645.09Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
sp|O64624|PP163_ARATH6.6e-1833.33Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
sp|Q9FKC3|PP424_ARATH1.7e-0525.55Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT5G02860.11.3e-3745.09Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G18940.13.6e-1933.33Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48730.19.3e-0725.55Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G09450.17.4e-0429.35Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008568 microtubule-severing ATPase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G033630.1Cla97C02G033630.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 279..339
e-value: 1.2E-13
score: 50.8
coord: 207..269
e-value: 8.5E-21
score: 73.7
coord: 632..687
e-value: 1.5E-6
score: 28.0
coord: 453..512
e-value: 9.4E-10
score: 38.3
coord: 384..445
e-value: 6.9E-19
score: 67.6
coord: 344..372
e-value: 5.1E-6
score: 26.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 222..255
e-value: 3.6E-8
score: 31.1
coord: 539..572
e-value: 1.5E-6
score: 26.0
coord: 468..500
e-value: 1.4E-6
score: 26.1
coord: 328..361
e-value: 1.6E-9
score: 35.3
coord: 504..536
e-value: 4.1E-9
score: 34.1
coord: 293..326
e-value: 9.0E-10
score: 36.1
coord: 748..781
e-value: 9.3E-8
score: 29.8
coord: 713..747
e-value: 7.8E-10
score: 36.3
coord: 433..467
e-value: 1.5E-7
score: 29.2
coord: 398..431
e-value: 6.0E-6
score: 24.1
coord: 679..712
e-value: 1.1E-5
score: 23.3
coord: 257..291
e-value: 2.5E-5
score: 22.1
coord: 363..397
e-value: 3.9E-10
score: 37.3
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 776..805
e-value: 1.9E-6
score: 27.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 710..758
e-value: 1.9E-14
score: 53.4
coord: 535..584
e-value: 1.2E-10
score: 41.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 501..535
score: 12.891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 220..254
score: 12.858
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 536..570
score: 11.849
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..290
score: 10.446
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 571..605
score: 7.213
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 396..430
score: 11.729
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 746..780
score: 11.575
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 606..640
score: 7.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 711..745
score: 13.735
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 361..395
score: 13.943
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..219
score: 7.202
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 431..465
score: 10.424
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 466..500
score: 11.992
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 781..815
score: 7.487
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 676..710
score: 10.852
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 291..325
score: 13.187
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 641..675
score: 9.405
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 326..360
score: 13.702
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 498..592
e-value: 2.7E-28
score: 100.5
coord: 141..272
e-value: 7.5E-22
score: 79.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 273..424
e-value: 1.7E-52
score: 180.6
coord: 662..830
e-value: 8.3E-41
score: 142.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 425..497
e-value: 1.7E-16
score: 62.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 85..99
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..79
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..52
NoneNo IPR availablePANTHERPTHR24015:SF308SUBFAMILY NOT NAMEDcoord: 1..644
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 730..801
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..644
NoneNo IPR availablePANTHERPTHR24015:SF308SUBFAMILY NOT NAMEDcoord: 730..801

The following gene(s) are paralogous to this gene:

None