Cla97C02G041510 (gene) Watermelon (97103) v2

NameCla97C02G041510
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr02 : 29452036 .. 29454054 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATTCAAATGGGTATTTCAAAAACTGAGCTCACGTCTTCCCTCTTGGGTCTCTTCTCTAACCTTCCCTCTCAGAAACCAATTCCATCAAAACCCATTTGCAGAAACCTCCTCAACATTCGTCCTGAAACATGTAAACCCAAGCTACCTTCTATCCATTTGTGGAAGAGAAGGGCATCTTCATTTGGGTTCTTCTCTCCATGCCTCCATCTTCAAGAGGTTCGAGCTCTCCAACCATGATCATGGGGTCGTCATAATGAATTCTCTCATCTCCATGTACGAGAGGTGTGGTAAGTTGCCCGATGCCATCAAGGTGTTTGACGAAATGCTCACAAGAGATACTATTTCATGGAACGCATTGATTGGTGGGTTTATGAGAAATGGGGAGTTTTGTGCTGGTTTTAGCTATTTTAAGGCTATGTGTTTAGTTGGTGATTGTAAATTTGACGAAGCTACTTTGACGATGATTTTATCTGCTTGTGATGGCTTGGAGTTGTGTTGTATTATTAAAATGATGCATGGTTTGGCGTTTCTGAGTGGGTATGAACGAGAAATTACCGTGGGAAATGCTCTGATTAGTTCATATTTTAAATATGGATGTGTTGATTTGGGGATGCAAGTTTTTTATGGGATGGGGGAGAGAAATGTGATTACTTGGACAGCTGTGATCTCTGGTTTGGCTCAAAATGGGCGTCATGAGCACAGCCTGAAGCTGTTTAGGGAGATGATGAGTTGTGGGTCTGTGGAGCCAAATTTTTTAACTTATTTGGGTTTACTCACTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAGTCAGATTTGTGCATTGGAAGTGCCCTGATGGATATGTACTCAAAATCTGGAAGAATTGGAGATGCTTGGAAGATTTTTGAGTCGGCTGAGGAATTTGATATGGTTTCATTGACTGTTATACTTGCAGGGTTCACACAGAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAGATGGGGATCAAGATTGACGAAAATGTCATTTCAGCTGTTCTTGGGGTGTTTGGTGCTGAGACATCTTTGAGATTGGGTCAACAAGTTCACTCGTTTGTTGTCAAGAAGAACTTTAGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCACTGGATGAGTCAGTGAAGGTCTTTGATAGGATGCGGGAGAGGAACTCGGTGACATGGAACTCCATGATTGCAGGGTTTGCCCGACATGGAGATGGCTTGAAAGCTCTACACCTTTATGAGAATATGAAACTGGAAGATGCAAAGCCTACCGACGTCACGTTTCTATCGTTACTCCATGCTTGTAGCCATGTTGGGTTACTAAAAAAAGGAATGGAATTCCTCGAATCAATGACAAAAGATCACGGGATGAATCCAAGGAGCGAACATTATGCTTGTGTTGTAGACATGTTGGGTAGGGCAGGATTGCTGTCTGAAGCTAAAAACTTCATTGAGAAACTACCTGAACAGCCAGGTTTACGTGTGTGGCAGGCGTTGCTCGGTGCCTGCAGCCTCTATGGTGATTCTGAAACAGGGAAGTATGCAGCTGAGCATCTGTTTTCAGAAACTCCGCATAGTCCGGTCCCATATGTTCTGTTAGCCAACATATATTCTTCTAAAGGGAATTGGAAGGAAAGAGCAAGGACAATTAGGAAGATGAAGGAGGTGGGAATGGCCAAAGAAACTGGTATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTGGAGACAAAATGCATCCACAAGCTGAGATCATTTATGGAGTTTTGATGGAGCTATTTGTACTCATGGTAGATGAAGGATATGTACCAGATAAGAAGTTCATCCTCTACTGCTTGGATGATGACAGGAGGGATCCAATCGATAACGGTTGTACTAACCGTCAAAATGTCATAGAAACTGAAGTTGTTTGGGAGTAG

mRNA sequence

ATGAAATTCAAATGGGTATTTCAAAAACTGAGCTCACGTCTTCCCTCTTGGGTCTCTTCTCTAACCTTCCCTCTCAGAAACCAATTCCATCAAAACCCATTTGCAGAAACCTCCTCAACATTCGTCCTGAAACATGTAAACCCAAGCTACCTTCTATCCATTTGTGGAAGAGAAGGGCATCTTCATTTGGGTTCTTCTCTCCATGCCTCCATCTTCAAGAGGTTCGAGCTCTCCAACCATGATCATGGGGTCGTCATAATGAATTCTCTCATCTCCATGTACGAGAGGTGTGGTAAGTTGCCCGATGCCATCAAGGTGTTTGACGAAATGCTCACAAGAGATACTATTTCATGGAACGCATTGATTGGTGGGTTTATGAGAAATGGGGAGTTTTGTGCTGGTTTTAGCTATTTTAAGGCTATGTGTTTAGTTGGTGATTGTAAATTTGACGAAGCTACTTTGACGATGATTTTATCTGCTTGTGATGGCTTGGAGTTGTGTTGTATTATTAAAATGATGCATGGTTTGGCGTTTCTGAGTGGGTATGAACGAGAAATTACCGTGGGAAATGCTCTGATTAGTTCATATTTTAAATATGGATGTGTTGATTTGGGGATGCAAGTTTTTTATGGGATGGGGGAGAGAAATGTGATTACTTGGACAGCTGTGATCTCTGGTTTGGCTCAAAATGGGCGTCATGAGCACAGCCTGAAGCTGTTTAGGGAGATGATGAGTTGTGGGTCTGTGGAGCCAAATTTTTTAACTTATTTGGGTTTACTCACTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAGTCAGATTTGTGCATTGGAAGTGCCCTGATGGATATGTACTCAAAATCTGGAAGAATTGGAGATGCTTGGAAGATTTTTGAGTCGGCTGAGGAATTTGATATGGTTTCATTGACTGTTATACTTGCAGGGTTCACACAGAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAGATGGGGATCAAGATTGACGAAAATGTCATTTCAGCTGTTCTTGGGGTGTTTGGTGCTGAGACATCTTTGAGATTGGGTCAACAAGTTCACTCGTTTGTTGTCAAGAAGAACTTTAGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCACTGGATGAGTCAGTGAAGGTCTTTGATAGGATGCGGGAGAGGAACTCGGTGACATGGAACTCCATGATTGCAGGGTTTGCCCGACATGGAGATGGCTTGAAAGCTCTACACCTTTATGAGAATATGAAACTGGAAGATGCAAAGCCTACCGACGTCACGTTTCTATCGTTACTCCATGCTTGTAGCCATGTTGGGTTACTAAAAAAAGGAATGGAATTCCTCGAATCAATGACAAAAGATCACGGGATGAATCCAAGGAGCGAACATTATGCTTGTGTTGTAGACATGTTGGGTAGGGCAGGATTGCTGTCTGAAGCTAAAAACTTCATTGAGAAACTACCTGAACAGCCAGGTTTACGTGTGTGGCAGGCGTTGCTCGGTGCCTGCAGCCTCTATGGTGATTCTGAAACAGGGAAGTATGCAGCTGAGCATCTGTTTTCAGAAACTCCGCATAGTCCGGTCCCATATGTTCTGTTAGCCAACATATATTCTTCTAAAGGGAATTGGAAGGAAAGAGCAAGGACAATTAGGAAGATGAAGGAGGTGGGAATGGCCAAAGAAACTGGTATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTGGAGACAAAATGCATCCACAAGCTGAGATCATTTATGGAGTTTTGATGGAGCTATTTGTACTCATGGTAGATGAAGGATATGTACCAGATAAGAAGTTCATCCTCTACTGCTTGGATGATGACAGGAGGGATCCAATCGATAACGGTTGTACTAACCGTCAAAATGTCATAGAAACTGAAGTTGTTTGGGAGTAG

Coding sequence (CDS)

ATGAAATTCAAATGGGTATTTCAAAAACTGAGCTCACGTCTTCCCTCTTGGGTCTCTTCTCTAACCTTCCCTCTCAGAAACCAATTCCATCAAAACCCATTTGCAGAAACCTCCTCAACATTCGTCCTGAAACATGTAAACCCAAGCTACCTTCTATCCATTTGTGGAAGAGAAGGGCATCTTCATTTGGGTTCTTCTCTCCATGCCTCCATCTTCAAGAGGTTCGAGCTCTCCAACCATGATCATGGGGTCGTCATAATGAATTCTCTCATCTCCATGTACGAGAGGTGTGGTAAGTTGCCCGATGCCATCAAGGTGTTTGACGAAATGCTCACAAGAGATACTATTTCATGGAACGCATTGATTGGTGGGTTTATGAGAAATGGGGAGTTTTGTGCTGGTTTTAGCTATTTTAAGGCTATGTGTTTAGTTGGTGATTGTAAATTTGACGAAGCTACTTTGACGATGATTTTATCTGCTTGTGATGGCTTGGAGTTGTGTTGTATTATTAAAATGATGCATGGTTTGGCGTTTCTGAGTGGGTATGAACGAGAAATTACCGTGGGAAATGCTCTGATTAGTTCATATTTTAAATATGGATGTGTTGATTTGGGGATGCAAGTTTTTTATGGGATGGGGGAGAGAAATGTGATTACTTGGACAGCTGTGATCTCTGGTTTGGCTCAAAATGGGCGTCATGAGCACAGCCTGAAGCTGTTTAGGGAGATGATGAGTTGTGGGTCTGTGGAGCCAAATTTTTTAACTTATTTGGGTTTACTCACTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAGTCAGATTTGTGCATTGGAAGTGCCCTGATGGATATGTACTCAAAATCTGGAAGAATTGGAGATGCTTGGAAGATTTTTGAGTCGGCTGAGGAATTTGATATGGTTTCATTGACTGTTATACTTGCAGGGTTCACACAGAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAGATGGGGATCAAGATTGACGAAAATGTCATTTCAGCTGTTCTTGGGGTGTTTGGTGCTGAGACATCTTTGAGATTGGGTCAACAAGTTCACTCGTTTGTTGTCAAGAAGAACTTTAGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCACTGGATGAGTCAGTGAAGGTCTTTGATAGGATGCGGGAGAGGAACTCGGTGACATGGAACTCCATGATTGCAGGGTTTGCCCGACATGGAGATGGCTTGAAAGCTCTACACCTTTATGAGAATATGAAACTGGAAGATGCAAAGCCTACCGACGTCACGTTTCTATCGTTACTCCATGCTTGTAGCCATGTTGGGTTACTAAAAAAAGGAATGGAATTCCTCGAATCAATGACAAAAGATCACGGGATGAATCCAAGGAGCGAACATTATGCTTGTGTTGTAGACATGTTGGGTAGGGCAGGATTGCTGTCTGAAGCTAAAAACTTCATTGAGAAACTACCTGAACAGCCAGGTTTACGTGTGTGGCAGGCGTTGCTCGGTGCCTGCAGCCTCTATGGTGATTCTGAAACAGGGAAGTATGCAGCTGAGCATCTGTTTTCAGAAACTCCGCATAGTCCGGTCCCATATGTTCTGTTAGCCAACATATATTCTTCTAAAGGGAATTGGAAGGAAAGAGCAAGGACAATTAGGAAGATGAAGGAGGTGGGAATGGCCAAAGAAACTGGTATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTGGAGACAAAATGCATCCACAAGCTGAGATCATTTATGGAGTTTTGATGGAGCTATTTGTACTCATGGTAGATGAAGGATATGTACCAGATAAGAAGTTCATCCTCTACTGCTTGGATGATGACAGGAGGGATCCAATCGATAACGGTTGTACTAACCGTCAAAATGTCATAGAAACTGAAGTTGTTTGGGAGTAG

Protein sequence

MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGHLHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDPIDNGCTNRQNVIETEVVWE
BLAST of Cla97C02G041510 vs. NCBI nr
Match: KGN60913.1 (hypothetical protein Csa_2G022820 [Cucumis sativus])

HSP 1 Score: 1214.5 bits (3141), Expect = 0.0e+00
Identity = 592/672 (88.10%), Postives = 625/672 (93.01%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQK SS LPSWV+SL  P RNQFHQNPF ETSSTFVL H++PS+LLSICGREG+
Sbjct: 1   MKLKWVFQKRSSHLPSWVTSLISPFRNQFHQNPFPETSSTFVLNHLDPSFLLSICGREGN 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           LHLGSSLHASI K FELSNH +GVVIMNSLISMY+RCGKLPDA+KVFDEM+TRDTISWNA
Sbjct: 61  LHLGSSLHASIIKSFELSNHYNGVVIMNSLISMYDRCGKLPDAVKVFDEMITRDTISWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGF+RNG+F AGFSYFKAMCLVGDC+FD+ATLT ILSACDGLE C IIKMMHGLAFLS
Sbjct: 121 LIGGFVRNGKFFAGFSYFKAMCLVGDCRFDKATLTTILSACDGLEFCWIIKMMHGLAFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           GY +EITVGNALISSYFK GCV LGMQVFY MGERNVITWTAVISGLAQNG HEHSLKLF
Sbjct: 181 GYGQEITVGNALISSYFKCGCVGLGMQVFYEMGERNVITWTAVISGLAQNGYHEHSLKLF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           +EMMS GSVEPN LTYL LLTACSGLEAL+EGCQIHGLI+KLGIQSDLCIGSALMDMYSK
Sbjct: 241 KEMMSYGSVEPNSLTYLSLLTACSGLEALKEGCQIHGLIMKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
           SGRIG+AWKIFE AEE DMVSLTVILAGFT NGCEEEAIQIFLKMLKMGI+ID NV+S V
Sbjct: 301 SGRIGEAWKIFELAEELDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIEIDGNVVSVV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSFVVKKNF CNPFVSNGLINMYSKCGALDES+KVFDRMRERNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFVVKKNFICNPFVSNGLINMYSKCGALDESMKVFDRMRERNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGD LKAL LYE+M+LE AKPTDVTFLSLLHACSH GL+KKGMEFL+S
Sbjct: 421 VTWNSMIAAFARHGDALKALQLYEDMQLEGAKPTDVTFLSLLHACSHAGLVKKGMEFLKS 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDHGMNPRSEH+ACVVDMLGRAG+LSEA+NFIEKLPEQPGL VWQALLGACSLYGDS+
Sbjct: 481 MTKDHGMNPRSEHHACVVDMLGRAGMLSEARNFIEKLPEQPGLLVWQALLGACSLYGDSK 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAAEHLFSETP SPVPYVLLANIYSS+GNWKERARTIRKMKEVG AKETGISWIEID
Sbjct: 541 IGKYAAEHLFSETPDSPVPYVLLANIYSSEGNWKERARTIRKMKEVGTAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDPIDNGCTN 660
           KKVHSFTVGDKMHPQ E+IYGVL ELF+LMVDEGYVPDKKFILY LDDDRRDPI NG   
Sbjct: 601 KKVHSFTVGDKMHPQTEMIYGVLWELFILMVDEGYVPDKKFILYYLDDDRRDPIHNGQAT 660

Query: 661 RQNVIETEVVWE 673
            QN IETEVVWE
Sbjct: 661 HQNAIETEVVWE 672

BLAST of Cla97C02G041510 vs. NCBI nr
Match: XP_022946256.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita moschata])

HSP 1 Score: 1209.1 bits (3127), Expect = 0.0e+00
Identity = 590/672 (87.80%), Postives = 625/672 (93.01%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQKLSS+LPSW +S   P RNQFHQNPFAETSSTFVL HV+PS+LLS+CGR+GH
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           L+LGSSLHASI K FELSNH++GVVIMNSLISMYERCGKLPDA+KVFDEM TRDT+SWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGFMRNGEFCAGFSYFKAMCLVGDCKFD+ATLT ILSACDGLE+CCIIKMMHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           GY++EITVGNALISSYFK GCV  G QVFY M ERNVITWTAVISGLAQNG HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPN LTYL LLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
            G IGDAWKIFESAEE DMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENV+SAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGDGLKALHLYENMKLE AKPTD+TFLSLLHACSHVGL+ KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDH MNPRSEHYACVVDMLGRAGLLSEA+ FIEKLPEQPGL VWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAAEHLFSETP+S VPYVLLANIYSS+GNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLD-DDRRDPI--DNG 660
           KKVHSFTVGDK HPQA+IIYGVLM+LFV MVDEGYVPDKKFIL+ LD DD+++PI  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 CTNRQNVIETEV 670
             N  N+  + V
Sbjct: 661 RVNDPNLFPSLV 672

BLAST of Cla97C02G041510 vs. NCBI nr
Match: XP_022946255.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1208.0 bits (3124), Expect = 0.0e+00
Identity = 586/657 (89.19%), Postives = 619/657 (94.22%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQKLSS+LPSW +S   P RNQFHQNPFAETSSTFVL HV+PS+LLS+CGR+GH
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           L+LGSSLHASI K FELSNH++GVVIMNSLISMYERCGKLPDA+KVFDEM TRDT+SWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGFMRNGEFCAGFSYFKAMCLVGDCKFD+ATLT ILSACDGLE+CCIIKMMHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           GY++EITVGNALISSYFK GCV  G QVFY M ERNVITWTAVISGLAQNG HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPN LTYL LLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
            G IGDAWKIFESAEE DMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENV+SAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGDGLKALHLYENMKLE AKPTD+TFLSLLHACSHVGL+ KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDH MNPRSEHYACVVDMLGRAGLLSEA+ FIEKLPEQPGL VWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAAEHLFSETP+S VPYVLLANIYSS+GNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLD-DDRRDPIDN 657
           KKVHSFTVGDK HPQA+IIYGVLM+LFV MVDEGYVPDKKFIL+ LD DD+++PIDN
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDN 657

BLAST of Cla97C02G041510 vs. NCBI nr
Match: XP_022946254.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1208.0 bits (3124), Expect = 0.0e+00
Identity = 586/657 (89.19%), Postives = 619/657 (94.22%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQKLSS+LPSW +S   P RNQFHQNPFAETSSTFVL HV+PS+LLS+CGR+GH
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           L+LGSSLHASI K FELSNH++GVVIMNSLISMYERCGKLPDA+KVFDEM TRDT+SWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGFMRNGEFCAGFSYFKAMCLVGDCKFD+ATLT ILSACDGLE+CCIIKMMHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           GY++EITVGNALISSYFK GCV  G QVFY M ERNVITWTAVISGLAQNG HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPN LTYL LLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
            G IGDAWKIFESAEE DMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENV+SAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGDGLKALHLYENMKLE AKPTD+TFLSLLHACSHVGL+ KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDH MNPRSEHYACVVDMLGRAGLLSEA+ FIEKLPEQPGL VWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAAEHLFSETP+S VPYVLLANIYSS+GNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLD-DDRRDPIDN 657
           KKVHSFTVGDK HPQA+IIYGVLM+LFV MVDEGYVPDKKFIL+ LD DD+++PIDN
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDN 657

BLAST of Cla97C02G041510 vs. NCBI nr
Match: XP_022999024.1 (pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima])

HSP 1 Score: 1198.7 bits (3100), Expect = 0.0e+00
Identity = 582/657 (88.58%), Postives = 615/657 (93.61%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQ+LSS+LPSW SS   P RNQFHQNPFAETSSTFVL HV+PS+LLS+CGR+GH
Sbjct: 1   MKLKWVFQRLSSKLPSWASSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           L+LGSSLHASI K FELSNH++GVVIMNSLISMYERCGKLPDA+KVFDEM TRDT+SWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGFMRNGEF AGFSYFKAMCLVGDCKFD+ATLT ILSACDG E+CCII+MMHGL FLS
Sbjct: 121 LIGGFMRNGEFYAGFSYFKAMCLVGDCKFDKATLTTILSACDGSEMCCIIEMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           GYE+EITVGNALISSYFK GCV  G Q+FY M ERNVITWTAVISGLAQNG HEHSL+LF
Sbjct: 181 GYEQEITVGNALISSYFKCGCVGFGRQLFYEMEERNVITWTAVISGLAQNGHHEHSLELF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REMMSCGSVEPN LTYL LLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
            G IGDAWKIFESAEE DMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+ID NV+SAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDANVVSAV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGDG KALHLYENMKLE AKPTD+TFLSLLHACSHVGL+ KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDH MNPRSEHYACVVDMLGRAGLLSEA+ FIEKLPEQPGL VWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAAEHLFSETP+S VPYVLLANIYSS+GNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPYSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLD-DDRRDPIDN 657
           KKVHSFTVGDK HPQA+IIYGVLM+LFVLMVDEGYVPDK FIL+ LD DD+++PIDN
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVLMVDEGYVPDKNFILFYLDPDDKKEPIDN 657

BLAST of Cla97C02G041510 vs. TrEMBL
Match: tr|A0A0A0LGC8|A0A0A0LGC8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G022820 PE=4 SV=1)

HSP 1 Score: 1214.5 bits (3141), Expect = 0.0e+00
Identity = 592/672 (88.10%), Postives = 625/672 (93.01%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQK SS LPSWV+SL  P RNQFHQNPF ETSSTFVL H++PS+LLSICGREG+
Sbjct: 1   MKLKWVFQKRSSHLPSWVTSLISPFRNQFHQNPFPETSSTFVLNHLDPSFLLSICGREGN 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           LHLGSSLHASI K FELSNH +GVVIMNSLISMY+RCGKLPDA+KVFDEM+TRDTISWNA
Sbjct: 61  LHLGSSLHASIIKSFELSNHYNGVVIMNSLISMYDRCGKLPDAVKVFDEMITRDTISWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGF+RNG+F AGFSYFKAMCLVGDC+FD+ATLT ILSACDGLE C IIKMMHGLAFLS
Sbjct: 121 LIGGFVRNGKFFAGFSYFKAMCLVGDCRFDKATLTTILSACDGLEFCWIIKMMHGLAFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           GY +EITVGNALISSYFK GCV LGMQVFY MGERNVITWTAVISGLAQNG HEHSLKLF
Sbjct: 181 GYGQEITVGNALISSYFKCGCVGLGMQVFYEMGERNVITWTAVISGLAQNGYHEHSLKLF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           +EMMS GSVEPN LTYL LLTACSGLEAL+EGCQIHGLI+KLGIQSDLCIGSALMDMYSK
Sbjct: 241 KEMMSYGSVEPNSLTYLSLLTACSGLEALKEGCQIHGLIMKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
           SGRIG+AWKIFE AEE DMVSLTVILAGFT NGCEEEAIQIFLKMLKMGI+ID NV+S V
Sbjct: 301 SGRIGEAWKIFELAEELDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIEIDGNVVSVV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSFVVKKNF CNPFVSNGLINMYSKCGALDES+KVFDRMRERNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFVVKKNFICNPFVSNGLINMYSKCGALDESMKVFDRMRERNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGD LKAL LYE+M+LE AKPTDVTFLSLLHACSH GL+KKGMEFL+S
Sbjct: 421 VTWNSMIAAFARHGDALKALQLYEDMQLEGAKPTDVTFLSLLHACSHAGLVKKGMEFLKS 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDHGMNPRSEH+ACVVDMLGRAG+LSEA+NFIEKLPEQPGL VWQALLGACSLYGDS+
Sbjct: 481 MTKDHGMNPRSEHHACVVDMLGRAGMLSEARNFIEKLPEQPGLLVWQALLGACSLYGDSK 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAAEHLFSETP SPVPYVLLANIYSS+GNWKERARTIRKMKEVG AKETGISWIEID
Sbjct: 541 IGKYAAEHLFSETPDSPVPYVLLANIYSSEGNWKERARTIRKMKEVGTAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDPIDNGCTN 660
           KKVHSFTVGDKMHPQ E+IYGVL ELF+LMVDEGYVPDKKFILY LDDDRRDPI NG   
Sbjct: 601 KKVHSFTVGDKMHPQTEMIYGVLWELFILMVDEGYVPDKKFILYYLDDDRRDPIHNGQAT 660

Query: 661 RQNVIETEVVWE 673
            QN IETEVVWE
Sbjct: 661 HQNAIETEVVWE 672

BLAST of Cla97C02G041510 vs. TrEMBL
Match: tr|A0A1S4E0D4|A0A1S4E0D4_CUCME (pentatricopeptide repeat-containing protein At3g05340 OS=Cucumis melo OX=3656 GN=LOC103494955 PE=4 SV=1)

HSP 1 Score: 1186.4 bits (3068), Expect = 0.0e+00
Identity = 582/663 (87.78%), Postives = 613/663 (92.46%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWVFQK SS LPS V+SL FP RNQFHQNPFAETSSTFVL H++ S+LLSICGREG+
Sbjct: 1   MKLKWVFQKSSSHLPSLVTSLIFPFRNQFHQNPFAETSSTFVLNHLDVSFLLSICGREGN 60

Query: 61  LHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNA 120
           LHLGSSLHASI K FE SNH +GVVIMNSLISMY+RCGKL DA+KVFDEMLTRDTISWNA
Sbjct: 61  LHLGSSLHASIIKSFEPSNHYNGVVIMNSLISMYDRCGKLSDAVKVFDEMLTRDTISWNA 120

Query: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLS 180
           LIGGF+RNG+F AGFSYFKAMCLVGDCKFD+ATLT ILSACDGLE CCIIKMMHGLAFLS
Sbjct: 121 LIGGFVRNGKFFAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEFCCIIKMMHGLAFLS 180

Query: 181 GYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLF 240
           G+ +EITVGNAL+SSY K GCV LGMQVF  MGERNVITWTAVISGLA+NG HEHSLKLF
Sbjct: 181 GFGQEITVGNALVSSYLKCGCVGLGMQVFDEMGERNVITWTAVISGLARNGHHEHSLKLF 240

Query: 241 REMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           +EMMS GSVEPN LTYL LLTACSGLEAL+EGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 KEMMSYGSVEPNSLTYLSLLTACSGLEALKEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAV 360
           SGRIG+AWKIFESAEE DMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+ID NV+S V
Sbjct: 301 SGRIGEAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDGNVVSVV 360

Query: 361 LGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNS 420
           LGVFGA+TSLRLGQQVHSFVVKKNF CNPFVSNGLINMYSKCGALDES+KVFDRMRERNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFVVKKNFICNPFVSNGLINMYSKCGALDESMKVFDRMRERNS 420

Query: 421 VTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLES 480
           VTWNSMIA FARHGD  KAL LYENM+LE AKPTDVTFLSLLHACSH GL+KKGMEFL+S
Sbjct: 421 VTWNSMIAAFARHGDASKALQLYENMQLEGAKPTDVTFLSLLHACSHAGLVKKGMEFLKS 480

Query: 481 MTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSE 540
           MTKDHGMNPRSEHYACVVDMLGRAG+LSEA+NFIEKLPEQPGL VWQALLGACSLYGDSE
Sbjct: 481 MTKDHGMNPRSEHYACVVDMLGRAGMLSEARNFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 TGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEID 600
            GKYAA+HLF ETPHS VPYVLLANIYSS+GNWKERARTIR+MKEVG AKETGISWIEID
Sbjct: 541 MGKYAADHLFLETPHSTVPYVLLANIYSSEGNWKERARTIRRMKEVGTAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDPIDNGCTN 660
           KKVHSFTVGDKMHPQ EIIYGVL ELFVLMVDEGYVPDKKFILY LDDDRRDPI N   +
Sbjct: 601 KKVHSFTVGDKMHPQTEIIYGVLTELFVLMVDEGYVPDKKFILYYLDDDRRDPIHNDHND 660

Query: 661 RQN 664
             N
Sbjct: 661 TSN 663

BLAST of Cla97C02G041510 vs. TrEMBL
Match: tr|A0A2P5CFM5|A0A2P5CFM5_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_286550 PE=4 SV=1)

HSP 1 Score: 867.5 bits (2240), Expect = 2.0e-248
Identity = 434/655 (66.26%), Postives = 512/655 (78.17%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           M  +WVF  L++RLPSW S+L  P + + + NP ++T S FVL HV+ S +L  CGREGH
Sbjct: 1   MNSRWVFLSLNTRLPSWASTLITPFKTKIYDNPSSKT-SRFVLNHVDISRILPHCGREGH 60

Query: 61  LHLGSSLHASIFKRFELSNHDHG------VVIMNSLISMYERCGKLPDAIKVFDEMLTRD 120
            HL SS+HASIFK FE  + ++G      +V+ NSL+S Y +   L +A+K+FDEM  +D
Sbjct: 61  FHLCSSIHASIFKHFEFFDSENGGILRNALVVWNSLLSAYSKRDALSNALKLFDEMPMKD 120

Query: 121 TISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMH 180
           T+SWN LI  F+RNG+F  GF YFK M  +G C+FD+ATLT ILSA DG E  C+ KM+H
Sbjct: 121 TVSWNTLISAFLRNGQFDMGFGYFKRMRELGFCRFDKATLTSILSALDGPEFLCVNKMIH 180

Query: 181 GLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHE 240
           GL F SGYE E  VGN+LI+SYFK GC  L  +VF  M ERNVITWTA+ISGLAQN  +E
Sbjct: 181 GLVFQSGYELETAVGNSLITSYFKCGCSGLARRVFDEMFERNVITWTAMISGLAQNEYYE 240

Query: 241 HSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSAL 300
            SLKLF EMM  G+V PN LTYL  L ACSGL+AL+EG QIH LI KLGI+SDL + SAL
Sbjct: 241 ESLKLF-EMMRNGTVNPNSLTYLSSLMACSGLQALKEGRQIHALIWKLGIESDLHLESAL 300

Query: 301 MDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDE 360
           MDMYSK G + DA +IFE AEE D VS+TVIL GF QNG EEEAIQIF KM+K GIKID 
Sbjct: 301 MDMYSKCGSMEDALQIFEYAEELDEVSMTVILVGFAQNGFEEEAIQIFKKMVKAGIKIDP 360

Query: 361 NVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDR 420
           N++SA LGVFG +TSL LG+Q+H+ VVK NFS NP+VSNGLINMYSKCG L ES+ VF+ 
Sbjct: 361 NMVSAALGVFGVDTSLALGRQLHALVVKNNFSYNPYVSNGLINMYSKCGELKESINVFNA 420

Query: 421 MRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKG 480
           M +RNS++WNSMIA FARHGDG  AL LYE MK E  +PTDVTFLSLLHACSHVGL+KKG
Sbjct: 421 MPQRNSISWNSMIAAFARHGDGFGALQLYEKMKFEGVQPTDVTFLSLLHACSHVGLVKKG 480

Query: 481 MEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACS 540
           MEFL SM KDHG++PR+EHYACVVDMLGRAGLL+EAK+FI  LPE PG  VWQALLGACS
Sbjct: 481 MEFLNSMAKDHGLSPRTEHYACVVDMLGRAGLLTEAKDFIVALPENPGPLVWQALLGACS 540

Query: 541 LYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGI 600
            + +SE GK+AAE L    P SP PYVLLANIYSS+GNWKERA+TI++MKE+G+AKETGI
Sbjct: 541 FHSNSELGKFAAEQLVVAAPGSPAPYVLLANIYSSEGNWKERAKTIKRMKEMGVAKETGI 600

Query: 601 SWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDD 650
           SWIEI+KKVH F V D++HPQ+EIIYGVL +LF  M DEGYVPDKKFILY LD D
Sbjct: 601 SWIEIEKKVHGFVVWDRLHPQSEIIYGVLAQLFRPMADEGYVPDKKFILYYLDQD 653

BLAST of Cla97C02G041510 vs. TrEMBL
Match: tr|A0A2P5DSY6|A0A2P5DSY6_PARAD (DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_034590 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 4.7e-245
Identity = 429/654 (65.60%), Postives = 508/654 (77.68%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           M  +WVF  +++ LPSW S+L    + + + NP A+T S FVL HV+ S LL  CGREGH
Sbjct: 1   MNSRWVFLSVNTHLPSWASTLITTFKTKIYDNPSAKT-SRFVLNHVDISRLLPHCGREGH 60

Query: 61  LHLGSSLHASIFKRFELSNHDHG-----VVIMNSLISMYERCGKLPDAIKVFDEMLTRDT 120
            HL SS+HASI K FE  + ++G     +V+ NSL+S Y +   L DA+K+FDEM  +DT
Sbjct: 61  FHLCSSIHASIIKHFEFFDSENGGILNALVVWNSLLSAYSKPCALSDALKLFDEMPMKDT 120

Query: 121 ISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHG 180
           +SWN LI  F+RNG+F  GF YFK M  +G C+FD+ +LT ILSA DG E   + KM+HG
Sbjct: 121 VSWNTLISAFLRNGQFDMGFGYFKRMLELGFCRFDKVSLTTILSALDGPEFLYVNKMIHG 180

Query: 181 LAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEH 240
           L F SGYE E TVGN+LI+SYFK GC  L  +VF  M ERNVITWTA+ISGLAQN  +E 
Sbjct: 181 LVFQSGYELETTVGNSLITSYFKCGCSGLARRVFDEMFERNVITWTAMISGLAQNEYYEE 240

Query: 241 SLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALM 300
           SLKLF EMM  G+V+PN LTYL  L ACSGL+AL+EG QIH LI KLGI+SDL + SALM
Sbjct: 241 SLKLF-EMMRNGTVDPNSLTYLSSLMACSGLQALKEGRQIHALIWKLGIESDLHLESALM 300

Query: 301 DMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDEN 360
           DMYSK G + DA +IFE AEE D VS+TVIL GF QNG EEEAIQIF KM+K GIKID N
Sbjct: 301 DMYSKCGSMEDALQIFEYAEELDEVSMTVILVGFAQNGFEEEAIQIFKKMVKAGIKIDPN 360

Query: 361 VISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRM 420
           ++SA LGVFG +TSL LG+Q+H+ V+K NFS NP+V NGLINMYSKCG L ES+ VF+ M
Sbjct: 361 MVSAALGVFGVDTSLALGRQLHALVIKNNFSYNPYVGNGLINMYSKCGELKESINVFNEM 420

Query: 421 RERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGM 480
            +RNS++WNSMIA FARHGDG  AL LYE MK E  +PTDVTFLSLLHACSHVGL+K GM
Sbjct: 421 PQRNSISWNSMIAAFARHGDGFGALQLYEKMKFEGVQPTDVTFLSLLHACSHVGLVKMGM 480

Query: 481 EFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSL 540
           EFL SM KDHG++PR+EHYACVVDMLGRAGLL+EAK+FI  LPE PG  VWQALLGACS 
Sbjct: 481 EFLNSMAKDHGLSPRTEHYACVVDMLGRAGLLTEAKDFIVALPENPGPLVWQALLGACSF 540

Query: 541 YGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGIS 600
           + +SE GK+AAE L    P SP PYVLLANIYSSKGNWKERA+TI++MKE+G+AKETGIS
Sbjct: 541 HSNSELGKFAAEQLVVAAPGSPAPYVLLANIYSSKGNWKERAKTIKRMKEMGVAKETGIS 600

Query: 601 WIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDD 650
           WIEI++KVHSF V D++HPQ+EIIYGVL +LF  M DEGYVPDKKFILY LD D
Sbjct: 601 WIEIEQKVHSFVVWDRLHPQSEIIYGVLAQLFRPMADEGYVPDKKFILYYLDQD 652

BLAST of Cla97C02G041510 vs. TrEMBL
Match: tr|A0A2K1XZS0|A0A2K1XZS0_POPTR (Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_013G021100v3 PE=4 SV=1)

HSP 1 Score: 848.6 bits (2191), Expect = 9.7e-243
Identity = 422/653 (64.62%), Postives = 511/653 (78.25%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           MK KWV  KL+S +PSW +SLT PL+ + +  P ++TSS F+L HV+  +LLSICGREG+
Sbjct: 1   MKSKWVIHKLNSHIPSWATSLTSPLKAKTYHTPSSKTSS-FLLNHVDIGHLLSICGREGY 60

Query: 61  LHLGSSLHASIFKRFELSN--HDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISW 120
           LHLGSSLHASI K  E  N    +  VI NSL+SMY + G L DA K+FDEM  RDT+SW
Sbjct: 61  LHLGSSLHASIIKTHEFFNPLEQNAFVIWNSLLSMYAKNGVLTDAAKLFDEMPMRDTVSW 120

Query: 121 NALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAF 180
           N +I GF+++G F  GF +FK M  +G  + D+ATLT ILSACD  EL  + KM+H LA 
Sbjct: 121 NIMISGFLKDGSFDVGFGFFKQMQSLGFYRLDQATLTTILSACDRPELGFVNKMVHCLAV 180

Query: 181 LSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLK 240
           L+G++REI+VGNALI+SYFK G    GMQVF  M ERNVITWTA+ISGL Q+  +  SL+
Sbjct: 181 LNGFQREISVGNALITSYFKCGFSSSGMQVFDEMLERNVITWTAIISGLVQSELYRDSLR 240

Query: 241 LFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMY 300
           LF EM + G VEPN LTYL  L ACSGL+AL EGCQIHG + KLG+QSD C+ SALMDMY
Sbjct: 241 LFVEMTN-GLVEPNSLTYLSSLMACSGLQALREGCQIHGRVWKLGLQSDFCVESALMDMY 300

Query: 301 SKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVIS 360
           SK G +GD  +IFESA + D VS+T+ILAGF QNG EEEA+Q F+KML+ G +ID N++S
Sbjct: 301 SKCGSMGDTLQIFESAGQLDKVSMTIILAGFAQNGFEEEAMQFFVKMLEAGTEIDSNMVS 360

Query: 361 AVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRER 420
           AVLGVFGA+TSL LGQQ+HS V+K++F  NPFV NGLINMYSKCG L++S KVF RM   
Sbjct: 361 AVLGVFGADTSLGLGQQIHSLVIKRSFGSNPFVGNGLINMYSKCGDLEDSTKVFSRMPCM 420

Query: 421 NSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFL 480
           NSV+WNSMIA FARHGDG +AL LY+ M+L+  +PTDVTFLSLLHACSHVGL++KGMEFL
Sbjct: 421 NSVSWNSMIAAFARHGDGSRALQLYKEMRLKGVEPTDVTFLSLLHACSHVGLVEKGMEFL 480

Query: 481 ESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGD 540
           +SMT+ H + PR EHYACVVDMLGRAGLL+EAK FIE LP +P + VWQALLGAC ++GD
Sbjct: 481 KSMTEVHKLTPRMEHYACVVDMLGRAGLLNEAKTFIEGLPIKPDVLVWQALLGACGIHGD 540

Query: 541 SETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIE 600
            E GKYAAEHL    P  P PY+LLANIYSSKG WKERA+TI++MKE+ +AKETGISWIE
Sbjct: 541 PEMGKYAAEHLILSAPEKPSPYILLANIYSSKGRWKERAKTIKRMKEMCVAKETGISWIE 600

Query: 601 IDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRR 652
           I+  +HSF V DKMHPQAEIIYGVL ELF  M+DEGYVPDK++IL  ++ D +
Sbjct: 601 IENNLHSFVVEDKMHPQAEIIYGVLAELFGHMIDEGYVPDKRYILSYVNQDEK 651

BLAST of Cla97C02G041510 vs. Swiss-Prot
Match: sp|Q9MA85|PP215_ARATH (Pentatricopeptide repeat-containing protein At3g05340 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E83 PE=2 SV=2)

HSP 1 Score: 755.7 bits (1950), Expect = 4.2e-217
Identity = 390/661 (59.00%), Postives = 473/661 (71.56%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           M  +WV QKL+S LPS +S++  P +    Q+P  +  STF+L HV+ S LLSICGREG 
Sbjct: 1   MNSRWVIQKLTSHLPSCLSTVLSPSKILIRQSPNYQV-STFLLNHVDMSLLLSICGREGW 60

Query: 61  L-HLGSSLHASIFKRFELSN------HDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTR 120
             HLG  LHASI K  E         H + +V+ NSL+S+Y +CGKL DAIK+FDEM  R
Sbjct: 61  FPHLGPCLHASIIKNPEFFEPVDADIHRNALVVWNSLLSLYAKCGKLVDAIKLFDEMPMR 120

Query: 121 DTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMM 180
           D IS N +  GF+RN E  +GF   K M  +G   FD ATLT++LS CD  E C + KM+
Sbjct: 121 DVISQNIVFYGFLRNRETESGFVLLKRM--LGSGGFDHATLTIVLSVCDTPEFCLVTKMI 180

Query: 181 HGLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRH 240
           H LA LSGY++EI+VGN LI+SYFK GC   G  VF GM  RNVIT TAVISGL +N  H
Sbjct: 181 HALAILSGYDKEISVGNKLITSYFKCGCSVSGRGVFDGMSHRNVITLTAVISGLIENELH 240

Query: 241 EHSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSA 300
           E  L+LF  +M  G V PN +TYL  L ACSG + + EG QIH L+ K GI+S+LCI SA
Sbjct: 241 EDGLRLF-SLMRRGLVHPNSVTYLSALAACSGSQRIVEGQQIHALLWKYGIESELCIESA 300

Query: 301 LMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKID 360
           LMDMYSK G I DAW IFES  E D VS+TVIL G  QNG EEEAIQ F++ML+ G++ID
Sbjct: 301 LMDMYSKCGSIEDAWTIFESTTEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQAGVEID 360

Query: 361 ENVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFD 420
            NV+SAVLGV   + SL LG+Q+HS V+K+ FS N FV+NGLINMYSKCG L +S  VF 
Sbjct: 361 ANVVSAVLGVSFIDNSLGLGKQLHSLVIKRKFSGNTFVNNGLINMYSKCGDLTDSQTVFR 420

Query: 421 RMRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKK 480
           RM +RN V+WNSMIA FARHG GL AL LYE M   + KPTDVTFLSLLHACSHVGL+ K
Sbjct: 421 RMPKRNYVSWNSMIAAFARHGHGLAALKLYEEMTTLEVKPTDVTFLSLLHACSHVGLIDK 480

Query: 481 GMEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGAC 540
           G E L  M + HG+ PR+EHY C++DMLGRAGLL EAK+FI+ LP +P  ++WQALLGAC
Sbjct: 481 GRELLNEMKEVHGIEPRTEHYTCIIDMLGRAGLLKEAKSFIDSLPLKPDCKIWQALLGAC 540

Query: 541 SLYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETG 600
           S +GD+E G+YAAE LF   P S   ++L+ANIYSS+G WKERA+TI++MK +G+ KETG
Sbjct: 541 SFHGDTEVGEYAAEQLFQTAPDSSSAHILIANIYSSRGKWKERAKTIKRMKAMGVTKETG 600

Query: 601 ISWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDP 655
           IS IEI+ K HSF V DK+HPQAE IY VL  LF +MVDEGY PDK+FIL    DDR   
Sbjct: 601 ISSIEIEHKTHSFVVEDKLHPQAEAIYDVLSGLFPVMVDEGYRPDKRFILCYTGDDRNGT 657

BLAST of Cla97C02G041510 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 3.1e-111
Identity = 219/579 (37.82%), Postives = 338/579 (58.38%), Query Frame = 0

Query: 81  DHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNALIGGFMRNGEFCAGFSYFKA 140
           D  V I N L++MY +CG + DA +VF  M  +D++SWN++I G  +NG F      +K+
Sbjct: 346 DFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKS 405

Query: 141 MCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLSGYEREITVGNALISSYFKYG 200
           M    D      TL   LS+C  L+   + + +HG +   G +  ++V NAL++ Y + G
Sbjct: 406 M-RRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETG 465

Query: 201 CVDLGMQVFYGMGERNVITWTAVISGLAQNGRH-EHSLKLFREMMSCGSVEPNFLTYLGL 260
            ++   ++F  M E + ++W ++I  LA++ R    ++  F      G  + N +T+  +
Sbjct: 466 YLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQ-KLNRITFSSV 525

Query: 261 LTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIF-ESAEEFD 320
           L+A S L   E G QIHGL LK  I  +    +AL+  Y K G +    KIF   AE  D
Sbjct: 526 LSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRD 585

Query: 321 MVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAVLGVFGAETSLRLGQQVHS 380
            V+   +++G+  N    +A+ +   ML+ G ++D  + + VL  F +  +L  G +VH+
Sbjct: 586 NVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHA 645

Query: 381 FVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAGFARHGDGLK 440
             V+     +  V + L++MYSKCG LD +++ F+ M  RNS +WNSMI+G+ARHG G +
Sbjct: 646 CSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEE 705

Query: 441 ALHLYENMKLEDAKPTD-VTFLSLLHACSHVGLLKKGMEFLESMTKDHGMNPRSEHYACV 500
           AL L+E MKL+   P D VTF+ +L ACSH GLL++G +  ESM+  +G+ PR EH++C+
Sbjct: 706 ALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCM 765

Query: 501 VDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYG--DSETGKYAAEHLFSETPH 560
            D+LGRAG L + ++FIEK+P +P + +W+ +LGAC       +E GK AAE LF   P 
Sbjct: 766 ADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPE 825

Query: 561 SPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQ 620
           + V YVLL N+Y++ G W++  +  +KMK+  + KE G SW+ +   VH F  GDK HP 
Sbjct: 826 NAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPD 885

Query: 621 AEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDPI 655
           A++IY  L EL   M D GYVP   F LY L+ + ++ I
Sbjct: 886 ADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEI 922

BLAST of Cla97C02G041510 vs. Swiss-Prot
Match: sp|Q9M1V3|PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 401.7 bits (1031), Expect = 1.6e-110
Identity = 222/653 (34.00%), Postives = 361/653 (55.28%), Query Frame = 0

Query: 10  LSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGHLHLGSSLHA 69
           LSS   S  S  T  L  + H    A  S T V         L+ C    +  LG  +HA
Sbjct: 256 LSSYSTSGKSLETLELFREMHMTGPAPNSYTIV-------SALTACDGFSYAKLGKEIHA 315

Query: 70  SIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNALIGGFMRNG 129
           S+ K    S H   + + N+LI+MY RCGK+P A ++  +M   D ++WN+LI G+++N 
Sbjct: 316 SVLKS---STHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNL 375

Query: 130 EFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLSGYEREITVG 189
            +     +F  M   G  K DE ++T I++A   L        +H      G++  + VG
Sbjct: 376 MYKEALEFFSDMIAAGH-KSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVG 435

Query: 190 NALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSV 249
           N LI  Y K        + F  M ++++I+WT VI+G AQN  H  +L+LFR++     +
Sbjct: 436 NTLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAK-KRM 495

Query: 250 EPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWK 309
           E + +    +L A S L+++    +IH  IL+ G+  D  I + L+D+Y K   +G A +
Sbjct: 496 EIDEMILGSILRASSVLKSMLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATR 555

Query: 310 IFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAVLGVFGAETS 369
           +FES +  D+VS T +++    NG E EA+++F +M++ G+  D   +  +L    + ++
Sbjct: 556 VFESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSA 615

Query: 370 LRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAG 429
           L  G+++H ++++K F     ++  +++MY+ CG L  +  VFDR+  +  + + SMI  
Sbjct: 616 LNKGREIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINA 675

Query: 430 FARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLESMTKDHGMNP 489
           +  HG G  A+ L++ M+ E+  P  ++FL+LL+ACSH GLL +G  FL+ M  ++ + P
Sbjct: 676 YGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEP 735

Query: 490 RSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSETGKYAAEHL 549
             EHY C+VDMLGRA  + EA  F++ +  +P   VW ALL AC  + + E G+ AA+ L
Sbjct: 736 WPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRL 795

Query: 550 FSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVG 609
               P +P   VL++N+++ +G W +  +   KMK  GM K  G SWIE+D KVH FT  
Sbjct: 796 LELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTAR 855

Query: 610 DKMHPQAEIIYGVLMELFVLMVDE-GYVPDKKFILYCLDDDRRDPIDNGCTNR 662
           DK HP+++ IY  L E+   +  E GYV D KF+L+ +D+  +  + +G + R
Sbjct: 856 DKSHPESKEIYEKLSEVTRKLEREVGYVADTKFVLHNVDEGEKVQMLHGHSER 895

BLAST of Cla97C02G041510 vs. Swiss-Prot
Match: sp|Q5G1T1|PP272_ARATH (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 1.2e-107
Identity = 213/566 (37.63%), Postives = 322/566 (56.89%), Query Frame = 0

Query: 89  SLISMYERC-GKLPDAIKVFDEMLTRDTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDC 148
           SLI M+ +      +A KVFD+M   + ++W  +I   M+ G       +F  M L G  
Sbjct: 207 SLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG-F 266

Query: 149 KFDEATLTMILSACDGLELCCIIKMMHGLAFLSGYEREITVGNALISSYFKY---GCVDL 208
           + D+ TL+ + SAC  LE   + K +H  A  SG   ++    +L+  Y K    G VD 
Sbjct: 267 ESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDVEC--SLVDMYAKCSADGSVDD 326

Query: 209 GMQVFYGMGERNVITWTAVISGLAQN-GRHEHSLKLFREMMSCGSVEPNFLTYLGLLTAC 268
             +VF  M + +V++WTA+I+G  +N      ++ LF EM++ G VEPN  T+     AC
Sbjct: 327 CRKVFDRMEDHSVMSWTALITGYMKNCNLATEAINLFSEMITQGHVEPNHFTFSSAFKAC 386

Query: 269 SGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLT 328
             L     G Q+ G   K G+ S+  + ++++ M+ KS R+ DA + FES  E ++VS  
Sbjct: 387 GNLSDPRVGKQVLGQAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEKNLVSYN 446

Query: 329 VILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAVLGVFGAETSLRLGQQVHSFVVKK 388
             L G  +N   E+A ++  ++ +  + +     +++L       S+R G+Q+HS VVK 
Sbjct: 447 TFLDGTCRNLNFEQAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKL 506

Query: 389 NFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAGFARHGDGLKALHLY 448
             SCN  V N LI+MYSKCG++D + +VF+ M  RN ++W SMI GFA+HG  ++ L  +
Sbjct: 507 GLSCNQPVCNALISMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETF 566

Query: 449 ENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLESMTKDHGMNPRSEHYACVVDMLGR 508
             M  E  KP +VT++++L ACSHVGL+ +G     SM +DH + P+ EHYAC+VD+L R
Sbjct: 567 NQMIEEGVKPNEVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCR 626

Query: 509 AGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSETGKYAAEHLFSETPHSPVPYVLL 568
           AGLL++A  FI  +P Q  + VW+  LGAC ++ ++E GK AA  +    P+ P  Y+ L
Sbjct: 627 AGLLTDAFEFINTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQL 686

Query: 569 ANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQAEIIYGVL 628
           +NIY+  G W+E     RKMKE  + KE G SWIE+  K+H F VGD  HP A  IY  L
Sbjct: 687 SNIYACAGKWEESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDEL 746

Query: 629 MELFVLMVDEGYVPDKKFILYCLDDD 650
             L   +   GYVPD   +L+ L+++
Sbjct: 747 DRLITEIKRCGYVPDTDLVLHKLEEE 769

BLAST of Cla97C02G041510 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 389.4 bits (999), Expect = 8.0e-107
Identity = 202/604 (33.44%), Postives = 336/604 (55.63%), Query Frame = 0

Query: 51  LLSICGREGHLHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEM 110
           L+  C  +G L  G  LHA   K    SN+     I  +L+++Y +C  +  A+  F E 
Sbjct: 395 LVVACSADGTLFRGQQLHAYTTKLGFASNNK----IEGALLNLYAKCADIETALDYFLET 454

Query: 111 LTRDTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCII 170
              + + WN ++  +    +    F  F+ M  + +   ++ T   IL  C  L    + 
Sbjct: 455 EVENVVLWNVMLVAYGLLDDLRNSFRIFRQM-QIEEIVPNQYTYPSILKTCIRLGDLELG 514

Query: 171 KMMHGLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQN 230
           + +H     + ++    V + LI  Y K G +D    +      ++V++WT +I+G  Q 
Sbjct: 515 EQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQY 574

Query: 231 GRHEHSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCI 290
              + +L  FR+M+  G +  + +     ++AC+GL+AL+EG QIH      G  SDL  
Sbjct: 575 NFDDKALTTFRQMLDRG-IRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPF 634

Query: 291 GSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI 350
            +AL+ +YS+ G+I +++  FE  E  D ++   +++GF Q+G  EEA+++F++M + GI
Sbjct: 635 QNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGI 694

Query: 351 KIDENVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVK 410
             +     + +       +++ G+QVH+ + K  +     V N LI+MY+KCG++ ++ K
Sbjct: 695 DNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEK 754

Query: 411 VFDRMRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGL 470
            F  +  +N V+WN++I  +++HG G +AL  ++ M   + +P  VT + +L ACSH+GL
Sbjct: 755 QFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGL 814

Query: 471 LKKGMEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALL 530
           + KG+ + ESM  ++G++P+ EHY CVVDML RAGLLS AK FI+++P +P   VW+ LL
Sbjct: 815 VDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLL 874

Query: 531 GACSLYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAK 590
            AC ++ + E G++AA HL    P     YVLL+N+Y+    W  R  T +KMKE G+ K
Sbjct: 875 SACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKK 934

Query: 591 ETGISWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDR 650
           E G SWIE+   +HSF VGD+ HP A+ I+    +L     + GYV D   +L  L  ++
Sbjct: 935 EPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFSLLNELQHEQ 992

Query: 651 RDPI 655
           +DPI
Sbjct: 995 KDPI 992

BLAST of Cla97C02G041510 vs. TAIR10
Match: AT3G05340.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 755.7 bits (1950), Expect = 2.3e-218
Identity = 390/661 (59.00%), Postives = 473/661 (71.56%), Query Frame = 0

Query: 1   MKFKWVFQKLSSRLPSWVSSLTFPLRNQFHQNPFAETSSTFVLKHVNPSYLLSICGREGH 60
           M  +WV QKL+S LPS +S++  P +    Q+P  +  STF+L HV+ S LLSICGREG 
Sbjct: 1   MNSRWVIQKLTSHLPSCLSTVLSPSKILIRQSPNYQV-STFLLNHVDMSLLLSICGREGW 60

Query: 61  L-HLGSSLHASIFKRFELSN------HDHGVVIMNSLISMYERCGKLPDAIKVFDEMLTR 120
             HLG  LHASI K  E         H + +V+ NSL+S+Y +CGKL DAIK+FDEM  R
Sbjct: 61  FPHLGPCLHASIIKNPEFFEPVDADIHRNALVVWNSLLSLYAKCGKLVDAIKLFDEMPMR 120

Query: 121 DTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCIIKMM 180
           D IS N +  GF+RN E  +GF   K M  +G   FD ATLT++LS CD  E C + KM+
Sbjct: 121 DVISQNIVFYGFLRNRETESGFVLLKRM--LGSGGFDHATLTIVLSVCDTPEFCLVTKMI 180

Query: 181 HGLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQNGRH 240
           H LA LSGY++EI+VGN LI+SYFK GC   G  VF GM  RNVIT TAVISGL +N  H
Sbjct: 181 HALAILSGYDKEISVGNKLITSYFKCGCSVSGRGVFDGMSHRNVITLTAVISGLIENELH 240

Query: 241 EHSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSA 300
           E  L+LF  +M  G V PN +TYL  L ACSG + + EG QIH L+ K GI+S+LCI SA
Sbjct: 241 EDGLRLF-SLMRRGLVHPNSVTYLSALAACSGSQRIVEGQQIHALLWKYGIESELCIESA 300

Query: 301 LMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKID 360
           LMDMYSK G I DAW IFES  E D VS+TVIL G  QNG EEEAIQ F++ML+ G++ID
Sbjct: 301 LMDMYSKCGSIEDAWTIFESTTEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQAGVEID 360

Query: 361 ENVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFD 420
            NV+SAVLGV   + SL LG+Q+HS V+K+ FS N FV+NGLINMYSKCG L +S  VF 
Sbjct: 361 ANVVSAVLGVSFIDNSLGLGKQLHSLVIKRKFSGNTFVNNGLINMYSKCGDLTDSQTVFR 420

Query: 421 RMRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGLLKK 480
           RM +RN V+WNSMIA FARHG GL AL LYE M   + KPTDVTFLSLLHACSHVGL+ K
Sbjct: 421 RMPKRNYVSWNSMIAAFARHGHGLAALKLYEEMTTLEVKPTDVTFLSLLHACSHVGLIDK 480

Query: 481 GMEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGAC 540
           G E L  M + HG+ PR+EHY C++DMLGRAGLL EAK+FI+ LP +P  ++WQALLGAC
Sbjct: 481 GRELLNEMKEVHGIEPRTEHYTCIIDMLGRAGLLKEAKSFIDSLPLKPDCKIWQALLGAC 540

Query: 541 SLYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETG 600
           S +GD+E G+YAAE LF   P S   ++L+ANIYSS+G WKERA+TI++MK +G+ KETG
Sbjct: 541 SFHGDTEVGEYAAEQLFQTAPDSSSAHILIANIYSSRGKWKERAKTIKRMKAMGVTKETG 600

Query: 601 ISWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDP 655
           IS IEI+ K HSF V DK+HPQAE IY VL  LF +MVDEGY PDK+FIL    DDR   
Sbjct: 601 ISSIEIEHKTHSFVVEDKLHPQAEAIYDVLSGLFPVMVDEGYRPDKRFILCYTGDDRNGT 657

BLAST of Cla97C02G041510 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 404.1 bits (1037), Expect = 1.7e-112
Identity = 219/579 (37.82%), Postives = 338/579 (58.38%), Query Frame = 0

Query: 81  DHGVVIMNSLISMYERCGKLPDAIKVFDEMLTRDTISWNALIGGFMRNGEFCAGFSYFKA 140
           D  V I N L++MY +CG + DA +VF  M  +D++SWN++I G  +NG F      +K+
Sbjct: 346 DFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKS 405

Query: 141 MCLVGDCKFDEATLTMILSACDGLELCCIIKMMHGLAFLSGYEREITVGNALISSYFKYG 200
           M    D      TL   LS+C  L+   + + +HG +   G +  ++V NAL++ Y + G
Sbjct: 406 M-RRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETG 465

Query: 201 CVDLGMQVFYGMGERNVITWTAVISGLAQNGRH-EHSLKLFREMMSCGSVEPNFLTYLGL 260
            ++   ++F  M E + ++W ++I  LA++ R    ++  F      G  + N +T+  +
Sbjct: 466 YLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQ-KLNRITFSSV 525

Query: 261 LTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIF-ESAEEFD 320
           L+A S L   E G QIHGL LK  I  +    +AL+  Y K G +    KIF   AE  D
Sbjct: 526 LSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRD 585

Query: 321 MVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAVLGVFGAETSLRLGQQVHS 380
            V+   +++G+  N    +A+ +   ML+ G ++D  + + VL  F +  +L  G +VH+
Sbjct: 586 NVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHA 645

Query: 381 FVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAGFARHGDGLK 440
             V+     +  V + L++MYSKCG LD +++ F+ M  RNS +WNSMI+G+ARHG G +
Sbjct: 646 CSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEE 705

Query: 441 ALHLYENMKLEDAKPTD-VTFLSLLHACSHVGLLKKGMEFLESMTKDHGMNPRSEHYACV 500
           AL L+E MKL+   P D VTF+ +L ACSH GLL++G +  ESM+  +G+ PR EH++C+
Sbjct: 706 ALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCM 765

Query: 501 VDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYG--DSETGKYAAEHLFSETPH 560
            D+LGRAG L + ++FIEK+P +P + +W+ +LGAC       +E GK AAE LF   P 
Sbjct: 766 ADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPE 825

Query: 561 SPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQ 620
           + V YVLL N+Y++ G W++  +  +KMK+  + KE G SW+ +   VH F  GDK HP 
Sbjct: 826 NAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPD 885

Query: 621 AEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDRRDPI 655
           A++IY  L EL   M D GYVP   F LY L+ + ++ I
Sbjct: 886 ADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEI 922

BLAST of Cla97C02G041510 vs. TAIR10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 392.1 bits (1006), Expect = 6.8e-109
Identity = 213/566 (37.63%), Postives = 322/566 (56.89%), Query Frame = 0

Query: 89  SLISMYERC-GKLPDAIKVFDEMLTRDTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDC 148
           SLI M+ +      +A KVFD+M   + ++W  +I   M+ G       +F  M L G  
Sbjct: 207 SLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG-F 266

Query: 149 KFDEATLTMILSACDGLELCCIIKMMHGLAFLSGYEREITVGNALISSYFKY---GCVDL 208
           + D+ TL+ + SAC  LE   + K +H  A  SG   ++    +L+  Y K    G VD 
Sbjct: 267 ESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDVEC--SLVDMYAKCSADGSVDD 326

Query: 209 GMQVFYGMGERNVITWTAVISGLAQN-GRHEHSLKLFREMMSCGSVEPNFLTYLGLLTAC 268
             +VF  M + +V++WTA+I+G  +N      ++ LF EM++ G VEPN  T+     AC
Sbjct: 327 CRKVFDRMEDHSVMSWTALITGYMKNCNLATEAINLFSEMITQGHVEPNHFTFSSAFKAC 386

Query: 269 SGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLT 328
             L     G Q+ G   K G+ S+  + ++++ M+ KS R+ DA + FES  E ++VS  
Sbjct: 387 GNLSDPRVGKQVLGQAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEKNLVSYN 446

Query: 329 VILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVISAVLGVFGAETSLRLGQQVHSFVVKK 388
             L G  +N   E+A ++  ++ +  + +     +++L       S+R G+Q+HS VVK 
Sbjct: 447 TFLDGTCRNLNFEQAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKL 506

Query: 389 NFSCNPFVSNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAGFARHGDGLKALHLY 448
             SCN  V N LI+MYSKCG++D + +VF+ M  RN ++W SMI GFA+HG  ++ L  +
Sbjct: 507 GLSCNQPVCNALISMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETF 566

Query: 449 ENMKLEDAKPTDVTFLSLLHACSHVGLLKKGMEFLESMTKDHGMNPRSEHYACVVDMLGR 508
             M  E  KP +VT++++L ACSHVGL+ +G     SM +DH + P+ EHYAC+VD+L R
Sbjct: 567 NQMIEEGVKPNEVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCR 626

Query: 509 AGLLSEAKNFIEKLPEQPGLRVWQALLGACSLYGDSETGKYAAEHLFSETPHSPVPYVLL 568
           AGLL++A  FI  +P Q  + VW+  LGAC ++ ++E GK AA  +    P+ P  Y+ L
Sbjct: 627 AGLLTDAFEFINTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQL 686

Query: 569 ANIYSSKGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQAEIIYGVL 628
           +NIY+  G W+E     RKMKE  + KE G SWIE+  K+H F VGD  HP A  IY  L
Sbjct: 687 SNIYACAGKWEESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDEL 746

Query: 629 MELFVLMVDEGYVPDKKFILYCLDDD 650
             L   +   GYVPD   +L+ L+++
Sbjct: 747 DRLITEIKRCGYVPDTDLVLHKLEEE 769

BLAST of Cla97C02G041510 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 389.4 bits (999), Expect = 4.4e-108
Identity = 202/604 (33.44%), Postives = 336/604 (55.63%), Query Frame = 0

Query: 51  LLSICGREGHLHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKVFDEM 110
           L+  C  +G L  G  LHA   K    SN+     I  +L+++Y +C  +  A+  F E 
Sbjct: 395 LVVACSADGTLFRGQQLHAYTTKLGFASNNK----IEGALLNLYAKCADIETALDYFLET 454

Query: 111 LTRDTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLELCCII 170
              + + WN ++  +    +    F  F+ M  + +   ++ T   IL  C  L    + 
Sbjct: 455 EVENVVLWNVMLVAYGLLDDLRNSFRIFRQM-QIEEIVPNQYTYPSILKTCIRLGDLELG 514

Query: 171 KMMHGLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISGLAQN 230
           + +H     + ++    V + LI  Y K G +D    +      ++V++WT +I+G  Q 
Sbjct: 515 EQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQY 574

Query: 231 GRHEHSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQSDLCI 290
              + +L  FR+M+  G +  + +     ++AC+GL+AL+EG QIH      G  SDL  
Sbjct: 575 NFDDKALTTFRQMLDRG-IRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPF 634

Query: 291 GSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI 350
            +AL+ +YS+ G+I +++  FE  E  D ++   +++GF Q+G  EEA+++F++M + GI
Sbjct: 635 QNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGI 694

Query: 351 KIDENVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVK 410
             +     + +       +++ G+QVH+ + K  +     V N LI+MY+KCG++ ++ K
Sbjct: 695 DNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEK 754

Query: 411 VFDRMRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACSHVGL 470
            F  +  +N V+WN++I  +++HG G +AL  ++ M   + +P  VT + +L ACSH+GL
Sbjct: 755 QFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGL 814

Query: 471 LKKGMEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVWQALL 530
           + KG+ + ESM  ++G++P+ EHY CVVDML RAGLLS AK FI+++P +P   VW+ LL
Sbjct: 815 VDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLL 874

Query: 531 GACSLYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEVGMAK 590
            AC ++ + E G++AA HL    P     YVLL+N+Y+    W  R  T +KMKE G+ K
Sbjct: 875 SACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKK 934

Query: 591 ETGISWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFILYCLDDDR 650
           E G SWIE+   +HSF VGD+ HP A+ I+    +L     + GYV D   +L  L  ++
Sbjct: 935 EPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFSLLNELQHEQ 992

Query: 651 RDPI 655
           +DPI
Sbjct: 995 KDPI 992

BLAST of Cla97C02G041510 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 388.3 bits (996), Expect = 9.8e-108
Identity = 206/597 (34.51%), Postives = 329/597 (55.11%), Query Frame = 0

Query: 47  NPSYLLSICGREGHLHLGSSLHASIFKRFELSNHDHGVVIMNSLISMYERCGKLPDAIKV 106
           N +YLL +CG E  L +G  +H  + K    S     +  M  L +MY +C ++ +A KV
Sbjct: 137 NFTYLLKVCGDEAELRVGKEIHGLLVK----SGFSLDLFAMTGLENMYAKCRQVNEARKV 196

Query: 107 FDEMLTRDTISWNALIGGFMRNGEFCAGFSYFKAMCLVGDCKFDEATLTMILSACDGLEL 166
           FD M  RD +SWN ++ G+ +NG         K+MC   + K    T+  +L A   L L
Sbjct: 197 FDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMC-EENLKPSFITIVSVLPAVSALRL 256

Query: 167 CCIIKMMHGLAFLSGYEREITVGNALISSYFKYGCVDLGMQVFYGMGERNVITWTAVISG 226
             + K +HG A  SG++  + +  AL+  Y K G ++   Q+F GM ERNV++W ++I  
Sbjct: 257 ISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDA 316

Query: 227 LAQNGRHEHSLKLFREMMSCGSVEPNFLTYLGLLTACSGLEALEEGCQIHGLILKLGIQS 286
             QN   + ++ +F++M+  G V+P  ++ +G L AC+ L  LE G  IH L ++LG+  
Sbjct: 317 YVQNENPKEAMLIFQKMLDEG-VKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDR 376

Query: 287 DLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTQNGCEEEAIQIFLKML 346
           ++ + ++L+ MY K   +  A  +F   +   +VS   ++ GF QNG   +A+  F +M 
Sbjct: 377 NVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMR 436

Query: 347 KMGIKIDENVISAVLGVFGAETSLRLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALD 406
              +K D     +V+      +     + +H  V++     N FV+  L++MY+KCGA+ 
Sbjct: 437 SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIM 496

Query: 407 ESVKVFDRMRERNSVTWNSMIAGFARHGDGLKALHLYENMKLEDAKPTDVTFLSLLHACS 466
            +  +FD M ER+  TWN+MI G+  HG G  AL L+E M+    KP  VTFLS++ ACS
Sbjct: 497 IARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACS 556

Query: 467 HVGLLKKGMEFLESMTKDHGMNPRSEHYACVVDMLGRAGLLSEAKNFIEKLPEQPGLRVW 526
           H GL++ G++    M +++ +    +HY  +VD+LGRAG L+EA +FI ++P +P + V+
Sbjct: 557 HSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVY 616

Query: 527 QALLGACSLYGDSETGKYAAEHLFSETPHSPVPYVLLANIYSSKGNWKERARTIRKMKEV 586
            A+LGAC ++ +    + AAE LF   P     +VLLANIY +   W++  +    M   
Sbjct: 617 GAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQ 676

Query: 587 GMAKETGISWIEIDKKVHSFTVGDKMHPQAEIIYGVLMELFVLMVDEGYVPDKKFIL 644
           G+ K  G S +EI  +VHSF  G   HP ++ IY  L +L   + + GYVPD   +L
Sbjct: 677 GLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL 727

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN60913.10.0e+0088.10hypothetical protein Csa_2G022820 [Cucumis sativus][more]
XP_022946256.10.0e+0087.80pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita mosc... [more]
XP_022946255.10.0e+0089.19pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita mosc... [more]
XP_022946254.10.0e+0089.19pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita mosc... [more]
XP_022999024.10.0e+0088.58pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0LGC8|A0A0A0LGC8_CUCSA0.0e+0088.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G022820 PE=4 SV=1[more]
tr|A0A1S4E0D4|A0A1S4E0D4_CUCME0.0e+0087.78pentatricopeptide repeat-containing protein At3g05340 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2P5CFM5|A0A2P5CFM5_9ROSA2.0e-24866.26DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_286550 ... [more]
tr|A0A2P5DSY6|A0A2P5DSY6_PARAD4.7e-24565.60DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_034... [more]
tr|A0A2K1XZS0|A0A2K1XZS0_POPTR9.7e-24364.62Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_013G021100v3 PE=... [more]
Match NameE-valueIdentityDescription
sp|Q9MA85|PP215_ARATH4.2e-21759.00Pentatricopeptide repeat-containing protein At3g05340 OS=Arabidopsis thaliana OX... [more]
sp|Q9FIB2|PP373_ARATH3.1e-11137.82Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
sp|Q9M1V3|PP296_ARATH1.6e-11034.00Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
sp|Q5G1T1|PP272_ARATH1.2e-10737.63Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
sp|Q9SVP7|PP307_ARATH8.0e-10733.44Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT3G05340.12.3e-21859.00Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G09950.11.7e-11237.82Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G49170.16.8e-10937.63Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G13650.14.4e-10833.44Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.19.8e-10834.51Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
IPR032867DYW_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008568 microtubule-severing ATPase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G041510.1Cla97C02G041510.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 593..651
e-value: 2.0E-9
score: 37.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 320..350
e-value: 5.0E-4
score: 20.0
coord: 116..141
e-value: 0.16
score: 12.2
coord: 85..113
e-value: 1.9E-5
score: 24.5
coord: 292..316
e-value: 0.14
score: 12.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 215..263
e-value: 1.6E-9
score: 37.7
coord: 419..466
e-value: 2.8E-11
score: 43.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 421..454
e-value: 1.4E-6
score: 26.1
coord: 87..113
e-value: 0.0026
score: 15.8
coord: 321..354
e-value: 1.0E-4
score: 20.2
coord: 218..252
e-value: 3.5E-8
score: 31.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 252..286
score: 6.445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..215
score: 6.336
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 216..250
score: 11.213
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 419..453
score: 11.542
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 556..590
score: 7.574
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 83..117
score: 9.427
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 388..418
score: 9.657
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 287..317
score: 7.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 454..489
score: 7.958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 490..520
score: 6.511
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 318..352
score: 9.712
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 368..483
e-value: 1.0E-27
score: 99.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 30..165
e-value: 5.9E-19
score: 70.1
coord: 271..367
e-value: 3.6E-14
score: 54.5
coord: 169..270
e-value: 5.5E-17
score: 63.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 484..616
e-value: 1.6E-9
score: 39.8
NoneNo IPR availablePANTHERPTHR24015:SF563SUBFAMILY NOT NAMEDcoord: 39..620
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 39..620

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C02G041510Silver-seed gourdcarwmbB0746
Cla97C02G041510Cucurbita maxima (Rimu)cmawmbB175
Cla97C02G041510Cucurbita moschata (Rifu)cmowmbB161
Cla97C02G041510Melon (DHL92) v3.5.1mewmbB376