Cla97C07G130920 (gene) Watermelon (97103) v2.5

Overview
NameCla97C07G130920
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr07: 2542587 .. 2544581 (+)
RNA-Seq ExpressionCla97C07G130920
SyntenyCla97C07G130920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCAAGCCCGGCCGCAGTTTCGATGCCTTCTTCATCCCCTTACGTTCATTTACTGTGCGATCGCTCTCCACCAAGACTCCCTCTGCTTCTTTACAAGAATTCACCAGTCTCTGCGATGGCGGAGGCATAAGACAAGCCTATAACACGTTCAAATACGAGATATGGTCAGACCCATCTCTTTTTTCTCATCTGATCCAATCATGCATAAAACTAAGCTCGCTTTTTGGAGGAAAACAGGTCCATTCTTTGGTAATTACATCTGGGTGTTCCAGAGACAAGTTCATTTCTAATCACCTTTTAAACTTGTACTCCAAATTAGGACATTTGAAGTCTTCTTTGGTGCTGTTTAGTCAAATGCCACGAAGAAATATAATGTCATATAACATTTTGATCGATGGGTACCTGCAGCTTGGGGATTTGGAAAGCGCCCAGAAACTGTTTGATGAAATGTCTGAAAGAAACATTGCCACATGGAATGCGATGATTGCAGGTCTAACCCAGTTTGAATTTAACGAACAGGCTTTAAGTTTGTTTAAAGAAATGAATGGATTGGGTTTTTTGCCTGATGAGTTCACACTAGGCAGTGTACTTAGAGGTTGCGCTGGTTTAAGATCTTTAATTGCTGGTCAAGAGGTTCATGCTTGTCTGATGAAATGTGGATTTGAACTGAACTTGGTAGTGGGGAGTTCTCTAGCTCATATGTATATGAAGTCTGGTAGTTTATCTGATGGAGAGAAGTTAATTAAATCAATGCCAATTCGTAATGTAGTTGCTTGGAATACTCTTATTGCTGGAAAAGCTCAACATGGGCGTTCAGAAGAAGTGTTGAACCAGTATAATATGATGAAAATGTCAGGCTTTCGACCGGATAAAATAACATTTGTGAGTGTAATAAGTGCGTGTTCGGAACTGGCGACATTAGGACAAGGCCAGCAGATCCATGCTGAAGTGATCAAAGCTGGAGCTGGTTCAGTTGTAGCAGTTGTCAGTTCATTGATTAGTATGTATTCACGGTCTGGGTGTCTAGAGGACTCTGTGAAAGTCTTTTTGGAACGTGAAGATTCTGATGTTGTGTTATGGAGTGCTATGATTGCAGCTTATGGATTCCATGGGAGAGGAGAGGAAGCTATTGAGCTGTTTCACCAAATGGAAGATTTGAAAATAAAGGCAAATGAAGTGACCTTCTTGATTCTGCTTTATGCTTGTAGTCACTGTGGATTGAAAGAGAAAGGAACTGAGTATTTAGATTTGATGGTGAAGAAGTATAAACTCAAGCCTAGAATCGAACACTACACATGTGTGGTTGATCTGCTCGGTCGGGCTGGCTGCTTGGAGGAAGCAGAGGGTATGATAAGATCAATGCCTGTAAAAGCAGATGGCATCATTTGGAAAACTTTATTATCAGCCTGCAAACTCCATAAGAATGCAGAAATGGCCAAACGAATTTCTGAAGAAATTCTAAAGCTTGATCCTCTGGATGCTGCTTCCTATGTCCTGCTTTCAAACATCCATGCTTCTGCTAGAAATTGGCTCGACGTTTCCGAGATTAGGAAAACCATGAGAGATAGGAACGTCAGGAAGGAACCAGGCATTAGTTGGTTAGAACTCAAAAATTCGGTTCACCAATTTAGCATGGGGGGCAAATCTCACCCACAATACCTGGAGATTGATTCGTATTTGAAAGAACTAATGTCTGAACTGAAACTGCACGGTTACGTACCGGACTTAGCCTCAGTTTTGCACGACATGGACAATGAAGAAAAAGAATACAATTTGGCACAACACAGTGAGAAGTTAGCAATTGCTTTTGCACTGATGAACACTCCCGAGAATGTCCCAATAAGAGTGATGAAGAACTTGCGGGTATGCAATGACTGTCATAATGCCATTAAGTGCATATCAAAGATCAGAAACAGAGAGATTATTGTAAGAGATGCAAGTAGATTTCACCATTTCAAGGACGGTGAATGTTCTTGTCGTAATTACTGGTAG

mRNA sequence

ATGGGCAAGCCCGGCCGCAGTTTCGATGCCTTCTTCATCCCCTTACGTTCATTTACTGTGCGATCGCTCTCCACCAAGACTCCCTCTGCTTCTTTACAAGAATTCACCAGTCTCTGCGATGGCGGAGGCATAAGACAAGCCTATAACACGTTCAAATACGAGATATGGTCAGACCCATCTCTTTTTTCTCATCTGATCCAATCATGCATAAAACTAAGCTCGCTTTTTGGAGGAAAACAGGTCCATTCTTTGGTAATTACATCTGGGTGTTCCAGAGACAAGTTCATTTCTAATCACCTTTTAAACTTGTACTCCAAATTAGGACATTTGAAGTCTTCTTTGGTGCTGTTTAGTCAAATGCCACGAAGAAATATAATGTCATATAACATTTTGATCGATGGGTACCTGCAGCTTGGGGATTTGGAAAGCGCCCAGAAACTGTTTGATGAAATGTCTGAAAGAAACATTGCCACATGGAATGCGATGATTGCAGGTCTAACCCAGTTTGAATTTAACGAACAGGCTTTAAGTTTGTTTAAAGAAATGAATGGATTGGGTTTTTTGCCTGATGAGTTCACACTAGGCAGTGTACTTAGAGGTTGCGCTGGTTTAAGATCTTTAATTGCTGGTCAAGAGGTTCATGCTTGTCTGATGAAATGTGGATTTGAACTGAACTTGGTAGTGGGGAGTTCTCTAGCTCATATGTATATGAAGTCTGGTAGTTTATCTGATGGAGAGAAGTTAATTAAATCAATGCCAATTCGTAATGTAGTTGCTTGGAATACTCTTATTGCTGGAAAAGCTCAACATGGGCGTTCAGAAGAAGTGTTGAACCAGTATAATATGATGAAAATGTCAGGCTTTCGACCGGATAAAATAACATTTGTGAGTGTAATAAGTGCGTGTTCGGAACTGGCGACATTAGGACAAGGCCAGCAGATCCATGCTGAAGTGATCAAAGCTGGAGCTGGTTCAGTTGTAGCAGTTGTCAGTTCATTGATTAGTATGTATTCACGGTCTGGGTGTCTAGAGGACTCTGTGAAAGTCTTTTTGGAACGTGAAGATTCTGATGTTGTGTTATGGAGTGCTATGATTGCAGCTTATGGATTCCATGGGAGAGGAGAGGAAGCTATTGAGCTGTTTCACCAAATGGAAGATTTGAAAATAAAGGCAAATGAAGTGACCTTCTTGATTCTGCTTTATGCTTGTAGTCACTGTGGATTGAAAGAGAAAGGAACTGAGTATTTAGATTTGATGGTGAAGAAGTATAAACTCAAGCCTAGAATCGAACACTACACATGTGTGGTTGATCTGCTCGGTCGGGCTGGCTGCTTGGAGGAAGCAGAGGGTATGATAAGATCAATGCCTGTAAAAGCAGATGGCATCATTTGGAAAACTTTATTATCAGCCTGCAAACTCCATAAGAATGCAGAAATGGCCAAACGAATTTCTGAAGAAATTCTAAAGCTTGATCCTCTGGATGCTGCTTCCTATGTCCTGCTTTCAAACATCCATGCTTCTGCTAGAAATTGGCTCGACGTTTCCGAGATTAGGAAAACCATGAGAGATAGGAACGTCAGGAAGGAACCAGGCATTAGTTGGTTAGAACTCAAAAATTCGGTTCACCAATTTAGCATGGGGGGCAAATCTCACCCACAATACCTGGAGATTGATTCGTATTTGAAAGAACTAATGTCTGAACTGAAACTGCACGGTTACGTACCGGACTTAGCCTCAGTTTTGCACGACATGGACAATGAAGAAAAAGAATACAATTTGGCACAACACAGTGAGAAGTTAGCAATTGCTTTTGCACTGATGAACACTCCCGAGAATGTCCCAATAAGAGTGATGAAGAACTTGCGGGTATGCAATGACTGTCATAATGCCATTAAGTGCATATCAAAGATCAGAAACAGAGAGATTATTGTAAGAGATGCAAGTAGATTTCACCATTTCAAGGACGGTGAATGTTCTTGTCGTAATTACTGGTAG

Coding sequence (CDS)

ATGGGCAAGCCCGGCCGCAGTTTCGATGCCTTCTTCATCCCCTTACGTTCATTTACTGTGCGATCGCTCTCCACCAAGACTCCCTCTGCTTCTTTACAAGAATTCACCAGTCTCTGCGATGGCGGAGGCATAAGACAAGCCTATAACACGTTCAAATACGAGATATGGTCAGACCCATCTCTTTTTTCTCATCTGATCCAATCATGCATAAAACTAAGCTCGCTTTTTGGAGGAAAACAGGTCCATTCTTTGGTAATTACATCTGGGTGTTCCAGAGACAAGTTCATTTCTAATCACCTTTTAAACTTGTACTCCAAATTAGGACATTTGAAGTCTTCTTTGGTGCTGTTTAGTCAAATGCCACGAAGAAATATAATGTCATATAACATTTTGATCGATGGGTACCTGCAGCTTGGGGATTTGGAAAGCGCCCAGAAACTGTTTGATGAAATGTCTGAAAGAAACATTGCCACATGGAATGCGATGATTGCAGGTCTAACCCAGTTTGAATTTAACGAACAGGCTTTAAGTTTGTTTAAAGAAATGAATGGATTGGGTTTTTTGCCTGATGAGTTCACACTAGGCAGTGTACTTAGAGGTTGCGCTGGTTTAAGATCTTTAATTGCTGGTCAAGAGGTTCATGCTTGTCTGATGAAATGTGGATTTGAACTGAACTTGGTAGTGGGGAGTTCTCTAGCTCATATGTATATGAAGTCTGGTAGTTTATCTGATGGAGAGAAGTTAATTAAATCAATGCCAATTCGTAATGTAGTTGCTTGGAATACTCTTATTGCTGGAAAAGCTCAACATGGGCGTTCAGAAGAAGTGTTGAACCAGTATAATATGATGAAAATGTCAGGCTTTCGACCGGATAAAATAACATTTGTGAGTGTAATAAGTGCGTGTTCGGAACTGGCGACATTAGGACAAGGCCAGCAGATCCATGCTGAAGTGATCAAAGCTGGAGCTGGTTCAGTTGTAGCAGTTGTCAGTTCATTGATTAGTATGTATTCACGGTCTGGGTGTCTAGAGGACTCTGTGAAAGTCTTTTTGGAACGTGAAGATTCTGATGTTGTGTTATGGAGTGCTATGATTGCAGCTTATGGATTCCATGGGAGAGGAGAGGAAGCTATTGAGCTGTTTCACCAAATGGAAGATTTGAAAATAAAGGCAAATGAAGTGACCTTCTTGATTCTGCTTTATGCTTGTAGTCACTGTGGATTGAAAGAGAAAGGAACTGAGTATTTAGATTTGATGGTGAAGAAGTATAAACTCAAGCCTAGAATCGAACACTACACATGTGTGGTTGATCTGCTCGGTCGGGCTGGCTGCTTGGAGGAAGCAGAGGGTATGATAAGATCAATGCCTGTAAAAGCAGATGGCATCATTTGGAAAACTTTATTATCAGCCTGCAAACTCCATAAGAATGCAGAAATGGCCAAACGAATTTCTGAAGAAATTCTAAAGCTTGATCCTCTGGATGCTGCTTCCTATGTCCTGCTTTCAAACATCCATGCTTCTGCTAGAAATTGGCTCGACGTTTCCGAGATTAGGAAAACCATGAGAGATAGGAACGTCAGGAAGGAACCAGGCATTAGTTGGTTAGAACTCAAAAATTCGGTTCACCAATTTAGCATGGGGGGCAAATCTCACCCACAATACCTGGAGATTGATTCGTATTTGAAAGAACTAATGTCTGAACTGAAACTGCACGGTTACGTACCGGACTTAGCCTCAGTTTTGCACGACATGGACAATGAAGAAAAAGAATACAATTTGGCACAACACAGTGAGAAGTTAGCAATTGCTTTTGCACTGATGAACACTCCCGAGAATGTCCCAATAAGAGTGATGAAGAACTTGCGGGTATGCAATGACTGTCATAATGCCATTAAGTGCATATCAAAGATCAGAAACAGAGAGATTATTGTAAGAGATGCAAGTAGATTTCACCATTTCAAGGACGGTGAATGTTCTTGTCGTAATTACTGGTAG

Protein sequence

MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPSLFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSCRNYW
Homology
BLAST of Cla97C07G130920 vs. NCBI nr
Match: XP_023553764.1 (pentatricopeptide repeat-containing protein At2g41080 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1221.5 bits (3159), Expect = 0.0e+00
Identity = 607/664 (91.42%), Postives = 630/664 (94.88%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MGKP RSFDAFF PL +  VRSLSTKT SASLQEFTSLC GG I QAY  FK EIWSDPS
Sbjct: 1   MGKPSRSFDAFFSPLHALAVRSLSTKTSSASLQEFTSLCSGGRITQAYERFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCIKL SLFGG+QVHSLVITSGC+ DKFISNHLLNLYSKLG+LKSSLVLFS M
Sbjct: 61  LFSHLLQSCIKLGSLFGGEQVHSLVITSGCANDKFISNHLLNLYSKLGNLKSSLVLFSHM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFE+NEQAL LF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEYNEQALGLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLG LPDEFTLGSVLRGCAGLRSL+AGQEVHACLMKCGFELNLVVGSSLAHMYMKSG
Sbjct: 181 EMYGLGILPDEFTLGSVLRGCAGLRSLLAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMPIRNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVK F++RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKTFVDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEEAIELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEAIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           ++YKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA
Sbjct: 421 EQYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISEEILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKNS
Sbjct: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNS 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEIDSYLKELMSE+KLHGY+PD+ SVLHDMDNEEKEYNLA HSEK 
Sbjct: 541 VHQFSMGDKSHPQYLEIDSYLKELMSEMKLHGYMPDIGSVLHDMDNEEKEYNLAHHSEKF 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMN PE VPIRVMKNLRVCNDCH AIKCISKIRNREIIVRD SRFHHFKDGECSC
Sbjct: 601 AIAFALMNIPEGVPIRVMKNLRVCNDCHEAIKCISKIRNREIIVRDTSRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. NCBI nr
Match: XP_022969026.1 (pentatricopeptide repeat-containing protein At2g41080 [Cucurbita maxima])

HSP 1 Score: 1221.1 bits (3158), Expect = 0.0e+00
Identity = 609/664 (91.72%), Postives = 630/664 (94.88%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MGKP RSFDA F PL + TVR LSTKT SASLQEFTSLC GG I QAY  FK EIWSDPS
Sbjct: 1   MGKPSRSFDALFNPLHALTVRLLSTKTSSASLQEFTSLCSGGRITQAYERFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCIKL SLFGG+QVHSLVITSGC+ DKFISNHLLNLYSKLG+LKSSLVLFSQM
Sbjct: 61  LFSHLLQSCIKLGSLFGGEQVHSLVITSGCAYDKFISNHLLNLYSKLGNLKSSLVLFSQM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFE+NEQALSLF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEYNEQALSLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLGFLPDEFTLGSVLRGCAGLRSL+AGQEVHACLMKCGFELNLVVGSSLAHMYMKSG
Sbjct: 181 EMYGLGFLPDEFTLGSVLRGCAGLRSLLAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMPIRNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVK FL+RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKTFLDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEEAIELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEAIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           ++YKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLSACKLHK AEMA
Sbjct: 421 EQYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKADGIIWKTLLSACKLHKKAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISEEILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKN 
Sbjct: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNL 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEIDSYLKELMSE+KLHGYVPD+ SVLHDMDNEEKEYNLA HSEK 
Sbjct: 541 VHQFSMGDKSHPQYLEIDSYLKELMSEMKLHGYVPDIGSVLHDMDNEEKEYNLAHHSEKF 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMN PE VPIRVMKNLRVCNDCH AIKCISKIRNREIIVRD SRFHHFKDGECSC
Sbjct: 601 AIAFALMNIPEGVPIRVMKNLRVCNDCHEAIKCISKIRNREIIVRDTSRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. NCBI nr
Match: KAG7011986.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 607/664 (91.42%), Postives = 629/664 (94.73%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MGKP RSFDAFF PL +  VRSLSTKT SASLQEFTSLC GG I QAY  FK EIWSDPS
Sbjct: 1   MGKPSRSFDAFFSPLHALAVRSLSTKTSSASLQEFTSLCSGGRITQAYERFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCIKL SLFGG+QVHSLVITSGC+ DKFISNHLLNLYSKLG+LKSSLVLFS M
Sbjct: 61  LFSHLLQSCIKLGSLFGGEQVHSLVITSGCANDKFISNHLLNLYSKLGNLKSSLVLFSHM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFE+NEQAL LF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEYNEQALGLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLG LPDEFTLGSVLRGCAGLRSL+AGQEVHACLMKCGFELNLVVGSSLAHMYMKSG
Sbjct: 181 EMYGLGILPDEFTLGSVLRGCAGLRSLLAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMPIRNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVK F++RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKTFVDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEEAIELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEAIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           ++YKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA
Sbjct: 421 EQYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISEEILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKN 
Sbjct: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNL 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEIDSYLKELMSE+KLHGYVPD+ SVLHDMDNEEKEYNLA HSEK 
Sbjct: 541 VHQFSMGDKSHPQYLEIDSYLKELMSEMKLHGYVPDIGSVLHDMDNEEKEYNLAHHSEKF 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMN PE VPIRVMKNLRVCNDCH AIKCISKIRNREIIVRD SRFHHFKDGECSC
Sbjct: 601 AIAFALMNIPEGVPIRVMKNLRVCNDCHEAIKCISKIRNREIIVRDTSRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. NCBI nr
Match: XP_022953052.1 (pentatricopeptide repeat-containing protein At2g41080 [Cucurbita moschata] >KAG6572381.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 605/664 (91.11%), Postives = 628/664 (94.58%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MG+P RSFDA F PL +  VRSLSTKT SASLQEFTSLC GG I QAY  FK EIWSDPS
Sbjct: 1   MGQPSRSFDALFNPLHALAVRSLSTKTSSASLQEFTSLCSGGRITQAYERFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCIKL SLFGG+QVHSLVITSGC+ DKFISNHLLNLYSKLG+LKSSLVLFS M
Sbjct: 61  LFSHLLQSCIKLGSLFGGEQVHSLVITSGCANDKFISNHLLNLYSKLGNLKSSLVLFSHM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFE+NEQAL LF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEYNEQALGLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLG LPDEFTLGSVLRGCAGLRSL+AGQEVHACLMKCGFELNLVVGSSLAHMYMKSG
Sbjct: 181 EMYGLGILPDEFTLGSVLRGCAGLRSLLAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMPIRNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVK F++RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKTFVDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEEAIELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEAIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           ++YKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA
Sbjct: 421 EQYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISEEILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKN 
Sbjct: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNL 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEIDSYLKELMSE+KLHGYVPD+ SVLHDMDNEEKEYNLA HSEK 
Sbjct: 541 VHQFSMGDKSHPQYLEIDSYLKELMSEMKLHGYVPDIGSVLHDMDNEEKEYNLAHHSEKF 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMN PE VPIRVMKNLRVCNDCH AIKCISKIRNREIIVRD SRFHHFKDGECSC
Sbjct: 601 AIAFALMNIPEGVPIRVMKNLRVCNDCHEAIKCISKIRNREIIVRDTSRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. NCBI nr
Match: XP_038887926.1 (pentatricopeptide repeat-containing protein At2g41080 isoform X1 [Benincasa hispida])

HSP 1 Score: 1186.4 bits (3068), Expect = 0.0e+00
Identity = 592/664 (89.16%), Postives = 621/664 (93.52%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           M    RSF+ FF PL SFTVRSLSTKTPSASLQ+FTSLC+GG IRQAY+TFK EIWSDPS
Sbjct: 1   MSNLSRSFEVFFNPLHSFTVRSLSTKTPSASLQKFTSLCNGGRIRQAYDTFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCI L SLFGGKQVHSL+ITSGCS+D+FI NHLLN YSKLG L+S  VLFSQM
Sbjct: 61  LFSHLLQSCINLGSLFGGKQVHSLIITSGCSKDQFICNHLLNFYSKLGQLESPFVLFSQM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PR+NIMSYNILI+GYLQ G LE AQKLFDEMSERNIATWNAMIAGLTQF  NEQALSLFK
Sbjct: 121 PRKNIMSYNILINGYLQRGYLEKAQKLFDEMSERNIATWNAMIAGLTQFGSNEQALSLFK 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLGFLPDEFTLGSVLRGCAGLRSL+AGQEVHACL+KCGFEL+LVV SSLAHMYMKSG
Sbjct: 181 EMYGLGFLPDEFTLGSVLRGCAGLRSLLAGQEVHACLIKCGFELSLVVCSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMP R+VVAWNTLIAGKAQ+G  EEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPTRSVVAWNTLIAGKAQNGCPEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIK G GSVVAVVSSLISMYSRSG LEDSVK FL+REDSDVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKGGVGSVVAVVSSLISMYSRSGSLEDSVKAFLDREDSDVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           W+AMIAA+GFHGRGEEAIELFHQMEDLK++ANEVTFL LLYACSHCGL+EKGTEYLDLMV
Sbjct: 361 WTAMIAAHGFHGRGEEAIELFHQMEDLKMEANEVTFLSLLYACSHCGLEEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           KKYKLKPR+EHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLS+CKLHK AEMA
Sbjct: 421 KKYKLKPRVEHYTCVVDLLGRAGHLEEAEGMIRSMPVKADGIIWKTLLSSCKLHKKAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRI+EEILKLD LDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKN 
Sbjct: 481 KRIAEEILKLDSLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNL 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEI+SYLKELMSELKLHGYVPDL SVLHDMDNEEKEYNLA HSEKL
Sbjct: 541 VHQFSMGDKSHPQYLEIESYLKELMSELKLHGYVPDLGSVLHDMDNEEKEYNLAHHSEKL 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMNTPEN PIRVMKNLRVCNDCHNAIKC+SKIR REIIVRDASRFHHFKDGECSC
Sbjct: 601 AIAFALMNTPENFPIRVMKNLRVCNDCHNAIKCMSKIRKREIIVRDASRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. ExPASy Swiss-Prot
Match: Q8S9M4 (Pentatricopeptide repeat-containing protein At2g41080 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H29 PE=2 SV=2)

HSP 1 Score: 830.1 bits (2143), Expect = 1.8e-239
Identity = 408/649 (62.87%), Postives = 508/649 (78.27%), Query Frame = 0

Query: 17  SFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPSLFSHLIQSCIKLSSLF 76
           S  VR LS    +A      +LC  G +R+A+  F+  I+++ SLF+  IQSC    SL 
Sbjct: 6   SSVVRPLSVDPATA----IATLCSKGNLREAFQRFRLNIFTNTSLFTPFIQSCTTRQSLP 65

Query: 77  GGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYL 136
            GKQ+H L++ SG S DKFI NHL+++YSKLG   S++ ++ +M ++N MS NILI+GY+
Sbjct: 66  SGKQLHCLLVVSGFSSDKFICNHLMSMYSKLGDFPSAVAVYGRMRKKNYMSSNILINGYV 125

Query: 137 QLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFKEMNGLGFLPDEFTLGS 196
           + GDL +A+K+FDEM +R + TWNAMIAGL QFEFNE+ LSLF+EM+GLGF PDE+TLGS
Sbjct: 126 RAGDLVNARKVFDEMPDRKLTTWNAMIAGLIQFEFNEEGLSLFREMHGLGFSPDEYTLGS 185

Query: 197 VLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSGSLSDGEKLIKSMPIRN 256
           V  G AGLRS+  GQ++H   +K G EL+LVV SSLAHMYM++G L DGE +I+SMP+RN
Sbjct: 186 VFSGSAGLRSVSIGQQIHGYTIKYGLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRN 245

Query: 257 VVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVISACSELATLGQGQQIHA 316
           +VAWNTLI G AQ+G  E VL  Y MMK+SG RP+KITFV+V+S+CS+LA  GQGQQIHA
Sbjct: 246 LVAWNTLIMGNAQNGCPETVLYLYKMMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHA 305

Query: 317 EVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVLWSAMIAAYGFHGRGEE 376
           E IK GA SVVAVVSSLISMYS+ GCL D+ K F ERED D V+WS+MI+AYGFHG+G+E
Sbjct: 306 EAIKIGASSVVAVVSSLISMYSKCGCLGDAAKAFSEREDEDEVMWSSMISAYGFHGQGDE 365

Query: 377 AIELFHQM-EDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCV 436
           AIELF+ M E   ++ NEV FL LLYACSH GLK+KG E  D+MV+KY  KP ++HYTCV
Sbjct: 366 AIELFNTMAEQTNMEINEVAFLNLLYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCV 425

Query: 437 VDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMAKRISEEILKLDPLDA 496
           VDLLGRAGCL++AE +IRSMP+K D +IWKTLLSAC +HKNAEMA+R+ +EIL++DP D+
Sbjct: 426 VDLLGRAGCLDQAEAIIRSMPIKTDIVIWKTLLSACNIHKNAEMAQRVFKEILQIDPNDS 485

Query: 497 ASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNSVHQFSMGGKSHPQYL 556
           A YVLL+N+HASA+ W DVSE+RK+MRD+NV+KE GISW E K  VHQF MG +S  +  
Sbjct: 486 ACYVLLANVHASAKRWRDVSEVRKSMRDKNVKKEAGISWFEHKGEVHQFKMGDRSQSKSK 545

Query: 557 EIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKLAIAFALMNTPENVPI 616
           EI SYLKEL  E+KL GY PD ASVLHDMD EEKE +L QHSEKLA+AFALM  PE  PI
Sbjct: 546 EIYSYLKELTLEMKLKGYKPDTASVLHDMDEEEKESDLVQHSEKLAVAFALMILPEGAPI 605

Query: 617 RVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSCRNYW 665
           R++KNLRVC+DCH A K IS I+NREI +RD SRFHHF +G+CSC +YW
Sbjct: 606 RIIKNLRVCSDCHVAFKYISVIKNREITLRDGSRFHHFINGKCSCGDYW 650

BLAST of Cla97C07G130920 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 5.0e-141
Identity = 254/622 (40.84%), Postives = 387/622 (62.22%), Query Frame = 0

Query: 78  GKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYLQ 137
           GK+VHS ++  G   +  +SN LLN+Y+K G    +  +F +M  R+I S+N +I  ++Q
Sbjct: 165 GKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQ 224

Query: 138 LGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFKEMNGLGFL-PDEFTLGS 197
           +G ++ A   F++M+ER+I TWN+MI+G  Q  ++ +AL +F +M     L PD FTL S
Sbjct: 225 VGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLAS 284

Query: 198 VLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSGSLSDGEKLIK------ 257
           VL  CA L  L  G+++H+ ++  GF+++ +V ++L  MY + G +    +LI+      
Sbjct: 285 VLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKD 344

Query: 258 ---------------------------SMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMM 317
                                      S+  R+VVAW  +I G  QHG   E +N +  M
Sbjct: 345 LKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSM 404

Query: 318 KMSGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCL 377
              G RP+  T  +++S  S LA+L  G+QIH   +K+G    V+V ++LI+MY+++G +
Sbjct: 405 VGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNI 464

Query: 378 EDSVKVF-LEREDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYA 437
             + + F L R + D V W++MI A   HG  EEA+ELF  M    ++ + +T++ +  A
Sbjct: 465 TSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSA 524

Query: 438 CSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGI 497
           C+H GL  +G +Y D+M    K+ P + HY C+VDL GRAG L+EA+  I  MP++ D +
Sbjct: 525 CTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVV 584

Query: 498 IWKTLLSACKLHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMR 557
            W +LLSAC++HKN ++ K  +E +L L+P ++ +Y  L+N++++   W + ++IRK+M+
Sbjct: 585 TWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMK 644

Query: 558 DRNVRKEPGISWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLH 617
           D  V+KE G SW+E+K+ VH F +   +HP+  EI   +K++  E+K  GYVPD ASVLH
Sbjct: 645 DGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLH 704

Query: 618 DMDNEEKEYNLAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREI 665
           D++ E KE  L  HSEKLAIAF L++TP+   +R+MKNLRVCNDCH AIK ISK+  REI
Sbjct: 705 DLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREI 764

BLAST of Cla97C07G130920 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 491.1 bits (1263), Expect = 2.0e-137
Identity = 243/612 (39.71%), Postives = 375/612 (61.27%), Query Frame = 0

Query: 58  DPSLFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLV-- 117
           D ++F  +++SC  +  L  G+ VH  ++  G   D +  N L+N+Y+KL  + S +   
Sbjct: 104 DHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVG 163

Query: 118 -LFSQMPRR--NIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFN 177
            +F +MP+R  N    ++  +  +    ++S +++F+ M  +++ ++N +IAG  Q    
Sbjct: 164 NVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMY 223

Query: 178 EQALSLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSL 237
           E AL + +EM      PD FTL SVL   +    +I G+E+H  +++ G + ++ +GSSL
Sbjct: 224 EDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSL 283

Query: 238 AHMYMKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDK 297
             MY KS  + D E++   +  R+ ++WN+L+AG  Q+GR  E L  +  M  +  +P  
Sbjct: 284 VDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGA 343

Query: 298 ITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLE 357
           + F SVI AC+ LATL  G+Q+H  V++ G GS + + S+L+ MYS+ G ++ + K+F  
Sbjct: 344 VAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDR 403

Query: 358 REDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKG 417
               D V W+A+I  +  HG G EA+ LF +M+   +K N+V F+ +L ACSH GL ++ 
Sbjct: 404 MNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEA 463

Query: 418 TEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACK 477
             Y + M K Y L   +EHY  V DLLGRAG LEEA   I  M V+  G +W TLLS+C 
Sbjct: 464 WGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCS 523

Query: 478 LHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGI 537
           +HKN E+A++++E+I  +D  +  +YVL+ N++AS   W +++++R  MR + +RK+P  
Sbjct: 524 VHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPAC 583

Query: 538 SWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYN 597
           SW+E+KN  H F  G +SHP   +I+ +LK +M +++  GYV D + VLHD+D E K   
Sbjct: 584 SWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKREL 643

Query: 598 LAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHH 657
           L  HSE+LA+AF ++NT     IRV KN+R+C DCH AIK ISKI  REIIVRD SRFHH
Sbjct: 644 LFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHH 703

Query: 658 FKDGECSCRNYW 665
           F  G CSC +YW
Sbjct: 704 FNRGNCSCGDYW 715

BLAST of Cla97C07G130920 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.8e-135
Identity = 243/608 (39.97%), Postives = 362/608 (59.54%), Query Frame = 0

Query: 57  SDPSLFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVL 116
           +D   ++ L++ C     L  G+ VH+ ++ S    D  + N LLN+Y+K          
Sbjct: 58  ADRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAK---------- 117

Query: 117 FSQMPRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQAL 176
                                 G LE A+K+F++M +R+  TW  +I+G +Q +    AL
Sbjct: 118 ---------------------CGSLEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDAL 177

Query: 177 SLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMY 236
             F +M   G+ P+EFTL SV++  A  R    G ++H   +KCGF+ N+ VGS+L  +Y
Sbjct: 178 LFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLY 237

Query: 237 MKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFV 296
            + G + D + +  ++  RN V+WN LIAG A+   +E+ L  +  M   GFRP   ++ 
Sbjct: 238 TRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYA 297

Query: 297 SVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDS 356
           S+  ACS    L QG+ +HA +IK+G   V    ++L+ MY++SG + D+ K+F      
Sbjct: 298 SLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKR 357

Query: 357 DVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYL 416
           DVV W++++ AY  HG G+EA+  F +M  + I+ NE++FL +L ACSH GL ++G  Y 
Sbjct: 358 DVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYY 417

Query: 417 DLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKN 476
           +LM KK  + P   HY  VVDLLGRAG L  A   I  MP++    IWK LL+AC++HKN
Sbjct: 418 ELM-KKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKN 477

Query: 477 AEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLE 536
            E+    +E + +LDP D   +V+L NI+AS   W D + +RK M++  V+KEP  SW+E
Sbjct: 478 TELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVE 537

Query: 537 LKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQH 596
           ++N++H F    + HPQ  EI    +E+++++K  GYVPD + V+  +D +E+E NL  H
Sbjct: 538 IENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYH 597

Query: 597 SEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDG 656
           SEK+A+AFAL+NTP    I + KN+RVC DCH AIK  SK+  REIIVRD +RFHHFKDG
Sbjct: 598 SEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDG 633

Query: 657 ECSCRNYW 665
            CSC++YW
Sbjct: 658 NCSCKDYW 633

BLAST of Cla97C07G130920 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 2.4e-135
Identity = 240/605 (39.67%), Postives = 375/605 (61.98%), Query Frame = 0

Query: 62  FSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMP 121
           F  +++SC K  +   G+Q+H  V+  GC  D ++   L+++Y + G L+ +  +F + P
Sbjct: 137 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 196

Query: 122 RRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFKE 181
            R+++SY  LI GY   G +E+AQKLFDE+  +++ +WNAMI+G  +    ++AL LFK+
Sbjct: 197 HRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKD 256

Query: 182 MNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSGS 241
           M      PDE T+ +V+  CA   S+  G++VH  +   GF  NL + ++L  +Y K G 
Sbjct: 257 MMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGE 316

Query: 242 LSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVISA 301
           L     L + +P ++V++WNTLI G       +E L  +  M  SG  P+ +T +S++ A
Sbjct: 317 LETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPA 376

Query: 302 CSELATLGQGQQIHAEVIK--AGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVV 361
           C+ L  +  G+ IH  + K   G  +  ++ +SLI MY++ G +E + +VF       + 
Sbjct: 377 CAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLS 436

Query: 362 LWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLM 421
            W+AMI  +  HGR + + +LF +M  + I+ +++TF+ LL ACSH G+ + G      M
Sbjct: 437 SWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM 496

Query: 422 VKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEM 481
            + YK+ P++EHY C++DLLG +G  +EAE MI  M ++ DG+IW +LL ACK+H N E+
Sbjct: 497 TQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVEL 556

Query: 482 AKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKN 541
            +  +E ++K++P +  SYVLLSNI+ASA  W +V++ R  + D+ ++K PG S +E+ +
Sbjct: 557 GESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDS 616

Query: 542 SVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEK 601
            VH+F +G K HP+  EI   L+E+   L+  G+VPD + VL +M+ E KE  L  HSEK
Sbjct: 617 VVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEK 676

Query: 602 LAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECS 661
           LAIAF L++T     + ++KNLRVC +CH A K ISKI  REII RD +RFHHF+DG CS
Sbjct: 677 LAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCS 736

Query: 662 CRNYW 665
           C +YW
Sbjct: 737 CNDYW 741

BLAST of Cla97C07G130920 vs. ExPASy TrEMBL
Match: A0A6J1HWJ8 (pentatricopeptide repeat-containing protein At2g41080 OS=Cucurbita maxima OX=3661 GN=LOC111468145 PE=3 SV=1)

HSP 1 Score: 1221.1 bits (3158), Expect = 0.0e+00
Identity = 609/664 (91.72%), Postives = 630/664 (94.88%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MGKP RSFDA F PL + TVR LSTKT SASLQEFTSLC GG I QAY  FK EIWSDPS
Sbjct: 1   MGKPSRSFDALFNPLHALTVRLLSTKTSSASLQEFTSLCSGGRITQAYERFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCIKL SLFGG+QVHSLVITSGC+ DKFISNHLLNLYSKLG+LKSSLVLFSQM
Sbjct: 61  LFSHLLQSCIKLGSLFGGEQVHSLVITSGCAYDKFISNHLLNLYSKLGNLKSSLVLFSQM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFE+NEQALSLF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEYNEQALSLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLGFLPDEFTLGSVLRGCAGLRSL+AGQEVHACLMKCGFELNLVVGSSLAHMYMKSG
Sbjct: 181 EMYGLGFLPDEFTLGSVLRGCAGLRSLLAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMPIRNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVK FL+RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKTFLDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEEAIELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEAIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           ++YKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLSACKLHK AEMA
Sbjct: 421 EQYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKADGIIWKTLLSACKLHKKAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISEEILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKN 
Sbjct: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNL 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEIDSYLKELMSE+KLHGYVPD+ SVLHDMDNEEKEYNLA HSEK 
Sbjct: 541 VHQFSMGDKSHPQYLEIDSYLKELMSEMKLHGYVPDIGSVLHDMDNEEKEYNLAHHSEKF 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMN PE VPIRVMKNLRVCNDCH AIKCISKIRNREIIVRD SRFHHFKDGECSC
Sbjct: 601 AIAFALMNIPEGVPIRVMKNLRVCNDCHEAIKCISKIRNREIIVRDTSRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. ExPASy TrEMBL
Match: A0A6J1GM53 (pentatricopeptide repeat-containing protein At2g41080 OS=Cucurbita moschata OX=3662 GN=LOC111455573 PE=3 SV=1)

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 605/664 (91.11%), Postives = 628/664 (94.58%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MG+P RSFDA F PL +  VRSLSTKT SASLQEFTSLC GG I QAY  FK EIWSDPS
Sbjct: 1   MGQPSRSFDALFNPLHALAVRSLSTKTSSASLQEFTSLCSGGRITQAYERFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCIKL SLFGG+QVHSLVITSGC+ DKFISNHLLNLYSKLG+LKSSLVLFS M
Sbjct: 61  LFSHLLQSCIKLGSLFGGEQVHSLVITSGCANDKFISNHLLNLYSKLGNLKSSLVLFSHM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFE+NEQAL LF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEYNEQALGLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLG LPDEFTLGSVLRGCAGLRSL+AGQEVHACLMKCGFELNLVVGSSLAHMYMKSG
Sbjct: 181 EMYGLGILPDEFTLGSVLRGCAGLRSLLAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEKLIKSMPIRNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVK F++RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKTFVDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEEAIELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEAIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           ++YKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA
Sbjct: 421 EQYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISEEILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDRNV+KEPGISWLELKN 
Sbjct: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRNVKKEPGISWLELKNL 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQFSMG KSHPQYLEIDSYLKELMSE+KLHGYVPD+ SVLHDMDNEEKEYNLA HSEK 
Sbjct: 541 VHQFSMGDKSHPQYLEIDSYLKELMSEMKLHGYVPDIGSVLHDMDNEEKEYNLAHHSEKF 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMN PE VPIRVMKNLRVCNDCH AIKCISKIRNREIIVRD SRFHHFKDGECSC
Sbjct: 601 AIAFALMNIPEGVPIRVMKNLRVCNDCHEAIKCISKIRNREIIVRDTSRFHHFKDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. ExPASy TrEMBL
Match: A0A1S3C1T8 (pentatricopeptide repeat-containing protein At2g41080 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495490 PE=3 SV=1)

HSP 1 Score: 1184.5 bits (3063), Expect = 0.0e+00
Identity = 591/665 (88.87%), Postives = 623/665 (93.68%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTK-TPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDP 60
           MGKP  SF+AF  P  SFTVRSLS K + SASLQEFTSLC+ G IRQAY+TFK EIWSDP
Sbjct: 1   MGKPSGSFNAFLNPFYSFTVRSLSMKISSSASLQEFTSLCNDGRIRQAYDTFKVEIWSDP 60

Query: 61  SLFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQ 120
           SLFSHL+QSCIKL SLFGGKQVHSL+ITSG S+DKFISNHLLNLYSKLG  KSSLVLFS 
Sbjct: 61  SLFSHLLQSCIKLGSLFGGKQVHSLIITSGGSKDKFISNHLLNLYSKLGQFKSSLVLFSN 120

Query: 121 MPRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLF 180
           MPRRN+MS+NILI+GYLQLGDLE+AQKLFDEMSERNIATWNAMIAGLTQFEFN+QALSLF
Sbjct: 121 MPRRNLMSFNILINGYLQLGDLENAQKLFDEMSERNIATWNAMIAGLTQFEFNKQALSLF 180

Query: 181 KEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKS 240
           KEM GLGFLPDEFTLGSVLRGCAGLRSL+AGQEVHACL+KCGFEL+ VVGSSLAHMY+KS
Sbjct: 181 KEMYGLGFLPDEFTLGSVLRGCAGLRSLLAGQEVHACLLKCGFELSSVVGSSLAHMYIKS 240

Query: 241 GSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVI 300
           GSLSDGEKLIKSMPIR VVAWNTLIAGKAQ+G  EEVLNQYNMMKM+GFRPDKITFVSV+
Sbjct: 241 GSLSDGEKLIKSMPIRTVVAWNTLIAGKAQNGCPEEVLNQYNMMKMAGFRPDKITFVSVL 300

Query: 301 SACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVV 360
           SACSELATLGQGQQIHAEVIKAGA SV+AV+SSLISMYSRSGCLEDS+K F++RED DVV
Sbjct: 301 SACSELATLGQGQQIHAEVIKAGASSVLAVISSLISMYSRSGCLEDSIKAFVDREDFDVV 360

Query: 361 LWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLM 420
           LWS+MIAAYGFHGRGEEA+ELFHQMEDLK++ANEVTFL LLYACSH GLKEKGTEYLDLM
Sbjct: 361 LWSSMIAAYGFHGRGEEAVELFHQMEDLKMEANEVTFLSLLYACSHSGLKEKGTEYLDLM 420

Query: 421 VKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEM 480
           VKKYKLKPRIEHYTCVVDLLGRAG LEEAEGMIRSMPVK DGIIWKTLL+ACKLHK AEM
Sbjct: 421 VKKYKLKPRIEHYTCVVDLLGRAGRLEEAEGMIRSMPVKPDGIIWKTLLAACKLHKEAEM 480

Query: 481 AKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKN 540
           AKRISEEI+KLDPLDAASYVLLSNIHASARNW +VSEIRK MRDRNVRKEPGISWLELKN
Sbjct: 481 AKRISEEIIKLDPLDAASYVLLSNIHASARNWPNVSEIRKAMRDRNVRKEPGISWLELKN 540

Query: 541 SVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEK 600
            VHQFSMG KSHPQY EID YLKELMSELK HGYVPDL SVLHDMDNEEKEYNLA HSEK
Sbjct: 541 LVHQFSMGDKSHPQYFEIDLYLKELMSELKRHGYVPDLGSVLHDMDNEEKEYNLAHHSEK 600

Query: 601 LAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECS 660
            AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCIS+IRNREIIVRDASRFHHFKDGECS
Sbjct: 601 FAIAFALMNTSENVPIRVMKNLRVCNDCHNAIKCISRIRNREIIVRDASRFHHFKDGECS 660

Query: 661 CRNYW 665
           C NYW
Sbjct: 661 CGNYW 665

BLAST of Cla97C07G130920 vs. ExPASy TrEMBL
Match: A0A6J1D3P8 (pentatricopeptide repeat-containing protein At2g41080 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016714 PE=3 SV=1)

HSP 1 Score: 1176.4 bits (3042), Expect = 0.0e+00
Identity = 582/664 (87.65%), Postives = 621/664 (93.52%), Query Frame = 0

Query: 1   MGKPGRSFDAFFIPLRSFTVRSLSTKTPSASLQEFTSLCDGGGIRQAYNTFKYEIWSDPS 60
           MGKP  SF A F PL +  VR LSTKT S  LQEFT+LC+GG IRQAY +FK EIWSDPS
Sbjct: 1   MGKPILSFHALFNPLHAPAVRFLSTKTSSVYLQEFTNLCNGGRIRQAYESFKSEIWSDPS 60

Query: 61  LFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQM 120
           LFSHL+QSCI++ S+FGGKQ+HSLVITSGCS+DKFISNHLLNLYSKLG  K+SLVLFS M
Sbjct: 61  LFSHLLQSCIEIGSIFGGKQLHSLVITSGCSQDKFISNHLLNLYSKLGQFKTSLVLFSHM 120

Query: 121 PRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFK 180
           PRRNIMSYNILI+GYLQLGDLESAQKLFDEMSERNIATWNAMI GLTQFEFNEQALSLF+
Sbjct: 121 PRRNIMSYNILINGYLQLGDLESAQKLFDEMSERNIATWNAMITGLTQFEFNEQALSLFR 180

Query: 181 EMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSG 240
           EM GLGFLPDEFTLGSVLRGCAGLRS+ AGQEVHACLMKCGFELNLVVGSS+AHMYMKSG
Sbjct: 181 EMYGLGFLPDEFTLGSVLRGCAGLRSIRAGQEVHACLMKCGFELNLVVGSSVAHMYMKSG 240

Query: 241 SLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVIS 300
           SLSDGEK+IKSMP RNVVAWNTLIAGKAQ+G SEEVLNQYNMMKM+GFRPDKITFVSVIS
Sbjct: 241 SLSDGEKVIKSMPTRNVVAWNTLIAGKAQNGCSEEVLNQYNMMKMAGFRPDKITFVSVIS 300

Query: 301 ACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVVL 360
           ACSELATLGQGQQIHAE IKAGAGSVVAV+SSLISMYSRSGCL+DSVK FL+RED+DVVL
Sbjct: 301 ACSELATLGQGQQIHAEAIKAGAGSVVAVISSLISMYSRSGCLDDSVKAFLDREDADVVL 360

Query: 361 WSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLMV 420
           WSAMIAAYGFHGRGEE IELFHQME+LK++ANEVTFL LLYACSHCGLKEKGTEYLDLMV
Sbjct: 361 WSAMIAAYGFHGRGEEVIELFHQMEELKMEANEVTFLSLLYACSHCGLKEKGTEYLDLMV 420

Query: 421 KKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEMA 480
           KKYKLKPRIEHYTCVVDLLGRAG LEEAE  IRSMPVKADGIIWKTLLSACK+HK AEMA
Sbjct: 421 KKYKLKPRIEHYTCVVDLLGRAGRLEEAEATIRSMPVKADGIIWKTLLSACKIHKKAEMA 480

Query: 481 KRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKNS 540
           KRISE+ILKLDPLDAASYVLLSNIHASARNW DVSEIRK MRDR VRKEPGISWLELK++
Sbjct: 481 KRISEDILKLDPLDAASYVLLSNIHASARNWPDVSEIRKAMRDRQVRKEPGISWLELKST 540

Query: 541 VHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEKL 600
           VHQF+M  KSHP+YLEI+ YLKELM+E+KL GYVPD+ SVLHDMDNEEKEYNLA HSEKL
Sbjct: 541 VHQFNMSDKSHPKYLEIELYLKELMAEMKLQGYVPDIGSVLHDMDNEEKEYNLAHHSEKL 600

Query: 601 AIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECSC 660
           AIAFALMNTPE+ PIRVMKNLRVC+DCH+AIKCISKIRNREIIVRDASRFHHF+DGECSC
Sbjct: 601 AIAFALMNTPESAPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFRDGECSC 660

Query: 661 RNYW 665
            NYW
Sbjct: 661 GNYW 664

BLAST of Cla97C07G130920 vs. ExPASy TrEMBL
Match: A0A5A7SLF2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003190 PE=3 SV=1)

HSP 1 Score: 1157.9 bits (2994), Expect = 0.0e+00
Identity = 573/636 (90.09%), Postives = 604/636 (94.97%), Query Frame = 0

Query: 29  SASLQEFTSLCDGGGIRQAYNTFKYEIWSDPSLFSHLIQSCIKLSSLFGGKQVHSLVITS 88
           SASLQEFTSLC+ G IRQAY+TFK+EIWSDPSLFSHL+QSCIKL SLFGGKQVHSL+ITS
Sbjct: 6   SASLQEFTSLCNDGRIRQAYDTFKFEIWSDPSLFSHLLQSCIKLGSLFGGKQVHSLIITS 65

Query: 89  GCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYLQLGDLESAQKLF 148
           G S+DKFISNHLLNLYSKLG  KSSLVLFS MPRRN+MS+NILI+GYLQLGDLE+AQKLF
Sbjct: 66  GGSKDKFISNHLLNLYSKLGQFKSSLVLFSNMPRRNVMSFNILINGYLQLGDLENAQKLF 125

Query: 149 DEMSERNIATWNAMIAGLTQFEFNEQALSLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLI 208
           DEMSERNIATWNAMIAGLTQFEFN+QALSLFKEM GLGFLPDEFTLGSVLRGCAGLRSL+
Sbjct: 126 DEMSERNIATWNAMIAGLTQFEFNKQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLL 185

Query: 209 AGQEVHACLMKCGFELNLVVGSSLAHMYMKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKA 268
           AGQEVHACL+KCGFEL+ VVGSSLAHMY+KSGSLSDGEKLIKSMPIR VVAWNTLIAGKA
Sbjct: 186 AGQEVHACLLKCGFELSSVVGSSLAHMYIKSGSLSDGEKLIKSMPIRTVVAWNTLIAGKA 245

Query: 269 QHGRSEEVLNQYNMMKMSGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVA 328
           Q+G  EEVLNQYNMMKM+GFRPDKITFVSV+SACSELATLGQGQQIHAEVIKAGA SV+A
Sbjct: 246 QNGCPEEVLNQYNMMKMAGFRPDKITFVSVLSACSELATLGQGQQIHAEVIKAGASSVLA 305

Query: 329 VVSSLISMYSRSGCLEDSVKVFLEREDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLK 388
           V+SSLISMYSRSGCLEDS+K F++RED DVVLWS+MIAAYGFHGRGEEA+ELFHQMEDLK
Sbjct: 306 VISSLISMYSRSGCLEDSIKAFVDREDFDVVLWSSMIAAYGFHGRGEEAVELFHQMEDLK 365

Query: 389 IKANEVTFLILLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEA 448
           ++ANEVTFL LLYACSH GLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAG LEEA
Sbjct: 366 MEANEVTFLSLLYACSHSGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGRLEEA 425

Query: 449 EGMIRSMPVKADGIIWKTLLSACKLHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASA 508
           EGMIRSMPVK DGIIWKTLL+ACKLHK AEMAKRISEEI+KLDPLDAASYVLLSNIHASA
Sbjct: 426 EGMIRSMPVKPDGIIWKTLLAACKLHKEAEMAKRISEEIIKLDPLDAASYVLLSNIHASA 485

Query: 509 RNWLDVSEIRKTMRDRNVRKEPGISWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSEL 568
           RNW +VSEIRK MRDRNVRKEPGISWLELKN VHQFSMG KSHPQY EID YLKELMSEL
Sbjct: 486 RNWPNVSEIRKAMRDRNVRKEPGISWLELKNLVHQFSMGDKSHPQYFEIDLYLKELMSEL 545

Query: 569 KLHGYVPDLASVLHDMDNEEKEYNLAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCH 628
           K HGYVPDL SVLHDMDNEEKEYNLA HSEK AIAFALMNT ENVPIRVMKNLRVCNDCH
Sbjct: 546 KRHGYVPDLGSVLHDMDNEEKEYNLAHHSEKFAIAFALMNTSENVPIRVMKNLRVCNDCH 605

Query: 629 NAIKCISKIRNREIIVRDASRFHHFKDGECSCRNYW 665
           NAIKCIS+IRNREIIVRDASRFHHFKDGECSC NYW
Sbjct: 606 NAIKCISRIRNREIIVRDASRFHHFKDGECSCGNYW 641

BLAST of Cla97C07G130920 vs. TAIR 10
Match: AT2G41080.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 765.4 bits (1975), Expect = 3.8e-221
Identity = 371/565 (65.66%), Postives = 458/565 (81.06%), Query Frame = 0

Query: 101 LNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWN 160
           +++YSKLG   S++ ++ +M ++N MS NILI+GY++ GDL +A+K+FDEM +R + TWN
Sbjct: 1   MSMYSKLGDFPSAVAVYGRMRKKNYMSSNILINGYVRAGDLVNARKVFDEMPDRKLTTWN 60

Query: 161 AMIAGLTQFEFNEQALSLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKC 220
           AMIAGL QFEFNE+ LSLF+EM+GLGF PDE+TLGSV  G AGLRS+  GQ++H   +K 
Sbjct: 61  AMIAGLIQFEFNEEGLSLFREMHGLGFSPDEYTLGSVFSGSAGLRSVSIGQQIHGYTIKY 120

Query: 221 GFELNLVVGSSLAHMYMKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQY 280
           G EL+LVV SSLAHMYM++G L DGE +I+SMP+RN+VAWNTLI G AQ+G  E VL  Y
Sbjct: 121 GLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRNLVAWNTLIMGNAQNGCPETVLYLY 180

Query: 281 NMMKMSGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRS 340
            MMK+SG RP+KITFV+V+S+CS+LA  GQGQQIHAE IK GA SVVAVVSSLISMYS+ 
Sbjct: 181 KMMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMYSKC 240

Query: 341 GCLEDSVKVFLEREDSDVVLWSAMIAAYGFHGRGEEAIELFHQM-EDLKIKANEVTFLIL 400
           GCL D+ K F ERED D V+WS+MI+AYGFHG+G+EAIELF+ M E   ++ NEV FL L
Sbjct: 241 GCLGDAAKAFSEREDEDEVMWSSMISAYGFHGQGDEAIELFNTMAEQTNMEINEVAFLNL 300

Query: 401 LYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKA 460
           LYACSH GLK+KG E  D+MV+KY  KP ++HYTCVVDLLGRAGCL++AE +IRSMP+K 
Sbjct: 301 LYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGCLDQAEAIIRSMPIKT 360

Query: 461 DGIIWKTLLSACKLHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRK 520
           D +IWKTLLSAC +HKNAEMA+R+ +EIL++DP D+A YVLL+N+HASA+ W DVSE+RK
Sbjct: 361 DIVIWKTLLSACNIHKNAEMAQRVFKEILQIDPNDSACYVLLANVHASAKRWRDVSEVRK 420

Query: 521 TMRDRNVRKEPGISWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLAS 580
           +MRD+NV+KE GISW E K  VHQF MG +S  +  EI SYLKEL  E+KL GY PD AS
Sbjct: 421 SMRDKNVKKEAGISWFEHKGEVHQFKMGDRSQSKSKEIYSYLKELTLEMKLKGYKPDTAS 480

Query: 581 VLHDMDNEEKEYNLAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRN 640
           VLHDMD EEKE +L QHSEKLA+AFALM  PE  PIR++KNLRVC+DCH A K IS I+N
Sbjct: 481 VLHDMDEEEKESDLVQHSEKLAVAFALMILPEGAPIRIIKNLRVCSDCHVAFKYISVIKN 540

Query: 641 REIIVRDASRFHHFKDGECSCRNYW 665
           REI +RD SRFHHF +G+CSC +YW
Sbjct: 541 REITLRDGSRFHHFINGKCSCGDYW 565

BLAST of Cla97C07G130920 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 503.1 bits (1294), Expect = 3.5e-142
Identity = 254/622 (40.84%), Postives = 387/622 (62.22%), Query Frame = 0

Query: 78  GKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYLQ 137
           GK+VHS ++  G   +  +SN LLN+Y+K G    +  +F +M  R+I S+N +I  ++Q
Sbjct: 165 GKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQ 224

Query: 138 LGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFKEMNGLGFL-PDEFTLGS 197
           +G ++ A   F++M+ER+I TWN+MI+G  Q  ++ +AL +F +M     L PD FTL S
Sbjct: 225 VGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLAS 284

Query: 198 VLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSGSLSDGEKLIK------ 257
           VL  CA L  L  G+++H+ ++  GF+++ +V ++L  MY + G +    +LI+      
Sbjct: 285 VLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKD 344

Query: 258 ---------------------------SMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMM 317
                                      S+  R+VVAW  +I G  QHG   E +N +  M
Sbjct: 345 LKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSM 404

Query: 318 KMSGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCL 377
              G RP+  T  +++S  S LA+L  G+QIH   +K+G    V+V ++LI+MY+++G +
Sbjct: 405 VGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNI 464

Query: 378 EDSVKVF-LEREDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYA 437
             + + F L R + D V W++MI A   HG  EEA+ELF  M    ++ + +T++ +  A
Sbjct: 465 TSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSA 524

Query: 438 CSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGI 497
           C+H GL  +G +Y D+M    K+ P + HY C+VDL GRAG L+EA+  I  MP++ D +
Sbjct: 525 CTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVV 584

Query: 498 IWKTLLSACKLHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMR 557
            W +LLSAC++HKN ++ K  +E +L L+P ++ +Y  L+N++++   W + ++IRK+M+
Sbjct: 585 TWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMK 644

Query: 558 DRNVRKEPGISWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLH 617
           D  V+KE G SW+E+K+ VH F +   +HP+  EI   +K++  E+K  GYVPD ASVLH
Sbjct: 645 DGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLH 704

Query: 618 DMDNEEKEYNLAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREI 665
           D++ E KE  L  HSEKLAIAF L++TP+   +R+MKNLRVCNDCH AIK ISK+  REI
Sbjct: 705 DLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREI 764

BLAST of Cla97C07G130920 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 491.1 bits (1263), Expect = 1.4e-138
Identity = 243/612 (39.71%), Postives = 375/612 (61.27%), Query Frame = 0

Query: 58  DPSLFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLV-- 117
           D ++F  +++SC  +  L  G+ VH  ++  G   D +  N L+N+Y+KL  + S +   
Sbjct: 104 DHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVG 163

Query: 118 -LFSQMPRR--NIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFN 177
            +F +MP+R  N    ++  +  +    ++S +++F+ M  +++ ++N +IAG  Q    
Sbjct: 164 NVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMY 223

Query: 178 EQALSLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSL 237
           E AL + +EM      PD FTL SVL   +    +I G+E+H  +++ G + ++ +GSSL
Sbjct: 224 EDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSL 283

Query: 238 AHMYMKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDK 297
             MY KS  + D E++   +  R+ ++WN+L+AG  Q+GR  E L  +  M  +  +P  
Sbjct: 284 VDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGA 343

Query: 298 ITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSRSGCLEDSVKVFLE 357
           + F SVI AC+ LATL  G+Q+H  V++ G GS + + S+L+ MYS+ G ++ + K+F  
Sbjct: 344 VAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDR 403

Query: 358 REDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKG 417
               D V W+A+I  +  HG G EA+ LF +M+   +K N+V F+ +L ACSH GL ++ 
Sbjct: 404 MNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEA 463

Query: 418 TEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACK 477
             Y + M K Y L   +EHY  V DLLGRAG LEEA   I  M V+  G +W TLLS+C 
Sbjct: 464 WGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCS 523

Query: 478 LHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGI 537
           +HKN E+A++++E+I  +D  +  +YVL+ N++AS   W +++++R  MR + +RK+P  
Sbjct: 524 VHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPAC 583

Query: 538 SWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYN 597
           SW+E+KN  H F  G +SHP   +I+ +LK +M +++  GYV D + VLHD+D E K   
Sbjct: 584 SWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKREL 643

Query: 598 LAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHH 657
           L  HSE+LA+AF ++NT     IRV KN+R+C DCH AIK ISKI  REIIVRD SRFHH
Sbjct: 644 LFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHH 703

Query: 658 FKDGECSCRNYW 665
           F  G CSC +YW
Sbjct: 704 FNRGNCSCGDYW 715

BLAST of Cla97C07G130920 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 484.2 bits (1245), Expect = 1.7e-136
Identity = 240/605 (39.67%), Postives = 375/605 (61.98%), Query Frame = 0

Query: 62  FSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNHLLNLYSKLGHLKSSLVLFSQMP 121
           F  +++SC K  +   G+Q+H  V+  GC  D ++   L+++Y + G L+ +  +F + P
Sbjct: 137 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 196

Query: 122 RRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNEQALSLFKE 181
            R+++SY  LI GY   G +E+AQKLFDE+  +++ +WNAMI+G  +    ++AL LFK+
Sbjct: 197 HRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKD 256

Query: 182 MNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMKCGFELNLVVGSSLAHMYMKSGS 241
           M      PDE T+ +V+  CA   S+  G++VH  +   GF  NL + ++L  +Y K G 
Sbjct: 257 MMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGE 316

Query: 242 LSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQYNMMKMSGFRPDKITFVSVISA 301
           L     L + +P ++V++WNTLI G       +E L  +  M  SG  P+ +T +S++ A
Sbjct: 317 LETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPA 376

Query: 302 CSELATLGQGQQIHAEVIK--AGAGSVVAVVSSLISMYSRSGCLEDSVKVFLEREDSDVV 361
           C+ L  +  G+ IH  + K   G  +  ++ +SLI MY++ G +E + +VF       + 
Sbjct: 377 CAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLS 436

Query: 362 LWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLILLYACSHCGLKEKGTEYLDLM 421
            W+AMI  +  HGR + + +LF +M  + I+ +++TF+ LL ACSH G+ + G      M
Sbjct: 437 SWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM 496

Query: 422 VKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKADGIIWKTLLSACKLHKNAEM 481
            + YK+ P++EHY C++DLLG +G  +EAE MI  M ++ DG+IW +LL ACK+H N E+
Sbjct: 497 TQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVEL 556

Query: 482 AKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRKTMRDRNVRKEPGISWLELKN 541
            +  +E ++K++P +  SYVLLSNI+ASA  W +V++ R  + D+ ++K PG S +E+ +
Sbjct: 557 GESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDS 616

Query: 542 SVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLASVLHDMDNEEKEYNLAQHSEK 601
            VH+F +G K HP+  EI   L+E+   L+  G+VPD + VL +M+ E KE  L  HSEK
Sbjct: 617 VVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEK 676

Query: 602 LAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRNREIIVRDASRFHHFKDGECS 661
           LAIAF L++T     + ++KNLRVC +CH A K ISKI  REII RD +RFHHF+DG CS
Sbjct: 677 LAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCS 736

Query: 662 CRNYW 665
           C +YW
Sbjct: 737 CNDYW 741

BLAST of Cla97C07G130920 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 480.7 bits (1236), Expect = 1.9e-135
Identity = 247/625 (39.52%), Postives = 374/625 (59.84%), Query Frame = 0

Query: 44   IRQAYNTFKY----EIWSDPSLFSHLIQSCIKLSSLFGGKQVHSLVITSGCSRDKFISNH 103
            +R ++  F+     EI  +   +  ++++CI+L  L  G+Q+HS +I          +N 
Sbjct: 471  LRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIK---------TNF 530

Query: 104  LLNLYSKLGHLKSSLVLFSQMPRRNIMSYNILIDGYLQLGDLESAQKLFDEMSERNIATW 163
             LN Y                        ++LID Y +LG L++A  +    + +++ +W
Sbjct: 531  QLNAY----------------------VCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSW 590

Query: 164  NAMIAGLTQFEFNEQALSLFKEMNGLGFLPDEFTLGSVLRGCAGLRSLIAGQEVHACLMK 223
              MIAG TQ+ F+++AL+ F++M   G   DE  L + +  CAGL++L  GQ++HA    
Sbjct: 591  TTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACV 650

Query: 224  CGFELNLVVGSSLAHMYMKSGSLSDGEKLIKSMPIRNVVAWNTLIAGKAQHGRSEEVLNQ 283
             GF  +L   ++L  +Y + G + +     +     + +AWN L++G  Q G +EE L  
Sbjct: 651  SGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRV 710

Query: 284  YNMMKMSGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGAGSVVAVVSSLISMYSR 343
            +  M   G   +  TF S + A SE A + QG+Q+HA + K G  S   V ++LISMY++
Sbjct: 711  FVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAK 770

Query: 344  SGCLEDSVKVFLEREDSDVVLWSAMIAAYGFHGRGEEAIELFHQMEDLKIKANEVTFLIL 403
             G + D+ K FLE    + V W+A+I AY  HG G EA++ F QM    ++ N VT + +
Sbjct: 771  CGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGV 830

Query: 404  LYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLEEAEGMIRSMPVKA 463
            L ACSH GL +KG  Y + M  +Y L P+ EHY CVVD+L RAG L  A+  I+ MP+K 
Sbjct: 831  LSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKP 890

Query: 464  DGIIWKTLLSACKLHKNAEMAKRISEEILKLDPLDAASYVLLSNIHASARNWLDVSEIRK 523
            D ++W+TLLSAC +HKN E+ +  +  +L+L+P D+A+YVLLSN++A ++ W      R+
Sbjct: 891  DALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQ 950

Query: 524  TMRDRNVRKEPGISWLELKNSVHQFSMGGKSHPQYLEIDSYLKELMSELKLHGYVPDLAS 583
             M+++ V+KEPG SW+E+KNS+H F +G ++HP   EI  Y ++L       GYV D  S
Sbjct: 951  KMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFS 1010

Query: 584  VLHDMDNEEKEYNLAQHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISKIRN 643
            +L+++ +E+K+  +  HSEKLAI+F L++ P  VPI VMKNLRVCNDCH  IK +SK+ N
Sbjct: 1011 LLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVSN 1064

Query: 644  REIIVRDASRFHHFKDGECSCRNYW 665
            REIIVRDA RFHHF+ G CSC++YW
Sbjct: 1071 REIIVRDAYRFHHFEGGACSCKDYW 1064

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023553764.10.0e+0091.42pentatricopeptide repeat-containing protein At2g41080 [Cucurbita pepo subsp. pep... [more]
XP_022969026.10.0e+0091.72pentatricopeptide repeat-containing protein At2g41080 [Cucurbita maxima][more]
KAG7011986.10.0e+0091.42Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022953052.10.0e+0091.11pentatricopeptide repeat-containing protein At2g41080 [Cucurbita moschata] >KAG6... [more]
XP_038887926.10.0e+0089.16pentatricopeptide repeat-containing protein At2g41080 isoform X1 [Benincasa hisp... [more]
Match NameE-valueIdentityDescription
Q8S9M41.8e-23962.87Pentatricopeptide repeat-containing protein At2g41080 OS=Arabidopsis thaliana OX... [more]
Q9SHZ85.0e-14140.84Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9LW632.0e-13739.71Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9LIQ71.8e-13539.97Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9LN012.4e-13539.67Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1HWJ80.0e+0091.72pentatricopeptide repeat-containing protein At2g41080 OS=Cucurbita maxima OX=366... [more]
A0A6J1GM530.0e+0091.11pentatricopeptide repeat-containing protein At2g41080 OS=Cucurbita moschata OX=3... [more]
A0A1S3C1T80.0e+0088.87pentatricopeptide repeat-containing protein At2g41080 isoform X1 OS=Cucumis melo... [more]
A0A6J1D3P80.0e+0087.65pentatricopeptide repeat-containing protein At2g41080 isoform X1 OS=Momordica ch... [more]
A0A5A7SLF20.0e+0090.09Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G41080.13.8e-22165.66Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.13.5e-14240.84pentatricopeptide (PPR) repeat-containing protein [more]
AT3G23330.11.4e-13839.71Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.11.7e-13639.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G13650.11.9e-13539.52Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 127..156
e-value: 9.4E-7
score: 26.6
coord: 359..392
e-value: 3.4E-5
score: 21.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 124..152
e-value: 8.3E-7
score: 28.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 155..202
e-value: 4.8E-9
score: 36.3
coord: 357..404
e-value: 1.7E-10
score: 41.0
coord: 255..302
e-value: 8.1E-11
score: 42.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 431..456
e-value: 0.011
score: 15.9
coord: 331..352
e-value: 0.063
score: 13.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 256..290
score: 10.939435
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 357..391
score: 10.862706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 124..158
score: 12.035565
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 530..654
e-value: 1.4E-41
score: 141.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 37..172
e-value: 1.4E-21
score: 79.2
coord: 173..323
e-value: 1.3E-27
score: 99.0
coord: 328..556
e-value: 1.4E-36
score: 128.5
NoneNo IPR availablePANTHERPTHR24015:SF1956OS03G0816600 PROTEINcoord: 19..654
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 19..654

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G130920.1Cla97C07G130920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding