CaUC01G016520 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G016520
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr01: 29964192 .. 29966114 (-)
RNA-Seq ExpressionCaUC01G016520
SyntenyCaUC01G016520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATCTTCTTCCTTTTTCACAGCTTCACTTCCATCGCCGCCGCCAATTTTCACCACTAACCGTTCATCTGTCTTACCATCTTCTTCTTCTACCGCCAGAACCTCCGATCGTTCCCAACAAGTAGAAAGGTTCGCCTTACTAATCGATAAATCGAAATCCGTCGACCGCCTGCTTCAAATCCACGCTTCTCTTCTCCGCCACGGGCTTTACCACAACCCCATCTTAAATTTCAAGCTCCAACGCTCATACGCCGCTCTCGGACGCCTTGATTACTCGGTGGCCGTTTTTGACACCTCCGATGAACCCAATGTCTTTTCATTTAGTGCAATTATCCACAGCCACGTACAATGTCGATTATTTGACCGAGCTCTCGGTTACTATTCGCAAATGCTAACCCGTGGTGTGGAGCCCAATGTTTTCACCTTCTCTTCAGTCTTGAAATCCTGCTCACTCGAACCCGGAAAAGTCCTCCATTGTCAAGCAATAAAACTCGGATTCGATTCTGATTTGTATGTCAGAACGGGACTGGTCGATGTATACGCGAGAGGGGGCGATGTGGTATGCGCACGCCAACTGTTTGATAAAATGCCTGAGAGAAGTTTGGTTTCTCAAACGGCCATGCTTGCTTGCTACGCGAAGCTGGGGGAGCTTGACGAGGCACGGGCTTTGTTCGACGGGATGAAACAGAGGGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTACGCGCAGAATGGTCTTCCCAATGAATCGTTGAAGCTTTTCAGGCGAATGTTGGTGGCGAAAGCTATGCCCAATGAAATAACAATTCTGGCTGTGCTTTCGGCCTGTGGGCAACTCGGAGCCCTCGAATCTGGAAGGTGGGTTCATTCGTATATTGAGAATAAGGGCATTCAGATAAACGTTCATGTGGGCACTGCACTTATTGATATGTATAGCAAATGTGGTCGCTTGGAAGATGCACGTTTGGTATTTGATGGAATTCGGGATAAGGATGTGGTCGCCTGGAATTCGATGATTGTTGGGTATTCCATGCATGGATTTAGCCAAAATGCATTGAAGTTGTTCGAGGAAATGACTGAGACCGGATACCAGCCAACTGATATCACTTTCATTGGCCTTTTGAGTGCCTGCAGCCATGGCGGCTTGGTGGAGGAGGGAAGGAGTTTCTTCAGATTGATGAGAGACAAATACCTAATCGAACCAAAGGTCGAGCATTATGGATGTATGGTTAATCTTCTAGGACGTGCGGGACATCTAGAAGAAGCATATGAACTTGTGAAAAACATGTCGATTGCAGCTGACCCGGTTTTGTGGGGATCATTACTAGGGTCTTGTAGACTTCACGGTAACATTAAATTGGGAGAGGAAATTGCTGAGTTTCTTGTTGATCAGAAGCTTGCAAATTCAGGGACGTATGTTCTTCTTTCCAATATATATGCTGCAACAGGCAATTGGGAAGGGGTTGCAAAGATGAGGACGTTGATGAAAGAGCATGGCATTGAAAAGGAGCCTGGTTGCAGCTCAATTGAAGTTAACAATAAGGTGCATGAGTTCCTTGCTGGTGAGAGGAAACATCCCAAAAGCAAAGAAATCTATGTGATGTTGAATGAGATAAATAGCTGGCTCAAAGCTCGCAGATACACACCCCAGATCGGTGTTGTTTTACATGATCTCGAGGAGGAACAGAAGAAGCAATCGCTCGAAGTTCACAGTGAGAAGCTTGCTATTGCCTTTGGGCTTATCAGTACTCAACCAGGCACCACTATCAAGATTGTGAAGAACCTCCGAGTTTGTTCAGATTGCCACGCTGTAATGAAGTTGATATCAGAAATCACTGGACGTAAAATTGTAATGAGGGACCGGAATCGTTTCCATCACTTTGAAAATGGATTATGTTCTTGTGGAGATTATTGGTGA

mRNA sequence

ATGTCATCTTCTTCCTTTTTCACAGCTTCACTTCCATCGCCGCCGCCAATTTTCACCACTAACCGTTCATCTGTCTTACCATCTTCTTCTTCTACCGCCAGAACCTCCGATCGTTCCCAACAAGTAGAAAGGTTCGCCTTACTAATCGATAAATCGAAATCCGTCGACCGCCTGCTTCAAATCCACGCTTCTCTTCTCCGCCACGGGCTTTACCACAACCCCATCTTAAATTTCAAGCTCCAACGCTCATACGCCGCTCTCGGACGCCTTGATTACTCGGTGGCCGTTTTTGACACCTCCGATGAACCCAATGTCTTTTCATTTAGTGCAATTATCCACAGCCACGTACAATGTCGATTATTTGACCGAGCTCTCGGTTACTATTCGCAAATGCTAACCCGTGGTGTGGAGCCCAATGTTTTCACCTTCTCTTCAGTCTTGAAATCCTGCTCACTCGAACCCGGAAAAGTCCTCCATTGTCAAGCAATAAAACTCGGATTCGATTCTGATTTGTATGTCAGAACGGGACTGGTCGATGTATACGCGAGAGGGGGCGATGTGGTATGCGCACGCCAACTGTTTGATAAAATGCCTGAGAGAAGTTTGGTTTCTCAAACGGCCATGCTTGCTTGCTACGCGAAGCTGGGGGAGCTTGACGAGGCACGGGCTTTGTTCGACGGGATGAAACAGAGGGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTACGCGCAGAATGGTCTTCCCAATGAATCGTTGAAGCTTTTCAGGCGAATGTTGGTGGCGAAAGCTATGCCCAATGAAATAACAATTCTGGCTGTGCTTTCGGCCTGTGGGCAACTCGGAGCCCTCGAATCTGGAAGGTGGGTTCATTCGTATATTGAGAATAAGGGCATTCAGATAAACGTTCATGTGGGCACTGCACTTATTGATATGTATAGCAAATGTGGTCGCTTGGAAGATGCACGTTTGGTATTTGATGGAATTCGGGATAAGGATGTGGTCGCCTGGAATTCGATGATTGTTGGGTATTCCATGCATGGATTTAGCCAAAATGCATTGAAGTTGTTCGAGGAAATGACTGAGACCGGATACCAGCCAACTGATATCACTTTCATTGGCCTTTTGAGTGCCTGCAGCCATGGCGGCTTGGTGGAGGAGGGAAGGAGTTTCTTCAGATTGATGAGAGACAAATACCTAATCGAACCAAAGGTCGAGCATTATGGATGTATGGTTAATCTTCTAGGACGTGCGGGACATCTAGAAGAAGCATATGAACTTGTGAAAAACATGTCGATTGCAGCTGACCCGGTTTTGTGGGGATCATTACTAGGGTCTTGTAGACTTCACGGTAACATTAAATTGGGAGAGGAAATTGCTGAGTTTCTTGTTGATCAGAAGCTTGCAAATTCAGGGACGTATGTTCTTCTTTCCAATATATATGCTGCAACAGGCAATTGGGAAGGGGTTGCAAAGATGAGGACGTTGATGAAAGAGCATGGCATTGAAAAGGAGCCTGGTTGCAGCTCAATTGAAGTTAACAATAAGGTGCATGAGTTCCTTGCTGGTGAGAGGAAACATCCCAAAAGCAAAGAAATCTATGTGATGTTGAATGAGATAAATAGCTGGCTCAAAGCTCGCAGATACACACCCCAGATCGGTGTTGTTTTACATGATCTCGAGGAGGAACAGAAGAAGCAATCGCTCGAAGTTCACAGTGAGAAGCTTGCTATTGCCTTTGGGCTTATCAGTACTCAACCAGGCACCACTATCAAGATTGTGAAGAACCTCCGAGTTTGTTCAGATTGCCACGCTGTAATGAAGTTGATATCAGAAATCACTGGACGTAAAATTGTAATGAGGGACCGGAATCGTTTCCATCACTTTGAAAATGGATTATGTTCTTGTGGAGATTATTGGTGA

Coding sequence (CDS)

ATGTCATCTTCTTCCTTTTTCACAGCTTCACTTCCATCGCCGCCGCCAATTTTCACCACTAACCGTTCATCTGTCTTACCATCTTCTTCTTCTACCGCCAGAACCTCCGATCGTTCCCAACAAGTAGAAAGGTTCGCCTTACTAATCGATAAATCGAAATCCGTCGACCGCCTGCTTCAAATCCACGCTTCTCTTCTCCGCCACGGGCTTTACCACAACCCCATCTTAAATTTCAAGCTCCAACGCTCATACGCCGCTCTCGGACGCCTTGATTACTCGGTGGCCGTTTTTGACACCTCCGATGAACCCAATGTCTTTTCATTTAGTGCAATTATCCACAGCCACGTACAATGTCGATTATTTGACCGAGCTCTCGGTTACTATTCGCAAATGCTAACCCGTGGTGTGGAGCCCAATGTTTTCACCTTCTCTTCAGTCTTGAAATCCTGCTCACTCGAACCCGGAAAAGTCCTCCATTGTCAAGCAATAAAACTCGGATTCGATTCTGATTTGTATGTCAGAACGGGACTGGTCGATGTATACGCGAGAGGGGGCGATGTGGTATGCGCACGCCAACTGTTTGATAAAATGCCTGAGAGAAGTTTGGTTTCTCAAACGGCCATGCTTGCTTGCTACGCGAAGCTGGGGGAGCTTGACGAGGCACGGGCTTTGTTCGACGGGATGAAACAGAGGGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTACGCGCAGAATGGTCTTCCCAATGAATCGTTGAAGCTTTTCAGGCGAATGTTGGTGGCGAAAGCTATGCCCAATGAAATAACAATTCTGGCTGTGCTTTCGGCCTGTGGGCAACTCGGAGCCCTCGAATCTGGAAGGTGGGTTCATTCGTATATTGAGAATAAGGGCATTCAGATAAACGTTCATGTGGGCACTGCACTTATTGATATGTATAGCAAATGTGGTCGCTTGGAAGATGCACGTTTGGTATTTGATGGAATTCGGGATAAGGATGTGGTCGCCTGGAATTCGATGATTGTTGGGTATTCCATGCATGGATTTAGCCAAAATGCATTGAAGTTGTTCGAGGAAATGACTGAGACCGGATACCAGCCAACTGATATCACTTTCATTGGCCTTTTGAGTGCCTGCAGCCATGGCGGCTTGGTGGAGGAGGGAAGGAGTTTCTTCAGATTGATGAGAGACAAATACCTAATCGAACCAAAGGTCGAGCATTATGGATGTATGGTTAATCTTCTAGGACGTGCGGGACATCTAGAAGAAGCATATGAACTTGTGAAAAACATGTCGATTGCAGCTGACCCGGTTTTGTGGGGATCATTACTAGGGTCTTGTAGACTTCACGGTAACATTAAATTGGGAGAGGAAATTGCTGAGTTTCTTGTTGATCAGAAGCTTGCAAATTCAGGGACGTATGTTCTTCTTTCCAATATATATGCTGCAACAGGCAATTGGGAAGGGGTTGCAAAGATGAGGACGTTGATGAAAGAGCATGGCATTGAAAAGGAGCCTGGTTGCAGCTCAATTGAAGTTAACAATAAGGTGCATGAGTTCCTTGCTGGTGAGAGGAAACATCCCAAAAGCAAAGAAATCTATGTGATGTTGAATGAGATAAATAGCTGGCTCAAAGCTCGCAGATACACACCCCAGATCGGTGTTGTTTTACATGATCTCGAGGAGGAACAGAAGAAGCAATCGCTCGAAGTTCACAGTGAGAAGCTTGCTATTGCCTTTGGGCTTATCAGTACTCAACCAGGCACCACTATCAAGATTGTGAAGAACCTCCGAGTTTGTTCAGATTGCCACGCTGTAATGAAGTTGATATCAGAAATCACTGGACGTAAAATTGTAATGAGGGACCGGAATCGTTTCCATCACTTTGAAAATGGATTATGTTCTTGTGGAGATTATTGGTGA

Protein sequence

MSSSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQINVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW
Homology
BLAST of CaUC01G016520 vs. NCBI nr
Match: XP_038882525.1 (pentatricopeptide repeat-containing protein ELI1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 598/640 (93.44%), Postives = 621/640 (97.03%), Query Frame = 0

Query: 1   MSSSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQ 60
           M SSSF TAS P PPPIFTTNRSSVLPSSSSTARTSD  QQVERFALLIDKSKS+ RLLQ
Sbjct: 1   MPSSSFLTASPPPPPPIFTTNRSSVLPSSSSTARTSDPFQQVERFALLIDKSKSIPRLLQ 60

Query: 61  IHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRL 120
           IHASLLRH LYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQCRL
Sbjct: 61  IHASLLRHRLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCRL 120

Query: 121 FDRALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDV 180
           FD+ALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDV
Sbjct: 121 FDQALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDV 180

Query: 181 YARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMI 240
           YARGGDVVCARQLFDKMPERSLVS TAML CYAKLGELDEARALFDGMK+RDVVCWNVMI
Sbjct: 181 YARGGDVVCARQLFDKMPERSLVSLTAMLTCYAKLGELDEARALFDGMKERDVVCWNVMI 240

Query: 241 GGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQ 300
           GGYAQNGLPNESLKLFRRMLVAKA+PNE+T+LAVLSACGQLGALESGRWVHSYIEN+GIQ
Sbjct: 241 GGYAQNGLPNESLKLFRRMLVAKAIPNEVTVLAVLSACGQLGALESGRWVHSYIENEGIQ 300

Query: 301 INVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEM 360
           INVHVGTAL+DMYSKCG LEDARL+FDGIRDKDVVAWNSMIVGY++HGFSQ+AL+LFEEM
Sbjct: 301 INVHVGTALVDMYSKCGSLEDARLIFDGIRDKDVVAWNSMIVGYALHGFSQHALQLFEEM 360

Query: 361 TETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGH 420
           TETGY+PTDITFIG+LSACSHGGLVEEGRSFFRLMRDKY IEPKVEHYGCMVNLLGRAGH
Sbjct: 361 TETGYRPTDITFIGILSACSHGGLVEEGRSFFRLMRDKYGIEPKVEHYGCMVNLLGRAGH 420

Query: 421 LEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 480
           LEEAYELVKNM+IAADPVLWG+LLGSCRLHGNIKLGEEIAEFLVDQ+LANSGTYVLLSN 
Sbjct: 421 LEEAYELVKNMTIAADPVLWGTLLGSCRLHGNIKLGEEIAEFLVDQRLANSGTYVLLSNT 480

Query: 481 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEI 540
           YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEF+AGERKHPKSKEIY+MLNEI
Sbjct: 481 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFVAGERKHPKSKEIYMMLNEI 540

Query: 541 NSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 600
           N WLKARRYTPQ  VVLHDL EEQK+QSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC
Sbjct: 541 NGWLKARRYTPQTDVVLHDLREEQKEQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 600

Query: 601 SDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
            DCHAVMKLISEIT RKIVMRDRNRFHHFE+GLCSCGDYW
Sbjct: 601 LDCHAVMKLISEITKRKIVMRDRNRFHHFEDGLCSCGDYW 640

BLAST of CaUC01G016520 vs. NCBI nr
Match: KAG6604154.1 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1184.5 bits (3063), Expect = 0.0e+00
Identity = 581/638 (91.07%), Postives = 606/638 (94.98%), Query Frame = 0

Query: 3   SSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIH 62
           +SSF TAS  SPPPIFTTN SSVL SS STARTSDR QQVE FALLIDKSKSVDRLLQIH
Sbjct: 2   TSSFLTASPASPPPIFTTNCSSVLRSSCSTARTSDRFQQVESFALLIDKSKSVDRLLQIH 61

Query: 63  ASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFD 122
           AS+LRHGLYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQC LFD
Sbjct: 62  ASVLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCGLFD 121

Query: 123 RALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYA 182
           RAL YY QML  GVEPNVFTFSSVLKSCSLE GKVLHCQAIKLGFDSDLYVRTGLVDVYA
Sbjct: 122 RALNYYLQMLNGGVEPNVFTFSSVLKSCSLELGKVLHCQAIKLGFDSDLYVRTGLVDVYA 181

Query: 183 RGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGG 242
           RGGDVVCARQLFDKM ERSLVS TAML CY KLGEL EAR LFDGMK+RDVVCWNVMIGG
Sbjct: 182 RGGDVVCARQLFDKMSERSLVSLTAMLTCYVKLGELGEARTLFDGMKERDVVCWNVMIGG 241

Query: 243 YAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQIN 302
           YAQNG+PNESLKLFRRML+AK MPNE+T+LAVLSACGQLGALESGRWVHSYIENKGIQ+N
Sbjct: 242 YAQNGVPNESLKLFRRMLMAKVMPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIQMN 301

Query: 303 VHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE 362
           VHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LFEEM+E
Sbjct: 302 VHVGTALVDMYSKCGSLEDARLVFDQIRDKDVVAWNSMIVGYAMHGFSQDALQLFEEMSE 361

Query: 363 TGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLE 422
            GYQPTDITFIG+LSACSHGGLVEEGRS+FRLMRDKY IEPKVEHYGC+VNLLGRAGHLE
Sbjct: 362 IGYQPTDITFIGILSACSHGGLVEEGRSYFRLMRDKYRIEPKVEHYGCIVNLLGRAGHLE 421

Query: 423 EAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 482
           EAYELVKNM +AADPV+WG+LLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA
Sbjct: 422 EAYELVKNMRVAADPVIWGTLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 481

Query: 483 ATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 542
           ATGNWEGV+KMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS
Sbjct: 482 ATGNWEGVSKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 541

Query: 543 WLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSD 602
           WLKA RYTPQ  VVLHDL EEQK++SLEVHSEKLA+AFGLISTQPGTTIKIVKNLRVCSD
Sbjct: 542 WLKAHRYTPQTDVVLHDLGEEQKERSLEVHSEKLAVAFGLISTQPGTTIKIVKNLRVCSD 601

Query: 603 CHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           CHAVMKLIS+ITGRKIVMRDRNRFHHFE GLCSCG YW
Sbjct: 602 CHAVMKLISKITGRKIVMRDRNRFHHFEEGLCSCGGYW 639

BLAST of CaUC01G016520 vs. NCBI nr
Match: XP_022950238.1 (pentatricopeptide repeat-containing protein ELI1, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 580/638 (90.91%), Postives = 607/638 (95.14%), Query Frame = 0

Query: 3   SSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIH 62
           +SSF TAS  SPPPIFTTN SSVL SS STARTSDR QQVE FA+LIDKSKSVDRLLQIH
Sbjct: 2   TSSFLTASPASPPPIFTTNCSSVLRSSCSTARTSDRFQQVESFAMLIDKSKSVDRLLQIH 61

Query: 63  ASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFD 122
           AS+LRHGLYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQC LFD
Sbjct: 62  ASVLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCGLFD 121

Query: 123 RALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYA 182
           RAL YY QML  GVEPNVFTFSSVLKSCSLE GKVLHCQAIKLGFDSDLYVRTGLVDVYA
Sbjct: 122 RALDYYLQMLNGGVEPNVFTFSSVLKSCSLELGKVLHCQAIKLGFDSDLYVRTGLVDVYA 181

Query: 183 RGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGG 242
           RGGDVVCARQLFDKM ERSLVS TAML CY KLGEL EAR LFDGMK+RDVVCWNVMIGG
Sbjct: 182 RGGDVVCARQLFDKMSERSLVSLTAMLTCYVKLGELGEARTLFDGMKERDVVCWNVMIGG 241

Query: 243 YAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQIN 302
           YAQNG+PNESLKLFRRML+AK MPNE+T+LAVLSACGQLGALESGRWVHSYIENKGIQ+N
Sbjct: 242 YAQNGVPNESLKLFRRMLMAKVMPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIQMN 301

Query: 303 VHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE 362
           VHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LFEEM+E
Sbjct: 302 VHVGTALVDMYSKCGSLEDARLVFDQIRDKDVVAWNSMIVGYAMHGFSQDALQLFEEMSE 361

Query: 363 TGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLE 422
            GYQPTDITFIG+LSACSHGGLVEEGRS+FRLMRDKY IEPKVEHYGC+VNLLGRAGHLE
Sbjct: 362 IGYQPTDITFIGILSACSHGGLVEEGRSYFRLMRDKYRIEPKVEHYGCIVNLLGRAGHLE 421

Query: 423 EAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 482
           EAYELVKNM +AADPV+WG+LLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA
Sbjct: 422 EAYELVKNMRVAADPVIWGTLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 481

Query: 483 ATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 542
           ATGNWEGV+KMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS
Sbjct: 482 ATGNWEGVSKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 541

Query: 543 WLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSD 602
           WLKA RYTPQ  VVLHDL EEQK++SLEVHSEKLA+AFGLISTQPGTTIKIVKNLRVCSD
Sbjct: 542 WLKAHRYTPQTDVVLHDLGEEQKERSLEVHSEKLAVAFGLISTQPGTTIKIVKNLRVCSD 601

Query: 603 CHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           CHAVMKLIS+ITGRKIVMRDRNRFHHFE+GLCSCG YW
Sbjct: 602 CHAVMKLISKITGRKIVMRDRNRFHHFEDGLCSCGGYW 639

BLAST of CaUC01G016520 vs. NCBI nr
Match: KAG7034315.1 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1182.9 bits (3059), Expect = 0.0e+00
Identity = 580/638 (90.91%), Postives = 605/638 (94.83%), Query Frame = 0

Query: 3   SSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIH 62
           +SSF TAS  SPPPIFTTN SSVL SS STARTSDR QQVE FALLIDKSKSVDRLLQIH
Sbjct: 2   TSSFLTASPASPPPIFTTNCSSVLRSSCSTARTSDRFQQVESFALLIDKSKSVDRLLQIH 61

Query: 63  ASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFD 122
           AS+LRHGLYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQC LFD
Sbjct: 62  ASVLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCGLFD 121

Query: 123 RALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYA 182
           RAL YY QML  GVEPNVFTFSSVLKSCSLE GKVLHCQAIK GFDSDLYVRTGLVDVYA
Sbjct: 122 RALNYYLQMLNGGVEPNVFTFSSVLKSCSLELGKVLHCQAIKFGFDSDLYVRTGLVDVYA 181

Query: 183 RGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGG 242
           RGGDVVCARQLFDKM ERSLVS TAML CY KLGEL EAR LFDGMK+RDVVCWNVMIGG
Sbjct: 182 RGGDVVCARQLFDKMSERSLVSLTAMLTCYVKLGELGEARTLFDGMKERDVVCWNVMIGG 241

Query: 243 YAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQIN 302
           YAQNG+PNESLKLFRRML+AK MPNE+T+LAVLSACGQLGALESGRWVHSYIENKGIQ+N
Sbjct: 242 YAQNGVPNESLKLFRRMLMAKVMPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIQMN 301

Query: 303 VHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE 362
           VHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LFEEM+E
Sbjct: 302 VHVGTALVDMYSKCGSLEDARLVFDQIRDKDVVAWNSMIVGYAMHGFSQDALQLFEEMSE 361

Query: 363 TGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLE 422
            GYQPTDITFIG+LSACSHGGLVEEGRS+FRLMRDKY IEPKVEHYGC+VNLLGRAGHLE
Sbjct: 362 IGYQPTDITFIGILSACSHGGLVEEGRSYFRLMRDKYRIEPKVEHYGCIVNLLGRAGHLE 421

Query: 423 EAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 482
           EAYELVKNM +AADPV+WG+LLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA
Sbjct: 422 EAYELVKNMRVAADPVIWGTLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 481

Query: 483 ATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 542
           ATGNWEGV+KMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS
Sbjct: 482 ATGNWEGVSKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 541

Query: 543 WLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSD 602
           WLKA RYTPQ  VVLHDL EEQK++SLEVHSEKLA+AFGLISTQPGTTIKIVKNLRVCSD
Sbjct: 542 WLKAHRYTPQTDVVLHDLGEEQKERSLEVHSEKLAVAFGLISTQPGTTIKIVKNLRVCSD 601

Query: 603 CHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           CHAVMKLIS+ITGRKIVMRDRNRFHHFE GLCSCG YW
Sbjct: 602 CHAVMKLISKITGRKIVMRDRNRFHHFEEGLCSCGGYW 639

BLAST of CaUC01G016520 vs. NCBI nr
Match: XP_023543384.1 (pentatricopeptide repeat-containing protein ELI1, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1182.9 bits (3059), Expect = 0.0e+00
Identity = 580/638 (90.91%), Postives = 607/638 (95.14%), Query Frame = 0

Query: 3   SSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIH 62
           +SSF TAS  SPPPIFTTN SSVL SS STARTSDR QQVE FALLIDKSKSVDRLLQIH
Sbjct: 2   TSSFLTASPASPPPIFTTNCSSVLRSSCSTARTSDRFQQVESFALLIDKSKSVDRLLQIH 61

Query: 63  ASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFD 122
           AS+LRHGLYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQC LFD
Sbjct: 62  ASVLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCGLFD 121

Query: 123 RALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYA 182
           RAL YY +ML  GVEPNVFTFSSVLKSCSLE GKVLHCQAIK GFDSDLYVRTGLVDVYA
Sbjct: 122 RALNYYLKMLNGGVEPNVFTFSSVLKSCSLELGKVLHCQAIKFGFDSDLYVRTGLVDVYA 181

Query: 183 RGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGG 242
           RGGDVVCARQLFDKM ERSLVS TAML CY KLGEL EARALFDGMK+RDVVCWNVMIGG
Sbjct: 182 RGGDVVCARQLFDKMSERSLVSLTAMLTCYVKLGELGEARALFDGMKERDVVCWNVMIGG 241

Query: 243 YAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQIN 302
           YAQNG+PNESLKLFRRML+AK MPNE+T+LAVLSACGQLGALESGRWVHSYIENKGIQ+N
Sbjct: 242 YAQNGVPNESLKLFRRMLMAKVMPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIQMN 301

Query: 303 VHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE 362
           VHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+ALKLFEEM+E
Sbjct: 302 VHVGTALVDMYSKCGSLEDARLVFDQIRDKDVVAWNSMIVGYAMHGFSQHALKLFEEMSE 361

Query: 363 TGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLE 422
            GYQPTDITFIG+LSACSHGGLVEEGRS+FRLMRDKY IEPKVEHYGC+VNLLGRAGHLE
Sbjct: 362 IGYQPTDITFIGILSACSHGGLVEEGRSYFRLMRDKYRIEPKVEHYGCIVNLLGRAGHLE 421

Query: 423 EAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 482
           EAYELVKNM +AADPV+WG+LLGSCRLHG+IKLGEEIAEFLVDQKLANSGTYVLLSNIYA
Sbjct: 422 EAYELVKNMRVAADPVIWGTLLGSCRLHGDIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 481

Query: 483 ATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 542
           ATGNWEGV+KMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS
Sbjct: 482 ATGNWEGVSKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 541

Query: 543 WLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSD 602
           WLKA RYTPQ  VVLHDL EEQK++SLEVHSEKLA+AFGLISTQPGTTIKIVKNLRVCSD
Sbjct: 542 WLKAHRYTPQTDVVLHDLGEEQKERSLEVHSEKLAVAFGLISTQPGTTIKIVKNLRVCSD 601

Query: 603 CHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           CHAVMKLIS+ITGRKIVMRDRNRFHHFE+GLCSCG YW
Sbjct: 602 CHAVMKLISKITGRKIVMRDRNRFHHFEDGLCSCGGYW 639

BLAST of CaUC01G016520 vs. ExPASy Swiss-Prot
Match: Q9SZT8 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 2.0e-219
Identity = 372/630 (59.05%), Postives = 483/630 (76.67%), Query Frame = 0

Query: 16  PIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLY---H 75
           P+  T+      S+++TAR   R    E+ A+LIDKS+SVD +LQIHA++LRH L     
Sbjct: 5   PLLATSLPQNQLSTTATARF--RLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPR 64

Query: 76  NPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRALGYYSQML 135
            P+LN KL R+YA+ G++ +S+A+F  + +P++F F+A I++     L D+A   Y Q+L
Sbjct: 65  YPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLL 124

Query: 136 TRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQ 195
           +  + PN FTFSS+LKSCS + GK++H   +K G   D YV TGLVDVYA+GGDVV A++
Sbjct: 125 SSEINPNEFTFSSLLKSCSTKSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQK 184

Query: 196 LFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNES 255
           +FD+MPERSLVS TAM+ CYAK G ++ ARALFD M +RD+V WNVMI GYAQ+G PN++
Sbjct: 185 VFDRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDA 244

Query: 256 LKLFRRMLV-AKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQINVHVGTALID 315
           L LF+++L   K  P+EIT++A LSAC Q+GALE+GRW+H ++++  I++NV V T LID
Sbjct: 245 LMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLID 304

Query: 316 MYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE-TGYQPTDI 375
           MYSKCG LE+A LVF+    KD+VAWN+MI GY+MHG+SQ+AL+LF EM   TG QPTDI
Sbjct: 305 MYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDI 364

Query: 376 TFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKN 435
           TFIG L AC+H GLV EG   F  M  +Y I+PK+EHYGC+V+LLGRAG L+ AYE +KN
Sbjct: 365 TFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKN 424

Query: 436 MSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGV 495
           M++ AD VLW S+LGSC+LHG+  LG+EIAE+L+   + NSG YVLLSNIYA+ G++EGV
Sbjct: 425 MNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGV 484

Query: 496 AKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYT 555
           AK+R LMKE GI KEPG S+IE+ NKVHEF AG+R+H KSKEIY ML +I+  +K+  Y 
Sbjct: 485 AKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYV 544

Query: 556 PQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSDCHAVMKLI 615
           P    VL DLEE +K+QSL+VHSE+LAIA+GLIST+PG+ +KI KNLRVCSDCH V KLI
Sbjct: 545 PNTNTVLQDLEETEKEQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLI 604

Query: 616 SEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           S+ITGRKIVMRDRNRFHHF +G CSCGD+W
Sbjct: 605 SKITGRKIVMRDRNRFHHFTDGSCSCGDFW 632

BLAST of CaUC01G016520 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 530.0 bits (1364), Expect = 3.7e-149
Identity = 280/640 (43.75%), Postives = 402/640 (62.81%), Query Frame = 0

Query: 27  PSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQRSYAA 86
           P+SS  +  S    Q       I+  +++  L QIHA  ++ G   + +   ++ R  A 
Sbjct: 15  PASSPASHPSSLFPQ-------INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCAT 74

Query: 87  LG----RLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRAL----GYYSQMLTRGVEP 146
                  LDY+  +F+   + N FS++ II    +    D+AL     +Y  M    VEP
Sbjct: 75  SDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESD-EDKALIAITLFYEMMSDEFVEP 134

Query: 147 NVFTFSSVLKSCS----LEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLF 206
           N FTF SVLK+C+    ++ GK +H  A+K GF  D +V + LV +Y   G +  AR LF
Sbjct: 135 NRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLF 194

Query: 207 DK-MPERSLVSQT-------------AMLACYAKLGELDEARALFDGMKQRDVVCWNVMI 266
            K + E+ +V  T              M+  Y +LG+   AR LFD M+QR VV WN MI
Sbjct: 195 YKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMI 254

Query: 267 GGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQ 326
            GY+ NG   +++++FR M      PN +T+++VL A  +LG+LE G W+H Y E+ GI+
Sbjct: 255 SGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIR 314

Query: 327 INVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEM 386
           I+  +G+ALIDMYSKCG +E A  VF+ +  ++V+ W++MI G+++HG + +A+  F +M
Sbjct: 315 IDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKM 374

Query: 387 TETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGH 446
            + G +P+D+ +I LL+ACSHGGLVEEGR +F  M     +EP++EHYGCMV+LLGR+G 
Sbjct: 375 RQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGL 434

Query: 447 LEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 506
           L+EA E + NM I  D V+W +LLG+CR+ GN+++G+ +A  L+D    +SG YV LSN+
Sbjct: 435 LDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNM 494

Query: 507 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEI 566
           YA+ GNW  V++MR  MKE  I K+PGCS I+++  +HEF+  +  HPK+KEI  ML EI
Sbjct: 495 YASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEI 554

Query: 567 NSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 626
           +  L+   Y P    VL +LEEE K+  L  HSEK+A AFGLIST PG  I+IVKNLR+C
Sbjct: 555 SDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRIC 614

Query: 627 SDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
            DCH+ +KLIS++  RKI +RDR RFHHF++G CSC DYW
Sbjct: 615 EDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CaUC01G016520 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 1.4e-148
Identity = 286/728 (39.29%), Postives = 407/728 (55.91%), Query Frame = 0

Query: 26  LPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQR--S 85
           LPSSS     S R+        L+   K++  L  IHA +++ GL++    N+ L +   
Sbjct: 20  LPSSSDPPYDSIRNHP---SLSLLHNCKTLQSLRIIHAQMIKIGLHNT---NYALSKLIE 79

Query: 86  YAALG----RLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPN 145
           +  L      L Y+++VF T  EPN+  ++ +   H        AL  Y  M++ G+ PN
Sbjct: 80  FCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPN 139

Query: 146 VFTFSSVLKSC----SLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFD 205
            +TF  VLKSC    + + G+ +H   +KLG D DLYV T L+ +Y + G +  A ++FD
Sbjct: 140 SYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD 199

Query: 206 KMPERSLVSQTAML---------------------------------------------- 265
           K P R +VS TA++                                              
Sbjct: 200 KSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 259

Query: 266 ---------------------AC----------------------------------YAK 325
                                AC                                  Y+K
Sbjct: 260 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 319

Query: 326 LGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMPNEITILAV 385
            GEL+ A  LF+ +  +DV+ WN +IGGY    L  E+L LF+ ML +   PN++T+L++
Sbjct: 320 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 379

Query: 386 LSACGQLGALESGRWVHSYIEN--KGIQINVHVGTALIDMYSKCGRLEDARLVFDGIRDK 445
           L AC  LGA++ GRW+H YI+   KG+     + T+LIDMY+KCG +E A  VF+ I  K
Sbjct: 380 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 439

Query: 446 DVVAWNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEGRSFF 505
            + +WN+MI G++MHG +  +  LF  M + G QP DITF+GLLSACSH G+++ GR  F
Sbjct: 440 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 499

Query: 506 RLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCRLHGN 565
           R M   Y + PK+EHYGCM++LLG +G  +EA E++  M +  D V+W SLL +C++HGN
Sbjct: 500 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 559

Query: 566 IKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGCSSIE 625
           ++LGE  AE L+  +  N G+YVLLSNIYA+ G W  VAK R L+ + G++K PGCSSIE
Sbjct: 560 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 619

Query: 626 VNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQKKQSLEVH 641
           +++ VHEF+ G++ HP+++EIY ML E+   L+   + P    VL ++EEE K+ +L  H
Sbjct: 620 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHH 679

BLAST of CaUC01G016520 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 5.5e-145
Identity = 255/605 (42.15%), Postives = 383/605 (63.31%), Query Frame = 0

Query: 45  FALLIDKSKSVDRL---LQIHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSD 104
           F  LI  +  V  L     +H   ++  +  +  +   L   Y + G LD +  VF T  
Sbjct: 134 FPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIK 193

Query: 105 EPNVFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPNVFTFSSVLKSC----SLEPGKV 164
           E +V S++++I+  VQ    D+AL  + +M +  V+ +  T   VL +C    +LE G+ 
Sbjct: 194 EKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQ 253

Query: 165 LHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGE 224
           +     +   + +L +   ++D+Y + G +  A++LFD M E+  V+ T ML  YA   +
Sbjct: 254 VCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISED 313

Query: 225 LDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMP-NEITILAVLS 284
            + AR + + M Q+D+V WN +I  Y QNG PNE+L +F  + + K M  N+IT+++ LS
Sbjct: 314 YEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 373

Query: 285 ACGQLGALESGRWVHSYIENKGIQINVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVA 344
           AC Q+GALE GRW+HSYI+  GI++N HV +ALI MYSKCG LE +R VF+ +  +DV  
Sbjct: 374 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 433

Query: 345 WNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMR 404
           W++MI G +MHG    A+ +F +M E   +P  +TF  +  ACSH GLV+E  S F  M 
Sbjct: 434 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 493

Query: 405 DKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLG 464
             Y I P+ +HY C+V++LGR+G+LE+A + ++ M I     +WG+LLG+C++H N+ L 
Sbjct: 494 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 553

Query: 465 EEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNK 524
           E     L++ +  N G +VLLSNIYA  G WE V+++R  M+  G++KEPGCSSIE++  
Sbjct: 554 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 613

Query: 525 VHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQ-KKQSLEVHSEK 584
           +HEFL+G+  HP S+++Y  L+E+   LK+  Y P+I  VL  +EEE+ K+QSL +HSEK
Sbjct: 614 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEK 673

Query: 585 LAIAFGLISTQPGTTIKIVKNLRVCSDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCS 641
           LAI +GLIST+    I+++KNLRVC DCH+V KLIS++  R+I++RDR RFHHF NG CS
Sbjct: 674 LAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCS 733

BLAST of CaUC01G016520 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 1.3e-143
Identity = 262/672 (38.99%), Postives = 393/672 (58.48%), Query Frame = 0

Query: 45  FALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPN 104
           +A LID +    +L QIHA LL  GL  +  L  KL  + ++ G + ++  VFD    P 
Sbjct: 24  YASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQ 83

Query: 105 VFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPNVFTFSSVLKSCS----LEPGKVLHC 164
           +F ++AII  + +   F  AL  YS M    V P+ FTF  +LK+CS    L+ G+ +H 
Sbjct: 84  IFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHA 143

Query: 165 QAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFD--KMPERSLVSQTAMLAC------- 224
           Q  +LGFD+D++V+ GL+ +YA+   +  AR +F+   +PER++VS TA+++        
Sbjct: 144 QVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEP 203

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 204 MEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISL 263

Query: 285 ---YAKLGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMPNE 344
              YAK G++  A+ LFD MK  +++ WN MI GYA+NG   E++ +F  M+     P+ 
Sbjct: 264 NTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDT 323

Query: 345 ITILAVLSACGQLGALESGRWVHSYIENKGIQINVHVGTALIDMYSKCGRLEDARLVFDG 404
           I+I + +SAC Q+G+LE  R ++ Y+     + +V + +ALIDM++KCG +E ARLVFD 
Sbjct: 324 ISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDR 383

Query: 405 IRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEG 464
             D+DVV W++MIVGY +HG ++ A+ L+  M   G  P D+TF+GLL AC+H G+V EG
Sbjct: 384 TLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREG 443

Query: 465 RSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCR 524
             FF  M D + I P+ +HY C+++LLGRAGHL++AYE++K M +     +WG+LL +C+
Sbjct: 444 WWFFNRMAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACK 503

Query: 525 LHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGC 584
            H +++LGE  A+ L     +N+G YV LSN+YAA   W+ VA++R  MKE G+ K+ GC
Sbjct: 504 KHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGC 563

Query: 585 SSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQKKQS 641
           S +EV  ++  F  G++ HP+ +EI   +  I S LK   +       LHDL +E+ +++
Sbjct: 564 SWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEET 623

BLAST of CaUC01G016520 vs. ExPASy TrEMBL
Match: A0A6J1GED1 (pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111453386 PE=3 SV=1)

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 580/638 (90.91%), Postives = 607/638 (95.14%), Query Frame = 0

Query: 3   SSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIH 62
           +SSF TAS  SPPPIFTTN SSVL SS STARTSDR QQVE FA+LIDKSKSVDRLLQIH
Sbjct: 2   TSSFLTASPASPPPIFTTNCSSVLRSSCSTARTSDRFQQVESFAMLIDKSKSVDRLLQIH 61

Query: 63  ASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFD 122
           AS+LRHGLYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQC LFD
Sbjct: 62  ASVLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCGLFD 121

Query: 123 RALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYA 182
           RAL YY QML  GVEPNVFTFSSVLKSCSLE GKVLHCQAIKLGFDSDLYVRTGLVDVYA
Sbjct: 122 RALDYYLQMLNGGVEPNVFTFSSVLKSCSLELGKVLHCQAIKLGFDSDLYVRTGLVDVYA 181

Query: 183 RGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGG 242
           RGGDVVCARQLFDKM ERSLVS TAML CY KLGEL EAR LFDGMK+RDVVCWNVMIGG
Sbjct: 182 RGGDVVCARQLFDKMSERSLVSLTAMLTCYVKLGELGEARTLFDGMKERDVVCWNVMIGG 241

Query: 243 YAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQIN 302
           YAQNG+PNESLKLFRRML+AK MPNE+T+LAVLSACGQLGALESGRWVHSYIENKGIQ+N
Sbjct: 242 YAQNGVPNESLKLFRRMLMAKVMPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIQMN 301

Query: 303 VHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE 362
           VHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LFEEM+E
Sbjct: 302 VHVGTALVDMYSKCGSLEDARLVFDQIRDKDVVAWNSMIVGYAMHGFSQDALQLFEEMSE 361

Query: 363 TGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLE 422
            GYQPTDITFIG+LSACSHGGLVEEGRS+FRLMRDKY IEPKVEHYGC+VNLLGRAGHLE
Sbjct: 362 IGYQPTDITFIGILSACSHGGLVEEGRSYFRLMRDKYRIEPKVEHYGCIVNLLGRAGHLE 421

Query: 423 EAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 482
           EAYELVKNM +AADPV+WG+LLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA
Sbjct: 422 EAYELVKNMRVAADPVIWGTLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 481

Query: 483 ATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 542
           ATGNWEGV+KMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS
Sbjct: 482 ATGNWEGVSKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 541

Query: 543 WLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSD 602
           WLKA RYTPQ  VVLHDL EEQK++SLEVHSEKLA+AFGLISTQPGTTIKIVKNLRVCSD
Sbjct: 542 WLKAHRYTPQTDVVLHDLGEEQKERSLEVHSEKLAVAFGLISTQPGTTIKIVKNLRVCSD 601

Query: 603 CHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           CHAVMKLIS+ITGRKIVMRDRNRFHHFE+GLCSCG YW
Sbjct: 602 CHAVMKLISKITGRKIVMRDRNRFHHFEDGLCSCGGYW 639

BLAST of CaUC01G016520 vs. ExPASy TrEMBL
Match: A0A6J1IMD2 (pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111477796 PE=3 SV=1)

HSP 1 Score: 1176.0 bits (3041), Expect = 0.0e+00
Identity = 576/638 (90.28%), Postives = 604/638 (94.67%), Query Frame = 0

Query: 3   SSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIH 62
           +SSF TAS  SPPPIFTTN SSVL +S STARTSDR QQVE FALLIDKSKSVDRLLQIH
Sbjct: 8   TSSFLTASPASPPPIFTTNCSSVLRTSCSTARTSDRFQQVESFALLIDKSKSVDRLLQIH 67

Query: 63  ASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFD 122
           AS+LRHGLYHNPILNFKLQRSYAALGRLDYSVAVF+TSDEPNVFSFSAIIHSHVQC LFD
Sbjct: 68  ASVLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFNTSDEPNVFSFSAIIHSHVQCGLFD 127

Query: 123 RALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYA 182
           RAL YY QML  GVEPNVFTFSSVLKSCSLE GKVLHCQA+K GFDSDLYVRTGLVDVYA
Sbjct: 128 RALNYYLQMLNGGVEPNVFTFSSVLKSCSLELGKVLHCQAMKFGFDSDLYVRTGLVDVYA 187

Query: 183 RGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGG 242
           RGGDVVCARQLFDKM ERSLVS TAML CY KLGEL EARALFDGMK+RDVVCWNVMIGG
Sbjct: 188 RGGDVVCARQLFDKMSERSLVSLTAMLTCYVKLGELGEARALFDGMKERDVVCWNVMIGG 247

Query: 243 YAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQIN 302
           YAQNG+PNESLKLFRRML+ K MPNE+T+LAVLSACGQLGALESGRWVHSYIENKGIQ+N
Sbjct: 248 YAQNGVPNESLKLFRRMLMVKVMPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIQMN 307

Query: 303 VHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE 362
           VHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LFE+M+E
Sbjct: 308 VHVGTALVDMYSKCGSLEDARLVFDQIRDKDVVAWNSMIVGYAMHGFSQHALQLFEQMSE 367

Query: 363 TGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLE 422
            GYQPTDITFIG+LSACSHGGLVEEGRS+FRLMRDKY IEPKVEHYGCMVNLLGRAGHLE
Sbjct: 368 IGYQPTDITFIGILSACSHGGLVEEGRSYFRLMRDKYRIEPKVEHYGCMVNLLGRAGHLE 427

Query: 423 EAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 482
           EAYELVKNM +AADPV+WG+LLGSCRLHG+IKLGEEIAEFLVDQKLANSGTYVLLSNIYA
Sbjct: 428 EAYELVKNMRVAADPVIWGTLLGSCRLHGDIKLGEEIAEFLVDQKLANSGTYVLLSNIYA 487

Query: 483 ATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 542
           ATGNWEGV+KMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS
Sbjct: 488 ATGNWEGVSKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINS 547

Query: 543 WLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSD 602
           WLKA RYTPQ  VVLHDL EEQK++SLEVHSEKLA AFGLISTQPGTTIKIVKNLRVC D
Sbjct: 548 WLKAHRYTPQTDVVLHDLGEEQKERSLEVHSEKLAAAFGLISTQPGTTIKIVKNLRVCPD 607

Query: 603 CHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           CHAVMKLIS+ITGRKIVMRDRNRFHHFE+GLCSCG YW
Sbjct: 608 CHAVMKLISKITGRKIVMRDRNRFHHFEDGLCSCGGYW 645

BLAST of CaUC01G016520 vs. ExPASy TrEMBL
Match: A0A1S3B0U7 (pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484813 PE=3 SV=1)

HSP 1 Score: 1167.9 bits (3020), Expect = 0.0e+00
Identity = 575/640 (89.84%), Postives = 604/640 (94.38%), Query Frame = 0

Query: 1   MSSSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQ 60
           MSSSSFFTAS PSPP IFTTNRSSV PSSSSTA+TS R QQVERFALLIDKSKSV  LLQ
Sbjct: 6   MSSSSFFTASPPSPPSIFTTNRSSVSPSSSSTAKTSGRFQQVERFALLIDKSKSVAHLLQ 65

Query: 61  IHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRL 120
           IHASLLRHGLYHNPILNFKLQRSYAALGRLD SV VF+T DEPNVFSFSAIIHSHVQ RL
Sbjct: 66  IHASLLRHGLYHNPILNFKLQRSYAALGRLDCSVVVFNTFDEPNVFSFSAIIHSHVQSRL 125

Query: 121 FDRALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDV 180
           FDRA GYYSQML+RGVEPN FTFSSVLKSCSLE GKVLHCQAIKLG  SDLYVRTGLVDV
Sbjct: 126 FDRAFGYYSQMLSRGVEPNAFTFSSVLKSCSLESGKVLHCQAIKLGLGSDLYVRTGLVDV 185

Query: 181 YARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMI 240
           YARGGDVVCARQLFDKMPERSLVS T ML CY+K+GELDEAR+LF+GMK+RDVVCWNVMI
Sbjct: 186 YARGGDVVCARQLFDKMPERSLVSLTTMLTCYSKMGELDEARSLFEGMKERDVVCWNVMI 245

Query: 241 GGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQ 300
           GGYAQNG+PNESLKLFRRMLV+KA+PNE+T+LAVLSACGQLGALESGRWVHSYIENK IQ
Sbjct: 246 GGYAQNGVPNESLKLFRRMLVSKAIPNEVTVLAVLSACGQLGALESGRWVHSYIENKSIQ 305

Query: 301 INVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEM 360
           INVHVGTAL+DMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LF EM
Sbjct: 306 INVHVGTALVDMYSKCGSLEDARLVFDRIRDKDVVAWNSMIVGYAMHGFSQHALQLFGEM 365

Query: 361 TETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGH 420
           TETG+QPTDITFIG+LSAC+HGGLVEEGRS FRLMRDKY IEPK+EHYGCMVNLLGRAGH
Sbjct: 366 TETGHQPTDITFIGILSACAHGGLVEEGRSLFRLMRDKYGIEPKIEHYGCMVNLLGRAGH 425

Query: 421 LEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 480
           LEEAY LVKNM+IAADPVLWG+LLGSCRLH NIKLGEEIAEFLVDQKLA+SGTYVLLSN+
Sbjct: 426 LEEAYALVKNMTIAADPVLWGTLLGSCRLHVNIKLGEEIAEFLVDQKLAHSGTYVLLSNM 485

Query: 481 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEI 540
           YAATGNWEGVAKMRTLMKEHGIEKE GCSSIEVNNKVHEF+AGERKHPKSKEIYVM NEI
Sbjct: 486 YAATGNWEGVAKMRTLMKEHGIEKEHGCSSIEVNNKVHEFVAGERKHPKSKEIYVMSNEI 545

Query: 541 NSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 600
           NSWLKAR YT Q  VVLHDL EEQK+Q LEVHSEKLAIAFGLIST+PGTTIKIVKNLRVC
Sbjct: 546 NSWLKARGYTSQTDVVLHDLREEQKEQLLEVHSEKLAIAFGLISTKPGTTIKIVKNLRVC 605

Query: 601 SDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           SDCH VMKLISEITGRKIVMRDRNRFHHFE+GLCSCGDYW
Sbjct: 606 SDCHTVMKLISEITGRKIVMRDRNRFHHFEDGLCSCGDYW 645

BLAST of CaUC01G016520 vs. ExPASy TrEMBL
Match: A0A0A0KGF1 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G492280 PE=3 SV=1)

HSP 1 Score: 1167.1 bits (3018), Expect = 0.0e+00
Identity = 573/640 (89.53%), Postives = 606/640 (94.69%), Query Frame = 0

Query: 1   MSSSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQ 60
           MSSSS FTAS PSPP IFTTNRSSVLPSSSSTARTSDR Q+VERFA LIDKSKSV  LLQ
Sbjct: 6   MSSSSIFTASHPSPPSIFTTNRSSVLPSSSSTARTSDRFQEVERFASLIDKSKSVAHLLQ 65

Query: 61  IHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRL 120
           IHASLLR GLYHNPILNFKLQRSYAALGRLD SV VF+T DEPNVFSFSAIIHSHVQ RL
Sbjct: 66  IHASLLRRGLYHNPILNFKLQRSYAALGRLDCSVFVFNTFDEPNVFSFSAIIHSHVQSRL 125

Query: 121 FDRALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDV 180
           FDRA GYYSQML+ GVEPN FTFSSVLKSCSLE GKVLHCQAIKLG  SDLYVRTGLVDV
Sbjct: 126 FDRAFGYYSQMLSCGVEPNAFTFSSVLKSCSLESGKVLHCQAIKLGLGSDLYVRTGLVDV 185

Query: 181 YARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMI 240
           YARGGDVVCARQLFDKMPERSLVS T ML CY+K+GELD+AR+LF+GMK+RDVVCWNVMI
Sbjct: 186 YARGGDVVCARQLFDKMPERSLVSLTTMLTCYSKMGELDKARSLFEGMKERDVVCWNVMI 245

Query: 241 GGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQ 300
           GGYAQ+G+PNESLKLFRRMLVAKA+PNE+T+LAVLSACGQLGALESGRW+HSYIENKGIQ
Sbjct: 246 GGYAQSGVPNESLKLFRRMLVAKAIPNEVTVLAVLSACGQLGALESGRWIHSYIENKGIQ 305

Query: 301 INVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEM 360
           INVHVGTALIDMYSKCG LEDARLVFD IRDKDVVAWNSMIVGY+MHGFSQ+AL+LFEEM
Sbjct: 306 INVHVGTALIDMYSKCGSLEDARLVFDRIRDKDVVAWNSMIVGYAMHGFSQHALQLFEEM 365

Query: 361 TETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGH 420
           TETG++PTDITFIG+LSAC HGGLVEEGRSFFRLMRDKY IEPK+EHYGCMVNLLGRAGH
Sbjct: 366 TETGHKPTDITFIGILSACGHGGLVEEGRSFFRLMRDKYGIEPKIEHYGCMVNLLGRAGH 425

Query: 421 LEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 480
           LEEAY LVKNM+IAADPVLWG+LLG CRLH NIKLGEEIA+FLVDQKLANSGTYVLLSN+
Sbjct: 426 LEEAYGLVKNMTIAADPVLWGTLLGCCRLHVNIKLGEEIAKFLVDQKLANSGTYVLLSNM 485

Query: 481 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEI 540
           YAATGNWEGVAKMRTLMKEHGIEKE GCSSIEV+NKVHEF+AGERKHPKSKEIYVMLNEI
Sbjct: 486 YAATGNWEGVAKMRTLMKEHGIEKEHGCSSIEVDNKVHEFVAGERKHPKSKEIYVMLNEI 545

Query: 541 NSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 600
           NSWLKAR YTPQ  VVLHDL EEQK+QSLEVHSEKLAIAFGLIST+PGTT+KIVKNLRVC
Sbjct: 546 NSWLKARGYTPQTDVVLHDLREEQKEQSLEVHSEKLAIAFGLISTKPGTTVKIVKNLRVC 605

Query: 601 SDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           SDCH VMK+ISEITGRKIVMRDRNRFHHFE+GLCSCGDYW
Sbjct: 606 SDCHTVMKMISEITGRKIVMRDRNRFHHFEDGLCSCGDYW 645

BLAST of CaUC01G016520 vs. ExPASy TrEMBL
Match: A0A6J1BWF5 (pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111005343 PE=3 SV=1)

HSP 1 Score: 1166.4 bits (3016), Expect = 0.0e+00
Identity = 565/640 (88.28%), Postives = 603/640 (94.22%), Query Frame = 0

Query: 1   MSSSSFFTASLPSPPPIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQ 60
           MSSSSFFTA  PSPPP+ TTNRSS+LPSS STA TSDRSQQVERFA LIDKS+SV RLLQ
Sbjct: 1   MSSSSFFTAPPPSPPPLVTTNRSSLLPSSPSTATTSDRSQQVERFASLIDKSRSVIRLLQ 60

Query: 61  IHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRL 120
           IHASLLR GLYH+PILNFKLQRSYA +GRLDYSVAVF+TSD+PNVFS+S+IIHSH QC L
Sbjct: 61  IHASLLRQGLYHHPILNFKLQRSYATVGRLDYSVAVFNTSDDPNVFSYSSIIHSHAQCGL 120

Query: 121 FDRALGYYSQMLTRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDV 180
           FD+AL +YSQMLTRGVEPNVFTFSSVLKSCSLEPGK LHCQAIKLGFDSDLYVRTGLVDV
Sbjct: 121 FDQALDFYSQMLTRGVEPNVFTFSSVLKSCSLEPGKALHCQAIKLGFDSDLYVRTGLVDV 180

Query: 181 YARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMI 240
           YARGG+VVCARQLFDKMPERSLVS TAML CY KLGELDEARALFDGMK+RDVVCWNVMI
Sbjct: 181 YARGGEVVCARQLFDKMPERSLVSLTAMLTCYTKLGELDEARALFDGMKERDVVCWNVMI 240

Query: 241 GGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQ 300
           GGYAQNG+PNESLKLFRRMLVAK  PNE+T+LAVLSACGQLGALESGRWVHSYIENKGI+
Sbjct: 241 GGYAQNGVPNESLKLFRRMLVAKVKPNEVTVLAVLSACGQLGALESGRWVHSYIENKGIE 300

Query: 301 INVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEM 360
           +NVHVGTAL+DMYSKCG LEDARLVFDGIRDKDVVAWNSMIVGY+MHGF Q+ L+LFEEM
Sbjct: 301 MNVHVGTALVDMYSKCGSLEDARLVFDGIRDKDVVAWNSMIVGYAMHGFFQHVLQLFEEM 360

Query: 361 TETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGH 420
           T  GYQPTDITFIG+LSACSHGG+VEEGR F  LMR +Y IEPKVEHYGCMVNLLG AGH
Sbjct: 361 TAIGYQPTDITFIGILSACSHGGMVEEGRRFLSLMRKEYGIEPKVEHYGCMVNLLGHAGH 420

Query: 421 LEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 480
           LEEAY LVKNM++AADPVLWG+LLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI
Sbjct: 421 LEEAYNLVKNMTVAADPVLWGTLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 480

Query: 481 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEI 540
           YAATGNWEGVAK+RTLMKEHGIEKEPGCSSIEVNNKVHEF+AGERKHPK+KEIY+MLNEI
Sbjct: 481 YAATGNWEGVAKIRTLMKEHGIEKEPGCSSIEVNNKVHEFVAGERKHPKTKEIYMMLNEI 540

Query: 541 NSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 600
           N WL+A  Y PQ  +VLHDL EEQK+QSLEVHSEKLAIAFGLISTQPGTT+KIVKNLRVC
Sbjct: 541 NRWLRAHGYAPQTDIVLHDLGEEQKEQSLEVHSEKLAIAFGLISTQPGTTVKIVKNLRVC 600

Query: 601 SDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           SDCHAVMKLIS+ITGRKIVMRDRNRFHHFE+GLCSCGDYW
Sbjct: 601 SDCHAVMKLISKITGRKIVMRDRNRFHHFEDGLCSCGDYW 640

BLAST of CaUC01G016520 vs. TAIR 10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 763.5 bits (1970), Expect = 1.4e-220
Identity = 372/630 (59.05%), Postives = 483/630 (76.67%), Query Frame = 0

Query: 16  PIFTTNRSSVLPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLY---H 75
           P+  T+      S+++TAR   R    E+ A+LIDKS+SVD +LQIHA++LRH L     
Sbjct: 5   PLLATSLPQNQLSTTATARF--RLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPR 64

Query: 76  NPILNFKLQRSYAALGRLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRALGYYSQML 135
            P+LN KL R+YA+ G++ +S+A+F  + +P++F F+A I++     L D+A   Y Q+L
Sbjct: 65  YPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLL 124

Query: 136 TRGVEPNVFTFSSVLKSCSLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQ 195
           +  + PN FTFSS+LKSCS + GK++H   +K G   D YV TGLVDVYA+GGDVV A++
Sbjct: 125 SSEINPNEFTFSSLLKSCSTKSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQK 184

Query: 196 LFDKMPERSLVSQTAMLACYAKLGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNES 255
           +FD+MPERSLVS TAM+ CYAK G ++ ARALFD M +RD+V WNVMI GYAQ+G PN++
Sbjct: 185 VFDRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDA 244

Query: 256 LKLFRRMLV-AKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQINVHVGTALID 315
           L LF+++L   K  P+EIT++A LSAC Q+GALE+GRW+H ++++  I++NV V T LID
Sbjct: 245 LMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLID 304

Query: 316 MYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTE-TGYQPTDI 375
           MYSKCG LE+A LVF+    KD+VAWN+MI GY+MHG+SQ+AL+LF EM   TG QPTDI
Sbjct: 305 MYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDI 364

Query: 376 TFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKN 435
           TFIG L AC+H GLV EG   F  M  +Y I+PK+EHYGC+V+LLGRAG L+ AYE +KN
Sbjct: 365 TFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKN 424

Query: 436 MSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGV 495
           M++ AD VLW S+LGSC+LHG+  LG+EIAE+L+   + NSG YVLLSNIYA+ G++EGV
Sbjct: 425 MNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGV 484

Query: 496 AKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYT 555
           AK+R LMKE GI KEPG S+IE+ NKVHEF AG+R+H KSKEIY ML +I+  +K+  Y 
Sbjct: 485 AKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYV 544

Query: 556 PQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVCSDCHAVMKLI 615
           P    VL DLEE +K+QSL+VHSE+LAIA+GLIST+PG+ +KI KNLRVCSDCH V KLI
Sbjct: 545 PNTNTVLQDLEETEKEQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLI 604

Query: 616 SEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
           S+ITGRKIVMRDRNRFHHF +G CSCGD+W
Sbjct: 605 SKITGRKIVMRDRNRFHHFTDGSCSCGDFW 632

BLAST of CaUC01G016520 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 530.0 bits (1364), Expect = 2.6e-150
Identity = 280/640 (43.75%), Postives = 402/640 (62.81%), Query Frame = 0

Query: 27  PSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQRSYAA 86
           P+SS  +  S    Q       I+  +++  L QIHA  ++ G   + +   ++ R  A 
Sbjct: 15  PASSPASHPSSLFPQ-------INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCAT 74

Query: 87  LG----RLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRAL----GYYSQMLTRGVEP 146
                  LDY+  +F+   + N FS++ II    +    D+AL     +Y  M    VEP
Sbjct: 75  SDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESD-EDKALIAITLFYEMMSDEFVEP 134

Query: 147 NVFTFSSVLKSCS----LEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLF 206
           N FTF SVLK+C+    ++ GK +H  A+K GF  D +V + LV +Y   G +  AR LF
Sbjct: 135 NRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLF 194

Query: 207 DK-MPERSLVSQT-------------AMLACYAKLGELDEARALFDGMKQRDVVCWNVMI 266
            K + E+ +V  T              M+  Y +LG+   AR LFD M+QR VV WN MI
Sbjct: 195 YKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMI 254

Query: 267 GGYAQNGLPNESLKLFRRMLVAKAMPNEITILAVLSACGQLGALESGRWVHSYIENKGIQ 326
            GY+ NG   +++++FR M      PN +T+++VL A  +LG+LE G W+H Y E+ GI+
Sbjct: 255 SGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIR 314

Query: 327 INVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVAWNSMIVGYSMHGFSQNALKLFEEM 386
           I+  +G+ALIDMYSKCG +E A  VF+ +  ++V+ W++MI G+++HG + +A+  F +M
Sbjct: 315 IDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKM 374

Query: 387 TETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGH 446
            + G +P+D+ +I LL+ACSHGGLVEEGR +F  M     +EP++EHYGCMV+LLGR+G 
Sbjct: 375 RQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGL 434

Query: 447 LEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLGEEIAEFLVDQKLANSGTYVLLSNI 506
           L+EA E + NM I  D V+W +LLG+CR+ GN+++G+ +A  L+D    +SG YV LSN+
Sbjct: 435 LDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNM 494

Query: 507 YAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNKVHEFLAGERKHPKSKEIYVMLNEI 566
           YA+ GNW  V++MR  MKE  I K+PGCS I+++  +HEF+  +  HPK+KEI  ML EI
Sbjct: 495 YASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEI 554

Query: 567 NSWLKARRYTPQIGVVLHDLEEEQKKQSLEVHSEKLAIAFGLISTQPGTTIKIVKNLRVC 626
           +  L+   Y P    VL +LEEE K+  L  HSEK+A AFGLIST PG  I+IVKNLR+C
Sbjct: 555 SDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRIC 614

Query: 627 SDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCSCGDYW 641
            DCH+ +KLIS++  RKI +RDR RFHHF++G CSC DYW
Sbjct: 615 EDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CaUC01G016520 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 528.1 bits (1359), Expect = 9.9e-150
Identity = 286/728 (39.29%), Postives = 407/728 (55.91%), Query Frame = 0

Query: 26  LPSSSSTARTSDRSQQVERFALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQR--S 85
           LPSSS     S R+        L+   K++  L  IHA +++ GL++    N+ L +   
Sbjct: 20  LPSSSDPPYDSIRNHP---SLSLLHNCKTLQSLRIIHAQMIKIGLHNT---NYALSKLIE 79

Query: 86  YAALG----RLDYSVAVFDTSDEPNVFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPN 145
           +  L      L Y+++VF T  EPN+  ++ +   H        AL  Y  M++ G+ PN
Sbjct: 80  FCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPN 139

Query: 146 VFTFSSVLKSC----SLEPGKVLHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFD 205
            +TF  VLKSC    + + G+ +H   +KLG D DLYV T L+ +Y + G +  A ++FD
Sbjct: 140 SYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD 199

Query: 206 KMPERSLVSQTAML---------------------------------------------- 265
           K P R +VS TA++                                              
Sbjct: 200 KSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 259

Query: 266 ---------------------AC----------------------------------YAK 325
                                AC                                  Y+K
Sbjct: 260 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 319

Query: 326 LGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMPNEITILAV 385
            GEL+ A  LF+ +  +DV+ WN +IGGY    L  E+L LF+ ML +   PN++T+L++
Sbjct: 320 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 379

Query: 386 LSACGQLGALESGRWVHSYIEN--KGIQINVHVGTALIDMYSKCGRLEDARLVFDGIRDK 445
           L AC  LGA++ GRW+H YI+   KG+     + T+LIDMY+KCG +E A  VF+ I  K
Sbjct: 380 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 439

Query: 446 DVVAWNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEGRSFF 505
            + +WN+MI G++MHG +  +  LF  M + G QP DITF+GLLSACSH G+++ GR  F
Sbjct: 440 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 499

Query: 506 RLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCRLHGN 565
           R M   Y + PK+EHYGCM++LLG +G  +EA E++  M +  D V+W SLL +C++HGN
Sbjct: 500 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 559

Query: 566 IKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGCSSIE 625
           ++LGE  AE L+  +  N G+YVLLSNIYA+ G W  VAK R L+ + G++K PGCSSIE
Sbjct: 560 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 619

Query: 626 VNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQKKQSLEVH 641
           +++ VHEF+ G++ HP+++EIY ML E+   L+   + P    VL ++EEE K+ +L  H
Sbjct: 620 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHH 679

BLAST of CaUC01G016520 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 516.2 bits (1328), Expect = 3.9e-146
Identity = 255/605 (42.15%), Postives = 383/605 (63.31%), Query Frame = 0

Query: 45  FALLIDKSKSVDRL---LQIHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSD 104
           F  LI  +  V  L     +H   ++  +  +  +   L   Y + G LD +  VF T  
Sbjct: 134 FPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIK 193

Query: 105 EPNVFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPNVFTFSSVLKSC----SLEPGKV 164
           E +V S++++I+  VQ    D+AL  + +M +  V+ +  T   VL +C    +LE G+ 
Sbjct: 194 EKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQ 253

Query: 165 LHCQAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFDKMPERSLVSQTAMLACYAKLGE 224
           +     +   + +L +   ++D+Y + G +  A++LFD M E+  V+ T ML  YA   +
Sbjct: 254 VCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISED 313

Query: 225 LDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMP-NEITILAVLS 284
            + AR + + M Q+D+V WN +I  Y QNG PNE+L +F  + + K M  N+IT+++ LS
Sbjct: 314 YEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 373

Query: 285 ACGQLGALESGRWVHSYIENKGIQINVHVGTALIDMYSKCGRLEDARLVFDGIRDKDVVA 344
           AC Q+GALE GRW+HSYI+  GI++N HV +ALI MYSKCG LE +R VF+ +  +DV  
Sbjct: 374 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 433

Query: 345 WNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEGRSFFRLMR 404
           W++MI G +MHG    A+ +F +M E   +P  +TF  +  ACSH GLV+E  S F  M 
Sbjct: 434 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 493

Query: 405 DKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCRLHGNIKLG 464
             Y I P+ +HY C+V++LGR+G+LE+A + ++ M I     +WG+LLG+C++H N+ L 
Sbjct: 494 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 553

Query: 465 EEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGCSSIEVNNK 524
           E     L++ +  N G +VLLSNIYA  G WE V+++R  M+  G++KEPGCSSIE++  
Sbjct: 554 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 613

Query: 525 VHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQ-KKQSLEVHSEK 584
           +HEFL+G+  HP S+++Y  L+E+   LK+  Y P+I  VL  +EEE+ K+QSL +HSEK
Sbjct: 614 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEK 673

Query: 585 LAIAFGLISTQPGTTIKIVKNLRVCSDCHAVMKLISEITGRKIVMRDRNRFHHFENGLCS 641
           LAI +GLIST+    I+++KNLRVC DCH+V KLIS++  R+I++RDR RFHHF NG CS
Sbjct: 674 LAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCS 733

BLAST of CaUC01G016520 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 511.5 bits (1316), Expect = 9.6e-145
Identity = 262/672 (38.99%), Postives = 393/672 (58.48%), Query Frame = 0

Query: 45  FALLIDKSKSVDRLLQIHASLLRHGLYHNPILNFKLQRSYAALGRLDYSVAVFDTSDEPN 104
           +A LID +    +L QIHA LL  GL  +  L  KL  + ++ G + ++  VFD    P 
Sbjct: 24  YASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQ 83

Query: 105 VFSFSAIIHSHVQCRLFDRALGYYSQMLTRGVEPNVFTFSSVLKSCS----LEPGKVLHC 164
           +F ++AII  + +   F  AL  YS M    V P+ FTF  +LK+CS    L+ G+ +H 
Sbjct: 84  IFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHA 143

Query: 165 QAIKLGFDSDLYVRTGLVDVYARGGDVVCARQLFD--KMPERSLVSQTAMLAC------- 224
           Q  +LGFD+D++V+ GL+ +YA+   +  AR +F+   +PER++VS TA+++        
Sbjct: 144 QVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEP 203

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 204 MEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISL 263

Query: 285 ---YAKLGELDEARALFDGMKQRDVVCWNVMIGGYAQNGLPNESLKLFRRMLVAKAMPNE 344
              YAK G++  A+ LFD MK  +++ WN MI GYA+NG   E++ +F  M+     P+ 
Sbjct: 264 NTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDT 323

Query: 345 ITILAVLSACGQLGALESGRWVHSYIENKGIQINVHVGTALIDMYSKCGRLEDARLVFDG 404
           I+I + +SAC Q+G+LE  R ++ Y+     + +V + +ALIDM++KCG +E ARLVFD 
Sbjct: 324 ISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDR 383

Query: 405 IRDKDVVAWNSMIVGYSMHGFSQNALKLFEEMTETGYQPTDITFIGLLSACSHGGLVEEG 464
             D+DVV W++MIVGY +HG ++ A+ L+  M   G  P D+TF+GLL AC+H G+V EG
Sbjct: 384 TLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREG 443

Query: 465 RSFFRLMRDKYLIEPKVEHYGCMVNLLGRAGHLEEAYELVKNMSIAADPVLWGSLLGSCR 524
             FF  M D + I P+ +HY C+++LLGRAGHL++AYE++K M +     +WG+LL +C+
Sbjct: 444 WWFFNRMAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACK 503

Query: 525 LHGNIKLGEEIAEFLVDQKLANSGTYVLLSNIYAATGNWEGVAKMRTLMKEHGIEKEPGC 584
            H +++LGE  A+ L     +N+G YV LSN+YAA   W+ VA++R  MKE G+ K+ GC
Sbjct: 504 KHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGC 563

Query: 585 SSIEVNNKVHEFLAGERKHPKSKEIYVMLNEINSWLKARRYTPQIGVVLHDLEEEQKKQS 641
           S +EV  ++  F  G++ HP+ +EI   +  I S LK   +       LHDL +E+ +++
Sbjct: 564 SWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEET 623

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882525.10.0e+0093.44pentatricopeptide repeat-containing protein ELI1, chloroplastic [Benincasa hispi... [more]
KAG6604154.10.0e+0091.07Pentatricopeptide repeat-containing protein ELI1, chloroplastic, partial [Cucurb... [more]
XP_022950238.10.0e+0090.91pentatricopeptide repeat-containing protein ELI1, chloroplastic [Cucurbita mosch... [more]
KAG7034315.10.0e+0090.91Pentatricopeptide repeat-containing protein ELI1, chloroplastic, partial [Cucurb... [more]
XP_023543384.10.0e+0090.91pentatricopeptide repeat-containing protein ELI1, chloroplastic [Cucurbita pepo ... [more]
Match NameE-valueIdentityDescription
Q9SZT82.0e-21959.05Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Q9FI803.7e-14943.75Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9LN011.4e-14839.29Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823805.5e-14542.15Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LTV81.3e-14338.99Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1GED10.0e+0090.91pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Cucurbita mos... [more]
A0A6J1IMD20.0e+0090.28pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Cucurbita max... [more]
A0A1S3B0U70.0e+0089.84pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Cucumis melo ... [more]
A0A0A0KGF10.0e+0089.53DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G4922... [more]
A0A6J1BWF50.0e+0088.28pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Momordica cha... [more]
Match NameE-valueIdentityDescription
AT4G37380.11.4e-22059.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.12.6e-15043.75Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.19.9e-15039.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.13.9e-14642.15Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G12770.19.6e-14538.99mitochondrial editing factor 22 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 310..573
e-value: 5.0E-40
score: 139.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 24..156
e-value: 3.3E-14
score: 54.6
coord: 157..288
e-value: 1.7E-27
score: 98.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 231..278
e-value: 4.0E-8
score: 33.3
coord: 103..150
e-value: 3.0E-10
score: 40.1
coord: 332..380
e-value: 4.9E-11
score: 42.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 176..200
e-value: 0.063
score: 13.6
coord: 407..431
e-value: 0.011
score: 16.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 234..267
e-value: 7.5E-6
score: 23.8
coord: 335..368
e-value: 1.8E-5
score: 22.6
coord: 107..140
e-value: 4.8E-6
score: 24.4
coord: 307..335
e-value: 0.0019
score: 16.2
coord: 204..234
e-value: 2.8E-6
score: 25.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 11.334042
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 333..367
score: 12.386327
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..231
score: 9.04313
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 104..138
score: 10.577712
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 506..630
e-value: 9.2E-39
score: 132.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..37
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 29..626
NoneNo IPR availablePANTHERPTHR47926:SF226PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN ELI1, CHLOROPLASTICcoord: 29..626

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G016520.1CaUC01G016520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding