CmUC02G038770 (gene) Watermelon (USVL531) v1

Overview
NameCmUC02G038770
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionpentatricopeptide repeat-containing protein At2g01390-like
LocationCmU531Chr02: 26288814 .. 26291641 (+)
RNA-Seq ExpressionCmUC02G038770
SyntenyCmUC02G038770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAGCGTTGCCTCGGTGGTGGTTATTTTTTTTTAATTTGATTCAAGTCAACTTGGTTGGTATGGGAAAAAATTTGGTAGTCTTTTAGGGTTTACAATCTGCCGCCTACAAACACCCGCCGCCGACCAAATCCGCCGCCGACTTCATCTGTTCGTAGATCTGACCGAAGAAACTCACCTCTGCTATCAGATCTGCTGCTGTCGACCAAGATCCAAGCTATTCCGTCCGTTCGTCGCGCTCTCCGTTGCGGTTCCTGTCGTCCGACTCGTCTGCTGTTGACCAAGATCCAACGCCGACACATTCATCGCCGTCCGTTCATCGCGCGTCAAGTTCTCCGTCGCTCTCGGTCTTCCTCTTTCGCCGGCAAAAAACGCAGTAACAGTTGCTTTTTGAATTGCGTCCAAGTAGTCTCGGAGTCCTCTCTTCCTCCCATTGATGCCATTCCAGCGCTTCTCCTTCTAGACACAGCACGGCCGCGTCGAGTTATCCTTCTCCGACAGCCGATTGACCAGGAAGTAACGCTCCACTCGATGGAACCATCCGAGCGGATCCTCCCCTACCTCTCCTTTGTAGATGGGTATTTCTAAAAACCTTATTTTGTTGTCCATTCCATGCATTGTTCTATTAGTTTTTCTTTACTTCTGAGTCACTATGTGGTTACATCTGCCATCCATAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCCTTTCACCAATACAAACAAGAGAAACCCACCACACGATTCAGTAGAAAGTCGAGGAAGGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACTAGAGATACAGTGAGGAACATATGCAATATTCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACACCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATCAACCAAGTTCTGAAAACACATCCACCATTAGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCAGTATACGTACACGACGATGCTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCAAATTCGGGAGATGTTGAGGGAGCAATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCGACGGTGGTTTCTTATACTGCTTATATAAAGATTTTGTTGGACATTGGCCAAATTAAGGAGGCCACTGATGCATACAAGGAGTTGCTTCAATCTGGGCTATCTCCAAGTTGTTGTACTTACACCATCTTAATGGAATACCTTATTGGAGAGGGTAAGTTTCTCTATTCATCCATGTTAGTGAGCTTCAAAATTTGCTGTTTATTTATTTTTTATTTTTTATTTTTTATTTTTTTTGGTTAGGTTAGGTTAACAGTTGTTTCTTTCATTGTACTTCCAGGTAAATGCAAAGAAGCCCTTGATATTTTTCACAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCAGCTTGCAATATATTGATTCAGAAATGCTGTAAATCAGGGGAGAGGCTAGTAATGACACAAATTCTTGAATACATGAAAGAAAAACACCTTGTGCTTCGATACCCTGTGTTTCTTGAAGCACATGAAACTTTAAAAAGTTGTTCTTTTAGTTGTAACCTACTCAGGCAAGTTAATCCTCATATAGAAATTGAATCAGTCAGTAAGGGCGAGGTTGTGGATGTTAGTACAAGTTCTAATGTTGTTCCTTCCGATGTAGATTATGAGCTTGTGGCAATTCTGTTGAAGGAGAATAAACTTACTGCTATTGACTACATGCTCATTGGGACAGTAGATAAAAACATACGGTTGGATTCTTCAATTATTTTATCCATCATTGAGGTGAATTGCAAACATAATCGACCCAACGGTGCTCTACTGGCTTTCAACTATGGTTTTAAAAATGGTGTTAACATTGAGAGAAATCTGTATCTTTGCTTGATCGGAATTCTGATACGTTCGAGTATATATCCGAAGTTGTTGGAAATTGTTCAGAAAATGTATATGCAAGGGCATTGCCTTGGACTCTATTATGCCACACTTATACTTCATAGTCTTGGTAAAGCTGGAAAACCTCAATATGCTAGGAAAGTTTTCAATATGTTGCCTGAAGAATTGAAGTGCACTGCAACTTACACTGCTCTGGTTGATGCTTATTTCTCTGCTGGAAGTTCTGGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAAAGGATTTACACCATCTTTAGGCACGTATAATGTGCTGTTAACTGGTCTTGTGAAGAATGGTAGAGTCTGTGAATTAGATATTTATAGAAGGGAGAAGAAGAGTTTTGAGATTAGTCATCATTCTCATCCCAATACAGTATTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGGTATCTTGAGGAGAATCGGAGGAAGAAAGGATTTGTGATCTTCTTTTCGGAGAATTGGTATCTTGAGGAGGATCGGAGGAAGAAAGGATTTGCGAACTATGCAAGCCCAACGTAGAATGGATCAGCGAGGCTACGGAATTCATTGTTTTCGATTTCAGCATCACCAACAGGGTGGGACACAGTTTTATTATTGCCAATGTTATGAAGCAACGGGCTATTCTTCTGTATGCTGTCGCGACTTGGTGATCAGTTGACGTGCACAAGGTTCCTCATTAGTCGATGCACCCCATCTGACGAGGTTCCATTACAGTTGGCTTTGGACATTCGTAATTAA

mRNA sequence

CGAGCGTTGCCTCGGTGGTGGTTATTTTTTTTTAATTTGATTCAAGTCAACTTGGTTGGTATGGGAAAAAATTTGGTAGTCTTTTAGGGTTTACAATCTGCCGCCTACAAACACCCGCCGCCGACCAAATCCGCCGCCGACTTCATCTGTTCGTAGATCTGACCGAAGAAACTCACCTCTGCTATCAGATCTGCTGCTGTCGACCAAGATCCAAGCTATTCCGTCCGTTCGTCGCGCTCTCCGTTGCGGTTCCTGTCGTCCGACTCGTCTGCTGTTGACCAAGATCCAACGCCGACACATTCATCGCCGTCCGTTCATCGCGCGTCAAGTTCTCCGTCGCTCTCGGTCTTCCTCTTTCGCCGGCAAAAAACGCAGTAACAGTTGCTTTTTGAATTGCGTCCAAGTAGTCTCGGAGTCCTCTCTTCCTCCCATTGATGCCATTCCAGCGCTTCTCCTTCTAGACACAGCACGGCCGCGTCGAGTTATCCTTCTCCGACAGCCGATTGACCAGGAAGTAACGCTCCACTCGATGGAACCATCCGAGCGGATCCTCCCCTACCTCTCCTTTGTAGATGGGTATTTCTAAAAACCTTATTTTGTTGTCCATTCCATGCATTGTTCTATTAGTTTTTCTTTACTTCTGAGTCACTATGTGGTTACATCTGCCATCCATAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCCTTTCACCAATACAAACAAGAGAAACCCACCACACGATTCAGTAGAAAGTCGAGGAAGGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACTAGAGATACAGTGAGGAACATATGCAATATTCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACACCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATCAACCAAGTTCTGAAAACACATCCACCATTAGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCAGTATACGTACACGACGATGCTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCAAATTCGGGAGATGTTGAGGGAGCAATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCGACGGTGGTTTCTTATACTGCTTATATAAAGATTTTGTTGGACATTGGCCAAATTAAGGAGGCCACTGATGCATACAAGGAGTTGCTTCAATCTGGGCTATCTCCAAGTTGTTGTACTTACACCATCTTAATGGAATACCTTATTGGAGAGGGTAAATGCAAAGAAGCCCTTGATATTTTTCACAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCAGCTTGCAATATATTGATTCAGAAATGCTGTAAATCAGGGGAGAGGCTAGTAATGACACAAATTCTTGAATACATGAAAGAAAAACACCTTGTGCTTCGATACCCTGTGTTTCTTGAAGCACATGAAACTTTAAAAAGTTGTTCTTTTAGTTGTAACCTACTCAGGCAAGTTAATCCTCATATAGAAATTGAATCAGTCAGTAAGGGCGAGGTTGTGGATGTTAGTACAAGTTCTAATGTTGTTCCTTCCGATGTAGATTATGAGCTTGTGGCAATTCTGTTGAAGGAGAATAAACTTACTGCTATTGACTACATGCTCATTGGGACAGTAGATAAAAACATACGGTTGGATTCTTCAATTATTTTATCCATCATTGAGGTGAATTGCAAACATAATCGACCCAACGGTGCTCTACTGGCTTTCAACTATGGTTTTAAAAATGGTGTTAACATTGAGAGAAATCTGTATCTTTGCTTGATCGGAATTCTGATACGTTCGAGTATATATCCGAAGTTGTTGGAAATTGTTCAGAAAATGTATATGCAAGGGCATTGCCTTGGACTCTATTATGCCACACTTATACTTCATAGTCTTGGTAAAGCTGGAAAACCTCAATATGCTAGGAAAGTTTTCAATATGTTGCCTGAAGAATTGAAGTGCACTGCAACTTACACTGCTCTGGTTGATGCTTATTTCTCTGCTGGAAGTTCTGGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAAAGGATTTACACCATCTTTAGGCACGTATAATGTGCTGTTAACTGGTCTTGTGAAGAATGGTAGAGTCTGTGAATTAGATATTTATAGAAGGGAGAAGAAGAGTTTTGAGATTAGTCATCATTCTCATCCCAATACAGTATTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGTTGACGTGCACAAGGTTCCTCATTAGTCGATGCACCCCATCTGACGAGGTTCCATTACAGTTGGCTTTGGACATTCGTAATTAA

Coding sequence (CDS)

ATGCATTGTTCTATTAGTTTTTCTTTACTTCTGAGTCACTATGTGGTTACATCTGCCATCCATAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCCTTTCACCAATACAAACAAGAGAAACCCACCACACGATTCAGTAGAAAGTCGAGGAAGGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACTAGAGATACAGTGAGGAACATATGCAATATTCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACACCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATCAACCAAGTTCTGAAAACACATCCACCATTAGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCAGTATACGTACACGACGATGCTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCAAATTCGGGAGATGTTGAGGGAGCAATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCGACGGTGGTTTCTTATACTGCTTATATAAAGATTTTGTTGGACATTGGCCAAATTAAGGAGGCCACTGATGCATACAAGGAGTTGCTTCAATCTGGGCTATCTCCAAGTTGTTGTACTTACACCATCTTAATGGAATACCTTATTGGAGAGGGTAAATGCAAAGAAGCCCTTGATATTTTTCACAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCAGCTTGCAATATATTGATTCAGAAATGCTGTAAATCAGGGGAGAGGCTAGTAATGACACAAATTCTTGAATACATGAAAGAAAAACACCTTGTGCTTCGATACCCTGTGTTTCTTGAAGCACATGAAACTTTAAAAAGTTGTTCTTTTAGTTGTAACCTACTCAGGCAAGTTAATCCTCATATAGAAATTGAATCAGTCAGTAAGGGCGAGGTTGTGGATGTTAGTACAAGTTCTAATGTTGTTCCTTCCGATGTAGATTATGAGCTTGTGGCAATTCTGTTGAAGGAGAATAAACTTACTGCTATTGACTACATGCTCATTGGGACAGTAGATAAAAACATACGGTTGGATTCTTCAATTATTTTATCCATCATTGAGGTGAATTGCAAACATAATCGACCCAACGGTGCTCTACTGGCTTTCAACTATGGTTTTAAAAATGGTGTTAACATTGAGAGAAATCTGTATCTTTGCTTGATCGGAATTCTGATACGTTCGAGTATATATCCGAAGTTGTTGGAAATTGTTCAGAAAATGTATATGCAAGGGCATTGCCTTGGACTCTATTATGCCACACTTATACTTCATAGTCTTGGTAAAGCTGGAAAACCTCAATATGCTAGGAAAGTTTTCAATATGTTGCCTGAAGAATTGAAGTGCACTGCAACTTACACTGCTCTGGTTGATGCTTATTTCTCTGCTGGAAGTTCTGGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAAAGGATTTACACCATCTTTAGGCACGTATAATGTGCTGTTAACTGGTCTTGTGAAGAATGGTAGAGTCTGTGAATTAGATATTTATAGAAGGGAGAAGAAGAGTTTTGAGATTAGTCATCATTCTCATCCCAATACAGTATTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGTTGACGTGCACAAGGTTCCTCATTAGTCGATGCACCCCATCTGACGAGGTTCCATTACAGTTGGCTTTGGACATTCGTAATTAA

Protein sequence

MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSDVDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYGFKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKPQYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLTCTRFLISRCTPSDEVPLQLALDIRN
Homology
BLAST of CmUC02G038770 vs. NCBI nr
Match: XP_038901985.1 (pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida])

HSP 1 Score: 1037.3 bits (2681), Expect = 5.2e-299
Identity = 520/587 (88.59%), Postives = 545/587 (92.84%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           MHCS SFS LLS+YVVTSAI KRIYQNISSK LHSFHQYKQEKP  +F+RKSRKGTKVVK
Sbjct: 1   MHCSNSFSFLLSNYVVTSAIGKRIYQNISSKCLHSFHQYKQEKPIKQFNRKSRKGTKVVK 60

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEVD R YTRDTVRNI NILR CSWGSAQ HLEMLPIRWDSYLINQVLKTHPPLEKTWL
Sbjct: 61  KEEVDLRRYTRDTVRNIYNILRQCSWGSAQEHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK IKIDAVTYTSLMHWR
Sbjct: 121 FFNWASRLQMFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKRIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLD  QIKEATD YKE+LQSGL P+C
Sbjct: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDSDQIKEATDTYKEMLQSGLPPNC 240

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGE LVMTQILEY
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGETLVMTQILEY 300

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MK+K LVLRYPVF+EAHETLKSCS S  LLRQVNPHIEIESVSKGEVV+VST SN+VP +
Sbjct: 301 MKDKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVSKGEVVNVSTRSNIVPPN 360

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VD+EL+AILLKENKLTAIDYML G VD+NI+LDSSIILSI EVNCK NRPNGALLAFNY 
Sbjct: 361 VDHELLAILLKENKLTAIDYMLTGIVDRNIQLDSSIILSIFEVNCKSNRPNGALLAFNYC 420

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            K+GVNIER LYL LIGILIRSSIYPKLLEIVQKMY QGHCLGLY+ATLIL+ LGKAGKP
Sbjct: 421 LKDGVNIERKLYLDLIGILIRSSIYPKLLEIVQKMYTQGHCLGLYHATLILYRLGKAGKP 480

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFN+LPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGF PSLGTYNVLL 
Sbjct: 481 QYARKVFNVLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFAPSLGTYNVLLA 540

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELL 588
           GL K GR+ EL IYR+E+KSFEISHHSH  T+LEEERICDLL+GEL+
Sbjct: 541 GLAKCGRIDELHIYRKERKSFEISHHSHLYTILEEERICDLLYGELV 587

BLAST of CmUC02G038770 vs. NCBI nr
Match: XP_011649371.1 (pentatricopeptide repeat-containing protein At2g01390 [Cucumis sativus] >KGN62065.1 hypothetical protein Csa_006695 [Cucumis sativus])

HSP 1 Score: 1017.3 bits (2629), Expect = 5.5e-293
Identity = 506/588 (86.05%), Postives = 542/588 (92.18%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           MH    FSLLLS+YVV+SAI KRIYQNISSK LHS HQYK++KP +RFSR+SRKGTKV K
Sbjct: 1   MHFCNIFSLLLSNYVVSSAIRKRIYQNISSKCLHSLHQYKRDKPISRFSRQSRKGTKVAK 60

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV PRLYTRDTVRNICNILRNCSW SAQ HLEMLPIRWDSYLINQVLKTHPPLEKTWL
Sbjct: 61  KEEVIPRLYTRDTVRNICNILRNCSWASAQKHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWAS LQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 121 FFNWASTLQVFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAIK+WKEMKANGC+PTVVSYTAYIKILLD GQI EAT  YK++LQSGLSP+C
Sbjct: 181 SNSGDVDGAIKLWKEMKANGCHPTVVSYTAYIKILLDNGQINEATATYKKMLQSGLSPNC 240

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEF 300

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKE   VLRYPVF+EAHETLKSCS S  LL+QVNPH+EIES+SKGEVVDVST SN VP +
Sbjct: 301 MKENRFVLRYPVFVEAHETLKSCSVSYALLKQVNPHMEIESISKGEVVDVSTGSNTVPPN 360

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VD EL+A+LLK+NKLTA+D+MLIG VDKNI+LDSSII SIIEVNCK NRPN ALLAF+Y 
Sbjct: 361 VDNELLAMLLKDNKLTAVDHMLIGIVDKNIQLDSSIIYSIIEVNCKSNRPNSALLAFDYC 420

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KN VNI+R LYL LIGILIRSSIYPKLLEIVQ+MY QGHCLGLY+ATLIL SLGKAGKP
Sbjct: 421 LKNSVNIKRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCLGLYHATLILCSLGKAGKP 480

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFNMLPEELKCTATYTALVD YFSAGSSGKGLKI+ETMRKKGFTPSLGTYNVLL 
Sbjct: 481 QYARKVFNMLPEELKCTATYTALVDGYFSAGSSGKGLKIFETMRKKGFTPSLGTYNVLLN 540

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GL KNGR  EL+IYRREKKSFEISHHS  NT+L++ERICDLLFGEL++
Sbjct: 541 GLAKNGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLFGELVS 588

BLAST of CmUC02G038770 vs. NCBI nr
Match: XP_016902133.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cucumis melo])

HSP 1 Score: 1005.7 bits (2599), Expect = 1.7e-289
Identity = 499/588 (84.86%), Postives = 539/588 (91.67%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           MH    FSLLLS+YVV SAI KRIYQNIS K LHS HQYK+EKP +RFSR SRKGTKVVK
Sbjct: 1   MHFCNRFSLLLSNYVVISAIRKRIYQNISCKCLHSLHQYKREKPISRFSRNSRKGTKVVK 60

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV PR+YTRDTV NICNILRNCSW SAQ HLEMLPIRWDSYLINQVLKTHPPLEKTWL
Sbjct: 61  KEEVIPRVYTRDTVCNICNILRNCSWASAQKHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRL++FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDA TYTSLMHWR
Sbjct: 121 FFNWASRLKVFKHDQYTYTTMLDIFGEAGRISSMNYLFQQMKEKGIKIDAATYTSLMHWR 180

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAIKVWKEMKANGC+PTVVSYTAYIKILLD GQ KEAT  YKE+L++GLSP+C
Sbjct: 181 SNSGDVDGAIKVWKEMKANGCHPTVVSYTAYIKILLDNGQSKEATATYKEMLKTGLSPNC 240

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEF 300

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKE   VLRYPVF+EAHE LKSCS    LLRQVNPHIEIES+SKGEV+DVST SN VP +
Sbjct: 301 MKENRFVLRYPVFVEAHENLKSCSVGHALLRQVNPHIEIESISKGEVLDVSTGSNTVPPN 360

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VD EL+A+LLK+NKLTAID+MLIG VDKNI+LDSSII SIIEVNCK NRPN A+LAF+Y 
Sbjct: 361 VDNELLAMLLKDNKLTAIDHMLIGIVDKNIQLDSSIIYSIIEVNCKSNRPNSAMLAFDYC 420

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KNGVNI R LYL LIGILIRSSIYPKLLEIVQ+MY QGHC+GLY+ATLIL+SLG+AGKP
Sbjct: 421 LKNGVNIGRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCIGLYHATLILYSLGRAGKP 480

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFN+LPEELKCTATYT+LVDAYFSAGSSGKGLKI+ETMRKKGFTPSLGTYNVLL 
Sbjct: 481 QYARKVFNILPEELKCTATYTSLVDAYFSAGSSGKGLKIFETMRKKGFTPSLGTYNVLLN 540

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GL K+GR  EL+IYRREKKSFEISHHS  NT+L++ERICDLLFGEL++
Sbjct: 541 GLAKSGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLFGELVS 588

BLAST of CmUC02G038770 vs. NCBI nr
Match: XP_022971714.1 (pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima] >XP_022971715.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima])

HSP 1 Score: 1001.5 bits (2588), Expect = 3.1e-288
Identity = 502/588 (85.37%), Postives = 534/588 (90.82%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           M CS SFS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VK
Sbjct: 5   MRCSNSFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV+   YTRDTVRNI NILRNCSWGSAQGH+E LPIRWDSYLINQVLKTHPPLEK WL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWGSAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD G++++ATD YKE+LQSGLSP+C
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNGRVRKATDTYKEMLQSGLSPNC 244

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYT+LMEYLIGE K KEALDIFHKMQDAGVYPDKAACNILIQKCCKSGE LVMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKEK LVLRYPVF+EAHE LKSCS S  LL QVNPHIEIESVSKGEVVDVSTS NV+   
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VDYELVA LLKE KL A+D++LIG  DKNI+LDSSIILSIIEVNCK NRPNGALLAF+Y 
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KNGV +ERNLYL LIG+LIRSSIY  LLEIVQ MY +GHCLGLY+ATLIL+ LGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQDMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFNMLPEELKCTATYTALV AYFSAGS GKGLKIYETMRKKGFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GLVK+ RV ELDIYRREKK FEISHHSH  T+LEEERICDLLFGEL++
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of CmUC02G038770 vs. NCBI nr
Match: XP_023512200.1 (pentatricopeptide repeat-containing protein At2g01390 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 995.7 bits (2573), Expect = 1.7e-286
Identity = 495/588 (84.18%), Postives = 532/588 (90.48%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           M CS  FS  +S+YVVTSAI KR+YQNISSK LHS HQYKQEKP +RF+RK RKGTK VK
Sbjct: 5   MRCSNHFSFFMSNYVVTSAICKRVYQNISSKCLHSSHQYKQEKPFSRFNRKLRKGTKGVK 64

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEE+DP  YTRDTVRNI NILRNCSWG AQGH+E LPIRWDSYLINQVLKTHPPLEK WL
Sbjct: 65  KEELDPTPYTRDTVRNIYNILRNCSWGFAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFRHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD G++++ATDAYKE+LQSGLSP+C
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNGRVRKATDAYKEMLQSGLSPNC 244

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYT+LMEYLIGE K KEALDIFHKMQDAG YPDKAACNILIQKCCKSGE LVMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGAYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKEK LVLRYPVF+EAHE LKSCS S  LL QVNPHIEIESVSKGEVVDVSTS NV+   
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VDYELVA LLKE KL A+D++LIG  DKNI+LDSSIILSIIEVNCK NRPNGALLAF+Y 
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KNGV +ERNLYL LIG+LIRSSIY KLLE+VQ+MY +GHCLGLY+ATL L+ LGKAGKP
Sbjct: 425 LKNGVKVERNLYLGLIGLLIRSSIYSKLLEVVQEMYTKGHCLGLYHATLTLYRLGKAGKP 484

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFNMLPEELKCTATYTALV AYFSAGS GKGLKIYETMRKKGFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GLVK+ RV ELDIYRREKK FEISHHSH  T+LEEERICDLLFGE ++
Sbjct: 545 GLVKSDRVAELDIYRREKKIFEISHHSHHGTILEEERICDLLFGEFVS 592

BLAST of CmUC02G038770 vs. ExPASy Swiss-Prot
Match: Q9ZU29 (Pentatricopeptide repeat-containing protein At2g01390 OS=Arabidopsis thaliana OX=3702 GN=At2g01390/At2g01380 PE=2 SV=2)

HSP 1 Score: 554.7 bits (1428), Expect = 1.3e-156
Identity = 288/559 (51.52%), Postives = 393/559 (70.30%), Query Frame = 0

Query: 29  SSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEV-DPRLYTRDTVRNICNILRNCSWG 88
           S K LHS  + K    + RFS+K     K+VK + + DP +YTRD V NI NIL+  +W 
Sbjct: 20  SVKLLHSLPRLKPTN-SKRFSQK----PKLVKTQTLPDPSVYTRDIVSNIYNILKYSNWD 79

Query: 89  SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGE 148
           SAQ  L  L +RWDS++IN+VLK HPP++K WLFFNWA++++ FKHD +TYTTMLDIFGE
Sbjct: 80  SAQEQLPHLGVRWDSHIINRVLKAHPPMQKAWLFFNWAAQIKGFKHDHFTYTTMLDIFGE 139

Query: 149 AGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVS 208
           AGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDV+GA+++W+EM+ NGC PTVVS
Sbjct: 140 AGRIQSMYSVFHLMKEKGVLIDTVTYTSLIHWVSSSGDVDGAMRLWEEMRDNGCEPTVVS 199

Query: 209 YTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGKCKEALDIFHKMQ 268
           YTAY+K+L   G+++EAT+ YKE+L+S +SP+C TYT+LMEYL+  GKC+EALDIF KMQ
Sbjct: 200 YTAYMKMLFADGRVEEATEVYKEMLRSRVSPNCHTYTVLMEYLVATGKCEEALDIFFKMQ 259

Query: 269 DAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKHLVLRYPVFLEAHETLKSCSFSC 328
           + GV PDKAACNILI K  K GE   MT++L YMKE  +VLRYP+F+EA ETLK+   S 
Sbjct: 260 EIGVQPDKAACNILIAKALKFGETSFMTRVLVYMKENGVVLRYPIFVEALETLKAAGESD 319

Query: 329 NLLRQVNPHIEIESVSKGEVVDVSTS--SNVVPSDVDYELVAILLKENKLTAIDYMLIGT 388
           +LLR+VN HI +ES+   ++ +  T+  ++   SD    + ++LL +  L A+D +L   
Sbjct: 320 DLLREVNSHISVESLCSSDIDETPTAEVNDTKNSDDSRVISSVLLMKQNLVAVDILLNQM 379

Query: 389 VDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYGFKNGVNIERNLYLCLIGILIRSSIY 448
            D+NI+LDS ++ +IIE NC   R  GA LAF+Y  + G++++++ YL LIG  +RS+  
Sbjct: 380 RDRNIKLDSFVVSAIIETNCDRCRTEGASLAFDYSLEMGIHLKKSAYLALIGNFLRSNEL 439

Query: 449 PKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKPQYARKVFNMLPEELKCTATYTALVD 508
           PK++E+V++M    H LG Y   +++H LG   +P+ A  VF++LP++ K  A YTAL+D
Sbjct: 440 PKVIEVVKEMVKAQHSLGCYQGAMLIHRLGFGRRPRLAADVFDLLPDDQKGVAAYTALMD 499

Query: 509 AYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKNGRV-CELDIYRREKKSFEIS 568
            Y SAGS  K +KI   MR++   PSLGTY+VLL+GL K      E+ + R+EKKS   S
Sbjct: 500 VYISAGSPEKAMKILREMREREIMPSLGTYDVLLSGLEKTSDFQKEVALLRKEKKSLVAS 559

Query: 569 HHSHPNTVLEEERICDLLF 584
                N V  E++ICDLLF
Sbjct: 560 ARFREN-VHVEDKICDLLF 572

BLAST of CmUC02G038770 vs. ExPASy Swiss-Prot
Match: Q9SSF9 (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX=3702 GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 2.4e-28
Identity = 78/238 (32.77%), Postives = 116/238 (48.74%), Query Frame = 0

Query: 48  FSRKSRKGTKVVKKEEVDPRLYTRD--TVRNICNILRNCSWG-SAQGHLEMLPIRWDSYL 107
           F + SR+  KV  +    PR +      V N+ +ILR   WG +A+  L     R D+Y 
Sbjct: 269 FGKPSREMMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAEEALHNFGFRMDAYQ 328

Query: 108 INQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK 167
            NQVLK          FF W  R   FKHD +TYTTM+   G A +   +N +  +M   
Sbjct: 329 ANQVLKQMDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRD 388

Query: 168 GIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEA 227
           G K + VTY  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I    G +  A
Sbjct: 389 GCKPNTVTYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIA 448

Query: 228 TDAYKELLQSGLSPSCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILI 283
            D Y+ + ++GLSP   TY++++  L   G    A  +F +M   G  P+    NI+I
Sbjct: 449 MDMYQRMQEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMI 506

BLAST of CmUC02G038770 vs. ExPASy Swiss-Prot
Match: Q8GYP6 (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX=3702 GN=At1g18900 PE=2 SV=1)

HSP 1 Score: 124.4 bits (311), Expect = 4.4e-27
Identity = 71/216 (32.87%), Postives = 109/216 (50.46%), Query Frame = 0

Query: 74  VRNICNILRNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFK 133
           V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  +  A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIG 253
           + +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 EGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKS 289
            G    A  +F +M D G  P+    NI++    K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of CmUC02G038770 vs. ExPASy Swiss-Prot
Match: Q9C6S6 (Putative pentatricopeptide repeat-containing protein At1g31840 OS=Arabidopsis thaliana OX=3702 GN=At1g31840 PE=3 SV=2)

HSP 1 Score: 121.7 bits (304), Expect = 2.9e-26
Identity = 106/457 (23.19%), Postives = 195/457 (42.67%), Query Frame = 0

Query: 134 DQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVW 193
           D   Y+T++D + +AG +   + +F Q   KG+K+D V ++S +     SGD+  A  V+
Sbjct: 320 DLIAYSTLIDGYFKAGMLGMGHKLFSQALHKGVKLDVVVFSSTIDVYVKSGDLATASVVY 379

Query: 194 KEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGE 253
           K M   G  P VV+YT  IK L   G+I EA   Y ++L+ G+ PS  TY+ L++     
Sbjct: 380 KRMLCQGISPNVVTYTILIKGLCQDGRIYEAFGMYGQILKRGMEPSIVTYSSLIDGFCKC 439

Query: 254 GKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKHLVLRYPVF 313
           G  +    ++  M   G  PD     +L+    K G  L   +    M  + + L   VF
Sbjct: 440 GNLRSGFALYEDMIKMGYPPDVVIYGVLVDGLSKQGLMLHAMRFSVKMLGQSIRLNVVVF 499

Query: 314 ---LEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSDVDYELVAILL 373
              ++    L     +  + R +  +        G   DV+T + V         + + +
Sbjct: 500 NSLIDGWCRLNRFDEALKVFRLMGIY--------GIKPDVATFTTV---------MRVSI 559

Query: 374 KENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYGFKNGVNIERN 433
            E +L    ++        +  D+    ++I+  CKH +P   L  F+   +N ++ +  
Sbjct: 560 MEGRLEEALFLFFRMFKMGLEPDALAYCTLIDAFCKHMKPTIGLQLFDLMQRNKISADIA 619

Query: 434 LYLCLIGILIR-------SSIYPKLLE-------IVQKMYMQGHCL-------------- 493
           +   +I +L +       S  +  L+E       +     + G+C               
Sbjct: 620 VCNVVIHLLFKCHRIEDASKFFNNLIEGKMEPDIVTYNTMICGYCSLRRLDEAERIFELL 679

Query: 494 -------GLYYATLILHSLGKAGKPQYARKVFNMLPEE--LKCTATYTALVDAYFSAGSS 551
                       T+++H L K      A ++F+++ E+       TY  L+D +  +   
Sbjct: 680 KVTPFGPNTVTLTILIHVLCKNNDMDGAIRMFSIMAEKGSKPNAVTYGCLMDWFSKSVDI 739

BLAST of CmUC02G038770 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 5.4e-25
Identity = 109/441 (24.72%), Postives = 189/441 (42.86%), Query Frame = 0

Query: 134 DQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVW 193
           D  +YTT+++ F + G        + +M ++GI  D VTY S++     +  ++ A++V 
Sbjct: 195 DVVSYTTVINGFFKEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEVL 254

Query: 194 KEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGE 253
             M  NG  P  ++Y + +      GQ KEA    K++   G+ P   TY++LM+YL   
Sbjct: 255 NTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKN 314

Query: 254 GKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEK-----HLVL 313
           G+C EA  IF  M   G+ P+      L+Q     G  + M  +L+ M        H V 
Sbjct: 315 GRCMEARKIFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVF 374

Query: 314 RYPVFLEAHE-TLKSCSFSCNLLRQ--VNPHIEIESVSKGEVVDVSTSSNVV-------- 373
              +   A +  +       + +RQ  +NP+    +V+ G V+ +   S  V        
Sbjct: 375 SILICAYAKQGKVDQAMLVFSKMRQQGLNPN----AVTYGAVIGILCKSGRVEDAMLYFE 434

Query: 374 --------PSDVDY-ELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHN 433
                   P ++ Y  L+  L   NK    + +++  +D+ I L++    SII+ +CK  
Sbjct: 435 QMIDEGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEG 494

Query: 434 RPNGALLAFNYGFKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYAT 493
           R               +  E+     L  +++R  + P +  I     + G+CL      
Sbjct: 495 RV--------------IESEK-----LFELMVRIGVKPNV--ITYNTLINGYCL------ 554

Query: 494 LILHSLGKAGKPQYARKVFN-MLPEELK-CTATYTALVDAYFSAGSSGKGLKIYETMRKK 548
                   AGK   A K+ + M+   LK  T TY+ L++ Y         L +++ M   
Sbjct: 555 --------AGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRMEDALVLFKEMESS 596

BLAST of CmUC02G038770 vs. ExPASy TrEMBL
Match: A0A0A0LJM3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G295410 PE=4 SV=1)

HSP 1 Score: 1017.3 bits (2629), Expect = 2.7e-293
Identity = 506/588 (86.05%), Postives = 542/588 (92.18%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           MH    FSLLLS+YVV+SAI KRIYQNISSK LHS HQYK++KP +RFSR+SRKGTKV K
Sbjct: 1   MHFCNIFSLLLSNYVVSSAIRKRIYQNISSKCLHSLHQYKRDKPISRFSRQSRKGTKVAK 60

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV PRLYTRDTVRNICNILRNCSW SAQ HLEMLPIRWDSYLINQVLKTHPPLEKTWL
Sbjct: 61  KEEVIPRLYTRDTVRNICNILRNCSWASAQKHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWAS LQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 121 FFNWASTLQVFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAIK+WKEMKANGC+PTVVSYTAYIKILLD GQI EAT  YK++LQSGLSP+C
Sbjct: 181 SNSGDVDGAIKLWKEMKANGCHPTVVSYTAYIKILLDNGQINEATATYKKMLQSGLSPNC 240

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEF 300

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKE   VLRYPVF+EAHETLKSCS S  LL+QVNPH+EIES+SKGEVVDVST SN VP +
Sbjct: 301 MKENRFVLRYPVFVEAHETLKSCSVSYALLKQVNPHMEIESISKGEVVDVSTGSNTVPPN 360

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VD EL+A+LLK+NKLTA+D+MLIG VDKNI+LDSSII SIIEVNCK NRPN ALLAF+Y 
Sbjct: 361 VDNELLAMLLKDNKLTAVDHMLIGIVDKNIQLDSSIIYSIIEVNCKSNRPNSALLAFDYC 420

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KN VNI+R LYL LIGILIRSSIYPKLLEIVQ+MY QGHCLGLY+ATLIL SLGKAGKP
Sbjct: 421 LKNSVNIKRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCLGLYHATLILCSLGKAGKP 480

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFNMLPEELKCTATYTALVD YFSAGSSGKGLKI+ETMRKKGFTPSLGTYNVLL 
Sbjct: 481 QYARKVFNMLPEELKCTATYTALVDGYFSAGSSGKGLKIFETMRKKGFTPSLGTYNVLLN 540

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GL KNGR  EL+IYRREKKSFEISHHS  NT+L++ERICDLLFGEL++
Sbjct: 541 GLAKNGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLFGELVS 588

BLAST of CmUC02G038770 vs. ExPASy TrEMBL
Match: A0A1S4E1N2 (pentatricopeptide repeat-containing protein At2g01390-like OS=Cucumis melo OX=3656 GN=LOC103497457 PE=4 SV=1)

HSP 1 Score: 1005.7 bits (2599), Expect = 8.0e-290
Identity = 499/588 (84.86%), Postives = 539/588 (91.67%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           MH    FSLLLS+YVV SAI KRIYQNIS K LHS HQYK+EKP +RFSR SRKGTKVVK
Sbjct: 1   MHFCNRFSLLLSNYVVISAIRKRIYQNISCKCLHSLHQYKREKPISRFSRNSRKGTKVVK 60

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV PR+YTRDTV NICNILRNCSW SAQ HLEMLPIRWDSYLINQVLKTHPPLEKTWL
Sbjct: 61  KEEVIPRVYTRDTVCNICNILRNCSWASAQKHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRL++FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDA TYTSLMHWR
Sbjct: 121 FFNWASRLKVFKHDQYTYTTMLDIFGEAGRISSMNYLFQQMKEKGIKIDAATYTSLMHWR 180

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAIKVWKEMKANGC+PTVVSYTAYIKILLD GQ KEAT  YKE+L++GLSP+C
Sbjct: 181 SNSGDVDGAIKVWKEMKANGCHPTVVSYTAYIKILLDNGQSKEATATYKEMLKTGLSPNC 240

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEF 300

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKE   VLRYPVF+EAHE LKSCS    LLRQVNPHIEIES+SKGEV+DVST SN VP +
Sbjct: 301 MKENRFVLRYPVFVEAHENLKSCSVGHALLRQVNPHIEIESISKGEVLDVSTGSNTVPPN 360

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VD EL+A+LLK+NKLTAID+MLIG VDKNI+LDSSII SIIEVNCK NRPN A+LAF+Y 
Sbjct: 361 VDNELLAMLLKDNKLTAIDHMLIGIVDKNIQLDSSIIYSIIEVNCKSNRPNSAMLAFDYC 420

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KNGVNI R LYL LIGILIRSSIYPKLLEIVQ+MY QGHC+GLY+ATLIL+SLG+AGKP
Sbjct: 421 LKNGVNIGRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCIGLYHATLILYSLGRAGKP 480

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFN+LPEELKCTATYT+LVDAYFSAGSSGKGLKI+ETMRKKGFTPSLGTYNVLL 
Sbjct: 481 QYARKVFNILPEELKCTATYTSLVDAYFSAGSSGKGLKIFETMRKKGFTPSLGTYNVLLN 540

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GL K+GR  EL+IYRREKKSFEISHHS  NT+L++ERICDLLFGEL++
Sbjct: 541 GLAKSGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLFGELVS 588

BLAST of CmUC02G038770 vs. ExPASy TrEMBL
Match: A0A6J1I9C5 (pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita maxima OX=3661 GN=LOC111470381 PE=4 SV=1)

HSP 1 Score: 1001.5 bits (2588), Expect = 1.5e-288
Identity = 502/588 (85.37%), Postives = 534/588 (90.82%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           M CS SFS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VK
Sbjct: 5   MRCSNSFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV+   YTRDTVRNI NILRNCSWGSAQGH+E LPIRWDSYLINQVLKTHPPLEK WL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWGSAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD G++++ATD YKE+LQSGLSP+C
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNGRVRKATDTYKEMLQSGLSPNC 244

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYT+LMEYLIGE K KEALDIFHKMQDAGVYPDKAACNILIQKCCKSGE LVMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKEK LVLRYPVF+EAHE LKSCS S  LL QVNPHIEIESVSKGEVVDVSTS NV+   
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VDYELVA LLKE KL A+D++LIG  DKNI+LDSSIILSIIEVNCK NRPNGALLAF+Y 
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KNGV +ERNLYL LIG+LIRSSIY  LLEIVQ MY +GHCLGLY+ATLIL+ LGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQDMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFNMLPEELKCTATYTALV AYFSAGS GKGLKIYETMRKKGFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GLVK+ RV ELDIYRREKK FEISHHSH  T+LEEERICDLLFGEL++
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of CmUC02G038770 vs. ExPASy TrEMBL
Match: A0A6J1EIW0 (pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita moschata OX=3662 GN=LOC111434967 PE=4 SV=1)

HSP 1 Score: 993.4 bits (2567), Expect = 4.1e-286
Identity = 498/588 (84.69%), Postives = 532/588 (90.48%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           M CS  FS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VK
Sbjct: 5   MRCSNHFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEV+   YTRDTVRNI NILRNCSW SAQGH+E LPIRWDSYLINQVLKTHPPLEK WL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWASAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFRHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           SNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD  ++++ATDAYKE+LQSGLSP+C
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNVRVRKATDAYKEMLQSGLSPNC 244

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYT+LMEYLIGE K KEALDIFHKMQDAG YPDKAACNILIQKCCKSGE LVMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGAYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKEK LVLRYPVF+EAHE LKSCS S  LL QVNPHIEIESVSKGEVVDVSTS NV+   
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VDYELVA LLKE KL A+D++LIG  DKNI+LDSSIILSIIEVNCK NRPNGALLAF+Y 
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            KNGV +ERNLYL LIG+LIRSSIY  LLEIVQ+MY +GHCLGLY+ATLIL+ LGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQEMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYARKVFNMLPEELKCTATYTALV AYFSAGS GKGLKIYETMRKKGFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GLVK+ RV ELDIYRREKK FEISHHSH  T+LEEERICDLLFGEL++
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of CmUC02G038770 vs. ExPASy TrEMBL
Match: A0A6J1DMJ2 (pentatricopeptide repeat-containing protein At2g01390 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022612 PE=4 SV=1)

HSP 1 Score: 961.1 bits (2483), Expect = 2.3e-276
Identity = 474/588 (80.61%), Postives = 526/588 (89.46%), Query Frame = 0

Query: 1   MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVK 60
           MH S SFSLLLS+YVV SAI K+IY NIS KALHS  QYKQEKP   FSRK RKG KVV+
Sbjct: 1   MHYSNSFSLLLSNYVVISAIRKKIYHNISIKALHSLRQYKQEKPIKLFSRKLRKGAKVVE 60

Query: 61  KEEVDPRLYTRDTVRNICNILRNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120
           KEEVDP+LYTRDTVRNI NILRN SW SAQ HLE LP+RWDSYLINQV+KTHPPLEK WL
Sbjct: 61  KEEVDPKLYTRDTVRNIYNILRNFSWSSAQEHLERLPMRWDSYLINQVMKTHPPLEKAWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWA RL+ FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 121 FFNWACRLRTFKHDQYTYTTMLDIFGEAGRISSMNYIFQQMKEKGIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSC 240
           S SGDV+GAIKVWKEMK NGCYPTVVSYTAYIKILLD  Q+KEATD YKE+LQSGLSP+C
Sbjct: 181 SKSGDVDGAIKVWKEMKTNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNC 240

Query: 241 CTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY 300
           CTYT+LMEYLIG GKCKEALDIFHKMQDAGVYPDKAACNILI KCC+SGE LVMT ILEY
Sbjct: 241 CTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILILKCCRSGEMLVMTPILEY 300

Query: 301 MKEKHLVLRYPVFLEAHETLKSCSFSCNLLRQVNPHIEIESVSKGEVVDVSTSSNVVPSD 360
           MKE   VLRYPVF+EAH+TLKSCS S  LLRQVNPHIE ESVSK EV+ V TSS ++PS+
Sbjct: 301 MKENRFVLRYPVFVEAHQTLKSCSVSETLLRQVNPHIETESVSKDEVIHVITSSTIIPSN 360

Query: 361 VDYELVAILLKENKLTAIDYMLIGTVDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYG 420
           VD+EL+ ILLK+ KL A+DY+L G VDKNI+LDS+II +IIEVNCKHNRP+GALL F++ 
Sbjct: 361 VDHELMEILLKKEKLIAVDYLLTGMVDKNIQLDSAIISTIIEVNCKHNRPDGALLVFDHC 420

Query: 421 FKNGVNIERNLYLCLIGILIRSSIYPKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKP 480
            K+GVN++RNLYL LIG+LIRSSIY KLLEIV +MY QGHCLGLY+ATLIL+ LGKAGKP
Sbjct: 421 LKSGVNMKRNLYLGLIGVLIRSSIYSKLLEIVLEMYRQGHCLGLYHATLILYRLGKAGKP 480

Query: 481 QYARKVFNMLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLT 540
           QYA K+FN+LPEELKCTATYTALV AYFSAGSSGKGLKIYETMRKKGF+PSLGTYNVLLT
Sbjct: 481 QYAVKIFNVLPEELKCTATYTALVGAYFSAGSSGKGLKIYETMRKKGFSPSLGTYNVLLT 540

Query: 541 GLVKNGRVCELDIYRREKKSFEISHHSHPNTVLEEERICDLLFGELLT 589
           GL K+GRV EL+IYRREKKSFEI ++SH + +LEE+RICDLL+GE+++
Sbjct: 541 GLEKSGRVVELEIYRREKKSFEIGYNSHHHIILEEDRICDLLYGEMIS 588

BLAST of CmUC02G038770 vs. TAIR 10
Match: AT2G01390.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 554.7 bits (1428), Expect = 9.4e-158
Identity = 288/559 (51.52%), Postives = 393/559 (70.30%), Query Frame = 0

Query: 29  SSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEV-DPRLYTRDTVRNICNILRNCSWG 88
           S K LHS  + K    + RFS+K     K+VK + + DP +YTRD V NI NIL+  +W 
Sbjct: 20  SVKLLHSLPRLKPTN-SKRFSQK----PKLVKTQTLPDPSVYTRDIVSNIYNILKYSNWD 79

Query: 89  SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGE 148
           SAQ  L  L +RWDS++IN+VLK HPP++K WLFFNWA++++ FKHD +TYTTMLDIFGE
Sbjct: 80  SAQEQLPHLGVRWDSHIINRVLKAHPPMQKAWLFFNWAAQIKGFKHDHFTYTTMLDIFGE 139

Query: 149 AGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVS 208
           AGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDV+GA+++W+EM+ NGC PTVVS
Sbjct: 140 AGRIQSMYSVFHLMKEKGVLIDTVTYTSLIHWVSSSGDVDGAMRLWEEMRDNGCEPTVVS 199

Query: 209 YTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGKCKEALDIFHKMQ 268
           YTAY+K+L   G+++EAT+ YKE+L+S +SP+C TYT+LMEYL+  GKC+EALDIF KMQ
Sbjct: 200 YTAYMKMLFADGRVEEATEVYKEMLRSRVSPNCHTYTVLMEYLVATGKCEEALDIFFKMQ 259

Query: 269 DAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKHLVLRYPVFLEAHETLKSCSFSC 328
           + GV PDKAACNILI K  K GE   MT++L YMKE  +VLRYP+F+EA ETLK+   S 
Sbjct: 260 EIGVQPDKAACNILIAKALKFGETSFMTRVLVYMKENGVVLRYPIFVEALETLKAAGESD 319

Query: 329 NLLRQVNPHIEIESVSKGEVVDVSTS--SNVVPSDVDYELVAILLKENKLTAIDYMLIGT 388
           +LLR+VN HI +ES+   ++ +  T+  ++   SD    + ++LL +  L A+D +L   
Sbjct: 320 DLLREVNSHISVESLCSSDIDETPTAEVNDTKNSDDSRVISSVLLMKQNLVAVDILLNQM 379

Query: 389 VDKNIRLDSSIILSIIEVNCKHNRPNGALLAFNYGFKNGVNIERNLYLCLIGILIRSSIY 448
            D+NI+LDS ++ +IIE NC   R  GA LAF+Y  + G++++++ YL LIG  +RS+  
Sbjct: 380 RDRNIKLDSFVVSAIIETNCDRCRTEGASLAFDYSLEMGIHLKKSAYLALIGNFLRSNEL 439

Query: 449 PKLLEIVQKMYMQGHCLGLYYATLILHSLGKAGKPQYARKVFNMLPEELKCTATYTALVD 508
           PK++E+V++M    H LG Y   +++H LG   +P+ A  VF++LP++ K  A YTAL+D
Sbjct: 440 PKVIEVVKEMVKAQHSLGCYQGAMLIHRLGFGRRPRLAADVFDLLPDDQKGVAAYTALMD 499

Query: 509 AYFSAGSSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKNGRV-CELDIYRREKKSFEIS 568
            Y SAGS  K +KI   MR++   PSLGTY+VLL+GL K      E+ + R+EKKS   S
Sbjct: 500 VYISAGSPEKAMKILREMREREIMPSLGTYDVLLSGLEKTSDFQKEVALLRKEKKSLVAS 559

Query: 569 HHSHPNTVLEEERICDLLF 584
                N V  E++ICDLLF
Sbjct: 560 ARFREN-VHVEDKICDLLF 572

BLAST of CmUC02G038770 vs. TAIR 10
Match: AT1G74750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 128.6 bits (322), Expect = 1.7e-29
Identity = 78/238 (32.77%), Postives = 116/238 (48.74%), Query Frame = 0

Query: 48  FSRKSRKGTKVVKKEEVDPRLYTRD--TVRNICNILRNCSWG-SAQGHLEMLPIRWDSYL 107
           F + SR+  KV  +    PR +      V N+ +ILR   WG +A+  L     R D+Y 
Sbjct: 269 FGKPSREMMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAEEALHNFGFRMDAYQ 328

Query: 108 INQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK 167
            NQVLK          FF W  R   FKHD +TYTTM+   G A +   +N +  +M   
Sbjct: 329 ANQVLKQMDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRD 388

Query: 168 GIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEA 227
           G K + VTY  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I    G +  A
Sbjct: 389 GCKPNTVTYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIA 448

Query: 228 TDAYKELLQSGLSPSCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILI 283
            D Y+ + ++GLSP   TY++++  L   G    A  +F +M   G  P+    NI+I
Sbjct: 449 MDMYQRMQEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMI 506

BLAST of CmUC02G038770 vs. TAIR 10
Match: AT1G18900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 124.4 bits (311), Expect = 3.2e-28
Identity = 71/216 (32.87%), Postives = 109/216 (50.46%), Query Frame = 0

Query: 74  VRNICNILRNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFK 133
           V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  +  A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIG 253
           + +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 EGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKS 289
            G    A  +F +M D G  P+    NI++    K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of CmUC02G038770 vs. TAIR 10
Match: AT1G18900.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 124.4 bits (311), Expect = 3.2e-28
Identity = 71/216 (32.87%), Postives = 109/216 (50.46%), Query Frame = 0

Query: 74  VRNICNILRNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFK 133
           V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  +  A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIG 253
           + +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 EGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKS 289
            G    A  +F +M D G  P+    NI++    K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of CmUC02G038770 vs. TAIR 10
Match: AT1G18900.3 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 124.4 bits (311), Expect = 3.2e-28
Identity = 71/216 (32.87%), Postives = 109/216 (50.46%), Query Frame = 0

Query: 74  VRNICNILRNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFK 133
           V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  +  A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIG 253
           + +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 EGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKS 289
            G    A  +F +M D G  P+    NI++    K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901985.15.2e-29988.59pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida][more]
XP_011649371.15.5e-29386.05pentatricopeptide repeat-containing protein At2g01390 [Cucumis sativus] >KGN6206... [more]
XP_016902133.11.7e-28984.86PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cucumis m... [more]
XP_022971714.13.1e-28885.37pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima] >XP_022... [more]
XP_023512200.11.7e-28684.18pentatricopeptide repeat-containing protein At2g01390 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9ZU291.3e-15651.52Pentatricopeptide repeat-containing protein At2g01390 OS=Arabidopsis thaliana OX... [more]
Q9SSF92.4e-2832.77Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX... [more]
Q8GYP64.4e-2732.87Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX... [more]
Q9C6S62.9e-2623.19Putative pentatricopeptide repeat-containing protein At1g31840 OS=Arabidopsis th... [more]
Q76C995.4e-2524.72Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Match NameE-valueIdentityDescription
A0A0A0LJM32.7e-29386.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G295410 PE=4 SV=1[more]
A0A1S4E1N28.0e-29084.86pentatricopeptide repeat-containing protein At2g01390-like OS=Cucumis melo OX=36... [more]
A0A6J1I9C51.5e-28885.37pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita maxima OX=366... [more]
A0A6J1EIW04.1e-28684.69pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita moschata OX=3... [more]
A0A6J1DMJ22.3e-27680.61pentatricopeptide repeat-containing protein At2g01390 isoform X1 OS=Momordica ch... [more]
Match NameE-valueIdentityDescription
AT2G01390.19.4e-15851.52Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74750.11.7e-2932.77Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.13.2e-2832.87Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.23.2e-2832.87Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.33.2e-2832.87Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 157..214
e-value: 2.6E-13
score: 49.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 238..287
e-value: 3.6E-11
score: 43.1
coord: 496..544
e-value: 7.3E-10
score: 38.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 171..205
e-value: 9.1E-9
score: 33.0
coord: 499..531
e-value: 1.7E-6
score: 25.8
coord: 241..274
e-value: 4.5E-7
score: 27.6
coord: 136..170
e-value: 4.7E-8
score: 30.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 204..238
score: 9.514466
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 239..273
score: 10.194067
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 496..530
score: 10.862706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 134..168
score: 10.89559
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 169..203
score: 11.783455
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 28..218
e-value: 5.0E-25
score: 89.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 229..306
e-value: 1.5E-14
score: 55.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 387..580
e-value: 1.1E-19
score: 72.9
NoneNo IPR availablePANTHERPTHR47938:SF18OS10G0358700 PROTEINcoord: 44..586
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 44..586

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC02G038770.1CmUC02G038770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding