CaUC10G191670 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC10G191670
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
LocationCiama_Chr10: 26041293 .. 26043290 (+)
RNA-Seq ExpressionCaUC10G191670
SyntenyCaUC10G191670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTTCTTCTCTCCACCCACATTCATCCTCTCCCCATTACTCAAAAAACCAATCACGCATACGGTCGCCACCCACCATTTAATAATCCCCCTCATGTTCGTACTATAACTACCGAGAATTATGCTAATTTTAGCGTAGCCCACCAACTGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATTTGGGGCATGTGATTTCAACATATCAACAGATGTTGTTTCGTGGAGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGTGCTACCCGCCAGTATGGCGATCTGCAGGTTGGCAAACAGCTCCACGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTCTATGTGCTTACTTCCTTAATTGAATTGTACGGGATTCTTGACAGTGCTGACACTGCAAAGTGGCTTCATGACAAGTCCGCTTGCAGAAACTCTGTTTCTTGGACTATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCATAGACGTGTTTTACCAAATGGTGGAGTTGGCGGATGATATTGATGCAGTGGCTTTGGCTACGGCCATTGGTGCCTGTGGTGCACTCAAAATCCTGCAACATGGAAGAAACATCCACCATCTCGCAAGAATTCATGGGTTGGAATCTAATGTCTTGGTTAGTAATTCTCTGTTGAAAATGTACCTTGACTGTGATAGTATCAAAGATGCTCGGGGTTTCTTCGACCAAATGCCATCCAAAGATGTCATTTCGTGGACAGAACTTATCCATATGTACGTTAAGAAAGGTGGAATCAATGAGGCGTTTAAGCTGTTTCGACAGATGAATTTGGATGGAGGATTGAAGCCTGATCCTCTTACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAAAGAGATTCATGGATACGTGCTTAAAAATGCTTTTGACGAGAATCTCATTGTCCAAAATGCTTTGGTGGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAACTTTCTCGATGATGAAGGAGAGAGATATGGTTTCATGGACCATCATGACTTTGGGCTACAGCTTACATGGCCAAGGAAAACTTGGAGTCAATTTGTTCCGTGAGATGGAGAGGAACTTGAGGATGCATAGAGATGAGATCACTTACACTGCAGTTTTGCATGCTTGTACTACTGCAAACATGGTAGATGAAGGGGATTTTTACTTCAGTTGCATTACCGAACCGACTGTGGCACACTTTGCTTTAAAGGTGGCTCTTTTAGCCCGAGCAGGGCACCTAGATGAAGCAAGGACATTTGTAGAAAAAAATAAACTTGGCAAGCATGCTGAGGTTTTGAGAGCATTGCTCGATGGATGCAGGAACCGCCATCAACAAAAACTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCTGAGAATTACGTTCTACTTTCGAACTGGTATGCGTGCAACGAAAAATGGGACATGGTTGAAAAGTTGAGAGAAACAATAAGAGACATGGGATTAAGACCTAAGAAGGCTTACAGTTGGATGGAGTTCTGCAACAAAATTCATGTGTTTGGGACGGGGGATGTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCTTAATGAAAAAAATGGAAGAAGATGGTTCAAAGCCAAATCCAGATTTCAGTTTTCACGACGTCGACGAGGAGCGAGAGTGTGTTCCAATAGGACACAGCGAACTCTTGGCAATTTCATTCGGGCTGATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATGGTTGGCCGAGAAATCATAGTTAGAGATCCTTACGTTTTCCATCATTTCAAAGATGGTTGCTGTTCTTGTGAAGACTTTTGTTAA

mRNA sequence

ATGAATCTTCTTCTCTCCACCCACATTCATCCTCTCCCCATTACTCAAAAAACCAATCACGCATACGGTCGCCACCCACCATTTAATAATCCCCCTCATGTTCGTACTATAACTACCGAGAATTATGCTAATTTTAGCGTAGCCCACCAACTGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATTTGGGGCATGTGATTTCAACATATCAACAGATGTTGTTTCGTGGAGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGTGCTACCCGCCAGTATGGCGATCTGCAGGTTGGCAAACAGCTCCACGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTCTATGTGCTTACTTCCTTAATTGAATTGTACGGGATTCTTGACAGTGCTGACACTGCAAAGTGGCTTCATGACAAGTCCGCTTGCAGAAACTCTGTTTCTTGGACTATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCATAGACGTGTTTTACCAAATGGTGGAGTTGGCGGATGATATTGATGCAGTGGCTTTGGCTACGGCCATTGGTGCCTGTGGTGCACTCAAAATCCTGCAACATGGAAGAAACATCCACCATCTCGCAAGAATTCATGGGTTGGAATCTAATGTCTTGGTTAGTAATTCTCTGTTGAAAATGTACCTTGACTGTGATAGTATCAAAGATGCTCGGGGTTTCTTCGACCAAATGCCATCCAAAGATGTCATTTCGTGGACAGAACTTATCCATATGTACGTTAAGAAAGGTGGAATCAATGAGGCGTTTAAGCTGTTTCGACAGATGAATTTGGATGGAGGATTGAAGCCTGATCCTCTTACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAAAGAGATTCATGGATACGTGCTTAAAAATGCTTTTGACGAGAATCTCATTGTCCAAAATGCTTTGGTGGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAACTTTCTCGATGATGAAGGAGAGAGATATGGTTTCATGGACCATCATGACTTTGGGCTACAGCTTACATGGCCAAGGAAAACTTGGAGTCAATTTGTTCCGTGAGATGGAGAGGAACTTGAGGATGCATAGAGATGAGATCACTTACACTGCAGTTTTGCATGCTTGTACTACTGCAAACATGGTAGATGAAGGGGATTTTTACTTCAGTTGCATTACCGAACCGACTGTGGCACACTTTGCTTTAAAGGTGGCTCTTTTAGCCCGAGCAGGGCACCTAGATGAAGCAAGGACATTTGTAGAAAAAAATAAACTTGGCAAGCATGCTGAGGTTTTGAGAGCATTGCTCGATGGATGCAGGAACCGCCATCAACAAAAACTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCTGAGAATTACGTTCTACTTTCGAACTGGTATGCGTGCAACGAAAAATGGGACATGGTTGAAAAGTTGAGAGAAACAATAAGAGACATGGGATTAAGACCTAAGAAGGCTTACAGTTGGATGGAGTTCTGCAACAAAATTCATGTGTTTGGGACGGGGGATGTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCTTAATGAAAAAAATGGAAGAAGATGGTTCAAAGCCAAATCCAGATTTCAGTTTTCACGACGTCGACGAGGAGCGAGAGTGTGTTCCAATAGGACACAGCGAACTCTTGGCAATTTCATTCGGGCTGATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATGGTTGGCCGAGAAATCATAGTTAGAGATCCTTACGTTTTCCATCATTTCAAAGATGGTTGCTGTTCTTGTGAAGACTTTTGTTAA

Coding sequence (CDS)

ATGAATCTTCTTCTCTCCACCCACATTCATCCTCTCCCCATTACTCAAAAAACCAATCACGCATACGGTCGCCACCCACCATTTAATAATCCCCCTCATGTTCGTACTATAACTACCGAGAATTATGCTAATTTTAGCGTAGCCCACCAACTGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATTTGGGGCATGTGATTTCAACATATCAACAGATGTTGTTTCGTGGAGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGTGCTACCCGCCAGTATGGCGATCTGCAGGTTGGCAAACAGCTCCACGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTCTATGTGCTTACTTCCTTAATTGAATTGTACGGGATTCTTGACAGTGCTGACACTGCAAAGTGGCTTCATGACAAGTCCGCTTGCAGAAACTCTGTTTCTTGGACTATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCATAGACGTGTTTTACCAAATGGTGGAGTTGGCGGATGATATTGATGCAGTGGCTTTGGCTACGGCCATTGGTGCCTGTGGTGCACTCAAAATCCTGCAACATGGAAGAAACATCCACCATCTCGCAAGAATTCATGGGTTGGAATCTAATGTCTTGGTTAGTAATTCTCTGTTGAAAATGTACCTTGACTGTGATAGTATCAAAGATGCTCGGGGTTTCTTCGACCAAATGCCATCCAAAGATGTCATTTCGTGGACAGAACTTATCCATATGTACGTTAAGAAAGGTGGAATCAATGAGGCGTTTAAGCTGTTTCGACAGATGAATTTGGATGGAGGATTGAAGCCTGATCCTCTTACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAAAGAGATTCATGGATACGTGCTTAAAAATGCTTTTGACGAGAATCTCATTGTCCAAAATGCTTTGGTGGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAACTTTCTCGATGATGAAGGAGAGAGATATGGTTTCATGGACCATCATGACTTTGGGCTACAGCTTACATGGCCAAGGAAAACTTGGAGTCAATTTGTTCCGTGAGATGGAGAGGAACTTGAGGATGCATAGAGATGAGATCACTTACACTGCAGTTTTGCATGCTTGTACTACTGCAAACATGGTAGATGAAGGGGATTTTTACTTCAGTTGCATTACCGAACCGACTGTGGCACACTTTGCTTTAAAGGTGGCTCTTTTAGCCCGAGCAGGGCACCTAGATGAAGCAAGGACATTTGTAGAAAAAAATAAACTTGGCAAGCATGCTGAGGTTTTGAGAGCATTGCTCGATGGATGCAGGAACCGCCATCAACAAAAACTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCTGAGAATTACGTTCTACTTTCGAACTGGTATGCGTGCAACGAAAAATGGGACATGGTTGAAAAGTTGAGAGAAACAATAAGAGACATGGGATTAAGACCTAAGAAGGCTTACAGTTGGATGGAGTTCTGCAACAAAATTCATGTGTTTGGGACGGGGGATGTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCTTAATGAAAAAAATGGAAGAAGATGGTTCAAAGCCAAATCCAGATTTCAGTTTTCACGACGTCGACGAGGAGCGAGAGTGTGTTCCAATAGGACACAGCGAACTCTTGGCAATTTCATTCGGGCTGATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATGGTTGGCCGAGAAATCATAGTTAGAGATCCTTACGTTTTCCATCATTTCAAAGATGGTTGCTGTTCTTGTGAAGACTTTTGTTAA

Protein sequence

MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGDFYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCSCEDFC
Homology
BLAST of CaUC10G191670 vs. NCBI nr
Match: XP_004137884.2 (pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus])

HSP 1 Score: 1293.1 bits (3345), Expect = 0.0e+00
Identity = 613/665 (92.18%), Postives = 639/665 (96.09%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           MNLLLSTH H LPITQK NHAY RHPPFNN PHVRT+T ENYAN  VAHQ+FD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTY+QMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           +D+FYQMVELADDIDAVALATAIGACGALK+L HGRNIHHLAR+HGLE N+LVSNSLLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           Y+DCDSIKDARGFFDQMPSKD+ISWTELIHMYVKKGGINEAFKLFRQMN+DG LKPDP T
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKEIHGYV+KNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+DMVSW+IMTLGYSLHGQGKLGV+LFREME+N +M RDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
            YFSCIT+PTVAH ALKVALLARAG LDEARTFVEK KL KH E+LRALLDGCRN  QQK
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSW+EFC
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDGSKPNPDFS HDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIV+DPYVFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CE+FC
Sbjct: 661 CENFC 665

BLAST of CaUC10G191670 vs. NCBI nr
Match: XP_038905218.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1280.8 bits (3313), Expect = 0.0e+00
Identity = 617/666 (92.64%), Postives = 640/666 (96.10%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           MNLLLSTH+H LPITQ+T+H      PFNNPPHVRT T +N AN SVAHQLFDEIPIWDT
Sbjct: 1   MNLLLSTHVHCLPITQETSH----RQPFNNPPHVRTTTAKNSANLSVAHQLFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYG+LQ GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGNLQFGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSSNLYVLTSLIE YGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIEFYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSCA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELADDIDAVALATAIGACGALK+LQHGRNIH LARIHGLE NVLVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDAVALATAIGACGALKMLQHGRNIHLLARIHGLEFNVLVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDCDSIKDARGFFD+MPSKDVISWTELIHMYVKKGGINEAFKLFRQMN DGGLKPDPLT
Sbjct: 241 YLDCDSIKDARGFFDRMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNKDGGLKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSAS+TFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASETFSMMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMH-RDEITYTAVLHACTTANMVDEG 420
           E+DMVSWTIMTLGYSLHGQGKLGV+LFRE+ERNLRMH RD+ITYTAVLHACTTANMVDEG
Sbjct: 361 EKDMVSWTIMTLGYSLHGQGKLGVSLFREIERNLRMHNRDQITYTAVLHACTTANMVDEG 420

Query: 421 DFYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQ 480
           DFYFSCITEPTVAH ALKVALLARAG LDEA TFVEKNKL KHA +LRALLDGCR  HQ+
Sbjct: 421 DFYFSCITEPTVAHIALKVALLARAGRLDEATTFVEKNKLDKHAVILRALLDGCRKHHQR 480

Query: 481 KLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEF 540
           KLGK+IIE+LCDLEPLNAENY+LLSNWYACN+KWDMVEKLRET+RDMGLRPKKAYSWMEF
Sbjct: 481 KLGKQIIEKLCDLEPLNAENYILLSNWYACNKKWDMVEKLRETMRDMGLRPKKAYSWMEF 540

Query: 541 CNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE 600
           CNKIHVFGTGDVSHPRS+NIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE
Sbjct: 541 CNKIHVFGTGDVSHPRSRNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE 600

Query: 601 LLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCC 660
           LLAISFGLIST+AGRTIRITKNLRVCHSCHESAKFISKMVGREIIV+DPYVFHHFKDGCC
Sbjct: 601 LLAISFGLISTKAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCC 660

Query: 661 SCEDFC 666
           SCED C
Sbjct: 661 SCEDVC 662

BLAST of CaUC10G191670 vs. NCBI nr
Match: XP_008465161.1 (PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1280.4 bits (3312), Expect = 0.0e+00
Identity = 611/665 (91.88%), Postives = 634/665 (95.34%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           MNLLLSTH H LPITQK  HAY RHPPFNN PHVRT T ENYA+  VAHQ+FDEIPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPYHAYHRHPPFNNLPHVRTTTVENYADLCVAHQVFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD GHVIS Y+QMLFRGVRPDKHTLPRIICATRQYGDL VGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDWGHVISIYRQMLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSS+LYVLTSLIELYGILDSADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFA
Sbjct: 121 AFKLGFSSDLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELADDID+VALATAIGACGALK+L HGRNIHHLARIHGLE N+LVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDSVALATAIGACGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMN+DG LKPDPLT
Sbjct: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKEIHGYVLKN FDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+DMVSW+IMTLGYSLHGQGKLGV LFREME+NL+MHRDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVGLFREMEKNLKMHRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
           FYFS IT+PTVAH ALKVALLARAG LDEARTFVEK KL KH E+LRALLDGCRN  QQK
Sbjct: 421 FYFSRITKPTVAHIALKVALLARAGRLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLN ENY+LLSNWYACN+KWDMVE+LRETIRDMGLRPKKAYSW+EFC
Sbjct: 481 LGKRIIEQLCDLEPLNTENYILLSNWYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSK NP+FS HDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIV+DPYVFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CE+FC
Sbjct: 661 CENFC 665

BLAST of CaUC10G191670 vs. NCBI nr
Match: XP_038905219.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1197.2 bits (3096), Expect = 0.0e+00
Identity = 584/666 (87.69%), Postives = 607/666 (91.14%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           MNLLLSTH+H LPITQ+T+H      PFNNPPHVRT T +N AN SVAHQLFDEIPIWDT
Sbjct: 1   MNLLLSTHVHCLPITQETSH----RQPFNNPPHVRTTTAKNSANLSVAHQLFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYG+LQ GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGNLQFGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG                                  NSVSWTMLAKLYLMEDKPS A
Sbjct: 121 AFKLG----------------------------------NSVSWTMLAKLYLMEDKPSCA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELADDIDAVALATAIGACGALK+LQHGRNIH LARIHGLE NVLVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDAVALATAIGACGALKMLQHGRNIHLLARIHGLEFNVLVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDCDSIKDARGFFD+MPSKDVISWTELIHMYVKKGGINEAFKLFRQMN DGGLKPDPLT
Sbjct: 241 YLDCDSIKDARGFFDRMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNKDGGLKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSAS+TFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASETFSMMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMH-RDEITYTAVLHACTTANMVDEG 420
           E+DMVSWTIMTLGYSLHGQGKLGV+LFRE+ERNLRMH RD+ITYTAVLHACTTANMVDEG
Sbjct: 361 EKDMVSWTIMTLGYSLHGQGKLGVSLFREIERNLRMHNRDQITYTAVLHACTTANMVDEG 420

Query: 421 DFYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQ 480
           DFYFSCITEPTVAH ALKVALLARAG LDEA TFVEKNKL KHA +LRALLDGCR  HQ+
Sbjct: 421 DFYFSCITEPTVAHIALKVALLARAGRLDEATTFVEKNKLDKHAVILRALLDGCRKHHQR 480

Query: 481 KLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEF 540
           KLGK+IIE+LCDLEPLNAENY+LLSNWYACN+KWDMVEKLRET+RDMGLRPKKAYSWMEF
Sbjct: 481 KLGKQIIEKLCDLEPLNAENYILLSNWYACNKKWDMVEKLRETMRDMGLRPKKAYSWMEF 540

Query: 541 CNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE 600
           CNKIHVFGTGDVSHPRS+NIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE
Sbjct: 541 CNKIHVFGTGDVSHPRSRNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE 600

Query: 601 LLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCC 660
           LLAISFGLIST+AGRTIRITKNLRVCHSCHESAKFISKMVGREIIV+DPYVFHHFKDGCC
Sbjct: 601 LLAISFGLISTKAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCC 628

Query: 661 SCEDFC 666
           SCED C
Sbjct: 661 SCEDVC 628

BLAST of CaUC10G191670 vs. NCBI nr
Match: XP_022158739.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Momordica charantia])

HSP 1 Score: 1182.5 bits (3058), Expect = 0.0e+00
Identity = 567/665 (85.26%), Postives = 606/665 (91.13%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           M+LLLSTH   LPIT KT+  Y R  PFNNPPHVRT  TENYAN   AH  FDEIP WDT
Sbjct: 1   MDLLLSTHFRRLPITPKTDLTYRRRRPFNNPPHVRTAITENYANLCEAHHPFDEIPTWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD+G VISTY+QML RGVRPD HTLPRII A+RQ GDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDVGLVISTYEQMLLRGVRPDNHTLPRIIGASRQCGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
            FKLGFSSNLYV+TSLIELYGILD ADTAKWLHDKSACRNSVSWTMLAKLY+MEDKPSFA
Sbjct: 121 VFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTMLAKLYVMEDKPSFA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELA DIDAVALATAIGACG+LK+LQHGRNIH LAR HGLE +VLVSNSLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGSLKLLQHGRNIHLLARTHGLEFDVLVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDC SI+DARGFF++MPSKDVISWTELI  YVKKGGINE FKLFRQMN+DGGLKPDP+T
Sbjct: 241 YLDCGSIRDARGFFNRMPSKDVISWTELIQAYVKKGGINEGFKLFRQMNMDGGLKPDPIT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHG+EIHGYVLK+A D NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRMAAHKHGREIHGYVLKSAIDVNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+D +SWT+M LGYSLHGQGKLGV+LFR MERNLRMHRDEITYT+VLHAC+TA++V+EGD
Sbjct: 361 EKDAISWTVMILGYSLHGQGKLGVSLFRLMERNLRMHRDEITYTSVLHACSTASLVEEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
           FYF+CI EPT +HFALKVALLARAG LDEAR FVE++KL KH E+LRALLDGCR    +K
Sbjct: 421 FYFNCIMEPTFSHFALKVALLARAGRLDEARAFVEQHKLDKHPEILRALLDGCRTHRDKK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYACN K DMVEK RE +RDMGLRPKKAYSWMEF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNGKLDMVEKSREIVRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNL+CLMKKME+DG KP PDFSFHDVDEERECV IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLECLMKKMEDDGLKPKPDFSFHDVDEERECVLIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTI ITKNLRVCHSCHESAKFISK+VGREIIV+DPYVFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTICITKNLRVCHSCHESAKFISKIVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CaUC10G191670 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 438.3 bits (1126), Expect = 1.5e-121
Identity = 228/623 (36.60%), Postives = 356/623 (57.14%), Query Frame = 0

Query: 48  AHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQ 107
           A ++FDE+   D  +WN++I  +++NG     +S + QML  G+  D  T+  +      
Sbjct: 249 ARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCAD 308

Query: 108 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTML 167
              + +G+ +H+   K  FS       +L+++Y      D+AK +  + + R+ VS+T +
Sbjct: 309 SRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSM 368

Query: 168 AKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGL 227
              Y  E     A+ +F +M E     D   +   +  C   ++L  G+ +H   + + L
Sbjct: 369 IAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDL 428

Query: 228 ESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQ 287
             ++ VSN+L+ MY  C S+++A   F +M  KD+ISW  +I  Y K    NEA  LF  
Sbjct: 429 GFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 488

Query: 288 MNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSG 347
           +  +    PD  T++ +LPAC  ++A   G+EIHGY+++N +  +  V N+LVDMY K G
Sbjct: 489 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 548

Query: 348 CIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVL 407
            +  A   F  +  +D+VSWT+M  GY +HG GK  + LF +M R   +  DEI++ ++L
Sbjct: 549 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADEISFVSLL 608

Query: 408 HACTTANMVDEGDFYFS-----CITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKH 467
           +AC+ + +VDEG  +F+     C  EPTV H+A  V +LAR G L +A  F+E   +   
Sbjct: 609 YACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPD 668

Query: 468 AEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRET 527
           A +  ALL GCR  H  KL +++ E++ +LEP N   YVL++N YA  EKW+ V++LR+ 
Sbjct: 669 ATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKR 728

Query: 528 IRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFS 587
           I   GLR     SW+E   ++++F  GD S+P ++NI   L+ +  +M E+G  P   ++
Sbjct: 729 IGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYA 788

Query: 588 FHDVDE-ERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGR 647
             D +E E+E    GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Sbjct: 789 LIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRR 848

Query: 648 EIIVRDPYVFHHFKDGCCSCEDF 665
           EI++RD   FH FKDG CSC  F
Sbjct: 849 EIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of CaUC10G191670 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 422.5 bits (1085), Expect = 8.6e-117
Identity = 221/633 (34.91%), Postives = 354/633 (55.92%), Query Frame = 0

Query: 39  TENYANFSVAHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTL 98
           + ++ + + A Q+FD++P    F WN +I+ +  N      +  Y  M    V PD  T 
Sbjct: 63  SSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTF 122

Query: 99  PRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSAC 158
           P ++ A      LQ+G+ +HAQ F+LGF ++++V   LI LY       +A+ + +    
Sbjct: 123 PHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPL 182

Query: 159 --RNSVSWTMLAKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGR 218
             R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR
Sbjct: 183 PERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGR 242

Query: 219 NIHHLARIHGLESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKG 278
           +IH      GLE    +  SL  MY  C  +  A+  FD+M S ++I W  +I  Y K G
Sbjct: 243 SIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNG 302

Query: 279 GINEAFKLFRQMNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQ 338
              EA  +F +M ++  ++PD ++I+S + AC ++ + +  + ++ YV ++ + +++ + 
Sbjct: 303 YAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFIS 362

Query: 339 NALVDMYVKSGCIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRM 398
           +AL+DM+ K G ++ A   F    +RD+V W+ M +GY LHG+ +  ++L+R MER   +
Sbjct: 363 SALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GV 422

Query: 399 HRDEITYTAVLHACTTANMVDEGDFYFSCITE----PTVAHFALKVALLARAGHLDEART 458
           H +++T+  +L AC  + MV EG ++F+ + +    P   H+A  + LL RAGHLD+A  
Sbjct: 423 HPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYE 482

Query: 459 FVEKNKLGKHAEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEK 518
            ++   +     V  ALL  C+     +LG+   +QL  ++P N  +YV LSN YA    
Sbjct: 483 VIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARL 542

Query: 519 WDMVEKLRETIRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEE 578
           WD V ++R  +++ GL      SW+E   ++  F  GD SHPR + I   ++ +  +++E
Sbjct: 543 WDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKE 602

Query: 579 DGSKPNPDFSFHDV-DEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHES 638
            G   N D S HD+ DEE E     HSE +AI++GLIST  G  +RITKNLR C +CH +
Sbjct: 603 GGFVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAA 662

Query: 639 AKFISKMVGREIIVRDPYVFHHFKDGCCSCEDF 665
            K ISK+V REI+VRD   FHHFKDG CSC D+
Sbjct: 663 TKLISKLVDREIVVRDTNRFHHFKDGVCSCGDY 693

BLAST of CaUC10G191670 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 9.5e-108
Identity = 228/687 (33.19%), Postives = 351/687 (51.09%), Query Frame = 0

Query: 50  QLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYG 109
           + FD++P  D+ +W  +I  +   G     I     M+  G+ P + TL  ++ +     
Sbjct: 101 EFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATR 160

Query: 110 DLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAK 169
            ++ GK++H+   KLG   N+ V  SL+ +Y        AK++ D+   R+  SW  +  
Sbjct: 161 CMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIA 220

Query: 170 LYLMEDKPSFAIDVFYQMVE-----------------------------LADDI---DAV 229
           L++   +   A+  F QM E                             L D +   D  
Sbjct: 221 LHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRF 280

Query: 230 ALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKMYLDCDSIKDARGFFDQ- 289
            LA+ + AC  L+ L  G+ IH      G + + +V N+L+ MY  C  ++ AR   +Q 
Sbjct: 281 TLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQR 340

Query: 290 --------------------------------MPSKDVISWTELIHMYVKKGGINEAFKL 349
                                           +  +DV++WT +I  Y + G   EA  L
Sbjct: 341 GTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINL 400

Query: 350 FRQMNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYV 409
           FR M + GG +P+  T++++L     +A+  HGK+IHG  +K+    ++ V NAL+ MY 
Sbjct: 401 FRSM-VGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYA 460

Query: 410 KSGCIQSASKTFSMMK-ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITY 469
           K+G I SAS+ F +++ ERD VSWT M +  + HG  +  + LF  M     +  D ITY
Sbjct: 461 KAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLME-GLRPDHITY 520

Query: 470 TAVLHACTTANMVDEGDFYFSCITE-----PTVAHFALKVALLARAGHLDEARTFVEKNK 529
             V  ACT A +V++G  YF  + +     PT++H+A  V L  RAG L EA+ F+EK  
Sbjct: 521 VGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMP 580

Query: 530 LGKHAEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEK 589
           +        +LL  CR      LGK   E+L  LEP N+  Y  L+N Y+   KW+   K
Sbjct: 581 IEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAK 640

Query: 590 LRETIRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPN 649
           +R++++D  ++ ++ +SW+E  +K+HVFG  D +HP    IY  ++ +  ++++ G  P+
Sbjct: 641 IRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPD 700

Query: 650 PDFSFHDVDEE-RECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISK 665
                HD++EE +E +   HSE LAI+FGLIST    T+RI KNLRVC+ CH + KFISK
Sbjct: 701 TASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISK 760

BLAST of CaUC10G191670 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.4e-106
Identity = 229/621 (36.88%), Postives = 335/621 (53.95%), Query Frame = 0

Query: 48  AHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQ 107
           A  LFD +P  D  +WN +I  +  NG     +  +  M    V PD  TL  +I A   
Sbjct: 250 ARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACEL 309

Query: 108 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTML 167
            GD ++G+ +HA     GF+ ++ V  SL ++Y    S   A+ L  +   ++ VSWT +
Sbjct: 310 LGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTM 369

Query: 168 AKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGL 227
              Y     P  AID +  M + +   D + +A  + AC  L  L  G  +H LA    L
Sbjct: 370 ISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARL 429

Query: 228 ESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQ 287
            S V+V+N+L+ MY  C  I  A   F  +P K+VISWT +I          EA    RQ
Sbjct: 430 ISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQ 489

Query: 288 MNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSG 347
           M +   L+P+ +T+++ L AC R+ A   GKEIH +VL+     +  + NAL+DMYV+ G
Sbjct: 490 MKMT--LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCG 549

Query: 348 CIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVL 407
            + +A   F+  K +D+ SW I+  GYS  GQG + V LF  M ++ R+  DEIT+ ++L
Sbjct: 550 RMNTAWSQFNSQK-KDVTSWNILLTGYSERGQGSMVVELFDRMVKS-RVRPDEITFISLL 609

Query: 408 HACTTANMVDEGDFYFSCITE----PTVAHFALKVALLARAGHLDEARTFVEKNKLGKHA 467
             C+ + MV +G  YFS + +    P + H+A  V LL RAG L EA  F++K  +    
Sbjct: 610 CGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDP 669

Query: 468 EVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETI 527
            V  ALL+ CR  H+  LG+   + + +L+  +   Y+LL N YA   KW  V K+R  +
Sbjct: 670 AVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMM 729

Query: 528 RDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG-SKPNPDFS 587
           ++ GL      SW+E   K+H F + D  HP+++ I   L+   +KM E G +K +   S
Sbjct: 730 KENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVGLTKISESSS 789

Query: 588 FHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGRE 647
             + +  R+ +  GHSE  AI+FGLI+T  G  I +TKNL +C +CH++ KFISK V RE
Sbjct: 790 MDETEISRDEIFCGHSERKAIAFGLINTVPGMPIWVTKNLSMCENCHDTVKFISKTVRRE 849

Query: 648 IIVRDPYVFHHFKDGCCSCED 664
           I VRD   FHHFKDG CSC D
Sbjct: 850 ISVRDAEHFHHFKDGECSCGD 866

BLAST of CaUC10G191670 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 4.4e-105
Identity = 233/679 (34.32%), Postives = 353/679 (51.99%), Query Frame = 0

Query: 36  TITTENYANFSVAHQ---LFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVR 95
           +I    Y N  + H+   LF  +      AW ++I+           ++++ +M   G  
Sbjct: 43  SIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRC 102

Query: 96  PDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGIL---DSADTA 155
           PD +  P ++ +     DL+ G+ +H    +LG   +LY   +L+ +Y  L    S  + 
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 156 KWLHDKSACR--NSVSWTMLAKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACG 215
             + D+   R  NS    + A+  +M     F ID   ++ E+    D V+  T I    
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIM----PFGIDSVRRVFEVMPRKDVVSYNTIIAGYA 222

Query: 216 -------ALKILQH----------------------------GRNIHHLARIHGLESNVL 275
                  AL++++                             G+ IH      G++S+V 
Sbjct: 223 QSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVY 282

Query: 276 VSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDG 335
           + +SL+ MY     I+D+   F ++  +D ISW  L+  YV+ G  NEA +LFRQM +  
Sbjct: 283 IGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTA 342

Query: 336 GLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSA 395
            +KP  +  SS++PAC  +A    GK++HGYVL+  F  N+ + +ALVDMY K G I++A
Sbjct: 343 KVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAA 402

Query: 396 SKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTT 455
            K F  M   D VSWT + +G++LHG G   V+LF EM+R   +  +++ + AVL AC+ 
Sbjct: 403 RKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSH 462

Query: 456 ANMVDEGDFYFSCITE-----PTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLR 515
             +VDE   YF+ +T+       + H+A    LL RAG L+EA  F+ K  +     V  
Sbjct: 463 VGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWS 522

Query: 516 ALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMG 575
            LL  C      +L +++ E++  ++  N   YVL+ N YA N +W  + KLR  +R  G
Sbjct: 523 TLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKG 582

Query: 576 LRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVD 635
           LR K A SW+E  NK H F +GD SHP    I   L+ +M++ME++G   +     HDVD
Sbjct: 583 LRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVD 642

Query: 636 EE--RECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIV 665
           EE  RE +  GHSE LA++FG+I+TE G TIR+TKN+R+C  CH + KFISK+  REIIV
Sbjct: 643 EEHKRELL-FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIV 702

BLAST of CaUC10G191670 vs. ExPASy TrEMBL
Match: A0A1S3CPR5 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103502829 PE=3 SV=1)

HSP 1 Score: 1280.4 bits (3312), Expect = 0.0e+00
Identity = 611/665 (91.88%), Postives = 634/665 (95.34%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           MNLLLSTH H LPITQK  HAY RHPPFNN PHVRT T ENYA+  VAHQ+FDEIPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPYHAYHRHPPFNNLPHVRTTTVENYADLCVAHQVFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD GHVIS Y+QMLFRGVRPDKHTLPRIICATRQYGDL VGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDWGHVISIYRQMLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSS+LYVLTSLIELYGILDSADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFA
Sbjct: 121 AFKLGFSSDLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELADDID+VALATAIGACGALK+L HGRNIHHLARIHGLE N+LVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDSVALATAIGACGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMN+DG LKPDPLT
Sbjct: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKEIHGYVLKN FDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+DMVSW+IMTLGYSLHGQGKLGV LFREME+NL+MHRDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVGLFREMEKNLKMHRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
           FYFS IT+PTVAH ALKVALLARAG LDEARTFVEK KL KH E+LRALLDGCRN  QQK
Sbjct: 421 FYFSRITKPTVAHIALKVALLARAGRLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLN ENY+LLSNWYACN+KWDMVE+LRETIRDMGLRPKKAYSW+EFC
Sbjct: 481 LGKRIIEQLCDLEPLNTENYILLSNWYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSK NP+FS HDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIV+DPYVFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CE+FC
Sbjct: 661 CENFC 665

BLAST of CaUC10G191670 vs. ExPASy TrEMBL
Match: A0A0A0L9N4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G722890 PE=3 SV=1)

HSP 1 Score: 1200.3 bits (3104), Expect = 0.0e+00
Identity = 573/624 (91.83%), Postives = 598/624 (95.83%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           MNLLLSTH H LPITQK NHAY RHPPFNN PHVRT+T ENYAN  VAHQ+FD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTY+QMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           +D+FYQMVELADDIDAVALATAIGACGALK+L HGRNIHHLAR+HGLE N+LVSNSLLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           Y+DCDSIKDARGFFDQMPSKD+ISWTELIHMYVKKGGINEAFKLFRQMN+DG LKPDP T
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKEIHGYV+KNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+DMVSW+IMTLGYSLHGQGKLGV+LFREME+N +M RDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
            YFSCIT+PTVAH ALKVALLARAG LDEARTFVEK KL KH E+LRALLDGCRN  QQK
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSW+EFC
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDGSKPNPDFS HDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRV 625
           LAISFGLISTEAGRTIRITKNLR+
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRM 624

BLAST of CaUC10G191670 vs. ExPASy TrEMBL
Match: A0A6J1E0A4 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111025202 PE=3 SV=1)

HSP 1 Score: 1182.5 bits (3058), Expect = 0.0e+00
Identity = 567/665 (85.26%), Postives = 606/665 (91.13%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           M+LLLSTH   LPIT KT+  Y R  PFNNPPHVRT  TENYAN   AH  FDEIP WDT
Sbjct: 1   MDLLLSTHFRRLPITPKTDLTYRRRRPFNNPPHVRTAITENYANLCEAHHPFDEIPTWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD+G VISTY+QML RGVRPD HTLPRII A+RQ GDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDVGLVISTYEQMLLRGVRPDNHTLPRIIGASRQCGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
            FKLGFSSNLYV+TSLIELYGILD ADTAKWLHDKSACRNSVSWTMLAKLY+MEDKPSFA
Sbjct: 121 VFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTMLAKLYVMEDKPSFA 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELA DIDAVALATAIGACG+LK+LQHGRNIH LAR HGLE +VLVSNSLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGSLKLLQHGRNIHLLARTHGLEFDVLVSNSLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDC SI+DARGFF++MPSKDVISWTELI  YVKKGGINE FKLFRQMN+DGGLKPDP+T
Sbjct: 241 YLDCGSIRDARGFFNRMPSKDVISWTELIQAYVKKGGINEGFKLFRQMNMDGGLKPDPIT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHG+EIHGYVLK+A D NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRMAAHKHGREIHGYVLKSAIDVNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+D +SWT+M LGYSLHGQGKLGV+LFR MERNLRMHRDEITYT+VLHAC+TA++V+EGD
Sbjct: 361 EKDAISWTVMILGYSLHGQGKLGVSLFRLMERNLRMHRDEITYTSVLHACSTASLVEEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
           FYF+CI EPT +HFALKVALLARAG LDEAR FVE++KL KH E+LRALLDGCR    +K
Sbjct: 421 FYFNCIMEPTFSHFALKVALLARAGRLDEARAFVEQHKLDKHPEILRALLDGCRTHRDKK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYACN K DMVEK RE +RDMGLRPKKAYSWMEF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNGKLDMVEKSREIVRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNL+CLMKKME+DG KP PDFSFHDVDEERECV IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLECLMKKMEDDGLKPKPDFSFHDVDEERECVLIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTI ITKNLRVCHSCHESAKFISK+VGREIIV+DPYVFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTICITKNLRVCHSCHESAKFISKIVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CaUC10G191670 vs. ExPASy TrEMBL
Match: A0A6J1I9E1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111470851 PE=3 SV=1)

HSP 1 Score: 1181.0 bits (3054), Expect = 0.0e+00
Identity = 560/665 (84.21%), Postives = 606/665 (91.13%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           M+LLLST IH LP+TQK NH Y RH  FNNPPHVRT T E  A+  VAHQLFD+IPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GD+GHVISTYQQML RGVRPD HTLPR+ICA+R YGDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILDSADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           +D+FYQMVELA DIDAVALATAIGACGA K+LQHGRNIHH+ARIHGLE +VLVSN LLKM
Sbjct: 181 LDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDC SIKDARG F++MP +D+ISWT+LIH YVK GGINEA KLFRQMN+DG LKPDPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGR+AAHKHG+EIHGYVLKN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNDFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+DMVSWT+M  GYSLHGQGKLGV LFREM+RN R+HRDEITYTAVL +C+TA+MV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQSCSTASMVEEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
           FYF+CITEPT+AHF LKVALL RAG  DEARTFV+K+KL K++E+LRALLDGCR  HQ K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQHK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYA NE+W+MVEKLR+TIRDMGLRPKKAYSWMEF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDG K N DF FHDVDEEREC PIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECAPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFIS  VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CaUC10G191670 vs. ExPASy TrEMBL
Match: A0A6J1EXC6 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111439085 PE=3 SV=1)

HSP 1 Score: 1172.1 bits (3031), Expect = 0.0e+00
Identity = 557/665 (83.76%), Postives = 602/665 (90.53%), Query Frame = 0

Query: 1   MNLLLSTHIHPLPITQKTNHAYGRHPPFNNPPHVRTITTENYANFSVAHQLFDEIPIWDT 60
           M+LLLST IH LP+TQK NH Y RH  FNNPPHVRT T E  A+  VAHQLFD+IPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GD+GHVISTYQQML RGVRPD HTLPR+ICA+R YGDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILDSADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKM 240
           ID+FYQMVELA DIDAVALATAIGACGA K+LQHGRNIHH+ARIHGLE ++LVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240

Query: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDGGLKPDPLT 300
           YLDC SIKDARG F++MP +D+ISWT+LIH YVK GGINEA KLFRQMN+DG LKPDPLT
Sbjct: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGR+ AHKHG+EIHGYVLKN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTTANMVDEGD 420
           E+DMVSWT++  GYSLHGQGKLGV LFREM+RN  +HRDEITYTAVL AC+TA+MV+EGD
Sbjct: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFSCITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLRALLDGCRNRHQQK 480
           FYF+CITEPT+AHF LKVALL RAG  +EARTFV+K+KL K+ E+LRALLDGCR  HQQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWMEFC 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYA NE+W+MVEKLR+TIRDMGLRPKKAYSWMEF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDG K N DF FHDVDEEREC  IGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVRDPYVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRI KNLRVCHSCHESAKFIS  VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CaUC10G191670 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 438.3 bits (1126), Expect = 1.1e-122
Identity = 228/623 (36.60%), Postives = 356/623 (57.14%), Query Frame = 0

Query: 48  AHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQ 107
           A ++FDE+   D  +WN++I  +++NG     +S + QML  G+  D  T+  +      
Sbjct: 249 ARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCAD 308

Query: 108 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTML 167
              + +G+ +H+   K  FS       +L+++Y      D+AK +  + + R+ VS+T +
Sbjct: 309 SRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSM 368

Query: 168 AKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGL 227
              Y  E     A+ +F +M E     D   +   +  C   ++L  G+ +H   + + L
Sbjct: 369 IAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDL 428

Query: 228 ESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQ 287
             ++ VSN+L+ MY  C S+++A   F +M  KD+ISW  +I  Y K    NEA  LF  
Sbjct: 429 GFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 488

Query: 288 MNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSG 347
           +  +    PD  T++ +LPAC  ++A   G+EIHGY+++N +  +  V N+LVDMY K G
Sbjct: 489 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 548

Query: 348 CIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVL 407
            +  A   F  +  +D+VSWT+M  GY +HG GK  + LF +M R   +  DEI++ ++L
Sbjct: 549 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADEISFVSLL 608

Query: 408 HACTTANMVDEGDFYFS-----CITEPTVAHFALKVALLARAGHLDEARTFVEKNKLGKH 467
           +AC+ + +VDEG  +F+     C  EPTV H+A  V +LAR G L +A  F+E   +   
Sbjct: 609 YACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPD 668

Query: 468 AEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRET 527
           A +  ALL GCR  H  KL +++ E++ +LEP N   YVL++N YA  EKW+ V++LR+ 
Sbjct: 669 ATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKR 728

Query: 528 IRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFS 587
           I   GLR     SW+E   ++++F  GD S+P ++NI   L+ +  +M E+G  P   ++
Sbjct: 729 IGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYA 788

Query: 588 FHDVDE-ERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGR 647
             D +E E+E    GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Sbjct: 789 LIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRR 848

Query: 648 EIIVRDPYVFHHFKDGCCSCEDF 665
           EI++RD   FH FKDG CSC  F
Sbjct: 849 EIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of CaUC10G191670 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 422.5 bits (1085), Expect = 6.1e-118
Identity = 221/633 (34.91%), Postives = 354/633 (55.92%), Query Frame = 0

Query: 39  TENYANFSVAHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTL 98
           + ++ + + A Q+FD++P    F WN +I+ +  N      +  Y  M    V PD  T 
Sbjct: 63  SSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTF 122

Query: 99  PRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSAC 158
           P ++ A      LQ+G+ +HAQ F+LGF ++++V   LI LY       +A+ + +    
Sbjct: 123 PHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPL 182

Query: 159 --RNSVSWTMLAKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGR 218
             R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR
Sbjct: 183 PERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGR 242

Query: 219 NIHHLARIHGLESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKG 278
           +IH      GLE    +  SL  MY  C  +  A+  FD+M S ++I W  +I  Y K G
Sbjct: 243 SIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNG 302

Query: 279 GINEAFKLFRQMNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQ 338
              EA  +F +M ++  ++PD ++I+S + AC ++ + +  + ++ YV ++ + +++ + 
Sbjct: 303 YAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFIS 362

Query: 339 NALVDMYVKSGCIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRM 398
           +AL+DM+ K G ++ A   F    +RD+V W+ M +GY LHG+ +  ++L+R MER   +
Sbjct: 363 SALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GV 422

Query: 399 HRDEITYTAVLHACTTANMVDEGDFYFSCITE----PTVAHFALKVALLARAGHLDEART 458
           H +++T+  +L AC  + MV EG ++F+ + +    P   H+A  + LL RAGHLD+A  
Sbjct: 423 HPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYE 482

Query: 459 FVEKNKLGKHAEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEK 518
            ++   +     V  ALL  C+     +LG+   +QL  ++P N  +YV LSN YA    
Sbjct: 483 VIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARL 542

Query: 519 WDMVEKLRETIRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEE 578
           WD V ++R  +++ GL      SW+E   ++  F  GD SHPR + I   ++ +  +++E
Sbjct: 543 WDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKE 602

Query: 579 DGSKPNPDFSFHDV-DEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHES 638
            G   N D S HD+ DEE E     HSE +AI++GLIST  G  +RITKNLR C +CH +
Sbjct: 603 GGFVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAA 662

Query: 639 AKFISKMVGREIIVRDPYVFHHFKDGCCSCEDF 665
            K ISK+V REI+VRD   FHHFKDG CSC D+
Sbjct: 663 TKLISKLVDREIVVRDTNRFHHFKDGVCSCGDY 693

BLAST of CaUC10G191670 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 392.5 bits (1007), Expect = 6.7e-109
Identity = 228/687 (33.19%), Postives = 351/687 (51.09%), Query Frame = 0

Query: 50  QLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYG 109
           + FD++P  D+ +W  +I  +   G     I     M+  G+ P + TL  ++ +     
Sbjct: 101 EFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATR 160

Query: 110 DLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTMLAK 169
            ++ GK++H+   KLG   N+ V  SL+ +Y        AK++ D+   R+  SW  +  
Sbjct: 161 CMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIA 220

Query: 170 LYLMEDKPSFAIDVFYQMVE-----------------------------LADDI---DAV 229
           L++   +   A+  F QM E                             L D +   D  
Sbjct: 221 LHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRF 280

Query: 230 ALATAIGACGALKILQHGRNIHHLARIHGLESNVLVSNSLLKMYLDCDSIKDARGFFDQ- 289
            LA+ + AC  L+ L  G+ IH      G + + +V N+L+ MY  C  ++ AR   +Q 
Sbjct: 281 TLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQR 340

Query: 290 --------------------------------MPSKDVISWTELIHMYVKKGGINEAFKL 349
                                           +  +DV++WT +I  Y + G   EA  L
Sbjct: 341 GTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINL 400

Query: 350 FRQMNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYV 409
           FR M + GG +P+  T++++L     +A+  HGK+IHG  +K+    ++ V NAL+ MY 
Sbjct: 401 FRSM-VGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYA 460

Query: 410 KSGCIQSASKTFSMMK-ERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITY 469
           K+G I SAS+ F +++ ERD VSWT M +  + HG  +  + LF  M     +  D ITY
Sbjct: 461 KAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLME-GLRPDHITY 520

Query: 470 TAVLHACTTANMVDEGDFYFSCITE-----PTVAHFALKVALLARAGHLDEARTFVEKNK 529
             V  ACT A +V++G  YF  + +     PT++H+A  V L  RAG L EA+ F+EK  
Sbjct: 521 VGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMP 580

Query: 530 LGKHAEVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEK 589
           +        +LL  CR      LGK   E+L  LEP N+  Y  L+N Y+   KW+   K
Sbjct: 581 IEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAK 640

Query: 590 LRETIRDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPN 649
           +R++++D  ++ ++ +SW+E  +K+HVFG  D +HP    IY  ++ +  ++++ G  P+
Sbjct: 641 IRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPD 700

Query: 650 PDFSFHDVDEE-RECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISK 665
                HD++EE +E +   HSE LAI+FGLIST    T+RI KNLRVC+ CH + KFISK
Sbjct: 701 TASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISK 760

BLAST of CaUC10G191670 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 388.7 bits (997), Expect = 9.7e-108
Identity = 229/621 (36.88%), Postives = 335/621 (53.95%), Query Frame = 0

Query: 48  AHQLFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQ 107
           A  LFD +P  D  +WN +I  +  NG     +  +  M    V PD  TL  +I A   
Sbjct: 250 ARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACEL 309

Query: 108 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSACRNSVSWTML 167
            GD ++G+ +HA     GF+ ++ V  SL ++Y    S   A+ L  +   ++ VSWT +
Sbjct: 310 LGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTM 369

Query: 168 AKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACGALKILQHGRNIHHLARIHGL 227
              Y     P  AID +  M + +   D + +A  + AC  L  L  G  +H LA    L
Sbjct: 370 ISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARL 429

Query: 228 ESNVLVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQ 287
            S V+V+N+L+ MY  C  I  A   F  +P K+VISWT +I          EA    RQ
Sbjct: 430 ISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQ 489

Query: 288 MNLDGGLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSG 347
           M +   L+P+ +T+++ L AC R+ A   GKEIH +VL+     +  + NAL+DMYV+ G
Sbjct: 490 MKMT--LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCG 549

Query: 348 CIQSASKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVL 407
            + +A   F+  K +D+ SW I+  GYS  GQG + V LF  M ++ R+  DEIT+ ++L
Sbjct: 550 RMNTAWSQFNSQK-KDVTSWNILLTGYSERGQGSMVVELFDRMVKS-RVRPDEITFISLL 609

Query: 408 HACTTANMVDEGDFYFSCITE----PTVAHFALKVALLARAGHLDEARTFVEKNKLGKHA 467
             C+ + MV +G  YFS + +    P + H+A  V LL RAG L EA  F++K  +    
Sbjct: 610 CGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDP 669

Query: 468 EVLRALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETI 527
            V  ALL+ CR  H+  LG+   + + +L+  +   Y+LL N YA   KW  V K+R  +
Sbjct: 670 AVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMM 729

Query: 528 RDMGLRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG-SKPNPDFS 587
           ++ GL      SW+E   K+H F + D  HP+++ I   L+   +KM E G +K +   S
Sbjct: 730 KENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVGLTKISESSS 789

Query: 588 FHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGRE 647
             + +  R+ +  GHSE  AI+FGLI+T  G  I +TKNL +C +CH++ KFISK V RE
Sbjct: 790 MDETEISRDEIFCGHSERKAIAFGLINTVPGMPIWVTKNLSMCENCHDTVKFISKTVRRE 849

Query: 648 IIVRDPYVFHHFKDGCCSCED 664
           I VRD   FHHFKDG CSC D
Sbjct: 850 ISVRDAEHFHHFKDGECSCGD 866

BLAST of CaUC10G191670 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 383.6 bits (984), Expect = 3.1e-106
Identity = 233/679 (34.32%), Postives = 353/679 (51.99%), Query Frame = 0

Query: 36  TITTENYANFSVAHQ---LFDEIPIWDTFAWNNLIQTHLTNGDLGHVISTYQQMLFRGVR 95
           +I    Y N  + H+   LF  +      AW ++I+           ++++ +M   G  
Sbjct: 43  SIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRC 102

Query: 96  PDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGIL---DSADTA 155
           PD +  P ++ +     DL+ G+ +H    +LG   +LY   +L+ +Y  L    S  + 
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 156 KWLHDKSACR--NSVSWTMLAKLYLMEDKPSFAIDVFYQMVELADDIDAVALATAIGACG 215
             + D+   R  NS    + A+  +M     F ID   ++ E+    D V+  T I    
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIM----PFGIDSVRRVFEVMPRKDVVSYNTIIAGYA 222

Query: 216 -------ALKILQH----------------------------GRNIHHLARIHGLESNVL 275
                  AL++++                             G+ IH      G++S+V 
Sbjct: 223 QSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVY 282

Query: 276 VSNSLLKMYLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNLDG 335
           + +SL+ MY     I+D+   F ++  +D ISW  L+  YV+ G  NEA +LFRQM +  
Sbjct: 283 IGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTA 342

Query: 336 GLKPDPLTISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSA 395
            +KP  +  SS++PAC  +A    GK++HGYVL+  F  N+ + +ALVDMY K G I++A
Sbjct: 343 KVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAA 402

Query: 396 SKTFSMMKERDMVSWTIMTLGYSLHGQGKLGVNLFREMERNLRMHRDEITYTAVLHACTT 455
            K F  M   D VSWT + +G++LHG G   V+LF EM+R   +  +++ + AVL AC+ 
Sbjct: 403 RKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSH 462

Query: 456 ANMVDEGDFYFSCITE-----PTVAHFALKVALLARAGHLDEARTFVEKNKLGKHAEVLR 515
             +VDE   YF+ +T+       + H+A    LL RAG L+EA  F+ K  +     V  
Sbjct: 463 VGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWS 522

Query: 516 ALLDGCRNRHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYACNEKWDMVEKLRETIRDMG 575
            LL  C      +L +++ E++  ++  N   YVL+ N YA N +W  + KLR  +R  G
Sbjct: 523 TLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKG 582

Query: 576 LRPKKAYSWMEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSFHDVD 635
           LR K A SW+E  NK H F +GD SHP    I   L+ +M++ME++G   +     HDVD
Sbjct: 583 LRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVD 642

Query: 636 EE--RECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIV 665
           EE  RE +  GHSE LA++FG+I+TE G TIR+TKN+R+C  CH + KFISK+  REIIV
Sbjct: 643 EEHKRELL-FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIV 702

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004137884.20.0e+0092.18pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus... [more]
XP_038905218.10.0e+0092.64pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X1 ... [more]
XP_008465161.10.0e+0091.88PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like ... [more]
XP_038905219.10.0e+0087.69pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X2 ... [more]
XP_022158739.10.0e+0085.26pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Momordica ... [more]
Match NameE-valueIdentityDescription
Q9SN391.5e-12136.60Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LTV88.6e-11734.91Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9SHZ89.5e-10833.19Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9M9E21.4e-10636.88Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Q9LW634.4e-10534.32Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A1S3CPR50.0e+0091.88pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis ... [more]
A0A0A0L9N40.0e+0091.83DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G7228... [more]
A0A6J1E0A40.0e+0085.26pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Momordic... [more]
A0A6J1I9E10.0e+0084.21pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A6J1EXC60.0e+0083.76pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT4G18750.11.1e-12236.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.16.1e-11834.91mitochondrial editing factor 22 [more]
AT2G22070.16.7e-10933.19pentatricopeptide (PPR) repeat-containing protein [more]
AT1G15510.19.7e-10836.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G23330.13.1e-10634.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 318..420
e-value: 1.1E-16
score: 62.7
coord: 38..141
e-value: 2.6E-10
score: 41.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 169..315
e-value: 6.8E-24
score: 86.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 365..393
e-value: 0.014
score: 15.6
coord: 162..189
e-value: 0.41
score: 11.0
coord: 337..363
e-value: 0.011
score: 15.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 260..308
e-value: 1.6E-9
score: 37.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 62..94
e-value: 0.0021
score: 16.1
coord: 335..365
e-value: 0.0018
score: 16.3
coord: 263..297
e-value: 4.1E-4
score: 18.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 9.733692
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 59..93
score: 9.843305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 332..366
score: 8.549871
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 534..655
e-value: 7.8E-32
score: 109.8
NoneNo IPR availablePANTHERPTHR24015:SF1853REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..657
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 1..657

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC10G191670.1CaUC10G191670.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding