CcUC01G003740 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC01G003740
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCicolChr01: 3758121 .. 3760061 (-)
RNA-Seq ExpressionCcUC01G003740
SyntenyCcUC01G003740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACATTGGTAAGGATAACAAAGTTGAAGAGAAGTGAAAAGAGAAAGGCGTTTCAGTTGCATCCTACCCGACGTGCATGCTTCTGGCAGTTGATGTATTTCTTCTAATTCATCAATTTACACGTAGGACACATAACGAAAAGGATGCTCGTGCACAATTTCAGATGTTGTCTATAGGGACATAAAGAAGTGCTTCCAGTTCTTATGCTCGCTTCCGAGCTAAAAGTCCTCTTGAACCGCTCTGTAAATGTTAAGCAAGCTACTCAAATTCATGCCTACATTCTCGTCAATGGGCTTCCAAATCTCGAATCTTGCTTGGTCCGTCAACTCACTCGTTCTGAGTTCACTTGCGCCAGAATCGTATCCCGTTATCTCCAACGAATTCTTCACCATTCGCAAAACCCAGATGCTTTCACATGGGCCTGCACCGTTCGGTTCTTTTCCCAGAATGGCCACTTTATGGAAGCAATCGCCCATTATGTTCAGATGCAAAGATTGGGACTGCATCCGAGCACTTTTGCTGTATCCTCGACTTTGAGAGCTTGTGGTAGGATTATGTGTAAGTTTGGTGGGAGCTATGTTCACACTCAGGTTTATAAGTTGGGGTTTTGTCGCTGTGTTTACGTGCAAACTGCCCTTGTGGATTTTTACTCAAAATTGGGTGATATGGGTTTTGCACAGAAGGTGTTTGATGATATGACTGAGAAAAATGTGGTTTCATGGAACTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGATGCTCAGAAAGTGTTTGATGTAATGCCTGTGAAAGATGTCATATCTTGGAATTCGATGTTGACTGGATTCGCCAATTCTGGAAATATGGATCGAGCGTGGTGTTTATTTCAACAAATGGGGGAGAAAAGTTCGGCTTCTTGGAATGCAATGATCAGTGGCTACGTGAACTGTGGAGACATGAAGTCTGCGAGAAAGTTGTTTGATGTAATGCCTAATAGGAATAATGTTTCACGGATTACATTGATTGCTGGTTATTCGAAGCTTGGGGAGGTTAATTCTGCTTGTGAGCTATTTGATAAGATGGGAGAGAAGGAGCTTCTCTCATTTAATGCCATGATTGCTTGCTATTCACAAAATAGTCTGCCTAACAAAGCATTGGAGTTATTCAACCAAATGCTTCAACCCCATGTGAATATCCAACCCGATGAGATGACTTTTGCTAGCATTATATCTGCTTGTACTCAACTGGGAAATATGAATTATGGTACTTGGATTGAGTCATATATGGAAAAACTTGGAATTGAGTTGGATGATCATTTGGCAACTGCATTAGTAGATCTATATGCAAAATCCGGGAATATCGAAAGGGCGATCGAGCTGTTCAATGGTCTGGAAAAGAGGGATTTAGTCGCTTATTCAGCTATGATCTTTGGATGTGGGATAAATGGTAAGGCGTACGAAGCAATAAGGTTATTCAAAGAGATGCTAAGAGTTAACATCTCCCCTAATCTAGTGACGTATGCTGGTCTTCTTACAGCGTATAACCATGCCGGTTTAGTTGATGAAGGTTACCTTTGCTTCTCATACATGAAGGACCATGGGCTTGAGGCAATGGCCGATCATTATGGAATCATGGTGGATTTGTTGGGCAGGGCAGGGCGGTTAGAAGAAGCATATGAGCTTATACATAGTATGCCAGTGCAGCCAAATGCTGGTGTTTGGGGAGCCTTGCTTCATGCTTGTAAATTACATAACAATGTTGAGCTTGGTGAGATAGCAGCTCGTAATTGCTCGAAGTTAGTGACTGATACAGCTGGATATCGGTCGCTTCTAGCCAACATTTACTCTTCTATGGAGAGGTGGGATGATGCTAAGAGGCTGAGAAAAGCCATGGGAAATAAAGTATTTGCCAAGATATCTGGTTGTAGTTGGATGGAGCAATTAGAAAGCTGA

mRNA sequence

ATGACATTGGTAAGGATAACAAAGTTGAAGAGAAGTGAAAAGAGAAAGGCGTTTCAGTTGCATCCTACCCGACGTGCATGCGGACATAAAGAAGTGCTTCCAGTTCTTATGCTCGCTTCCGAGCTAAAAGTCCTCTTGAACCGCTCTGTAAATGTTAAGCAAGCTACTCAAATTCATGCCTACATTCTCGTCAATGGGCTTCCAAATCTCGAATCTTGCTTGGTCCGTCAACTCACTCGTTCTGAGTTCACTTGCGCCAGAATCGTATCCCGTTATCTCCAACGAATTCTTCACCATTCGCAAAACCCAGATGCTTTCACATGGGCCTGCACCGTTCGGTTCTTTTCCCAGAATGGCCACTTTATGGAAGCAATCGCCCATTATGTTCAGATGCAAAGATTGGGACTGCATCCGAGCACTTTTGCTGTATCCTCGACTTTGAGAGCTTGTGGTAGGATTATGTGTAAGTTTGGTGGGAGCTATGTTCACACTCAGGTTTATAAGTTGGGGTTTTGTCGCTGTGTTTACGTGCAAACTGCCCTTGTGGATTTTTACTCAAAATTGGGTGATATGGGTTTTGCACAGAAGGTGTTTGATGATATGACTGAGAAAAATGTGGTTTCATGGAACTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGATGCTCAGAAAGTGTTTGATGTAATGCCTGTGAAAGATGTCATATCTTGGAATTCGATGTTGACTGGATTCGCCAATTCTGGAAATATGGATCGAGCGTGGTGTTTATTTCAACAAATGGGGGAGAAAAGTTCGGCTTCTTGGAATGCAATGATCAGTGGCTACGTGAACTGTGGAGACATGAAGTCTGCGAGAAAGTTGTTTGATGTAATGCCTAATAGGAATAATGTTTCACGGATTACATTGATTGCTGGTTATTCGAAGCTTGGGGAGGTTAATTCTGCTTGTGAGCTATTTGATAAGATGGGAGAGAAGGAGCTTCTCTCATTTAATGCCATGATTGCTTGCTATTCACAAAATAGTCTGCCTAACAAAGCATTGGAGTTATTCAACCAAATGCTTCAACCCCATGTGAATATCCAACCCGATGAGATGACTTTTGCTAGCATTATATCTGCTTGTACTCAACTGGGAAATATGAATTATGGTACTTGGATTGAGTCATATATGGAAAAACTTGGAATTGAGTTGGATGATCATTTGGCAACTGCATTAGTAGATCTATATGCAAAATCCGGGAATATCGAAAGGGCGATCGAGCTGTTCAATGGTCTGGAAAAGAGGGATTTAGTCGCTTATTCAGCTATGATCTTTGGATGTGGGATAAATGGTAAGGCGTACGAAGCAATAAGGTTATTCAAAGAGATGCTAAGAGTTAACATCTCCCCTAATCTAGTGACGTATGCTGGTCTTCTTACAGCGTATAACCATGCCGGTTTAGTTGATGAAGGTTACCTTTGCTTCTCATACATGAAGGACCATGGGCTTGAGGCAATGGCCGATCATTATGGAATCATGGTGGATTTGTTGGGCAGGGCAGGGCGGTTAGAAGAAGCATATGAGCTTATACATAGTATGCCAGTGCAGCCAAATGCTGGTGTTTGGGGAGCCTTGCTTCATGCTTGTAAATTACATAACAATGTTGAGCTTGGTGAGATAGCAGCTCGTAATTGCTCGAAGTTAGTGACTGATACAGCTGGATATCGGTCGCTTCTAGCCAACATTTACTCTTCTATGGAGAGGTGGGATGATGCTAAGAGGCTGAGAAAAGCCATGGGAAATAAAGTATTTGCCAAGATATCTGGTTGTAGTTGGATGGAGCAATTAGAAAGCTGA

Coding sequence (CDS)

ATGACATTGGTAAGGATAACAAAGTTGAAGAGAAGTGAAAAGAGAAAGGCGTTTCAGTTGCATCCTACCCGACGTGCATGCGGACATAAAGAAGTGCTTCCAGTTCTTATGCTCGCTTCCGAGCTAAAAGTCCTCTTGAACCGCTCTGTAAATGTTAAGCAAGCTACTCAAATTCATGCCTACATTCTCGTCAATGGGCTTCCAAATCTCGAATCTTGCTTGGTCCGTCAACTCACTCGTTCTGAGTTCACTTGCGCCAGAATCGTATCCCGTTATCTCCAACGAATTCTTCACCATTCGCAAAACCCAGATGCTTTCACATGGGCCTGCACCGTTCGGTTCTTTTCCCAGAATGGCCACTTTATGGAAGCAATCGCCCATTATGTTCAGATGCAAAGATTGGGACTGCATCCGAGCACTTTTGCTGTATCCTCGACTTTGAGAGCTTGTGGTAGGATTATGTGTAAGTTTGGTGGGAGCTATGTTCACACTCAGGTTTATAAGTTGGGGTTTTGTCGCTGTGTTTACGTGCAAACTGCCCTTGTGGATTTTTACTCAAAATTGGGTGATATGGGTTTTGCACAGAAGGTGTTTGATGATATGACTGAGAAAAATGTGGTTTCATGGAACTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGATGCTCAGAAAGTGTTTGATGTAATGCCTGTGAAAGATGTCATATCTTGGAATTCGATGTTGACTGGATTCGCCAATTCTGGAAATATGGATCGAGCGTGGTGTTTATTTCAACAAATGGGGGAGAAAAGTTCGGCTTCTTGGAATGCAATGATCAGTGGCTACGTGAACTGTGGAGACATGAAGTCTGCGAGAAAGTTGTTTGATGTAATGCCTAATAGGAATAATGTTTCACGGATTACATTGATTGCTGGTTATTCGAAGCTTGGGGAGGTTAATTCTGCTTGTGAGCTATTTGATAAGATGGGAGAGAAGGAGCTTCTCTCATTTAATGCCATGATTGCTTGCTATTCACAAAATAGTCTGCCTAACAAAGCATTGGAGTTATTCAACCAAATGCTTCAACCCCATGTGAATATCCAACCCGATGAGATGACTTTTGCTAGCATTATATCTGCTTGTACTCAACTGGGAAATATGAATTATGGTACTTGGATTGAGTCATATATGGAAAAACTTGGAATTGAGTTGGATGATCATTTGGCAACTGCATTAGTAGATCTATATGCAAAATCCGGGAATATCGAAAGGGCGATCGAGCTGTTCAATGGTCTGGAAAAGAGGGATTTAGTCGCTTATTCAGCTATGATCTTTGGATGTGGGATAAATGGTAAGGCGTACGAAGCAATAAGGTTATTCAAAGAGATGCTAAGAGTTAACATCTCCCCTAATCTAGTGACGTATGCTGGTCTTCTTACAGCGTATAACCATGCCGGTTTAGTTGATGAAGGTTACCTTTGCTTCTCATACATGAAGGACCATGGGCTTGAGGCAATGGCCGATCATTATGGAATCATGGTGGATTTGTTGGGCAGGGCAGGGCGGTTAGAAGAAGCATATGAGCTTATACATAGTATGCCAGTGCAGCCAAATGCTGGTGTTTGGGGAGCCTTGCTTCATGCTTGTAAATTACATAACAATGTTGAGCTTGGTGAGATAGCAGCTCGTAATTGCTCGAAGTTAGTGACTGATACAGCTGGATATCGGTCGCTTCTAGCCAACATTTACTCTTCTATGGAGAGGTGGGATGATGCTAAGAGGCTGAGAAAAGCCATGGGAAATAAAGTATTTGCCAAGATATCTGGTTGTAGTTGGATGGAGCAATTAGAAAGCTGA

Protein sequence

MTLVRITKLKRSEKRKAFQLHPTRRACGHKEVLPVLMLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRILHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES
Homology
BLAST of CcUC01G003740 vs. NCBI nr
Match: XP_038906935.1 (pentatricopeptide repeat-containing protein At4g22760 [Benincasa hispida])

HSP 1 Score: 1095.1 bits (2831), Expect = 0.0e+00
Identity = 536/578 (92.73%), Postives = 550/578 (95.16%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           MLASELKV LN S+N KQATQIHA+ILVNGLPNLESCLVRQ+TRSEFTCARIVS YLQ+I
Sbjct: 1   MLASELKVFLNSSINFKQATQIHAHILVNGLPNLESCLVRQITRSEFTCARIVSLYLQQI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHSQNPDAFTWAC VRFFSQNG FMEAI+HYVQMQRLGLHP TFAVSSTLRACGRIMCK
Sbjct: 61  LHHSQNPDAFTWACAVRFFSQNGQFMEAISHYVQMQRLGLHPGTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           FGGSYVH QVYK GFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FGGSYVHAQVYKFGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQKVFD MP+KDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS
Sbjct: 181 VKIGNLVDAQKVFDEMPLKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAM 336
           GYVNCGD+KSAR LFDVMPNRNNVS ITLIAGYSKLGEVNSA ELFDKMGEKELLSFNAM
Sbjct: 241 GYVNCGDIKSARNLFDVMPNRNNVSWITLIAGYSKLGEVNSAYELFDKMGEKELLSFNAM 300

Query: 337 IACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKL 396
           IACYSQNSLPNKALELFNQMLQP VNIQPDEMTFASIISACTQLGN+N GTWIESYMEKL
Sbjct: 301 IACYSQNSLPNKALELFNQMLQPDVNIQPDEMTFASIISACTQLGNLNCGTWIESYMEKL 360

Query: 397 GIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLF 456
           GIELDDHLATALVDLYAKSGNIERA ELFNGL+KRDL+AYSAMIFGCGINGKAY+AIRLF
Sbjct: 361 GIELDDHLATALVDLYAKSGNIERAFELFNGLKKRDLIAYSAMIFGCGINGKAYKAIRLF 420

Query: 457 KEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRA 516
           KEML VNI PNLVTYAGLLTAYNH GLVDEGYLCFS MKDHGLE +ADHYGIMVDLLGRA
Sbjct: 421 KEMLSVNICPNLVTYAGLLTAYNHGGLVDEGYLCFSSMKDHGLEPLADHYGIMVDLLGRA 480

Query: 517 GRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLA 576
           GRLEEAYE+I SMPVQPNAGVWGALLHACKLHNNVELGEIAA+N  K VTDT GYRSLLA
Sbjct: 481 GRLEEAYEIIQSMPVQPNAGVWGALLHACKLHNNVELGEIAAQNSLKSVTDTTGYRSLLA 540

Query: 577 NIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           NIYSSMERWDDAKRLRKAMGNKVF KISGCSWMEQ ES
Sbjct: 541 NIYSSMERWDDAKRLRKAMGNKVFVKISGCSWMEQSES 578

BLAST of CcUC01G003740 vs. NCBI nr
Match: XP_008437157.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g22760 [Cucumis melo])

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 530/580 (91.38%), Postives = 551/580 (95.00%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+LK  LN SV+VKQATQIHA+ILVNGLPNLESCLVRQ+TRS+FTCARIVSRYLQRI
Sbjct: 1   MLGSDLKFFLNSSVHVKQATQIHAHILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHSQNPDAFTW+C VRFFS+NG FMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK
Sbjct: 61  LHHSQNPDAFTWSCAVRFFSKNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           FGG  +H QVYKLGFCRCVYVQT+LVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FGGRCIHAQVYKLGFCRCVYVQTSLVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQKVFD MPVKDVISWNSMLTGF+NSGNMDRA CLFQQM EKSSASWNAMIS
Sbjct: 181 VKIGNLVDAQKVFDEMPVKDVISWNSMLTGFSNSGNMDRALCLFQQMREKSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMG--EKELLSFN 336
           GYVNCGDMK+AR LFDVMPNRNNV+RITLIAGYSKLGEVNSACELFDKMG  EKEL SFN
Sbjct: 241 GYVNCGDMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSACELFDKMGENEKELFSFN 300

Query: 337 AMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYME 396
           AMIACYSQN LPNKALELFNQMLQPH+NIQPDEMTFAS+ISACTQLGN++YGTWIESYME
Sbjct: 301 AMIACYSQNGLPNKALELFNQMLQPHLNIQPDEMTFASVISACTQLGNLSYGTWIESYME 360

Query: 397 KLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIR 456
           KLGIELDDHLATALVDLYAKSGNI+RA ELFN L+KRDLVAYSAMIFGCGINGKA+EAI 
Sbjct: 361 KLGIELDDHLATALVDLYAKSGNIDRAFELFNDLKKRDLVAYSAMIFGCGINGKAHEAIG 420

Query: 457 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLG 516
           LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGY CFS MKDHGL  +ADHYGIMVDLLG
Sbjct: 421 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYFCFSSMKDHGLAPLADHYGIMVDLLG 480

Query: 517 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSL 576
           RAGRLEEAYELI SMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSK VTDT GYRSL
Sbjct: 481 RAGRLEEAYELIRSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKSVTDTTGYRSL 540

Query: 577 LANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           LANIYSSMERWDDAKR+RKAM NKVFAKISGCSWMEQ ES
Sbjct: 541 LANIYSSMERWDDAKRMRKAMSNKVFAKISGCSWMEQSES 580

BLAST of CcUC01G003740 vs. NCBI nr
Match: XP_004147606.1 (pentatricopeptide repeat-containing protein At4g22760 [Cucumis sativus] >XP_031741502.1 pentatricopeptide repeat-containing protein At4g22760 [Cucumis sativus] >XP_031741503.1 pentatricopeptide repeat-containing protein At4g22760 [Cucumis sativus] >KGN50191.1 hypothetical protein Csa_000399 [Cucumis sativus])

HSP 1 Score: 1082.8 bits (2799), Expect = 0.0e+00
Identity = 530/580 (91.38%), Postives = 550/580 (94.83%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+LK  LN SV+VKQATQIHA ILVNGLPNLESCLVRQ+TRS+FTCARIVSRYLQRI
Sbjct: 1   MLGSDLKFFLNSSVHVKQATQIHAQILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHS+NPDAFTWAC VRFFSQNG FMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK
Sbjct: 61  LHHSRNPDAFTWACAVRFFSQNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           F G  +H QVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFD+MTEKNVVSWNSILSGY
Sbjct: 121 FRGWCIHAQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDEMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQK+FD MPVKD ISWNSMLTGF+NSGNMDRA CLFQQMGEKSSASWNAMI 
Sbjct: 181 VKIGNLVDAQKLFDEMPVKDAISWNSMLTGFSNSGNMDRACCLFQQMGEKSSASWNAMIG 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKM--GEKELLSFN 336
           GYVNCGDMK+AR LFDVMPNRNNV+RITLIAGYSKLGEVNSA ELFDKM   EKELLSFN
Sbjct: 241 GYVNCGDMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSAYELFDKMEESEKELLSFN 300

Query: 337 AMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYME 396
           AMIACYSQNS+PNKALELFN MLQPHVNIQPDEMTFAS+ISACTQLGN++YGTWIESYME
Sbjct: 301 AMIACYSQNSMPNKALELFNLMLQPHVNIQPDEMTFASVISACTQLGNLSYGTWIESYME 360

Query: 397 KLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIR 456
           KLGIELDDHLATALVDLYAKSGNI RA ELFNGL+KRDLVAYSAMIFGCGIN KA+EAIR
Sbjct: 361 KLGIELDDHLATALVDLYAKSGNINRAFELFNGLKKRDLVAYSAMIFGCGINSKAHEAIR 420

Query: 457 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLG 516
           LFKEMLRVNI PNLVTYAGLLTAYNHAGLVDEGYLCFS MKDHGL  +ADHYGIMVDLLG
Sbjct: 421 LFKEMLRVNICPNLVTYAGLLTAYNHAGLVDEGYLCFSSMKDHGLAPLADHYGIMVDLLG 480

Query: 517 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSL 576
           RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDT GYRSL
Sbjct: 481 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTTGYRSL 540

Query: 577 LANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           LANIYSSMERWDDAKR+RKAMGNK+FAKISGCSWMEQ ES
Sbjct: 541 LANIYSSMERWDDAKRMRKAMGNKIFAKISGCSWMEQSES 580

BLAST of CcUC01G003740 vs. NCBI nr
Match: KAA0042843.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 528/580 (91.03%), Postives = 550/580 (94.83%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+LK  LN SV+VKQATQIHA+ILVNGLPNLESCLVRQ+TRS+FTCARIVSRYLQRI
Sbjct: 1   MLGSDLKFFLNSSVHVKQATQIHAHILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHSQNPDAFTW+C VRFFS+NG FMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK
Sbjct: 61  LHHSQNPDAFTWSCAVRFFSKNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           FGG  +H QVYKLGFCRCVYVQT+LVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FGGRCIHAQVYKLGFCRCVYVQTSLVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQKVFD MPVKDVISWNSMLTGF+NSGNMDRA CLFQQM EKSSASWNAMIS
Sbjct: 181 VKIGNLVDAQKVFDEMPVKDVISWNSMLTGFSNSGNMDRALCLFQQMREKSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMG--EKELLSFN 336
           GYVNCGDMK+AR LFDVMPNRNNV+RITLIAGYSKLGEVNSACELFDKMG  EKEL SFN
Sbjct: 241 GYVNCGDMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSACELFDKMGENEKELFSFN 300

Query: 337 AMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYME 396
           AMIACYSQN LPNKALELFNQMLQPH+NIQPDEMTFAS+ISACTQLGN++YGTWIESYME
Sbjct: 301 AMIACYSQNGLPNKALELFNQMLQPHLNIQPDEMTFASVISACTQLGNLSYGTWIESYME 360

Query: 397 KLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIR 456
           KLGIELDDHLATALVDLYAKSGNI+RA ELFN L+KRDLVAYSAMIFGCGINGKA+EAI 
Sbjct: 361 KLGIELDDHLATALVDLYAKSGNIDRAFELFNDLKKRDLVAYSAMIFGCGINGKAHEAIG 420

Query: 457 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLG 516
           LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGY CFS MKDHGL  +ADHYGIMVDLLG
Sbjct: 421 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYFCFSSMKDHGLAPLADHYGIMVDLLG 480

Query: 517 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSL 576
           RAGRL+EAYELI SMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSK VTDT GYRSL
Sbjct: 481 RAGRLKEAYELIRSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKSVTDTTGYRSL 540

Query: 577 LANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           LANIYSSMERWDDAK +RKAM NKVFAKISGCSWMEQ ES
Sbjct: 541 LANIYSSMERWDDAKSMRKAMSNKVFAKISGCSWMEQSES 580

BLAST of CcUC01G003740 vs. NCBI nr
Match: KAG6579446.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7016919.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1061.6 bits (2744), Expect = 2.6e-306
Identity = 515/577 (89.25%), Postives = 545/577 (94.45%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           MLASELK+ LN+ VNVKQATQIHA+ILVNGL NLESCLVRQ+TRSEFTCARIVSRYLQRI
Sbjct: 1   MLASELKIFLNKPVNVKQATQIHAHILVNGLRNLESCLVRQITRSEFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           L HS+NPD F+W C VRFFSQNG FMEAI+HYVQMQRLGLHPSTFAVSSTLRACGRI+CK
Sbjct: 61  LRHSKNPDYFSWGCAVRFFSQNGQFMEAISHYVQMQRLGLHPSTFAVSSTLRACGRIICK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           F GS VH QVYKLGFCRCVYVQTALVDFYSKLGDMGFA+KVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FSGSSVHAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNL DAQKVFD MPVKDVISWNSMLTGFANSGNMDRA CLFQQ+GE+SSASWNAMIS
Sbjct: 181 VKIGNLDDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAM 336
           GYVNCGDMKSAR +FD MPNRNNVS ITLIAGYSKLGEV SACELF+ MGEKE+LS+NAM
Sbjct: 241 GYVNCGDMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNAM 300

Query: 337 IACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKL 396
           IACYSQN +PN+AL+LFNQMLQPHVNIQPDEMTFASIISACTQLGN+NYG WIESYMEKL
Sbjct: 301 IACYSQNGMPNEALKLFNQMLQPHVNIQPDEMTFASIISACTQLGNLNYGAWIESYMEKL 360

Query: 397 GIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLF 456
           GIELDDHLATALVD YAKSGNIERA ELFN L+K+DLV+YSAMIFGCGINGKA+EAIRLF
Sbjct: 361 GIELDDHLATALVDFYAKSGNIERAFELFNDLKKKDLVSYSAMIFGCGINGKAFEAIRLF 420

Query: 457 KEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRA 516
           +EML VNI PNLVTYAGLLT+YNHAGLVDEGY+CF  MKDHGLE +ADHYGIMVDLLGRA
Sbjct: 421 EEMLSVNICPNLVTYAGLLTSYNHAGLVDEGYVCFLSMKDHGLEPLADHYGIMVDLLGRA 480

Query: 517 GRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLA 576
           GRLEEA+ELI SMPVQPNAGVWGALLHAC+LHNNVELGEIAARNCSKLVTDT GYRSLLA
Sbjct: 481 GRLEEAHELIQSMPVQPNAGVWGALLHACRLHNNVELGEIAARNCSKLVTDTTGYRSLLA 540

Query: 577 NIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLE 614
           NIYSS+ERWDDAKRLRKAMGNKVFAKISGCSWMEQ E
Sbjct: 541 NIYSSVERWDDAKRLRKAMGNKVFAKISGCSWMEQSE 577

BLAST of CcUC01G003740 vs. ExPASy Swiss-Prot
Match: P0C8Q5 (Pentatricopeptide repeat-containing protein At4g22760 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E6 PE=2 SV=1)

HSP 1 Score: 631.7 bits (1628), Expect = 8.6e-180
Identity = 311/574 (54.18%), Postives = 404/574 (70.38%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+L+  L R V ++QA Q+HA ++VN   +LE  LV Q        +R +  Y++RI
Sbjct: 1   MLDSKLRFFLQRCVVLEQAKQVHAQLVVNRYNHLEPILVHQTLHFTKEFSRNIVTYVKRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           L      D+F+W C VRF SQ+  F E +  Y+ M   G+ PS+ AV+S LRACG++   
Sbjct: 61  LKGFNGHDSFSWGCLVRFLSQHRKFKETVDVYIDMHNSGIPPSSHAVTSVLRACGKMENM 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
             G  +H Q  K G C CVYVQT LV  YS+LG +  A+K FDD+ EKN VSWNS+L GY
Sbjct: 121 VDGKPIHAQALKNGLCGCVYVQTGLVGLYSRLGYIELAKKAFDDIAEKNTVSWNSLLHGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           ++ G L +A++VFD +P KD +SWN +++ +A  G+M  A  LF  M  KS ASWN +I 
Sbjct: 181 LESGELDEARRVFDKIPEKDAVSWNLIISSYAKKGDMGNACSLFSAMPLKSPASWNILIG 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAM 336
           GYVNC +MK AR  FD MP +N VS IT+I+GY+KLG+V SA ELF  M +K+ L ++AM
Sbjct: 241 GYVNCREMKLARTYFDAMPQKNGVSWITMISGYTKLGDVQSAEELFRLMSKKDKLVYDAM 300

Query: 337 IACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKL 396
           IACY+QN  P  AL+LF QML+ +  IQPDE+T +S++SA +QLGN ++GTW+ESY+ + 
Sbjct: 301 IACYTQNGKPKDALKLFAQMLERNSYIQPDEITLSSVVSANSQLGNTSFGTWVESYITEH 360

Query: 397 GIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLF 456
           GI++DD L+T+L+DLY K G+  +A ++F+ L K+D V+YSAMI GCGING A EA  LF
Sbjct: 361 GIKIDDLLSTSLIDLYMKGGDFAKAFKMFSNLNKKDTVSYSAMIMGCGINGMATEANSLF 420

Query: 457 KEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRA 516
             M+   I PN+VT+ GLL+AY+H+GLV EGY CF+ MKDH LE  ADHYGIMVD+LGRA
Sbjct: 421 TAMIEKKIPPNVVTFTGLLSAYSHSGLVQEGYKCFNSMKDHNLEPSADHYGIMVDMLGRA 480

Query: 517 GRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLA 576
           GRLEEAYELI SMP+QPNAGVWGALL A  LHNNVE GEIA  +C KL TD  GY S LA
Sbjct: 481 GRLEEAYELIKSMPMQPNAGVWGALLLASGLHNNVEFGEIACSHCVKLETDPTGYLSHLA 540

Query: 577 NIYSSMERWDDAKRLRKAMGNKVFAKISGCSWME 611
            IYSS+ RWDDA+ +R ++  K   K  GCSW+E
Sbjct: 541 MIYSSVGRWDDARTVRDSIKEKKLCKTLGCSWVE 574

BLAST of CcUC01G003740 vs. ExPASy Swiss-Prot
Match: O22137 (Pentatricopeptide repeat-containing protein At2g45350, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR4 PE=1 SV=2)

HSP 1 Score: 363.6 bits (932), Expect = 4.4e-99
Identity = 191/511 (37.38%), Postives = 303/511 (59.30%), Query Frame = 0

Query: 104 DAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGSYVH 163
           D F W   ++  S      +A+     M   G+    F++S  L+AC R+    GG  +H
Sbjct: 85  DPFLWNAVIKSHSHGKDPRQALLLLCLMLENGVSVDKFSLSLVLKACSRLGFVKGGMQIH 144

Query: 164 TQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNLV 223
             + K G    +++Q  L+  Y K G +G ++++FD M +++ VS+NS++ GYVK G +V
Sbjct: 145 GFLKKTGLWSDLFLQNCLIGLYLKCGCLGLSRQMFDRMPKRDSVSYNSMIDGYVKCGLIV 204

Query: 224 DAQKVFDVMP--VKDVISWNSMLTGFA-NSGNMDRAWCLFQQMGEKSSASWNAMISGYVN 283
            A+++FD+MP  +K++ISWNSM++G+A  S  +D A  LF  M EK   SWN+MI GYV 
Sbjct: 205 SARELFDLMPMEMKNLISWNSMISGYAQTSDGVDIASKLFADMPEKDLISWNSMIDGYVK 264

Query: 284 CGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACY 343
            G ++ A+ LFDVMP R+ V+  T+I GY+KLG V+ A  LFD+M  ++++++N+M+A Y
Sbjct: 265 HGRIEDAKGLFDVMPRRDVVTWATMIDGYAKLGFVHHAKTLFDQMPHRDVVAYNSMMAGY 324

Query: 344 SQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIEL 403
            QN    +ALE+F+ M +   ++ PD+ T   ++ A  QLG ++    +  Y+ +    L
Sbjct: 325 VQNKYHMEALEIFSDM-EKESHLLPDDTTLVIVLPAIAQLGRLSKAIDMHLYIVEKQFYL 384

Query: 404 DDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEML 463
              L  AL+D+Y+K G+I+ A+ +F G+E + +  ++AMI G  I+G    A  +  ++ 
Sbjct: 385 GGKLGVALIDMYSKCGSIQHAMLVFEGIENKSIDHWNAMIGGLAIHGLGESAFDMLLQIE 444

Query: 464 RVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYM-KDHGLEAMADHYGIMVDLLGRAGRL 523
           R+++ P+ +T+ G+L A +H+GLV EG LCF  M + H +E    HYG MVD+L R+G +
Sbjct: 445 RLSLKPDDITFVGVLNACSHSGLVKEGLLCFELMRRKHKIEPRLQHYGCMVDILSRSGSI 504

Query: 524 EEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIY 583
           E A  LI  MPV+PN  +W   L AC  H   E GE+ A++             LL+N+Y
Sbjct: 505 ELAKNLIEEMPVEPNDVIWRTFLTACSHHKEFETGELVAKHLILQAGYNPSSYVLLSNMY 564

Query: 584 SSMERWDDAKRLRKAMGNKVFAKISGCSWME 611
           +S   W D +R+R  M  +   KI GCSW+E
Sbjct: 565 ASFGMWKDVRRVRTMMKERKIEKIPGCSWIE 594

BLAST of CcUC01G003740 vs. ExPASy Swiss-Prot
Match: Q9SJZ3 (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 3.7e-98
Identity = 219/595 (36.81%), Postives = 327/595 (54.96%), Query Frame = 0

Query: 34  PVLMLASELKVLLNRSVNVKQATQIHAYILVNGL---PNLESCLVRQLTRSEFTCARIVS 93
           P+L L  + K+LL+         QI A +++NGL   P   S L+         CA   S
Sbjct: 55  PLLSLLEKCKLLLH-------LKQIQAQMIINGLILDPFASSRLIA-------FCALSES 114

Query: 94  RYLQ---RILHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGL---HPSTFAVS 153
           RYL    +IL   +NP+ F+W  T+R FS++ +  E+   Y QM R G     P  F   
Sbjct: 115 RYLDYSVKILKGIENPNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYP 174

Query: 154 STLRACGRIMCKFGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEK 213
              + C  +     G  +   V KL      +V  A +  ++  GDM  A+KVFD+   +
Sbjct: 175 VLFKVCADLRLSSLGHMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVR 234

Query: 214 NVVSWNSILSGYVKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANS----GNMDRAWCLF 273
           ++VSWN +++GY KIG    A  V+ +M  + V   +  + G  +S    G+++R    +
Sbjct: 235 DLVSWNCLINGYKKIGEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFY 294

Query: 274 QQMGEKSSASW----NAMISGYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVN 333
           + + E          NA++  +  CGD+  AR++FD +  R  VS  T+I+GY++ G ++
Sbjct: 295 EYVKENGLRMTIPLVNALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLD 354

Query: 334 SACELFDKMGEKELLSFNAMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISA 393
            + +LFD M EK+++ +NAMI    Q      AL LF +M     N +PDE+T    +SA
Sbjct: 355 VSRKLFDDMEEKDVVLWNAMIGGSVQAKRGQDALALFQEMQTS--NTKPDEITMIHCLSA 414

Query: 394 CTQLGNMNYGTWIESYMEKLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAY 453
           C+QLG ++ G WI  Y+EK  + L+  L T+LVD+YAK GNI  A+ +F+G++ R+ + Y
Sbjct: 415 CSQLGALDVGIWIHRYIEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTY 474

Query: 454 SAMIFGCGINGKAYEAIRLFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKD 513
           +A+I G  ++G A  AI  F EM+   I+P+ +T+ GLL+A  H G++  G   FS MK 
Sbjct: 475 TAIIGGLALHGDASTAISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKS 534

Query: 514 H-GLEAMADHYGIMVDLLGRAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGE 573
              L     HY IMVDLLGRAG LEEA  L+ SMP++ +A VWGALL  C++H NVELGE
Sbjct: 535 RFNLNPQLKHYSIMVDLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGE 594

Query: 574 IAARNCSKLVTDTAGYRSLLANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWME 611
            AA+   +L    +G   LL  +Y     W+DAKR R+ M  +   KI GCS +E
Sbjct: 595 KAAKKLLELDPSDSGIYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIE 633

BLAST of CcUC01G003740 vs. ExPASy Swiss-Prot
Match: Q9LS72 (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 354.8 bits (909), Expect = 2.0e-96
Identity = 198/568 (34.86%), Postives = 317/568 (55.81%), Query Frame = 0

Query: 46  LNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRILHHSQNPDA 105
           L +  N+ Q  Q+HA I+   L + +  +  +L  +   C +  +    R+ +  Q P+ 
Sbjct: 26  LPKCANLNQVKQLHAQIIRRNL-HEDLHIAPKLISALSLCRQ--TNLAVRVFNQVQEPNV 85

Query: 106 FTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGSYVHTQ 165
                 +R  +QN    +A   + +MQR GL    F     L+AC           +H  
Sbjct: 86  HLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNH 145

Query: 166 VYKLGFCRCVYVQTALVDFYSKLGDMGF--AQKVFDDMTEKNVVSWNSILSGYVKIGNLV 225
           + KLG    +YV  AL+D YS+ G +G   A K+F+ M+E++ VSWNS+L G VK G L 
Sbjct: 146 IEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELR 205

Query: 226 DAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCGD 285
           DA+++FD MP +D+ISWN+ML G+A    M +A+ LF++M E+++ SW+ M+ GY   GD
Sbjct: 206 DARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGD 265

Query: 286 MKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACYSQN 345
           M+ AR +FD MP                             +  K ++++  +IA Y++ 
Sbjct: 266 MEMARVMFDKMP-----------------------------LPAKNVVTWTIIIAGYAEK 325

Query: 346 SLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIELDDH 405
            L  +A  L +QM+     ++ D     SI++ACT+ G ++ G  I S +++  +  + +
Sbjct: 326 GLLKEADRLVDQMVAS--GLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAY 385

Query: 406 LATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEMLRVN 465
           +  AL+D+YAK GN+++A ++FN + K+DLV+++ M+ G G++G   EAI LF  M R  
Sbjct: 386 VLNALLDMYAKCGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREG 445

Query: 466 ISPNLVTYAGLLTAYNHAGLVDEGY-LCFSYMKDHGLEAMADHYGIMVDLLGRAGRLEEA 525
           I P+ VT+  +L + NHAGL+DEG    +S  K + L    +HYG +VDLLGR GRL+EA
Sbjct: 446 IRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEA 505

Query: 526 YELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIYSSM 585
            +++ +MP++PN  +WGALL AC++HN V++ +    N  KL     G  SLL+NIY++ 
Sbjct: 506 IKVVQTMPMEPNVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAA 559

Query: 586 ERWDDAKRLRKAMGNKVFAKISGCSWME 611
           E W+    +R  M +    K SG S +E
Sbjct: 566 EDWEGVADIRSKMKSMGVEKPSGASSVE 559

BLAST of CcUC01G003740 vs. ExPASy Swiss-Prot
Match: Q9FHR3 (Putative pentatricopeptide repeat-containing protein At5g37570 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E37 PE=3 SV=1)

HSP 1 Score: 347.8 bits (891), Expect = 2.5e-94
Identity = 193/558 (34.59%), Postives = 312/558 (55.91%), Query Frame = 0

Query: 57  QIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRILHHSQNPDAFTWACTVRFFS 116
           QIHA I+  GL   ++ +   ++ S  + + +   Y   +     +P  + W   ++ +S
Sbjct: 28  QIHARIIRKGLEQDQNLISIFISSSSSSSSSL--SYSSSVFERVPSPGTYLWNHLIKGYS 87

Query: 117 QNGHFMEAIAHYVQMQRLGL-HPSTFAVSSTLRACGRIMCKFGGSYVHTQVYKLGFCRCV 176
               F E ++  ++M R GL  P  +     ++ C        GS VH  V ++GF + V
Sbjct: 88  NKFLFFETVSILMRMMRTGLARPDEYTFPLVMKVCSNNGQVRVGSSVHGLVLRIGFDKDV 147

Query: 177 YVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNLVDAQKVFDVMPVK 236
            V T+ VDFY K  D+  A+KVF +M E+N VSW +++  YVK G L +A+ +FD+MP  
Sbjct: 148 VVGTSFVDFYGKCKDLFSARKVFGEMPERNAVSWTALVVAYVKSGELEEAKSMFDLMP-- 207

Query: 237 DVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCGDMKSARKLFDVMP 296
                                        E++  SWNA++ G V  GD+ +A+KLFD MP
Sbjct: 208 -----------------------------ERNLGSWNALVDGLVKSGDLVNAKKLFDEMP 267

Query: 297 NRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACYSQNSLPNKALELFNQ 356
            R+ +S  ++I GY+K G++ SA +LF++    ++ +++A+I  Y+QN  PN+A ++F++
Sbjct: 268 KRDIISYTSMIDGYAKGGDMVSARDLFEEARGVDVRAWSALILGYAQNGQPNEAFKVFSE 327

Query: 357 MLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIELDDH-LATALVDLYAK 416
           M     N++PDE     ++SAC+Q+G       ++SY+ +   +   H +  AL+D+ AK
Sbjct: 328 MCAK--NVKPDEFIMVGLMSACSQMGCFELCEKVDSYLHQRMNKFSSHYVVPALIDMNAK 387

Query: 417 SGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEMLRVNISPNLVTYAGL 476
            G+++RA +LF  + +RDLV+Y +M+ G  I+G   EAIRLF++M+   I P+ V +  +
Sbjct: 388 CGHMDRAAKLFEEMPQRDLVSYCSMMEGMAIHGCGSEAIRLFEKMVDEGIVPDEVAFTVI 447

Query: 477 LTAYNHAGLVDEGYLCFSYM-KDHGLEAMADHYGIMVDLLGRAGRLEEAYELIHSMPVQP 536
           L     + LV+EG   F  M K + + A  DHY  +V+LL R G+L+EAYELI SMP + 
Sbjct: 448 LKVCGQSRLVEEGLRYFELMRKKYSILASPDHYSCIVNLLSRTGKLKEAYELIKSMPFEA 507

Query: 537 NAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIYSSMERWDDAKRLRK 596
           +A  WG+LL  C LH N E+ E+ AR+  +L   +AG   LL+NIY++++RW D   LR 
Sbjct: 508 HASAWGSLLGGCSLHGNTEIAEVVARHLFELEPQSAGSYVLLSNIYAALDRWTDVAHLRD 550

Query: 597 AMGNKVFAKISGCSWMEQ 612
            M      KI G SW+ +
Sbjct: 568 KMNENGITKICGRSWISR 550

BLAST of CcUC01G003740 vs. ExPASy TrEMBL
Match: A0A1S3ATB6 (pentatricopeptide repeat-containing protein At4g22760 OS=Cucumis melo OX=3656 GN=LOC103482665 PE=4 SV=1)

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 530/580 (91.38%), Postives = 551/580 (95.00%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+LK  LN SV+VKQATQIHA+ILVNGLPNLESCLVRQ+TRS+FTCARIVSRYLQRI
Sbjct: 1   MLGSDLKFFLNSSVHVKQATQIHAHILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHSQNPDAFTW+C VRFFS+NG FMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK
Sbjct: 61  LHHSQNPDAFTWSCAVRFFSKNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           FGG  +H QVYKLGFCRCVYVQT+LVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FGGRCIHAQVYKLGFCRCVYVQTSLVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQKVFD MPVKDVISWNSMLTGF+NSGNMDRA CLFQQM EKSSASWNAMIS
Sbjct: 181 VKIGNLVDAQKVFDEMPVKDVISWNSMLTGFSNSGNMDRALCLFQQMREKSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMG--EKELLSFN 336
           GYVNCGDMK+AR LFDVMPNRNNV+RITLIAGYSKLGEVNSACELFDKMG  EKEL SFN
Sbjct: 241 GYVNCGDMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSACELFDKMGENEKELFSFN 300

Query: 337 AMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYME 396
           AMIACYSQN LPNKALELFNQMLQPH+NIQPDEMTFAS+ISACTQLGN++YGTWIESYME
Sbjct: 301 AMIACYSQNGLPNKALELFNQMLQPHLNIQPDEMTFASVISACTQLGNLSYGTWIESYME 360

Query: 397 KLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIR 456
           KLGIELDDHLATALVDLYAKSGNI+RA ELFN L+KRDLVAYSAMIFGCGINGKA+EAI 
Sbjct: 361 KLGIELDDHLATALVDLYAKSGNIDRAFELFNDLKKRDLVAYSAMIFGCGINGKAHEAIG 420

Query: 457 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLG 516
           LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGY CFS MKDHGL  +ADHYGIMVDLLG
Sbjct: 421 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYFCFSSMKDHGLAPLADHYGIMVDLLG 480

Query: 517 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSL 576
           RAGRLEEAYELI SMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSK VTDT GYRSL
Sbjct: 481 RAGRLEEAYELIRSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKSVTDTTGYRSL 540

Query: 577 LANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           LANIYSSMERWDDAKR+RKAM NKVFAKISGCSWMEQ ES
Sbjct: 541 LANIYSSMERWDDAKRMRKAMSNKVFAKISGCSWMEQSES 580

BLAST of CcUC01G003740 vs. ExPASy TrEMBL
Match: A0A0A0KR38 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G157980 PE=4 SV=1)

HSP 1 Score: 1082.8 bits (2799), Expect = 0.0e+00
Identity = 530/580 (91.38%), Postives = 550/580 (94.83%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+LK  LN SV+VKQATQIHA ILVNGLPNLESCLVRQ+TRS+FTCARIVSRYLQRI
Sbjct: 1   MLGSDLKFFLNSSVHVKQATQIHAQILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHS+NPDAFTWAC VRFFSQNG FMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK
Sbjct: 61  LHHSRNPDAFTWACAVRFFSQNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           F G  +H QVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFD+MTEKNVVSWNSILSGY
Sbjct: 121 FRGWCIHAQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDEMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQK+FD MPVKD ISWNSMLTGF+NSGNMDRA CLFQQMGEKSSASWNAMI 
Sbjct: 181 VKIGNLVDAQKLFDEMPVKDAISWNSMLTGFSNSGNMDRACCLFQQMGEKSSASWNAMIG 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKM--GEKELLSFN 336
           GYVNCGDMK+AR LFDVMPNRNNV+RITLIAGYSKLGEVNSA ELFDKM   EKELLSFN
Sbjct: 241 GYVNCGDMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSAYELFDKMEESEKELLSFN 300

Query: 337 AMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYME 396
           AMIACYSQNS+PNKALELFN MLQPHVNIQPDEMTFAS+ISACTQLGN++YGTWIESYME
Sbjct: 301 AMIACYSQNSMPNKALELFNLMLQPHVNIQPDEMTFASVISACTQLGNLSYGTWIESYME 360

Query: 397 KLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIR 456
           KLGIELDDHLATALVDLYAKSGNI RA ELFNGL+KRDLVAYSAMIFGCGIN KA+EAIR
Sbjct: 361 KLGIELDDHLATALVDLYAKSGNINRAFELFNGLKKRDLVAYSAMIFGCGINSKAHEAIR 420

Query: 457 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLG 516
           LFKEMLRVNI PNLVTYAGLLTAYNHAGLVDEGYLCFS MKDHGL  +ADHYGIMVDLLG
Sbjct: 421 LFKEMLRVNICPNLVTYAGLLTAYNHAGLVDEGYLCFSSMKDHGLAPLADHYGIMVDLLG 480

Query: 517 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSL 576
           RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDT GYRSL
Sbjct: 481 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTTGYRSL 540

Query: 577 LANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           LANIYSSMERWDDAKR+RKAMGNK+FAKISGCSWMEQ ES
Sbjct: 541 LANIYSSMERWDDAKRMRKAMGNKIFAKISGCSWMEQSES 580

BLAST of CcUC01G003740 vs. ExPASy TrEMBL
Match: A0A5A7THR1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G003540 PE=4 SV=1)

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 528/580 (91.03%), Postives = 550/580 (94.83%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+LK  LN SV+VKQATQIHA+ILVNGLPNLESCLVRQ+TRS+FTCARIVSRYLQRI
Sbjct: 1   MLGSDLKFFLNSSVHVKQATQIHAHILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           LHHSQNPDAFTW+C VRFFS+NG FMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK
Sbjct: 61  LHHSQNPDAFTWSCAVRFFSKNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           FGG  +H QVYKLGFCRCVYVQT+LVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FGGRCIHAQVYKLGFCRCVYVQTSLVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNLVDAQKVFD MPVKDVISWNSMLTGF+NSGNMDRA CLFQQM EKSSASWNAMIS
Sbjct: 181 VKIGNLVDAQKVFDEMPVKDVISWNSMLTGFSNSGNMDRALCLFQQMREKSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMG--EKELLSFN 336
           GYVNCGDMK+AR LFDVMPNRNNV+RITLIAGYSKLGEVNSACELFDKMG  EKEL SFN
Sbjct: 241 GYVNCGDMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSACELFDKMGENEKELFSFN 300

Query: 337 AMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYME 396
           AMIACYSQN LPNKALELFNQMLQPH+NIQPDEMTFAS+ISACTQLGN++YGTWIESYME
Sbjct: 301 AMIACYSQNGLPNKALELFNQMLQPHLNIQPDEMTFASVISACTQLGNLSYGTWIESYME 360

Query: 397 KLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIR 456
           KLGIELDDHLATALVDLYAKSGNI+RA ELFN L+KRDLVAYSAMIFGCGINGKA+EAI 
Sbjct: 361 KLGIELDDHLATALVDLYAKSGNIDRAFELFNDLKKRDLVAYSAMIFGCGINGKAHEAIG 420

Query: 457 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLG 516
           LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGY CFS MKDHGL  +ADHYGIMVDLLG
Sbjct: 421 LFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYFCFSSMKDHGLAPLADHYGIMVDLLG 480

Query: 517 RAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSL 576
           RAGRL+EAYELI SMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSK VTDT GYRSL
Sbjct: 481 RAGRLKEAYELIRSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKSVTDTTGYRSL 540

Query: 577 LANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLES 615
           LANIYSSMERWDDAK +RKAM NKVFAKISGCSWMEQ ES
Sbjct: 541 LANIYSSMERWDDAKSMRKAMSNKVFAKISGCSWMEQSES 580

BLAST of CcUC01G003740 vs. ExPASy TrEMBL
Match: A0A6J1E2U4 (pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita moschata OX=3662 GN=LOC111430313 PE=4 SV=1)

HSP 1 Score: 1057.7 bits (2734), Expect = 1.8e-305
Identity = 512/577 (88.73%), Postives = 544/577 (94.28%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           MLASELK+ LN+ VNVKQA QIHA+ILVNGL NLESCLVRQ+TRSEFTCARIVSRYLQRI
Sbjct: 1   MLASELKIFLNKPVNVKQAAQIHAHILVNGLRNLESCLVRQITRSEFTCARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           L HSQNPD+F+W C VRFFS+NG FMEAI+HYVQMQRLGLHPSTFAVSSTLRACGRI+CK
Sbjct: 61  LRHSQNPDSFSWGCAVRFFSRNGQFMEAISHYVQMQRLGLHPSTFAVSSTLRACGRIICK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           F GS VH QVYKLGFCRCVYVQTALVDFYSKLGDMGFA+KVFDD+TEKNVVSWNSILSGY
Sbjct: 121 FSGSSVHAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDITEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIGNL DAQKVFD MPVKDVISWNSMLTGFANSGNMDRA CLFQQ+GE+SSASWNAMIS
Sbjct: 181 VKIGNLDDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAM 336
           GYVNCGDMKSAR +FD MPNRNNVS ITLIAGYSKLGEV SACELF+ MGEKE+LS+NAM
Sbjct: 241 GYVNCGDMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNAM 300

Query: 337 IACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKL 396
           IACYSQN +PN+AL+LFNQMLQPHVNIQPDEMTFASIISACTQLGN+NYG WIESYMEKL
Sbjct: 301 IACYSQNGMPNEALKLFNQMLQPHVNIQPDEMTFASIISACTQLGNLNYGAWIESYMEKL 360

Query: 397 GIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLF 456
           GIELDDHLATALVD YAKSGNIERA ELFN L+K+D V+YSAMIFGCGINGKA+EAIRLF
Sbjct: 361 GIELDDHLATALVDFYAKSGNIERAFELFNDLKKKDFVSYSAMIFGCGINGKAFEAIRLF 420

Query: 457 KEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRA 516
           +EML VNI PNLVTYAGLLT+YNHAGLVDEGY+CF  MKDHGLE +ADHYGIMVDLLGRA
Sbjct: 421 EEMLSVNICPNLVTYAGLLTSYNHAGLVDEGYVCFLSMKDHGLEPLADHYGIMVDLLGRA 480

Query: 517 GRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLA 576
           GRLEEA+ELI SMPVQPNAGVWGALLHAC+LHNNVELGEIAARNCSKLVTDT GYRSLLA
Sbjct: 481 GRLEEAHELIQSMPVQPNAGVWGALLHACRLHNNVELGEIAARNCSKLVTDTTGYRSLLA 540

Query: 577 NIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLE 614
           NIYSS+ERWDDAKRLRKAMGNKVFAKISGCSWMEQ E
Sbjct: 541 NIYSSVERWDDAKRLRKAMGNKVFAKISGCSWMEQSE 577

BLAST of CcUC01G003740 vs. ExPASy TrEMBL
Match: A0A6J1HXF1 (pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita maxima OX=3661 GN=LOC111468920 PE=4 SV=1)

HSP 1 Score: 1049.7 bits (2713), Expect = 4.9e-303
Identity = 510/577 (88.39%), Postives = 543/577 (94.11%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           MLASELK+ LN+ VNVKQATQIHA ILVNGL NLESCLVRQ+TRSEF+ ARIVSRYLQRI
Sbjct: 1   MLASELKIFLNKPVNVKQATQIHAQILVNGLRNLESCLVRQITRSEFSRARIVSRYLQRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           L HSQNPD+F+W C VRFFSQNG FME I+HYVQMQRLGLHPSTFAVSSTLRACGRI+CK
Sbjct: 61  LRHSQNPDSFSWGCAVRFFSQNGQFMETISHYVQMQRLGLHPSTFAVSSTLRACGRIICK 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
           FGGS VH QVYKLGFCRCVYVQTALVDFYSKLGDMGFA+KVFDDMTEKNVVSWNSILSGY
Sbjct: 121 FGGSSVHAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDMTEKNVVSWNSILSGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           VKIG L DAQKVFD MPVKDVISWNSMLTGFANSGNMDRA CLFQQ+GE+SSASWNAMIS
Sbjct: 181 VKIGILDDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMIS 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAM 336
           GYVNCGDMKSAR +FD MPNRNNVS ITLIAGYSKLGEV SACELF+ MGEKE+LS+NA+
Sbjct: 241 GYVNCGDMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNAL 300

Query: 337 IACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKL 396
           IACYSQN +PN+AL+LFNQMLQPHV+IQPDEMTFASIISACTQLGN+NYG WIESYMEKL
Sbjct: 301 IACYSQNGMPNEALKLFNQMLQPHVDIQPDEMTFASIISACTQLGNLNYGAWIESYMEKL 360

Query: 397 GIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLF 456
           GIELDDHLATALVD YAKSGNI+RA ELFN L+K+DLV+YSAMIFGCGINGKA+EAIRLF
Sbjct: 361 GIELDDHLATALVDFYAKSGNIKRAFELFNDLKKKDLVSYSAMIFGCGINGKAFEAIRLF 420

Query: 457 KEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRA 516
           +EML VNI PNLVTYAGLLT+YNHAGLVDEGY+CF  MKDHGLE +ADHYGIMVDLLGRA
Sbjct: 421 EEMLSVNICPNLVTYAGLLTSYNHAGLVDEGYVCFLSMKDHGLEPLADHYGIMVDLLGRA 480

Query: 517 GRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLA 576
           GRLEEA+ELI SMPVQPNAGVWGALLHAC+LHNNVELGEIAARNCSKLVTDT GYRSLLA
Sbjct: 481 GRLEEAHELIQSMPVQPNAGVWGALLHACRLHNNVELGEIAARNCSKLVTDTTGYRSLLA 540

Query: 577 NIYSSMERWDDAKRLRKAMGNKVFAKISGCSWMEQLE 614
           NIYSS+ERWDDAKRLRKAMGNKVFAKISGCSWMEQ E
Sbjct: 541 NIYSSVERWDDAKRLRKAMGNKVFAKISGCSWMEQSE 577

BLAST of CcUC01G003740 vs. TAIR 10
Match: AT4G22760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 631.7 bits (1628), Expect = 6.1e-181
Identity = 311/574 (54.18%), Postives = 404/574 (70.38%), Query Frame = 0

Query: 37  MLASELKVLLNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRI 96
           ML S+L+  L R V ++QA Q+HA ++VN   +LE  LV Q        +R +  Y++RI
Sbjct: 1   MLDSKLRFFLQRCVVLEQAKQVHAQLVVNRYNHLEPILVHQTLHFTKEFSRNIVTYVKRI 60

Query: 97  LHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCK 156
           L      D+F+W C VRF SQ+  F E +  Y+ M   G+ PS+ AV+S LRACG++   
Sbjct: 61  LKGFNGHDSFSWGCLVRFLSQHRKFKETVDVYIDMHNSGIPPSSHAVTSVLRACGKMENM 120

Query: 157 FGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGY 216
             G  +H Q  K G C CVYVQT LV  YS+LG +  A+K FDD+ EKN VSWNS+L GY
Sbjct: 121 VDGKPIHAQALKNGLCGCVYVQTGLVGLYSRLGYIELAKKAFDDIAEKNTVSWNSLLHGY 180

Query: 217 VKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMIS 276
           ++ G L +A++VFD +P KD +SWN +++ +A  G+M  A  LF  M  KS ASWN +I 
Sbjct: 181 LESGELDEARRVFDKIPEKDAVSWNLIISSYAKKGDMGNACSLFSAMPLKSPASWNILIG 240

Query: 277 GYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAM 336
           GYVNC +MK AR  FD MP +N VS IT+I+GY+KLG+V SA ELF  M +K+ L ++AM
Sbjct: 241 GYVNCREMKLARTYFDAMPQKNGVSWITMISGYTKLGDVQSAEELFRLMSKKDKLVYDAM 300

Query: 337 IACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKL 396
           IACY+QN  P  AL+LF QML+ +  IQPDE+T +S++SA +QLGN ++GTW+ESY+ + 
Sbjct: 301 IACYTQNGKPKDALKLFAQMLERNSYIQPDEITLSSVVSANSQLGNTSFGTWVESYITEH 360

Query: 397 GIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLF 456
           GI++DD L+T+L+DLY K G+  +A ++F+ L K+D V+YSAMI GCGING A EA  LF
Sbjct: 361 GIKIDDLLSTSLIDLYMKGGDFAKAFKMFSNLNKKDTVSYSAMIMGCGINGMATEANSLF 420

Query: 457 KEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKDHGLEAMADHYGIMVDLLGRA 516
             M+   I PN+VT+ GLL+AY+H+GLV EGY CF+ MKDH LE  ADHYGIMVD+LGRA
Sbjct: 421 TAMIEKKIPPNVVTFTGLLSAYSHSGLVQEGYKCFNSMKDHNLEPSADHYGIMVDMLGRA 480

Query: 517 GRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLA 576
           GRLEEAYELI SMP+QPNAGVWGALL A  LHNNVE GEIA  +C KL TD  GY S LA
Sbjct: 481 GRLEEAYELIKSMPMQPNAGVWGALLLASGLHNNVEFGEIACSHCVKLETDPTGYLSHLA 540

Query: 577 NIYSSMERWDDAKRLRKAMGNKVFAKISGCSWME 611
            IYSS+ RWDDA+ +R ++  K   K  GCSW+E
Sbjct: 541 MIYSSVGRWDDARTVRDSIKEKKLCKTLGCSWVE 574

BLAST of CcUC01G003740 vs. TAIR 10
Match: AT2G45350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 363.6 bits (932), Expect = 3.1e-100
Identity = 191/511 (37.38%), Postives = 303/511 (59.30%), Query Frame = 0

Query: 104 DAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGSYVH 163
           D F W   ++  S      +A+     M   G+    F++S  L+AC R+    GG  +H
Sbjct: 85  DPFLWNAVIKSHSHGKDPRQALLLLCLMLENGVSVDKFSLSLVLKACSRLGFVKGGMQIH 144

Query: 164 TQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNLV 223
             + K G    +++Q  L+  Y K G +G ++++FD M +++ VS+NS++ GYVK G +V
Sbjct: 145 GFLKKTGLWSDLFLQNCLIGLYLKCGCLGLSRQMFDRMPKRDSVSYNSMIDGYVKCGLIV 204

Query: 224 DAQKVFDVMP--VKDVISWNSMLTGFA-NSGNMDRAWCLFQQMGEKSSASWNAMISGYVN 283
            A+++FD+MP  +K++ISWNSM++G+A  S  +D A  LF  M EK   SWN+MI GYV 
Sbjct: 205 SARELFDLMPMEMKNLISWNSMISGYAQTSDGVDIASKLFADMPEKDLISWNSMIDGYVK 264

Query: 284 CGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACY 343
            G ++ A+ LFDVMP R+ V+  T+I GY+KLG V+ A  LFD+M  ++++++N+M+A Y
Sbjct: 265 HGRIEDAKGLFDVMPRRDVVTWATMIDGYAKLGFVHHAKTLFDQMPHRDVVAYNSMMAGY 324

Query: 344 SQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIEL 403
            QN    +ALE+F+ M +   ++ PD+ T   ++ A  QLG ++    +  Y+ +    L
Sbjct: 325 VQNKYHMEALEIFSDM-EKESHLLPDDTTLVIVLPAIAQLGRLSKAIDMHLYIVEKQFYL 384

Query: 404 DDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEML 463
              L  AL+D+Y+K G+I+ A+ +F G+E + +  ++AMI G  I+G    A  +  ++ 
Sbjct: 385 GGKLGVALIDMYSKCGSIQHAMLVFEGIENKSIDHWNAMIGGLAIHGLGESAFDMLLQIE 444

Query: 464 RVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYM-KDHGLEAMADHYGIMVDLLGRAGRL 523
           R+++ P+ +T+ G+L A +H+GLV EG LCF  M + H +E    HYG MVD+L R+G +
Sbjct: 445 RLSLKPDDITFVGVLNACSHSGLVKEGLLCFELMRRKHKIEPRLQHYGCMVDILSRSGSI 504

Query: 524 EEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIY 583
           E A  LI  MPV+PN  +W   L AC  H   E GE+ A++             LL+N+Y
Sbjct: 505 ELAKNLIEEMPVEPNDVIWRTFLTACSHHKEFETGELVAKHLILQAGYNPSSYVLLSNMY 564

Query: 584 SSMERWDDAKRLRKAMGNKVFAKISGCSWME 611
           +S   W D +R+R  M  +   KI GCSW+E
Sbjct: 565 ASFGMWKDVRRVRTMMKERKIEKIPGCSWIE 594

BLAST of CcUC01G003740 vs. TAIR 10
Match: AT2G22410.1 (SLOW GROWTH 1 )

HSP 1 Score: 360.5 bits (924), Expect = 2.6e-99
Identity = 219/595 (36.81%), Postives = 327/595 (54.96%), Query Frame = 0

Query: 34  PVLMLASELKVLLNRSVNVKQATQIHAYILVNGL---PNLESCLVRQLTRSEFTCARIVS 93
           P+L L  + K+LL+         QI A +++NGL   P   S L+         CA   S
Sbjct: 55  PLLSLLEKCKLLLH-------LKQIQAQMIINGLILDPFASSRLIA-------FCALSES 114

Query: 94  RYLQ---RILHHSQNPDAFTWACTVRFFSQNGHFMEAIAHYVQMQRLGL---HPSTFAVS 153
           RYL    +IL   +NP+ F+W  T+R FS++ +  E+   Y QM R G     P  F   
Sbjct: 115 RYLDYSVKILKGIENPNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYP 174

Query: 154 STLRACGRIMCKFGGSYVHTQVYKLGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEK 213
              + C  +     G  +   V KL      +V  A +  ++  GDM  A+KVFD+   +
Sbjct: 175 VLFKVCADLRLSSLGHMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVR 234

Query: 214 NVVSWNSILSGYVKIGNLVDAQKVFDVMPVKDVISWNSMLTGFANS----GNMDRAWCLF 273
           ++VSWN +++GY KIG    A  V+ +M  + V   +  + G  +S    G+++R    +
Sbjct: 235 DLVSWNCLINGYKKIGEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFY 294

Query: 274 QQMGEKSSASW----NAMISGYVNCGDMKSARKLFDVMPNRNNVSRITLIAGYSKLGEVN 333
           + + E          NA++  +  CGD+  AR++FD +  R  VS  T+I+GY++ G ++
Sbjct: 295 EYVKENGLRMTIPLVNALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLD 354

Query: 334 SACELFDKMGEKELLSFNAMIACYSQNSLPNKALELFNQMLQPHVNIQPDEMTFASIISA 393
            + +LFD M EK+++ +NAMI    Q      AL LF +M     N +PDE+T    +SA
Sbjct: 355 VSRKLFDDMEEKDVVLWNAMIGGSVQAKRGQDALALFQEMQTS--NTKPDEITMIHCLSA 414

Query: 394 CTQLGNMNYGTWIESYMEKLGIELDDHLATALVDLYAKSGNIERAIELFNGLEKRDLVAY 453
           C+QLG ++ G WI  Y+EK  + L+  L T+LVD+YAK GNI  A+ +F+G++ R+ + Y
Sbjct: 415 CSQLGALDVGIWIHRYIEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTY 474

Query: 454 SAMIFGCGINGKAYEAIRLFKEMLRVNISPNLVTYAGLLTAYNHAGLVDEGYLCFSYMKD 513
           +A+I G  ++G A  AI  F EM+   I+P+ +T+ GLL+A  H G++  G   FS MK 
Sbjct: 475 TAIIGGLALHGDASTAISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKS 534

Query: 514 H-GLEAMADHYGIMVDLLGRAGRLEEAYELIHSMPVQPNAGVWGALLHACKLHNNVELGE 573
              L     HY IMVDLLGRAG LEEA  L+ SMP++ +A VWGALL  C++H NVELGE
Sbjct: 535 RFNLNPQLKHYSIMVDLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGE 594

Query: 574 IAARNCSKLVTDTAGYRSLLANIYSSMERWDDAKRLRKAMGNKVFAKISGCSWME 611
            AA+   +L    +G   LL  +Y     W+DAKR R+ M  +   KI GCS +E
Sbjct: 595 KAAKKLLELDPSDSGIYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIE 633

BLAST of CcUC01G003740 vs. TAIR 10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 354.8 bits (909), Expect = 1.4e-97
Identity = 198/568 (34.86%), Postives = 317/568 (55.81%), Query Frame = 0

Query: 46  LNRSVNVKQATQIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRILHHSQNPDA 105
           L +  N+ Q  Q+HA I+   L + +  +  +L  +   C +  +    R+ +  Q P+ 
Sbjct: 26  LPKCANLNQVKQLHAQIIRRNL-HEDLHIAPKLISALSLCRQ--TNLAVRVFNQVQEPNV 85

Query: 106 FTWACTVRFFSQNGHFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGSYVHTQ 165
                 +R  +QN    +A   + +MQR GL    F     L+AC           +H  
Sbjct: 86  HLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNH 145

Query: 166 VYKLGFCRCVYVQTALVDFYSKLGDMGF--AQKVFDDMTEKNVVSWNSILSGYVKIGNLV 225
           + KLG    +YV  AL+D YS+ G +G   A K+F+ M+E++ VSWNS+L G VK G L 
Sbjct: 146 IEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELR 205

Query: 226 DAQKVFDVMPVKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCGD 285
           DA+++FD MP +D+ISWN+ML G+A    M +A+ LF++M E+++ SW+ M+ GY   GD
Sbjct: 206 DARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGD 265

Query: 286 MKSARKLFDVMPNRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACYSQN 345
           M+ AR +FD MP                             +  K ++++  +IA Y++ 
Sbjct: 266 MEMARVMFDKMP-----------------------------LPAKNVVTWTIIIAGYAEK 325

Query: 346 SLPNKALELFNQMLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIELDDH 405
            L  +A  L +QM+     ++ D     SI++ACT+ G ++ G  I S +++  +  + +
Sbjct: 326 GLLKEADRLVDQMVAS--GLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAY 385

Query: 406 LATALVDLYAKSGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEMLRVN 465
           +  AL+D+YAK GN+++A ++FN + K+DLV+++ M+ G G++G   EAI LF  M R  
Sbjct: 386 VLNALLDMYAKCGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREG 445

Query: 466 ISPNLVTYAGLLTAYNHAGLVDEGY-LCFSYMKDHGLEAMADHYGIMVDLLGRAGRLEEA 525
           I P+ VT+  +L + NHAGL+DEG    +S  K + L    +HYG +VDLLGR GRL+EA
Sbjct: 446 IRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEA 505

Query: 526 YELIHSMPVQPNAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIYSSM 585
            +++ +MP++PN  +WGALL AC++HN V++ +    N  KL     G  SLL+NIY++ 
Sbjct: 506 IKVVQTMPMEPNVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAA 559

Query: 586 ERWDDAKRLRKAMGNKVFAKISGCSWME 611
           E W+    +R  M +    K SG S +E
Sbjct: 566 EDWEGVADIRSKMKSMGVEKPSGASSVE 559

BLAST of CcUC01G003740 vs. TAIR 10
Match: AT5G37570.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 347.8 bits (891), Expect = 1.8e-95
Identity = 193/558 (34.59%), Postives = 312/558 (55.91%), Query Frame = 0

Query: 57  QIHAYILVNGLPNLESCLVRQLTRSEFTCARIVSRYLQRILHHSQNPDAFTWACTVRFFS 116
           QIHA I+  GL   ++ +   ++ S  + + +   Y   +     +P  + W   ++ +S
Sbjct: 28  QIHARIIRKGLEQDQNLISIFISSSSSSSSSL--SYSSSVFERVPSPGTYLWNHLIKGYS 87

Query: 117 QNGHFMEAIAHYVQMQRLGL-HPSTFAVSSTLRACGRIMCKFGGSYVHTQVYKLGFCRCV 176
               F E ++  ++M R GL  P  +     ++ C        GS VH  V ++GF + V
Sbjct: 88  NKFLFFETVSILMRMMRTGLARPDEYTFPLVMKVCSNNGQVRVGSSVHGLVLRIGFDKDV 147

Query: 177 YVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNLVDAQKVFDVMPVK 236
            V T+ VDFY K  D+  A+KVF +M E+N VSW +++  YVK G L +A+ +FD+MP  
Sbjct: 148 VVGTSFVDFYGKCKDLFSARKVFGEMPERNAVSWTALVVAYVKSGELEEAKSMFDLMP-- 207

Query: 237 DVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCGDMKSARKLFDVMP 296
                                        E++  SWNA++ G V  GD+ +A+KLFD MP
Sbjct: 208 -----------------------------ERNLGSWNALVDGLVKSGDLVNAKKLFDEMP 267

Query: 297 NRNNVSRITLIAGYSKLGEVNSACELFDKMGEKELLSFNAMIACYSQNSLPNKALELFNQ 356
            R+ +S  ++I GY+K G++ SA +LF++    ++ +++A+I  Y+QN  PN+A ++F++
Sbjct: 268 KRDIISYTSMIDGYAKGGDMVSARDLFEEARGVDVRAWSALILGYAQNGQPNEAFKVFSE 327

Query: 357 MLQPHVNIQPDEMTFASIISACTQLGNMNYGTWIESYMEKLGIELDDH-LATALVDLYAK 416
           M     N++PDE     ++SAC+Q+G       ++SY+ +   +   H +  AL+D+ AK
Sbjct: 328 MCAK--NVKPDEFIMVGLMSACSQMGCFELCEKVDSYLHQRMNKFSSHYVVPALIDMNAK 387

Query: 417 SGNIERAIELFNGLEKRDLVAYSAMIFGCGINGKAYEAIRLFKEMLRVNISPNLVTYAGL 476
            G+++RA +LF  + +RDLV+Y +M+ G  I+G   EAIRLF++M+   I P+ V +  +
Sbjct: 388 CGHMDRAAKLFEEMPQRDLVSYCSMMEGMAIHGCGSEAIRLFEKMVDEGIVPDEVAFTVI 447

Query: 477 LTAYNHAGLVDEGYLCFSYM-KDHGLEAMADHYGIMVDLLGRAGRLEEAYELIHSMPVQP 536
           L     + LV+EG   F  M K + + A  DHY  +V+LL R G+L+EAYELI SMP + 
Sbjct: 448 LKVCGQSRLVEEGLRYFELMRKKYSILASPDHYSCIVNLLSRTGKLKEAYELIKSMPFEA 507

Query: 537 NAGVWGALLHACKLHNNVELGEIAARNCSKLVTDTAGYRSLLANIYSSMERWDDAKRLRK 596
           +A  WG+LL  C LH N E+ E+ AR+  +L   +AG   LL+NIY++++RW D   LR 
Sbjct: 508 HASAWGSLLGGCSLHGNTEIAEVVARHLFELEPQSAGSYVLLSNIYAALDRWTDVAHLRD 550

Query: 597 AMGNKVFAKISGCSWMEQ 612
            M      KI G SW+ +
Sbjct: 568 KMNENGITKICGRSWISR 550

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038906935.10.0e+0092.73pentatricopeptide repeat-containing protein At4g22760 [Benincasa hispida][more]
XP_008437157.10.0e+0091.38PREDICTED: pentatricopeptide repeat-containing protein At4g22760 [Cucumis melo][more]
XP_004147606.10.0e+0091.38pentatricopeptide repeat-containing protein At4g22760 [Cucumis sativus] >XP_0317... [more]
KAA0042843.10.0e+0091.03pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
KAG6579446.12.6e-30689.25Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
P0C8Q58.6e-18054.18Pentatricopeptide repeat-containing protein At4g22760 OS=Arabidopsis thaliana OX... [more]
O221374.4e-9937.38Pentatricopeptide repeat-containing protein At2g45350, chloroplastic OS=Arabidop... [more]
Q9SJZ33.7e-9836.81Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
Q9LS722.0e-9634.86Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
Q9FHR32.5e-9434.59Putative pentatricopeptide repeat-containing protein At5g37570 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A1S3ATB60.0e+0091.38pentatricopeptide repeat-containing protein At4g22760 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0KR380.0e+0091.38Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G157980 PE=4 SV=1[more]
A0A5A7THR10.0e+0091.03Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1E2U41.8e-30588.73pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita moschata OX=3... [more]
A0A6J1HXF14.9e-30388.39pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT4G22760.16.1e-18154.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G45350.13.1e-10037.38Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22410.12.6e-9936.81SLOW GROWTH 1 [more]
AT3G29230.11.4e-9734.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G37570.11.8e-9534.59Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 47..156
e-value: 1.2E-6
score: 30.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 485..612
e-value: 1.2E-11
score: 46.7
coord: 236..298
e-value: 1.0E-14
score: 56.7
coord: 299..391
e-value: 4.0E-18
score: 67.9
coord: 157..235
e-value: 8.7E-12
score: 47.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 392..484
e-value: 6.2E-21
score: 76.6
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 303..590
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 105..301
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 332..377
e-value: 3.5E-8
score: 33.5
coord: 432..478
e-value: 7.7E-9
score: 35.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 238..266
e-value: 1.4E-5
score: 25.0
coord: 179..206
e-value: 0.017
score: 15.3
coord: 505..529
e-value: 0.0022
score: 18.1
coord: 304..328
e-value: 3.8E-4
score: 20.5
coord: 207..235
e-value: 3.4E-5
score: 23.8
coord: 270..297
e-value: 7.8E-7
score: 28.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 406..434
e-value: 0.0029
score: 15.7
coord: 506..529
e-value: 9.7E-4
score: 17.2
coord: 207..234
e-value: 0.0021
score: 16.1
coord: 304..327
e-value: 9.6E-4
score: 17.2
coord: 270..297
e-value: 1.4E-5
score: 22.9
coord: 332..357
e-value: 1.6E-5
score: 22.7
coord: 434..467
e-value: 1.2E-5
score: 23.1
coord: 238..265
e-value: 8.0E-6
score: 23.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 432..466
score: 11.246351
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 467..501
score: 9.470621
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 104..138
score: 9.54735
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 205..235
score: 9.481582
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 329..363
score: 9.04313
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 10.89559
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 265..610
NoneNo IPR availablePANTHERPTHR24015:SF1771OS08G0162200 PROTEINcoord: 202..265
coord: 265..610
coord: 91..201
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 202..265
coord: 91..201

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC01G003740.1CcUC01G003740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding