CaUC06G117920 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC06G117920
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr06: 22490499 .. 22492685 (+)
RNA-Seq ExpressionCaUC06G117920
SyntenyCaUC06G117920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTTTCTGCCCCTCCTCTGCTTCTTTCACCCTCTTCCGATCCGCCTTACAGACTCCTTCAAGACCACCCATCTCTCAAGCTTCTCTCCAAATGCCAAAGTATTCAAACTCTCAAACAAATCCACGCTCAGATCATCAAGACCGGCCTTCACAACACACAGTTCGCCCTCAGCAAGCTTATCGAGTTTTCCGCTGTTTCTCGCTTTGGTAGTATCTCTTACGCCATTTCCCTATTTAATTCCATCGAAGAGCCCAATTTATTCATTTGGAATTCCATGATTCGAGGCCTTTCGATGAGTCTGTCGCCGGTTCTGGCCTTGGTTTTCTTTGTCAGAATGATTTATTCTGGGGTAGAGCCGAATTCTTATACGTTTCCTTTTCTTTTGAAGTCTTGCGCTAAGCTCGCCTCTGCCCATGAAGGGAAACAGATTCATGCCCATCTTTTGAAGCTTGGATTTGTGTCTGATGTGTTCATTCATACTTCGCTTATTAATATGTACGCGCAGAGTGGGGAAATGAATAATGCCCAATTGGTTTTTGATCAAAGTAAATTCAGGGATGCAATTTCTTTCACTGCATTAATTGCTGGTTATGCTTTGTGGGGTTACATGGACCGCTCTCGGAAACTGTTTGACGAAATGCCTGTGAGAGATGTGGTGTCTTGGAATGCTATGATTGCTGGGTATGCACAGACTGGTCGTTCCAAAGAGGCCTTGTTATTGTTTGAAGAAATGAGGAAAGCAAATGTCCCCCCAAATGAGAGTACTATTGTGTCTGTTCTTTCTGCTTGTGCTCAGTCAAACGCTCTAGATTTAGGAAACTCAATGCGCTCTTGGATTGAAGAACGCGGGCTTCGTTCAAATCTTAAGCTTGTTAATGCCCTTATTGACATGTACTCAAAGTGCGGTGATCTTAGGACTGCTCGTGAATTGTTTGATGATATGCCTGAAAGAGATGTGATCTCATGGAATGTTATGATCGGAGGTTACACTCACATGTGCAGCTATAAAGAAGCTTTGGCACTCTTCCGCGAGATGCTAGCCTCAGGTGTTGAGCCTACTGACATAACTTTCCTTAACATTCTTCCATCTTGTGCTCATCTAGGTGCTATTGACCTTGGTAAGTGGATACATGCTTATATAAACAAAAACTTCAACTCAGCCAGTGCCTCTCTTTGGACGAGTCTGATAGATTTGTACGCTAAATGCGGTGATATAGAGGCAGCACGACAGGTCTTTCATGGTATGAATATCAAAACCTTGGCTTCTTGGAATGCTATGATATGTGGGTTAGCAATGCATGGACAAGCAGATGAGGCTTTTGAGCTTTTCTCAAAAATGTCTAGTGATGGAATTGAACCAAATGAGATAACATTTGTCGGTATTCTATCAGCTTGTAAACACGCTGGTTTGGTTGATCTCGGACGCCAATTTTTCAGCTCTATGGTTCAAGACTATAAGATATCTCCTAAATCCCAACATTATGGATGCATGATAGATCTTCTTGGGCGAGCTGGGTTGTTTGAGGAAGCAGAGTCCTTGATACAGAATATGGAAATGAAACCAGATGGAGCCATATGGGGTTCCCTTCTTGGGGCATGTAGAGACCATGGACGAGTCGAGTTGGGAGAACTAGTTGCAGAACGTCTTTTCAAACTTGAGCCTGATAATCCTGGAGCTTATGTGCTTTTATCTAACATATATGCAGGAGCTGGTAAATGGGATGATGTAGCAAGAATAAGAACAAGATTGAACGACAGGGGAATGAAGAAAGTTCCTGGTTGTACCACCATTGAAGTCGATAACGTCGTCCATGAGTTCCTAGTAGGAGACAAAGTTCATCCACAAAGTGAAGATATTTACAAGATGCTGGAAGAAGTAGACAGACAATTAAAGGTGTTTGGATTTGTGGCAGATACATCTGAGGTATTGTATGACATGGATGAAGAATGGAAAGAAGGAGCTTTAAGCCACCACAGTGAGAAATTAGCAATTGCTTTTGGATTGATAAGTACAAAACCAGGAACACCAATTAGAATCATTAAAAATCTTCGTGTCTGTCGTAATTGTCATTCTGCTACAAAGCTAATATCTAAGATATTCAATAGAGAGATTATTGCTAGAGATAGAAACCGTTTCCATCACTTCAAAGATGGTTCTTGCTCATGTAATGACTATTGGTGA

mRNA sequence

ATGGCGCTTTCTGCCCCTCCTCTGCTTCTTTCACCCTCTTCCGATCCGCCTTACAGACTCCTTCAAGACCACCCATCTCTCAAGCTTCTCTCCAAATGCCAAAGTATTCAAACTCTCAAACAAATCCACGCTCAGATCATCAAGACCGGCCTTCACAACACACAGTTCGCCCTCAGCAAGCTTATCGAGTTTTCCGCTGTTTCTCGCTTTGGTAGTATCTCTTACGCCATTTCCCTATTTAATTCCATCGAAGAGCCCAATTTATTCATTTGGAATTCCATGATTCGAGGCCTTTCGATGAGTCTGTCGCCGGTTCTGGCCTTGGTTTTCTTTGTCAGAATGATTTATTCTGGGGTAGAGCCGAATTCTTATACGTTTCCTTTTCTTTTGAAGTCTTGCGCTAAGCTCGCCTCTGCCCATGAAGGGAAACAGATTCATGCCCATCTTTTGAAGCTTGGATTTGTGTCTGATGTGTTCATTCATACTTCGCTTATTAATATGTACGCGCAGAGTGGGGAAATGAATAATGCCCAATTGGTTTTTGATCAAAGTAAATTCAGGGATGCAATTTCTTTCACTGCATTAATTGCTGGTTATGCTTTGTGGGGTTACATGGACCGCTCTCGGAAACTGTTTGACGAAATGCCTGTGAGAGATGTGGTGTCTTGGAATGCTATGATTGCTGGGTATGCACAGACTGGTCGTTCCAAAGAGGCCTTGTTATTGTTTGAAGAAATGAGGAAAGCAAATGTCCCCCCAAATGAGAGTACTATTGTGTCTGTTCTTTCTGCTTGTGCTCAGTCAAACGCTCTAGATTTAGGAAACTCAATGCGCTCTTGGATTGAAGAACGCGGGCTTCGTTCAAATCTTAAGCTTGTTAATGCCCTTATTGACATGTACTCAAAGTGCGGTGATCTTAGGACTGCTCGTGAATTGTTTGATGATATGCCTGAAAGAGATGTGATCTCATGGAATGTTATGATCGGAGGTTACACTCACATGTGCAGCTATAAAGAAGCTTTGGCACTCTTCCGCGAGATGCTAGCCTCAGGTGTTGAGCCTACTGACATAACTTTCCTTAACATTCTTCCATCTTGTGCTCATCTAGGTGCTATTGACCTTGGTAAGTGGATACATGCTTATATAAACAAAAACTTCAACTCAGCCAGTGCCTCTCTTTGGACGAGTCTGATAGATTTGTACGCTAAATGCGGTGATATAGAGGCAGCACGACAGGTCTTTCATGGTATGAATATCAAAACCTTGGCTTCTTGGAATGCTATGATATGTGGGTTAGCAATGCATGGACAAGCAGATGAGGCTTTTGAGCTTTTCTCAAAAATGTCTAGTGATGGAATTGAACCAAATGAGATAACATTTGTCGGTATTCTATCAGCTTGTAAACACGCTGGTTTGGTTGATCTCGGACGCCAATTTTTCAGCTCTATGGTTCAAGACTATAAGATATCTCCTAAATCCCAACATTATGGATGCATGATAGATCTTCTTGGGCGAGCTGGGTTGTTTGAGGAAGCAGAGTCCTTGATACAGAATATGGAAATGAAACCAGATGGAGCCATATGGGGTTCCCTTCTTGGGGCATGTAGAGACCATGGACGAGTCGAGTTGGGAGAACTAGTTGCAGAACGTCTTTTCAAACTTGAGCCTGATAATCCTGGAGCTTATGTGCTTTTATCTAACATATATGCAGGAGCTGGTAAATGGGATGATGTAGCAAGAATAAGAACAAGATTGAACGACAGGGGAATGAAGAAAGTTCCTGGTTGTACCACCATTGAAGTCGATAACGTCGTCCATGAGTTCCTAGTAGGAGACAAAGTTCATCCACAAAGTGAAGATATTTACAAGATGCTGGAAGAAGTAGACAGACAATTAAAGGTGTTTGGATTTGTGGCAGATACATCTGAGGTATTGTATGACATGGATGAAGAATGGAAAGAAGGAGCTTTAAGCCACCACAGTGAGAAATTAGCAATTGCTTTTGGATTGATAAGTACAAAACCAGGAACACCAATTAGAATCATTAAAAATCTTCGTGTCTGTCGTAATTGTCATTCTGCTACAAAGCTAATATCTAAGATATTCAATAGAGAGATTATTGCTAGAGATAGAAACCGTTTCCATCACTTCAAAGATGGTTCTTGCTCATGTAATGACTATTGGTGA

Coding sequence (CDS)

ATGGCGCTTTCTGCCCCTCCTCTGCTTCTTTCACCCTCTTCCGATCCGCCTTACAGACTCCTTCAAGACCACCCATCTCTCAAGCTTCTCTCCAAATGCCAAAGTATTCAAACTCTCAAACAAATCCACGCTCAGATCATCAAGACCGGCCTTCACAACACACAGTTCGCCCTCAGCAAGCTTATCGAGTTTTCCGCTGTTTCTCGCTTTGGTAGTATCTCTTACGCCATTTCCCTATTTAATTCCATCGAAGAGCCCAATTTATTCATTTGGAATTCCATGATTCGAGGCCTTTCGATGAGTCTGTCGCCGGTTCTGGCCTTGGTTTTCTTTGTCAGAATGATTTATTCTGGGGTAGAGCCGAATTCTTATACGTTTCCTTTTCTTTTGAAGTCTTGCGCTAAGCTCGCCTCTGCCCATGAAGGGAAACAGATTCATGCCCATCTTTTGAAGCTTGGATTTGTGTCTGATGTGTTCATTCATACTTCGCTTATTAATATGTACGCGCAGAGTGGGGAAATGAATAATGCCCAATTGGTTTTTGATCAAAGTAAATTCAGGGATGCAATTTCTTTCACTGCATTAATTGCTGGTTATGCTTTGTGGGGTTACATGGACCGCTCTCGGAAACTGTTTGACGAAATGCCTGTGAGAGATGTGGTGTCTTGGAATGCTATGATTGCTGGGTATGCACAGACTGGTCGTTCCAAAGAGGCCTTGTTATTGTTTGAAGAAATGAGGAAAGCAAATGTCCCCCCAAATGAGAGTACTATTGTGTCTGTTCTTTCTGCTTGTGCTCAGTCAAACGCTCTAGATTTAGGAAACTCAATGCGCTCTTGGATTGAAGAACGCGGGCTTCGTTCAAATCTTAAGCTTGTTAATGCCCTTATTGACATGTACTCAAAGTGCGGTGATCTTAGGACTGCTCGTGAATTGTTTGATGATATGCCTGAAAGAGATGTGATCTCATGGAATGTTATGATCGGAGGTTACACTCACATGTGCAGCTATAAAGAAGCTTTGGCACTCTTCCGCGAGATGCTAGCCTCAGGTGTTGAGCCTACTGACATAACTTTCCTTAACATTCTTCCATCTTGTGCTCATCTAGGTGCTATTGACCTTGGTAAGTGGATACATGCTTATATAAACAAAAACTTCAACTCAGCCAGTGCCTCTCTTTGGACGAGTCTGATAGATTTGTACGCTAAATGCGGTGATATAGAGGCAGCACGACAGGTCTTTCATGGTATGAATATCAAAACCTTGGCTTCTTGGAATGCTATGATATGTGGGTTAGCAATGCATGGACAAGCAGATGAGGCTTTTGAGCTTTTCTCAAAAATGTCTAGTGATGGAATTGAACCAAATGAGATAACATTTGTCGGTATTCTATCAGCTTGTAAACACGCTGGTTTGGTTGATCTCGGACGCCAATTTTTCAGCTCTATGGTTCAAGACTATAAGATATCTCCTAAATCCCAACATTATGGATGCATGATAGATCTTCTTGGGCGAGCTGGGTTGTTTGAGGAAGCAGAGTCCTTGATACAGAATATGGAAATGAAACCAGATGGAGCCATATGGGGTTCCCTTCTTGGGGCATGTAGAGACCATGGACGAGTCGAGTTGGGAGAACTAGTTGCAGAACGTCTTTTCAAACTTGAGCCTGATAATCCTGGAGCTTATGTGCTTTTATCTAACATATATGCAGGAGCTGGTAAATGGGATGATGTAGCAAGAATAAGAACAAGATTGAACGACAGGGGAATGAAGAAAGTTCCTGGTTGTACCACCATTGAAGTCGATAACGTCGTCCATGAGTTCCTAGTAGGAGACAAAGTTCATCCACAAAGTGAAGATATTTACAAGATGCTGGAAGAAGTAGACAGACAATTAAAGGTGTTTGGATTTGTGGCAGATACATCTGAGGTATTGTATGACATGGATGAAGAATGGAAAGAAGGAGCTTTAAGCCACCACAGTGAGAAATTAGCAATTGCTTTTGGATTGATAAGTACAAAACCAGGAACACCAATTAGAATCATTAAAAATCTTCGTGTCTGTCGTAATTGTCATTCTGCTACAAAGCTAATATCTAAGATATTCAATAGAGAGATTATTGCTAGAGATAGAAACCGTTTCCATCACTTCAAAGATGGTTCTTGCTCATGTAATGACTATTGGTGA

Protein sequence

MALSAPPLLLSPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSCNDYW
Homology
BLAST of CaUC06G117920 vs. NCBI nr
Match: XP_004150015.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus])

HSP 1 Score: 1392.5 bits (3603), Expect = 0.0e+00
Identity = 689/734 (93.87%), Postives = 710/734 (96.73%), Query Frame = 0

Query: 1   MALSAPPLLLS------PSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNT 60
           MALS+P LLLS      PSSDPPYR+LQ+HPSLKLLSKCQSI+T KQIHA IIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60

Query: 61  QFALSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120
            FALSKLIEFSAVSR G ISYAISLFNSIEEPNLFIWNSMIRGLSMSLSP LALVFFVRM
Sbjct: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTG 240
           NNAQLVFDQS FRDAISFTALIAGYALWGYMDR+R+LFDEMPV+DVVSWNAMIAGYAQ G
Sbjct: 181 NNAQLVFDQSNFRDAISFTALIAGYALWGYMDRARQLFDEMPVKDVVSWNAMIAGYAQMG 240

Query: 241 RSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVN 300
           RSKEALLLFE+MRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGL SNLKLVN
Sbjct: 241 RSKEALLLFEDMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLCSNLKLVN 300

Query: 301 ALIDMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360
           ALIDMYSKCGDL+TARELFDDM ERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP
Sbjct: 301 ALIDMYSKCGDLQTARELFDDMLERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360

Query: 361 TDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVF 420
           T+ITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS S SL TSLIDLYAKCG+I AARQVF
Sbjct: 361 TEITFLSILPSCAHLGAIDLGKWIHAYINKNFNSVSTSLSTSLIDLYAKCGNIVAARQVF 420

Query: 421 HGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480
            GM IK+LASWNAMICGLAMHGQAD+AFELFSKMSSDGIEPNEITFVGILSACKHAGLVD
Sbjct: 421 DGMKIKSLASWNAMICGLAMHGQADKAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480

Query: 481 LGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGA 540
           LG+QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESL+QNME+KPDGAIWGSLLGA
Sbjct: 481 LGQQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLLQNMEVKPDGAIWGSLLGA 540

Query: 541 CRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600
           CRDHGRVELGELVAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP
Sbjct: 541 CRDHGRVELGELVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600

Query: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKE 660
           GCTTIEVDNVVHEFLVGDKVHPQSEDIY+MLEEVD QLKVFGFVADTSEVLYDMDEEWKE
Sbjct: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYRMLEEVDEQLKVFGFVADTSEVLYDMDEEWKE 660

Query: 661 GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720
           GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF
Sbjct: 661 GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720

Query: 721 HHFKDGSCSCNDYW 729
           HHFKDGSCSCNDYW
Sbjct: 721 HHFKDGSCSCNDYW 734

BLAST of CaUC06G117920 vs. NCBI nr
Match: KAA0046752.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK29694.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1391.7 bits (3601), Expect = 0.0e+00
Identity = 686/734 (93.46%), Postives = 709/734 (96.59%), Query Frame = 0

Query: 1   MALSAPPLLLS------PSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNT 60
           MALS+P LLLS      PSSDPPYR+LQ+HP+LKLLSKCQ+I+T KQIHA IIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPALKLLSKCQNIRTFKQIHAHIIKTGLHNT 60

Query: 61  QFALSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120
            FALSKLIEFSAVSR G ISYAISLF+SIE+PNLFIWNSMIRGLSMSLSPVLALVFFVRM
Sbjct: 61  HFALSKLIEFSAVSRSGDISYAISLFSSIEDPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTG 240
           NNAQL+FDQS FRDAISFTALIAGYALWGYMDR+R+LFDEMPV+DVVSWNAMIAGYAQ G
Sbjct: 181 NNAQLIFDQSNFRDAISFTALIAGYALWGYMDRARQLFDEMPVKDVVSWNAMIAGYAQMG 240

Query: 241 RSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVN 300
           RSKEALLLFE+MRK NVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGLRSNLKLVN
Sbjct: 241 RSKEALLLFEDMRKENVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLRSNLKLVN 300

Query: 301 ALIDMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360
           ALIDMYSKCGDL TARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP
Sbjct: 301 ALIDMYSKCGDLPTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360

Query: 361 TDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVF 420
           T+ITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS S SL TSLIDLYAKCG+I AARQVF
Sbjct: 361 TEITFLSILPSCAHLGAIDLGKWIHAYINKNFNSVSTSLSTSLIDLYAKCGNIVAARQVF 420

Query: 421 HGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480
            GMNIK+LASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVG+LSACKHAGLVD
Sbjct: 421 DGMNIKSLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVD 480

Query: 481 LGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGA 540
           LG Q FSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNME+KPDGAIWGSLLGA
Sbjct: 481 LGHQIFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEVKPDGAIWGSLLGA 540

Query: 541 CRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600
           CRDHGRVELGELVAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP
Sbjct: 541 CRDHGRVELGELVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600

Query: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKE 660
           GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVD+QLKVFGFVADTSEVLYDMDEEWKE
Sbjct: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDKQLKVFGFVADTSEVLYDMDEEWKE 660

Query: 661 GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720
           G LSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF
Sbjct: 661 GTLSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720

Query: 721 HHFKDGSCSCNDYW 729
           HHFKDGSCSCNDYW
Sbjct: 721 HHFKDGSCSCNDYW 734

BLAST of CaUC06G117920 vs. NCBI nr
Match: XP_023515625.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1385.5 bits (3585), Expect = 0.0e+00
Identity = 678/731 (92.75%), Postives = 710/731 (97.13%), Query Frame = 0

Query: 1   MALSAPPLLL---SPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFA 60
           MA SAP L+L   SPSSDPPYRLLQDHPSLKL+SKC+SI+TL+QIHAQIIKTGLHNTQFA
Sbjct: 1   MATSAPSLVLSPTSPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNTQFA 60

Query: 61  LSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYS 120
           LSKLIEFSAVSR+  ISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSPVLALVFFVRMI++
Sbjct: 61  LSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFVRMIHA 120

Query: 121 GVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNA 180
           GVEPNSYTFPFLLKSCAKLASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGE+N A
Sbjct: 121 GVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEINYA 180

Query: 181 QLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240
           QLVFDQS FRDAISFTALIAGY LWGYMDR+RKLFDEMPVRDVVSWNAMIAGYAQTGRSK
Sbjct: 181 QLVFDQSNFRDAISFTALIAGYVLWGYMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240

Query: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALI 300
           EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGLRSNLKLVNALI
Sbjct: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLRSNLKLVNALI 300

Query: 301 DMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDI 360
           DMYSKCGDL+TA ELFD+MPERDVISWNVMIGGYTHMCSYKEALALFREMLASG+EPTDI
Sbjct: 301 DMYSKCGDLQTACELFDEMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGIEPTDI 360

Query: 361 TFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGM 420
           TFLN+LPSCA LGAIDLGKWIHAYINKNFNSAS SLWTSLID+YAKCG+I+AARQVF+GM
Sbjct: 361 TFLNVLPSCACLGAIDLGKWIHAYINKNFNSASTSLWTSLIDMYAKCGNIDAARQVFNGM 420

Query: 421 NIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGR 480
           NIK+LASWNAMICGLAMHGQA+EA ELFSKMSS+GIEPNEITFVG+LSACKHAG VDLGR
Sbjct: 421 NIKSLASWNAMICGLAMHGQANEALELFSKMSSNGIEPNEITFVGVLSACKHAGFVDLGR 480

Query: 481 QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540
            FFSSM+QDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD
Sbjct: 481 LFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540

Query: 541 HGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCT 600
           HGRVELGE+VAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRT+LND GMKKVPGCT
Sbjct: 541 HGRVELGEIVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTKLNDMGMKKVPGCT 600

Query: 601 TIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGAL 660
           TIEVDNVVHEFLVGDKVHPQ+E+IYKMLEEVDRQLK FGFV DTSEVLYDMDEEWKEGAL
Sbjct: 601 TIEVDNVVHEFLVGDKVHPQTENIYKMLEEVDRQLKEFGFVPDTSEVLYDMDEEWKEGAL 660

Query: 661 SHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHF 720
           SHHSEKLAIAFGLISTKPGTPI IIKNLRVCRNCH+ATKLISKIFNREIIARDRNRFHHF
Sbjct: 661 SHHSEKLAIAFGLISTKPGTPITIIKNLRVCRNCHAATKLISKIFNREIIARDRNRFHHF 720

Query: 721 KDGSCSCNDYW 729
           KDGSCSCNDYW
Sbjct: 721 KDGSCSCNDYW 731

BLAST of CaUC06G117920 vs. NCBI nr
Match: XP_022987625.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1382.1 bits (3576), Expect = 0.0e+00
Identity = 675/731 (92.34%), Postives = 709/731 (96.99%), Query Frame = 0

Query: 1   MALSAPPLLL---SPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFA 60
           MA SAP L+L   SPSSDPPYRLLQDHPSLKL+SKC+SI+TL+QIHAQIIKTGLHNTQFA
Sbjct: 1   MATSAPSLVLSPTSPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNTQFA 60

Query: 61  LSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYS 120
           LSKLIEFSAVSR+  ISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSPVLALVFF RMI++
Sbjct: 61  LSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFARMIHA 120

Query: 121 GVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNA 180
           GVEPNSYTFPFLLKSCA+LASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGE+N A
Sbjct: 121 GVEPNSYTFPFLLKSCARLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEINYA 180

Query: 181 QLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240
           QLVFDQS FRDAISFTALIAGY LWGYMDR+RKLFDEMPVRDVVSWNAMIAGYAQTGRSK
Sbjct: 181 QLVFDQSNFRDAISFTALIAGYVLWGYMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240

Query: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALI 300
           EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGLRSNLKLVNALI
Sbjct: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLRSNLKLVNALI 300

Query: 301 DMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDI 360
           DMYSKCG L+TARELFD+MPERDVISWNVMIGGYTHMCSYKEALALFREMLASG+EPTDI
Sbjct: 301 DMYSKCGALQTARELFDEMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGIEPTDI 360

Query: 361 TFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGM 420
           TFLN+LPSCA LGAIDLGKWIHAYINKNFNSAS SLWTSLID+YAKCG+IEAARQVF+GM
Sbjct: 361 TFLNVLPSCACLGAIDLGKWIHAYINKNFNSASTSLWTSLIDMYAKCGNIEAARQVFNGM 420

Query: 421 NIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGR 480
           NIK+LASWNAMICGLAMHGQA+EA ELFSK++SDGIEPNEITFVG+LSACKHAG VDLGR
Sbjct: 421 NIKSLASWNAMICGLAMHGQANEALELFSKLTSDGIEPNEITFVGVLSACKHAGFVDLGR 480

Query: 481 QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540
            FFSSM+QDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD
Sbjct: 481 LFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540

Query: 541 HGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCT 600
           HGRVELGE+VAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVA IRT+LND+GMKKVPGCT
Sbjct: 541 HGRVELGEIVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVATIRTKLNDKGMKKVPGCT 600

Query: 601 TIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGAL 660
           TIEVDNVVHEFLVGDKVHPQSE+IYKMLEEVDRQLK FGFV DTSEVLYDMDEEWKEGAL
Sbjct: 601 TIEVDNVVHEFLVGDKVHPQSENIYKMLEEVDRQLKEFGFVPDTSEVLYDMDEEWKEGAL 660

Query: 661 SHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHF 720
           SHHSEKLAIAFGLISTKPGTPI IIKNLRVCRNCH+ATKLISKIFNREIIARDRNRFHHF
Sbjct: 661 SHHSEKLAIAFGLISTKPGTPITIIKNLRVCRNCHAATKLISKIFNREIIARDRNRFHHF 720

Query: 721 KDGSCSCNDYW 729
           KDGSCSCNDYW
Sbjct: 721 KDGSCSCNDYW 731

BLAST of CaUC06G117920 vs. NCBI nr
Match: XP_022961045.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 676/731 (92.48%), Postives = 709/731 (96.99%), Query Frame = 0

Query: 1   MALSAPPLLL---SPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFA 60
           MA SAP L+L   SPSSDPPYRLLQDHPSLKL+SKC+SI+TL+QIHAQIIKTGLHNTQFA
Sbjct: 1   MATSAPSLVLSPTSPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNTQFA 60

Query: 61  LSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYS 120
           LSKLIEFSAVSR+  ISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSPVLALVFFVRMI++
Sbjct: 61  LSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFVRMIHA 120

Query: 121 GVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNA 180
           GVEPNSYTFPFLLKSCAKLASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGE+N A
Sbjct: 121 GVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEINYA 180

Query: 181 QLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240
           QLVFDQS FRDAISFTALIAGY LWGYMDR+RKLFDEMPVRDVVSWNAMIAGYAQTGRSK
Sbjct: 181 QLVFDQSNFRDAISFTALIAGYVLWGYMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240

Query: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALI 300
           EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGL SNLKLVNALI
Sbjct: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLCSNLKLVNALI 300

Query: 301 DMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDI 360
           DMYSKCGDL+TARELFD+MPERDVISWNVMIGGYTHMCSYKEALALFREMLASG+EPTDI
Sbjct: 301 DMYSKCGDLQTARELFDEMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGIEPTDI 360

Query: 361 TFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGM 420
           TFLN+LPSCA LGAID+GKWIHAYINKNFNSAS SLWTSLID+YAKCG+I+AARQVF+GM
Sbjct: 361 TFLNVLPSCACLGAIDVGKWIHAYINKNFNSASTSLWTSLIDMYAKCGNIDAARQVFNGM 420

Query: 421 NIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGR 480
           N K+LASWNAMICGLAMHGQA+EA ELFSKMSSDGIEPNEITFVG+LSACKHAG VDLGR
Sbjct: 421 NFKSLASWNAMICGLAMHGQANEALELFSKMSSDGIEPNEITFVGVLSACKHAGFVDLGR 480

Query: 481 QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540
            FFSSM+QDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD
Sbjct: 481 LFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540

Query: 541 HGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCT 600
           HGRVELGE+VAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRT+LND+GMKKVPGCT
Sbjct: 541 HGRVELGEIVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTKLNDKGMKKVPGCT 600

Query: 601 TIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGAL 660
           TIEVDNVVHEFLVGDKVH QSE+IYKMLEEVDRQLK FGFV DTSEVLYDMDEEWKEGAL
Sbjct: 601 TIEVDNVVHEFLVGDKVHLQSENIYKMLEEVDRQLKEFGFVPDTSEVLYDMDEEWKEGAL 660

Query: 661 SHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHF 720
           SHHSEKLAIAFGLISTKPGTPI IIKNLRVCRNCH+ATKLISKIFNREIIARDRNRFHHF
Sbjct: 661 SHHSEKLAIAFGLISTKPGTPITIIKNLRVCRNCHAATKLISKIFNREIIARDRNRFHHF 720

Query: 721 KDGSCSCNDYW 729
           KDGSCSCNDYW
Sbjct: 721 KDGSCSCNDYW 731

BLAST of CaUC06G117920 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 986.1 bits (2548), Expect = 2.1e-286
Identity = 467/728 (64.15%), Postives = 588/728 (80.77%), Query Frame = 0

Query: 4   SAPPLLLSPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKLIE 63
           S P   L  SSDPPY  +++HPSL LL  C+++Q+L+ IHAQ+IK GLHNT +ALSKLIE
Sbjct: 14  SYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIE 73

Query: 64  FSAVS-RFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYSGVEPN 123
           F  +S  F  + YAIS+F +I+EPNL IWN+M RG ++S  PV AL  +V MI  G+ PN
Sbjct: 74  FCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPN 133

Query: 124 SYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFD 183
           SYTFPF+LKSCAK  +  EG+QIH H+LKLG   D+++HTSLI+MY Q+G + +A  VFD
Sbjct: 134 SYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD 193

Query: 184 QSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLL 243
           +S  RD +S+TALI GYA  GY++ ++KLFDE+PV+DVVSWNAMI+GYA+TG  KEAL L
Sbjct: 194 KSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 253

Query: 244 FEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYSK 303
           F++M K NV P+EST+V+V+SACAQS +++LG  +  WI++ G  SNLK+VNALID+YSK
Sbjct: 254 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 313

Query: 304 CGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDITFLNI 363
           CG+L TA  LF+ +P +DVISWN +IGGYTHM  YKEAL LF+EML SG  P D+T L+I
Sbjct: 314 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 373

Query: 364 LPSCAHLGAIDLGKWIHAYINKNFNSA--SASLWTSLIDLYAKCGDIEAARQVFHGMNIK 423
           LP+CAHLGAID+G+WIH YI+K       ++SL TSLID+YAKCGDIEAA QVF+ +  K
Sbjct: 374 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 433

Query: 424 TLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFF 483
           +L+SWNAMI G AMHG+AD +F+LFS+M   GI+P++ITFVG+LSAC H+G++DLGR  F
Sbjct: 434 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 493

Query: 484 SSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGR 543
            +M QDYK++PK +HYGCMIDLLG +GLF+EAE +I  MEM+PDG IW SLL AC+ HG 
Sbjct: 494 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 553

Query: 544 VELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIE 603
           VELGE  AE L K+EP+NPG+YVLLSNIYA AG+W++VA+ R  LND+GMKKVPGC++IE
Sbjct: 554 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 613

Query: 604 VDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHH 663
           +D+VVHEF++GDK HP++ +IY MLEE++  L+  GFV DTSEVL +M+EEWKEGAL HH
Sbjct: 614 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHH 673

Query: 664 SEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDG 723
           SEKLAIAFGLISTKPGT + I+KNLRVCRNCH ATKLISKI+ REIIARDR RFHHF+DG
Sbjct: 674 SEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDG 733

Query: 724 SCSCNDYW 729
            CSCNDYW
Sbjct: 734 VCSCNDYW 741

BLAST of CaUC06G117920 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 6.0e-172
Identity = 301/736 (40.90%), Postives = 447/736 (60.73%), Query Frame = 0

Query: 27  LKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGSISYAISLFNSIEEP 86
           + L+ +C S++ LKQ H  +I+TG  +  ++ SKL   +A+S F S+ YA  +F+ I +P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 87  NLFIWNSMIRGLSMSLSPVLALVFFVRMI-YSGVEPNSYTFPFLLKSCAKLASAHEGKQI 146
           N F WN++IR  +    PVL++  F+ M+  S   PN YTFPFL+K+ A+++S   G+ +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 147 HAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGYM 206
           H   +K    SDVF+  SLI+ Y   G+                               +
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGD-------------------------------L 213

Query: 207 DRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMRKANVPPNESTIVSVLSAC 266
           D + K+F  +  +DVVSWN+MI G+ Q G   +AL LF++M   +V  +  T+V VLSAC
Sbjct: 214 DSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSAC 273

Query: 267 AQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYSKCGDLRTARELFD----------- 326
           A+   L+ G  + S+IEE  +  NL L NA++DMY+KCG +  A+ LFD           
Sbjct: 274 AKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWT 333

Query: 327 --------------------DMPERDVISWNVMIGGYTHMCSYKEALALFREM-LASGVE 386
                                MP++D+++WN +I  Y       EAL +F E+ L   ++
Sbjct: 334 TMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMK 393

Query: 387 PTDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQV 446
              IT ++ L +CA +GA++LG+WIH+YI K+    +  + ++LI +Y+KCGD+E +R+V
Sbjct: 394 LNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREV 453

Query: 447 FHGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLV 506
           F+ +  + +  W+AMI GLAMHG  +EA ++F KM    ++PN +TF  +  AC H GLV
Sbjct: 454 FNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLV 513

Query: 507 DLGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLG 566
           D     F  M  +Y I P+ +HY C++D+LGR+G  E+A   I+ M + P  ++WG+LLG
Sbjct: 514 DEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLG 573

Query: 567 ACRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKV 626
           AC+ H  + L E+   RL +LEP N GA+VLLSNIYA  GKW++V+ +R  +   G+KK 
Sbjct: 574 ACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKE 633

Query: 627 PGCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDM-DEEW 686
           PGC++IE+D ++HEFL GD  HP SE +Y  L EV  +LK  G+  + S+VL  + +EE 
Sbjct: 634 PGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEM 693

Query: 687 KEGALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRN 729
           KE +L+ HSEKLAI +GLIST+    IR+IKNLRVC +CHS  KLIS++++REII RDR 
Sbjct: 694 KEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRY 738

BLAST of CaUC06G117920 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 3.0e-163
Identity = 306/835 (36.65%), Postives = 466/835 (55.81%), Query Frame = 0

Query: 3   LSAPPLLLSPSSDPPYRLLQDHPSLKL----LSKCQSIQTLKQIHAQIIKTGLHNTQFAL 62
           L   P++L+ ++     LL      K     L  C++I  LK  H  + K GL N    +
Sbjct: 8   LHLSPMVLATTTTTKPSLLNQSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTI 67

Query: 63  SKLIEFSA-VSRFGSISYAISLF-NSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIY 122
           +KL+  S  +    S+S+A  +F NS      F++NS+IRG + S     A++ F+RM+ 
Sbjct: 68  TKLVARSCELGTRESLSFAKEVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMN 127

Query: 123 SGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNN 182
           SG+ P+ YTFPF L +CAK  +   G QIH  ++K+G+  D+F+  SL++ YA+ GE+++
Sbjct: 128 SGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDS 187

Query: 183 AQLVFDQSKFRDAISFTALIAGYALWGY-------------------------------- 242
           A+ VFD+   R+ +S+T++I GYA   +                                
Sbjct: 188 ARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACA 247

Query: 243 ---------------------------------------MDRSRKLFDEMPVRDVVSWNA 302
                                                  +D +++LFDE    ++   NA
Sbjct: 248 KLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNA 307

Query: 303 MIAGYAQTGRSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERG 362
           M + Y + G ++EAL +F  M  + V P+  +++S +S+C+Q   +  G S   ++   G
Sbjct: 308 MASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNG 367

Query: 363 LRSNLKLVNALIDMYSKC-------------------------------GDLRTARELFD 422
             S   + NALIDMY KC                               G++  A E F+
Sbjct: 368 FESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFE 427

Query: 423 DMPERDVISWNVMIGGYTHMCSYKEALALFREMLA-SGVEPTDITFLNILPSCAHLGAID 482
            MPE++++SWN +I G      ++EA+ +F  M +  GV    +T ++I  +C HLGA+D
Sbjct: 428 TMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALD 487

Query: 483 LGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKTLASWNAMICGLA 542
           L KWI+ YI KN       L T+L+D++++CGD E+A  +F+ +  + +++W A I  +A
Sbjct: 488 LAKWIYYYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMA 547

Query: 543 MHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFSSMVQDYKISPKS 602
           M G A+ A ELF  M   G++P+ + FVG L+AC H GLV  G++ F SM++ + +SP+ 
Sbjct: 548 MAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPED 607

Query: 603 QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGELVAERLFK 662
            HYGCM+DLLGRAGL EEA  LI++M M+P+  IW SLL ACR  G VE+    AE++  
Sbjct: 608 VHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQV 667

Query: 663 LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEVDNVVHEFLVGDK 722
           L P+  G+YVLLSN+YA AG+W+D+A++R  + ++G++K PG ++I++    HEF  GD+
Sbjct: 668 LAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDE 727

Query: 723 VHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHSEKLAIAFGLIST 729
            HP+  +I  ML+EV ++    G V D S VL D+DE+ K   LS HSEKLA+A+GLIS+
Sbjct: 728 SHPEMPNIEAMLDEVSQRASHLGHVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISS 787

BLAST of CaUC06G117920 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 8.6e-163
Identity = 299/727 (41.13%), Postives = 442/727 (60.80%), Query Frame = 0

Query: 5   APPLLLSPS---SDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKL 64
           A PLL + S   SD  Y  L D  + K          LKQIHA+++  GL  + F ++KL
Sbjct: 8   ASPLLYTNSGIHSDSFYASLIDSATHK--------AQLKQIHARLLVLGLQFSGFLITKL 67

Query: 65  IEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYSGVEP 124
           I   A S FG I++A  +F+ +  P +F WN++IRG S +     AL+ +  M  + V P
Sbjct: 68  IH--ASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSP 127

Query: 125 NSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVF 184
           +S+TFP LLK+C+ L+    G+ +HA + +LGF +DVF+   LI +YA+   + +A+ VF
Sbjct: 128 DSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVF 187

Query: 185 DQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALL 244
           +                                +P R +VSW A+++ YAQ G   EAL 
Sbjct: 188 EGL-----------------------------PLPERTIVSWTAIVSAYAQNGEPMEALE 247

Query: 245 LFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYS 304
           +F +MRK +V P+   +VSVL+A      L  G S+ + + + GL     L+ +L  MY+
Sbjct: 248 IFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYA 307

Query: 305 KCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDITFLN 364
           KCG + TA+ LFD M   ++I WN MI GY      +EA+ +F EM+   V P  I+  +
Sbjct: 308 KCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITS 367

Query: 365 ILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKT 424
            + +CA +G+++  + ++ Y+ ++       + ++LID++AKCG +E AR VF     + 
Sbjct: 368 AISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRD 427

Query: 425 LASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFS 484
           +  W+AMI G  +HG+A EA  L+  M   G+ PN++TF+G+L AC H+G+V  G  FF+
Sbjct: 428 VVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFN 487

Query: 485 SMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRV 544
            M  D+KI+P+ QHY C+IDLLGRAG  ++A  +I+ M ++P   +WG+LL AC+ H  V
Sbjct: 488 RMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHV 547

Query: 545 ELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEV 604
           ELGE  A++LF ++P N G YV LSN+YA A  WD VA +R R+ ++G+ K  GC+ +EV
Sbjct: 548 ELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEV 607

Query: 605 DNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHS 664
              +  F VGDK HP+ E+I + +E ++ +LK  GFVA+    L+D+++E  E  L  HS
Sbjct: 608 RGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHS 667

Query: 665 EKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGS 724
           E++AIA+GLIST  GTP+RI KNLR C NCH+ATKLISK+ +REI+ RD NRFHHFKDG 
Sbjct: 668 ERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGV 694

Query: 725 CSCNDYW 729
           CSC DYW
Sbjct: 728 CSCGDYW 694

BLAST of CaUC06G117920 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 1.3e-158
Identity = 296/776 (38.14%), Postives = 430/776 (55.41%), Query Frame = 0

Query: 23  DHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGSISYAISLFNS 82
           +HP+  LL +C S++ L+QI   + K GL+   F  +KL+  S   R+GS+  A  +F  
Sbjct: 37  EHPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLV--SLFCRYGSVDEAARVFEP 96

Query: 83  IEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEG 142
           I+     ++++M++G +       AL FFVRM Y  VEP  Y F +LLK C   A    G
Sbjct: 97  IDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVG 156

Query: 143 KQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQSKFRDAISFTALIAG---- 202
           K+IH  L+K GF  D+F  T L NMYA+  ++N A+ VFD+   RD +S+  ++AG    
Sbjct: 157 KEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 203 ------------------------------------------------------------ 262
                                                                       
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 263 ------YALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMRKANVP 322
                 YA  G ++ +R+LFD M  R+VVSWN+MI  Y Q    KEA+L+F++M    V 
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVK 336

Query: 323 PNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYSKCGDLRTAREL 382
           P + +++  L ACA    L+ G  +     E GL  N+ +VN+LI MY KC ++ TA  +
Sbjct: 337 PTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASM 396

Query: 383 FDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDITFLNILPSCAHLGAI 442
           F  +  R ++SWN MI G+       +AL  F +M +  V+P   T+++++ + A L   
Sbjct: 397 FGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSIT 456

Query: 443 DLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKTLASWNAMICGL 502
              KWIH  + ++    +  + T+L+D+YAKCG I  AR +F  M+ + + +WNAMI G 
Sbjct: 457 HHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGY 516

Query: 503 AMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFSSMVQDYKISPK 562
             HG    A ELF +M    I+PN +TF+ ++SAC H+GLV+ G + F  M ++Y I   
Sbjct: 517 GTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELS 576

Query: 563 SQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGELVAERLF 622
             HYG M+DLLGRAG   EA   I  M +KP   ++G++LGAC+ H  V   E  AERLF
Sbjct: 577 MDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLF 636

Query: 623 KLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEVDNVVHEFLVGD 682
           +L PD+ G +VLL+NIY  A  W+ V ++R  +  +G++K PGC+ +E+ N VH F  G 
Sbjct: 637 ELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGS 696

Query: 683 KVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHSEKLAIAFGLIS 729
             HP S+ IY  LE++   +K  G+V DT+ VL  ++ + KE  LS HSEKLAI+FGL++
Sbjct: 697 TAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLN 756

BLAST of CaUC06G117920 vs. ExPASy TrEMBL
Match: A0A0A0LU28 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G306800 PE=3 SV=1)

HSP 1 Score: 1392.5 bits (3603), Expect = 0.0e+00
Identity = 689/734 (93.87%), Postives = 710/734 (96.73%), Query Frame = 0

Query: 1   MALSAPPLLLS------PSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNT 60
           MALS+P LLLS      PSSDPPYR+LQ+HPSLKLLSKCQSI+T KQIHA IIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60

Query: 61  QFALSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120
            FALSKLIEFSAVSR G ISYAISLFNSIEEPNLFIWNSMIRGLSMSLSP LALVFFVRM
Sbjct: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTG 240
           NNAQLVFDQS FRDAISFTALIAGYALWGYMDR+R+LFDEMPV+DVVSWNAMIAGYAQ G
Sbjct: 181 NNAQLVFDQSNFRDAISFTALIAGYALWGYMDRARQLFDEMPVKDVVSWNAMIAGYAQMG 240

Query: 241 RSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVN 300
           RSKEALLLFE+MRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGL SNLKLVN
Sbjct: 241 RSKEALLLFEDMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLCSNLKLVN 300

Query: 301 ALIDMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360
           ALIDMYSKCGDL+TARELFDDM ERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP
Sbjct: 301 ALIDMYSKCGDLQTARELFDDMLERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360

Query: 361 TDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVF 420
           T+ITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS S SL TSLIDLYAKCG+I AARQVF
Sbjct: 361 TEITFLSILPSCAHLGAIDLGKWIHAYINKNFNSVSTSLSTSLIDLYAKCGNIVAARQVF 420

Query: 421 HGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480
            GM IK+LASWNAMICGLAMHGQAD+AFELFSKMSSDGIEPNEITFVGILSACKHAGLVD
Sbjct: 421 DGMKIKSLASWNAMICGLAMHGQADKAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480

Query: 481 LGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGA 540
           LG+QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESL+QNME+KPDGAIWGSLLGA
Sbjct: 481 LGQQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLLQNMEVKPDGAIWGSLLGA 540

Query: 541 CRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600
           CRDHGRVELGELVAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP
Sbjct: 541 CRDHGRVELGELVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600

Query: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKE 660
           GCTTIEVDNVVHEFLVGDKVHPQSEDIY+MLEEVD QLKVFGFVADTSEVLYDMDEEWKE
Sbjct: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYRMLEEVDEQLKVFGFVADTSEVLYDMDEEWKE 660

Query: 661 GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720
           GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF
Sbjct: 661 GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720

Query: 721 HHFKDGSCSCNDYW 729
           HHFKDGSCSCNDYW
Sbjct: 721 HHFKDGSCSCNDYW 734

BLAST of CaUC06G117920 vs. ExPASy TrEMBL
Match: A0A5A7TTJ1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold64G00470 PE=3 SV=1)

HSP 1 Score: 1391.7 bits (3601), Expect = 0.0e+00
Identity = 686/734 (93.46%), Postives = 709/734 (96.59%), Query Frame = 0

Query: 1   MALSAPPLLLS------PSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNT 60
           MALS+P LLLS      PSSDPPYR+LQ+HP+LKLLSKCQ+I+T KQIHA IIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPALKLLSKCQNIRTFKQIHAHIIKTGLHNT 60

Query: 61  QFALSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120
            FALSKLIEFSAVSR G ISYAISLF+SIE+PNLFIWNSMIRGLSMSLSPVLALVFFVRM
Sbjct: 61  HFALSKLIEFSAVSRSGDISYAISLFSSIEDPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTG 240
           NNAQL+FDQS FRDAISFTALIAGYALWGYMDR+R+LFDEMPV+DVVSWNAMIAGYAQ G
Sbjct: 181 NNAQLIFDQSNFRDAISFTALIAGYALWGYMDRARQLFDEMPVKDVVSWNAMIAGYAQMG 240

Query: 241 RSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVN 300
           RSKEALLLFE+MRK NVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGLRSNLKLVN
Sbjct: 241 RSKEALLLFEDMRKENVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLRSNLKLVN 300

Query: 301 ALIDMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360
           ALIDMYSKCGDL TARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP
Sbjct: 301 ALIDMYSKCGDLPTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360

Query: 361 TDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVF 420
           T+ITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS S SL TSLIDLYAKCG+I AARQVF
Sbjct: 361 TEITFLSILPSCAHLGAIDLGKWIHAYINKNFNSVSTSLSTSLIDLYAKCGNIVAARQVF 420

Query: 421 HGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480
            GMNIK+LASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVG+LSACKHAGLVD
Sbjct: 421 DGMNIKSLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVD 480

Query: 481 LGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGA 540
           LG Q FSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNME+KPDGAIWGSLLGA
Sbjct: 481 LGHQIFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEVKPDGAIWGSLLGA 540

Query: 541 CRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600
           CRDHGRVELGELVAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP
Sbjct: 541 CRDHGRVELGELVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600

Query: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKE 660
           GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVD+QLKVFGFVADTSEVLYDMDEEWKE
Sbjct: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDKQLKVFGFVADTSEVLYDMDEEWKE 660

Query: 661 GALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720
           G LSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF
Sbjct: 661 GTLSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRF 720

Query: 721 HHFKDGSCSCNDYW 729
           HHFKDGSCSCNDYW
Sbjct: 721 HHFKDGSCSCNDYW 734

BLAST of CaUC06G117920 vs. ExPASy TrEMBL
Match: A0A6J1JHE6 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111485125 PE=3 SV=1)

HSP 1 Score: 1382.1 bits (3576), Expect = 0.0e+00
Identity = 675/731 (92.34%), Postives = 709/731 (96.99%), Query Frame = 0

Query: 1   MALSAPPLLL---SPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFA 60
           MA SAP L+L   SPSSDPPYRLLQDHPSLKL+SKC+SI+TL+QIHAQIIKTGLHNTQFA
Sbjct: 1   MATSAPSLVLSPTSPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNTQFA 60

Query: 61  LSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYS 120
           LSKLIEFSAVSR+  ISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSPVLALVFF RMI++
Sbjct: 61  LSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFARMIHA 120

Query: 121 GVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNA 180
           GVEPNSYTFPFLLKSCA+LASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGE+N A
Sbjct: 121 GVEPNSYTFPFLLKSCARLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEINYA 180

Query: 181 QLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240
           QLVFDQS FRDAISFTALIAGY LWGYMDR+RKLFDEMPVRDVVSWNAMIAGYAQTGRSK
Sbjct: 181 QLVFDQSNFRDAISFTALIAGYVLWGYMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240

Query: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALI 300
           EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGLRSNLKLVNALI
Sbjct: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLRSNLKLVNALI 300

Query: 301 DMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDI 360
           DMYSKCG L+TARELFD+MPERDVISWNVMIGGYTHMCSYKEALALFREMLASG+EPTDI
Sbjct: 301 DMYSKCGALQTARELFDEMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGIEPTDI 360

Query: 361 TFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGM 420
           TFLN+LPSCA LGAIDLGKWIHAYINKNFNSAS SLWTSLID+YAKCG+IEAARQVF+GM
Sbjct: 361 TFLNVLPSCACLGAIDLGKWIHAYINKNFNSASTSLWTSLIDMYAKCGNIEAARQVFNGM 420

Query: 421 NIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGR 480
           NIK+LASWNAMICGLAMHGQA+EA ELFSK++SDGIEPNEITFVG+LSACKHAG VDLGR
Sbjct: 421 NIKSLASWNAMICGLAMHGQANEALELFSKLTSDGIEPNEITFVGVLSACKHAGFVDLGR 480

Query: 481 QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540
            FFSSM+QDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD
Sbjct: 481 LFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540

Query: 541 HGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCT 600
           HGRVELGE+VAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVA IRT+LND+GMKKVPGCT
Sbjct: 541 HGRVELGEIVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVATIRTKLNDKGMKKVPGCT 600

Query: 601 TIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGAL 660
           TIEVDNVVHEFLVGDKVHPQSE+IYKMLEEVDRQLK FGFV DTSEVLYDMDEEWKEGAL
Sbjct: 601 TIEVDNVVHEFLVGDKVHPQSENIYKMLEEVDRQLKEFGFVPDTSEVLYDMDEEWKEGAL 660

Query: 661 SHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHF 720
           SHHSEKLAIAFGLISTKPGTPI IIKNLRVCRNCH+ATKLISKIFNREIIARDRNRFHHF
Sbjct: 661 SHHSEKLAIAFGLISTKPGTPITIIKNLRVCRNCHAATKLISKIFNREIIARDRNRFHHF 720

Query: 721 KDGSCSCNDYW 729
           KDGSCSCNDYW
Sbjct: 721 KDGSCSCNDYW 731

BLAST of CaUC06G117920 vs. ExPASy TrEMBL
Match: A0A6J1HAU9 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111461670 PE=3 SV=1)

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 676/731 (92.48%), Postives = 709/731 (96.99%), Query Frame = 0

Query: 1   MALSAPPLLL---SPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFA 60
           MA SAP L+L   SPSSDPPYRLLQDHPSLKL+SKC+SI+TL+QIHAQIIKTGLHNTQFA
Sbjct: 1   MATSAPSLVLSPTSPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNTQFA 60

Query: 61  LSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYS 120
           LSKLIEFSAVSR+  ISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSPVLALVFFVRMI++
Sbjct: 61  LSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFVRMIHA 120

Query: 121 GVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNA 180
           GVEPNSYTFPFLLKSCAKLASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGE+N A
Sbjct: 121 GVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEINYA 180

Query: 181 QLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240
           QLVFDQS FRDAISFTALIAGY LWGYMDR+RKLFDEMPVRDVVSWNAMIAGYAQTGRSK
Sbjct: 181 QLVFDQSNFRDAISFTALIAGYVLWGYMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSK 240

Query: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALI 300
           EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGL SNLKLVNALI
Sbjct: 241 EALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLCSNLKLVNALI 300

Query: 301 DMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDI 360
           DMYSKCGDL+TARELFD+MPERDVISWNVMIGGYTHMCSYKEALALFREMLASG+EPTDI
Sbjct: 301 DMYSKCGDLQTARELFDEMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGIEPTDI 360

Query: 361 TFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGM 420
           TFLN+LPSCA LGAID+GKWIHAYINKNFNSAS SLWTSLID+YAKCG+I+AARQVF+GM
Sbjct: 361 TFLNVLPSCACLGAIDVGKWIHAYINKNFNSASTSLWTSLIDMYAKCGNIDAARQVFNGM 420

Query: 421 NIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGR 480
           N K+LASWNAMICGLAMHGQA+EA ELFSKMSSDGIEPNEITFVG+LSACKHAG VDLGR
Sbjct: 421 NFKSLASWNAMICGLAMHGQANEALELFSKMSSDGIEPNEITFVGVLSACKHAGFVDLGR 480

Query: 481 QFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540
            FFSSM+QDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD
Sbjct: 481 LFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRD 540

Query: 541 HGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCT 600
           HGRVELGE+VAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRT+LND+GMKKVPGCT
Sbjct: 541 HGRVELGEIVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTKLNDKGMKKVPGCT 600

Query: 601 TIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGAL 660
           TIEVDNVVHEFLVGDKVH QSE+IYKMLEEVDRQLK FGFV DTSEVLYDMDEEWKEGAL
Sbjct: 601 TIEVDNVVHEFLVGDKVHLQSENIYKMLEEVDRQLKEFGFVPDTSEVLYDMDEEWKEGAL 660

Query: 661 SHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHF 720
           SHHSEKLAIAFGLISTKPGTPI IIKNLRVCRNCH+ATKLISKIFNREIIARDRNRFHHF
Sbjct: 661 SHHSEKLAIAFGLISTKPGTPITIIKNLRVCRNCHAATKLISKIFNREIIARDRNRFHHF 720

Query: 721 KDGSCSCNDYW 729
           KDGSCSCNDYW
Sbjct: 721 KDGSCSCNDYW 731

BLAST of CaUC06G117920 vs. ExPASy TrEMBL
Match: A0A1S3CMX0 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502738 PE=3 SV=1)

HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 633/681 (92.95%), Postives = 656/681 (96.33%), Query Frame = 0

Query: 1   MALSAPPLLLS------PSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNT 60
           MALS+P LLLS      PSSDPPYR+LQ+HP+LKLLSKCQ+I+T KQIHA IIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPALKLLSKCQNIRTFKQIHAHIIKTGLHNT 60

Query: 61  QFALSKLIEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120
            FALSKLIEFSAVSR G ISYAISLF+SIE+PNLFIWNSMIRGLSMSLSPVLALVFFVRM
Sbjct: 61  HFALSKLIEFSAVSRSGDISYAISLFSSIEDPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASA EGKQIHAH+LKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTG 240
           NNAQL+FDQS FRDAISFTALIAGYALWGYMDR+R+LFDEMPV+DVVSWNAMIAGYAQ G
Sbjct: 181 NNAQLIFDQSNFRDAISFTALIAGYALWGYMDRARQLFDEMPVKDVVSWNAMIAGYAQMG 240

Query: 241 RSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVN 300
           RSKEALLLFE+MRK NVPPNESTIVSVLSACAQSNALDLGNSMRSWIE+RGLRSNLKLVN
Sbjct: 241 RSKEALLLFEDMRKENVPPNESTIVSVLSACAQSNALDLGNSMRSWIEDRGLRSNLKLVN 300

Query: 301 ALIDMYSKCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360
           ALIDMYSKCGDL TARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP
Sbjct: 301 ALIDMYSKCGDLPTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEP 360

Query: 361 TDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVF 420
           T+ITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS S SL TSLIDLYAKCG+I AARQVF
Sbjct: 361 TEITFLSILPSCAHLGAIDLGKWIHAYINKNFNSVSTSLSTSLIDLYAKCGNIVAARQVF 420

Query: 421 HGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVD 480
            GMNIK+LASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVG+LSACKHAGLVD
Sbjct: 421 DGMNIKSLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVD 480

Query: 481 LGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGA 540
           LG Q FSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNME+KPDGAIWGSLLGA
Sbjct: 481 LGHQIFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEVKPDGAIWGSLLGA 540

Query: 541 CRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600
           CRDHGRVELGELVAERLF+LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP
Sbjct: 541 CRDHGRVELGELVAERLFELEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVP 600

Query: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKE 660
           GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVD+QLKVFGFVADTSEVLYDMDEEWKE
Sbjct: 601 GCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDKQLKVFGFVADTSEVLYDMDEEWKE 660

Query: 661 GALSHHSEKLAIAFGLISTKP 676
           G LSHHSEKLAIAFGLISTKP
Sbjct: 661 GTLSHHSEKLAIAFGLISTKP 681

BLAST of CaUC06G117920 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 986.1 bits (2548), Expect = 1.5e-287
Identity = 467/728 (64.15%), Postives = 588/728 (80.77%), Query Frame = 0

Query: 4   SAPPLLLSPSSDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKLIE 63
           S P   L  SSDPPY  +++HPSL LL  C+++Q+L+ IHAQ+IK GLHNT +ALSKLIE
Sbjct: 14  SYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIE 73

Query: 64  FSAVS-RFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYSGVEPN 123
           F  +S  F  + YAIS+F +I+EPNL IWN+M RG ++S  PV AL  +V MI  G+ PN
Sbjct: 74  FCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPN 133

Query: 124 SYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFD 183
           SYTFPF+LKSCAK  +  EG+QIH H+LKLG   D+++HTSLI+MY Q+G + +A  VFD
Sbjct: 134 SYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD 193

Query: 184 QSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLL 243
           +S  RD +S+TALI GYA  GY++ ++KLFDE+PV+DVVSWNAMI+GYA+TG  KEAL L
Sbjct: 194 KSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 253

Query: 244 FEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYSK 303
           F++M K NV P+EST+V+V+SACAQS +++LG  +  WI++ G  SNLK+VNALID+YSK
Sbjct: 254 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 313

Query: 304 CGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDITFLNI 363
           CG+L TA  LF+ +P +DVISWN +IGGYTHM  YKEAL LF+EML SG  P D+T L+I
Sbjct: 314 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 373

Query: 364 LPSCAHLGAIDLGKWIHAYINKNFNSA--SASLWTSLIDLYAKCGDIEAARQVFHGMNIK 423
           LP+CAHLGAID+G+WIH YI+K       ++SL TSLID+YAKCGDIEAA QVF+ +  K
Sbjct: 374 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 433

Query: 424 TLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFF 483
           +L+SWNAMI G AMHG+AD +F+LFS+M   GI+P++ITFVG+LSAC H+G++DLGR  F
Sbjct: 434 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 493

Query: 484 SSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGR 543
            +M QDYK++PK +HYGCMIDLLG +GLF+EAE +I  MEM+PDG IW SLL AC+ HG 
Sbjct: 494 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 553

Query: 544 VELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIE 603
           VELGE  AE L K+EP+NPG+YVLLSNIYA AG+W++VA+ R  LND+GMKKVPGC++IE
Sbjct: 554 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 613

Query: 604 VDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHH 663
           +D+VVHEF++GDK HP++ +IY MLEE++  L+  GFV DTSEVL +M+EEWKEGAL HH
Sbjct: 614 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHH 673

Query: 664 SEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDG 723
           SEKLAIAFGLISTKPGT + I+KNLRVCRNCH ATKLISKI+ REIIARDR RFHHF+DG
Sbjct: 674 SEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDG 733

Query: 724 SCSCNDYW 729
            CSCNDYW
Sbjct: 734 VCSCNDYW 741

BLAST of CaUC06G117920 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 605.9 bits (1561), Expect = 4.2e-173
Identity = 301/736 (40.90%), Postives = 447/736 (60.73%), Query Frame = 0

Query: 27  LKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGSISYAISLFNSIEEP 86
           + L+ +C S++ LKQ H  +I+TG  +  ++ SKL   +A+S F S+ YA  +F+ I +P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 87  NLFIWNSMIRGLSMSLSPVLALVFFVRMI-YSGVEPNSYTFPFLLKSCAKLASAHEGKQI 146
           N F WN++IR  +    PVL++  F+ M+  S   PN YTFPFL+K+ A+++S   G+ +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 147 HAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGYM 206
           H   +K    SDVF+  SLI+ Y   G+                               +
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGD-------------------------------L 213

Query: 207 DRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMRKANVPPNESTIVSVLSAC 266
           D + K+F  +  +DVVSWN+MI G+ Q G   +AL LF++M   +V  +  T+V VLSAC
Sbjct: 214 DSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSAC 273

Query: 267 AQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYSKCGDLRTARELFD----------- 326
           A+   L+ G  + S+IEE  +  NL L NA++DMY+KCG +  A+ LFD           
Sbjct: 274 AKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWT 333

Query: 327 --------------------DMPERDVISWNVMIGGYTHMCSYKEALALFREM-LASGVE 386
                                MP++D+++WN +I  Y       EAL +F E+ L   ++
Sbjct: 334 TMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMK 393

Query: 387 PTDITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQV 446
              IT ++ L +CA +GA++LG+WIH+YI K+    +  + ++LI +Y+KCGD+E +R+V
Sbjct: 394 LNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREV 453

Query: 447 FHGMNIKTLASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLV 506
           F+ +  + +  W+AMI GLAMHG  +EA ++F KM    ++PN +TF  +  AC H GLV
Sbjct: 454 FNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLV 513

Query: 507 DLGRQFFSSMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLG 566
           D     F  M  +Y I P+ +HY C++D+LGR+G  E+A   I+ M + P  ++WG+LLG
Sbjct: 514 DEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLG 573

Query: 567 ACRDHGRVELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKV 626
           AC+ H  + L E+   RL +LEP N GA+VLLSNIYA  GKW++V+ +R  +   G+KK 
Sbjct: 574 ACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKE 633

Query: 627 PGCTTIEVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDM-DEEW 686
           PGC++IE+D ++HEFL GD  HP SE +Y  L EV  +LK  G+  + S+VL  + +EE 
Sbjct: 634 PGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEM 693

Query: 687 KEGALSHHSEKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRN 729
           KE +L+ HSEKLAI +GLIST+    IR+IKNLRVC +CHS  KLIS++++REII RDR 
Sbjct: 694 KEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRY 738

BLAST of CaUC06G117920 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 577.0 bits (1486), Expect = 2.1e-164
Identity = 306/835 (36.65%), Postives = 466/835 (55.81%), Query Frame = 0

Query: 3   LSAPPLLLSPSSDPPYRLLQDHPSLKL----LSKCQSIQTLKQIHAQIIKTGLHNTQFAL 62
           L   P++L+ ++     LL      K     L  C++I  LK  H  + K GL N    +
Sbjct: 8   LHLSPMVLATTTTTKPSLLNQSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTI 67

Query: 63  SKLIEFSA-VSRFGSISYAISLF-NSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIY 122
           +KL+  S  +    S+S+A  +F NS      F++NS+IRG + S     A++ F+RM+ 
Sbjct: 68  TKLVARSCELGTRESLSFAKEVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMN 127

Query: 123 SGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNN 182
           SG+ P+ YTFPF L +CAK  +   G QIH  ++K+G+  D+F+  SL++ YA+ GE+++
Sbjct: 128 SGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDS 187

Query: 183 AQLVFDQSKFRDAISFTALIAGYALWGY-------------------------------- 242
           A+ VFD+   R+ +S+T++I GYA   +                                
Sbjct: 188 ARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACA 247

Query: 243 ---------------------------------------MDRSRKLFDEMPVRDVVSWNA 302
                                                  +D +++LFDE    ++   NA
Sbjct: 248 KLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNA 307

Query: 303 MIAGYAQTGRSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERG 362
           M + Y + G ++EAL +F  M  + V P+  +++S +S+C+Q   +  G S   ++   G
Sbjct: 308 MASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNG 367

Query: 363 LRSNLKLVNALIDMYSKC-------------------------------GDLRTARELFD 422
             S   + NALIDMY KC                               G++  A E F+
Sbjct: 368 FESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFE 427

Query: 423 DMPERDVISWNVMIGGYTHMCSYKEALALFREMLA-SGVEPTDITFLNILPSCAHLGAID 482
            MPE++++SWN +I G      ++EA+ +F  M +  GV    +T ++I  +C HLGA+D
Sbjct: 428 TMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALD 487

Query: 483 LGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKTLASWNAMICGLA 542
           L KWI+ YI KN       L T+L+D++++CGD E+A  +F+ +  + +++W A I  +A
Sbjct: 488 LAKWIYYYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMA 547

Query: 543 MHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFSSMVQDYKISPKS 602
           M G A+ A ELF  M   G++P+ + FVG L+AC H GLV  G++ F SM++ + +SP+ 
Sbjct: 548 MAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPED 607

Query: 603 QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGELVAERLFK 662
            HYGCM+DLLGRAGL EEA  LI++M M+P+  IW SLL ACR  G VE+    AE++  
Sbjct: 608 VHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQV 667

Query: 663 LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEVDNVVHEFLVGDK 722
           L P+  G+YVLLSN+YA AG+W+D+A++R  + ++G++K PG ++I++    HEF  GD+
Sbjct: 668 LAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDE 727

Query: 723 VHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHSEKLAIAFGLIST 729
            HP+  +I  ML+EV ++    G V D S VL D+DE+ K   LS HSEKLA+A+GLIS+
Sbjct: 728 SHPEMPNIEAMLDEVSQRASHLGHVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISS 787

BLAST of CaUC06G117920 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 575.5 bits (1482), Expect = 6.1e-164
Identity = 299/727 (41.13%), Postives = 442/727 (60.80%), Query Frame = 0

Query: 5   APPLLLSPS---SDPPYRLLQDHPSLKLLSKCQSIQTLKQIHAQIIKTGLHNTQFALSKL 64
           A PLL + S   SD  Y  L D  + K          LKQIHA+++  GL  + F ++KL
Sbjct: 8   ASPLLYTNSGIHSDSFYASLIDSATHK--------AQLKQIHARLLVLGLQFSGFLITKL 67

Query: 65  IEFSAVSRFGSISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIYSGVEP 124
           I   A S FG I++A  +F+ +  P +F WN++IRG S +     AL+ +  M  + V P
Sbjct: 68  IH--ASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSP 127

Query: 125 NSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVF 184
           +S+TFP LLK+C+ L+    G+ +HA + +LGF +DVF+   LI +YA+   + +A+ VF
Sbjct: 128 DSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVF 187

Query: 185 DQSKFRDAISFTALIAGYALWGYMDRSRKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALL 244
           +                                +P R +VSW A+++ YAQ G   EAL 
Sbjct: 188 EGL-----------------------------PLPERTIVSWTAIVSAYAQNGEPMEALE 247

Query: 245 LFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERGLRSNLKLVNALIDMYS 304
           +F +MRK +V P+   +VSVL+A      L  G S+ + + + GL     L+ +L  MY+
Sbjct: 248 IFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYA 307

Query: 305 KCGDLRTARELFDDMPERDVISWNVMIGGYTHMCSYKEALALFREMLASGVEPTDITFLN 364
           KCG + TA+ LFD M   ++I WN MI GY      +EA+ +F EM+   V P  I+  +
Sbjct: 308 KCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITS 367

Query: 365 ILPSCAHLGAIDLGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKT 424
            + +CA +G+++  + ++ Y+ ++       + ++LID++AKCG +E AR VF     + 
Sbjct: 368 AISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRD 427

Query: 425 LASWNAMICGLAMHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFS 484
           +  W+AMI G  +HG+A EA  L+  M   G+ PN++TF+G+L AC H+G+V  G  FF+
Sbjct: 428 VVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFN 487

Query: 485 SMVQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRV 544
            M  D+KI+P+ QHY C+IDLLGRAG  ++A  +I+ M ++P   +WG+LL AC+ H  V
Sbjct: 488 RMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHV 547

Query: 545 ELGELVAERLFKLEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEV 604
           ELGE  A++LF ++P N G YV LSN+YA A  WD VA +R R+ ++G+ K  GC+ +EV
Sbjct: 548 ELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEV 607

Query: 605 DNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHS 664
              +  F VGDK HP+ E+I + +E ++ +LK  GFVA+    L+D+++E  E  L  HS
Sbjct: 608 RGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHS 667

Query: 665 EKLAIAFGLISTKPGTPIRIIKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGS 724
           E++AIA+GLIST  GTP+RI KNLR C NCH+ATKLISK+ +REI+ RD NRFHHFKDG 
Sbjct: 668 ERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGV 694

Query: 725 CSCNDYW 729
           CSC DYW
Sbjct: 728 CSCGDYW 694

BLAST of CaUC06G117920 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 572.8 bits (1475), Expect = 4.0e-163
Identity = 305/834 (36.57%), Postives = 465/834 (55.76%), Query Frame = 0

Query: 3   LSAPPLLLSPSSDPPYRLLQDHPSLKL----LSKCQSIQTLKQIHAQIIKTGLHNTQFAL 62
           L   P++L+ ++     LL      K     L  C++I  LK  H  + K GL N    +
Sbjct: 8   LHLSPMVLATTTTTKPSLLNQSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTI 67

Query: 63  SKLIEFSA-VSRFGSISYAISLF-NSIEEPNLFIWNSMIRGLSMSLSPVLALVFFVRMIY 122
           +KL+  S  +    S+S+A  +F NS      F++NS+IRG + S     A++ F+RM+ 
Sbjct: 68  TKLVARSCELGTRESLSFAKEVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMN 127

Query: 123 SGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHLLKLGFVSDVFIHTSLINMYAQSGEMNN 182
           SG+ P+ YTFPF L +CAK  +   G QIH  ++K+G+  D+F+  SL++ YA+ GE+++
Sbjct: 128 SGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDS 187

Query: 183 AQLVFDQSKFRDAISFTALIAGYALWGY-------------------------------- 242
           A+ VFD+   R+ +S+T++I GYA   +                                
Sbjct: 188 ARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACA 247

Query: 243 ---------------------------------------MDRSRKLFDEMPVRDVVSWNA 302
                                                  +D +++LFDE    ++   NA
Sbjct: 248 KLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNA 307

Query: 303 MIAGYAQTGRSKEALLLFEEMRKANVPPNESTIVSVLSACAQSNALDLGNSMRSWIEERG 362
           M + Y + G ++EAL +F  M  + V P+  +++S +S+C+Q   +  G S   ++   G
Sbjct: 308 MASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNG 367

Query: 363 LRSNLKLVNALIDMYSKC-------------------------------GDLRTARELFD 422
             S   + NALIDMY KC                               G++  A E F+
Sbjct: 368 FESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFE 427

Query: 423 DMPERDVISWNVMIGGYTHMCSYKEALALFREMLA-SGVEPTDITFLNILPSCAHLGAID 482
            MPE++++SWN +I G      ++EA+ +F  M +  GV    +T ++I  +C HLGA+D
Sbjct: 428 TMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALD 487

Query: 483 LGKWIHAYINKNFNSASASLWTSLIDLYAKCGDIEAARQVFHGMNIKTLASWNAMICGLA 542
           L KWI+ YI KN       L T+L+D++++CGD E+A  +F+ +  + +++W A I  +A
Sbjct: 488 LAKWIYYYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMA 547

Query: 543 MHGQADEAFELFSKMSSDGIEPNEITFVGILSACKHAGLVDLGRQFFSSMVQDYKISPKS 602
           M G A+ A ELF  M   G++P+ + FVG L+AC H GLV  G++ F SM++ + +SP+ 
Sbjct: 548 MAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPED 607

Query: 603 QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGELVAERLFK 662
            HYGCM+DLLGRAGL EEA  LI++M M+P+  IW SLL ACR  G VE+    AE++  
Sbjct: 608 VHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQV 667

Query: 663 LEPDNPGAYVLLSNIYAGAGKWDDVARIRTRLNDRGMKKVPGCTTIEVDNVVHEFLVGDK 722
           L P+  G+YVLLSN+YA AG+W+D+A++R  + ++G++K PG ++I++    HEF  GD+
Sbjct: 668 LAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDE 727

Query: 723 VHPQSEDIYKMLEEVDRQLKVFGFVADTSEVLYDMDEEWKEGALSHHSEKLAIAFGLIST 728
            HP+  +I  ML+EV ++    G V D S VL D+DE+ K   LS HSEKLA+A+GLIS+
Sbjct: 728 SHPEMPNIEAMLDEVSQRASHLGHVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISS 787

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004150015.10.0e+0093.87pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sa... [more]
KAA0046752.10.0e+0093.46pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK29694... [more]
XP_023515625.10.0e+0092.75pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita ... [more]
XP_022987625.10.0e+0092.34pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita ... [more]
XP_022961045.10.0e+0092.48pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9LN012.1e-28664.15Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823806.0e-17240.90Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LUJ23.0e-16336.65Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9LTV88.6e-16341.13Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q3E6Q11.3e-15838.14Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LU280.0e+0093.87DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G3068... [more]
A0A5A7TTJ10.0e+0093.46Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1JHE60.0e+0092.34pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucurbit... [more]
A0A6J1HAU90.0e+0092.48pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucurbit... [more]
A0A1S3CMX00.0e+0092.95pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.5e-28764.15Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.14.2e-17340.90Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.22.1e-16436.65INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT3G12770.16.1e-16441.13mitochondrial editing factor 22 [more]
AT3G22690.14.0e-16336.57CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 166..271
e-value: 3.5E-27
score: 96.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 27..149
e-value: 3.2E-6
score: 28.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 272..393
e-value: 9.6E-25
score: 89.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 395..613
e-value: 1.3E-41
score: 145.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 384..559
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 160..584
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 594..718
e-value: 1.6E-39
score: 134.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 424..457
e-value: 1.1E-9
score: 35.9
coord: 322..355
e-value: 3.8E-8
score: 31.0
coord: 190..220
e-value: 7.6E-4
score: 17.5
coord: 221..255
e-value: 3.9E-9
score: 34.1
coord: 496..520
e-value: 7.9E-4
score: 17.4
coord: 294..321
e-value: 4.3E-5
score: 21.4
coord: 393..417
e-value: 1.8E-4
score: 19.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 319..366
e-value: 8.2E-10
score: 38.7
coord: 218..266
e-value: 3.2E-14
score: 52.9
coord: 86..135
e-value: 1.8E-7
score: 31.3
coord: 423..467
e-value: 5.2E-12
score: 45.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 161..183
e-value: 0.25
score: 11.7
coord: 393..417
e-value: 2.3E-4
score: 21.2
coord: 496..521
e-value: 4.7E-4
score: 20.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 421..455
score: 12.495939
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 219..253
score: 13.723605
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 87..121
score: 9.097937
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 390..420
score: 8.681407
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 320..354
score: 12.386327
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..319
score: 9.470621
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 306..722
coord: 28..328
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 306..722
coord: 28..328

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC06G117920.1CaUC06G117920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding