CsaV3_7G002540 (gene) Cucumber (Chinese Long) v3

NameCsaV3_7G002540
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr7 : 1989487 .. 1993211 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGAAGGAAGTCATGGGAGATGGTCTCAAACCCAACGAGTATACATTTGGTAGTTTAATATCTGCTACTTGTTCTTTGGCTAATTCTGGATTGGTTTTGCTTGAACAGCTGCTGACCAGGGTGGAAAAATCTGGTTTCTTGCATGATTTGTATGTAGGTAGTGCTTTAGTTAGTGGTTTTGCAAAGGCTGGGTCAATTGGTTATGCCAAAAATATTTTTCAGAAGATGAGTTATAGAAATGTAGTATCCTTGAACGGTTTGATAATTGGACTGGTAAGACAGAAAAGAGGGGAAGAGGCAGTTGAACTTTTCATGGAAATGAAGGATTCTGTTGAACTAAACCCTAATTCTTATATGATCATTTTGACTGCTTTTCCCGAGTTCCATGTTCTGGAAAATGGAAAACGGAAAGGTAGTGAGGTTCATGCGTTCCTCATCCGATCAGGCTTACTCAACGCCCAGATTGCAATAGGGAATGGTCTTATAAATATGTATGCTAAATGTGGAGCAATCAATGATGCTTGTGTGGTTTTTAGGCTCATGGATAATAAGGATTCAGTTACATGGAACTCCATGATCACCGGTCTTGACCAAAATAAACAGTTTTTAGAAGCGGTTAAAACTTTTCAAGAAATGAGAAGAACAGAATTATATCCTTCAAATTTCACAATGATTAGTGCTTTAAGTTCCTGTGCAAGCTTAGGGTGGATCTCTGTTGGAGAACAATTACATTGTGAAGGACTTAAACTGGGGCTTGATTTGGATGTTTCTGTTTCAAATGCTCTTTTAGCATTGTATGGTGAGTGTGGGTATGTAAAGGAATGCCAGAAAGCTTTTTCTTTGATGCTCGATTACGATCATGTTTCATGGAACTCTTTGATTGGCGCTCTAGCAGATTCAGAACCATCAATGCTTGAAGCTGTGGAAAGTTTCCTTGTTATGATGCGTGCTGGTTGGGATCCTAATAGAGTGACCTTCATTACCATACTTGCAGCAGTGTCTTCTCTTTCACTTCATGAACTAGGCAAGCAAATTCATGCTTTAGTCTTAAAGCGTAATGTTGCAGCTGACACTGCTATAGAGAATGCACTTTTGGCTTGCTATGGGAAGTGTGGAGATATGGGTTACTGTGAGAATATCTTTTCGAGAATGTCTGATAGACAAGACGAAGTGAGTTGGAATTCTATGATTTCGGGTTATATACATAACGAGCTCTTGCCCAAGGCCATGGACATGGTCTGGTTTATGATGCAAAAAGGCCAGAGATTGGATGGTTTCACCTTTGCAACTGTGCTTAGTGCTTGTGCAACCGTCGCAACATTAGAGCGTGGCATGGAAGTTCATGGATGCAGTGTAAGAGCTTGTTTGGAATCTGATATTGTTATTGGGAGCGCACTTGTGGACATGTATGCCAAATGTGGAAGGATAGACTATGCTTCAAGATTCTTTGAAATGATGCCAGCCAGAAACTTGTATTCTTGGAACTCCATGATTTCAGGGTATGCGCGCCATGGACACGGAACAAAGTCTTTGGATCTTTTCGCCCAAATGAAGTTACAAGGTCCACTACCAGATCATGTAACTTTTGTTGGAGTTCTGTCAGCATGTAGTCACGCAGGTTTAGTCAATGAAGGGTTCAGTCATTTTGATTCAATGAGTGAAATATACGGATTAGCTCCTCGCATGGAACACTTTTCATGTATGGTAGATCTTCTTGGTCGTGTCGGGGAGCTAAACAAAATGGAGGATTTTCTCAATCAGATGCCAGTAAAGCCTAATGTTCTTATATGGAGGACTGTTTTAGGGGCCTGTTGCAGAGCCAACGGTCGAAACACAGCACTTGGGAGGAGAGCAGCTGAAATGCTGCTGGAAATGGAACCAACAAATGCAGTGAACTACATTCTTCTCTCAAATATGTATGCTTCTGGCGGAAAGTGGGATGACGTAGCGAAAACGAGGGTGGCAATGAGGAAAGCATTCGTGAAGAAGGAAGCTGGATGCAGTTGGGTGACAATGAAGGATGGTGTTCATGTGTTTGTTGCAGGAGACAAATCACATCCCGAAAAGGACTTAATATATGAAAAACTGAAGGAACTAAATGGGAAAATGAGGCTAGCAGGGTACATACCAGAGACGAGATTCGCACTCTACGACCTTGAAGGGGAGAGTAAGGAGGAGTTATTGAGCTATCATAGTGAGAAAATTGCCGTTGCTTTTGTTCTTACTCGTCCATCAAAAATGCCGATTAGAATATTGAAGAACCTTAGAGTTTGTGGGGATTGCCACTCTGCTTTCAAATATATTTCACAGATCGTTGAAAGGCAAATAGTTTTGAGAGATTCAAATAGATTTCACCACTTTGAAAATGGCAAATGTTCATGTGGAGATTTTTGGTAGAACAGATTCTTTCTTTTGAAGATATTTATTGATATTCAAAATGTGTTATTAATATTATTGCCCACACAAATTAGTTCTCTTTCTTTCATTTTTCTTTAATCCATCACCATTTTTTCTTCGGTCAGAAGTTAAACTTGATAAAGTCATCTAGTCCCTTGTTTTTTAGAATCGTTGTAACGAATATGAATTCAACTTCAAGACAAATGCACAATATCTCTTTCTTATGCGGCCTCAGTCGCAACACAATTTAGAAAGGTCCGCACAATATCTCTTTCTTATGCGGCCTCAGTCGCAACACAATTTAGAAAGGTCCGCACAATATCTCTTTCTTATGCGGCCTCAGTCGCAACACAATTTATAAAGGTCCAAAAACAAAGCATAGGAAGACGCAAAGAATATTGCGTGGAATAAAATACTCCTTGTATATAGTTCAAAGAAGATTTCGAAGGTCCAAATGATCTGGGGGACTAATCTTTATTCATATTAGAAGTTATTCACTCCCACATAAAGAAATATGAAAAATGGGTAAATATTTTCAAGGTTTCAACATCCCATGTAGATGTGGATGCAACACTATGTCGAGACCCATTAATCGACATGACAATATAGATCAACTGTTTGTTTTAGACTTTTTTTACGTTAACTAAAAGTTTAGTCCTTAAACTTTCAAAATTTTGAGTTTTTTCTATTTGATCTCGAATTTTAAAAAGTATCTTATAGGCCACTAACTTTCAAACTTGTGTTTAATAGATATTTAACATATTTAATTATCACTATCTATCACATACAAACTTTGATAGTCTACTATTATCTATCACTAATAGACACTAATAGTAGTTTATTAAGTTTTACTACTAATGGGGAGCGACATTATGCAATTTTTACAGGTATTTGAAATATTCACGTGGCTTCAAATAATTACCCTTAATTTAAGGGATTTAAATCAAATTTTATGTTTAATGCAAGTTTTGTGTCTAATATGTTTTAAGTTATTTAATGTGTAAATAATGGATTGGACACAAAAGTAAACATTAAGCACTTTTGCTTAATTGTATTAAAGAGAAAAAAAAAATTCACAGATCTATTAAAAACAAAATTAAAAACTTACCCGACTTATTGGATAATTTTTTTTAATTAAGAGACCAAATACACTAAACATAAAGATACAAGTCTAATTTTCATTTAGCCTCTAAGAACTTGTTAGCTAATTGAGTTCAAAAGGTAGCTTTGAGCCTTTCACTATATATTATATTGTGCGATACGTTATATTAATTGAACCTCATTTTGTAGGATTAGGATTAGGAAGACATCTTAACATGATTTAA

mRNA sequence

ATGCAGAAGGAAGTCATGGGAGATGGTCTCAAACCCAACGAGTATACATTTGGTAGTTTAATATCTGCTACTTGTTCTTTGGCTAATTCTGGATTGGTTTTGCTTGAACAGCTGCTGACCAGGGTGGAAAAATCTGGTTTCTTGCATGATTTGTATGTAGGTAGTGCTTTAGTTAGTGGTTTTGCAAAGGCTGGGTCAATTGGTTATGCCAAAAATATTTTTCAGAAGATGAGTTATAGAAATGTAGTATCCTTGAACGGTTTGATAATTGGACTGGTAAGACAGAAAAGAGGGGAAGAGGCAGTTGAACTTTTCATGGAAATGAAGGATTCTGTTGAACTAAACCCTAATTCTTATATGATCATTTTGACTGCTTTTCCCGAGTTCCATGTTCTGGAAAATGGAAAACGGAAAGGTAGTGAGGTTCATGCGTTCCTCATCCGATCAGGCTTACTCAACGCCCAGATTGCAATAGGGAATGGTCTTATAAATATGTATGCTAAATGTGGAGCAATCAATGATGCTTGTGTGGTTTTTAGGCTCATGGATAATAAGGATTCAGTTACATGGAACTCCATGATCACCGGTCTTGACCAAAATAAACAGTTTTTAGAAGCGGTTAAAACTTTTCAAGAAATGAGAAGAACAGAATTATATCCTTCAAATTTCACAATGATTAGTGCTTTAAGTTCCTGTGCAAGCTTAGGGTGGATCTCTGTTGGAGAACAATTACATTGTGAAGGACTTAAACTGGGGCTTGATTTGGATGTTTCTGTTTCAAATGCTCTTTTAGCATTGTATGGTGAGTGTGGGTATGTAAAGGAATGCCAGAAAGCTTTTTCTTTGATGCTCGATTACGATCATGTTTCATGGAACTCTTTGATTGGCGCTCTAGCAGATTCAGAACCATCAATGCTTGAAGCTGTGGAAAGTTTCCTTGTTATGATGCGTGCTGGTTGGGATCCTAATAGAGTGACCTTCATTACCATACTTGCAGCAGTGTCTTCTCTTTCACTTCATGAACTAGGCAAGCAAATTCATGCTTTAGTCTTAAAGCGTAATGTTGCAGCTGACACTGCTATAGAGAATGCACTTTTGGCTTGCTATGGGAAGTGTGGAGATATGGGTTACTGTGAGAATATCTTTTCGAGAATGTCTGATAGACAAGACGAAGTGAGTTGGAATTCTATGATTTCGGGATTAGGATTAGGAAGACATCTTAACATGATTTAA

Coding sequence (CDS)

ATGCAGAAGGAAGTCATGGGAGATGGTCTCAAACCCAACGAGTATACATTTGGTAGTTTAATATCTGCTACTTGTTCTTTGGCTAATTCTGGATTGGTTTTGCTTGAACAGCTGCTGACCAGGGTGGAAAAATCTGGTTTCTTGCATGATTTGTATGTAGGTAGTGCTTTAGTTAGTGGTTTTGCAAAGGCTGGGTCAATTGGTTATGCCAAAAATATTTTTCAGAAGATGAGTTATAGAAATGTAGTATCCTTGAACGGTTTGATAATTGGACTGGTAAGACAGAAAAGAGGGGAAGAGGCAGTTGAACTTTTCATGGAAATGAAGGATTCTGTTGAACTAAACCCTAATTCTTATATGATCATTTTGACTGCTTTTCCCGAGTTCCATGTTCTGGAAAATGGAAAACGGAAAGGTAGTGAGGTTCATGCGTTCCTCATCCGATCAGGCTTACTCAACGCCCAGATTGCAATAGGGAATGGTCTTATAAATATGTATGCTAAATGTGGAGCAATCAATGATGCTTGTGTGGTTTTTAGGCTCATGGATAATAAGGATTCAGTTACATGGAACTCCATGATCACCGGTCTTGACCAAAATAAACAGTTTTTAGAAGCGGTTAAAACTTTTCAAGAAATGAGAAGAACAGAATTATATCCTTCAAATTTCACAATGATTAGTGCTTTAAGTTCCTGTGCAAGCTTAGGGTGGATCTCTGTTGGAGAACAATTACATTGTGAAGGACTTAAACTGGGGCTTGATTTGGATGTTTCTGTTTCAAATGCTCTTTTAGCATTGTATGGTGAGTGTGGGTATGTAAAGGAATGCCAGAAAGCTTTTTCTTTGATGCTCGATTACGATCATGTTTCATGGAACTCTTTGATTGGCGCTCTAGCAGATTCAGAACCATCAATGCTTGAAGCTGTGGAAAGTTTCCTTGTTATGATGCGTGCTGGTTGGGATCCTAATAGAGTGACCTTCATTACCATACTTGCAGCAGTGTCTTCTCTTTCACTTCATGAACTAGGCAAGCAAATTCATGCTTTAGTCTTAAAGCGTAATGTTGCAGCTGACACTGCTATAGAGAATGCACTTTTGGCTTGCTATGGGAAGTGTGGAGATATGGGTTACTGTGAGAATATCTTTTCGAGAATGTCTGATAGACAAGACGAAGTGAGTTGGAATTCTATGATTTCGGGATTAGGATTAGGAAGACATCTTAACATGATTTAA

Protein sequence

MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYMIILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISGLGLGRHLNMI
BLAST of CsaV3_7G002540 vs. NCBI nr
Match: XP_004144619.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At5g09950 [Cucumis sativus] >KGN43412.1 hypothetical protein Csa_7G031730 [Cucumis sativus])

HSP 1 Score: 794.3 bits (2050), Expect = 2.0e-226
Identity = 400/400 (100.00%), Postives = 400/400 (100.00%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG
Sbjct: 269 MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 328

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM
Sbjct: 329 FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 388

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR
Sbjct: 389 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 448

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
           LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV
Sbjct: 449 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 508

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD
Sbjct: 509 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 568

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA
Sbjct: 569 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 628

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG
Sbjct: 629 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 668

BLAST of CsaV3_7G002540 vs. NCBI nr
Match: XP_008462071.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At5g09950 [Cucumis melo])

HSP 1 Score: 753.1 bits (1943), Expect = 5.1e-214
Identity = 382/400 (95.50%), Postives = 387/400 (96.75%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQKEVM DGLKPNEYTFGSLISATCSL NSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG
Sbjct: 269 MQKEVMRDGLKPNEYTFGSLISATCSLPNSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 328

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAKAGSI YAKNIFQKMSYRNVVSLNGLIIGLVRQ RGEEAVELFMEMKDSVELNPNSYM
Sbjct: 329 FAKAGSINYAKNIFQKMSYRNVVSLNGLIIGLVRQNRGEEAVELFMEMKDSVELNPNSYM 388

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           IILTAFPEF+VLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAK GAINDACVVFR
Sbjct: 389 IILTAFPEFYVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKFGAINDACVVFR 448

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
            MD KDSVTWNSMI+GLDQNKQFLEAVKTFQEMRRTEL+PSNFTMISALSSCASLGWISV
Sbjct: 449 FMDTKDSVTWNSMISGLDQNKQFLEAVKTFQEMRRTELFPSNFTMISALSSCASLGWISV 508

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYD VSWNSLIGALAD
Sbjct: 509 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDQVSWNSLIGALAD 568

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SEPSMLEAVESF+VMMRAGW PNRVTFI+ILAAVSSLSLHELGKQIHALVLK NVAADTA
Sbjct: 569 SEPSMLEAVESFIVMMRAGWHPNRVTFISILAAVSSLSLHELGKQIHALVLKHNVAADTA 628

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDM  CENIFSRMSDRQDE SWNSMISG
Sbjct: 629 IENALLACYGKCGDMVNCENIFSRMSDRQDEASWNSMISG 668

BLAST of CsaV3_7G002540 vs. NCBI nr
Match: XP_022136280.1 (putative pentatricopeptide repeat-containing protein At5g09950 [Momordica charantia])

HSP 1 Score: 686.8 bits (1771), Expect = 4.5e-194
Identity = 345/400 (86.25%), Postives = 372/400 (93.00%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           +Q+EVMGDGLKPNEYTFGSLISATCSL +SGL+LLEQ+L+RVEKSGF HDLYVGSALVSG
Sbjct: 276 VQQEVMGDGLKPNEYTFGSLISATCSLVDSGLILLEQILSRVEKSGFSHDLYVGSALVSG 335

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAK GSI YAK+IFQ+MSYRN VS+NGLIIGLVRQ RGEEAVELFMEMKDSVELN +SY+
Sbjct: 336 FAKFGSINYAKDIFQQMSYRNAVSMNGLIIGLVRQNRGEEAVELFMEMKDSVELNLDSYV 395

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           IILTAFPEF+VLENGKR GSEVHA+LIR+GLL A+IAIGNGLINMYAKCGAI DAC VFR
Sbjct: 396 IILTAFPEFYVLENGKRMGSEVHAYLIRTGLLXAKIAIGNGLINMYAKCGAIGDACTVFR 455

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
           LM++KDSVTWNSMITGLDQN+ FL+AV+TFQEMRRT L+PSNFTMISALSS ASLGWI +
Sbjct: 456 LMNDKDSVTWNSMITGLDQNEHFLDAVRTFQEMRRTGLFPSNFTMISALSSSASLGWIMI 515

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALL+LYGE GYVKECQKAFSLM +YD VSWNSLIGALAD
Sbjct: 516 GEQLHCEGLKLGLDLDVSVSNALLSLYGEAGYVKECQKAFSLMPEYDQVSWNSLIGALAD 575

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SE SMLEAV++FLVMMRAGW PNRVTFI+ILAAVSSLSLHEL  QIH L LK NVAADTA
Sbjct: 576 SESSMLEAVDNFLVMMRAGWRPNRVTFISILAAVSSLSLHELSXQIHVLXLKYNVAADTA 635

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDM  CENIF RMSDRQDEVSWNSMISG
Sbjct: 636 IENALLACYGKCGDMSDCENIFLRMSDRQDEVSWNSMISG 675

BLAST of CsaV3_7G002540 vs. NCBI nr
Match: XP_022928551.1 (putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita moschata] >XP_022928552.1 putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita moschata] >XP_022928554.1 putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita moschata] >XP_022928555.1 putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita moschata] >XP_022928556.1 putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita moschata])

HSP 1 Score: 677.9 bits (1748), Expect = 2.1e-191
Identity = 342/400 (85.50%), Postives = 371/400 (92.75%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           +QKE+MGD L+PNEYTFGSLISAT S  +SGL LL+Q+L+ VEKSGF HDLYVGSALVSG
Sbjct: 271 VQKEIMGDRLRPNEYTFGSLISATISFVDSGLTLLKQMLSMVEKSGFSHDLYVGSALVSG 330

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAK GSI YAK+IFQ+MSYRN VS+NGLIIGLVRQ RGEEAVELF EMKDSVE+N +SY+
Sbjct: 331 FAKFGSINYAKDIFQRMSYRNAVSMNGLIIGLVRQSRGEEAVELFAEMKDSVEINLDSYV 390

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           I+LTAFPEF VLE+GKRKGSEVHA+LIR+GLLNA+IAIGNGLINMYAKCGAINDA  VFR
Sbjct: 391 ILLTAFPEFCVLEDGKRKGSEVHAYLIRTGLLNAKIAIGNGLINMYAKCGAINDASTVFR 450

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
           LMDNKDSVTWNSMITGLDQN+ FL+AV+TFQEMRRT L+PSNFTMISALSS ASLGWI V
Sbjct: 451 LMDNKDSVTWNSMITGLDQNEHFLDAVETFQEMRRTVLFPSNFTMISALSSSASLGWIRV 510

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALLALYGE GYV+ECQKAFSLML YD VSWNSLIGALAD
Sbjct: 511 GEQLHCEGLKLGLDLDVSVSNALLALYGEAGYVEECQKAFSLMLKYDQVSWNSLIGALAD 570

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SE S+LEAVE+FLVMMR+GW PNRVTFI+ILAAVSSLSLH LGKQIHALVLK NVAADTA
Sbjct: 571 SESSLLEAVENFLVMMRSGWRPNRVTFISILAAVSSLSLHALGKQIHALVLKHNVAADTA 630

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDM  CENIFSRMS+R+DEVSWNSMISG
Sbjct: 631 IENALLACYGKCGDMRDCENIFSRMSNRRDEVSWNSMISG 670

BLAST of CsaV3_7G002540 vs. NCBI nr
Match: XP_023529590.1 (putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita pepo subsp. pepo] >XP_023529591.1 putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita pepo subsp. pepo] >XP_023529592.1 putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 675.2 bits (1741), Expect = 1.4e-190
Identity = 341/400 (85.25%), Postives = 370/400 (92.50%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           +QKEVMGD L+PNEYTFGSLISAT S  +SGL LL+Q+L+ VEKSGF HDLYVGSALVSG
Sbjct: 271 VQKEVMGDRLRPNEYTFGSLISATISFVDSGLTLLKQMLSMVEKSGFSHDLYVGSALVSG 330

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAK G I YAK+IFQ+MSYRN VS+NGLIIGLVRQ RGEEAVELF EMKDSVE+N +SY+
Sbjct: 331 FAKFGLISYAKDIFQRMSYRNAVSMNGLIIGLVRQSRGEEAVELFAEMKDSVEINLDSYV 390

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           I+LTAFPEF VLE+GKRKGSEVHA+LIR+GLLNA+IAIGNGLINMYAKCGAINDA  VFR
Sbjct: 391 ILLTAFPEFCVLEDGKRKGSEVHAYLIRTGLLNAKIAIGNGLINMYAKCGAINDASTVFR 450

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
           LMDNKDSVTWNSMITGLDQN+ FL+AV+TFQ+MRRT L+PSNFTMISALSS ASLGWI V
Sbjct: 451 LMDNKDSVTWNSMITGLDQNEHFLDAVETFQDMRRTGLFPSNFTMISALSSSASLGWIRV 510

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALLALYGE GYV+ECQKAFSLML+YD VSWNSLIGALAD
Sbjct: 511 GEQLHCEGLKLGLDLDVSVSNALLALYGETGYVEECQKAFSLMLEYDQVSWNSLIGALAD 570

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SE S+LEAVE+FLVMMRAGW PNRVTFI+ILAAVSSLSLH LGKQIH LVLK NVAADTA
Sbjct: 571 SESSLLEAVENFLVMMRAGWRPNRVTFISILAAVSSLSLHALGKQIHGLVLKHNVAADTA 630

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDM  CENIFSRMS+R+DEVSWNSMISG
Sbjct: 631 IENALLACYGKCGDMRDCENIFSRMSNRRDEVSWNSMISG 670

BLAST of CsaV3_7G002540 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 478.0 bits (1229), Expect = 5.8e-135
Identity = 232/394 (58.88%), Postives = 304/394 (77.16%), Query Frame = 0

Query: 8   DGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSI 67
           DG +P EYTFGSL++  CSL    + LLEQ++  ++KSG L DL+VGS LVS FAK+GS+
Sbjct: 200 DGSRPTEYTFGSLVTTACSLTEPDVRLLEQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSL 259

Query: 68  GYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYMIILTAFP 127
            YA+ +F +M  RN V+LNGL++GLVRQK GEEA +LFM+M   ++++P SY+I+L++FP
Sbjct: 260 SYARKVFNQMETRNAVTLNGLMVGLVRQKWGEEATKLFMDMNSMIDVSPESYVILLSSFP 319

Query: 128 EFHVLEN-GKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKD 187
           E+ + E  G +KG EVH  +I +GL++  + IGNGL+NMYAKCG+I DA  VF  M +KD
Sbjct: 320 EYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKD 379

Query: 188 SVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHC 247
           SV+WNSMITGLDQN  F+EAV+ ++ MRR ++ P +FT+IS+LSSCASL W  +G+Q+H 
Sbjct: 380 SVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHG 439

Query: 248 EGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSML 307
           E LKLG+DL+VSVSNAL+ LY E GY+ EC+K FS M ++D VSWNS+IGALA SE S+ 
Sbjct: 440 ESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLP 499

Query: 308 EAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALL 367
           EAV  FL   RAG   NR+TF ++L+AVSSLS  ELGKQIH L LK N+A +   ENAL+
Sbjct: 500 EAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALI 559

Query: 368 ACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           ACYGKCG+M  CE IFSRM++R+D V+WNSMISG
Sbjct: 560 ACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISG 593

BLAST of CsaV3_7G002540 vs. TAIR10
Match: AT2G33680.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 200.3 bits (508), Expect = 2.3e-51
Identity = 123/402 (30.60%), Postives = 204/402 (50.75%), Query Frame = 0

Query: 3   KEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFA 62
           +E+    + PN YT   +  A  SL +S   +  Q    V K     D+YV ++LV  + 
Sbjct: 107 REMRAQDILPNAYTLAGIFKAESSLQSS--TVGRQAHALVVKMSSFGDIYVDTSLVGMYC 166

Query: 63  KAGSIGYAKNIFQKMSYRNVVSLN---GLIIGLVRQKRGEEAVELFMEMKDSVELNPNSY 122
           KAG +     +F  M  RN  + +                    LF+  K+    +   +
Sbjct: 167 KAGLVEDGLKVFAYMPERNTYTWSTXXXXXXXXXXXXXXXXXXNLFLREKEEGSDSDYVF 226

Query: 123 MIILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVF 182
             +L++      +  G+    ++H   I++GLL   +A+ N L+ MY+KC ++N+AC +F
Sbjct: 227 TAVLSSLAATIYVGLGR----QIHCITIKNGLLGF-VALSNALVTMYSKCESLNEACKMF 286

Query: 183 RLMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWIS 242
               +++S+TW++M+TG  QN + LEAVK F  M    + PS +T++  L++C+ + ++ 
Sbjct: 287 DSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLE 346

Query: 243 VGEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALA 302
            G+QLH   LKLG +  +  + AL+ +Y + G + + +K F  + + D   W SLI    
Sbjct: 347 EGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYV 406

Query: 303 DSEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADT 362
            +  +  EA+  +  M  AG  PN  T  ++L A SSL+  ELGKQ+H   +K     + 
Sbjct: 407 QNSDNE-EALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEV 466

Query: 363 AIENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISGL 402
            I +AL   Y KCG +    N+  R +  +D VSWN+MISGL
Sbjct: 467 PIGSALSTMYSKCGSL-EDGNLVFRRTPNKDVVSWNAMISGL 499

BLAST of CsaV3_7G002540 vs. TAIR10
Match: AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 197.6 bits (501), Expect = 1.5e-50
Identity = 124/393 (31.55%), Postives = 207/393 (52.67%), Query Frame = 0

Query: 10  LKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGY 69
           +K    T GS++SA   +AN  L L+  +     K G   ++YVGS+LVS ++K   +  
Sbjct: 323 VKSTRSTLGSVLSAIGIVANLDLGLV--VHAEAIKLGLASNIYVGSSLVSMYSKCEKMEA 382

Query: 70  AKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDS-VELNPNSYMIILTAFPE 129
           A  +F+ +  +N V  N +I G        + +ELFM+MK S   ++  ++  +L+    
Sbjct: 383 AAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAA 442

Query: 130 FHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSV 189
            H LE     GS+ H+ +I+  L    + +GN L++MYAKCGA+ DA  +F  M ++D+V
Sbjct: 443 SHDLE----MGSQFHSIIIKKKLAK-NLFVGNALVDMYAKCGALEDARQIFERMCDRDNV 502

Query: 190 TWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEG 249
           TWN++I    Q++   EA   F+ M    +      + S L +C  +  +  G+Q+HC  
Sbjct: 503 TWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACTHVHGLYQGKQVHCLS 562

Query: 250 LKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSMLEA 309
           +K GLD D+   ++L+ +Y +CG +K+ +K FS + ++  VS N+LI     S+ ++ EA
Sbjct: 563 VKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNALIAGY--SQNNLEEA 622

Query: 310 VESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAAD-TAIENALLA 369
           V  F  M+  G +P+ +TF TI+ A        LG Q H  + KR  +++   +  +LL 
Sbjct: 623 VVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGFSSEGEYLGISLLG 682

Query: 370 CYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
            Y     M     +FS +S  +  V W  M+SG
Sbjct: 683 MYMNSRGMTEACALFSELSSPKSIVLWTGMMSG 706

BLAST of CsaV3_7G002540 vs. TAIR10
Match: AT3G47840.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 194.9 bits (494), Expect = 9.8e-50
Identity = 120/383 (31.33%), Postives = 200/383 (52.22%), Query Frame = 0

Query: 19  SLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGYAKNIFQKMS 78
           S++   C   +S +   E L     K+  L  +YVGS+L+  + + G I  +  +F +M 
Sbjct: 112 SVVLKACG-QSSNIAYGESLHAYAVKTSLLSSVYVGSSLLDMYKRVGKIDKSCRVFSEMP 171

Query: 79  YRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPN-SYMIILTAFPEFHVLENGKR 138
           +RN V+   +I GLV   R +E +  F EM  S EL+   ++ I L A      ++ GK 
Sbjct: 172 FRNAVTWTAIITGLVHAGRYKEGLTYFSEMSRSEELSDTYTFAIALKACAGLRQVKYGK- 231

Query: 139 KGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSVTWNSMITGL 198
               +H  +I  G +   + + N L  MY +CG + D   +F  M  +D V+W S+I   
Sbjct: 232 ---AIHTHVIVRGFVTT-LCVANSLATMYTECGEMQDGLCLFENMSERDVVSWTSLIVAY 291

Query: 199 DQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEGLKLGLDLDV 258
            +  Q ++AV+TF +MR +++ P+  T  S  S+CASL  +  GEQLHC  L LGL+  +
Sbjct: 292 KRIGQEVKAVETFIKMRNSQVPPNEQTFASMFSACASLSRLVWGEQLHCNVLSLGLNDSL 351

Query: 259 SVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSMLEAVESFLVMMR 318
           SVSN+++ +Y  CG +      F  M   D +SW+++IG    +     E  + F  M +
Sbjct: 352 SVSNSMMKMYSTCGNLVSASVLFQGMRCRDIISWSTIIGGYCQAGFGE-EGFKYFSWMRQ 411

Query: 319 AGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALLACYGKCGDMGY 378
           +G  P      ++L+   ++++ E G+Q+HAL L   +  ++ + ++L+  Y KCG +  
Sbjct: 412 SGTKPTDFALASLLSVSGNMAVIEGGRQVHALALCFGLEQNSTVRSSLINMYSKCGSIKE 471

Query: 379 CENIFSRMSDRQDEVSWNSMISG 401
              IF   +DR D VS  +MI+G
Sbjct: 472 ASMIFGE-TDRDDIVSLTAMING 486

BLAST of CsaV3_7G002540 vs. TAIR10
Match: AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 193.4 bits (490), Expect = 2.8e-49
Identity = 127/394 (32.23%), Postives = 211/394 (53.55%), Query Frame = 0

Query: 12  PNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGYAK 71
           PNEY   S I A   L   G  ++ QL + + KSGF  D+YVG+ L+  + K G+I YA+
Sbjct: 144 PNEYILSSFIQACSGLDGRGRWMVFQLQSFLVKSGFDRDVYVGTLLIDFYLKDGNIDYAR 203

Query: 72  NIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFME-MKDSVELNPNSYMI--ILTAFPE 131
            +F  +  ++ V+   +I G V+  R   +++LF + M+D+V   P+ Y++  +L+A   
Sbjct: 204 LVFDALPEKSTVTWTTMISGCVKMGRSYVSLQLFYQLMEDNVV--PDGYILSTVLSACSI 263

Query: 132 FHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSV 191
              LE GK    ++HA ++R G L    ++ N LI+ Y KCG +  A  +F  M NK+ +
Sbjct: 264 LPFLEGGK----QIHAHILRYG-LEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNII 323

Query: 192 TWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEG 251
           +W ++++G  QN    EA++ F  M +  L P  +   S L+SCASL  +  G Q+H   
Sbjct: 324 SWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCASLHALGFGTQVHAYT 383

Query: 252 LKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALA--DSEPSML 311
           +K  L  D  V+N+L+ +Y +C  + + +K F +    D V +N++I   +   ++  + 
Sbjct: 384 IKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGYSRLGTQWELH 443

Query: 312 EAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALL 371
           EA+  F  M      P+ +TF+++L A +SL+   L KQIH L+ K  +  D    +AL+
Sbjct: 444 EALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLNLDIFAGSALI 503

Query: 372 ACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
             Y  C  +     +F  M  + D V WNSM +G
Sbjct: 504 DVYSNCYCLKDSRLVFDEMKVK-DLVIWNSMFAG 529

BLAST of CsaV3_7G002540 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 1.0e-133
Identity = 232/394 (58.88%), Postives = 304/394 (77.16%), Query Frame = 0

Query: 8   DGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSI 67
           DG +P EYTFGSL++  CSL    + LLEQ++  ++KSG L DL+VGS LVS FAK+GS+
Sbjct: 200 DGSRPTEYTFGSLVTTACSLTEPDVRLLEQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSL 259

Query: 68  GYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYMIILTAFP 127
            YA+ +F +M  RN V+LNGL++GLVRQK GEEA +LFM+M   ++++P SY+I+L++FP
Sbjct: 260 SYARKVFNQMETRNAVTLNGLMVGLVRQKWGEEATKLFMDMNSMIDVSPESYVILLSSFP 319

Query: 128 EFHVLEN-GKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKD 187
           E+ + E  G +KG EVH  +I +GL++  + IGNGL+NMYAKCG+I DA  VF  M +KD
Sbjct: 320 EYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKD 379

Query: 188 SVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHC 247
           SV+WNSMITGLDQN  F+EAV+ ++ MRR ++ P +FT+IS+LSSCASL W  +G+Q+H 
Sbjct: 380 SVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHG 439

Query: 248 EGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSML 307
           E LKLG+DL+VSVSNAL+ LY E GY+ EC+K FS M ++D VSWNS+IGALA SE S+ 
Sbjct: 440 ESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLP 499

Query: 308 EAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALL 367
           EAV  FL   RAG   NR+TF ++L+AVSSLS  ELGKQIH L LK N+A +   ENAL+
Sbjct: 500 EAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALI 559

Query: 368 ACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           ACYGKCG+M  CE IFSRM++R+D V+WNSMISG
Sbjct: 560 ACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISG 593

BLAST of CsaV3_7G002540 vs. Swiss-Prot
Match: sp|P93005|PP181_ARATH (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 4.2e-50
Identity = 123/402 (30.60%), Postives = 204/402 (50.75%), Query Frame = 0

Query: 3   KEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFA 62
           +E+    + PN YT   +  A  SL +S   +  Q    V K     D+YV ++LV  + 
Sbjct: 107 REMRAQDILPNAYTLAGIFKAESSLQSS--TVGRQAHALVVKMSSFGDIYVDTSLVGMYC 166

Query: 63  KAGSIGYAKNIFQKMSYRNVVSLN---GLIIGLVRQKRGEEAVELFMEMKDSVELNPNSY 122
           KAG +     +F  M  RN  + +                    LF+  K+    +   +
Sbjct: 167 KAGLVEDGLKVFAYMPERNTYTWSTXXXXXXXXXXXXXXXXXXNLFLREKEEGSDSDYVF 226

Query: 123 MIILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVF 182
             +L++      +  G+    ++H   I++GLL   +A+ N L+ MY+KC ++N+AC +F
Sbjct: 227 TAVLSSLAATIYVGLGR----QIHCITIKNGLLGF-VALSNALVTMYSKCESLNEACKMF 286

Query: 183 RLMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWIS 242
               +++S+TW++M+TG  QN + LEAVK F  M    + PS +T++  L++C+ + ++ 
Sbjct: 287 DSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLE 346

Query: 243 VGEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALA 302
            G+QLH   LKLG +  +  + AL+ +Y + G + + +K F  + + D   W SLI    
Sbjct: 347 EGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYV 406

Query: 303 DSEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADT 362
            +  +  EA+  +  M  AG  PN  T  ++L A SSL+  ELGKQ+H   +K     + 
Sbjct: 407 QNSDNE-EALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEV 466

Query: 363 AIENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISGL 402
            I +AL   Y KCG +    N+  R +  +D VSWN+MISGL
Sbjct: 467 PIGSALSTMYSKCGSL-EDGNLVFRRTPNKDVVSWNAMISGL 499

BLAST of CsaV3_7G002540 vs. Swiss-Prot
Match: sp|Q9SS83|PP220_ARATH (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.7e-49
Identity = 124/393 (31.55%), Postives = 207/393 (52.67%), Query Frame = 0

Query: 10  LKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGY 69
           +K    T GS++SA   +AN  L L+  +     K G   ++YVGS+LVS ++K   +  
Sbjct: 323 VKSTRSTLGSVLSAIGIVANLDLGLV--VHAEAIKLGLASNIYVGSSLVSMYSKCEKMEA 382

Query: 70  AKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDS-VELNPNSYMIILTAFPE 129
           A  +F+ +  +N V  N +I G        + +ELFM+MK S   ++  ++  +L+    
Sbjct: 383 AAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAA 442

Query: 130 FHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSV 189
            H LE     GS+ H+ +I+  L    + +GN L++MYAKCGA+ DA  +F  M ++D+V
Sbjct: 443 SHDLE----MGSQFHSIIIKKKLAK-NLFVGNALVDMYAKCGALEDARQIFERMCDRDNV 502

Query: 190 TWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEG 249
           TWN++I    Q++   EA   F+ M    +      + S L +C  +  +  G+Q+HC  
Sbjct: 503 TWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACTHVHGLYQGKQVHCLS 562

Query: 250 LKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSMLEA 309
           +K GLD D+   ++L+ +Y +CG +K+ +K FS + ++  VS N+LI     S+ ++ EA
Sbjct: 563 VKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNALIAGY--SQNNLEEA 622

Query: 310 VESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAAD-TAIENALLA 369
           V  F  M+  G +P+ +TF TI+ A        LG Q H  + KR  +++   +  +LL 
Sbjct: 623 VVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGFSSEGEYLGISLLG 682

Query: 370 CYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
            Y     M     +FS +S  +  V W  M+SG
Sbjct: 683 MYMNSRGMTEACALFSELSSPKSIVLWTGMMSG 706

BLAST of CsaV3_7G002540 vs. Swiss-Prot
Match: sp|Q9STS9|PP268_ARATH (Putative pentatricopeptide repeat-containing protein At3g47840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E43 PE=3 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 1.8e-48
Identity = 120/383 (31.33%), Postives = 200/383 (52.22%), Query Frame = 0

Query: 19  SLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGYAKNIFQKMS 78
           S++   C   +S +   E L     K+  L  +YVGS+L+  + + G I  +  +F +M 
Sbjct: 112 SVVLKACG-QSSNIAYGESLHAYAVKTSLLSSVYVGSSLLDMYKRVGKIDKSCRVFSEMP 171

Query: 79  YRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPN-SYMIILTAFPEFHVLENGKR 138
           +RN V+   +I GLV   R +E +  F EM  S EL+   ++ I L A      ++ GK 
Sbjct: 172 FRNAVTWTAIITGLVHAGRYKEGLTYFSEMSRSEELSDTYTFAIALKACAGLRQVKYGK- 231

Query: 139 KGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSVTWNSMITGL 198
               +H  +I  G +   + + N L  MY +CG + D   +F  M  +D V+W S+I   
Sbjct: 232 ---AIHTHVIVRGFVTT-LCVANSLATMYTECGEMQDGLCLFENMSERDVVSWTSLIVAY 291

Query: 199 DQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEGLKLGLDLDV 258
            +  Q ++AV+TF +MR +++ P+  T  S  S+CASL  +  GEQLHC  L LGL+  +
Sbjct: 292 KRIGQEVKAVETFIKMRNSQVPPNEQTFASMFSACASLSRLVWGEQLHCNVLSLGLNDSL 351

Query: 259 SVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALADSEPSMLEAVESFLVMMR 318
           SVSN+++ +Y  CG +      F  M   D +SW+++IG    +     E  + F  M +
Sbjct: 352 SVSNSMMKMYSTCGNLVSASVLFQGMRCRDIISWSTIIGGYCQAGFGE-EGFKYFSWMRQ 411

Query: 319 AGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALLACYGKCGDMGY 378
           +G  P      ++L+   ++++ E G+Q+HAL L   +  ++ + ++L+  Y KCG +  
Sbjct: 412 SGTKPTDFALASLLSVSGNMAVIEGGRQVHALALCFGLEQNSTVRSSLINMYSKCGSIKE 471

Query: 379 CENIFSRMSDRQDEVSWNSMISG 401
              IF   +DR D VS  +MI+G
Sbjct: 472 ASMIFGE-TDRDDIVSLTAMING 486

BLAST of CsaV3_7G002540 vs. Swiss-Prot
Match: sp|Q9SVA5|PP357_ARATH (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 5.1e-48
Identity = 127/394 (32.23%), Postives = 211/394 (53.55%), Query Frame = 0

Query: 12  PNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSGFAKAGSIGYAK 71
           PNEY   S I A   L   G  ++ QL + + KSGF  D+YVG+ L+  + K G+I YA+
Sbjct: 144 PNEYILSSFIQACSGLDGRGRWMVFQLQSFLVKSGFDRDVYVGTLLIDFYLKDGNIDYAR 203

Query: 72  NIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFME-MKDSVELNPNSYMI--ILTAFPE 131
            +F  +  ++ V+   +I G V+  R   +++LF + M+D+V   P+ Y++  +L+A   
Sbjct: 204 LVFDALPEKSTVTWTTMISGCVKMGRSYVSLQLFYQLMEDNVV--PDGYILSTVLSACSI 263

Query: 132 FHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFRLMDNKDSV 191
              LE GK    ++HA ++R G L    ++ N LI+ Y KCG +  A  +F  M NK+ +
Sbjct: 264 LPFLEGGK----QIHAHILRYG-LEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNII 323

Query: 192 TWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISVGEQLHCEG 251
           +W ++++G  QN    EA++ F  M +  L P  +   S L+SCASL  +  G Q+H   
Sbjct: 324 SWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCASLHALGFGTQVHAYT 383

Query: 252 LKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALA--DSEPSML 311
           +K  L  D  V+N+L+ +Y +C  + + +K F +    D V +N++I   +   ++  + 
Sbjct: 384 IKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGYSRLGTQWELH 443

Query: 312 EAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTAIENALL 371
           EA+  F  M      P+ +TF+++L A +SL+   L KQIH L+ K  +  D    +AL+
Sbjct: 444 EALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLNLDIFAGSALI 503

Query: 372 ACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
             Y  C  +     +F  M  + D V WNSM +G
Sbjct: 504 DVYSNCYCLKDSRLVFDEMKVK-DLVIWNSMFAG 529

BLAST of CsaV3_7G002540 vs. TrEMBL
Match: tr|A0A0A0K552|A0A0A0K552_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G031730 PE=4 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 1.3e-226
Identity = 400/400 (100.00%), Postives = 400/400 (100.00%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG
Sbjct: 269 MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 328

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM
Sbjct: 329 FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 388

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR
Sbjct: 389 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 448

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
           LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV
Sbjct: 449 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 508

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD
Sbjct: 509 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 568

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA
Sbjct: 569 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 628

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG
Sbjct: 629 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 668

BLAST of CsaV3_7G002540 vs. TrEMBL
Match: tr|A0A1S3CHK4|A0A1S3CHK4_CUCME (putative pentatricopeptide repeat-containing protein At5g09950 OS=Cucumis melo OX=3656 GN=LOC103500513 PE=4 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 3.4e-214
Identity = 382/400 (95.50%), Postives = 387/400 (96.75%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQKEVM DGLKPNEYTFGSLISATCSL NSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG
Sbjct: 269 MQKEVMRDGLKPNEYTFGSLISATCSLPNSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 328

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FAKAGSI YAKNIFQKMSYRNVVSLNGLIIGLVRQ RGEEAVELFMEMKDSVELNPNSYM
Sbjct: 329 FAKAGSINYAKNIFQKMSYRNVVSLNGLIIGLVRQNRGEEAVELFMEMKDSVELNPNSYM 388

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           IILTAFPEF+VLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAK GAINDACVVFR
Sbjct: 389 IILTAFPEFYVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKFGAINDACVVFR 448

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
            MD KDSVTWNSMI+GLDQNKQFLEAVKTFQEMRRTEL+PSNFTMISALSSCASLGWISV
Sbjct: 449 FMDTKDSVTWNSMISGLDQNKQFLEAVKTFQEMRRTELFPSNFTMISALSSCASLGWISV 508

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYD VSWNSLIGALAD
Sbjct: 509 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDQVSWNSLIGALAD 568

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SEPSMLEAVESF+VMMRAGW PNRVTFI+ILAAVSSLSLHELGKQIHALVLK NVAADTA
Sbjct: 569 SEPSMLEAVESFIVMMRAGWHPNRVTFISILAAVSSLSLHELGKQIHALVLKHNVAADTA 628

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCGDM  CENIFSRMSDRQDE SWNSMISG
Sbjct: 629 IENALLACYGKCGDMVNCENIFSRMSDRQDEASWNSMISG 668

BLAST of CsaV3_7G002540 vs. TrEMBL
Match: tr|M5WQY7|M5WQY7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa001014mg PE=4 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 2.6e-153
Identity = 271/400 (67.75%), Postives = 335/400 (83.75%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQK+     L+PNEYTFGSLI+A CSLA++GL LL+Q+LTRV KSG L DLYVGSALVSG
Sbjct: 136 MQKDGSAFSLQPNEYTFGSLITAACSLAHAGLSLLQQILTRVNKSGILQDLYVGSALVSG 195

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FA+ G I YA+ IF++MS RN +S+NGL++ LVRQKRG+EA E+FMEMK  V +N +S +
Sbjct: 196 FARFGLIDYARKIFEQMSERNAISMNGLMVALVRQKRGKEATEVFMEMKGLVGINLDSLV 255

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           ++L++F EF VLE GKRKG EVHA++I +GL+  ++AIGNGLINMYAKCGAI+DAC VFR
Sbjct: 256 VLLSSFAEFSVLEEGKRKGREVHAYVIGAGLIYRKVAIGNGLINMYAKCGAISDACSVFR 315

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
            M +KD ++WNS+I+GLDQN+ F +AV  F+EM+R+E  PSNFT+ISALSSCASLGWI +
Sbjct: 316 HMMDKDLISWNSLISGLDQNEFFEDAVMNFREMKRSEFMPSNFTLISALSSCASLGWIIL 375

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           G+Q+HCE LKLGLDLDVSVSNALLALY + G++ EC+  F LM DYD VSWNS+IGALA 
Sbjct: 376 GQQIHCEALKLGLDLDVSVSNALLALYSDTGHLSECRNVFFLMQDYDQVSWNSIIGALAG 435

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SE S+LEAVE FL MM++GW+ NRVTF++ILAAVSSLSL +LG+QIHA+VLK N A D A
Sbjct: 436 SEASVLEAVEYFLDMMQSGWELNRVTFMSILAAVSSLSLPDLGQQIHAVVLKYNAAEDCA 495

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENAL+ CYGKCG +  CE IFSRMS+R+DE+SWNSMISG
Sbjct: 496 IENALITCYGKCGGIDDCEKIFSRMSERRDEISWNSMISG 535

BLAST of CsaV3_7G002540 vs. TrEMBL
Match: tr|A0A251Q2S2|A0A251Q2S2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G198300 PE=4 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 2.6e-153
Identity = 271/400 (67.75%), Postives = 335/400 (83.75%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQK+     L+PNEYTFGSLI+A CSLA++GL LL+Q+LTRV KSG L DLYVGSALVSG
Sbjct: 283 MQKDGSAFSLQPNEYTFGSLITAACSLAHAGLSLLQQILTRVNKSGILQDLYVGSALVSG 342

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FA+ G I YA+ IF++MS RN +S+NGL++ LVRQKRG+EA E+FMEMK  V +N +S +
Sbjct: 343 FARFGLIDYARKIFEQMSERNAISMNGLMVALVRQKRGKEATEVFMEMKGLVGINLDSLV 402

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           ++L++F EF VLE GKRKG EVHA++I +GL+  ++AIGNGLINMYAKCGAI+DAC VFR
Sbjct: 403 VLLSSFAEFSVLEEGKRKGREVHAYVIGAGLIYRKVAIGNGLINMYAKCGAISDACSVFR 462

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
            M +KD ++WNS+I+GLDQN+ F +AV  F+EM+R+E  PSNFT+ISALSSCASLGWI +
Sbjct: 463 HMMDKDLISWNSLISGLDQNEFFEDAVMNFREMKRSEFMPSNFTLISALSSCASLGWIIL 522

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           G+Q+HCE LKLGLDLDVSVSNALLALY + G++ EC+  F LM DYD VSWNS+IGALA 
Sbjct: 523 GQQIHCEALKLGLDLDVSVSNALLALYSDTGHLSECRNVFFLMQDYDQVSWNSIIGALAG 582

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SE S+LEAVE FL MM++GW+ NRVTF++ILAAVSSLSL +LG+QIHA+VLK N A D A
Sbjct: 583 SEASVLEAVEYFLDMMQSGWELNRVTFMSILAAVSSLSLPDLGQQIHAVVLKYNAAEDCA 642

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENAL+ CYGKCG +  CE IFSRMS+R+DE+SWNSMISG
Sbjct: 643 IENALITCYGKCGGIDDCEKIFSRMSERRDEISWNSMISG 682

BLAST of CsaV3_7G002540 vs. TrEMBL
Match: tr|A0A061DL19|A0A061DL19_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao OX=3641 GN=TCM_002398 PE=4 SV=1)

HSP 1 Score: 537.3 bits (1383), Expect = 2.9e-149
Identity = 271/400 (67.75%), Postives = 327/400 (81.75%), Query Frame = 0

Query: 1   MQKEVMGDGLKPNEYTFGSLISATCSLANSGLVLLEQLLTRVEKSGFLHDLYVGSALVSG 60
           MQKE +G   +PNEYTFGSLI+A CS  + GL LL+Q+L+R+ KSGFL DLYVGSALVSG
Sbjct: 278 MQKEGIGFSFEPNEYTFGSLITAACSSMDFGLCLLQQMLSRITKSGFLSDLYVGSALVSG 337

Query: 61  FAKAGSIGYAKNIFQKMSYRNVVSLNGLIIGLVRQKRGEEAVELFMEMKDSVELNPNSYM 120
           FA+ G   YA  IF +MS RN VS+NGL++GLVRQK GE+A E+FMEM + V++N +SY+
Sbjct: 338 FARLGLSNYAMKIFGQMSQRNAVSMNGLMVGLVRQKFGEDAAEVFMEMTNLVDINFDSYV 397

Query: 121 IILTAFPEFHVLENGKRKGSEVHAFLIRSGLLNAQIAIGNGLINMYAKCGAINDACVVFR 180
           I+L++F EF  LE G+RKG EVH +LIR GL +A +AIGNGLINMYAKCG I  +  VFR
Sbjct: 398 ILLSSFAEFSALEQGRRKGREVHGYLIRRGLNDAVVAIGNGLINMYAKCGDIVASTSVFR 457

Query: 181 LMDNKDSVTWNSMITGLDQNKQFLEAVKTFQEMRRTELYPSNFTMISALSSCASLGWISV 240
           LM NKD V+WNSMI+GLDQN+ F +AV +F  MRRT L PSN+T+ISALSSCASLGW  +
Sbjct: 458 LMLNKDLVSWNSMISGLDQNECFEDAVTSFCAMRRTGLMPSNYTVISALSSCASLGWSML 517

Query: 241 GEQLHCEGLKLGLDLDVSVSNALLALYGECGYVKECQKAFSLMLDYDHVSWNSLIGALAD 300
           G Q+H EG+KLGLD+DVSVSNALLALY   G + EC+  FSLMLD+D VSWNS+IGALAD
Sbjct: 518 GLQIHGEGMKLGLDVDVSVSNALLALYATIGCLSECKNIFSLMLDHDQVSWNSVIGALAD 577

Query: 301 SEPSMLEAVESFLVMMRAGWDPNRVTFITILAAVSSLSLHELGKQIHALVLKRNVAADTA 360
           SE S+LEAV+ FL MMR GWDPNR+TFI ILAAVSSLSL EL +QIH L++K ++A D++
Sbjct: 578 SESSVLEAVKYFLDMMRTGWDPNRITFINILAAVSSLSLSELSRQIHTLIIKYHLANDSS 637

Query: 361 IENALLACYGKCGDMGYCENIFSRMSDRQDEVSWNSMISG 401
           IENALLACYGKCG+M  CE IFSRMS+R+DEVSWNSMISG
Sbjct: 638 IENALLACYGKCGEMDECEKIFSRMSERRDEVSWNSMISG 677

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004144619.12.0e-226100.00PREDICTED: putative pentatricopeptide repeat-containing protein At5g09950 [Cucum... [more]
XP_008462071.15.1e-21495.50PREDICTED: putative pentatricopeptide repeat-containing protein At5g09950 [Cucum... [more]
XP_022136280.14.5e-19486.25putative pentatricopeptide repeat-containing protein At5g09950 [Momordica charan... [more]
XP_022928551.12.1e-19185.50putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita moscha... [more]
XP_023529590.11.4e-19085.25putative pentatricopeptide repeat-containing protein At5g09950 [Cucurbita pepo s... [more]
Match NameE-valueIdentityDescription
AT5G09950.15.8e-13558.88Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G33680.12.3e-5130.60Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G09040.11.5e-5031.55Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G47840.19.8e-5031.33Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G39530.12.8e-4932.23Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FIB2|PP373_ARATH1.0e-13358.88Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
sp|P93005|PP181_ARATH4.2e-5030.60Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX... [more]
sp|Q9SS83|PP220_ARATH2.7e-4931.55Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
sp|Q9STS9|PP268_ARATH1.8e-4831.33Putative pentatricopeptide repeat-containing protein At3g47840 OS=Arabidopsis th... [more]
sp|Q9SVA5|PP357_ARATH5.1e-4832.23Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K552|A0A0A0K552_CUCSA1.3e-226100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G031730 PE=4 SV=1[more]
tr|A0A1S3CHK4|A0A1S3CHK4_CUCME3.4e-21495.50putative pentatricopeptide repeat-containing protein At5g09950 OS=Cucumis melo O... [more]
tr|M5WQY7|M5WQY7_PRUPE2.6e-15367.75Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa001014mg PE=4 SV=1[more]
tr|A0A251Q2S2|A0A251Q2S2_PRUPE2.6e-15367.75Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G198300 PE=4 SV=1[more]
tr|A0A061DL19|A0A061DL19_THECC2.9e-14967.75Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao OX=3641 GN=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_7G002540.1CsaV3_7G002540.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 83..117
e-value: 7.4E-5
score: 20.7
coord: 188..221
e-value: 2.2E-5
score: 22.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 261..285
e-value: 0.44
score: 10.8
coord: 363..387
e-value: 0.023
score: 14.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 185..233
e-value: 1.5E-9
score: 37.8
coord: 80..126
e-value: 5.6E-8
score: 32.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 358..392
score: 7.41
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 11.477
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 5.393
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 81..111
score: 8.955
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 256..286
score: 6.577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 155..185
score: 6.369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 13..49
score: 5.864
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 50..80
score: 7.026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 287..322
score: 9.098
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 221..255
score: 5.305
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 1..140
e-value: 8.9E-19
score: 70.0
coord: 303..409
e-value: 1.0E-14
score: 56.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 141..247
e-value: 8.4E-16
score: 59.8
NoneNo IPR availablePANTHERPTHR24015:SF909SUBFAMILY NOT NAMEDcoord: 185..315
coord: 116..241
coord: 10..111
coord: 254..402
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 185..315
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 116..241
coord: 10..111
coord: 254..402

The following gene(s) are paralogous to this gene:

None