CsaV3_5G029290 (gene) Cucumber (Chinese Long) v3

NameCsaV3_5G029290
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionpentatricopeptide repeat-containing protein At2g19280
Locationchr5 : 24327940 .. 24329865 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGAGGACTGCACAAACTATGATGTTAACTCTGATGAACGAAGTTATGTTGGCAATGAAGTTGAAGTTTCTAAAGGTCAAAAAACTGATGAGGATGAAATGGAAACGATAAAATTGATACTTGGGAACCGTGGGTTTAATCTTGGTTCGTGTCCGAAACAATTGGAGATCATAAGGATTTTGGACGTTTTATTTGAGGATAGTTCAGATGCCGGACTTTGTCTTTACTACTTCAAATGGTCAGGATGTTTATCTGGATCTAATCAGTCGCTGGAGTCAATTTGTAGGATGGCACATATTTTGGTTGCTGGGAATATGAATCATAGGGCAGTTGATTTAATATCACACCTTGTTAAAAACTATGGTTGTACAGAGGGATCTTCAAGCATATTGTTGAAAGTTTTCTGTGAAACGCATAATGGAAGGAAAACTTTGGAAACCACATGCAGCATGATGGTTAACTGTTATATCAAGGAAAGAATGGTAACCTCTGCTCTTATCTTGATTGATCAAATGAAGCACCTTAACATATTTCCTTCTATATGGGTATACAAGTCGGTGATAAAAGCTTTATTACAAACCAATCAGTCAGGCATGGCCTGGGATCTTCTAGAAGAAATGCACCGGCAAGGTGTAAGTTTAAATTATTCAATTAATTTATTTATTCATCATTATTGTTCAGAAGGTAATCTGGGCAAGGGATGGAAAGTGCTTTTGGAGTTGAGGAATTTTGGATCTAAGCCAGATGTAGTTGATTACACAACTGTGATCAACTCACTTTGCAAAGTCTCTCTGTTAAAAGAAGCCACCGCCCTGTTGTTTAAAATGATTACTTTTGGTGTCTCTCCTGATTTAGTTACAATGAGTTCTATTATTGATGGTCATTGTAAAGTAGGAAAGTCAGATATAGCTTGTAAAATACTAAAGTACTTTAGGCTTCCCCTAAATATTTTCATATACAATAGCTTTATAACAAAGTTATCTACGGAAGGAGACATGGTAAAAGCTTCAAAAGTTTTTCTTGAAATGACTGAGGTGGGCTTAGTTCCAGATTGTATTAGTTACACAACCATGATAGGAGGTTATTGTAAAGTGGGAAACATAAACATAGCATTTTCTTACCTGAGCAAGATGTTAAAAAGTGGGATACAACCATCTGTTATTACGTATACTTTGTTCCTTGATTACTTTTGTGAGTGTAGAGATGTGGAAATGGCTGAAGTTATGTTTGAAAAGATGATTGTTGAGGGTTTAAAACCTGACGTTGTCGTGTATAATATATTGATGGATGCATATGGAAAGAAGGGCTACATGCACAAGGCTTTTAAACTCCTTGATATGATGAGATCTACAAATGTTACTCCTGACGTTGTGACGTATAACACTCTCATTAATGGTCTTGTTATGCGAGGGTTTCTTCAAGAAGCAAAGGATATTCTAGATGAGCTCATCAGGAGGGGTTTCAGTGTAGATGTTGTCACATACACTAATATCATACATGGATATTCCACAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGGCTGAGAACTGTGTAACGCCTGATGTTGTTACTTGCAGTGCTCTTCTTAGTGGGTATTGCCGAGAAAAGCGTATGGATGAAGCAAATGCACTATTTTGTAAAATGCTGGACATTGGGTTAAAACCAGACTTGATATTGTACAATACTCTAATCCATGGATTTTGCAGTGTTGGTAATGTAGATGAAGGTTGCAATCTGGTAAAGAAGATGATTGAAAGCAGCATCATTCCAAACAATGTTACTCATCGTGCACTTGTCCTTGGATTTCAGAAAAAGAGAGTTACGGATCCAATACAGAGTGCCACTTCTAAACTCCAAGAAATCTTGATTGCATATGATCTTCAGATTGATGCCATTGGATATATCTAA

mRNA sequence

ATGGATGAGGACTGCACAAACTATGATGTTAACTCTGATGAACGAAGTTATGTTGGCAATGAAGTTGAAGTTTCTAAAGGTCAAAAAACTGATGAGGATGAAATGGAAACGATAAAATTGATACTTGGGAACCGTGGGTTTAATCTTGGTTCGTGTCCGAAACAATTGGAGATCATAAGGATTTTGGACGTTTTATTTGAGGATAGTTCAGATGCCGGACTTTGTCTTTACTACTTCAAATGGTCAGGATGTTTATCTGGATCTAATCAGTCGCTGGAGTCAATTTGTAGGATGGCACATATTTTGGTTGCTGGGAATATGAATCATAGGGCAGTTGATTTAATATCACACCTTGTTAAAAACTATGGTTGTACAGAGGGATCTTCAAGCATATTGTTGAAAGTTTTCTGTGAAACGCATAATGGAAGGAAAACTTTGGAAACCACATGCAGCATGATGGTTAACTGTTATATCAAGGAAAGAATGGTAACCTCTGCTCTTATCTTGATTGATCAAATGAAGCACCTTAACATATTTCCTTCTATATGGGTATACAAGTCGGTGATAAAAGCTTTATTACAAACCAATCAGTCAGGCATGGCCTGGGATCTTCTAGAAGAAATGCACCGGCAAGGTGTAAGTTTAAATTATTCAATTAATTTATTTATTCATCATTATTGTTCAGAAGGTAATCTGGGCAAGGGATGGAAAGTGCTTTTGGAGTTGAGGAATTTTGGATCTAAGCCAGATGTAGTTGATTACACAACTGTGATCAACTCACTTTGCAAAGTCTCTCTGTTAAAAGAAGCCACCGCCCTGTTGTTTAAAATGATTACTTTTGGTGTCTCTCCTGATTTAGTTACAATGAGTTCTATTATTGATGGTCATTGTAAAGTAGGAAAGTCAGATATAGCTTGTAAAATACTAAAGTACTTTAGGCTTCCCCTAAATATTTTCATATACAATAGCTTTATAACAAAGTTATCTACGGAAGGAGACATGGTAAAAGCTTCAAAAGTTTTTCTTGAAATGACTGAGGTGGGCTTAGTTCCAGATTGTATTAGTTACACAACCATGATAGGAGGTTATTGTAAAGTGGGAAACATAAACATAGCATTTTCTTACCTGAGCAAGATGTTAAAAAGTGGGATACAACCATCTGTTATTACGTATACTTTGTTCCTTGATTACTTTTGTGAGTGTAGAGATGTGGAAATGGCTGAAGTTATGTTTGAAAAGATGATTGTTGAGGGTTTAAAACCTGACGTTGTCGTGTATAATATATTGATGGATGCATATGGAAAGAAGGGCTACATGCACAAGGCTTTTAAACTCCTTGATATGATGAGATCTACAAATGTTACTCCTGACGTTGTGACGTATAACACTCTCATTAATGGTCTTGTTATGCGAGGGTTTCTTCAAGAAGCAAAGGATATTCTAGATGAGCTCATCAGGAGGGGTTTCAGTGTAGATGTTGTCACATACACTAATATCATACATGGATATTCCACAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGGCTGAGAACTGTGTAACGCCTGATGTTGTTACTTGCAGTGCTCTTCTTAGTGGGTATTGCCGAGAAAAGCGTATGGATGAAGCAAATGCACTATTTTGTAAAATGCTGGACATTGGGTTAAAACCAGACTTGATATTGTACAATACTCTAATCCATGGATTTTGCAGTGTTGGTAATGTAGATGAAGGTTGCAATCTGGTAAAGAAGATGATTGAAAGCAGCATCATTCCAAACAATGTTACTCATCGTGCACTTGTCCTTGGATTTCAGAAAAAGAGAGTTACGGATCCAATACAGAGTGCCACTTCTAAACTCCAAGAAATCTTGATTGCATATGATCTTCAGATTGATGCCATTGGATATATCTAA

Coding sequence (CDS)

ATGGATGAGGACTGCACAAACTATGATGTTAACTCTGATGAACGAAGTTATGTTGGCAATGAAGTTGAAGTTTCTAAAGGTCAAAAAACTGATGAGGATGAAATGGAAACGATAAAATTGATACTTGGGAACCGTGGGTTTAATCTTGGTTCGTGTCCGAAACAATTGGAGATCATAAGGATTTTGGACGTTTTATTTGAGGATAGTTCAGATGCCGGACTTTGTCTTTACTACTTCAAATGGTCAGGATGTTTATCTGGATCTAATCAGTCGCTGGAGTCAATTTGTAGGATGGCACATATTTTGGTTGCTGGGAATATGAATCATAGGGCAGTTGATTTAATATCACACCTTGTTAAAAACTATGGTTGTACAGAGGGATCTTCAAGCATATTGTTGAAAGTTTTCTGTGAAACGCATAATGGAAGGAAAACTTTGGAAACCACATGCAGCATGATGGTTAACTGTTATATCAAGGAAAGAATGGTAACCTCTGCTCTTATCTTGATTGATCAAATGAAGCACCTTAACATATTTCCTTCTATATGGGTATACAAGTCGGTGATAAAAGCTTTATTACAAACCAATCAGTCAGGCATGGCCTGGGATCTTCTAGAAGAAATGCACCGGCAAGGTGTAAGTTTAAATTATTCAATTAATTTATTTATTCATCATTATTGTTCAGAAGGTAATCTGGGCAAGGGATGGAAAGTGCTTTTGGAGTTGAGGAATTTTGGATCTAAGCCAGATGTAGTTGATTACACAACTGTGATCAACTCACTTTGCAAAGTCTCTCTGTTAAAAGAAGCCACCGCCCTGTTGTTTAAAATGATTACTTTTGGTGTCTCTCCTGATTTAGTTACAATGAGTTCTATTATTGATGGTCATTGTAAAGTAGGAAAGTCAGATATAGCTTGTAAAATACTAAAGTACTTTAGGCTTCCCCTAAATATTTTCATATACAATAGCTTTATAACAAAGTTATCTACGGAAGGAGACATGGTAAAAGCTTCAAAAGTTTTTCTTGAAATGACTGAGGTGGGCTTAGTTCCAGATTGTATTAGTTACACAACCATGATAGGAGGTTATTGTAAAGTGGGAAACATAAACATAGCATTTTCTTACCTGAGCAAGATGTTAAAAAGTGGGATACAACCATCTGTTATTACGTATACTTTGTTCCTTGATTACTTTTGTGAGTGTAGAGATGTGGAAATGGCTGAAGTTATGTTTGAAAAGATGATTGTTGAGGGTTTAAAACCTGACGTTGTCGTGTATAATATATTGATGGATGCATATGGAAAGAAGGGCTACATGCACAAGGCTTTTAAACTCCTTGATATGATGAGATCTACAAATGTTACTCCTGACGTTGTGACGTATAACACTCTCATTAATGGTCTTGTTATGCGAGGGTTTCTTCAAGAAGCAAAGGATATTCTAGATGAGCTCATCAGGAGGGGTTTCAGTGTAGATGTTGTCACATACACTAATATCATACATGGATATTCCACAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGGCTGAGAACTGTGTAACGCCTGATGTTGTTACTTGCAGTGCTCTTCTTAGTGGGTATTGCCGAGAAAAGCGTATGGATGAAGCAAATGCACTATTTTGTAAAATGCTGGACATTGGGTTAAAACCAGACTTGATATTGTACAATACTCTAATCCATGGATTTTGCAGTGTTGGTAATGTAGATGAAGGTTGCAATCTGGTAAAGAAGATGATTGAAAGCAGCATCATTCCAAACAATGTTACTCATCGTGCACTTGTCCTTGGATTTCAGAAAAAGAGAGTTACGGATCCAATACAGAGTGCCACTTCTAAACTCCAAGAAATCTTGATTGCATATGATCTTCAGATTGATGCCATTGGATATATCTAA

Protein sequence

MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIRILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTTVINSLCKVSLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMIGGYCKVGNINIAFSYLSKMLKSGIQPSVITYTLFLDYFCECRDVEMAEVMFEKMIVEGLKPDVVVYNILMDAYGKKGYMHKAFKLLDMMRSTNVTPDVVTYNTLINGLVMRGFLQEAKDILDELIRRGFSVDVVTYTNIIHGYSTRGNFEEAFLLWYHMAENCVTPDVVTCSALLSGYCREKRMDEANALFCKMLDIGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVTDPIQSATSKLQEILIAYDLQIDAIGYI
BLAST of CsaV3_5G029290 vs. NCBI nr
Match: XP_011655513.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus] >XP_011655514.1 PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus] >KGN51586.1 hypothetical protein Csa_5G581700 [Cucumis sativus])

HSP 1 Score: 639.0 bits (1647), Expect = 1.7e-179
Identity = 366/366 (100.00%), Postives = 366/366 (100.00%), Query Frame = 0

Query: 1   MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR 60
           MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR
Sbjct: 38  MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR 97

Query: 61  ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK 120
           ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK
Sbjct: 98  ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK 157

Query: 121 NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP 180
           NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP
Sbjct: 158 NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP 217

Query: 181 SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL 240
           SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL
Sbjct: 218 SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL 277

Query: 241 ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 278 ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 337

Query: 301 XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI 360
           XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI
Sbjct: 338 XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI 397

Query: 361 GGYCKV 367
           GGYCKV
Sbjct: 398 GGYCKV 403

BLAST of CsaV3_5G029290 vs. NCBI nr
Match: XP_008445921.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis melo])

HSP 1 Score: 563.1 bits (1450), Expect = 1.2e-156
Identity = 324/366 (88.52%), Postives = 343/366 (93.72%), Query Frame = 0

Query: 1   MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR 60
           MDEDCTNYDV+SDERSY GNEVEVSKG+KTDED+ME IKLILGNRGF LGS PKQLE +R
Sbjct: 38  MDEDCTNYDVDSDERSYFGNEVEVSKGKKTDEDKMEKIKLILGNRGFKLGSRPKQLETVR 97

Query: 61  ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK 120
           ILD+LFEDSSD  LCLYYFKWSGCLSGSNQSLESICRMAHILVAGN NH AVDLISHLVK
Sbjct: 98  ILDILFEDSSDPELCLYYFKWSGCLSGSNQSLESICRMAHILVAGNKNHGAVDLISHLVK 157

Query: 121 NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP 180
           NYGC EGSSSILL+VF +THN RKTLETTC MM+NCYIKE MVTSA+ILIDQM+ LN+FP
Sbjct: 158 NYGCKEGSSSILLEVFYDTHNKRKTLETTCGMMINCYIKEGMVTSAVILIDQMRRLNVFP 217

Query: 181 SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL 240
           SIWVYKSVIKALLQTN+  MAWDLLEEM RQG+SL+YSINLFIHHYCSEGNLGKGWKVLL
Sbjct: 218 SIWVYKSVIKALLQTNRFDMAWDLLEEMQRQGISLHYSINLFIHHYCSEGNLGKGWKVLL 277

Query: 241 ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           ELRNFGSKPDVVDYTTV  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 278 ELRNFGSKPDVVDYTTVINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 337

Query: 301 XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI 360
           XSDIACKILKYF++PLNIFIYNSFIT+L  EGD VKASKVFLEM+EVGLVPDC+SYTTMI
Sbjct: 338 XSDIACKILKYFKIPLNIFIYNSFITELFMEGDTVKASKVFLEMSEVGLVPDCVSYTTMI 397

Query: 361 GGYCKV 367
           GGYCKV
Sbjct: 398 GGYCKV 403

BLAST of CsaV3_5G029290 vs. NCBI nr
Match: XP_022957015.1 (pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957016.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957017.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957018.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957019.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata])

HSP 1 Score: 479.6 bits (1233), Expect = 1.7e-131
Identity = 287/362 (79.28%), Postives = 318/362 (87.85%), Query Frame = 0

Query: 2   DEDC--------TNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCP 61
           DEDC        TN DV+S+E++Y GN+V+VSKG K D+DEM+ IKLILGN GFNLGS P
Sbjct: 39  DEDCFTSELPAATNSDVDSEEQNYFGNDVQVSKGLKADDDEMKLIKLILGNHGFNLGSHP 98

Query: 62  KQLEIIRILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVD 121
           KQLEI+RILD+LFE+SSDA LCLYYFKWSGCLSGSN+SLESICRM HILVAGNMNHRAVD
Sbjct: 99  KQLEIVRILDILFEESSDARLCLYYFKWSGCLSGSNRSLESICRMIHILVAGNMNHRAVD 158

Query: 122 LISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQM 181
           L+SHLVKNYG  EG S+ILLK+F ETH+ RKTLETTCSM+V+CYIKERMVT+ALIL+ QM
Sbjct: 159 LMSHLVKNYGSKEGFSTILLKLFYETHHERKTLETTCSMLVDCYIKERMVTAALILMGQM 218

Query: 182 KHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLG 241
           K  +IFPSIWVYKSVI+ALLQTNQS  AWDLLEEMHRQG+SLNYSINLFI+HYC++GNL 
Sbjct: 219 KSFDIFPSIWVYKSVIQALLQTNQSESAWDLLEEMHRQGISLNYSINLFIYHYCAKGNLS 278

Query: 242 KGWKVLLELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 301
           +GWKVLLELR FGSKPD VDYT V   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 279 RGWKVLLELRKFGSKPDAVDYTIVINSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 338

Query: 302 XXXXXXXXSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDC 356
           XXXXXXXX      ILKYFR PLNIFIYNSFITKL  EG+ VKAS+VFLEM+EVGLVPDC
Sbjct: 339 XXXXXXXXXXXXXXILKYFRRPLNIFIYNSFITKLCMEGNTVKASEVFLEMSEVGLVPDC 398

BLAST of CsaV3_5G029290 vs. NCBI nr
Match: XP_022139130.1 (pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia] >XP_022139131.1 pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia] >XP_022139132.1 pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia])

HSP 1 Score: 450.3 bits (1157), Expect = 1.1e-122
Identity = 278/362 (76.80%), Postives = 306/362 (84.53%), Query Frame = 0

Query: 1   MDEDC--------TNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSC 60
           +D+DC         NYDV+ DE+ Y  N  E  KGQK D+D M+ IKLIL N G NLGS 
Sbjct: 65  VDDDCFTYEYPVAANYDVDFDEKIYFRN--EDPKGQKVDDDRMKMIKLILRNHGLNLGSH 124

Query: 61  PKQLEIIRILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAV 120
           PKQLEI+RILD LFEDSSDAGL LYYFKWSGCLSGSNQSL+SICRM  IL+ GNMNHRAV
Sbjct: 125 PKQLEIVRILDTLFEDSSDAGLSLYYFKWSGCLSGSNQSLQSICRMIRILITGNMNHRAV 184

Query: 121 DLISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQ 180
           DL+SH+V+NYG  EGSSS+LLK+F E  N RKTLET CSM+V CYIKERMVT+ALIL+ Q
Sbjct: 185 DLMSHIVENYGSKEGSSSMLLKLFFEMVNERKTLETACSMLVYCYIKERMVTAALILMGQ 244

Query: 181 MKHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNL 240
           MKHL IFPSIWVY+SVI+ LL+TNQ  +AWDLLEEM+ QGVSLNYSINLFIHHYC+EGNL
Sbjct: 245 MKHLKIFPSIWVYRSVIQTLLETNQLELAWDLLEEMYIQGVSLNYSINLFIHHYCAEGNL 304

Query: 241 GKGWKVLLELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           G GWKVLLELRNFGSKPD VDYT V XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 305 GMGWKVLLELRNFGSKPDAVDYTIVIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 364

Query: 301 XXXXXXXXXSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPD 355
           XXXXXXXXX      ILKYFRLPLNIF YNSFITKL TEG+MV AS+VFLEM+EVGL+PD
Sbjct: 365 XXXXXXXXXXXXXXXILKYFRLPLNIFTYNSFITKLCTEGNMVSASEVFLEMSEVGLLPD 424

BLAST of CsaV3_5G029290 vs. NCBI nr
Match: XP_022957020.1 (pentatricopeptide repeat-containing protein At2g19280 isoform X2 [Cucurbita moschata])

HSP 1 Score: 354.4 bits (908), Expect = 8.3e-94
Identity = 180/262 (68.70%), Postives = 209/262 (79.77%), Query Frame = 0

Query: 2   DEDC--------TNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCP 61
           DEDC        TN DV+S+E++Y GN+V+VSKG K D+DEM+ IKLILGN GFNLGS P
Sbjct: 39  DEDCFTSELPAATNSDVDSEEQNYFGNDVQVSKGLKADDDEMKLIKLILGNHGFNLGSHP 98

Query: 62  KQLEIIRILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVD 121
           KQLEI+RILD+LFE+SSDA LCLYYFKWSGCLSGSN+SLESICRM HILVAGNMNHRAVD
Sbjct: 99  KQLEIVRILDILFEESSDARLCLYYFKWSGCLSGSNRSLESICRMIHILVAGNMNHRAVD 158

Query: 122 LISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQM 181
           L+SHLVKNYG  EG S+ILLK+F ETH+ RKTLETTCSM+V+CYIKERMVT+ALIL+ QM
Sbjct: 159 LMSHLVKNYGSKEGFSTILLKLFYETHHERKTLETTCSMLVDCYIKERMVTAALILMGQM 218

Query: 182 KHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLG 241
           K  +IFPSIWVYKSVI+ALLQTNQS  AWDLLEEMHRQG             YC  GN+ 
Sbjct: 219 KSFDIFPSIWVYKSVIQALLQTNQSESAWDLLEEMHRQG------------GYCKVGNIN 278

Query: 242 KGWKVLLELRNFGSKPDVVDYT 256
           + +  L ++   G +P V+ YT
Sbjct: 279 RAFSYLGKMLKSGIRPSVITYT 288


HSP 2 Score: 53.1 bits (126), Expect = 4.0e-03
Identity = 25/30 (83.33%), Postives = 28/30 (93.33%), Query Frame = 0

Query: 612 RVTDPIQSATSKLQEILIAYDLQIDAIGYI 642
           +V DPI+SATSKLQEIL+AYDLQIDA GYI
Sbjct: 508 KVIDPIESATSKLQEILLAYDLQIDANGYI 537

BLAST of CsaV3_5G029290 vs. TAIR10
Match: AT2G19280.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 142.9 bits (359), Expect = 6.9e-34
Identity = 87/246 (35.37%), Postives = 136/246 (55.28%), Query Frame = 0

Query: 11  NSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGF------NLGSCPKQLEIIRILDV 70
           +S  + +  + V + K      D +ETI+ +L    +         +   Q  +IRILD 
Sbjct: 59  HSSSKHFGEDFVSILKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDD 118

Query: 71  LFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGC 130
           LFE++ DA + LY+F+WS    G   S  SI RM HILV+GNMN+RAVD++  LVK    
Sbjct: 119 LFEETLDASIVLYFFRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSG 178

Query: 131 TEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWV 190
            E S              R+ LET  S++++C I+ER V  AL L  ++    IFPS  V
Sbjct: 179 EERSLXXXXXXXXXXXIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGV 238

Query: 191 YKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSI-NLFIHHYCSEGNLGKGWKVLLELR 250
             S++K +L+ +   +A + +E M  +G  LN ++ +LFI  YCS+G   KGW++L+ ++
Sbjct: 239 CISLLKEILRVHGLELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMK 298

BLAST of CsaV3_5G029290 vs. TAIR10
Match: AT4G11690.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 50.8 bits (120), Expect = 3.6e-06
Identity = 33/160 (20.62%), Postives = 70/160 (43.75%), Query Frame = 0

Query: 93  ESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSM 152
           ESI  +  +L++GN+   A  L+  ++     ++  +S  L  +       KT      +
Sbjct: 40  ESISILLRLLLSGNLFSHAQSLLLQVISGKIHSQFFTSSSLLHYLTESETSKTKFRLYEV 99

Query: 153 MVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQG 212
           ++N Y++ + +  ++   ++M      P    +  ++  ++ ++     W    E   + 
Sbjct: 100 IINSYVQSQSLNLSISYFNEMVDNGFVPGSNCFNYLLTFVVGSSSFNQWWSFFNENKSKV 159

Query: 213 VSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVV 253
           V   YS  + I   C  G + K + +L+EL  FG  P+VV
Sbjct: 160 VLDVYSFGILIKGCCEAGEIEKSFDLLIELTEFGFSPNVV 199

BLAST of CsaV3_5G029290 vs. Swiss-Prot
Match: sp|Q6NKW7|PP164_ARATH (Pentatricopeptide repeat-containing protein At2g19280 OS=Arabidopsis thaliana OX=3702 GN=At2g19280 PE=2 SV=2)

HSP 1 Score: 142.9 bits (359), Expect = 1.2e-32
Identity = 87/246 (35.37%), Postives = 136/246 (55.28%), Query Frame = 0

Query: 11  NSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGF------NLGSCPKQLEIIRILDV 70
           +S  + +  + V + K      D +ETI+ +L    +         +   Q  +IRILD 
Sbjct: 59  HSSSKHFGEDFVSILKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDD 118

Query: 71  LFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGC 130
           LFE++ DA + LY+F+WS    G   S  SI RM HILV+GNMN+RAVD++  LVK    
Sbjct: 119 LFEETLDASIVLYFFRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSG 178

Query: 131 TEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWV 190
            E S              R+ LET  S++++C I+ER V  AL L  ++    IFPS  V
Sbjct: 179 EERSLXXXXXXXXXXXIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGV 238

Query: 191 YKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSI-NLFIHHYCSEGNLGKGWKVLLELR 250
             S++K +L+ +   +A + +E M  +G  LN ++ +LFI  YCS+G   KGW++L+ ++
Sbjct: 239 CISLLKEILRVHGLELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMK 298

BLAST of CsaV3_5G029290 vs. Swiss-Prot
Match: sp|Q9T0D6|PP306_ARATH (Pentatricopeptide repeat-containing protein At4g11690 OS=Arabidopsis thaliana OX=3702 GN=At4g11690 PE=2 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 6.4e-05
Identity = 33/160 (20.62%), Postives = 70/160 (43.75%), Query Frame = 0

Query: 93  ESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSM 152
           ESI  +  +L++GN+   A  L+  ++     ++  +S  L  +       KT      +
Sbjct: 40  ESISILLRLLLSGNLFSHAQSLLLQVISGKIHSQFFTSSSLLHYLTESETSKTKFRLYEV 99

Query: 153 MVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQG 212
           ++N Y++ + +  ++   ++M      P    +  ++  ++ ++     W    E   + 
Sbjct: 100 IINSYVQSQSLNLSISYFNEMVDNGFVPGSNCFNYLLTFVVGSSSFNQWWSFFNENKSKV 159

Query: 213 VSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVV 253
           V   YS  + I   C  G + K + +L+EL  FG  P+VV
Sbjct: 160 VLDVYSFGILIKGCCEAGEIEKSFDLLIELTEFGFSPNVV 199

BLAST of CsaV3_5G029290 vs. TrEMBL
Match: tr|A0A0A0KV38|A0A0A0KV38_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G581700 PE=4 SV=1)

HSP 1 Score: 639.0 bits (1647), Expect = 1.1e-179
Identity = 366/366 (100.00%), Postives = 366/366 (100.00%), Query Frame = 0

Query: 1   MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR 60
           MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR
Sbjct: 38  MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR 97

Query: 61  ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK 120
           ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK
Sbjct: 98  ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK 157

Query: 121 NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP 180
           NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP
Sbjct: 158 NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP 217

Query: 181 SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL 240
           SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL
Sbjct: 218 SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL 277

Query: 241 ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 278 ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 337

Query: 301 XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI 360
           XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI
Sbjct: 338 XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI 397

Query: 361 GGYCKV 367
           GGYCKV
Sbjct: 398 GGYCKV 403

BLAST of CsaV3_5G029290 vs. TrEMBL
Match: tr|A0A1S3BDC2|A0A1S3BDC2_CUCME (pentatricopeptide repeat-containing protein At2g19280 OS=Cucumis melo OX=3656 GN=LOC103488803 PE=4 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 7.8e-157
Identity = 324/366 (88.52%), Postives = 343/366 (93.72%), Query Frame = 0

Query: 1   MDEDCTNYDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIR 60
           MDEDCTNYDV+SDERSY GNEVEVSKG+KTDED+ME IKLILGNRGF LGS PKQLE +R
Sbjct: 38  MDEDCTNYDVDSDERSYFGNEVEVSKGKKTDEDKMEKIKLILGNRGFKLGSRPKQLETVR 97

Query: 61  ILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVK 120
           ILD+LFEDSSD  LCLYYFKWSGCLSGSNQSLESICRMAHILVAGN NH AVDLISHLVK
Sbjct: 98  ILDILFEDSSDPELCLYYFKWSGCLSGSNQSLESICRMAHILVAGNKNHGAVDLISHLVK 157

Query: 121 NYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFP 180
           NYGC EGSSSILL+VF +THN RKTLETTC MM+NCYIKE MVTSA+ILIDQM+ LN+FP
Sbjct: 158 NYGCKEGSSSILLEVFYDTHNKRKTLETTCGMMINCYIKEGMVTSAVILIDQMRRLNVFP 217

Query: 181 SIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLL 240
           SIWVYKSVIKALLQTN+  MAWDLLEEM RQG+SL+YSINLFIHHYCSEGNLGKGWKVLL
Sbjct: 218 SIWVYKSVIKALLQTNRFDMAWDLLEEMQRQGISLHYSINLFIHHYCSEGNLGKGWKVLL 277

Query: 241 ELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           ELRNFGSKPDVVDYTTV  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 278 ELRNFGSKPDVVDYTTVINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 337

Query: 301 XSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMI 360
           XSDIACKILKYF++PLNIFIYNSFIT+L  EGD VKASKVFLEM+EVGLVPDC+SYTTMI
Sbjct: 338 XSDIACKILKYFKIPLNIFIYNSFITELFMEGDTVKASKVFLEMSEVGLVPDCVSYTTMI 397

Query: 361 GGYCKV 367
           GGYCKV
Sbjct: 398 GGYCKV 403

BLAST of CsaV3_5G029290 vs. TrEMBL
Match: tr|A0A1Q3CJL3|A0A1Q3CJL3_CEPFO (PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_23769 PE=4 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 6.3e-58
Identity = 173/350 (49.43%), Postives = 231/350 (66.00%), Query Frame = 0

Query: 24  VSKGQKTDEDEMETIKLILGNRGFNLG------SCPKQLEIIRILDVLFEDSSDAGLCLY 83
           + K QK D D+M  IK +L NRG+N+G          +  II+IL+ LFE++ DA L LY
Sbjct: 92  IFKNQKVD-DDMNVIKSVLKNRGWNVGYENGFQVDLDEFNIIQILNDLFEETLDAALALY 151

Query: 84  YFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVFC 143
           +F+WS    GS  ++ S+C+M HILV+GNMNHRAVDL+ H+V++    E   +++LK+  
Sbjct: 152 FFRWSEYYIGSEHTIRSVCKMIHILVSGNMNHRAVDLVVHIVRS-NIKEVFHNLVLKILY 211

Query: 144 ETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTNQ 203
           ETH  R+ LE   +M+V+CYIKE MV  AL +  +MK LN+FP+I +  S++KALL + +
Sbjct: 212 ETHTKREVLEIVYNMLVDCYIKENMVNVALEIKCKMKQLNLFPTIKLCNSLLKALLGSQK 271

Query: 204 SGMAWDLLEEMHRQGVSLN-YSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTT 263
             +AW+L+EEM  QG+SLN Y I+LFI  Y S+GN+          + FG  PDVV YT 
Sbjct: 272 LELAWELVEEMMTQGISLNVYIISLFIAAY-SKGNVEXXXXXXXXXKRFGIYPDVVAYTI 331

Query: 264 VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDIACKILKYFRLPL 323
           V          XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        KYF L  
Sbjct: 332 VVDCLCKMYCLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKYFNLTP 391

Query: 324 NIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMIGGYCKV 367
           N+F+YNS    L +   +V+ + +F EM E+G +PDC++YTT+IG YCKV
Sbjct: 392 NVFVYNS----LMSNSRLVEVAFLFNEMHELGFLPDCVNYTTIIGSYCKV 434

BLAST of CsaV3_5G029290 vs. TrEMBL
Match: tr|A0A251QBI9|A0A251QBI9_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G051600 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 1.7e-55
Identity = 184/335 (54.93%), Postives = 226/335 (67.46%), Query Frame = 0

Query: 25  SKGQKTDEDEMETIKLILGNRGFNLGSCP-------KQLEIIRILDVLFEDSSDAGLCLY 84
           S  ++ DEDEM+ + LIL  RG+NLG C         QL  I +L+ LFE+S DA L LY
Sbjct: 101 SINERPDEDEMKRLMLILAKRGWNLG-CQNGYNIYLNQLNTIELLNDLFEESFDAKLVLY 160

Query: 85  YFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVFC 144
           +FKWS C SGS  +L++ICRM HILV+GN+NHRAVDLI  LV+N+G  E  +S LL+V  
Sbjct: 161 FFKWSECCSGSKHTLQTICRMIHILVSGNLNHRAVDLILRLVRNHGDEESCNS-LLEVLD 220

Query: 145 ETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTNQ 204
           ETH+  + LETTCSM+VN YI+E MV  AL +  QMKHLNIFPS  V  S+++ALL + Q
Sbjct: 221 ETHSEIRVLETTCSMLVNGYIQEGMVNMALKIACQMKHLNIFPSNGVCNSLLQALLGSKQ 280

Query: 205 SGMAWDLLEEMHRQGVSLNYS-INLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTT 264
             +AWD LE M  +G+ LN + ++LFI+ YC                             
Sbjct: 281 LELAWDFLEVMRTRGMGLNAAMMSLFINKYCXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 340

Query: 265 VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDIACKILKYFRLPL 324
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX       LK F  PL
Sbjct: 341 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKIFNTPL 400

Query: 325 NIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVP 352
           NIFIYNSFI+KL T+G+M +AS +F EM+ +GL+P
Sbjct: 401 NIFIYNSFISKLCTDGNMAEASSLFHEMSMLGLLP 433

BLAST of CsaV3_5G029290 vs. TrEMBL
Match: tr|A0A2P5D9D3|A0A2P5D9D3_9ROSA (Tetratricopeptide-like helical domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_258450 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 1.7e-55
Identity = 185/347 (53.31%), Postives = 229/347 (65.99%), Query Frame = 0

Query: 13  DERSYVG-NEVEVSK------GQKTDEDEMETIKLILGNRGFNLGSCP------KQLEII 72
           DE S+ G +EVEV K       +K +  E+  I  IL NRG+NL SC        +L I+
Sbjct: 94  DELSFGGKDEVEVVKDVLFFNNKKPEVPEVIRIIRILTNRGWNLTSCNGFRINLNELNIM 153

Query: 73  RILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLV 132
           RI++ LF++SSDA L  Y+FKWS C  GS  S+ S+CRM HIL +GNMNHR +DL+  LV
Sbjct: 154 RIMNDLFQESSDAALAFYFFKWSECCIGSKHSVRSVCRMIHILASGNMNHRVMDLMLRLV 213

Query: 133 KNYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIF 192
           + Y   E S S+LL V CETH+ +K LET CSM+VNCYIKE MV  AL L  Q+KHLNIF
Sbjct: 214 RQY-AEEDSHSLLLTVLCETHSEKKILETVCSMLVNCYIKENMVDVALKLTSQLKHLNIF 273

Query: 193 PSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQGVSLNYS-INLFIHHYCSEGNLGKGWKV 252
           PS  V+ ++++ L+ +NQ  +AW  LE++H  G  ++ S I+LFIHHYC EG        
Sbjct: 274 PSDRVFHALLRELVGSNQLELAWVWLEDIHSIGRGISASTISLFIHHYCKEGXXXXXXXX 333

Query: 253 LLELRNFGSKPDVVDYTTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 312
                              XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 334 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 393

Query: 313 XXXSDIACKILKYFRLPLNIFIYNSFITKLSTEGDMVKASKVFLEMT 346
           XXX + A  ILK F LP N F+YNS I KL  +G+MV+ +K+F EMT
Sbjct: 394 XXXLEKAINILKVFNLPHNNFMYNSVIYKLCWDGNMVEVAKLFYEMT 439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011655513.11.7e-179100.00PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativu... [more]
XP_008445921.11.2e-15688.52PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis melo][more]
XP_022957015.11.7e-13179.28pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita mosc... [more]
XP_022139130.11.1e-12276.80pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica... [more]
XP_022957020.18.3e-9468.70pentatricopeptide repeat-containing protein At2g19280 isoform X2 [Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
AT2G19280.16.9e-3435.37Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G11690.13.6e-0620.63Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q6NKW7|PP164_ARATH1.2e-3235.37Pentatricopeptide repeat-containing protein At2g19280 OS=Arabidopsis thaliana OX... [more]
sp|Q9T0D6|PP306_ARATH6.4e-0520.63Pentatricopeptide repeat-containing protein At4g11690 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KV38|A0A0A0KV38_CUCSA1.1e-179100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G581700 PE=4 SV=1[more]
tr|A0A1S3BDC2|A0A1S3BDC2_CUCME7.8e-15788.52pentatricopeptide repeat-containing protein At2g19280 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A1Q3CJL3|A0A1Q3CJL3_CEPFO6.3e-5849.43PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-conta... [more]
tr|A0A251QBI9|A0A251QBI9_PRUPE1.7e-5554.93Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G051600 PE=4 SV=1[more]
tr|A0A2P5D9D3|A0A2P5D9D3_9ROSA1.7e-5553.31Tetratricopeptide-like helical domain containing protein OS=Trema orientalis OX=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_5G029290.1CsaV3_5G029290.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 244..311
e-value: 7.6E-16
score: 60.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 109..243
e-value: 5.5E-16
score: 60.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 506..628
e-value: 2.6E-30
score: 107.1
coord: 399..505
e-value: 1.6E-34
score: 120.9
coord: 312..398
e-value: 4.1E-23
score: 83.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 184..213
e-value: 0.015
score: 15.4
coord: 149..177
e-value: 0.03
score: 14.5
coord: 320..348
e-value: 0.0062
score: 16.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 249..298
e-value: 6.0E-10
score: 39.0
coord: 351..399
e-value: 3.1E-13
score: 49.6
coord: 492..540
e-value: 2.7E-13
score: 49.8
coord: 421..468
e-value: 5.1E-17
score: 61.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 424..458
e-value: 4.8E-9
score: 33.8
coord: 565..597
e-value: 2.5E-8
score: 31.6
coord: 494..528
e-value: 3.9E-6
score: 24.7
coord: 529..562
e-value: 1.5E-7
score: 29.1
coord: 254..285
e-value: 2.1E-5
score: 22.4
coord: 320..353
e-value: 3.9E-5
score: 21.5
coord: 184..216
e-value: 2.1E-4
score: 19.2
coord: 354..388
e-value: 2.9E-8
score: 31.4
coord: 149..181
e-value: 0.0013
score: 16.7
coord: 389..423
e-value: 6.3E-7
score: 27.2
coord: 459..493
e-value: 5.3E-8
score: 30.6
coord: 217..251
e-value: 2.1E-4
score: 19.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 558..589
e-value: 5.5E-11
score: 41.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 217..249
score: 6.785
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 285..315
score: 6.873
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 527..561
score: 13.088
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 146..180
score: 8.342
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..215
score: 8.977
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 457..491
score: 11.762
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 422..456
score: 12.342
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 250..284
score: 10.457
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 492..526
score: 11.071
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 387..421
score: 10.907
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 562..596
score: 12.068
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 352..386
score: 12.244
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 317..351
score: 10.26
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 327..503
coord: 113..310
coord: 476..614
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 327..503
coord: 113..310
coord: 476..614
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 220..365
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 220..365
NoneNo IPR availableSUPERFAMILYSSF81901HCP-likecoord: 328..520

The following gene(s) are paralogous to this gene:

None