CsGy3G017420.1 (mRNA) Cucumber (Gy14) v2

NameCsGy3G017420.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein isoform X2
LocationChr3 : 13391187 .. 13393788 (+)
Sequence length1355
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAATAGTGAAATTATCTTTTGAAAAATGGGTTTTGAGATATTTGGGGGCAATTTTCCGTAGAAAAATTGGGAATGCTAATCATTTTCACTTCGATGACACACTCTCTAGCCCAAGCAACGCTCTCGTCCTTCTTGGCTTCCTACCGGTAAGCTCTCCCTTTCTCCTTATCTCTATTTCTCTCAACTTTCGTGCTTGGTTACTGAGAAAAATTCAAAAAGAAAACGAGAAATTTGTTCTTATATACGAAGTATGCTTTGTGTGGACGATGAAATTTATTCATTTCCTTGAATAGCGATTTCGTGTTTTGCCTGGTTCCTCCTAGAGCTGGAAATTTTGATTTAAATTCGTTGAAATCTATTGACATTTGATCTCATTTGCTGGTTCAGGATGCTTACTTTAGTTTACACTTTTCCAGTCACGTCCAAAAGGATAGAATCTGTAAATTTTTCCTGGTGTCCAAGCAGCTCGGTTGTAAGTCTTCTCGTCAGTTACTTCGCTGCTAGCGATGATAAACGGAAATTGGGTTTAATTTTTACGCAATTCCTGAAATTTTCTTTCACTTCTTACCCAAGAGTTATTTTGTAGGTATGCGCTGCAAAGGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAAAAAGAGGATCGGGACTATATCCAAGGCAGCAAAACTTGTTGATTGTGTAAGCTCTCTTGTTCCTGTCTCACCAATCCTATATATCCTCCATCCATAAGTAGGGTGACATGATGCCTATTCAAATTGGATTTTCATAGTAGAAAATCTTTTCTTTCATCTTCATTGGGGGTATATTTGGGATGCATTTTCAGACTTTTGGAATTATTTTTGACAGCATAACCAAACATCAAGCAAATTTTAAATAGTACTTTTGATTTCCAACATAAATTTTCACCATATTTAAATTCAACACCAAAGTCACCCTTAATGTAATGCAGTCTCTTTTTATTTGTTATATGATGTAACTCTAAATTTACCTTCATTCATGAGCTTGAGCTTTTGGTTTCAATGGGTGATTTAACATGGTATCTAAGTGGATGGTCTAGGTGATCCTCTATTCAAATCCATGCAACATTAGTTCCTTTTCAATCAATATGTTAGTTCCTTTTCAATTAATATTGATTTTCACTTGTTCGATAGTCTATATATTTCAAGCCACAAGTGAGGGGAGTGTTAGATGGAAAATTCAATGATCTAACATTGCTTCTTGTTGCTACTTTTGAACTTGTTAACTTCTGGGTCGACCTTTTTAAGTACTGACTTCGCATTTTAGGTAAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGGGCTCTTGATTCCTTCATTGCCTGGGAACTAGAGTTTCCTCTTATTACAGTTAAGAAAGCCCTGAAGACCTTAGAGAACCAAAGAGAATGGAAGAGGATAATTCAGGTACATTCTTACATGTTTCTTTCATGTCCATTCTTGCTCTGCCCTTTTGTCCTGAGGATGTTGCTGACGAAATGAGTTAAGCAAAATGAGTATACTATTGTAGTTGACGAAATGGATGCTAAGTAAAGGCCAGGGAAGAACAATGGGAAGCTATTTTACACTATTAAACGCATTAGCTGAAGATGGAAGACTTGATGAAGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATTCCTCGCATATTCTTTCATAAAATGATATCCCTCTACTACGATCAGGCAATGCATGACAAGTTGTTTGAGGTATTTCGTTTTATTGCTACAGAAGTAGTTTCAACTGCATTGGTCACAAATCTCTAACCATTCCTTCCACTCCATATTGATTAAAAAAGGGATAAAAGTTATGATACATGGTGCCGATATATCATTTTTAGTTCAATCATATTGAGTTCCATGGGTAACGAAATTTAACTATGATTGACTTAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAGCCAAATATGGCAATTGTCACTAAGGTTGGAAACGTTTTCCAAGAGTTGGGTATGCTCGATAAATACAAAAAACTGATGAAGAAATATCCCCCACCGAAATGGGAATATCGTTACATCAAAGGAAAACGCGTCAAAATACGAGCTAAGTATCTGTCTGAAAATGGTAATTCAAACAATGGTTTAAGTGAGCATGCCAAAATGGAGCATAGTTCAACAAATTCAATAGATGAAGCTGAAATAACTTCCGAAGATTCCAGTCTCGAAGATGATGAGGATATGAGCGAAGATCCAGATGAAATTCTGGAAGATGAACATATGTGGAGCAAATCCAATTTTGAGCATGATTTCATGGGGCTTGGGCAATTGTAAATATATACACATTGTAGTTCCTTGTCGAAGTTTGTCATAGCGTAGACTTTTATATCAGTTATTATTTGTAGTTTTAATAGACGAGATGACACATTGATCATAGAGCCAATTCCCTTTATTTCATCCTTGAATATTATAGTCCCCGATCCAACATATGGCCTAGGCTTATGATGAAACAAGGGAACAATTCTTGGTTTAAAATTTTCATTAACAAAATATACGAACGTGGTCTTATAACTCATGAGGAAAAAAAAGGTTGCTG

mRNA sequence

AAAAAAATAGTGAAATTATCTTTTGAAAAATGGGTTTTGAGATATTTGGGGGCAATTTTCCGTAGAAAAATTGGGAATGCTAATCATTTTCACTTCGATGACACACTCTCTAGCCCAAGCAACGCTCTCGTCCTTCTTGGCTTCCTACCGGATGCTTACTTTAGTTTACACTTTTCCAGTCACGTCCAAAAGGATAGAATCTGTAAATTTTTCCTGGTGTCCAAGCAGCTCGGTTGTATGCGCTGCAAAGGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAAAAAGAGGATCGGGACTATATCCAAGGCAGCAAAACTTGTTGATTGTGTAAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGGGCTCTTGATTCCTTCATTGCCTGGGAACTAGAGTTTCCTCTTATTACAGTTAAGAAAGCCCTGAAGACCTTAGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGCTAAGTAAAGGCCAGGGAAGAACAATGGGAAGCTATTTTACACTATTAAACGCATTAGCTGAAGATGGAAGACTTGATGAAGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATTCCTCGCATATTCTTTCATAAAATGATATCCCTCTACTACGATCAGGCAATGCATGACAAGTTGTTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAGCCAAATATGGCAATTGTCACTAAGGTTGGAAACGTTTTCCAAGAGTTGGGTATGCTCGATAAATACAAAAAACTGATGAAGAAATATCCCCCACCGAAATGGGAATATCGTTACATCAAAGGAAAACGCGTCAAAATACGAGCTAAGTATCTGTCTGAAAATGGTAATTCAAACAATGGTTTAAGTGAGCATGCCAAAATGGAGCATAGTTCAACAAATTCAATAGATGAAGCTGAAATAACTTCCGAAGATTCCAGTCTCGAAGATGATGAGGATATGAGCGAAGATCCAGATGAAATTCTGGAAGATGAACATATGTGGAGCAAATCCAATTTTGAGCATGATTTCATGGGGCTTGGGCAATTGTAAATATATACACATTGTAGTTCCTTGTCGAAGTTTGTCATAGCGTAGACTTTTATATCAGTTATTATTTGTAGTTTTAATAGACGAGATGACACATTGATCATAGAGCCAATTCCCTTTATTTCATCCTTGAATATTATAGTCCCCGATCCAACATATGGCCTAGGCTTATGATGAAACAAGGGAACAATTCTTGGTTTAAAATTTTCATTAACAAAATATACGAACGTGGTCTTATAACTCATGAGGAAAAAAAAGGTTGCTG

Coding sequence (CDS)

ATGCTAATCATTTTCACTTCGATGACACACTCTCTAGCCCAAGCAACGCTCTCGTCCTTCTTGGCTTCCTACCGGATGCTTACTTTAGTTTACACTTTTCCAGTCACGTCCAAAAGGATAGAATCTGTAAATTTTTCCTGGTGTCCAAGCAGCTCGGTTGTATGCGCTGCAAAGGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAAAAAGAGGATCGGGACTATATCCAAGGCAGCAAAACTTGTTGATTGTGTAAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGGGCTCTTGATTCCTTCATTGCCTGGGAACTAGAGTTTCCTCTTATTACAGTTAAGAAAGCCCTGAAGACCTTAGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGCTAAGTAAAGGCCAGGGAAGAACAATGGGAAGCTATTTTACACTATTAAACGCATTAGCTGAAGATGGAAGACTTGATGAAGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATTCCTCGCATATTCTTTCATAAAATGATATCCCTCTACTACGATCAGGCAATGCATGACAAGTTGTTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAGCCAAATATGGCAATTGTCACTAAGGTTGGAAACGTTTTCCAAGAGTTGGGTATGCTCGATAAATACAAAAAACTGATGAAGAAATATCCCCCACCGAAATGGGAATATCGTTACATCAAAGGAAAACGCGTCAAAATACGAGCTAAGTATCTGTCTGAAAATGGTAATTCAAACAATGGTTTAAGTGAGCATGCCAAAATGGAGCATAGTTCAACAAATTCAATAGATGAAGCTGAAATAACTTCCGAAGATTCCAGTCTCGAAGATGATGAGGATATGAGCGAAGATCCAGATGAAATTCTGGAAGATGAACATATGTGGAGCAAATCCAATTTTGAGCATGATTTCATGGGGCTTGGGCAATTGTAA

Protein sequence

MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITSEDSSLEDDEDMSEDPDEILEDEHMWSKSNFEHDFMGLGQL
BLAST of CsGy3G017420.1 vs. NCBI nr
Match: XP_004140747.2 (PREDICTED: pentatricopeptide repeat-containing protein At4g21190 [Cucumis sativus] >KGN57423.1 hypothetical protein Csa_3G184050 [Cucumis sativus])

HSP 1 Score: 623.6 bits (1607), Expect = 3.9e-175
Identity = 338/338 (100.00%), Postives = 338/338 (100.00%), Query Frame = 0

Query: 1   MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60
           MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP
Sbjct: 1   MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60

Query: 61  RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120
           RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Sbjct: 61  RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120

Query: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE 180
           KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE
Sbjct: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE 180

Query: 181 SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK 240
           SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK
Sbjct: 181 SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK 240

Query: 241 LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX 300
           LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX
Sbjct: 241 LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 339
           XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL
Sbjct: 301 XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 338

BLAST of CsGy3G017420.1 vs. NCBI nr
Match: XP_008439301.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21190 isoform X2 [Cucumis melo])

HSP 1 Score: 595.1 bits (1533), Expect = 1.5e-166
Identity = 301/338 (89.05%), Postives = 309/338 (91.42%), Query Frame = 0

Query: 1   MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60
           M IIFTSMTH LAQATL+SF ASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP
Sbjct: 1   MRIIFTSMTHYLAQATLASFSASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60

Query: 61  RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120
           RPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Sbjct: 61  RPRYPRVWKTRKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120

Query: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE 180
           KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLFSQ+LE
Sbjct: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALVEDGRLDEAEELWNKLFSQYLE 180

Query: 181 SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK 240
           S+PRIFFHKMISLYYD+AMHDKLFEVFADMEELGVQPNMAIVTKVGN+FQELGMLDKY+K
Sbjct: 181 SMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTKVGNIFQELGMLDKYEK 240

Query: 241 LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX 300
           LMKKYPPPKWEYRYIKGKRVKIR KYLSENGNS NGLSE  KMEHSSTNS+DEAEIT   
Sbjct: 241 LMKKYPPPKWEYRYIKGKRVKIRTKYLSENGNSMNGLSEQNKMEHSSTNSLDEAEITSED 300

Query: 301 XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 339
                             DEHMWSKSNFEHDFMGLGQL
Sbjct: 301 SSLEDDEEIGKDPDEILEDEHMWSKSNFEHDFMGLGQL 338

BLAST of CsGy3G017420.1 vs. NCBI nr
Match: XP_016899030.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21190 isoform X1 [Cucumis melo])

HSP 1 Score: 564.7 bits (1454), Expect = 2.1e-157
Identity = 281/315 (89.21%), Postives = 289/315 (91.75%), Query Frame = 0

Query: 24  YRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKL 83
           +RMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKL
Sbjct: 40  FRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTRKRIGTISKAAKL 99

Query: 84  VDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKG 143
           VDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKG
Sbjct: 100 VDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKG 159

Query: 144 QGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDKL 203
           QGRTMGSYFTLLNAL EDGRLDEAEELWNKLFSQ+LES+PRIFFHKMISLYYD+AMHDKL
Sbjct: 160 QGRTMGSYFTLLNALVEDGRLDEAEELWNKLFSQYLESMPRIFFHKMISLYYDRAMHDKL 219

Query: 204 FEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKIR 263
           FEVFADMEELGVQPNMAIVTKVGN+FQELGMLDKY+KLMKKYPPPKWEYRYIKGKRVKIR
Sbjct: 220 FEVFADMEELGVQPNMAIVTKVGNIFQELGMLDKYEKLMKKYPPPKWEYRYIKGKRVKIR 279

Query: 264 AKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXXXXXXXXXXXXXXXXXXXXDEHMW 323
            KYLSENGNS NGLSE  KMEHSSTNS+DEAEIT                     DEHMW
Sbjct: 280 TKYLSENGNSMNGLSEQNKMEHSSTNSLDEAEITSEDSSLEDDEEIGKDPDEILEDEHMW 339

Query: 324 SKSNFEHDFMGLGQL 339
           SKSNFEHDFMGLGQL
Sbjct: 340 SKSNFEHDFMGLGQL 354

BLAST of CsGy3G017420.1 vs. NCBI nr
Match: XP_022945794.1 (pentatricopeptide repeat-containing protein At4g21190 [Cucurbita moschata])

HSP 1 Score: 488.0 bits (1255), Expect = 2.5e-134
Identity = 277/332 (83.43%), Postives = 292/332 (87.95%), Query Frame = 0

Query: 8   MTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRV 67
           M HSL  A+LSS LAS RM TL+Y+FPV SK IESV FS   SSSVVCAAKGPRPRYPRV
Sbjct: 1   MAHSLVPASLSSALASNRMHTLIYSFPVISKGIESVKFSLIASSSVVCAAKGPRPRYPRV 60

Query: 68  WKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQR 127
           WKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE QR
Sbjct: 61  WKTRKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLEIQR 120

Query: 128 EWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFF 187
           EWKRIIQLTKWMLSKGQGRTMGSYFTLLNALA DGRLDEAEELWNKLFSQHLES+PRIFF
Sbjct: 121 EWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAADGRLDEAEELWNKLFSQHLESMPRIFF 180

Query: 188 HKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPP 247
           HKMISLYY++ MHDKLFE+FADMEELGVQP+MAIVTK+GNVFQ+LGMLDKY+KL KKYPP
Sbjct: 181 HKMISLYYERDMHDKLFEIFADMEELGVQPSMAIVTKLGNVFQKLGMLDKYEKLKKKYPP 240

Query: 248 PKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXXXXXXXXX 307
           PKWEYRYI+GKRVKIRAK L ENG+SNNG  E  K EHSST  +      XXXXXXXXXX
Sbjct: 241 PKWEYRYIRGKRVKIRAKNLHENGSSNNGSGELDKKEHSSTEELXXXXXXXXXXXXXXXX 300

Query: 308 XXXXXXXXXXXDEHMWSK-SNFEHDFMGLGQL 339
           XXXXXXXXXXX      K S FEHDFMG GQL
Sbjct: 301 XXXXXXXXXXXXXXXXXKESTFEHDFMGFGQL 332

BLAST of CsGy3G017420.1 vs. NCBI nr
Match: XP_022140817.1 (pentatricopeptide repeat-containing protein At4g21190 [Momordica charantia])

HSP 1 Score: 483.4 bits (1243), Expect = 6.2e-133
Identity = 259/336 (77.08%), Postives = 282/336 (83.93%), Query Frame = 0

Query: 3   IIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRP 62
           ++ +SM HS  QA+LSS  AS RM TL+Y+FPV SKRIESV FSW  SS+VVCAAKGPRP
Sbjct: 8   LLRSSMAHSPVQASLSSSSASNRMRTLIYSFPVISKRIESVKFSWSASSTVVCAAKGPRP 67

Query: 63  RYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT 122
           RYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
Sbjct: 68  RYPRVWKTRKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT 127

Query: 123 LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESI 182
           LE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQ+LES+
Sbjct: 128 LEIQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQYLESM 187

Query: 183 PRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLM 242
           PR+FFHKMISLYYDQ MHDKLFEVFADMEELGVQPN  IVT +GNVFQELGM DKY+KL 
Sbjct: 188 PRMFFHKMISLYYDQGMHDKLFEVFADMEELGVQPNTEIVTMIGNVFQELGMFDKYEKLK 247

Query: 243 KKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXXXX 302
           KKYPP KWEYRY+KGKRV+IRAKYL+E GNSNNG SE  + + SS   ++EAE T     
Sbjct: 248 KKYPPLKWEYRYVKGKRVRIRAKYLNEYGNSNNGSSELDQKDDSSIKLLEEAE-TVSKDS 307

Query: 303 XXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 339
                    XXXXXXX         FE++FMG G+L
Sbjct: 308 SLEDEEMREXXXXXXXXXXXXXXXXFEYNFMGYGRL 342

BLAST of CsGy3G017420.1 vs. TAIR10
Match: AT4G21190.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 365.9 bits (938), Expect = 2.6e-101
Identity = 180/256 (70.31%), Postives = 217/256 (84.77%), Query Frame = 0

Query: 26  MLTLVYTFP---VTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAK 85
           ML+L Y+ P   + ++   +  F+  P++ VVCAA+GPRPR PRVWKT+KRIGTISKAAK
Sbjct: 1   MLSLRYSLPYLLLQTRESSTKLFTKKPNNVVVCAARGPRPRSPRVWKTRKRIGTISKAAK 60

Query: 86  LVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSK 145
           ++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  LE+++EWK+IIQ+TKWMLSK
Sbjct: 61  MIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSK 120

Query: 146 GQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDK 205
           GQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY + MH K
Sbjct: 121 GQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQK 180

Query: 206 LFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKI 265
           LFEVFADMEELGV+PN+AIV+ VG VF +L M DKY+KLMKKYPPP+WE+RYIKG+RVK+
Sbjct: 181 LFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYPPPQWEFRYIKGRRVKV 240

Query: 266 RAKYLSENGNSNNGLS 279
           +AK L+E      GLS
Sbjct: 241 KAKQLNELSEGEGGLS 256

BLAST of CsGy3G017420.1 vs. TAIR10
Match: AT4G18975.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 185.3 bits (469), Expect = 6.4e-47
Identity = 96/204 (47.06%), Postives = 131/204 (64.22%), Query Frame = 0

Query: 67  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQ 126
           +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  +
Sbjct: 85  LWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRKR 144

Query: 127 REWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIF 186
            +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DEAE LWN +   H  SIPR  
Sbjct: 145 SQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPRRL 204

Query: 187 FHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYP 246
           F +MI+LY    +HDK+ EVFADMEEL V P+     +V   F+EL   +  K ++++Y 
Sbjct: 205 FARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENRKLILRRY- 264

Query: 247 PPKWEYRYIKGKRVKIRAKYLSEN 271
             +++Y Y  G+RV+++ +Y SE+
Sbjct: 265 LSEYKYIYFNGERVRVK-RYFSED 286

BLAST of CsGy3G017420.1 vs. TAIR10
Match: AT1G04590.2 (BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4))

HSP 1 Score: 124.0 bits (310), Expect = 1.7e-28
Identity = 68/185 (36.76%), Postives = 110/185 (59.46%), Query Frame = 0

Query: 65  PRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE 124
           PR  +  + I    K   LV+ +  + + KE VYGALD+++AWE  FP+ ++K  + +LE
Sbjct: 131 PRKHQIGENIPKKDKIKFLVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLE 190

Query: 125 NQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPR 184
            + +W R++Q+ KW+LSKGQG TMG+Y  L+ AL  D R +EA  +W K     L S+P 
Sbjct: 191 KEHQWHRMVQVIKWILSKGQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPW 250

Query: 185 IFFHKMISLYYDQAMHDKLFEV---FADMEELGVQ-PNMAIVTKVGNVFQELGMLDKYKK 244
               +M+ +Y+   M  +L +V   F D+E    + P+  IV  V + ++ LGMLD+ ++
Sbjct: 251 QLCLQMMRIYFRNNMLQELVKVMKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKER 310

Query: 245 LMKKY 246
           ++ KY
Sbjct: 311 VVTKY 315

BLAST of CsGy3G017420.1 vs. Swiss-Prot
Match: sp|Q8LG95|PP332_ARATH (Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana OX=3702 GN=EMB1417 PE=2 SV=1)

HSP 1 Score: 365.9 bits (938), Expect = 4.7e-100
Identity = 180/256 (70.31%), Postives = 217/256 (84.77%), Query Frame = 0

Query: 26  MLTLVYTFP---VTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAK 85
           ML+L Y+ P   + ++   +  F+  P++ VVCAA+GPRPR PRVWKT+KRIGTISKAAK
Sbjct: 1   MLSLRYSLPYLLLQTRESSTKLFTKKPNNVVVCAARGPRPRSPRVWKTRKRIGTISKAAK 60

Query: 86  LVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSK 145
           ++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  LE+++EWK+IIQ+TKWMLSK
Sbjct: 61  MIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSK 120

Query: 146 GQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDK 205
           GQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY + MH K
Sbjct: 121 GQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQK 180

Query: 206 LFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKI 265
           LFEVFADMEELGV+PN+AIV+ VG VF +L M DKY+KLMKKYPPP+WE+RYIKG+RVK+
Sbjct: 181 LFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYPPPQWEFRYIKGRRVKV 240

Query: 266 RAKYLSENGNSNNGLS 279
           +AK L+E      GLS
Sbjct: 241 KAKQLNELSEGEGGLS 256

BLAST of CsGy3G017420.1 vs. Swiss-Prot
Match: sp|Q2V3H0|PP322_ARATH (Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g18975 PE=2 SV=2)

HSP 1 Score: 185.3 bits (469), Expect = 1.1e-45
Identity = 96/204 (47.06%), Postives = 131/204 (64.22%), Query Frame = 0

Query: 67  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQ 126
           +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  +
Sbjct: 85  LWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRKR 144

Query: 127 REWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIF 186
            +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DEAE LWN +   H  SIPR  
Sbjct: 145 SQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPRRL 204

Query: 187 FHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYP 246
           F +MI+LY    +HDK+ EVFADMEEL V P+     +V   F+EL   +  K ++++Y 
Sbjct: 205 FARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENRKLILRRY- 264

Query: 247 PPKWEYRYIKGKRVKIRAKYLSEN 271
             +++Y Y  G+RV+++ +Y SE+
Sbjct: 265 LSEYKYIYFNGERVRVK-RYFSED 286

BLAST of CsGy3G017420.1 vs. TrEMBL
Match: tr|A0A0A0L6Q9|A0A0A0L6Q9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G184050 PE=4 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 2.6e-175
Identity = 338/338 (100.00%), Postives = 338/338 (100.00%), Query Frame = 0

Query: 1   MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60
           MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP
Sbjct: 1   MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60

Query: 61  RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120
           RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Sbjct: 61  RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120

Query: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE 180
           KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE
Sbjct: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE 180

Query: 181 SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK 240
           SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK
Sbjct: 181 SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK 240

Query: 241 LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX 300
           LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX
Sbjct: 241 LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 339
           XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL
Sbjct: 301 XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 338

BLAST of CsGy3G017420.1 vs. TrEMBL
Match: tr|A0A1S3AY30|A0A1S3AY30_CUCME (pentatricopeptide repeat-containing protein At4g21190 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484127 PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 9.7e-167
Identity = 301/338 (89.05%), Postives = 309/338 (91.42%), Query Frame = 0

Query: 1   MLIIFTSMTHSLAQATLSSFLASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60
           M IIFTSMTH LAQATL+SF ASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP
Sbjct: 1   MRIIFTSMTHYLAQATLASFSASYRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGP 60

Query: 61  RPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120
           RPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Sbjct: 61  RPRYPRVWKTRKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL 120

Query: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLE 180
           KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLFSQ+LE
Sbjct: 121 KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALVEDGRLDEAEELWNKLFSQYLE 180

Query: 181 SIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKK 240
           S+PRIFFHKMISLYYD+AMHDKLFEVFADMEELGVQPNMAIVTKVGN+FQELGMLDKY+K
Sbjct: 181 SMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTKVGNIFQELGMLDKYEK 240

Query: 241 LMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXX 300
           LMKKYPPPKWEYRYIKGKRVKIR KYLSENGNS NGLSE  KMEHSSTNS+DEAEIT   
Sbjct: 241 LMKKYPPPKWEYRYIKGKRVKIRTKYLSENGNSMNGLSEQNKMEHSSTNSLDEAEITSED 300

Query: 301 XXXXXXXXXXXXXXXXXXDEHMWSKSNFEHDFMGLGQL 339
                             DEHMWSKSNFEHDFMGLGQL
Sbjct: 301 SSLEDDEEIGKDPDEILEDEHMWSKSNFEHDFMGLGQL 338

BLAST of CsGy3G017420.1 vs. TrEMBL
Match: tr|A0A1S4DTK8|A0A1S4DTK8_CUCME (pentatricopeptide repeat-containing protein At4g21190 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484127 PE=4 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 1.4e-157
Identity = 281/315 (89.21%), Postives = 289/315 (91.75%), Query Frame = 0

Query: 24  YRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKL 83
           +RMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKL
Sbjct: 40  FRMLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTRKRIGTISKAAKL 99

Query: 84  VDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKG 143
           VDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKG
Sbjct: 100 VDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKG 159

Query: 144 QGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDKL 203
           QGRTMGSYFTLLNAL EDGRLDEAEELWNKLFSQ+LES+PRIFFHKMISLYYD+AMHDKL
Sbjct: 160 QGRTMGSYFTLLNALVEDGRLDEAEELWNKLFSQYLESMPRIFFHKMISLYYDRAMHDKL 219

Query: 204 FEVFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKIR 263
           FEVFADMEELGVQPNMAIVTKVGN+FQELGMLDKY+KLMKKYPPPKWEYRYIKGKRVKIR
Sbjct: 220 FEVFADMEELGVQPNMAIVTKVGNIFQELGMLDKYEKLMKKYPPPKWEYRYIKGKRVKIR 279

Query: 264 AKYLSENGNSNNGLSEHAKMEHSSTNSIDEAEITXXXXXXXXXXXXXXXXXXXXXDEHMW 323
            KYLSENGNS NGLSE  KMEHSSTNS+DEAEIT                     DEHMW
Sbjct: 280 TKYLSENGNSMNGLSEQNKMEHSSTNSLDEAEITSEDSSLEDDEEIGKDPDEILEDEHMW 339

Query: 324 SKSNFEHDFMGLGQL 339
           SKSNFEHDFMGLGQL
Sbjct: 340 SKSNFEHDFMGLGQL 354

BLAST of CsGy3G017420.1 vs. TrEMBL
Match: tr|A0A2I4G3X3|A0A2I4G3X3_9ROSI (pentatricopeptide repeat-containing protein At4g21190 OS=Juglans regia OX=51240 GN=LOC109004499 PE=4 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 2.4e-109
Identity = 198/270 (73.33%), Postives = 230/270 (85.19%), Query Frame = 0

Query: 26  MLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVD 85
           MLTL Y+ P+  KR+ES+      S  VVCA+KGPRPRYPRVWKT+KRIGTISK++KL+D
Sbjct: 1   MLTLTYSLPIVIKRLESIKIPKSTSRVVVCASKGPRPRYPRVWKTRKRIGTISKSSKLID 60

Query: 86  CVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQG 145
           C+  LSNVKEEVYGALDSFIAWELEFPLITVKKALKTLEN+ EWKRIIQ+TKWMLSKGQG
Sbjct: 61  CIMKLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENEGEWKRIIQVTKWMLSKGQG 120

Query: 146 RTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDKLFE 205
           RTMGSYFTLLNALAEDGRLDEAEELW KLF ++LES PRIFF KMIS+YY +AMH+K+FE
Sbjct: 121 RTMGSYFTLLNALAEDGRLDEAEELWTKLFKENLESTPRIFFDKMISIYYKRAMHEKMFE 180

Query: 206 VFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKIRAK 265
           +FADMEELGV+PN++IV+ VGNVF+ELGM+DKY KL KKYPPPKWEY+YIKGKR+KI+AK
Sbjct: 181 IFADMEELGVRPNVSIVSMVGNVFKELGMMDKYNKLKKKYPPPKWEYQYIKGKRIKIKAK 240

Query: 266 YLSENGNSNNGLSEHAKMEHSSTNSIDEAE 296
           +L E   SN   S   K + +     +EAE
Sbjct: 241 HLDEYHVSNEEASRREKTDQTLNVMHEEAE 270

BLAST of CsGy3G017420.1 vs. TrEMBL
Match: tr|M5X1I2|M5X1I2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G302200 PE=4 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 4.2e-109
Identity = 200/262 (76.34%), Postives = 228/262 (87.02%), Query Frame = 0

Query: 26  MLTLVYTFPVTSKRIESVNFSWCPSSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVD 85
           MLTL Y+ PV ++R+E +  S   SS V+CAAKGPRPRYPRVWK  KRIGTISK+ KLV+
Sbjct: 1   MLTLTYSLPVFTRRLEFIKISHSRSSVVLCAAKGPRPRYPRVWKANKRIGTISKSIKLVE 60

Query: 86  CVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQG 145
            +KGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQ+EWKRIIQ++KWMLSKGQG
Sbjct: 61  SIKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQKEWKRIIQVSKWMLSKGQG 120

Query: 146 RTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESIPRIFFHKMISLYYDQAMHDKLFE 205
           RTMG+YFTLLNALAEDGR++EAEELW KLFSQ+LES+PR+FF KMIS+YY   +HDK+FE
Sbjct: 121 RTMGTYFTLLNALAEDGRVEEAEELWTKLFSQYLESMPRMFFDKMISIYYRHGIHDKMFE 180

Query: 206 VFADMEELGVQPNMAIVTKVGNVFQELGMLDKYKKLMKKYPPPKWEYRYIKGKRVKIRAK 265
           +FADMEELGVQPN++IVTKVGNVFQELGMLDKY KL +KYPPPKWEYRYIKGKRVKIRA 
Sbjct: 181 IFADMEELGVQPNVSIVTKVGNVFQELGMLDKYHKLKQKYPPPKWEYRYIKGKRVKIRAN 240

Query: 266 YLSENGNSNNGLSEHAKMEHSS 288
           Y  EN  +    S+  +  HSS
Sbjct: 241 Y--ENDGAEKMPSQEKETVHSS 260

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140747.23.9e-175100.00PREDICTED: pentatricopeptide repeat-containing protein At4g21190 [Cucumis sativu... [more]
XP_008439301.11.5e-16689.05PREDICTED: pentatricopeptide repeat-containing protein At4g21190 isoform X2 [Cuc... [more]
XP_016899030.12.1e-15789.21PREDICTED: pentatricopeptide repeat-containing protein At4g21190 isoform X1 [Cuc... [more]
XP_022945794.12.5e-13483.43pentatricopeptide repeat-containing protein At4g21190 [Cucurbita moschata][more]
XP_022140817.16.2e-13377.08pentatricopeptide repeat-containing protein At4g21190 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT4G21190.12.6e-10170.31Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G18975.16.4e-4747.06Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G04590.21.7e-2836.76BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) super... [more]
Match NameE-valueIdentityDescription
sp|Q8LG95|PP332_ARATH4.7e-10070.31Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana OX... [more]
sp|Q2V3H0|PP322_ARATH1.1e-4547.06Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L6Q9|A0A0A0L6Q9_CUCSA2.6e-175100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G184050 PE=4 SV=1[more]
tr|A0A1S3AY30|A0A1S3AY30_CUCME9.7e-16789.05pentatricopeptide repeat-containing protein At4g21190 isoform X2 OS=Cucumis melo... [more]
tr|A0A1S4DTK8|A0A1S4DTK8_CUCME1.4e-15789.21pentatricopeptide repeat-containing protein At4g21190 isoform X1 OS=Cucumis melo... [more]
tr|A0A2I4G3X3|A0A2I4G3X3_9ROSI2.4e-10973.33pentatricopeptide repeat-containing protein At4g21190 OS=Juglans regia OX=51240 ... [more]
tr|M5X1I2|M5X1I2_PRUPE4.2e-10976.34Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G302200 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy3G017420CsGy3G017420gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G017420.1.five_prime_UTR.1CsGy3G017420.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G017420.1.exon.1CsGy3G017420.1.exon.1exon
CsGy3G017420.1.exon.2CsGy3G017420.1.exon.2exon
CsGy3G017420.1.exon.3CsGy3G017420.1.exon.3exon
CsGy3G017420.1.exon.4CsGy3G017420.1.exon.4exon
CsGy3G017420.1.exon.5CsGy3G017420.1.exon.5exon
CsGy3G017420.1.exon.6CsGy3G017420.1.exon.6exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G017420.1.CDS.1CsGy3G017420.1.CDS.1CDS
CsGy3G017420.1.CDS.2CsGy3G017420.1.CDS.2CDS
CsGy3G017420.1.CDS.3CsGy3G017420.1.CDS.3CDS
CsGy3G017420.1.CDS.4CsGy3G017420.1.CDS.4CDS
CsGy3G017420.1.CDS.5CsGy3G017420.1.CDS.5CDS
CsGy3G017420.1.CDS.6CsGy3G017420.1.CDS.6CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G017420.1.three_prime_UTR.1CsGy3G017420.1.three_prime_UTR.1three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy3G017420.1CsGy3G017420.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 73..310
e-value: 4.0E-16
score: 61.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 271..296
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 271..323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 297..317
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 27..322
NoneNo IPR availablePANTHERPTHR24015:SF964SUBFAMILY NOT NAMEDcoord: 27..322
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 183..217
score: 8.144
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 147..181
score: 8.977