CsGy3G021830 (gene) Cucumber (Gy14) v2

NameCsGy3G021830
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat-containing family protein
LocationChr3 : 20095296 .. 20099326 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACCGTTTGAAAGTCCAAACAAGAGAGGAACCTTTGACCTAAAACTACAATCGTATTTAAAATTTCCCCTGTTTTCAATCCGTTGAAAATAGGAGAGGGCGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTTGATTATTTTTAAGTGAATAACTCGACTTCTCAACTTCTATATTCTCCAGAATGTTTCAATTGAATGCGAAAGAGTAGCTGCGCCCTGCAGGGATGTTCTCATTTGTGACTACCAACGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTCCTTCAATCTCAATGCCTCTTCAACCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATCAAAAACTGTTCCACCATAAACGAACTGCATGGTTTATGTGCTTCCATGATCAAAACTAATGCAATCCAAGATTGTTTTCTGGTGCATCACTTTATTAGCGCGTCTTTTGCTCTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAATCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACTGTGGGTACCCATTTCGTGCTCTACAATGTTATGTACATATGTTGGAAGAATCGAACGTCTTGCCAACTAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCAAAGTTGGAGATACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTGCTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATTCCGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACCAAGGATATAATCTCCTGGACAACCATGATCACTTGTTATTCTCAGAACAAACAATATCAAGATGCATTGGCGATTTATAGTGAGATGAGATTGAATGGGATTATTCCCGATGAGGTAACAATGTCAACTGTTGCTTCAGCTTGCGCCCACATTGGAGCTCTTGAACTAGGAAAAGAGATACATCATTATGTAATGTCTCAGGGGCTTAATCTTGACGTTTATATTGGTTCTGCATTAGTTGATATGTATGCTAAGTGTGGGAGTTTAGATTTGTCTCTTTTGATTTTCTTCAAATTGACAGATAAAAATTTATATTGCTGGAATGCAGTAATTGAAGGACTTGCTGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCATGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGACGAAGGCAGGAGTAGATTTTTAAGCATGACTCGTGATTACGACATTCGTCCTGATATCAGACACTATGGTTGCATGGTTGATATGTTAAGTAAATCAGGATATCTCAACGAAGCGTTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCATGGAAACTGTGAGATCGCTGAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTCAGCATGTATGCTGAAGAAAAGGATTGGATGGAGGTTGTGCATATTCGATCAATGATGAAAGAAAAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCATACTGACAGAATTAGATGGACAACTGAAGCTAGCTGGTTACATACTCGAGCCTTCAGTATGCAGTACTGGTTTGCTTTTTTCAGAGGAAATTTGATCAACATTAATTGAGGCCATACCGTCATAGTGAGATCGAATGTTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTATATTGAAGTGAAAATTCTCGAGGTCAAGTGCTAAATGACAAAGCTGGGCTACTATAGGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATCAAGAGCCATGGTAATCTTAAGTTTAATCTTACGTTTTGCTAGAATGTCTTCTCATTTTAAGTAAATTCTCAATATTTGTTATAATGAAGTGAATGAGTACTTCTTAGTCTTTTACATGCACAATTGAGCATCTTGATGTTGGCGTTCACCGGAACATAATTCACTTGAATCCTTCTTTCATGTGCGCTCTTCGTACTCCCCATTCTACATGCACAATTGAGAATCTTGATGATGCATTTCTTTTCAATGTTTGGCTTGTATCATTGAGGCTACCTATTCATATTAAGAATATATTTGATGATTAAATTTAATCTACAGTCGAAAAAATGAAGTGAATGGTTACTTCTTAGTTTTTTACGTGCACGATTGAGCATCTTGATGCATTTTCTTTCTCTGTTTAACTTGTCTAATTGAGGCTATCTATTCATATTAAGAATATATTGACCGATTATATCTAACCAACTCTTTTAGTGCTTAAGTTTTTAGATTAGTGATGGTTGAAACTCTATGATATGGTTAAAGTTGTTACCAATCTCAACATCTAAACTCAAACTCGTGCAATAGTTTGACTAATGAGGCCCAGTAACTATGTTATTGATATTGAAATAAAAGAGGAACAACTCTAAAAATCTTTCAACTTAGGCATGTAAAGTTCGAATTCCCCACTCTTGTTGATCGAAGTAGATTCAATCATTTTCTTTATGCTTTCAGTGTACTCGCATTCATTTTGTGATTTTTGCTCCAGCTATCAAAGAAAGGGCAGGGACTAAATTTATAACGAAATAACGTACCACTATTGTACAACCGGAGCTGACTCAGTGAAGTGATTTGATCGAAGAACCCAAGCTAACAGGATATCCTGTAACAGGTCGGTCCTTTTTCCTTCTTTGTACAATACTTTCATATGTTTCAAACACACATGACCATATTGTCTGCATTATGCACTTCTTTAGAAACAAGCTAGGTTTTGAGGAAAATGGAAACCAAGACTTATCAGAATCTTCACGTATAATCTCATTCATGACAATAAATTGCCAATACTGTCTGTACCGTGTGATAATATTTCCAAAACATCCAACTTGTCCCACAGATGGTTTCTTAGGTTGGAACGGCGTGGAGAAAAGAAAACTTCTAGTTTCTTCTTATGTTGTTTTAATTTTCATTTTCTAAACGAAGAACATGCCTAAGTCTTCTCTTTCCTACTTCAATGGTGTTCTTAATCCATCAATATGGCATTGGAGAATGCTAAACACTGGCTAAATTAAATGGAAGTGCGACTCTCTTCTTCAAGCACTAGCCATTTTTATTATTATGGAAGTAGCTTGCTTTGTTCTGACAAGATGCATGGTTTAAGGGATAGAGGAAATTCTTCAAACCTTGTAGGCTATGCATCTGCTGTAAGATATGGTTCTTAGGGAACATTATACACGTTTCTATCTTAGTCTCTTTTCATGTACTTAATGAAAAATTTCGTATCATTTCAAGGTTCAACGATAGTGATTGCTATTATTCCACTATTTTGACTTTCCCTTACCAAGTTGGTGGGGGGAAATCCTACCCAAATCTCGCATTAGTCTTTCTCATACTCTGCCAAAGTTCACTATTCTGAAACTTCATCTCAAACAGAAGTATGCTTTCTGACCTTGCTATAAACTTACATCCACAAAGGTCATAAAAATTTGTAACATAGAGGTGGATTATTATTCAGGTTGATTATAGAGATAGAAGTGGCCTCTGTTTACCAACAAGTTGGTATTGTATTTTCTTGAGCATTTTATCACAATAAGAAAGGAAAAAAAAAATAATCAAAA

mRNA sequence

GACCGTTTGAAAGTCCAAACAAGAGAGGAACCTTTGACCTAAAACTACAATCGTATTTAAAATTTCCCCTGTTTTCAATCCGTTGAAAATAGGAGAGGGCGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTTGATTATTTTTAAGTGAATAACTCGACTTCTCAACTTCTATATTCTCCAGAATGTTTCAATTGAATGCGAAAGAGTAGCTGCGCCCTGCAGGGATGTTCTCATTTGTGACTACCAACGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTATCATGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGACGAAGGCAGGAGTAGATTTTTAAGCATGACTCGTGATTACGACATTCGTCCTGATATCAGACACTATGGTTGCATGGTTGATATGTTAAGTAAATCAGGATATCTCAACGAAGCGTTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCATGGAAACTGTGAGATCGCTGAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTCAGCATGTATGCTGAAGAAAAGGATTGGATGGAGGTTGTGCATATTCGATCAATGATGAAAGAAAAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCATACTGACAGAATTAGATGGACAACTGAAGCTAGCTGGTTACATACTCGAGCCTTCAGTATGCAGTACTGGTTTGCTTTTTTCAGAGGAAATTTGATCAACATTAATTGAGGCCATACCGTCATAGTGAGATCGAATGTTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTATATTGAAGTGAAAATTCTCGAGGTCAAGTGCTAAATGACAAAGCTGGGCTACTATAGGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATCAAGAGCCATGCTATCAAAGAAAGGGCAGGGACTAAATTTATAACGAAATAACGTACCACTATTGTACAACCGGAGCTGACTCAGTGAAGTGATTTGATCGAAGAACCCAAGCTAACAGGATATCCTGTAACAGATGGTTTCTTAGGTTGGAACGGCGTGGAGAAAAGAAAACTTCTAGTTTCTTCTTATGTTGTTTTAATTTTCATTTTCTAAACGAAGAACATGCCTAAGTCTTCTCTTTCCTACTTCAATGGTGTTCTTAATCCATCAATATGGCATTGGAGAATGCTAAACACTGGCTAAATTAAATGGAAGTGCGACTCTCTTCTTCAAGCACTAGCCATTTTTATTATTATGGAAGTAGCTTGCTTTGTTCTGACAAGATGCATGGTTTAAGGGATAGAGGAAATTCTTCAAACCTTGTAGGCTATGCATCTGCTGTAAGATATGGTTCTTAGGGAACATTATACACGTTTCTATCTTAGTCTCTTTTCATGTACTTAATGAAAAATTTCGTATCATTTCAAGGTTCAACGATAGTGATTGCTATTATTCCACTATTTTGACTTTCCCTTACCAAGTTGGTGGGGGGAAATCCTACCCAAATCTCGCATTAGTCTTTCTCATACTCTGCCAAAGTTCACTATTCTGAAACTTCATCTCAAACAGAAGTATGCTTTCTGACCTTGCTATAAACTTACATCCACAAAGGTCATAAAAATTTGTAACATAGAGGTGGATTATTATTCAGGTTGATTATAGAGATAGAAGTGGCCTCTGTTTACCAACAAGTTGGTATTGTATTTTCTTGAGCATTTTATCACAATAAGAAAGGAAAAAAAAAATAATCAAAA

Coding sequence (CDS)

ATGTTCTCATTTGTGACTACCAACGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTATCATGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGACGAAGGCAGGAGTAGATTTTTAAGCATGACTCGTGATTACGACATTCGTCCTGATATCAGACACTATGGTTGCATGGTTGATATGTTAAGTAAATCAGGATATCTCAACGAAGCGTTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCATGGAAACTGTGAGATCGCTGAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTCAGCATGTATGCTGAAGAAAAGGATTGGATGGAGGTTGTGCATATTCGATCAATGATGAAAGAAAAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCATACTGACAGAATTAGATGGACAACTGAAGCTAGCTGGTTACATACTCGAGCCTTCAGTATGCAGTACTGGTTTGCTTTTTTCAGAGGAAATTTGA

Protein sequence

MFSFVTTNALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI
BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_011651448.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis sativus] >XP_011651449.1 PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis sativus] >XP_011651450.1 PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis sativus] >KGN57932.1 hypothetical protein Csa_3G395920 [Cucumis sativus])

HSP 1 Score: 416.0 bits (1068), Expect = 8.1e-113
Identity = 200/203 (98.52%), Postives = 201/203 (99.01%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA
Sbjct: 398 IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 457

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE
Sbjct: 458 LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 517

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           KDWMEV HIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQ
Sbjct: 518 KDWMEVAHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQ 577

Query: 201 LKLAGYILEPSVCSTGLLFSEEI 224
           LKLAGYILEPSVCST LLFSEEI
Sbjct: 578 LKLAGYILEPSVCSTALLFSEEI 600

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_008447444.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis melo])

HSP 1 Score: 381.7 bits (979), Expect = 1.7e-102
Identity = 184/203 (90.64%), Postives = 192/203 (94.58%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           I+PNGVTFISILSACTHAGLV+EGRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EA
Sbjct: 397 ILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEA 456

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELIKSMEFEPNSIIWGALLNGCKLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEE
Sbjct: 457 LELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEE 516

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           KDWMEV HIR MMKE+GVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQ
Sbjct: 517 KDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQ 576

Query: 201 LKLAGYILEPSVCSTGLLFSEEI 224
           LKLAGYILEPSVCST L+F EEI
Sbjct: 577 LKLAGYILEPSVCSTALVFPEEI 599

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_023554768.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 346.7 bits (888), Expect = 6.0e-92
Identity = 167/192 (86.98%), Postives = 174/192 (90.62%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           IMPNGVTFISILSACTHAGLV EGRSRF SM RDY IRP++ HYGCMVDMLSK+G L+EA
Sbjct: 396 IMPNGVTFISILSACTHAGLVIEGRSRFSSMIRDYGIRPEVEHYGCMVDMLSKAGLLDEA 455

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELI  MEFEPNSIIWGALLNGCKLHGN EIA+DAV QL ILEP NSGHYNLLVSMYAEE
Sbjct: 456 LELINGMEFEPNSIIWGALLNGCKLHGNSEIAKDAVRQLTILEPKNSGHYNLLVSMYAEE 515

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           K WMEV HIR+MMKE GVEKKYPGSSWIELEG IHQFSASAD HPDSDKIYFILTELDGQ
Sbjct: 516 KHWMEVAHIRAMMKENGVEKKYPGSSWIELEGRIHQFSASADCHPDSDKIYFILTELDGQ 575

Query: 201 LKLAGYILEPSV 213
           LKLAG +LEPSV
Sbjct: 576 LKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_022963550.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucurbita moschata])

HSP 1 Score: 343.6 bits (880), Expect = 5.1e-91
Identity = 164/192 (85.42%), Postives = 175/192 (91.15%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           IMPNGVTFISILSACTHAGLV EGRSRF SM RDY IRP++ HYGCMVDMLSK+G L+EA
Sbjct: 396 IMPNGVTFISILSACTHAGLVIEGRSRFSSMIRDYGIRPEVEHYGCMVDMLSKAGLLDEA 455

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELI  MEFEPNSIIWGALLNGCKLHGN EIA+DAV+QL +LEP NSGHYNLLVSMYAEE
Sbjct: 456 LELINGMEFEPNSIIWGALLNGCKLHGNSEIAKDAVQQLTVLEPKNSGHYNLLVSMYAEE 515

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           K WM+V HIR+MMKE GVEKKYPGSSWIELEG IHQFSASA+ HPDSDKIYFILTELDGQ
Sbjct: 516 KHWMKVAHIRAMMKENGVEKKYPGSSWIELEGRIHQFSASANCHPDSDKIYFILTELDGQ 575

Query: 201 LKLAGYILEPSV 213
           LKLAG +LEPSV
Sbjct: 576 LKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_022967388.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima])

HSP 1 Score: 343.6 bits (880), Expect = 5.1e-91
Identity = 165/192 (85.94%), Postives = 174/192 (90.62%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           IMPNGVTFISILSACTHAGLV EGRSRFLSM RDY I P++ HYGCMVDMLSK+G L+EA
Sbjct: 396 IMPNGVTFISILSACTHAGLVVEGRSRFLSMIRDYGIHPEVEHYGCMVDMLSKAGLLDEA 455

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELI  MEFEPNSIIWGALLNGCKLHGN EIA+DAV +L ILEP NSGHYNLLVSMYAEE
Sbjct: 456 LELINGMEFEPNSIIWGALLNGCKLHGNSEIAKDAVRRLNILEPKNSGHYNLLVSMYAEE 515

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           K W+EV HIR+MMKE GVEKKYPGSSWIELEG IHQFSASAD HPDSDKIYFILTELDGQ
Sbjct: 516 KHWIEVAHIRAMMKENGVEKKYPGSSWIELEGRIHQFSASADCHPDSDKIYFILTELDGQ 575

Query: 201 LKLAGYILEPSV 213
           LKLAG +LEPSV
Sbjct: 576 LKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. TAIR10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 220.3 bits (560), Expect = 1.2e-57
Identity = 106/201 (52.74%), Postives = 141/201 (70.15%), Query Frame = 0

Query: 9    ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
            ALK   +     + PN VTF+S+ +ACTHAGLVDEGR  + SM  DY I  ++ HYG MV
Sbjct: 1117 ALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDEGRRIYRSMIDDYSIVSNVEHYGGMV 1176

Query: 69   DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
             + SK+G + EALELI +MEFEPN++IWGALL+GC++H N  IAE A  +LM+LEPMNSG
Sbjct: 1177 HLFSKAGLIYEALELIGNMEFEPNAVIWGALLDGCRIHKNLVIAEIAFNKLMVLEPMNSG 1236

Query: 129  HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
            +Y LLVSMYAE+  W +V  IR  M+E G+EK  PG+S I ++   H F+A+  SH  SD
Sbjct: 1237 YYFLLVSMYAEQNRWRDVAEIRGRMRELGIEKICPGTSSIRIDKRDHLFAAADKSHSASD 1296

Query: 189  KIYFILTELDGQLKLAGYILE 210
            ++  +L E+  Q+ LAGY+ E
Sbjct: 1297 EVCLLLDEIYDQMGLAGYVQE 1317

BLAST of CsGy3G021830 vs. TAIR10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 199.5 bits (506), Expect = 2.2e-51
Identity = 89/192 (46.35%), Postives = 132/192 (68.75%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           + P+  T +++LSAC+H GLVD+GR  F +MT+DY + P+ +HY CMVD+L ++G L +A
Sbjct: 506 LKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDA 565

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
             L+K+M FEP++ IWG LL   ++HGN E+AE A +++  +EP NSG Y LL ++YA  
Sbjct: 566 HNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASS 625

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
             W +V  +R  M++KGV KK PG SWIE++   H FS   + HP+ D+I+  L ELD +
Sbjct: 626 GRWGDVGKLRVRMRDKGV-KKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLR 685

Query: 201 LKLAGYILEPSV 213
           +K AGY+ + SV
Sbjct: 686 MKKAGYVSKTSV 696

BLAST of CsGy3G021830 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 185.3 bits (469), Expect = 4.2e-47
Identity = 86/197 (43.65%), Postives = 128/197 (64.97%), Query Frame = 0

Query: 15  RSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKS 74
           R IG  I P+ +TF+ +LSAC+H+G++D GR  F +MT+DY + P + HYGCM+D+L  S
Sbjct: 462 RKIG--IQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHS 521

Query: 75  GYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLV 134
           G   EA E+I  ME EP+ +IW +LL  CK+HGN E+ E   E L+ +EP N G Y LL 
Sbjct: 522 GLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLS 581

Query: 135 SMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFIL 194
           ++YA    W EV   R+++ +KG+ KK PG S IE++  +H+F      HP + +IY +L
Sbjct: 582 NIYASAGRWNEVAKTRALLNDKGM-KKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 641

Query: 195 TELDGQLKLAGYILEPS 212
            E++  L+ AG++ + S
Sbjct: 642 EEMEVLLEKAGFVPDTS 655

BLAST of CsGy3G021830 vs. TAIR10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 184.9 bits (468), Expect = 5.5e-47
Identity = 86/189 (45.50%), Postives = 129/189 (68.25%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           I PN VTF+ IL AC H GL++EG+S F  M  ++ I P I+HYGCMVD+  +SG + EA
Sbjct: 298 INPNSVTFVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEA 357

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
              I SM  EP+ +IWG+LL+G ++ G+ +  E A+++L+ L+PMNSG Y LL ++YA+ 
Sbjct: 358 ESFIASMPMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKT 417

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
             WMEV  IR  M+ KG+  K PG S++E+EG +H+F    +S  +S++IY +L E+  +
Sbjct: 418 GRWMEVKCIRHEMEVKGI-NKVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQR 477

Query: 201 LKLAGYILE 210
           L+ AGY+ +
Sbjct: 478 LREAGYVTD 485

BLAST of CsGy3G021830 vs. TAIR10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 184.5 bits (467), Expect = 7.2e-47
Identity = 86/189 (45.50%), Postives = 128/189 (67.72%), Query Frame = 0

Query: 9   ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
           A++  +R     I P+ VTFI++L +C HAGL+DEG   F SM + YD+ P + HYGC+V
Sbjct: 400 AIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLV 459

Query: 69  DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
           D+L + G L EA++++++M  EPN +IWGALL  C++H   +IA++ ++ L+ L+P + G
Sbjct: 460 DLLGRVGRLKEAIKVVQTMPMEPNVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPG 519

Query: 129 HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
           +Y+LL ++YA  +DW  V  IRS MK  GVEK   G+S +ELE  IH+F+    SHP SD
Sbjct: 520 NYSLLSNIYAAAEDWEGVADIRSKMKSMGVEKP-SGASSVELEDGIHEFTVFDKSHPKSD 579

Query: 189 KIYFILTEL 198
           +IY +L  L
Sbjct: 580 QIYQMLGSL 587

BLAST of CsGy3G021830 vs. Swiss-Prot
Match: sp|Q56X05|PPR15_ARATH (Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=2)

HSP 1 Score: 220.3 bits (560), Expect = 2.1e-56
Identity = 106/201 (52.74%), Postives = 141/201 (70.15%), Query Frame = 0

Query: 9   ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
           ALK   +     + PN VTF+S+ +ACTHAGLVDEGR  + SM  DY I  ++ HYG MV
Sbjct: 372 ALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDEGRRIYRSMIDDYSIVSNVEHYGGMV 431

Query: 69  DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
            + SK+G + EALELI +MEFEPN++IWGALL+GC++H N  IAE A  +LM+LEPMNSG
Sbjct: 432 HLFSKAGLIYEALELIGNMEFEPNAVIWGALLDGCRIHKNLVIAEIAFNKLMVLEPMNSG 491

Query: 129 HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
           +Y LLVSMYAE+  W +V  IR  M+E G+EK  PG+S I ++   H F+A+  SH  SD
Sbjct: 492 YYFLLVSMYAEQNRWRDVAEIRGRMRELGIEKICPGTSSIRIDKRDHLFAAADKSHSASD 551

Query: 189 KIYFILTELDGQLKLAGYILE 210
           ++  +L E+  Q+ LAGY+ E
Sbjct: 552 EVCLLLDEIYDQMGLAGYVQE 572

BLAST of CsGy3G021830 vs. Swiss-Prot
Match: sp|Q9SY02|PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 3.9e-50
Identity = 89/192 (46.35%), Postives = 132/192 (68.75%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           + P+  T +++LSAC+H GLVD+GR  F +MT+DY + P+ +HY CMVD+L ++G L +A
Sbjct: 506 LKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDA 565

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
             L+K+M FEP++ IWG LL   ++HGN E+AE A +++  +EP NSG Y LL ++YA  
Sbjct: 566 HNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASS 625

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
             W +V  +R  M++KGV KK PG SWIE++   H FS   + HP+ D+I+  L ELD +
Sbjct: 626 GRWGDVGKLRVRMRDKGV-KKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLR 685

Query: 201 LKLAGYILEPSV 213
           +K AGY+ + SV
Sbjct: 686 MKKAGYVSKTSV 696

BLAST of CsGy3G021830 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 7.6e-46
Identity = 86/197 (43.65%), Postives = 128/197 (64.97%), Query Frame = 0

Query: 15  RSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKS 74
           R IG  I P+ +TF+ +LSAC+H+G++D GR  F +MT+DY + P + HYGCM+D+L  S
Sbjct: 462 RKIG--IQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHS 521

Query: 75  GYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLV 134
           G   EA E+I  ME EP+ +IW +LL  CK+HGN E+ E   E L+ +EP N G Y LL 
Sbjct: 522 GLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLS 581

Query: 135 SMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFIL 194
           ++YA    W EV   R+++ +KG+ KK PG S IE++  +H+F      HP + +IY +L
Sbjct: 582 NIYASAGRWNEVAKTRALLNDKGM-KKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 641

Query: 195 TELDGQLKLAGYILEPS 212
            E++  L+ AG++ + S
Sbjct: 642 EEMEVLLEKAGFVPDTS 655

BLAST of CsGy3G021830 vs. Swiss-Prot
Match: sp|Q683I9|PP295_ARATH (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 9.9e-46
Identity = 86/189 (45.50%), Postives = 129/189 (68.25%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           I PN VTF+ IL AC H GL++EG+S F  M  ++ I P I+HYGCMVD+  +SG + EA
Sbjct: 298 INPNSVTFVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEA 357

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
              I SM  EP+ +IWG+LL+G ++ G+ +  E A+++L+ L+PMNSG Y LL ++YA+ 
Sbjct: 358 ESFIASMPMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKT 417

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
             WMEV  IR  M+ KG+  K PG S++E+EG +H+F    +S  +S++IY +L E+  +
Sbjct: 418 GRWMEVKCIRHEMEVKGI-NKVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQR 477

Query: 201 LKLAGYILE 210
           L+ AGY+ +
Sbjct: 478 LREAGYVTD 485

BLAST of CsGy3G021830 vs. Swiss-Prot
Match: sp|Q9LS72|PP261_ARATH (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 1.3e-45
Identity = 86/189 (45.50%), Postives = 128/189 (67.72%), Query Frame = 0

Query: 9   ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
           A++  +R     I P+ VTFI++L +C HAGL+DEG   F SM + YD+ P + HYGC+V
Sbjct: 400 AIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLV 459

Query: 69  DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
           D+L + G L EA++++++M  EPN +IWGALL  C++H   +IA++ ++ L+ L+P + G
Sbjct: 460 DLLGRVGRLKEAIKVVQTMPMEPNVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPG 519

Query: 129 HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
           +Y+LL ++YA  +DW  V  IRS MK  GVEK   G+S +ELE  IH+F+    SHP SD
Sbjct: 520 NYSLLSNIYAAAEDWEGVADIRSKMKSMGVEKP-SGASSVELEDGIHEFTVFDKSHPKSD 579

Query: 189 KIYFILTEL 198
           +IY +L  L
Sbjct: 580 QIYQMLGSL 587

BLAST of CsGy3G021830 vs. TrEMBL
Match: tr|A0A0A0LB99|A0A0A0LB99_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G395920 PE=4 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 5.3e-113
Identity = 200/203 (98.52%), Postives = 201/203 (99.01%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA
Sbjct: 398 IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 457

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE
Sbjct: 458 LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 517

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           KDWMEV HIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQ
Sbjct: 518 KDWMEVAHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQ 577

Query: 201 LKLAGYILEPSVCSTGLLFSEEI 224
           LKLAGYILEPSVCST LLFSEEI
Sbjct: 578 LKLAGYILEPSVCSTALLFSEEI 600

BLAST of CsGy3G021830 vs. TrEMBL
Match: tr|A0A1S3BHH1|A0A1S3BHH1_CUCME (pentatricopeptide repeat-containing protein At1g06145-like OS=Cucumis melo OX=3656 GN=LOC103489889 PE=4 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 1.1e-102
Identity = 184/203 (90.64%), Postives = 192/203 (94.58%), Query Frame = 0

Query: 21  IMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEA 80
           I+PNGVTFISILSACTHAGLV+EGRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EA
Sbjct: 397 ILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEA 456

Query: 81  LELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEE 140
           LELIKSMEFEPNSIIWGALLNGCKLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEE
Sbjct: 457 LELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEE 516

Query: 141 KDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQ 200
           KDWMEV HIR MMKE+GVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQ
Sbjct: 517 KDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQ 576

Query: 201 LKLAGYILEPSVCSTGLLFSEEI 224
           LKLAGYILEPSVCST L+F EEI
Sbjct: 577 LKLAGYILEPSVCSTALVFPEEI 599

BLAST of CsGy3G021830 vs. TrEMBL
Match: tr|A0A2N9GP83|A0A2N9GP83_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29033 PE=4 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 2.5e-70
Identity = 133/201 (66.17%), Postives = 158/201 (78.61%), Query Frame = 0

Query: 9   ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
           ALK   R     I PNG+TFIS+LSACTHAGLV+EGR  FLSMT DY I P++ HYGCMV
Sbjct: 131 ALKMFRRMEREKIKPNGITFISVLSACTHAGLVNEGRRMFLSMTDDYSIPPEVGHYGCMV 190

Query: 69  DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
           D+LSK+G L +AL+LI+SM+ +PNSIIWGALL GCKLH N EIA+ AV +L+ILEP NSG
Sbjct: 191 DLLSKAGLLKDALKLIRSMKVKPNSIIWGALLGGCKLHRNLEIAQVAVNELLILEPNNSG 250

Query: 129 HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
           H+NLLV+MYAE   W EV  IR+ MK  GVEKK PGSSWIE+E  IHQF+AS  SHP S+
Sbjct: 251 HHNLLVNMYAEVNRWGEVAKIRAAMKNLGVEKKCPGSSWIEMERKIHQFAASDKSHPASN 310

Query: 189 KIYFILTELDGQLKLAGYILE 210
           +IY +L ELDG+LKLA Y+ E
Sbjct: 311 EIYLLLAELDGKLKLASYVPE 331

BLAST of CsGy3G021830 vs. TrEMBL
Match: tr|A0A2N9FGE3|A0A2N9FGE3_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14125 PE=4 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 2.5e-70
Identity = 133/201 (66.17%), Postives = 158/201 (78.61%), Query Frame = 0

Query: 9   ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
           ALK   R     I PNG+TFIS+LSACTHAGLV+EGR  FLSMT DY I P++ HYGCMV
Sbjct: 188 ALKMFRRMEREKIKPNGITFISVLSACTHAGLVNEGRRMFLSMTDDYSIPPEVGHYGCMV 247

Query: 69  DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
           D+LSK+G L +AL+LI+SM+ +PNSIIWGALL GCKLH N EIA+ AV +L+ILEP NSG
Sbjct: 248 DLLSKAGLLKDALKLIRSMKVKPNSIIWGALLGGCKLHRNLEIAQVAVNELLILEPNNSG 307

Query: 129 HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
           H+NLLV+MYAE   W EV  IR+ MK  GVEKK PGSSWIE+E  IHQF+AS  SHP S+
Sbjct: 308 HHNLLVNMYAEVNRWGEVAKIRAAMKNLGVEKKCPGSSWIEMERKIHQFAASDKSHPASN 367

Query: 189 KIYFILTELDGQLKLAGYILE 210
           +IY +L ELDG+LKLA Y+ E
Sbjct: 368 EIYLLLAELDGKLKLASYVPE 388

BLAST of CsGy3G021830 vs. TrEMBL
Match: tr|A0A2N9G7B2|A0A2N9G7B2_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26418 PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.1e-69
Identity = 130/201 (64.68%), Postives = 158/201 (78.61%), Query Frame = 0

Query: 9   ALKQLTRSIGNFIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMV 68
           ALK   R     + PNG+TFIS+LSACTHAGLV+EGR  FLSMT DY I P++ HYGCMV
Sbjct: 258 ALKMFRRMEREKMKPNGITFISVLSACTHAGLVNEGRRMFLSMTDDYSIPPEVGHYGCMV 317

Query: 69  DMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSG 128
           D+LSK+G L +AL+LI+SM+ +PNSIIWGAL  GCKLH N EIA+ AV +L+ILEP NSG
Sbjct: 318 DLLSKAGLLKDALKLIRSMKVKPNSIIWGALFGGCKLHRNLEIAQVAVNELLILEPNNSG 377

Query: 129 HYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSD 188
           H+NLLV+MYAE   W EV  +R+ MK  GVEKK PGSSWIE++  IHQF+AS  SHP S+
Sbjct: 378 HHNLLVNMYAEVNRWGEVAKMRAAMKNLGVEKKCPGSSWIEMKRKIHQFAASDKSHPASN 437

Query: 189 KIYFILTELDGQLKLAGYILE 210
           +IY +L ELDG+LKLAGY+ E
Sbjct: 438 EIYLLLAELDGKLKLAGYVPE 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011651448.18.1e-11398.52PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis s... [more]
XP_008447444.11.7e-10290.64PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis m... [more]
XP_023554768.16.0e-9286.98pentatricopeptide repeat-containing protein At1g06143 [Cucurbita pepo subsp. pep... [more]
XP_022963550.15.1e-9185.42pentatricopeptide repeat-containing protein At1g06143 [Cucurbita moschata][more]
XP_022967388.15.1e-9185.94pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G06150.11.2e-5752.74basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G02750.12.2e-5146.35Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G08070.14.2e-4743.65Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G62890.15.5e-4745.50Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G29230.17.2e-4745.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q56X05|PPR15_ARATH2.1e-5652.74Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX... [more]
sp|Q9SY02|PP301_ARATH3.9e-5046.35Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH7.6e-4643.65Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q683I9|PP295_ARATH9.9e-4645.50Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
sp|Q9LS72|PP261_ARATH1.3e-4545.50Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LB99|A0A0A0LB99_CUCSA5.3e-11398.52Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G395920 PE=4 SV=1[more]
tr|A0A1S3BHH1|A0A1S3BHH1_CUCME1.1e-10290.64pentatricopeptide repeat-containing protein At1g06145-like OS=Cucumis melo OX=36... [more]
tr|A0A2N9GP83|A0A2N9GP83_FAGSY2.5e-7066.17Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29033 PE=4 SV=1[more]
tr|A0A2N9FGE3|A0A2N9FGE3_FAGSY2.5e-7066.17Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14125 PE=4 SV=1[more]
tr|A0A2N9G7B2|A0A2N9G7B2_FAGSY2.1e-6964.68Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26418 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0005575 cellular_component
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G021830.1CsGy3G021830.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 26..61
e-value: 0.0026
score: 15.8
coord: 64..88
e-value: 0.0024
score: 15.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 63..88
e-value: 8.0E-4
score: 19.4
coord: 129..158
e-value: 0.0081
score: 16.2
coord: 26..53
e-value: 0.28
score: 11.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 126..160
score: 9.219
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 60..90
score: 8.243
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 24..59
score: 7.487
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 92..122
score: 5.568
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 6..164
e-value: 4.2E-26
score: 94.0
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 7..192
NoneNo IPR availablePANTHERPTHR24015:SF593SUBFAMILY NOT NAMEDcoord: 7..192