CsGy1G014900 (gene) Cucumber (Gy14) v2

NameCsGy1G014900
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat
LocationChr1 : 10806707 .. 10808284 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCAGCATTCCTTCCCACACAGCCACTCCTTCTCAACTCCAACTACCTCCTTTTACACCTTCTTCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCCCGCTCTCCCAACTCCCCTCATCGCAATATCTCCTCCAAATTCAACCCCAATTCTGTTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAATTATCCGAAGCCGCTGCAGAGTTTACCCGCATGCGACTCGCCGGAGTTGAGCCCAACCACATCACATTTATTACGCTTCTCTCCGCCTGTGCTGATTTTCCGTCAGAAAGCTTCTTCTTCGCCTCTTCACTTCATGGCTACGCTTGTAAATATGGTCTGGATACTGGGCATGTAATGGTGGGGACTGCTCTCATTGATATGTATTCCAAATGTGCTCAATTGGGTCATGCTAGGAAGGTTTTTTATAACCTGGGTGTGAAAAACTCTGTCTCTTGGAACACTATGCTCAATGGTTTTATGAGGAATGGAGAGATTGAGTTGGCCATTCAACTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTAATTAACGGTCTTTTGAAACATGGTTACTCGGAACAAGCATTGGAGTGCTTCCATCAGATGCAACGCTCGGGTGTCGCTGCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCCTGTGCTGATTTGGGTGCTCTTACTTTGGGGTTGTGGGTTCATCGTTTTGTTATGCCGCAGGAGTTTAAGGATAATATTAAGATTAGTAATTCCTTGATAGATATGTATTCTAGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGTGAAAATGGCCAAACGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAACGGATTTGCAGATGAATCTTTAGAGTTTTTTTATGCGATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACGGGGGCTCTTACCGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGCCTGGAATTGTTTGATAACATGAAGAGTGTACACAAAATTACTCCTAGGATTGAGCATTATGGGTGTATTGTCGATCTCTACGGTCGTGCTGGAAGGTTAGAGGATGCACTGAATATGATTGAGGAAATGCCGATGAAGCCGAATGAAGTTGTGTTGGGGTCGTTGCTTGCTGCTTGCAGGACTCATGGTGATGTGAACCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGAAGGCGATGCATATTATGTGCTTCTTTCAAACATATATGCAGCAATTGGGAAGTGGGATGGTGCTAACAATGTTAGGAGAACGATGAAAGCCCGAGGTGTGCAAAAAAAACCGGGTTATAGTTCTGTTGAAATTGATGGTAAGGTTCATGAATTTGTTGCAGGTGACAATTACCATGCTGATGCAGACAATATTTACTCAATGTTAGATTTGTTGTGTCATGAACTAAAGGTGTGTGGATATGTTCCTGGTAGTGATACCATTCTGAATACCAAAGAATCTAATAAGGACGATTGA

mRNA sequence

ATGAGCAGCATTCCTTCCCACACAGCCACTCCTTCTCAACTCCAACTACCTCCTTTTACACCTTCTTCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCCCGCTCTCCCAACTCCCCTCATCGCAATATCTCCTCCAAATTCAACCCCAATTCTGTTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAATTATCCGAAGCCGCTGCAGAGTTTACCCGCATGCGACTCGCCGGAGTTGAGCCCAACCACATCACATTTATTACGCTTCTCTCCGCCTGTGCTGATTTTCCGTCAGAAAGCTTCTTCTTCGCCTCTTCACTTCATGGCTACGCTTGTAAATATGGTCTGGATACTGGGCATGTAATGGTGGGGACTGCTCTCATTGATATGTATTCCAAATGTGCTCAATTGGGTCATGCTAGGAAGATGCAACGCTCGGGTGTCGCTGCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCCTGTGCTGATTTGGGTGCTCTTACTTTGGGGTTGTGGGTTCATCGTTTTGTTATGCCGCAGGAGTTTAAGGATAATATTAAGATTAGTAATTCCTTGATAGATATGTATTCTAGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGTGAAAATGGCCAAACGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAACGGATTTGCAGATGAATCTTTAGAGTTTTTTTATGCGATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACGGGGGCTCTTACCGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGCCTGGAATTGTTTGATAACATGAAGAGTGTACACAAAATTACTCCTAGGATTGAGCATTATGGGTGTATTGTCGATCTCTACGGTCGTGCTGGAAGGTTAGAGGATGCACTGAATATGATTGAGGAAATGCCGATGAAGCCGAATGAAGTTGTGTTGGGGTCGTTGCTTGCTGCTTGCAGGACTCATGGTGATGTGAACCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGAAGGCGATGCATATTATGTGCTTCTTTCAAACATATATGCAGCAATTGGGAAGTGGGATGGTGCTAACAATGTTAGGAGAACGATGAAAGCCCGAGGTGTGCAAAAAAAACCGGGTTATAGTTCTGTTGAAATTGATGGTAAGGTTCATGAATTTGTTGCAGGTGACAATTACCATGCTGATGCAGACAATATTTACTCAATGTTAGATTTGTTGTGTCATGAACTAAAGGTGTGTGGATATGTTCCTGGTAGTGATACCATTCTGAATACCAAAGAATCTAATAAGGACGATTGA

Coding sequence (CDS)

ATGAGCAGCATTCCTTCCCACACAGCCACTCCTTCTCAACTCCAACTACCTCCTTTTACACCTTCTTCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCCCGCTCTCCCAACTCCCCTCATCGCAATATCTCCTCCAAATTCAACCCCAATTCTGTTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAATTATCCGAAGCCGCTGCAGAGTTTACCCGCATGCGACTCGCCGGAGTTGAGCCCAACCACATCACATTTATTACGCTTCTCTCCGCCTGTGCTGATTTTCCGTCAGAAAGCTTCTTCTTCGCCTCTTCACTTCATGGCTACGCTTGTAAATATGGTCTGGATACTGGGCATGTAATGGTGGGGACTGCTCTCATTGATATGTATTCCAAATGTGCTCAATTGGGTCATGCTAGGAAGATGCAACGCTCGGGTGTCGCTGCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCCTGTGCTGATTTGGGTGCTCTTACTTTGGGGTTGTGGGTTCATCGTTTTGTTATGCCGCAGGAGTTTAAGGATAATATTAAGATTAGTAATTCCTTGATAGATATGTATTCTAGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGTGAAAATGGCCAAACGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAACGGATTTGCAGATGAATCTTTAGAGTTTTTTTATGCGATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACGGGGGCTCTTACCGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGCCTGGAATTGTTTGATAACATGAAGAGTGTACACAAAATTACTCCTAGGATTGAGCATTATGGGTGTATTGTCGATCTCTACGGTCGTGCTGGAAGGTTAGAGGATGCACTGAATATGATTGAGGAAATGCCGATGAAGCCGAATGAAGTTGTGTTGGGGTCGTTGCTTGCTGCTTGCAGGACTCATGGTGATGTGAACCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGAAGGCGATGCATATTATGTGCTTCTTTCAAACATATATGCAGCAATTGGGAAGTGGGATGGTGCTAACAATGTTAGGAGAACGATGAAAGCCCGAGGTGTGCAAAAAAAACCGGGTTATAGTTCTGTTGAAATTGATGGTAAGGTTCATGAATTTGTTGCAGGTGACAATTACCATGCTGATGCAGACAATATTTACTCAATGTTAGATTTGTTGTGTCATGAACTAAAGGTGTGTGGATATGTTCCTGGTAGTGATACCATTCTGAATACCAAAGAATCTAATAAGGACGATTGA

Protein sequence

MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKMQRSGVAADYVSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD
BLAST of CsGy1G014900 vs. NCBI nr
Match: XP_004139593.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus] >KGN64971.1 hypothetical protein Csa_1G169950 [Cucumis sativus])

HSP 1 Score: 902.1 bits (2330), Expect = 7.6e-259
Identity = 457/525 (87.05%), Postives = 458/525 (87.24%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW
Sbjct: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120
           TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA
Sbjct: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120

Query: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKM---------------------------- 180
           CKYGLDTGHVMVGTALIDMYSKCAQLGHARK+                            
Sbjct: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTXXXXXXXXXXXXXX 180

Query: 181 --------------------------------------QRSGVAADYVSIIAVLAACADL 240
                                                  RSGVAADYVSIIAVLAACADL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSGVAADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI
Sbjct: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK
Sbjct: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 460
           HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV
Sbjct: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480

BLAST of CsGy1G014900 vs. NCBI nr
Match: XP_008458940.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo])

HSP 1 Score: 777.3 bits (2006), Expect = 2.8e-221
Identity = 404/524 (77.10%), Postives = 420/524 (80.15%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSH A+PSQLQ PP   SSIPLSNPTK+NFPRSP SPH NI SKF  NSV PIV W
Sbjct: 1   MSSIPSHIASPSQLQQPP--SSSIPLSNPTKVNFPRSPKSPHCNIFSKFTANSVHPIVQW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120
           TSS+ARYC NGQL EAAAEFTRMRLAGVEPNHITFITLLS CADFPSES FFASSLHGYA
Sbjct: 61  TSSIARYCGNGQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSES-FFASSLHGYA 120

Query: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKM---------------------------- 180
           CK+GLDTGHVMVGTALIDMYSKC+QLG A+K+                            
Sbjct: 121 CKFGLDTGHVMVGTALIDMYSKCSQLGLAKKVFDYLGVKNSVSWNTMXXXXXXXXXXXXX 180

Query: 181 --------------------------------------QRSGVAADYVSIIAVLAACADL 240
                                                     V ADYVSIIAVLAACADL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVVADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALT GLWV+RFVM QEFKDN++ISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTSGLWVNRFVMQQEFKDNVRISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFA NGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VHKI
Sbjct: 301 VGFAFNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TP IEHYGCIVDLYGRAGRLEDA N+IEEMPMKPNEVVLGSLLAACRTHGDV LAERLMK
Sbjct: 361 TPGIEHYGCIVDLYGRAGRLEDASNVIEEMPMKPNEVVLGSLLAACRTHGDVRLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 459
           H+FKLD  GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKK GYSSVEIDGKVHEFV
Sbjct: 421 HIFKLDSVGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKRGYSSVEIDGKVHEFV 480

BLAST of CsGy1G014900 vs. NCBI nr
Match: XP_022142716.1 (pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia])

HSP 1 Score: 729.2 bits (1881), Expect = 8.9e-207
Identity = 374/524 (71.37%), Postives = 410/524 (78.24%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIP++TA   QLQ  P   +SIPL NP  +NFPRS NS +R+ISSK   NS+DPIVLW
Sbjct: 1   MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120
           TSS+ARYCRNGQL+EAAAEFT MRLAGVEPNH+T ITLLS CADFPSES +F SSLHGYA
Sbjct: 61  TSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYA 120

Query: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKM---------------------------- 180
            K GLDT HVMVGT+++DMY+KCAQLG AR++                            
Sbjct: 121 RKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNXXXXXXXXXXXXXXX 180

Query: 181 --------------------------------------QRSGVAADYVSIIAVLAACADL 240
                                                   SG+  DYVSIIAVLAACADL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCSGIKPDYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           G LTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCI FARQVF +M+KRTLVSWNSII
Sbjct: 241 GTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VG+A NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMK VH+I
Sbjct: 301 VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
            PRIEHYGCIVDLYGRAGRLEDAL++IE+MPMKPNEVVLGSLLAACRTHGDV+LAERLMK
Sbjct: 361 IPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 459
           HL KLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SSVEIDGKVHEFV
Sbjct: 421 HLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFV 480

BLAST of CsGy1G014900 vs. NCBI nr
Match: XP_023521260.1 (pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 695.3 bits (1793), Expect = 1.4e-196
Identity = 358/524 (68.32%), Postives = 397/524 (75.76%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSS+P+HTA P Q QL  ++      SNP+ LNFPR PNS           N + PIVLW
Sbjct: 1   MSSVPAHTAVPFQFQLQQYSN-----SNPSNLNFPRYPNS----------SNPIKPIVLW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120
           TSS+ARYCRN QL EAAAEFTRMRLAGVEPNHITFITLLS CADFPS S  F +SLHGY 
Sbjct: 61  TSSIARYCRNDQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYV 120

Query: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKM---------------------------- 180
            K GLDTGHVMVGTALI MY+KCAQLG AR +                            
Sbjct: 121 RKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNXXXXXXXXXXXXXXX 180

Query: 181 --------------------------------------QRSGVAADYVSIIAVLAACADL 240
                                                   SG+  DYVSIIAVLAACADL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCSGIEPDYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GAL+ GLWV+RF+M QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNS+I
Sbjct: 241 GALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMI 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFA+NGFADESLEFF AMQKEGFK DGVSYTGALTACSHAGLVNKGLELFDNMK VH+I
Sbjct: 301 VGFAINGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TPRIEHYGCIVDLY RAGRL++ALN+IE MPMKPNEVVLGSLLAACRTHGDV+LAERL+K
Sbjct: 361 TPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 459
           +LF+LDP GD+ YVLLSNIYAA+G+W+GAN VRRTMKARGVQKKPG+SS+EIDGKVHEFV
Sbjct: 421 YLFELDPGGDSSYVLLSNIYAAVGRWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFV 480

BLAST of CsGy1G014900 vs. NCBI nr
Match: XP_022967078.1 (pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima])

HSP 1 Score: 693.0 bits (1787), Expect = 7.0e-196
Identity = 357/526 (67.87%), Postives = 398/526 (75.67%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFT--PSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIV 60
           MSS+PSHT  P Q QL  ++  PS IP SNP+ L+FPR+PNS           N + PIV
Sbjct: 1   MSSVPSHTGIPFQFQLQQYSNPPSPIPHSNPSNLSFPRTPNS----------SNPIKPIV 60

Query: 61  LWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHG 120
           LWTSS+ARYCRN QL+EAAAEFTRMRLAGVEPNHITFITLLS CADFPS S  F +SLHG
Sbjct: 61  LWTSSIARYCRNAQLAEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHG 120

Query: 121 YACKYGLDTGHVMVGTALIDMYSKCAQLGHARKM-------------------------- 180
           Y  K GLDTGHVMVGTALI MY+KCAQLG AR +                          
Sbjct: 121 YVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLAMKNSVTWNTMXXXXXXXXXXX 180

Query: 181 ----------------------------------------QRSGVAADYVSIIAVLAACA 240
                                                     SG+  DYVSIIAVLAACA
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCSGIEPDYVSIIAVLAACA 240

Query: 241 DLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNS 300
           DLGAL+ GLWV+RF+M QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM+K TLVSWNS
Sbjct: 241 DLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMSKHTLVSWNS 300

Query: 301 IIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVH 360
           +IVGFA+NGFADESLEFF AMQKEGF  DGVSYTGALTACSHAGLVNKGLELFDNMK VH
Sbjct: 301 MIVGFAINGFADESLEFFDAMQKEGFMADGVSYTGALTACSHAGLVNKGLELFDNMKRVH 360

Query: 361 KITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERL 420
           +ITPRIEHYGCIVDLY RAGRL++ALN+IE MPMKPNEVVLGSLLAACRTHGDV+LAERL
Sbjct: 361 RITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERL 420

Query: 421 MKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHE 459
           +K+LF+LDP GD+ YVLLSNIYAA+G+W+GAN VRRTMKARGVQKKPG+SS+EIDGKVHE
Sbjct: 421 IKYLFELDPGGDSSYVLLSNIYAAVGRWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHE 480

BLAST of CsGy1G014900 vs. TAIR10
Match: AT1G05750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 468.8 bits (1205), Expect = 3.9e-132
Identity = 240/463 (51.84%), Postives = 303/463 (65.44%), Query Frame = 0

Query: 48  KFNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPS 107
           + N ++ +  V WTS +    RNG+L+EAA EF+ M LAGVEPNHITFI LLS C DF S
Sbjct: 27  RHNQSTSETTVSWTSRINLLTRNGRLAEAAKEFSDMTLAGVEPNHITFIALLSGCGDFTS 86

Query: 108 ESFFFASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKM--------------- 167
            S      LHGYACK GLD  HVMVGTA+I MYSK  +   AR +               
Sbjct: 87  GSEALGDLLHGYACKLGLDRNHVMVGTAIIGMYSKRGRFKKARLVFDYMEDKNSVTWNXX 146

Query: 168 ---------------------------------------------------QRSGVAADY 227
                                                              Q SGV  DY
Sbjct: 147 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQISGVKPDY 206

Query: 228 VSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVK 287
           V+IIA L AC +LGAL+ GLWVHR+V+ Q+FK+N+++SNSLID+Y RCGC+EFARQVF  
Sbjct: 207 VAIIAALNACTNLGALSFGLWVHRYVLSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFYN 266

Query: 288 MAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKG 347
           M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTACSH GLV +G
Sbjct: 267 MEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEKGFKPDAVTFTGALTACSHVGLVEEG 326

Query: 348 LELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACR 407
           L  F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC 
Sbjct: 327 LRYFQIMKCDYRISPRIEHYGCLVDLYSRAGRLEDALKLVQSMPMKPNEVVIGSLLAACS 386

Query: 408 THG-DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPG 444
            HG ++ LAERLMKHL  L+ +  + YV+LSN+YAA GKW+GA+ +RR MK  G++K+PG
Sbjct: 387 NHGNNIVLAERLMKHLTDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKQPG 446

BLAST of CsGy1G014900 vs. TAIR10
Match: AT3G49142.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 303.1 bits (775), Expect = 2.8e-82
Identity = 153/402 (38.06%), Postives = 233/402 (57.96%), Query Frame = 0

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSL 116
           +V W S +  Y +N +  +A      M    +  +  T  +LL A ++  +E+  +   +
Sbjct: 206 VVSWNSLVVGYAQNQRFDDALEVCREMESVKISHDAGTMASLLPAVSNTTTENVMYVKDM 265

Query: 117 HGYACKYGLDTGHVMVGTALIDMYSKCAQLGHA----RKMQRSGVAADYVSIIAVLAACA 176
                K  L + +VM+G     +Y K A    A     +M+  G   D VSI +VL AC 
Sbjct: 266 FFKMGKKSLVSWNVMIG-----VYMKNAMPVEAVELYSRMEADGFEPDAVSITSVLPACG 325

Query: 177 DLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNS 236
           D  AL+LG  +H ++  ++   N+ + N+LIDMY++CGC+E AR VF  M  R +VSW +
Sbjct: 326 DTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCGCLEKARDVFENMKSRDVVSWTA 385

Query: 237 IIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVH 296
           +I  +  +G   +++  F  +Q  G  PD +++   L ACSHAGL+ +G   F  M   +
Sbjct: 386 MISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLAACSHAGLLEEGRSCFKLMTDHY 445

Query: 297 KITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERL 356
           KITPR+EH  C+VDL GRAG++++A   I++M M+PNE V G+LL ACR H D ++    
Sbjct: 446 KITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNERVWGALLGACRVHSDTDIGLLA 505

Query: 357 MKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHE 416
              LF+L PE   YYVLLSNIYA  G+W+   N+R  MK++G++K PG S+VE++  +H 
Sbjct: 506 ADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIMKSKGLKKNPGASNVEVNRIIHT 565

Query: 417 FVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKE 455
           F+ GD  H  +D IY  LD+L  ++K  GYVP S++ L+  E
Sbjct: 566 FLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSESALHDVE 602

BLAST of CsGy1G014900 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 288.9 bits (738), Expect = 5.5e-78
Identity = 150/404 (37.13%), Postives = 234/404 (57.92%), Query Frame = 0

Query: 88  VEPNHITFITLLSACADFPSESFFFASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLG 147
           V P+  T +T++SACA   S S      +H +   +G  + ++ +  ALID+YSKC +L 
Sbjct: 262 VRPDESTMVTVVSACAQ--SGSIELGRQVHLWIDDHGFGS-NLKIVNALIDLYSKCGELE 321

Query: 148 HA-----------------------------------RKMQRSGVAADYVSIIAVLAACA 207
            A                                   ++M RSG   + V+++++L ACA
Sbjct: 322 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACA 381

Query: 208 DLGALTLGLWVHRFV--MPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSW 267
            LGA+ +G W+H ++    +   +   +  SLIDMY++CG IE A QVF  +  ++L SW
Sbjct: 382 HLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSW 441

Query: 268 NSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKS 327
           N++I GFA++G AD S + F  M+K G +PD +++ G L+ACSH+G+++ G  +F  M  
Sbjct: 442 NAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQ 501

Query: 328 VHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAE 387
            +K+TP++EHYGC++DL G +G  ++A  MI  M M+P+ V+  SLL AC+ HG+V L E
Sbjct: 502 DYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGE 561

Query: 388 RLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKV 447
              ++L K++PE    YVLLSNIYA+ G+W+     R  +  +G++K PG SS+EID  V
Sbjct: 562 SFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVV 621

Query: 448 HEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKE 455
           HEF+ GD +H     IY ML+ +   L+  G+VP +  +L   E
Sbjct: 622 HEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEME 662

BLAST of CsGy1G014900 vs. TAIR10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 286.2 bits (731), Expect = 3.6e-77
Identity = 159/454 (35.02%), Postives = 233/454 (51.32%), Query Frame = 0

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSL 116
           +V W S +  + +NG   EA   F  M  + VEP+ +T  +++SACA     +      +
Sbjct: 218 VVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASL--SAIKVGQEV 277

Query: 117 HGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKMQRS--------------------- 176
           HG   K       +++  A +DMY+KC+++  AR +  S                     
Sbjct: 278 HGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIXXXXXXXXXXXXXXXXXX 337

Query: 177 ---------------------------------------------GVAADYVSIIAVLAA 236
                                                         V   + S   +L A
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVCPTHYSFANILKA 397

Query: 237 CADLGALTLGLWVHRFVMPQEFK------DNIKISNSLIDMYSRCGCIEFARQVFVKMAK 296
           CADL  L LG+  H  V+   FK      D+I + NSLIDMY +CGC+E    VF KM +
Sbjct: 398 CADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMME 457

Query: 297 RTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL 356
           R  VSWN++I+GFA NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Sbjct: 458 RDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHY 517

Query: 357 FDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHG 416
           F +M     + P  +HY C+VDL GRAG LE+A +MIEEMPM+P+ V+ GSLLAAC+ H 
Sbjct: 518 FSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHR 577

Query: 417 DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSV 439
           ++ L + + + L +++P     YVLLSN+YA +GKW+   NVR++M+  GV K+PG S +
Sbjct: 578 NITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWI 637

BLAST of CsGy1G014900 vs. TAIR10
Match: AT3G13770.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 285.0 bits (728), Expect = 8.0e-77
Identity = 150/431 (34.80%), Postives = 232/431 (53.83%), Query Frame = 0

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSL 116
           +V WT+ ++RY + G  SEA   F  M  +  +PN  TF T+L++C    +        +
Sbjct: 118 VVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSC--IRASGLGLGKQI 177

Query: 117 HGYACKYGLDTGHVMVGTALIDMYSKCAQLGHAR-------------------------- 176
           HG   K+  D+ H+ VG++L+DMY+K  Q+  AR                          
Sbjct: 178 HGLIVKWNYDS-HIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGL 237

Query: 177 ---------KMQRSGVAADYVSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNS 236
                    ++   G++ +YV+  ++L A + L  L  G   H  V+ +E      + NS
Sbjct: 238 DEEALEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNS 297

Query: 237 LIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKE-GFKP 296
           LIDMYS+CG + +AR++F  M +RT +SWN+++VG++ +G   E LE F  M+ E   KP
Sbjct: 298 LIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKP 357

Query: 297 DGVSYTGALTACSHAGLVNKGLELFDNM-KSVHKITPRIEHYGCIVDLYGRAGRLEDALN 356
           D V+    L+ CSH  + + GL +FD M    +   P  EHYGCIVD+ GRAGR+++A  
Sbjct: 358 DAVTLLAVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFE 417

Query: 357 MIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGK 416
            I+ MP KP   VLGSLL ACR H  V++ E + + L +++PE    YV+LSN+YA+ G+
Sbjct: 418 FIKRMPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGR 477

Query: 417 WDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKV 451
           W   NNVR  M  + V K+PG S ++ +  +H F A D  H   + + + +  +  ++K 
Sbjct: 478 WADVNNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQ 537

BLAST of CsGy1G014900 vs. Swiss-Prot
Match: sp|Q9MA50|PPR13_ARATH (Pentatricopeptide repeat-containing protein At1g05750, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PDE247 PE=2 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 7.1e-131
Identity = 240/463 (51.84%), Postives = 303/463 (65.44%), Query Frame = 0

Query: 48  KFNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPS 107
           + N ++ +  V WTS +    RNG+L+EAA EF+ M LAGVEPNHITFI LLS C DF S
Sbjct: 27  RHNQSTSETTVSWTSRINLLTRNGRLAEAAKEFSDMTLAGVEPNHITFIALLSGCGDFTS 86

Query: 108 ESFFFASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKM--------------- 167
            S      LHGYACK GLD  HVMVGTA+I MYSK  +   AR +               
Sbjct: 87  GSEALGDLLHGYACKLGLDRNHVMVGTAIIGMYSKRGRFKKARLVFDYMEDKNSVTWNXX 146

Query: 168 ---------------------------------------------------QRSGVAADY 227
                                                              Q SGV  DY
Sbjct: 147 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQISGVKPDY 206

Query: 228 VSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVK 287
           V+IIA L AC +LGAL+ GLWVHR+V+ Q+FK+N+++SNSLID+Y RCGC+EFARQVF  
Sbjct: 207 VAIIAALNACTNLGALSFGLWVHRYVLSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFYN 266

Query: 288 MAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKG 347
           M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTACSH GLV +G
Sbjct: 267 MEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEKGFKPDAVTFTGALTACSHVGLVEEG 326

Query: 348 LELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACR 407
           L  F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC 
Sbjct: 327 LRYFQIMKCDYRISPRIEHYGCLVDLYSRAGRLEDALKLVQSMPMKPNEVVIGSLLAACS 386

Query: 408 THG-DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPG 444
            HG ++ LAERLMKHL  L+ +  + YV+LSN+YAA GKW+GA+ +RR MK  G++K+PG
Sbjct: 387 NHGNNIVLAERLMKHLTDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKQPG 446

BLAST of CsGy1G014900 vs. Swiss-Prot
Match: sp|P0C899|PP271_ARATH (Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H77 PE=3 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 5.1e-81
Identity = 153/402 (38.06%), Postives = 233/402 (57.96%), Query Frame = 0

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSL 116
           +V W S +  Y +N +  +A      M    +  +  T  +LL A ++  +E+  +   +
Sbjct: 206 VVSWNSLVVGYAQNQRFDDALEVCREMESVKISHDAGTMASLLPAVSNTTTENVMYVKDM 265

Query: 117 HGYACKYGLDTGHVMVGTALIDMYSKCAQLGHA----RKMQRSGVAADYVSIIAVLAACA 176
                K  L + +VM+G     +Y K A    A     +M+  G   D VSI +VL AC 
Sbjct: 266 FFKMGKKSLVSWNVMIG-----VYMKNAMPVEAVELYSRMEADGFEPDAVSITSVLPACG 325

Query: 177 DLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNS 236
           D  AL+LG  +H ++  ++   N+ + N+LIDMY++CGC+E AR VF  M  R +VSW +
Sbjct: 326 DTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCGCLEKARDVFENMKSRDVVSWTA 385

Query: 237 IIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVH 296
           +I  +  +G   +++  F  +Q  G  PD +++   L ACSHAGL+ +G   F  M   +
Sbjct: 386 MISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLAACSHAGLLEEGRSCFKLMTDHY 445

Query: 297 KITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERL 356
           KITPR+EH  C+VDL GRAG++++A   I++M M+PNE V G+LL ACR H D ++    
Sbjct: 446 KITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNERVWGALLGACRVHSDTDIGLLA 505

Query: 357 MKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHE 416
              LF+L PE   YYVLLSNIYA  G+W+   N+R  MK++G++K PG S+VE++  +H 
Sbjct: 506 ADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIMKSKGLKKNPGASNVEVNRIIHT 565

Query: 417 FVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKE 455
           F+ GD  H  +D IY  LD+L  ++K  GYVP S++ L+  E
Sbjct: 566 FLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSESALHDVE 602

BLAST of CsGy1G014900 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 1.0e-76
Identity = 150/404 (37.13%), Postives = 234/404 (57.92%), Query Frame = 0

Query: 88  VEPNHITFITLLSACADFPSESFFFASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLG 147
           V P+  T +T++SACA   S S      +H +   +G  + ++ +  ALID+YSKC +L 
Sbjct: 262 VRPDESTMVTVVSACAQ--SGSIELGRQVHLWIDDHGFGS-NLKIVNALIDLYSKCGELE 321

Query: 148 HA-----------------------------------RKMQRSGVAADYVSIIAVLAACA 207
            A                                   ++M RSG   + V+++++L ACA
Sbjct: 322 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACA 381

Query: 208 DLGALTLGLWVHRFV--MPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSW 267
            LGA+ +G W+H ++    +   +   +  SLIDMY++CG IE A QVF  +  ++L SW
Sbjct: 382 HLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSW 441

Query: 268 NSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKS 327
           N++I GFA++G AD S + F  M+K G +PD +++ G L+ACSH+G+++ G  +F  M  
Sbjct: 442 NAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQ 501

Query: 328 VHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAE 387
            +K+TP++EHYGC++DL G +G  ++A  MI  M M+P+ V+  SLL AC+ HG+V L E
Sbjct: 502 DYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGE 561

Query: 388 RLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKV 447
              ++L K++PE    YVLLSNIYA+ G+W+     R  +  +G++K PG SS+EID  V
Sbjct: 562 SFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVV 621

Query: 448 HEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKE 455
           HEF+ GD +H     IY ML+ +   L+  G+VP +  +L   E
Sbjct: 622 HEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEME 662

BLAST of CsGy1G014900 vs. Swiss-Prot
Match: sp|Q9SIT7|PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 6.5e-76
Identity = 159/454 (35.02%), Postives = 233/454 (51.32%), Query Frame = 0

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSL 116
           +V W S +  + +NG   EA   F  M  + VEP+ +T  +++SACA     +      +
Sbjct: 218 VVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASL--SAIKVGQEV 277

Query: 117 HGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKMQRS--------------------- 176
           HG   K       +++  A +DMY+KC+++  AR +  S                     
Sbjct: 278 HGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIXXXXXXXXXXXXXXXXXX 337

Query: 177 ---------------------------------------------GVAADYVSIIAVLAA 236
                                                         V   + S   +L A
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVCPTHYSFANILKA 397

Query: 237 CADLGALTLGLWVHRFVMPQEFK------DNIKISNSLIDMYSRCGCIEFARQVFVKMAK 296
           CADL  L LG+  H  V+   FK      D+I + NSLIDMY +CGC+E    VF KM +
Sbjct: 398 CADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMME 457

Query: 297 RTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL 356
           R  VSWN++I+GFA NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Sbjct: 458 RDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHY 517

Query: 357 FDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHG 416
           F +M     + P  +HY C+VDL GRAG LE+A +MIEEMPM+P+ V+ GSLLAAC+ H 
Sbjct: 518 FSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHR 577

Query: 417 DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSV 439
           ++ L + + + L +++P     YVLLSN+YA +GKW+   NVR++M+  GV K+PG S +
Sbjct: 578 NITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWI 637

BLAST of CsGy1G014900 vs. Swiss-Prot
Match: sp|Q9LIC3|PP227_ARATH (Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H85 PE=3 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.4e-75
Identity = 150/431 (34.80%), Postives = 232/431 (53.83%), Query Frame = 0

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSL 116
           +V WT+ ++RY + G  SEA   F  M  +  +PN  TF T+L++C    +        +
Sbjct: 118 VVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSC--IRASGLGLGKQI 177

Query: 117 HGYACKYGLDTGHVMVGTALIDMYSKCAQLGHAR-------------------------- 176
           HG   K+  D+ H+ VG++L+DMY+K  Q+  AR                          
Sbjct: 178 HGLIVKWNYDS-HIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGL 237

Query: 177 ---------KMQRSGVAADYVSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNS 236
                    ++   G++ +YV+  ++L A + L  L  G   H  V+ +E      + NS
Sbjct: 238 DEEALEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNS 297

Query: 237 LIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKE-GFKP 296
           LIDMYS+CG + +AR++F  M +RT +SWN+++VG++ +G   E LE F  M+ E   KP
Sbjct: 298 LIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKP 357

Query: 297 DGVSYTGALTACSHAGLVNKGLELFDNM-KSVHKITPRIEHYGCIVDLYGRAGRLEDALN 356
           D V+    L+ CSH  + + GL +FD M    +   P  EHYGCIVD+ GRAGR+++A  
Sbjct: 358 DAVTLLAVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFE 417

Query: 357 MIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGK 416
            I+ MP KP   VLGSLL ACR H  V++ E + + L +++PE    YV+LSN+YA+ G+
Sbjct: 418 FIKRMPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGR 477

Query: 417 WDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKV 451
           W   NNVR  M  + V K+PG S ++ +  +H F A D  H   + + + +  +  ++K 
Sbjct: 478 WADVNNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQ 537

BLAST of CsGy1G014900 vs. TrEMBL
Match: tr|A0A0A0LYD6|A0A0A0LYD6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G169950 PE=4 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 5.0e-259
Identity = 457/525 (87.05%), Postives = 458/525 (87.24%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW
Sbjct: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120
           TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA
Sbjct: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120

Query: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKM---------------------------- 180
           CKYGLDTGHVMVGTALIDMYSKCAQLGHARK+                            
Sbjct: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTXXXXXXXXXXXXXX 180

Query: 181 --------------------------------------QRSGVAADYVSIIAVLAACADL 240
                                                  RSGVAADYVSIIAVLAACADL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSGVAADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI
Sbjct: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK
Sbjct: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 460
           HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV
Sbjct: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480

BLAST of CsGy1G014900 vs. TrEMBL
Match: tr|A0A1S3C956|A0A1S3C956_CUCME (pentatricopeptide repeat-containing protein At1g05750, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498199 PE=4 SV=1)

HSP 1 Score: 777.3 bits (2006), Expect = 1.9e-221
Identity = 404/524 (77.10%), Postives = 420/524 (80.15%), Query Frame = 0

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSH A+PSQLQ PP   SSIPLSNPTK+NFPRSP SPH NI SKF  NSV PIV W
Sbjct: 1   MSSIPSHIASPSQLQQPP--SSSIPLSNPTKVNFPRSPKSPHCNIFSKFTANSVHPIVQW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120
           TSS+ARYC NGQL EAAAEFTRMRLAGVEPNHITFITLLS CADFPSES FFASSLHGYA
Sbjct: 61  TSSIARYCGNGQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSES-FFASSLHGYA 120

Query: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKM---------------------------- 180
           CK+GLDTGHVMVGTALIDMYSKC+QLG A+K+                            
Sbjct: 121 CKFGLDTGHVMVGTALIDMYSKCSQLGLAKKVFDYLGVKNSVSWNTMXXXXXXXXXXXXX 180

Query: 181 --------------------------------------QRSGVAADYVSIIAVLAACADL 240
                                                     V ADYVSIIAVLAACADL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVVADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALT GLWV+RFVM QEFKDN++ISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTSGLWVNRFVMQQEFKDNVRISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFA NGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VHKI
Sbjct: 301 VGFAFNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TP IEHYGCIVDLYGRAGRLEDA N+IEEMPMKPNEVVLGSLLAACRTHGDV LAERLMK
Sbjct: 361 TPGIEHYGCIVDLYGRAGRLEDASNVIEEMPMKPNEVVLGSLLAACRTHGDVRLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 459
           H+FKLD  GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKK GYSSVEIDGKVHEFV
Sbjct: 421 HIFKLDSVGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKRGYSSVEIDGKVHEFV 480

BLAST of CsGy1G014900 vs. TrEMBL
Match: tr|A0A2P4HUZ8|A0A2P4HUZ8_QUESU (Pentatricopeptide repeat-containing protein, chloroplastic OS=Quercus suber OX=58331 GN=CFP56_03618 PE=4 SV=1)

HSP 1 Score: 604.0 bits (1556), Expect = 2.8e-169
Identity = 315/520 (60.58%), Postives = 372/520 (71.54%), Query Frame = 0

Query: 6   SHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTSSLA 65
           ++TATP+QL  PP TP     + P+ L     PN  HR    + N  ++DP+V WTSS+A
Sbjct: 7   TNTATPTQLSSPPKTPKXXLPTQPSSL-----PNHRHRVSLPQTNRPTIDPVVSWTSSIA 66

Query: 66  RYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYACKYGL 125
           R+CRN QL EAAAEF RMRLAGVEPNH+TFITLLSACADFP ES  F SS+H Y  K+GL
Sbjct: 67  RHCRNAQLPEAAAEFQRMRLAGVEPNHVTFITLLSACADFPLESVAFGSSIHAYVRKFGL 126

Query: 126 DTGHVMVGTALIDMYSKCAQLGHARKM--------------------------------- 185
            T +VMVGTAL+DMY+KC ++  AR +                                 
Sbjct: 127 LTNNVMVGTALLDMYAKCRRVDLARMVFDEMGVKNSVSXXXXXXXXXXXXXXXXXXXXXX 186

Query: 186 ---------------------------------QRSGVAADYVSIIAVLAACADLGALTL 245
                                              + V  DYV+IIAVL+ACA+LG L L
Sbjct: 187 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLARVEPDYVTIIAVLSACANLGTLGL 246

Query: 246 GLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVGFAV 305
           GLW++RFVM Q+FKDNI+ISNSLIDMYSRCGCIEFA Q+F KM KRTLVSWNSIIVGFAV
Sbjct: 247 GLWMNRFVMKQDFKDNIRISNSLIDMYSRCGCIEFAHQIFEKMTKRTLVSWNSIIVGFAV 306

Query: 306 NGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITPRIE 365
           NG  +E+LEFF  MQKEGFKPDG+S+TGALTACSHAGLV++GL+ FD MK VH+ITPRIE
Sbjct: 307 NGHVEEALEFFGLMQKEGFKPDGISFTGALTACSHAGLVDEGLKFFDEMKRVHRITPRIE 366

Query: 366 HYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHLFKL 425
           HYGCIVDLY RAGRLEDALN+IE MPMKPNEVVLGSLLAACRTHG+V+LAERLM +L +L
Sbjct: 367 HYGCIVDLYSRAGRLEDALNVIENMPMKPNEVVLGSLLAACRTHGNVSLAERLMNYLSEL 426

Query: 426 DPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAGDNY 460
           DP GD+ YVLL+N+YAAIG+WDGA+ +RRTMKARG+QKKPG SS+EID  +HEFVAGDN 
Sbjct: 427 DPSGDSNYVLLANMYAAIGRWDGASKIRRTMKARGIQKKPGSSSIEIDCSIHEFVAGDNC 486

BLAST of CsGy1G014900 vs. TrEMBL
Match: tr|A0A2N9FND9|A0A2N9FND9_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16654 PE=4 SV=1)

HSP 1 Score: 598.2 bits (1541), Expect = 1.6e-167
Identity = 308/523 (58.89%), Postives = 372/523 (71.13%), Query Frame = 0

Query: 3   SIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTS 62
           +IP++T TP+QL  PP  P S+P +N   L   R  N P            +DP+V WTS
Sbjct: 2   NIPTYTPTPTQLSSPPKPPLSLP-NNRVSL---RQSNKP------------IDPVVSWTS 61

Query: 63  SLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYACK 122
           S+AR+CRNGQL++AAAEF RMRLAG+EPNH+TF+TL S CADFP ES  F SS+H Y  K
Sbjct: 62  SIARHCRNGQLAQAAAEFQRMRLAGIEPNHVTFVTLFSGCADFPLESVVFGSSIHAYVRK 121

Query: 123 YGLDTGHVMVGTALIDMYSKCAQLGHARKM------------------------------ 182
           +GL T +VMVGTAL+DMY+KC+++  AR +                              
Sbjct: 122 FGLLTNNVMVGTALLDMYAKCSRVDLARMVFDEMGVKNSVSWXXXXXXXXXXXXXXXXXX 181

Query: 183 ------------------------------------QRSGVAADYVSIIAVLAACADLGA 242
                                               Q SGV  DYV+IIAVL+ACA+LG 
Sbjct: 182 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQLSGVEPDYVTIIAVLSACANLGT 241

Query: 243 LTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVG 302
           L LGLW++RFVM Q+  DNI+ISNSLIDMYSRCGCIEFARQ+F KM  RTLVSWNSIIVG
Sbjct: 242 LGLGLWMNRFVMKQDMGDNIRISNSLIDMYSRCGCIEFARQIFEKMPTRTLVSWNSIIVG 301

Query: 303 FAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITP 362
           FAVNG+ +E+L+FF  MQKEGFKPDG+S+TGALTACSHAGLV++GL+ FD+MK VH+ITP
Sbjct: 302 FAVNGYVEEALQFFSLMQKEGFKPDGISFTGALTACSHAGLVDEGLQFFDDMKRVHRITP 361

Query: 363 RIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHL 422
           RIEHYGCIVDLY RAGRLEDALN+IE MPMKPNEVVLGSLLAACRTHG+V+LAERLM +L
Sbjct: 362 RIEHYGCIVDLYSRAGRLEDALNVIENMPMKPNEVVLGSLLAACRTHGNVSLAERLMNYL 421

Query: 423 FKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAG 460
           F+LDP GD+ YVLL+NIYAA+G+WDGA  +RR MKARG+QKKPG SS+EID  +HEFVAG
Sbjct: 422 FELDPGGDSNYVLLANIYAALGRWDGAGKIRRKMKARGIQKKPGSSSIEIDCSIHEFVAG 481

BLAST of CsGy1G014900 vs. TrEMBL
Match: tr|A0A2P5BEP9|A0A2P5BEP9_9ROSA (Tetratricopeptide-like helical domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_323900 PE=4 SV=1)

HSP 1 Score: 589.7 bits (1519), Expect = 5.5e-165
Identity = 310/526 (58.94%), Postives = 375/526 (71.29%), Query Frame = 0

Query: 3   SIPSHTATPSQLQ---LPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVL 62
           S+P++T TP+QL     PP  P S+P S+         P S   ++S +     ++P++ 
Sbjct: 2   SLPANTVTPAQLSHSTKPP--PLSLP-SSXXXXXXXXKPRSKDLSVSQRDLHKPIEPVIK 61

Query: 63  WTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGY 122
           WTSS+AR CR+G+L+EAA +F++MRLAG+EPNHITFITLLS CADFPSES  F +S+HGY
Sbjct: 62  WTSSIARLCRSGRLAEAAVQFSQMRLAGLEPNHITFITLLSGCADFPSESLKFGASIHGY 121

Query: 123 ACKYGLDTGHVMVGTALIDMYSKCAQLGHARKM--------------------------- 182
           A K GLDT HVMVGTA++DMY+KC  +G AR +                           
Sbjct: 122 ARKLGLDTSHVMVGTAILDMYAKCGLVGFARVVFDELMVKNSVTWXXXXXXXXXXXXXXX 181

Query: 183 ---------------------------------------QRSGVAADYVSIIAVLAACAD 242
                                                    SGV  DYV++IAVLAACAD
Sbjct: 182 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSGVEPDYVTVIAVLAACAD 241

Query: 243 LGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSI 302
           LG + LGLW++RFV+ +EFKDNIKISNSLIDMYSRCGCIEFARQVF KM  RTLVSWNS+
Sbjct: 242 LGTVGLGLWMNRFVINEEFKDNIKISNSLIDMYSRCGCIEFARQVFEKMPIRTLVSWNSM 301

Query: 303 IVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHK 362
           IVGFAVNG A+E+LEFF  MQKEGFKPDGVS+TGALTACSHAGLV++GL  F+NMK VH 
Sbjct: 302 IVGFAVNGLAEEALEFFNLMQKEGFKPDGVSFTGALTACSHAGLVDEGLLFFENMKRVHG 361

Query: 363 ITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLM 422
           I  RIEHYGCIVDLY RAGRLEDALN+IE MPMKPNEVVLGSLLAACRTHGD+ LAERLM
Sbjct: 362 IRHRIEHYGCIVDLYSRAGRLEDALNVIENMPMKPNEVVLGSLLAACRTHGDITLAERLM 421

Query: 423 KHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEF 460
           K+LF+LDP GD+ YVLL+N+YAA+G+WDGAN VR+ MKARG+QK PG+SS+EID  +HEF
Sbjct: 422 KYLFELDPGGDSNYVLLANVYAAVGRWDGANKVRKRMKARGIQKTPGFSSIEIDCNIHEF 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139593.17.6e-25987.05PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic ... [more]
XP_008458940.12.8e-22177.10PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic ... [more]
XP_022142716.18.9e-20771.37pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica ... [more]
XP_023521260.11.4e-19668.32pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita ... [more]
XP_022967078.17.0e-19667.87pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT1G05750.13.9e-13251.84Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G49142.12.8e-8238.06Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G08070.15.5e-7837.13Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G13600.13.6e-7735.02Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G13770.18.0e-7734.80Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9MA50|PPR13_ARATH7.1e-13151.84Pentatricopeptide repeat-containing protein At1g05750, chloroplastic OS=Arabidop... [more]
sp|P0C899|PP271_ARATH5.1e-8138.06Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis th... [more]
sp|Q9LN01|PPR21_ARATH1.0e-7637.13Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9SIT7|PP151_ARATH6.5e-7635.02Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
sp|Q9LIC3|PP227_ARATH1.4e-7534.80Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LYD6|A0A0A0LYD6_CUCSA5.0e-25987.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G169950 PE=4 SV=1[more]
tr|A0A1S3C956|A0A1S3C956_CUCME1.9e-22177.10pentatricopeptide repeat-containing protein At1g05750, chloroplastic OS=Cucumis ... [more]
tr|A0A2P4HUZ8|A0A2P4HUZ8_QUESU2.8e-16960.58Pentatricopeptide repeat-containing protein, chloroplastic OS=Quercus suber OX=5... [more]
tr|A0A2N9FND9|A0A2N9FND9_FAGSY1.6e-16758.89Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16654 PE=4 SV=1[more]
tr|A0A2P5BEP9|A0A2P5BEP9_9ROSA5.5e-16558.94Tetratricopeptide-like helical domain containing protein OS=Trema orientalis OX=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G014900.1CsGy1G014900.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 278..425
e-value: 3.8E-19
score: 71.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 43..124
e-value: 1.1E-8
score: 36.5
coord: 152..277
e-value: 2.5E-22
score: 81.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 304..386
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 227..273
e-value: 4.0E-7
score: 30.0
coord: 57..103
e-value: 8.0E-8
score: 32.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 263..290
e-value: 0.0025
score: 15.9
coord: 301..324
e-value: 6.6E-4
score: 17.7
coord: 228..261
e-value: 1.3E-5
score: 23.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 200..225
e-value: 0.001
score: 19.1
coord: 300..325
e-value: 9.3E-4
score: 19.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..397
score: 6.862
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 195..225
score: 8.254
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..291
score: 7.859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..260
score: 11.082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 297..331
score: 8.166
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 56..90
score: 9.668
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..48
NoneNo IPR availablePANTHERPTHR24015:SF778SUBFAMILY NOT NAMEDcoord: 154..430
NoneNo IPR availablePANTHERPTHR24015:SF778SUBFAMILY NOT NAMEDcoord: 45..152
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 154..430
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 45..152