CsGy2G016890 (gene) Cucumber (Gy14) v2

NameCsGy2G016890
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat
LocationChr2 : 27172924 .. 27175194 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCCTTCGCTGGAATTCTTCAATTCTGTACAATTTCTTCATCCAGTCAAGAACCCAATACCCACTTCTTCTTCATCGCTCATTTCACCTGGTCCGTCAATGTGCAACACCAGAAGCGATTGTTTCAGCTTTACTCATCGCTGTTAACTCTTGCCCTTCCATCTCCAATTGCCGGGAAATTCATGCCCGAGTATTCAAATCTTTGCTTTATAGAGATGGCTTCATTGGGGATCAGCTGGTTACTTGTTATAATAAACTGGGCTATGCTGAAGATGCACTGAAGCTGTTTGATGATATGCCTCATAAAGATTTGGTCTCTTGGAACTCACTGATTTCTGGTTTTTCTCGTTGTCTTCATATGAGCCTCACAGCATTTTATACCATGAAGTTTGAGATGTCAGTTAAACCCAATGAGGTCACAATTCTGTCGATGATATCAGCTTGCAATGGAGCTTTGGATGCAGGGAAGTATATTCATGGTTTTGGAATTAAAGTTGGTGGTACTTTGGAAGTTAAGGTTGCTAATTCTCTCATTAACATGTATGGAAAGTCTGGAGATTTAACATCAGCTTGTAGATTGTTTGAGGCCATTCCAGACCCGGATACAGTATCGTGGAATTCAATCATTGCTGCTCAAGTCACTAATGGCTGTGCACGAGAAGGAATTGATTATTTTAATAAAATGAGAAGGCTTGGAATTGAGCAGGATGAAGGAACTATCCTGGCCCTGCTTCAAGCTTGCCTACATTTGGGTGTAGGAAAATTGGCAGAAAGCATTCATGGTTTAATGTTCTGCACTGGTTTTGGCGCAAAGATCACCATAGCAACTGCACTTTTAGATACCTATGCGAAATTGGGAAGATTAAGTGCTTCATATGGCGTCTTTACGGAGGTGGGTTTTGCAGACAGAGTTGCTTGGACCGCCATGCTTGCAGGATATGCTGCTCATGGATTAGGTAGGGAAGCAATCAAGCTTTTCGAGAGCATGGCCAATAAAGGCTTGGAGCCTGATCATGTGACTTTTACTCATTTGCTTAGCGCATGTAGTCATTCAGGGCTAGTCAATGAGGGGAAAAGTTACTTCAATGTGATGTCTGAAGTGTATGGAATTGAGCCCAGGGTAGATCATTATTCATGTATGGTTGATCTACTCGGTCGCTGCGGCCTTTTGAATGATGCTTATGAGGTGATACAAAACATGCCCATGGAGCCTAATGCTGGTGTGTGGGGTGCGCTTCTCGGTGCTTGTAGGGTTCATGGTAACATTGAACTTGGTAAGGAAGTTGCAGAGCATTTGATTAATATGGAACCTTTGGACCCCAGAAACTATATCATGTTATCAAATATGTATTCCGCATCTCGTTCTTGGAAGGATGCTGCCAAAGTGAGGGCCTTGCTAAAGGAGAGAGGTCTGAAAAGAACCCCAGGATATAGCTCCATTGAATATGGAAACAAGAACCATCACTTCTTCGTGGGCGATCGATCTCACCCTGAGACGGAGAAGATCTATTCCAAGCTCGAAGAATTGCTCGGAAAAATAAGGAAAGCTGGATATAGTTCCAAAACAGAATATGTTCTGCAAGACGTTGAAGAGGAAGTCAAGGAGGATATGATAAACAAGCATAGCGAGAAGTTAGCCATTGCTTTTGGGCTTTTGGTGAGTAAAGAAGGTGAAGCTTTAATCATAACAAAGAATCTTAGAATTTGTGGAGATTGTCATAGCACTGCAAAGCTCATATCATTGATTGAGAAGCGTACCATTATTATCCGAGATCCAAAACGCTTTCACCATTTCTCTGATGGATTCTGTTCTTGTGCAGATTACTGGTAAGTTTTTACTGTACTTCACAATGGTGCTAATGCGTATTTATACCTTACATAGAGCTAACCCATGTGCATCATAACCATATTCTTTGTTTAGTTTTGCTAATTTAATATGGTTTTCAGTCTGATGTATAAGAGCTTGATTTAGGCTTCTCTTATATTCTTTTATTGTGTCATTAAAATAAGTATGATCGTAGTTTAAAGAGGCTTGATACAACCTACAAATGTCAGTTCATAAGTTCCTTAGAAAATGACTTTTAATAACTATGAGTTTGGTTCTTAGCTGGTTAGGTACAAAAACTCTTATCATATTGTTTATGACTTCATGTTTTATTTTATCTAGAGTGTTGCCTCTCATTATACACCCCGTGTTAATATGAGATCTTGGTTTTATCTTGACTTTAGTTCTAGAAAAACGATTCCAAATCAAAGTTGGTTGGCATGA

mRNA sequence

ATGCCCCTTCGCTGGAATTCTTCAATTCTGTACAATTTCTTCATCCAGTCAAGAACCCAATACCCACTTCTTCTTCATCGCTCATTTCACCTGGTCCGTCAATGTGCAACACCAGAAGCGATTGTTTCAGCTTTACTCATCGCTGTTAACTCTTGCCCTTCCATCTCCAATTGCCGGGAAATTCATGCCCGAGTATTCAAATCTTTGCTTTATAGAGATGGCTTCATTGGGGATCAGCTGGTTACTTGTTATAATAAACTGGGCTATGCTGAAGATGCACTGAAGCTGTTTGATGATATGCCTCATAAAGATTTGGTCTCTTGGAACTCACTGATTTCTGGTTTTTCTCGTTGTCTTCATATGAGCCTCACAGCATTTTATACCATGAAGTTTGAGATGTCAGTTAAACCCAATGAGGTCACAATTCTGTCGATGATATCAGCTTGCAATGGAGCTTTGGATGCAGGGAAGTATATTCATGGTTTTGGAATTAAAGTTGGTGGTACTTTGGAAGTTAAGGTTGCTAATTCTCTCATTAACATGTATGGAAAGTCTGGAGATTTAACATCAGCTTGTAGATTGTTTGAGGCCATTCCAGACCCGGATACAGTATCGTGGAATTCAATCATTGCTGCTCAAGTCACTAATGGCTGTGCACGAGAAGGAATTGATTATTTTAATAAAATGAGAAGGCTTGGAATTGAGCAGGATGAAGGAACTATCCTGGCCCTGCTTCAAGCTTGCCTACATTTGGGTGTAGGAAAATTGGCAGAAAGCATTCATGGTTTAATGTTCTGCACTGAGCATTTGATTAATATGGAACCTTTGGACCCCAGAAACTATATCATGTTATCAAATATGTATTCCGCATCTCGTTCTTGGAAGGATGCTGCCAAAGTGAGGGCCTTGCTAAAGGAGAGAGGTCTGAAAAGAACCCCAGGATATAGCTCCATTGAATATGGAAACAAGAACCATCACTTCTTCGTGGGCGATCGATCTCACCCTGAGACGGAGAAGATCTATTCCAAGCTCGAAGAATTGCTCGGAAAAATAAGGAAAGCTGGATATAGTTCCAAAACAGAATATGTTCTGCAAGACGTTGAAGAGGAAGTCAAGGAGGATATGATAAACAAGCATAGCGAGAAGTTAGCCATTGCTTTTGGGCTTTTGGTGAGTAAAGAAGGTGAAGCTTTAATCATAACAAAGAATCTTAGAATTTGTGGAGATTGTCATAGCACTGCAAAGCTCATATCATTGATTGAGAAGCGTACCATTATTATCCGAGATCCAAAACGCTTTCACCATTTCTCTGATGGATTCTGTTCTTGTGCAGATTACTGTCTGATAAAAACGATTCCAAATCAAAGTTGGTTGGCATGA

Coding sequence (CDS)

ATGCCCCTTCGCTGGAATTCTTCAATTCTGTACAATTTCTTCATCCAGTCAAGAACCCAATACCCACTTCTTCTTCATCGCTCATTTCACCTGGTCCGTCAATGTGCAACACCAGAAGCGATTGTTTCAGCTTTACTCATCGCTGTTAACTCTTGCCCTTCCATCTCCAATTGCCGGGAAATTCATGCCCGAGTATTCAAATCTTTGCTTTATAGAGATGGCTTCATTGGGGATCAGCTGGTTACTTGTTATAATAAACTGGGCTATGCTGAAGATGCACTGAAGCTGTTTGATGATATGCCTCATAAAGATTTGGTCTCTTGGAACTCACTGATTTCTGGTTTTTCTCGTTGTCTTCATATGAGCCTCACAGCATTTTATACCATGAAGTTTGAGATGTCAGTTAAACCCAATGAGGTCACAATTCTGTCGATGATATCAGCTTGCAATGGAGCTTTGGATGCAGGGAAGTATATTCATGGTTTTGGAATTAAAGTTGGTGGTACTTTGGAAGTTAAGGTTGCTAATTCTCTCATTAACATGTATGGAAAGTCTGGAGATTTAACATCAGCTTGTAGATTGTTTGAGGCCATTCCAGACCCGGATACAGTATCGTGGAATTCAATCATTGCTGCTCAAGTCACTAATGGCTGTGCACGAGAAGGAATTGATTATTTTAATAAAATGAGAAGGCTTGGAATTGAGCAGGATGAAGGAACTATCCTGGCCCTGCTTCAAGCTTGCCTACATTTGGGTGTAGGAAAATTGGCAGAAAGCATTCATGGTTTAATGTTCTGCACTGAGCATTTGATTAATATGGAACCTTTGGACCCCAGAAACTATATCATGTTATCAAATATGTATTCCGCATCTCGTTCTTGGAAGGATGCTGCCAAAGTGAGGGCCTTGCTAAAGGAGAGAGGTCTGAAAAGAACCCCAGGATATAGCTCCATTGAATATGGAAACAAGAACCATCACTTCTTCGTGGGCGATCGATCTCACCCTGAGACGGAGAAGATCTATTCCAAGCTCGAAGAATTGCTCGGAAAAATAAGGAAAGCTGGATATAGTTCCAAAACAGAATATGTTCTGCAAGACGTTGAAGAGGAAGTCAAGGAGGATATGATAAACAAGCATAGCGAGAAGTTAGCCATTGCTTTTGGGCTTTTGGTGAGTAAAGAAGGTGAAGCTTTAATCATAACAAAGAATCTTAGAATTTGTGGAGATTGTCATAGCACTGCAAAGCTCATATCATTGATTGAGAAGCGTACCATTATTATCCGAGATCCAAAACGCTTTCACCATTTCTCTGATGGATTCTGTTCTTGTGCAGATTACTGTCTGATAAAAACGATTCCAAATCAAAGTTGGTTGGCATGA

Protein sequence

MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLHMSLTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKLAESIHGLMFCTEHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPETEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGEALIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSDGFCSCADYCLIKTIPNQSWLA
BLAST of CsGy2G016890 vs. NCBI nr
Match: XP_004143073.2 (PREDICTED: pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucumis sativus] >KGN62278.1 hypothetical protein Csa_2G348160 [Cucumis sativus])

HSP 1 Score: 836.3 bits (2159), Expect = 5.1e-239
Identity = 444/609 (72.91%), Postives = 446/609 (73.23%), Query Frame = 0

Query: 1   MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCRE 60
           MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCRE
Sbjct: 19  MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCRE 78

Query: 61  IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLH 120
           IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLH
Sbjct: 79  IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLH 138

Query: 121 MSLTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLIN 180
           MSLTAFYTMKFEMSVKPNEVTILSMISAC+GALDAGKYIHGFGIKVGGTLEVKVANSLIN
Sbjct: 139 MSLTAFYTMKFEMSVKPNEVTILSMISACSGALDAGKYIHGFGIKVGGTLEVKVANSLIN 198

Query: 181 MYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGT 240
           MYGKSGDLTSACRLFEAIPDP+TVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGT
Sbjct: 199 MYGKSGDLTSACRLFEAIPDPNTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGT 258

Query: 241 ILALLQACLHLGVGKLAESIHGLMFCT--------------------------------- 300
           ILALLQACLHLGVGKLAESIHGLMFCT                                 
Sbjct: 259 ILALLQACLHLGVGKLAESIHGLMFCTGFGAKITIATALLDTYAKLGRLSASYGVFTEVG 318

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 319 FADRVAWTAMLAGYAAHGLGREAIKLFESMANKGLEPDHVTFTHLLSACSHSGLVNEGKS 378

Query: 361 ------------------------------------------------------------ 420
                                                                       
Sbjct: 379 YFNVMSEVYGIEPRVDHYSCMVDLLGRCGLLNDAYEVIQNMPMEPNAGVWGALLGACRVH 438

Query: 421 ----------EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSS 447
                     EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSS
Sbjct: 439 GNIELGKEVAEHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSS 498

BLAST of CsGy2G016890 vs. NCBI nr
Match: XP_008445305.2 (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g40410, mitochondrial-like [Cucumis melo])

HSP 1 Score: 779.6 bits (2012), Expect = 5.7e-222
Identity = 419/610 (68.69%), Postives = 431/610 (70.66%), Query Frame = 0

Query: 1   MPLRWNSSILYNFFIQSRTQYP-LLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCR 60
           +PLR +SSILYNFFIQSRTQYP LLL RSFHL+R CA  EA+VS LLIAV SC SISNCR
Sbjct: 19  IPLRRDSSILYNFFIQSRTQYPLLLLLRSFHLIRPCAASEALVSDLLIAVKSCTSISNCR 78

Query: 61  EIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCL 120
           EIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDA KLFDDMPHKDLVSWNSLISGFSRCL
Sbjct: 79  EIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDAQKLFDDMPHKDLVSWNSLISGFSRCL 138

Query: 121 HMSLTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLI 180
           HM+LTAFYTMKFEMS+KPNEVTILSMISACNGALDAGKYIHGF IKVGGTLEVKVANSLI
Sbjct: 139 HMTLTAFYTMKFEMSIKPNEVTILSMISACNGALDAGKYIHGFAIKVGGTLEVKVANSLI 198

Query: 181 NMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEG 240
           NMYGKSGDLTSACRLFEAIPDP+TVSWNSIIAAQVTNGCAREGI +FNKMRR GIEQDEG
Sbjct: 199 NMYGKSGDLTSACRLFEAIPDPNTVSWNSIIAAQVTNGCAREGIXFFNKMRRFGIEQDEG 258

Query: 241 TILALLQACLHLGVGKLAESIHGLMFCT-------------------------------- 300
           TILALLQACLHLGVGKLAESIH LMFCT                                
Sbjct: 259 TILALLQACLHLGVGKLAESIHALMFCTGFGAKITIATALLDTYAKLGRLSASCDVFREV 318

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 319 GFADRVAWTAMLAGYAAHGLGREAIKLFESMVNEGLEPDHVTFTHLLSACSHSGLVNEGK 378

Query: 361 ------------------------------------------------------------ 420
                                                                       
Sbjct: 379 SYFNVMSEVYGIEPRVDHYSCMVDLLGRCGLLNDAYEVIRNMPMEPNAGVWGALLGACRV 438

Query: 421 -----------EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYS 447
                      EHLIN+EPLDPRNYIMLSN+YSASRSWKDAAK+RALLKERGLKRTPG S
Sbjct: 439 HGNVELGKEVAEHLINLEPLDPRNYIMLSNIYSASRSWKDAAKMRALLKERGLKRTPGCS 498

BLAST of CsGy2G016890 vs. NCBI nr
Match: XP_023546177.1 (pentatricopeptide repeat-containing protein At5g40410, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo] >XP_023546178.1 pentatricopeptide repeat-containing protein At5g40410, mitochondrial isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 707.2 bits (1824), Expect = 3.6e-200
Identity = 382/607 (62.93%), Postives = 408/607 (67.22%), Query Frame = 0

Query: 3   LRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIH 62
           LR  SSILYNFF +SRTQYPLLL  SFH +RQC   E +VSAL+IAV SC SIS+CR IH
Sbjct: 21  LRQESSILYNFFNRSRTQYPLLLW-SFHSIRQCVAAEGLVSALVIAVKSCTSISSCRGIH 80

Query: 63  ARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLHMS 122
           ARV KS LYRDGFIGDQLVTCYNKLGYAEDA K+FDDMP +DLVSWNSLI GFSRCLH++
Sbjct: 81  ARVIKSFLYRDGFIGDQLVTCYNKLGYAEDAQKVFDDMPDRDLVSWNSLICGFSRCLHVT 140

Query: 123 LTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLINMY 182
           L AF TMKFEMSVKPNEVTILSMISACNGALD G+YIHGF IK+G +LEVKV NSLINMY
Sbjct: 141 LKAFCTMKFEMSVKPNEVTILSMISACNGALDVGRYIHGFAIKIGVSLEVKVVNSLINMY 200

Query: 183 GKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTIL 242
           GKSGDLTSACRLFEAIP P+ VSWNSIIAA+VTNGCA EG+  FNKMR  GIE DEGTIL
Sbjct: 201 GKSGDLTSACRLFEAIPYPNIVSWNSIIAARVTNGCAGEGVHCFNKMRMFGIEPDEGTIL 260

Query: 243 ALLQACLHLGVGKLAESIHGLMFCT----------------------------------- 302
           ALLQAC+HLGVGKLAESIHGL+FC+                                   
Sbjct: 261 ALLQACVHLGVGKLAESIHGLIFCSGLGAQITIATALLDLYAKLGRLSASYDVFGEVGCA 320

Query: 303 ------------------------------------------------------------ 362
                                                                       
Sbjct: 321 DRVAWTAMLAGYAAHGLGREAIKLFESMAERGLEPDHVTFTHLLSACSHSGLVREGKRYF 380

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 381 NLMSKVYGIEPRIDHYSCMVDLLGRCGLLNDAYEVIRSMPMEPNAGVWGALLGACRVYGN 440

Query: 423 --------EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIE 447
                   EHLI++EPLDPRNYIMLSNMY+A+RSWKDAAKVRALLKERGLKRTPG+SSIE
Sbjct: 441 IELGKEVAEHLIHLEPLDPRNYIMLSNMYAAARSWKDAAKVRALLKERGLKRTPGWSSIE 500

BLAST of CsGy2G016890 vs. NCBI nr
Match: XP_022961715.1 (pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita moschata])

HSP 1 Score: 704.5 bits (1817), Expect = 2.3e-199
Identity = 378/614 (61.56%), Postives = 410/614 (66.78%), Query Frame = 0

Query: 8   SILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIHARVFK 67
           SILYNFF QSRTQYPLLL R FH +RQC   EA+VSAL+IAV SC SIS+CR IHARV K
Sbjct: 26  SILYNFFNQSRTQYPLLL-RPFHPIRQCVAAEALVSALVIAVKSCTSISSCRGIHARVIK 85

Query: 68  SLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLHMSLTAFY 127
           S LYRDGFIGDQLV+CYNKLGYA DA K+FDDMP +DLVSWNSLI GFSRCLH++L AF 
Sbjct: 86  SSLYRDGFIGDQLVSCYNKLGYAVDAQKVFDDMPDRDLVSWNSLICGFSRCLHVTLKAFC 145

Query: 128 TMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGD 187
           TMKFEMSVKPNEVTILSMISACNGALD G+Y+HGF IK+G +LEVKV NSLINMYGKSGD
Sbjct: 146 TMKFEMSVKPNEVTILSMISACNGALDVGRYVHGFAIKIGVSLEVKVVNSLINMYGKSGD 205

Query: 188 LTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQA 247
           LTSACRLFEAIP P+ VSWNSIIAA VTN CA EG+  FNKMR  G+E DEGTILALLQA
Sbjct: 206 LTSACRLFEAIPYPNIVSWNSIIAAHVTNDCAGEGVHCFNKMRMFGMEPDEGTILALLQA 265

Query: 248 CLHLGVGKLAESIHGLMFCT---------------------------------------- 307
           C+HLGVGKLAESIHGL+FC+                                        
Sbjct: 266 CVHLGVGKLAESIHGLIFCSGLGAQITIATALLDLYAKLGRLSASYDVFGEVGCADRVAW 325

Query: 308 ------------------------------------------------------------ 367
                                                                       
Sbjct: 326 TAMLAGYAAHGLGREAIKLFESMAERGLEPDHVTFTHLLSACSHSGLVREGKRYFNLMSE 385

Query: 368 ------------------------------------------------------------ 427
                                                                       
Sbjct: 386 VYGIEPRIDHYSCMVDLLGRCGLLNDAYEVIRSMPMEPNAGVWGALLGACRVYGNIELGK 445

Query: 428 ---EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKN 459
              EHLI++EPLDPRNYIMLSNMY+A+RSWKDAAKVRALLKERGLKRTPG+SSIEYGNK 
Sbjct: 446 EVAEHLIHLEPLDPRNYIMLSNMYAAARSWKDAAKVRALLKERGLKRTPGWSSIEYGNKI 505

BLAST of CsGy2G016890 vs. NCBI nr
Match: XP_022997515.1 (pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita maxima])

HSP 1 Score: 699.5 bits (1804), Expect = 7.5e-198
Identity = 378/607 (62.27%), Postives = 405/607 (66.72%), Query Frame = 0

Query: 3   LRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIH 62
           LR   SILYNFF QSRTQYPLLL R FH +R C   EA+VSAL+IAV SC SIS+CR IH
Sbjct: 21  LRQECSILYNFFNQSRTQYPLLL-RPFHPIRHCVAAEALVSALVIAVKSCTSISSCRGIH 80

Query: 63  ARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLHMS 122
           ARV KS LYRDGFIGDQLVTCYNKLGYAEDA K+FDDMP +DLVSWNSLI GFSRCLH++
Sbjct: 81  ARVIKSFLYRDGFIGDQLVTCYNKLGYAEDAQKVFDDMPDRDLVSWNSLICGFSRCLHVT 140

Query: 123 LTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLINMY 182
           L AF TMKFEMSVKPNEVTILSMISACNGALD G+YIHGF IK+G +LEVKV NS INMY
Sbjct: 141 LKAFCTMKFEMSVKPNEVTILSMISACNGALDVGRYIHGFAIKIGVSLEVKVVNSFINMY 200

Query: 183 GKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTIL 242
           GKSGDLTSACRLFEAIP P+ VSWNSIIAA+VTNGCA EG+  FNKMR  G+E DEGTIL
Sbjct: 201 GKSGDLTSACRLFEAIPYPNIVSWNSIIAARVTNGCAGEGVHCFNKMRMFGMEPDEGTIL 260

Query: 243 ALLQACLHLGVGKLAESIHGLMFCT----------------------------------- 302
           ALLQAC+HLGVGKLAESIHGL+FC+                                   
Sbjct: 261 ALLQACVHLGVGKLAESIHGLIFCSGLGAQIAIATALLDLYAKLGRLSASYDVFGEVGCA 320

Query: 303 ------------------------------------------------------------ 362
                                                                       
Sbjct: 321 DRVAWTAMLAGYAAHGLGREAIKLFENMAERGLEPDHVTFTHLLSACSHSGLVREGKRYF 380

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 381 NLMSEVYGIEPRIDHYSCMVDLLGRCGLLNDAYEVIRSMPMEPNAGVWGALLGACRVYGN 440

Query: 423 --------EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIE 447
                   EHLI++EPLDPRNYIMLSNMY+A+ SWKDAAKVRALLKERGLKRTPG+SSIE
Sbjct: 441 IELGKEVAEHLIHLEPLDPRNYIMLSNMYAAACSWKDAAKVRALLKERGLKRTPGWSSIE 500

BLAST of CsGy2G016890 vs. TAIR10
Match: AT5G40410.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 397.1 bits (1019), Expect = 1.4e-110
Identity = 238/580 (41.03%), Postives = 301/580 (51.90%), Query Frame = 0

Query: 39  EAIVSALLIAVNSCPSISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFD 98
           +A VS+L+ AV SC SI  CR +H +V KS+ YR GFIGDQLV CY +LG+   A KLFD
Sbjct: 31  DANVSSLIAAVKSCVSIELCRLLHCKVVKSVSYRHGFIGDQLVGCYLRLGHDVCAEKLFD 90

Query: 99  DMPHKDLVSWNSLISGFS------RCLHMSLTAFYTMKFEMSVKPNEVTILSMISAC--N 158
           +MP +DLVSWNSLISG+S      +C  +       M  E+  +PNEVT LSMISAC   
Sbjct: 91  EMPERDLVSWNSLISGYSGRGYLGKCFEV---LSRMMISEVGFRPNEVTFLSMISACVYG 150

Query: 159 GALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSII 218
           G+ + G+ IHG  +K G   EVKV N+ IN YGK+GDLTS+C+LFE +   + VSWN++I
Sbjct: 151 GSKEEGRCIHGLVMKFGVLEEVKVVNAFINWYGKTGDLTSSCKLFEDLSIKNLVSWNTMI 210

Query: 219 AAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKLAESIHGLMF----- 278
              + NG A +G+ YFN  RR+G E D+ T LA+L++C  +GV +LA+ IHGL+      
Sbjct: 211 VIHLQNGLAEKGLAYFNMSRRVGHEPDQATFLAVLRSCEDMGVVRLAQGIHGLIMFGGFS 270

Query: 279 ------------------------------------------------------------ 338
                                                                       
Sbjct: 271 GNKCITTALLDLYSKLGRLEDSSTVFHEITSPDSMAWTAMLAAYATHGFGRDAIKHFELM 330

Query: 339 ------------------------------------------------------------ 398
                                                                       
Sbjct: 331 VHYGISPDHVTFTHLLNACSHSGLVEEGKHYFETMSKRYRIDPRLDHYSCMVDLLGRSGL 390

Query: 399 --------------------------------------CTEHLINMEPLDPRNYIMLSNM 447
                                                   E L  +EP D RNY+MLSN+
Sbjct: 391 LQDAYGLIKEMPMEPSSGVWGALLGACRVYKDTQLGTKAAERLFELEPRDGRNYVMLSNI 450

BLAST of CsGy2G016890 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 295.4 bits (755), Expect = 5.9e-80
Identity = 175/477 (36.69%), Postives = 258/477 (54.09%), Query Frame = 0

Query: 43  SALLIAVNSCP---SISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDD 102
           S ++  V++C    SI   R++H  +       +  I + L+  Y+K G  E A  LF+ 
Sbjct: 267 STMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFER 326

Query: 103 MPHKDLVSWNSLISGFSRCLHMSLTAFYTMKFEMSVK----PNEVTILSMISACN--GAL 162
           +P+KD++SWN+LI G++   HM+L     + F+  ++    PN+VT+LS++ AC   GA+
Sbjct: 327 LPYKDVISWNTLIGGYT---HMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAI 386

Query: 163 DAGKYIHGFGIK--VGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIA 222
           D G++IH +  K   G T    +  SLI+MY K GD+ +A ++F +I      SWN++I 
Sbjct: 387 DIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIF 446

Query: 223 AQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKLAESI----------- 282
               +G A    D F++MR++GI+ D+ T + LL AC H G+  L   I           
Sbjct: 447 GFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMT 506

Query: 283 ---------------------------------HGLMFCT------------------EH 342
                                             G+++C+                  E+
Sbjct: 507 PKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAEN 566

Query: 343 LINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFV 402
           LI +EP +P +Y++LSN+Y+++  W + AK RALL ++G+K+ PG SSIE  +  H F +
Sbjct: 567 LIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFII 626

Query: 403 GDRSHPETEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGL 447
           GD+ HP   +IY  LEE+   + KAG+   T  VLQ++EEE KE  +  HSEKLAIAFGL
Sbjct: 627 GDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGL 686

BLAST of CsGy2G016890 vs. TAIR10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 284.6 bits (727), Expect = 1.0e-76
Identity = 169/452 (37.39%), Postives = 238/452 (52.65%), Query Frame = 0

Query: 61  IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSR--C 120
           IH    KS       +   L T Y+KL   E A KLFD+ P K L SWN++ISG+++   
Sbjct: 341 IHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGL 400

Query: 121 LHMSLTAFYTMKFEMSVKPNEVTILSMISACN--GALDAGKYIHGFGIKVGGTLEVKVAN 180
              +++ F  M+ +    PN VTI  ++SAC   GAL  GK++H           + V+ 
Sbjct: 401 TEDAISLFREMQ-KSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVST 460

Query: 181 SLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQ 240
           +LI MY K G +  A RLF+ +   + V+WN++I+    +G  +E ++ F +M   GI  
Sbjct: 461 ALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITP 520

Query: 241 DEGTILALLQACLHLGVGKLAESIHGLMF-------------C-------TEHL------ 300
              T L +L AC H G+ K  + I   M              C         HL      
Sbjct: 521 TPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQF 580

Query: 301 ---INMEP------------------------------LDPRN---YIMLSNMYSASRSW 360
              +++EP                              LDP N   +++LSN++SA R++
Sbjct: 581 IEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNY 640

Query: 361 KDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPETEKIYSKLEELLGKIRKA 420
             AA VR   K+R L + PGY+ IE G   H F  GD+SHP+ ++IY KLE+L GK+R+A
Sbjct: 641 PQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREA 700

Query: 421 GYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGEALIITKNLRICGDCHSTA 447
           GY  +TE  L DVEEE +E M+  HSE+LAIAFGL+ ++ G  + I KNLR+C DCH+  
Sbjct: 701 GYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVT 760

BLAST of CsGy2G016890 vs. TAIR10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 273.1 bits (697), Expect = 3.2e-73
Identity = 157/468 (33.55%), Postives = 237/468 (50.64%), Query Frame = 0

Query: 45  LLIAVNSCPSISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKD 104
           L+ AV +   +     IH+ V +S      ++ + L+  Y   G    A K+FD MP KD
Sbjct: 127 LIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKD 186

Query: 105 LVSWNSLISGFS-RCLHMSLTAFYTMKFEMSVKPNEVTILSMISACN--GALDAGKYIHG 164
           LV+WNS+I+GF+         A YT      +KP+  TI+S++SAC   GAL  GK +H 
Sbjct: 187 LVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHV 246

Query: 165 FGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCARE 224
           + IKVG T  +  +N L+++Y + G +  A  LF+ + D ++VSW S+I     NG  +E
Sbjct: 247 YMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKE 306

Query: 225 GIDYFNKMRRL-GIEQDEGTILALLQACLHLGVGKL------------------------ 284
            I+ F  M    G+   E T + +L AC                                
Sbjct: 307 AIELFKYMESTEGLLPCEITFVGILYACSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366

Query: 285 ----------------------------------AESIHG----LMFCTEHLINMEPLDP 344
                                             A ++HG      F    ++ +EP   
Sbjct: 367 XXXXXXXXXXXXXXXXXXXXPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHS 426

Query: 345 RNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPETE 404
            +Y++LSNMY++ + W D  K+R  +   G+K+ PG+S +E GN+ H F +GD+SHP+++
Sbjct: 427 GDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSD 486

Query: 405 KIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGEAL 447
            IY+KL+E+ G++R  GY  +   V  DVEEE KE+ +  HSEK+AIAF L+ + E   +
Sbjct: 487 AIYAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPI 546

BLAST of CsGy2G016890 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 270.4 bits (690), Expect = 2.0e-72
Identity = 158/470 (33.62%), Postives = 247/470 (52.55%), Query Frame = 0

Query: 54  SISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLIS 113
           + S    IH  V K  L RD F+ + L+  Y++LG  + A+++F  M  +DLV+WN++I+
Sbjct: 420 AFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMIT 479

Query: 114 G--FSRCLHMSLTAFYTMK----------FEMSVKPNEVTILSMISACN--GALDAGKYI 173
           G  FS     +L   + M+            +S+KPN +T+++++ +C    AL  GK I
Sbjct: 480 GYVFSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEI 539

Query: 174 HGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCA 233
           H + IK     +V V ++L++MY K G L  + ++F+ IP  + ++WN II A   +G  
Sbjct: 540 HAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNG 599

Query: 234 REGIDYFNKMRRLGIEQDEGTILALLQACLHLGV-------------------------- 293
           +E ID    M   G++ +E T +++  AC H G+                          
Sbjct: 600 QEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYAC 659

Query: 294 --------GKLAES-------------------------IHGLM----FCTEHLINMEPL 353
                   G++ E+                         IH  +       ++LI +EP 
Sbjct: 660 VVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPN 719

Query: 354 DPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPE 413
              +Y++L+N+YS++  W  A +VR  +KE+G+++ PG S IE+G++ H F  GD SHP+
Sbjct: 720 VASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQ 779

Query: 414 TEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGE 447
           +EK+   LE L  ++RK GY   T  VL +VEE+ KE ++  HSEKLAIAFG+L +  G 
Sbjct: 780 SEKLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGT 839

BLAST of CsGy2G016890 vs. Swiss-Prot
Match: sp|Q9FND6|PP411_ARATH (Pentatricopeptide repeat-containing protein At5g40410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H15 PE=2 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 2.6e-109
Identity = 238/580 (41.03%), Postives = 301/580 (51.90%), Query Frame = 0

Query: 39  EAIVSALLIAVNSCPSISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFD 98
           +A VS+L+ AV SC SI  CR +H +V KS+ YR GFIGDQLV CY +LG+   A KLFD
Sbjct: 31  DANVSSLIAAVKSCVSIELCRLLHCKVVKSVSYRHGFIGDQLVGCYLRLGHDVCAEKLFD 90

Query: 99  DMPHKDLVSWNSLISGFS------RCLHMSLTAFYTMKFEMSVKPNEVTILSMISAC--N 158
           +MP +DLVSWNSLISG+S      +C  +       M  E+  +PNEVT LSMISAC   
Sbjct: 91  EMPERDLVSWNSLISGYSGRGYLGKCFEV---LSRMMISEVGFRPNEVTFLSMISACVYG 150

Query: 159 GALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSII 218
           G+ + G+ IHG  +K G   EVKV N+ IN YGK+GDLTS+C+LFE +   + VSWN++I
Sbjct: 151 GSKEEGRCIHGLVMKFGVLEEVKVVNAFINWYGKTGDLTSSCKLFEDLSIKNLVSWNTMI 210

Query: 219 AAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKLAESIHGLMF----- 278
              + NG A +G+ YFN  RR+G E D+ T LA+L++C  +GV +LA+ IHGL+      
Sbjct: 211 VIHLQNGLAEKGLAYFNMSRRVGHEPDQATFLAVLRSCEDMGVVRLAQGIHGLIMFGGFS 270

Query: 279 ------------------------------------------------------------ 338
                                                                       
Sbjct: 271 GNKCITTALLDLYSKLGRLEDSSTVFHEITSPDSMAWTAMLAAYATHGFGRDAIKHFELM 330

Query: 339 ------------------------------------------------------------ 398
                                                                       
Sbjct: 331 VHYGISPDHVTFTHLLNACSHSGLVEEGKHYFETMSKRYRIDPRLDHYSCMVDLLGRSGL 390

Query: 399 --------------------------------------CTEHLINMEPLDPRNYIMLSNM 447
                                                   E L  +EP D RNY+MLSN+
Sbjct: 391 LQDAYGLIKEMPMEPSSGVWGALLGACRVYKDTQLGTKAAERLFELEPRDGRNYVMLSNI 450

BLAST of CsGy2G016890 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 1.1e-78
Identity = 175/477 (36.69%), Postives = 258/477 (54.09%), Query Frame = 0

Query: 43  SALLIAVNSCP---SISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDD 102
           S ++  V++C    SI   R++H  +       +  I + L+  Y+K G  E A  LF+ 
Sbjct: 267 STMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFER 326

Query: 103 MPHKDLVSWNSLISGFSRCLHMSLTAFYTMKFEMSVK----PNEVTILSMISACN--GAL 162
           +P+KD++SWN+LI G++   HM+L     + F+  ++    PN+VT+LS++ AC   GA+
Sbjct: 327 LPYKDVISWNTLIGGYT---HMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAI 386

Query: 163 DAGKYIHGFGIK--VGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIA 222
           D G++IH +  K   G T    +  SLI+MY K GD+ +A ++F +I      SWN++I 
Sbjct: 387 DIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIF 446

Query: 223 AQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKLAESI----------- 282
               +G A    D F++MR++GI+ D+ T + LL AC H G+  L   I           
Sbjct: 447 GFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMT 506

Query: 283 ---------------------------------HGLMFCT------------------EH 342
                                             G+++C+                  E+
Sbjct: 507 PKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAEN 566

Query: 343 LINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFV 402
           LI +EP +P +Y++LSN+Y+++  W + AK RALL ++G+K+ PG SSIE  +  H F +
Sbjct: 567 LIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFII 626

Query: 403 GDRSHPETEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGL 447
           GD+ HP   +IY  LEE+   + KAG+   T  VLQ++EEE KE  +  HSEKLAIAFGL
Sbjct: 627 GDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGL 686

BLAST of CsGy2G016890 vs. Swiss-Prot
Match: sp|Q9SUH6|PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 1.9e-75
Identity = 169/452 (37.39%), Postives = 238/452 (52.65%), Query Frame = 0

Query: 61  IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSR--C 120
           IH    KS       +   L T Y+KL   E A KLFD+ P K L SWN++ISG+++   
Sbjct: 341 IHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGL 400

Query: 121 LHMSLTAFYTMKFEMSVKPNEVTILSMISACN--GALDAGKYIHGFGIKVGGTLEVKVAN 180
              +++ F  M+ +    PN VTI  ++SAC   GAL  GK++H           + V+ 
Sbjct: 401 TEDAISLFREMQ-KSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVST 460

Query: 181 SLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQ 240
           +LI MY K G +  A RLF+ +   + V+WN++I+    +G  +E ++ F +M   GI  
Sbjct: 461 ALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITP 520

Query: 241 DEGTILALLQACLHLGVGKLAESIHGLMF-------------C-------TEHL------ 300
              T L +L AC H G+ K  + I   M              C         HL      
Sbjct: 521 TPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQF 580

Query: 301 ---INMEP------------------------------LDPRN---YIMLSNMYSASRSW 360
              +++EP                              LDP N   +++LSN++SA R++
Sbjct: 581 IEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNY 640

Query: 361 KDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPETEKIYSKLEELLGKIRKA 420
             AA VR   K+R L + PGY+ IE G   H F  GD+SHP+ ++IY KLE+L GK+R+A
Sbjct: 641 PQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREA 700

Query: 421 GYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGEALIITKNLRICGDCHSTA 447
           GY  +TE  L DVEEE +E M+  HSE+LAIAFGL+ ++ G  + I KNLR+C DCH+  
Sbjct: 701 GYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVT 760

BLAST of CsGy2G016890 vs. Swiss-Prot
Match: sp|A8MQA3|PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 273.1 bits (697), Expect = 5.7e-72
Identity = 157/468 (33.55%), Postives = 237/468 (50.64%), Query Frame = 0

Query: 45  LLIAVNSCPSISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKD 104
           L+ AV +   +     IH+ V +S      ++ + L+  Y   G    A K+FD MP KD
Sbjct: 127 LIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKD 186

Query: 105 LVSWNSLISGFS-RCLHMSLTAFYTMKFEMSVKPNEVTILSMISACN--GALDAGKYIHG 164
           LV+WNS+I+GF+         A YT      +KP+  TI+S++SAC   GAL  GK +H 
Sbjct: 187 LVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHV 246

Query: 165 FGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCARE 224
           + IKVG T  +  +N L+++Y + G +  A  LF+ + D ++VSW S+I     NG  +E
Sbjct: 247 YMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKE 306

Query: 225 GIDYFNKMRRL-GIEQDEGTILALLQACLHLGVGKL------------------------ 284
            I+ F  M    G+   E T + +L AC                                
Sbjct: 307 AIELFKYMESTEGLLPCEITFVGILYACSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366

Query: 285 ----------------------------------AESIHG----LMFCTEHLINMEPLDP 344
                                             A ++HG      F    ++ +EP   
Sbjct: 367 XXXXXXXXXXXXXXXXXXXXPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHS 426

Query: 345 RNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPETE 404
            +Y++LSNMY++ + W D  K+R  +   G+K+ PG+S +E GN+ H F +GD+SHP+++
Sbjct: 427 GDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSD 486

Query: 405 KIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGEAL 447
            IY+KL+E+ G++R  GY  +   V  DVEEE KE+ +  HSEK+AIAF L+ + E   +
Sbjct: 487 AIYAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPI 546

BLAST of CsGy2G016890 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 270.4 bits (690), Expect = 3.7e-71
Identity = 158/470 (33.62%), Postives = 247/470 (52.55%), Query Frame = 0

Query: 54  SISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLIS 113
           + S    IH  V K  L RD F+ + L+  Y++LG  + A+++F  M  +DLV+WN++I+
Sbjct: 420 AFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMIT 479

Query: 114 G--FSRCLHMSLTAFYTMK----------FEMSVKPNEVTILSMISACN--GALDAGKYI 173
           G  FS     +L   + M+            +S+KPN +T+++++ +C    AL  GK I
Sbjct: 480 GYVFSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEI 539

Query: 174 HGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCA 233
           H + IK     +V V ++L++MY K G L  + ++F+ IP  + ++WN II A   +G  
Sbjct: 540 HAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNG 599

Query: 234 REGIDYFNKMRRLGIEQDEGTILALLQACLHLGV-------------------------- 293
           +E ID    M   G++ +E T +++  AC H G+                          
Sbjct: 600 QEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYAC 659

Query: 294 --------GKLAES-------------------------IHGLM----FCTEHLINMEPL 353
                   G++ E+                         IH  +       ++LI +EP 
Sbjct: 660 VVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPN 719

Query: 354 DPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRSHPE 413
              +Y++L+N+YS++  W  A +VR  +KE+G+++ PG S IE+G++ H F  GD SHP+
Sbjct: 720 VASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQ 779

Query: 414 TEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSKEGE 447
           +EK+   LE L  ++RK GY   T  VL +VEE+ KE ++  HSEKLAIAFG+L +  G 
Sbjct: 780 SEKLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGT 839

BLAST of CsGy2G016890 vs. TrEMBL
Match: tr|A0A0A0LKE6|A0A0A0LKE6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G348160 PE=4 SV=1)

HSP 1 Score: 836.3 bits (2159), Expect = 3.4e-239
Identity = 444/609 (72.91%), Postives = 446/609 (73.23%), Query Frame = 0

Query: 1   MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCRE 60
           MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCRE
Sbjct: 19  MPLRWNSSILYNFFIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCRE 78

Query: 61  IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLH 120
           IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLH
Sbjct: 79  IHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLH 138

Query: 121 MSLTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLIN 180
           MSLTAFYTMKFEMSVKPNEVTILSMISAC+GALDAGKYIHGFGIKVGGTLEVKVANSLIN
Sbjct: 139 MSLTAFYTMKFEMSVKPNEVTILSMISACSGALDAGKYIHGFGIKVGGTLEVKVANSLIN 198

Query: 181 MYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGT 240
           MYGKSGDLTSACRLFEAIPDP+TVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGT
Sbjct: 199 MYGKSGDLTSACRLFEAIPDPNTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGT 258

Query: 241 ILALLQACLHLGVGKLAESIHGLMFCT--------------------------------- 300
           ILALLQACLHLGVGKLAESIHGLMFCT                                 
Sbjct: 259 ILALLQACLHLGVGKLAESIHGLMFCTGFGAKITIATALLDTYAKLGRLSASYGVFTEVG 318

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 319 FADRVAWTAMLAGYAAHGLGREAIKLFESMANKGLEPDHVTFTHLLSACSHSGLVNEGKS 378

Query: 361 ------------------------------------------------------------ 420
                                                                       
Sbjct: 379 YFNVMSEVYGIEPRVDHYSCMVDLLGRCGLLNDAYEVIQNMPMEPNAGVWGALLGACRVH 438

Query: 421 ----------EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSS 447
                     EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSS
Sbjct: 439 GNIELGKEVAEHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSS 498

BLAST of CsGy2G016890 vs. TrEMBL
Match: tr|A0A1S3BBW7|A0A1S3BBW7_CUCME (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g40410, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103488375 PE=4 SV=1)

HSP 1 Score: 779.6 bits (2012), Expect = 3.8e-222
Identity = 419/610 (68.69%), Postives = 431/610 (70.66%), Query Frame = 0

Query: 1   MPLRWNSSILYNFFIQSRTQYP-LLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCR 60
           +PLR +SSILYNFFIQSRTQYP LLL RSFHL+R CA  EA+VS LLIAV SC SISNCR
Sbjct: 19  IPLRRDSSILYNFFIQSRTQYPLLLLLRSFHLIRPCAASEALVSDLLIAVKSCTSISNCR 78

Query: 61  EIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCL 120
           EIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDA KLFDDMPHKDLVSWNSLISGFSRCL
Sbjct: 79  EIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDAQKLFDDMPHKDLVSWNSLISGFSRCL 138

Query: 121 HMSLTAFYTMKFEMSVKPNEVTILSMISACNGALDAGKYIHGFGIKVGGTLEVKVANSLI 180
           HM+LTAFYTMKFEMS+KPNEVTILSMISACNGALDAGKYIHGF IKVGGTLEVKVANSLI
Sbjct: 139 HMTLTAFYTMKFEMSIKPNEVTILSMISACNGALDAGKYIHGFAIKVGGTLEVKVANSLI 198

Query: 181 NMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEG 240
           NMYGKSGDLTSACRLFEAIPDP+TVSWNSIIAAQVTNGCAREGI +FNKMRR GIEQDEG
Sbjct: 199 NMYGKSGDLTSACRLFEAIPDPNTVSWNSIIAAQVTNGCAREGIXFFNKMRRFGIEQDEG 258

Query: 241 TILALLQACLHLGVGKLAESIHGLMFCT-------------------------------- 300
           TILALLQACLHLGVGKLAESIH LMFCT                                
Sbjct: 259 TILALLQACLHLGVGKLAESIHALMFCTGFGAKITIATALLDTYAKLGRLSASCDVFREV 318

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 319 GFADRVAWTAMLAGYAAHGLGREAIKLFESMVNEGLEPDHVTFTHLLSACSHSGLVNEGK 378

Query: 361 ------------------------------------------------------------ 420
                                                                       
Sbjct: 379 SYFNVMSEVYGIEPRVDHYSCMVDLLGRCGLLNDAYEVIRNMPMEPNAGVWGALLGACRV 438

Query: 421 -----------EHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYS 447
                      EHLIN+EPLDPRNYIMLSN+YSASRSWKDAAK+RALLKERGLKRTPG S
Sbjct: 439 HGNVELGKEVAEHLINLEPLDPRNYIMLSNIYSASRSWKDAAKMRALLKERGLKRTPGCS 498

BLAST of CsGy2G016890 vs. TrEMBL
Match: tr|A0A251MQR1|A0A251MQR1_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G006400 PE=4 SV=1)

HSP 1 Score: 478.4 bits (1230), Expect = 1.8e-131
Identity = 272/600 (45.33%), Postives = 336/600 (56.00%), Query Frame = 0

Query: 14  FIQSRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIHARVFKSLLYRD 73
           F Q R    LL  +S         P++++S L+ AV+SC SIS  R IH+ V KS  Y D
Sbjct: 44  FTQKRFHNALLSPQSSVQFPSHPNPDSLLSYLISAVSSCSSISYSRAIHSCVIKSFNYTD 103

Query: 74  GFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSR--CLHMSLTAFYTMKF 133
           GFIGDQLV+CY +LG A+DA  LFD+MP+KDL+SWNSLISGFSR   +   L AF+ MKF
Sbjct: 104 GFIGDQLVSCYTRLGRADDARNLFDEMPNKDLISWNSLISGFSRRGYVDKCLDAFFRMKF 163

Query: 134 EMSVKPNEVTILSMISAC--NGALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLT 193
           EM ++P+EVT++S+ SAC   GA+D GKYIHGF +K+G   EVK+ NSLIN+YGKSG L 
Sbjct: 164 EMGIEPDEVTLISITSACASRGAVDEGKYIHGFALKLGVLWEVKLVNSLINLYGKSGYLD 223

Query: 194 SACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACL 253
           + CRL E +P  + VSWN +I +   NG A +G+ YFN MRR GI  D+GT+L+LL+AC 
Sbjct: 224 AVCRLVETMPVGNIVSWNLMIVSHAQNGSAADGVGYFNLMRRAGINPDDGTVLSLLEACE 283

Query: 254 HLGVGKLAESIHGLMF-------------------------------------------- 313
           +LG+ KLAE +HGL+                                             
Sbjct: 284 NLGLQKLAEGVHGLITKCGLYANATVATGLLDLYAKLGRLNYSLKVFGEVNNPDKVAWTA 343

Query: 314 ------------------------------------------------------------ 373
                                                                       
Sbjct: 344 MLAGNAVHGNGREAMELFEGMVKVGVEPDHVTFTHLLSACSHSGLVKEGKNYFDIMSQVY 403

Query: 374 -----------------------------------------------------------C 433
                                                                       
Sbjct: 404 GIEPRLDHYSCMVDLLGRSGLLNDAYELIKRMPLKPNSAVWGALFGACRVYGNIELGKEV 463

Query: 434 TEHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHH 447
            E L +++P D RNYIMLSNMYSA+  W+DA+KVRAL+KE+GL R PG S IE+GNK H 
Sbjct: 464 AERLFSLDPSDSRNYIMLSNMYSAAGLWRDASKVRALMKEKGLIRNPGCSFIEHGNKIHR 523

BLAST of CsGy2G016890 vs. TrEMBL
Match: tr|A0A2I4DY34|A0A2I4DY34_9ROSI (pentatricopeptide repeat-containing protein At5g40410, mitochondrial OS=Juglans regia OX=51240 GN=LOC108984521 PE=4 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 2.9e-129
Identity = 276/600 (46.00%), Postives = 328/600 (54.67%), Query Frame = 0

Query: 21  YPLLLHRSFHLV-------RQCATPEAIVSALLIAVNSCPSISNCREIHARVFKSLLYRD 80
           Y LL  R  H         ++   PE +VS+L++A+NSC S+S+CR IHARV KS+ Y D
Sbjct: 23  YKLLGRRKTHTALFPQVFSQRYTIPEPLVSSLILAINSCSSVSHCRAIHARVVKSVNYSD 82

Query: 81  GFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSR--CLHMSLTAFYTMKF 140
           GFIGDQLV+ Y K G  EDA KLFD++P KD+ SWNSLIS FS+   +   + A + MK 
Sbjct: 83  GFIGDQLVSNYIKFGGTEDAQKLFDEIPSKDVASWNSLISRFSQKGFVGKCMCALFRMKL 142

Query: 141 EMSVKPNEVTILSMISACN--GALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLT 200
            M +KPNEVT+LS+I AC   GALD GKYIHGF +K+G  LE+KVANS+INMYGK G L 
Sbjct: 143 VMGMKPNEVTLLSVIPACTDVGALDVGKYIHGFALKMGMLLEIKVANSIINMYGKCGYLD 202

Query: 201 SACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACL 260
           +AC+LFEA+P  + VSWNS+I     NG   EG+  FN MR  G+E DEGT++ALLQAC 
Sbjct: 203 AACKLFEAMPIRNIVSWNSMIMVYNQNGFPEEGVGCFNLMRLAGVEPDEGTVVALLQACE 262

Query: 261 HLGVGKLAESIHGLMF-------------------------------------------- 320
            LGVGKLAE IHG++F                                            
Sbjct: 263 DLGVGKLAEGIHGVIFSRCLNANERIATALLNLYAKLGRLNASHKVFGEMINPDRVAWTA 322

Query: 321 ------------------------------------------------------------ 380
                                                                       
Sbjct: 323 MLAGYAVHGCGRKAIELFESMVKEGLVPDHVTFTHLLNACSHSGFVKEGKYYFKIMSEVY 382

Query: 381 -----------------------------------------------------------C 440
                                                                       
Sbjct: 383 RIEARMDHYSCMVDLLGRSGLLSNAYDLIIQMPMEPNSGVWGALLSACRIYGNIELGKEA 442

Query: 441 TEHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHH 447
            E LI ++P DPRNYIMLSN+Y A   WKDA+KVR L+K+RG+ R PG S IE GNK H 
Sbjct: 443 AERLIALDPSDPRNYIMLSNIYCAGHLWKDASKVRTLMKDRGVIRNPGCSIIEQGNKIHR 502

BLAST of CsGy2G016890 vs. TrEMBL
Match: tr|M5VWG5|M5VWG5_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa021080mg PE=4 SV=1)

HSP 1 Score: 467.2 bits (1201), Expect = 4.2e-128
Identity = 261/563 (46.36%), Postives = 318/563 (56.48%), Query Frame = 0

Query: 51  SCPSISNCREIHARVFKSLLYRDGFIGDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNS 110
           SC SIS  R IH+ V KS  Y DGFIGDQLV+CY +LG A+DA  LFD+MP+KDL+SWNS
Sbjct: 1   SCSSISYSRAIHSCVIKSFNYTDGFIGDQLVSCYTRLGRADDARNLFDEMPNKDLISWNS 60

Query: 111 LISGFSR--CLHMSLTAFYTMKFEMSVKPNEVTILSMISAC--NGALDAGKYIHGFGIKV 170
           LISGFSR   +   L AF+ MKFEM ++P+EVT++S+ SAC   GA+D GKYIHGF +K+
Sbjct: 61  LISGFSRRGYVDKCLDAFFRMKFEMGIEPDEVTLISITSACASRGAVDEGKYIHGFALKL 120

Query: 171 GGTLEVKVANSLINMYGKSGDLTSACRLFEAIPDPDTVSWNSIIAAQVTNGCAREGIDYF 230
           G   EVK+ NSLIN+YGKSG L + CRL E +P  + VSWN +I +   NG A +G+ YF
Sbjct: 121 GVLWEVKLVNSLINLYGKSGYLDAVCRLVETMPVGNIVSWNLMIVSHAQNGSAADGVGYF 180

Query: 231 NKMRRLGIEQDEGTILALLQACLHLGVGKLAESIHGLMF--------------------- 290
           N MRR GI  D+GT+L+LL+AC +LG+ KLAE +HGL+                      
Sbjct: 181 NLMRRAGINPDDGTVLSLLEACENLGLQKLAEGVHGLITKCGLYANATVATGLLDLYAKL 240

Query: 291 ------------------------------------------------------------ 350
                                                                       
Sbjct: 241 GRLNYSLKVFGEVNNPDKVAWTAMLAGNAVHGNGREAMELFEGMVKVGVEPDHVTFTHLL 300

Query: 351 ------------------------------------------------------------ 410
                                                                       
Sbjct: 301 SACSHSGLVKEGKNYFDIMSQVYGIEPRLDHYSCMVDLLGRSGLLNDAYELIKRMPLKPN 360

Query: 411 ----------------------CTEHLINMEPLDPRNYIMLSNMYSASRSWKDAAKVRAL 447
                                   E L +++P D RNYIMLSNMYSA+  W+DA+KVRAL
Sbjct: 361 SAVWGALFGACRVYGNIELGKEVAERLFSLDPSDSRNYIMLSNMYSAAGLWRDASKVRAL 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004143073.25.1e-23972.91PREDICTED: pentatricopeptide repeat-containing protein At5g40410, mitochondrial ... [more]
XP_008445305.25.7e-22268.69PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g... [more]
XP_023546177.13.6e-20062.93pentatricopeptide repeat-containing protein At5g40410, mitochondrial isoform X1 ... [more]
XP_022961715.12.3e-19961.56pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita ... [more]
XP_022997515.17.5e-19862.27pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT5G40410.11.4e-11041.03Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G08070.15.9e-8036.69Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G30700.11.0e-7637.39Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21065.13.2e-7333.55Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.12.0e-7233.62Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FND6|PP411_ARATH2.6e-10941.03Pentatricopeptide repeat-containing protein At5g40410, mitochondrial OS=Arabidop... [more]
sp|Q9LN01|PPR21_ARATH1.1e-7836.69Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9SUH6|PP341_ARATH1.9e-7537.39Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
sp|A8MQA3|PP330_ARATH5.7e-7233.55Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
sp|Q7Y211|PP285_ARATH3.7e-7133.62Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LKE6|A0A0A0LKE6_CUCSA3.4e-23972.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G348160 PE=4 SV=1[more]
tr|A0A1S3BBW7|A0A1S3BBW7_CUCME3.8e-22268.69LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g40410, mito... [more]
tr|A0A251MQR1|A0A251MQR1_PRUPE1.8e-13145.33Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G006400 PE=4 SV=1[more]
tr|A0A2I4DY34|A0A2I4DY34_9ROSI2.9e-12946.00pentatricopeptide repeat-containing protein At5g40410, mitochondrial OS=Juglans ... [more]
tr|M5VWG5|M5VWG5_PRUPE4.2e-12846.36Uncharacterized protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa021080m... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy2G016890.1CsGy2G016890.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 204..237
e-value: 2.3E-4
score: 19.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 106..118
e-value: 0.46
score: 10.7
coord: 176..196
e-value: 0.01
score: 15.9
coord: 80..103
e-value: 0.015
score: 15.4
coord: 204..234
e-value: 2.4E-4
score: 21.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 73..107
score: 7.815
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 171..201
score: 6.939
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..236
score: 9.547
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 7.278
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 154..391
e-value: 1.8E-18
score: 69.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 34..153
e-value: 7.7E-13
score: 50.1
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 313..436
e-value: 1.1E-37
score: 128.7
NoneNo IPR availablePANTHERPTHR24015:SF669SUBFAMILY NOT NAMEDcoord: 267..369
coord: 13..266
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 267..369
coord: 13..266