Cucsat.G2170 (gene) Cucumber (B10) v3

Overview
NameCucsat.G2170
Typegene
OrganismCucumis sativus L. var. sativus cv B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationctg1002: 147999 .. 156167 (+)
RNA-Seq ExpressionCucsat.G2170
SyntenyCucsat.G2170
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTTAAATATAACAGTTGACAACTATTTAGACCTCAAAATAACGTAGAGAGCTAAACTAATTCGTCTTCCTTAACGCGGAAGACGAATTATTATTTTATAGTCAAACTGTAAATACCCAATGAAAGAATTTCAAGAAAAATTTCATGTTCTTCCTTTGATTCTGCGATTGGGGAAATTGGAAAAACTTTGAAGTTTACATTTTTAGATTTCTAGGTCACGAGTTGTTGAAGAATCACATAAGAGGAAGCTTGTGAGTTGAGCTTTCCGTTTATTTATTTTGGTCAAACTTTTTTCTTTCCTACTGAACCCATGGCTGATTGTTAAATTTCGTTGTTCAAGTTTCCAGAATTTATAAAGGTATGCATTTTTTATCTATTCATTATTTTAATCTGGACATTGATTTAATACAATGTTGGATGCTTTTAAGTCATTTGAATTTCAAATTCAACCTCACTGATTTTGGTTTAGAGTTTAAATGGAAGTAATGTTATTGGTTTTGATGAAGGGTTTATTATTGAATTTGTTTGTATTATCTATTGGTTATTTCATTTTGTTATTGTTCTGTTTGAGTGCTGTACTACTAGAATATGAATGGACAATCTTCAATGAAGTATTTGGGGATTATAATGTGTAATTTGTAGAAGATTTGAATGAACTATCTTGAATGAACGGTTTAGTTAAGTTACCAGCCTTCTAATTGTCTGTATTGACAGAGGATTTCAGTTCCTTGATTGTTTATTTTAATGTCCTACCAGGTGTCTAATCTATTCAACCGGGCCAATCGATAGAAGTTTTATAATTAGCCACAGACTTTACTTACTCCTTCTAGTTTGCCATACGATCTTAAAATGGTGCCTCTTTCTGATCTTTTCCCATCCTTTGATCACTGTGCTCGTCTCTTTTCAAAATGCATTCAACACAAACATTTAAGGGTTGGCATGTCCTTGCATTCCCACCTTATCAAAACTGCACTTTCATTTGACCTCTTCCTTGCAAATCGTCTTATTGACATGTATTCCAAATGTAATTCTATGGAAAATGCACAGAAGGCATTTGATGATTTGCCCATTAGAAATATTCATTCGTGGAATACCATTCTTGCGTCCTACTCTCGTGCTGGATTTTTTAGTCAAGCTCGTAAAGTCTTTGATGAAATGCCTCATCCAAATATTGTTAGCTACAATACCTTGATTTCTAGCTTTACTCACCATGGGCTGTATGTAGAATCAATGAATATCTTTCGACAAATGCAACAAGATTTCGATCTTTTAGCCTTGGACGAGATTACTCTTGTGAGTATAGCAGGTACTTGTGCCTGTTTGGGTGCTTTGGAGTTCTTGCGTCAGGTTCATGGAGCAGCTATTGTCATTGGATTAGAGTTTAATATGATTGTTTGTAATGCTATAGTTGATGCTTATGGTAAATGTGGCGATCCAGATGCGTCATATTCTATTTTCAGTAGAATGAAGGAGAGAGATGTTGTTACCTGGACCTCCATGGTTGTAGCTTATAATCAGACATCCAGGTTAGATGATGCTTTTCGAGTTTTCAGTTGTATGCCAGTAAAAAATGTTCATACTTGGACTGCATTGATTAATGCTTTAGTGAAGAACAAGTATAGCAATGAGGCCCTGGATTTGTTTCAACAAATGCTGGAGGAAAAAACTTCTCCAAATGCTTTCACATTTGTAGGCGTTTTAAGTGCCTGTGCTGATCTTGCTTTGATAGCAAAAGGCAAAGAGATTCATGGACTCATAATCAGAAGGAGCAGTGAACTTAATTTTCCAAACGTATATGTATGTAATGCTTTAATTGATCTGTACAGCAAGAGTGGTGACGTGAAATCAGCTAGGATGTTGTTTAACTTGATTCTTGAAAAGGATGTAGTATCGTGGAATTCATTAATAACTGGGTTTGCACAAAACGGGCTTGGAAGGGAAGCACTTCTTGCCTTCCGAAAGATGACAGAAGTAGGGATAAGGCCTAATAAAGTGACGTTTCTTGCTGTGCTGTCTGCCTGTTCCCATACTGGTTTGTCATCTGAAGGATTATGTATTCTGGAGTTAATGGAGAAGTTTTATGATATTGAGCCTAGTTTAGAACATTATGCAGTCATGATCGATATGTTTGGAAGAGAAAACAGACTTGCTGAAGCATTGGATTTAATATCCAGAGCACCCAATGGATCAAAACACGTTGGGATATGGGGTGCAGTTCTGGGGGCTTGTCGTATACATGAAAATTTAGACCTGGCTATAAGAGCTGCAGAAACTTTGTTTGAGATGGAGCCAGATAATGCTGGAAGATATGTAATGTTATCTAATGTATTTGCTGCAGCAAGTCGATGGATGGATGCCCATAATGTGAGAAAACTTATGGAGGAAAGAGGTTTCAAGAAGGAAGTAGCATATAGCTGCATAGAAATAAGAAATATAAGGCATAAGTTTGTGGCAAGAGATAATTCCCATAGTCAGATGGGTGAGATATATGAGTTAATGTTTATACTACTAGAGCATATGAACATTATTGGTTACATGGCTCTTGATGATGGTATTTACTTTTATGATGGATACAGTACTTGAACTTTGGGCATGATCTATTTGAATCCCGTGTCTCACCATTGCAAATATATTGTAAAAGTTAGCAATTTCATTGAATGAATTTGAGAATATGAAGCGGAACGATTAAGCTACAAGGCTGAGAAGGACGGATTGCTATCAAGTAGGAGATCTTGATTTTATTTTATTTATTTGTTTTAGTATTATTATTGTTTTTGTTTGTGGTAATATACATCTAGTTTTAGTCGGTAATGCATGAGAATATTTATCATTCAACATTTCAGTGGCTAAATGAATCAATTATAGGTCATCATCATCGTGCTTGTAGTATTTTATTTTCTTTCCCCACCATATTTGTGTGATACCATTACAAATAATCCTAGAAAAAATTGTGAAAATTTGAACTTGTACAGGGCTGGACACGACTGGGCACACCAAGGTTAAGAAAAAAATAGATTTTTCTTAATTCTTATAGACTGAATTTTTGTAAATTGATCAATCTTAAAGTATGAGGACGCTATCTTCTTTCTTTAGTCATAGCCAACTAATTCCATTTATATCTTGTAAGATTTGAAATGTCGTCAATTGGCTAAGCTCTTTATCTTTTTGGTGAAGTACTGTAACAAAAACTTACACGTTTTATTGATAAAGAAGTATAATATCGATATCAATATATAGACATATGTTTTATCATAAAAAAAGTGCACGCATCCTACTTTTTTAGAAATTGACCTGTCATCATGTTTGTTTTGTGTTGCTTTGGTGTTTTCATTTTTTTTTAATTTTCTTTATGTATTGGTGACTCTGATTCTTACATCCTATTATATTATCCAATAGGAAGATTGTCTTTGATTAAGAGATATCTTTCATGGAATGTGGCTCACTAGTGATTTTCGTCTTGCTGCTTTTCTTTCTGTCTTCATGACTTTGATCGTTTCTTCCTTTTCTGTAACTTGCTAGGAAGACTGCTGTACTGGAGGTAAAAGAGCTGATGAAGTTCTGTTCTTAATTTAGACGGAAAAGTTTGTAGGTGTGAGGGCTCTAAGAGTTAGTTTGCAGAGATGAAATTCAGAAATAATTTCCCTGTAAGAGGGCATCATCTTTTCTTGGTCGTAGTTGCTCTCACATTCACTGTCTTGGTGTTGTGGGCTTGGGAGAATCCTTTCCTTACCGCTTCTCAGTCAGTTCAAGCATGGTATAGAAATTCTTATGCAGGTATGCTTGTCAATGCTTGTTTATAACTCTAGAACTGGTCTACTATTAAATTAGACTAGTTTTTGGTATTCTTCGTTGTTCTCTTCTTGTTGGAACTATTAGATTTTTGAGTTCAGTAGTACATTTCTTTTTTATGACAAAGACAATCTGATTGGAGAAAAGTGAAGTTTGGGAGGTTAAAGGAAGTAAGCTAAATATGGCCTTTATACACAGTTTTTGGTTTCTCCTTTAAAATTTTGAAATTTGCTCTCAAAGGTATTGACTTTTTAGTCAAGTTACTATTTTCAATAACGTGGTTGTAGTTGAGTAGAGAAACTATGACGTAACATTGACAACCTTATATTCTATGAATGGAAAATATTTGAATGTTAAGTTTTTATCTCATTTCATCCAAATTAGAAAATCCCGTTGGCCTTACCTAAATAGAAGCTTACTTGATGGACAACTTCAGTGTACATGCTCCTTCAGTTTCATGCAACTCTCCCCATTCACTGTTGGTGCAGCCTTTTCTCTTGACTCATGCTACCAATCCCCTTTCTCGCAAACCTATATTATTCAGCCTTAATGCTCTTAGAAACCTACCCCATGAGCATACGGTTCTCCTTTCATTTCTTTTAGTAATTGGGTCCGTGGGTTTTCTTCTCCAAAATAGTTGGTTGGACTTTTTTTTGGATGTCATAGTTGGTTGGGTTTTTAATATAAGAGAGTTGGTAAATTTTTAATATAAAAAACTGCCAATATTTTTACAATGCTGCATCACTGTTGCTATATCAATTTCAACCACATTTATTGAATACGGATGTTTTTGAAAAATTTGGAACTTTGAGGACAAAATTGAAACTGTCAAATTGGCAAAAGAAGAAATGGTTCATAAATGTAATGAATTTAGTTAAATGCTGAAACATATTCAAATGCAAATTTAATGTATTTTGCCTTCTCTTGGTATATTTGTACCTCGCTGATGGTTTTTATTTGCTTAGGGTTCGTCGTAGGTTCAACAAAAAGTTCTGTATTACCTAACACAGTAAGGGAGAACGTTGAAAAAACATATTCAAATTCAAGTATAAAAGAAGAGATAATACAAGATGATGCAAATTCAGAAATTACACCCACAGATTCTGCATTCCAAATAGTTCTTGAGAGGAGCAAGAGTAATCAGAATAGTAAGTACTGAGCCTAAACTAGTGGTGCTGTTTTCTAATCACTAGTTTGCATGTTATTTTCTCTTTCAGAATTGTTCCCTACGGTTGGGTAACCGTTAGGAGTTTCATGGTGGCGTAATCCTTGCTTGTGATTGAAGCTTATTTCTTATTGATTTTTGTTCATGTCTCAAAAGAGAAGTTTTACGATTTCTCTCATTGTAGAAGGGAAAATTCTAAATCAGCTATTTAACTCTTCGTTTTATTGGCACATGATTTGTTTGTTAGTTAACTAACGGTCTAAGTAGTTTTTGGAACTATTTTAGAAACCTTTATCAAACCTCAAGGACTAAAATGTTCATTTTGAAACTTCATGGACCAAAGAGACATCCACTTTAAATCTCAAAGACCAAAAGTGTATTTTGCCATTTTATTTTATTTTTAGATTTCATAGTTGGTTGGATTTTTTATATAAGAGGGTTTGGTAATTATTTAATATAAAAAAATGTCAATTCAAAGAAAAACATCACCATTGAAATATCAATTTTCAACCACATTTATTGAATATGGATGTTCTTGAAAATTTTGACACTTTGAGGACAAAATTAAAACTGTTAAAATCAGCAATATAAAAGATGGTTCATAAATGTAATGAATTTAGAAAAATGCTGAAACATAAATGCAAATTTAGTGTATTTTGCCTGTTTTTGCTATTTGTGTACCTTGCTGATGTTCTTTATTTGTTTGGGGTTCATGGTAGGTTCCACAAACAGTTCTGTATTGCCTAACACAATAAAGGAGAATTCTGGAAAAACATATTCAAATTCAAGTACAAAGGAAAAGACAGTAAAAGATGATGCAAACTCAGAAGTTAAACTTACAGATTCTGCATCTACAATAATTTTTAACAGGAGCAAGAGTAATCAGAATAGTAAGTACTCAGGCCAAAATCGTGGTGCTGTTTTCTAATTACATACTTAGCATATTTATTTTCTCTTCGCAAAATTGTTCCCTTCAATTATGAAACCGTTGGGAGTTTCATGGGGGCGTGATCCCTGCTTACTGTTTATGATTGGAGCTTCTTTCTTATTGACTTTTATTGATGTCCCAAAAGAGAAATTATTTTACAATCTCATTGTAGAAGGGAGAATTCTAAAATGAAGGATACCTAGAGATGTTTTGAATATTTTGATATCTTGAAACTCATGATAATTCAAAATGTGCTTTATCAGTCGTTTACTTTGTTCGATGCTATGTTCTTGATTAAAGATCTAGTCGTGGTAAAAAAATTTGACCGGTATCCTTTTCCTAGTGTTGTTACTCTTTACATCAGAGCCCTCTTATAGCAATAAAGAGAAAGGAAACCATATGGATTGTTGTTGGTGAAATTTTCAATTATTTAGACCATATGGATTATGGCTTTATAGAACAAATTTTCCTCATCTAAGGTCTATGTTTTCCTCACGAAACAAAGCTTCGATTGTTAAATATTGAATTATATTGATATCAGGCATTTATCATGAAGGCCATGACATGCATATTAATCCAATATTTAATTTTATCTATGAAATTAGCTTATTCAGTGCACATTAAATCGGTTTGTTTGTTATATTGACTGATTTAACTTTTCAGCCTGTAGCTATGGAAATGGAGGATGGGTCCTTGACAATAGTCGACCGCTATACTCTGGCTTTGGATGTAAGAGATGGTTATCAGCAATGTGGTCATGTAGACTGACCCAACGGACAGATTTTTCCTATGAAAAATATCGTTGGGTTCCCAAAGATTGTGAATTGCCAGCATTTGAGCGGTCTGCATTCCTGAAAAGGTAATGACTTCCTTCTATATTTTTGCATATAGGTTTAGCAAGTCTGATAATATAAAACTGTCTTGTTTTATTTAGTGTATTGGTTTTAAAATTTTCAACCAAACAATTTCCACCATACGATGAATTCTGAAAGTATAGGTATCTATCTTATGGTCCTCAACTCTCTGCCAAGTGAACTATTTAATGGATTGTGATTTACTAAGGTTACTTTCTGCTATTCTCGACACTCTCTGAATGATCTGAATTTGTTCCATTCAGAATGCAGGACAAAACCATCGCATTCATTGGGGATTCATTAGGAAGGCAGCAATTTCAGTCTTTGATGTGTATGGTCACTGGTGGGGAAGAAAGGCCTGAGGTTCAAGATGTAGGAAAGGAATATGGTCTTGTCAAAGCCAAGGGTGCAATTCGTCCTGATGGCTGGGCATATCGTTTCTCAAATACCAATACTACCATTTTATACTATTGGTCATCTAGCCTCAGCGATTTATTGCCTTTGAACACATCAGACCCAGCCACCGATGTAGCAATGCATCTTGACCGTCCGCCAGCATTTCTGAGAAAATTCCTTCATCTATTTGATGTGTTGGTTCTCAATACAGGACATCATTGGAACAGGGGAAAAATGAGACAAAATAGATGGGTAATGTACACTGATGGAGTTCGTAGTGAACTCGGGAACTTAAAAGAAATAGGCATAGCTAAAAATTTTACGGTGCACAGTATCGTGAAATGGCTCAATTCACAACTCCCTTCCCATCCTCGACTCAAGGTTTTTTTCAGGACCTTATCACCTCGGCATTTCCGCAATGGGGAATGGAATTCTGGAGGTAGCTGTGACAACACAAGACCATTATCTGGAGGAAGCAAAGTAGAGCAGAATGGATCAAGTGATACAGTTGTTGAGAATGCTGTAAGAGGTACACAAGTAAAGATATTGGATATAACTGCTCTTTCATATCTAAGAGACGAAGCTCACAAATCCAATTACAGTATCAAAGGAACATCGAGCGGTAGCGATTGCTTGCATTGGTGTCTCCCTGGTATCCCGGATACGTGGAACGAGATTCTTATTGCACAAATATAGATTCTTTTGTCTTGAAGATTTTAGATTTTCTCTCTTGTGTAATCCTGGAAGACTCATCTTTCTAATTTTTGCTGCCTAGAGCTGTACTAATTGGTTCAGGGGAAGATGATCACTGATGAGATGGCGTTGGTTGCTACTGAATTAAATTATATGGTGGGTATTGCCTACGTGTCATTTTTCATTTAAAAAGAGTCAATTTATACTTGTAAACTGTATAAATTACATGTATCAATTTAAACTATAATAAGTATATTTCATTTAACAAACTGGTGGATCAGCAAAATGGATGATTAGAATTAATGCATTGACATTTTTTATAATTGTACTTTTATCTAAAGGTTCAAACTGATTCGAAAATACAGGCGAGTAAGGGTTTCAAATGCTTTTTGATA

Coding sequence (CDS)

ATGGTGCCTCTTTCTGATCTTTTCCCATCCTTTGATCACTGTGCTCGTCTCTTTTCAAAATGCATTCAACACAAACATTTAAGGGTTGGCATGTCCTTGCATTCCCACCTTATCAAAACTGCACTTTCATTTGACCTCTTCCTTGCAAATCGTCTTATTGACATGTATTCCAAATGTAATTCTATGGAAAATGCACAGAAGGCATTTGATGATTTGCCCATTAGAAATATTCATTCGTGGAATACCATTCTTGCGTCCTACTCTCGTGCTGGATTTTTTAGTCAAGCTCGTAAAGTCTTTGATGAAATGCCTCATCCAAATATTGTTAGCTACAATACCTTGATTTCTAGCTTTACTCACCATGGGCTGTATGTAGAATCAATGAATATCTTTCGACAAATGCAACAAGATTTCGATCTTTTAGCCTTGGACGAGATTACTCTTGTGAGTATAGCAGGTACTTGTGCCTGTTTGGGTGCTTTGGAGTTCTTGCGTCAGGTTCATGGAGCAGCTATTGTCATTGGATTAGAGTTTAATATGATTGTTTGTAATGCTATAGTTGATGCTTATGGTAAATGTGGCGATCCAGATGCGTCATATTCTATTTTCAGTAGAATGAAGGAGAGAGATGTTGTTACCTGGACCTCCATGGTTGTAGCTTATAATCAGACATCCAGGTTAGATGATGCTTTTCGAGTTTTCAGTTGTATGCCAGTAAAAAATGTTCATACTTGGACTGCATTGATTAATGCTTTAGTGAAGAACAAGTATAGCAATGAGGCCCTGGATTTGTTTCAACAAATGCTGGAGGAAAAAACTTCTCCAAATGCTTTCACATTTGTAGGCGTTTTAAGTGCCTGTGCTGATCTTGCTTTGATAGCAAAAGGCAAAGAGATTCATGGACTCATAATCAGAAGGAGCAGTGAACTTAATTTTCCAAACGTATATGTATGTAATGCTTTAATTGATCTGTACAGCAAGAGTGGTGACGTGAAATCAGCTAGGATGTTGTTTAACTTGATTCTTGAAAAGGATGTAGTATCGTGGAATTCATTAATAACTGGGTTTGCACAAAACGGGCTTGGAAGGGAAGCACTTCTTGCCTTCCGAAAGATGACAGAAGTAGGGATAAGGCCTAATAAAGTGACGTTTCTTGCTGTGCTGTCTGCCTGTTCCCATACTGGTTTGTCATCTGAAGGATTATGTATTCTGGAGTTAATGGAGAAGTTTTATGATATTGAGCCTAGTTTAGAACATTATGCAGTCATGATCGATATGTTTGGAAGAGAAAACAGACTTGCTGAAGCATTGGATTTAATATCCAGAGCACCCAATGGATCAAAACACGTTGGGATATGGGGTGCAGTTCTGGGGGCTTGTCGTATACATGAAAATTTAGACCTGGCTATAAGAGCTGCAGAAACTTTGTTTGAGATGGAGCCAGATAATGCTGGAAGATATGTAATGTTATCTAATGTATTTGCTGCAGCAAGTCGATGGATGGATGCCCATAATGTGAGAAAACTTATGGAGGAAAGAGGTTTCAAGAAGGAAGTAGCATATAGCTGCATAGAAATAAGAAATATAAGGCATAAGTTTGTGGCAAGAGATAATTCCCATAGTCAGATGGGTGAGATATATGAGTTAATGTTTATACTACTAGAGCATATGAACATTATTGGTTACATGGCTCTTGATGATGGTATTTACTTTTATGATGGATACAGTACTTGA

Protein sequence

MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST
Homology
BLAST of Cucsat.G2170 vs. ExPASy Swiss-Prot
Match: Q9SKQ4 (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 2.0e-98
Identity = 196/546 (35.90%), Postives = 308/546 (56.41%), Query Frame = 0

Query: 11  FDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSF-DLFLANRLIDMYSKCNSMENAQKAF 70
           FD  A L  +C   K L+ G  +H HL  T     +  L+N LI MY KC    +A K F
Sbjct: 46  FDLLASLLQQCGDTKSLKQGKWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVF 105

Query: 71  DDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMN 130
           D + +RN++SWN +++ Y ++G   +AR VFD MP  ++VS+NT++  +   G   E++ 
Sbjct: 106 DQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALW 165

Query: 131 IFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDA 190
            +++ ++    +  +E +   +   C     L+  RQ HG  +V G   N+++  +I+DA
Sbjct: 166 FYKEFRRSG--IKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDA 225

Query: 191 YGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALI 250
           Y KCG  +++   F  M  +D+  WT+++  Y +   ++ A ++F  MP KN  +WTALI
Sbjct: 226 YAKCGQMESAKRCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALI 285

Query: 251 NALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSE 310
              V+    N ALDLF++M+     P  FTF   L A A +A +  GKEIHG +IR +  
Sbjct: 286 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 345

Query: 311 LNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEK-DVVSWNSLITGFAQNGLGREALLA 370
              PN  V ++LID+YSKSG ++++  +F +  +K D V WN++I+  AQ+GLG +AL  
Sbjct: 346 ---PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRM 405

Query: 371 FRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFG 430
              M +  ++PN+ T + +L+ACSH+GL  EGL   E M   + I P  EHYA +ID+ G
Sbjct: 406 LDDMIKFRVQPNRTTLVVILNACSHSGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLG 465

Query: 431 RENRLAEALDLISRAP-NGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRY 490
           R     E +  I   P    KH  IW A+LG CRIH N +L  +AA+ L +++P+++  Y
Sbjct: 466 RAGCFKELMRKIEEMPFEPDKH--IWNAILGVCRIHGNEELGKKAADELIKLDPESSAPY 525

Query: 491 VMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIY 550
           ++LS+++A   +W     +R +M++R   KE A S IEI      F   D SH+   +  
Sbjct: 526 ILLSSIYADHGKWELVEKLRGVMKKRRVNKEKAVSWIEIEKKVEAFTVSDGSHAHARK-E 583

Query: 551 ELMFIL 554
           E+ FIL
Sbjct: 586 EIYFIL 583

BLAST of Cucsat.G2170 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 355.9 bits (912), Expect = 8.5e-97
Identity = 209/649 (32.20%), Postives = 324/649 (49.92%), Query Frame = 0

Query: 15  ARLFSKCIQHKHLRVGMS-LHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLP 74
           A+L   CI+ K   + +  +H+ +IK+  S ++F+ NRLID YSKC S+E+ ++ FD +P
Sbjct: 23  AKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMP 82

Query: 75  IRNIHSWNTILAS----------------------------------------------- 134
            RNI++WN+++                                                 
Sbjct: 83  QRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAM 142

Query: 135 ------------------------------------------------------YSRAGF 194
                                                                 YS+ G 
Sbjct: 143 MHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGN 202

Query: 195 FSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVSIA 254
            + A++VFDEM   N+VS+N+LI+ F  +G  VE++++F+ M +    +  DE+TL S+ 
Sbjct: 203 VNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE--SRVEPDEVTLASVI 262

Query: 255 GTCACLGALEFLRQVHGAAIVIG-LEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERDV 314
             CA L A++  ++VHG  +    L  ++I+ NA VD Y KC     +  IF  M  R+V
Sbjct: 263 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNV 322

Query: 315 VTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLEE 374
           +  TSM+  Y   +    A  +F+ M  +NV +W ALI    +N  + EAL LF  +  E
Sbjct: 323 IAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRE 382

Query: 375 KTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNF---PNVYVCNALIDLYSKS 434
              P  ++F  +L ACADLA +  G + H  +++   +       +++V N+LID+Y K 
Sbjct: 383 SVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKC 442

Query: 435 GDVKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVTFLAVL 494
           G V+   ++F  ++E+D VSWN++I GFAQNG G EAL  FR+M E G +P+ +T + VL
Sbjct: 443 GCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVL 502

Query: 495 SACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRAPNGSK 554
           SAC H G   EG      M + + + P  +HY  M+D+ GR   L EA  +I   P    
Sbjct: 503 SACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPD 562

Query: 555 HVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRK 558
            V IWG++L AC++H N+ L    AE L E+EP N+G YV+LSN++A   +W D  NVRK
Sbjct: 563 SV-IWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRK 622

BLAST of Cucsat.G2170 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 2.1e-95
Identity = 186/540 (34.44%), Postives = 316/540 (58.52%), Query Frame = 0

Query: 27  LRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLPIRNIHSWNTILAS 86
           +  G  +HS ++K  L  ++ ++N L++MY+KC     A+  FD + +R+I SWN ++A 
Sbjct: 162 METGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIAL 221

Query: 87  YSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEI 146
           + + G    A   F++M   +IV++N++IS F   G  + +++IF +M +D  LL+ D  
Sbjct: 222 HMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRD-SLLSPDRF 281

Query: 147 TLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDAYGKCGDPDASYSIFSRM 206
           TL S+   CA L  L   +Q+H   +  G + + IV NA++  Y +CG  + +  +  + 
Sbjct: 282 TLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQR 341

Query: 207 KERD--VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDL 266
             +D  +  +T+++  Y +   ++ A  +F  +  ++V  WTA+I    ++    EA++L
Sbjct: 342 GTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINL 401

Query: 267 FQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNFPNVYVCNALIDL 326
           F+ M+     PN++T   +LS  + LA ++ GK+IHG  + +S E+   +V V NALI +
Sbjct: 402 FRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAV-KSGEIY--SVSVSNALITM 461

Query: 327 YSKSGDVKSARMLFNLI-LEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVT 386
           Y+K+G++ SA   F+LI  E+D VSW S+I   AQ+G   EAL  F  M   G+RP+ +T
Sbjct: 462 YAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHIT 521

Query: 387 FLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRA 446
           ++ V SAC+H GL ++G    ++M+    I P+L HYA M+D+FGR   L EA + I + 
Sbjct: 522 YVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKM 581

Query: 447 PNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDA 506
           P     V  WG++L ACR+H+N+DL   AAE L  +EP+N+G Y  L+N+++A  +W +A
Sbjct: 582 PI-EPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEA 641

Query: 507 HNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIIGYM 564
             +RK M++   KKE  +S IE+++  H F   D +H +  EIY  M  + + +  +GY+
Sbjct: 642 AKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYV 696

BLAST of Cucsat.G2170 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 5.7e-93
Identity = 201/628 (32.01%), Postives = 326/628 (51.91%), Query Frame = 0

Query: 6   DLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENA 65
           D+ P   +   L   C     LRVG  +H  L+K+  S DLF    L +MY+KC  +  A
Sbjct: 130 DVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEA 189

Query: 66  QKAFDDLPIRNIHSWNTILASYS------------------------------------- 125
           +K FD +P R++ SWNTI+A YS                                     
Sbjct: 190 RKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSAL 249

Query: 126 --------------RAGFFS-------------------QARKVFDEMPHPNIVSYNTLI 185
                         R+GF S                    AR++FD M   N+VS+N++I
Sbjct: 250 RLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMI 309

Query: 186 SSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIG 245
            ++  +    E+M IF++M  +   +   +++++     CA LG LE  R +H  ++ +G
Sbjct: 310 DAYVQNENPKEAMLIFQKMLDEG--VKPTDVSVMGALHACADLGDLERGRFIHKLSVELG 369

Query: 246 LEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFS 305
           L+ N+ V N+++  Y KC + D + S+F +++ R +V+W +M++ + Q  R         
Sbjct: 370 LDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGR--------- 429

Query: 306 CMPVKNVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAK 365
             P+                    +AL+ F QM      P+ FT+V V++A A+L++   
Sbjct: 430 --PI--------------------DALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 489

Query: 366 GKEIHGLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITG 425
            K IHG+++R   +    NV+V  AL+D+Y+K G +  AR++F+++ E+ V +WN++I G
Sbjct: 490 AKWIHGVVMRSCLD---KNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDG 549

Query: 426 FAQNGLGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEP 485
           +  +G G+ AL  F +M +  I+PN VTFL+V+SACSH+GL   GL    +M++ Y IE 
Sbjct: 550 YGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIEL 609

Query: 486 SLEHYAVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAET 545
           S++HY  M+D+ GR  RL EA D I + P     V ++GA+LGAC+IH+N++ A +AAE 
Sbjct: 610 SMDHYGAMVDLLGRAGRLNEAWDFIMQMP-VKPAVNVYGAMLGACQIHKNVNFAEKAAER 669

Query: 546 LFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVA 564
           LFE+ PD+ G +V+L+N++ AAS W     VR  M  +G +K    S +EI+N  H F +
Sbjct: 670 LFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFS 720

BLAST of Cucsat.G2170 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 7.5e-93
Identity = 190/563 (33.75%), Postives = 305/563 (54.17%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           ++P S  FP       +   C + K  + G  +H H++K     DL++   LI MY +  
Sbjct: 130 LLPNSYTFPF------VLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNG 189

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
            +E+A K FD  P R++ S+  ++  Y+  G+   A+K+FDE+P  ++VS+N +IS +  
Sbjct: 190 RLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAE 249

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
            G Y E++ +F+ M +    +  DE T+V++   CA  G++E  RQVH      G   N+
Sbjct: 250 TGNYKEALELFKDMMK--TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNL 309

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
            + NA++D Y KCG+ + +  +F R+  +DV++W +++  Y                   
Sbjct: 310 KIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTH----------------- 369

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
                         N Y  EAL LFQ+ML    +PN  T + +L ACA L  I  G+ IH
Sbjct: 370 -------------MNLY-KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIH 429

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
             I +R   +   +  +  +LID+Y+K GD+++A  +FN IL K + SWN++I GFA +G
Sbjct: 430 VYIDKRLKGVTNAS-SLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHG 489

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
               +   F +M ++GI+P+ +TF+ +LSACSH+G+   G  I   M + Y + P LEHY
Sbjct: 490 RADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHY 549

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
             MID+ G      EA ++I+        V IW ++L AC++H N++L    AE L ++E
Sbjct: 550 GCMIDLLGHSGLFKEAEEMINMMEMEPDGV-IWCSLLKACKMHGNVELGESFAENLIKIE 609

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           P+N G YV+LSN++A+A RW +    R L+ ++G KK    S IEI ++ H+F+  D  H
Sbjct: 610 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 651

Query: 541 SQMGEIY---ELMFILLEHMNII 561
            +  EIY   E M +LLE    +
Sbjct: 670 PRNREIYGMLEEMEVLLEKAGFV 651

BLAST of Cucsat.G2170 vs. NCBI nr
Match: XP_031745241.1 (pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Cucumis sativus] >KAE8645941.1 hypothetical protein Csa_021389 [Cucumis sativus])

HSP 1 Score: 1156 bits (2990), Expect = 0.0
Identity = 577/577 (100.00%), Postives = 577/577 (100.00%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN
Sbjct: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM
Sbjct: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK
Sbjct: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
           GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY
Sbjct: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME
Sbjct: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577
           SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST
Sbjct: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577

BLAST of Cucsat.G2170 vs. NCBI nr
Match: XP_038882958.1 (pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Benincasa hispida])

HSP 1 Score: 1084 bits (2803), Expect = 0.0
Identity = 537/577 (93.07%), Postives = 555/577 (96.19%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MVP SDLFPSFDHCARL SKCI+HKHL+VGMSLHSHLIKTALS DLFLANRLIDMYSKCN
Sbjct: 1   MVPFSDLFPSFDHCARLISKCIKHKHLKVGMSLHSHLIKTALSSDLFLANRLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFD+LPIRNIHSWN ILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDELPIRNIHSWNIILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVES+NIFRQMQQDFD L LDE TLVSI GTCACLGALE LRQVHGAAIVIGLEFNM
Sbjct: 121 HGLYVESINIFRQMQQDFDHLVLDEFTLVSIVGTCACLGALELLRQVHGAAIVIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK
Sbjct: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALIN L KNKYSNEALDLFQQMLEEK SPN FTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINGLAKNKYSNEALDLFQQMLEEKISPNTFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
           G IIRRS++LNFPNVY+CNALIDLYSKSGD+KSAR LF+LILEKDVVSWNSLITGFAQNG
Sbjct: 301 GFIIRRSNDLNFPNVYICNALIDLYSKSGDMKSARTLFDLILEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREALLAFR+MTEVGIRPNKVTFL +LSACSHTGLSSEGL ILELME  YDI+PSL+HY
Sbjct: 361 LGREALLAFRRMTEVGIRPNKVTFLGLLSACSHTGLSSEGLHILELMETSYDIKPSLDHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AV+IDMFGR+NRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKE+AYSCIEIRNIRHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKELAYSCIEIRNIRHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577
           +QMGEI+ELMFILLEHM I G MALDDGIYFYDGYST
Sbjct: 541 NQMGEIHELMFILLEHMKIFGCMALDDGIYFYDGYST 577

BLAST of Cucsat.G2170 vs. NCBI nr
Match: KAA0025198.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK07468.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1055 bits (2727), Expect = 0.0
Identity = 528/547 (96.53%), Postives = 536/547 (97.99%), Query Frame = 0

Query: 31  MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLPIRNIHSWNTILASYSRA 90
           MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDD PIRNIHSWNTILASYSRA
Sbjct: 1   MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDSPIRNIHSWNTILASYSRA 60

Query: 91  GFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVS 150
           G FSQARKVFDEMPHPNIVSYNTLISSFTHHGLY ESMNIFRQMQ+DFDLLALDEITLVS
Sbjct: 61  GSFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYGESMNIFRQMQRDFDLLALDEITLVS 120

Query: 151 IAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERD 210
           I G CACLGALE LRQVHGAAIVIGLEFN+IVCNAIVDAYGKCGDPDASYSIFSRMKERD
Sbjct: 121 IVGACACLGALELLRQVHGAAIVIGLEFNLIVCNAIVDAYGKCGDPDASYSIFSRMKERD 180

Query: 211 VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE 270
           VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE
Sbjct: 181 VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE 240

Query: 271 EKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNFPNVYVCNALIDLYSKSGD 330
           EK SPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSS+LNFPNVYVCNALIDLYSKSGD
Sbjct: 241 EKNSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSDLNFPNVYVCNALIDLYSKSGD 300

Query: 331 VKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVTFLAVLSA 390
           +KSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAF+KMTEVGIRPNKVTFL VLSA
Sbjct: 301 MKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFQKMTEVGIRPNKVTFLGVLSA 360

Query: 391 CSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRAPNGSKHV 450
           CSHTGLSSEGL ILELMEK YDI+PSLEHYAVMIDMFGREN+L+EALDLISRAPNGSKHV
Sbjct: 361 CSHTGLSSEGLYILELMEKSYDIKPSLEHYAVMIDMFGRENKLSEALDLISRAPNGSKHV 420

Query: 451 GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM 510
           GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM
Sbjct: 421 GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM 480

Query: 511 EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIIGYMALDDGIY 570
           EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNI GYMALDDGIY
Sbjct: 481 EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIFGYMALDDGIY 540

Query: 571 FYDGYST 577
           FYDGYST
Sbjct: 541 FYDGYST 547

BLAST of Cucsat.G2170 vs. NCBI nr
Match: XP_022977857.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1013 bits (2620), Expect = 0.0
Identity = 497/574 (86.59%), Postives = 534/574 (93.03%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+PLS  FPSFDH A L SKCI+HKHL+VGMSLHSHLIK+ALSFD FLAN LIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANHLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLP +NIHSWNTILASYSRAGF SQAR +FDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVE+MNIF QMQQDFD L LDE T VSI GTCACLGALE LRQ+HGAAI IGLEFNM
Sbjct: 121 HGLYVEAMNIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQIHGAAIFIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNA+++AYGKCG+P  SYS+FSRM++RDVVTWTSMVVAY QTS+LDDAFRVF  MPVK
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINA VKNKYSNEALDLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
           G+IIRRSS+LNFPNVY+CNAL+DLYSKSGD+KSAR LFNL+ +KDVVSWNSLITGFAQNG
Sbjct: 301 GIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREAL+A+R+M EVGI+PN+VTFL VLSACSHTGLSSEGL I+E MEK  DI+PSL+HY
Sbjct: 361 LGREALIAYRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMESMEKSNDIKPSLDHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AV+IDMFGR+NRLAEALDLISRAPN SKH+GIWGAVLGACRIH+NLDLAIRAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVA S IEIRN+RHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNVRHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDG 574
           SQMGEIYELMFILL+HM   GYM LDDGIYFYDG
Sbjct: 541 SQMGEIYELMFILLDHMKKFGYMLLDDGIYFYDG 574

BLAST of Cucsat.G2170 vs. NCBI nr
Match: KAG6604304.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1011 bits (2615), Expect = 0.0
Identity = 497/574 (86.59%), Postives = 534/574 (93.03%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+PLS  FPSFDH A L SKCI+HKHL+VGMSLHSHLIK+ALSFD FLANRLIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANRLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLP +NIHSWNTILASYSRAGF SQAR +FDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVE+M+IF QMQQDFD L LDE T VSI GTCACLGALE LRQVHGAAI IGLEFNM
Sbjct: 121 HGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNA+++AYGKCG+P  SYS+FS M++RDVVTWTSMVVAY QTS+LDDAFRVF  MPVK
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSSMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINA VKNKYSNEALDLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
            +IIRRSS+LNFPNVY+CNAL+DLYSKSGD+KSAR LFNL+ +KDVVSWNSLITGFAQNG
Sbjct: 301 AIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREAL+AFR+M EVGI+PN+VTFL VLSACSHTGLSSEGL I+ELM K  DI+PSL+HY
Sbjct: 361 LGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMAKSNDIKPSLDHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AV+IDMFGR+NRLAEALDLISRAPN SKH+GIWGAVLGACRIH+NLDLAIRAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVA S IEIRN+RHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNVRHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDG 574
           SQMGEIYELMFILL+HM  IGYM LDDG+YFYDG
Sbjct: 541 SQMGEIYELMFILLDHMKKIGYMPLDDGVYFYDG 574

BLAST of Cucsat.G2170 vs. ExPASy TrEMBL
Match: A0A0A0KFI0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G476040 PE=4 SV=1)

HSP 1 Score: 1156 bits (2990), Expect = 0.0
Identity = 577/577 (100.00%), Postives = 577/577 (100.00%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN
Sbjct: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM
Sbjct: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK
Sbjct: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
           GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY
Sbjct: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME
Sbjct: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577
           SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST
Sbjct: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577

BLAST of Cucsat.G2170 vs. ExPASy TrEMBL
Match: A0A5D3C8H5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G001990 PE=4 SV=1)

HSP 1 Score: 1055 bits (2727), Expect = 0.0
Identity = 528/547 (96.53%), Postives = 536/547 (97.99%), Query Frame = 0

Query: 31  MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLPIRNIHSWNTILASYSRA 90
           MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDD PIRNIHSWNTILASYSRA
Sbjct: 1   MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDSPIRNIHSWNTILASYSRA 60

Query: 91  GFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVS 150
           G FSQARKVFDEMPHPNIVSYNTLISSFTHHGLY ESMNIFRQMQ+DFDLLALDEITLVS
Sbjct: 61  GSFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYGESMNIFRQMQRDFDLLALDEITLVS 120

Query: 151 IAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERD 210
           I G CACLGALE LRQVHGAAIVIGLEFN+IVCNAIVDAYGKCGDPDASYSIFSRMKERD
Sbjct: 121 IVGACACLGALELLRQVHGAAIVIGLEFNLIVCNAIVDAYGKCGDPDASYSIFSRMKERD 180

Query: 211 VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE 270
           VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE
Sbjct: 181 VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE 240

Query: 271 EKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNFPNVYVCNALIDLYSKSGD 330
           EK SPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSS+LNFPNVYVCNALIDLYSKSGD
Sbjct: 241 EKNSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSDLNFPNVYVCNALIDLYSKSGD 300

Query: 331 VKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVTFLAVLSA 390
           +KSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAF+KMTEVGIRPNKVTFL VLSA
Sbjct: 301 MKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFQKMTEVGIRPNKVTFLGVLSA 360

Query: 391 CSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRAPNGSKHV 450
           CSHTGLSSEGL ILELMEK YDI+PSLEHYAVMIDMFGREN+L+EALDLISRAPNGSKHV
Sbjct: 361 CSHTGLSSEGLYILELMEKSYDIKPSLEHYAVMIDMFGRENKLSEALDLISRAPNGSKHV 420

Query: 451 GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM 510
           GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM
Sbjct: 421 GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM 480

Query: 511 EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIIGYMALDDGIY 570
           EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNI GYMALDDGIY
Sbjct: 481 EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIFGYMALDDGIY 540

Query: 571 FYDGYST 577
           FYDGYST
Sbjct: 541 FYDGYST 547

BLAST of Cucsat.G2170 vs. ExPASy TrEMBL
Match: A0A6J1IJL9 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478028 PE=4 SV=1)

HSP 1 Score: 1013 bits (2620), Expect = 0.0
Identity = 497/574 (86.59%), Postives = 534/574 (93.03%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+PLS  FPSFDH A L SKCI+HKHL+VGMSLHSHLIK+ALSFD FLAN LIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANHLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLP +NIHSWNTILASYSRAGF SQAR +FDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVE+MNIF QMQQDFD L LDE T VSI GTCACLGALE LRQ+HGAAI IGLEFNM
Sbjct: 121 HGLYVEAMNIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQIHGAAIFIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNA+++AYGKCG+P  SYS+FSRM++RDVVTWTSMVVAY QTS+LDDAFRVF  MPVK
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINA VKNKYSNEALDLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
           G+IIRRSS+LNFPNVY+CNAL+DLYSKSGD+KSAR LFNL+ +KDVVSWNSLITGFAQNG
Sbjct: 301 GIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREAL+A+R+M EVGI+PN+VTFL VLSACSHTGLSSEGL I+E MEK  DI+PSL+HY
Sbjct: 361 LGREALIAYRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMESMEKSNDIKPSLDHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AV+IDMFGR+NRLAEALDLISRAPN SKH+GIWGAVLGACRIH+NLDLAIRAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVA S IEIRN+RHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNVRHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDG 574
           SQMGEIYELMFILL+HM   GYM LDDGIYFYDG
Sbjct: 541 SQMGEIYELMFILLDHMKKFGYMLLDDGIYFYDG 574

BLAST of Cucsat.G2170 vs. ExPASy TrEMBL
Match: A0A6J1BT15 (pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charantia OX=3673 GN=LOC111005503 PE=4 SV=1)

HSP 1 Score: 1004 bits (2597), Expect = 0.0
Identity = 493/576 (85.59%), Postives = 533/576 (92.53%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MVPL+D+FP+FDHCARL SKCI+HKHL+VGMSLHSHLIKTALS+DLFLANRLIDMYSKCN
Sbjct: 1   MVPLADIFPAFDHCARLISKCIKHKHLKVGMSLHSHLIKTALSYDLFLANRLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLPIRN+HSWNTILA Y+R G  SQARK FDEMPHPNI+SYNTLI SFT 
Sbjct: 61  SMENAQKAFDDLPIRNVHSWNTILALYTRIGCLSQARKFFDEMPHPNIISYNTLIYSFTR 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVESMNIFR+MQQDFDLL LDE TLVSIAGTCACLGAL  LRQ+HGAAIVIGLEFN+
Sbjct: 121 HGLYVESMNIFRKMQQDFDLLVLDEFTLVSIAGTCACLGALALLRQIHGAAIVIGLEFNV 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IV NAI+DAYGKCG+PD SYSIFS+M+ERDVVTWTSMVVAY QTSRLDDAFRVFSCMP+K
Sbjct: 181 IVSNAIIDAYGKCGEPDTSYSIFSQMQERDVVTWTSMVVAYAQTSRLDDAFRVFSCMPMK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINA  KNKYSNEALDLF+QMLEEK S N+FTFVGVLSACADLALIAKGK+IH
Sbjct: 241 NVHTWTALINAFAKNKYSNEALDLFEQMLEEKISLNSFTFVGVLSACADLALIAKGKQIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
           GLIIR S  LNF NVY+ NALID+YSKSGD+KSAR LFNL+ EKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRSSCSLNFLNVYIYNALIDMYSKSGDMKSARTLFNLMPEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LG+EAL+AFR+M EVGIRPNKVTFL VLSACSHTGL SEGL +LELMEKF+ I+PSL+HY
Sbjct: 361 LGKEALIAFRRMIEVGIRPNKVTFLGVLSACSHTGLLSEGLYLLELMEKFFGIKPSLDHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AV+IDMFGR+NRLAEALDLI+RAPN S HVGIWGAVLGACR+HENLDLA+ AAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLIARAPNRSNHVGIWGAVLGACRMHENLDLAMSAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVML+N+FAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRN  HKFVARDNSH
Sbjct: 481 PDNAGRYVMLANIFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNRGHKFVARDNSH 540

Query: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYS 576
           SQMGEIYELMFILL+HM   G M  D+GIYFYDGY 
Sbjct: 541 SQMGEIYELMFILLDHMKNFGCMPFDNGIYFYDGYG 576

BLAST of Cucsat.G2170 vs. ExPASy TrEMBL
Match: A0A6J1EJM2 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111433199 PE=3 SV=1)

HSP 1 Score: 934 bits (2413), Expect = 0.0
Identity = 463/541 (85.58%), Postives = 499/541 (92.24%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+PLS  FPSFDH A L SKCI+HKHL+VGMSLHSHLIK+ALSFD FLANRLIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANRLIDMYSKCN 60

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
           SMENAQKAFDDLP +NIHSWNTILASYSRAGF SQAR +FDEMPHPNIVSYNTLISSFTH
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
           HGLYVE+M+IF QMQQDFD L LDE T VSI GTCACLGALE LRQVHGAAI IGLEFNM
Sbjct: 121 HGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFIGLEFNM 180

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
           IVCNA+++AYGKCG+P  SYS+FSRM++RDVVTWTSMVVAY QTS+LDDAFRVF  MPVK
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
           NVHTWTALINA VKNKYSNEALDLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
            +IIRRSS+LNFPNVY+CNAL+DLYSKSGD+KSAR LFNL+ +KDVVSWNSLITGFAQNG
Sbjct: 301 AIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
           LGREAL+AFR+M EVGI+PN+VTFL VLSACSHTGLSSEGL I+ELMEK  DI+PSL+HY
Sbjct: 361 LGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIKPSLDHY 420

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
           AV+IDMFGR+NRLAEALDLISRAPN SKH+GIWGAVLGACRIH+NLDLAIRAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRN-----IRHKFVA 536
           PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVA S IEIRN     +R+ F A
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKLRNNFPA 540

BLAST of Cucsat.G2170 vs. TAIR 10
Match: AT2G21090.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 361.3 bits (926), Expect = 1.4e-99
Identity = 196/546 (35.90%), Postives = 308/546 (56.41%), Query Frame = 0

Query: 11  FDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSF-DLFLANRLIDMYSKCNSMENAQKAF 70
           FD  A L  +C   K L+ G  +H HL  T     +  L+N LI MY KC    +A K F
Sbjct: 46  FDLLASLLQQCGDTKSLKQGKWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVF 105

Query: 71  DDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMN 130
           D + +RN++SWN +++ Y ++G   +AR VFD MP  ++VS+NT++  +   G   E++ 
Sbjct: 106 DQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALW 165

Query: 131 IFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDA 190
            +++ ++    +  +E +   +   C     L+  RQ HG  +V G   N+++  +I+DA
Sbjct: 166 FYKEFRRSG--IKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDA 225

Query: 191 YGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALI 250
           Y KCG  +++   F  M  +D+  WT+++  Y +   ++ A ++F  MP KN  +WTALI
Sbjct: 226 YAKCGQMESAKRCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALI 285

Query: 251 NALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSE 310
              V+    N ALDLF++M+     P  FTF   L A A +A +  GKEIHG +IR +  
Sbjct: 286 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 345

Query: 311 LNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEK-DVVSWNSLITGFAQNGLGREALLA 370
              PN  V ++LID+YSKSG ++++  +F +  +K D V WN++I+  AQ+GLG +AL  
Sbjct: 346 ---PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRM 405

Query: 371 FRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFG 430
              M +  ++PN+ T + +L+ACSH+GL  EGL   E M   + I P  EHYA +ID+ G
Sbjct: 406 LDDMIKFRVQPNRTTLVVILNACSHSGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLG 465

Query: 431 RENRLAEALDLISRAP-NGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRY 490
           R     E +  I   P    KH  IW A+LG CRIH N +L  +AA+ L +++P+++  Y
Sbjct: 466 RAGCFKELMRKIEEMPFEPDKH--IWNAILGVCRIHGNEELGKKAADELIKLDPESSAPY 525

Query: 491 VMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIY 550
           ++LS+++A   +W     +R +M++R   KE A S IEI      F   D SH+   +  
Sbjct: 526 ILLSSIYADHGKWELVEKLRGVMKKRRVNKEKAVSWIEIEKKVEAFTVSDGSHAHARK-E 583

Query: 551 ELMFIL 554
           E+ FIL
Sbjct: 586 EIYFIL 583

BLAST of Cucsat.G2170 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 355.9 bits (912), Expect = 6.1e-98
Identity = 209/649 (32.20%), Postives = 324/649 (49.92%), Query Frame = 0

Query: 15  ARLFSKCIQHKHLRVGMS-LHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLP 74
           A+L   CI+ K   + +  +H+ +IK+  S ++F+ NRLID YSKC S+E+ ++ FD +P
Sbjct: 23  AKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMP 82

Query: 75  IRNIHSWNTILAS----------------------------------------------- 134
            RNI++WN+++                                                 
Sbjct: 83  QRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAM 142

Query: 135 ------------------------------------------------------YSRAGF 194
                                                                 YS+ G 
Sbjct: 143 MHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGN 202

Query: 195 FSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVSIA 254
            + A++VFDEM   N+VS+N+LI+ F  +G  VE++++F+ M +    +  DE+TL S+ 
Sbjct: 203 VNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE--SRVEPDEVTLASVI 262

Query: 255 GTCACLGALEFLRQVHGAAIVIG-LEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERDV 314
             CA L A++  ++VHG  +    L  ++I+ NA VD Y KC     +  IF  M  R+V
Sbjct: 263 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNV 322

Query: 315 VTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLEE 374
           +  TSM+  Y   +    A  +F+ M  +NV +W ALI    +N  + EAL LF  +  E
Sbjct: 323 IAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRE 382

Query: 375 KTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNF---PNVYVCNALIDLYSKS 434
              P  ++F  +L ACADLA +  G + H  +++   +       +++V N+LID+Y K 
Sbjct: 383 SVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKC 442

Query: 435 GDVKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVTFLAVL 494
           G V+   ++F  ++E+D VSWN++I GFAQNG G EAL  FR+M E G +P+ +T + VL
Sbjct: 443 GCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVL 502

Query: 495 SACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRAPNGSK 554
           SAC H G   EG      M + + + P  +HY  M+D+ GR   L EA  +I   P    
Sbjct: 503 SACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPD 562

Query: 555 HVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRK 558
            V IWG++L AC++H N+ L    AE L E+EP N+G YV+LSN++A   +W D  NVRK
Sbjct: 563 SV-IWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRK 622

BLAST of Cucsat.G2170 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 351.3 bits (900), Expect = 1.5e-96
Identity = 186/540 (34.44%), Postives = 316/540 (58.52%), Query Frame = 0

Query: 27  LRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDLPIRNIHSWNTILAS 86
           +  G  +HS ++K  L  ++ ++N L++MY+KC     A+  FD + +R+I SWN ++A 
Sbjct: 162 METGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIAL 221

Query: 87  YSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYVESMNIFRQMQQDFDLLALDEI 146
           + + G    A   F++M   +IV++N++IS F   G  + +++IF +M +D  LL+ D  
Sbjct: 222 HMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRD-SLLSPDRF 281

Query: 147 TLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNMIVCNAIVDAYGKCGDPDASYSIFSRM 206
           TL S+   CA L  L   +Q+H   +  G + + IV NA++  Y +CG  + +  +  + 
Sbjct: 282 TLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQR 341

Query: 207 KERD--VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDL 266
             +D  +  +T+++  Y +   ++ A  +F  +  ++V  WTA+I    ++    EA++L
Sbjct: 342 GTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINL 401

Query: 267 FQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSELNFPNVYVCNALIDL 326
           F+ M+     PN++T   +LS  + LA ++ GK+IHG  + +S E+   +V V NALI +
Sbjct: 402 FRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAV-KSGEIY--SVSVSNALITM 461

Query: 327 YSKSGDVKSARMLFNLI-LEKDVVSWNSLITGFAQNGLGREALLAFRKMTEVGIRPNKVT 386
           Y+K+G++ SA   F+LI  E+D VSW S+I   AQ+G   EAL  F  M   G+RP+ +T
Sbjct: 462 YAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHIT 521

Query: 387 FLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHYAVMIDMFGRENRLAEALDLISRA 446
           ++ V SAC+H GL ++G    ++M+    I P+L HYA M+D+FGR   L EA + I + 
Sbjct: 522 YVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKM 581

Query: 447 PNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDA 506
           P     V  WG++L ACR+H+N+DL   AAE L  +EP+N+G Y  L+N+++A  +W +A
Sbjct: 582 PI-EPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEA 641

Query: 507 HNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIIGYM 564
             +RK M++   KKE  +S IE+++  H F   D +H +  EIY  M  + + +  +GY+
Sbjct: 642 AKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYV 696

BLAST of Cucsat.G2170 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 343.2 bits (879), Expect = 4.1e-94
Identity = 201/628 (32.01%), Postives = 326/628 (51.91%), Query Frame = 0

Query: 6   DLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENA 65
           D+ P   +   L   C     LRVG  +H  L+K+  S DLF    L +MY+KC  +  A
Sbjct: 130 DVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEA 189

Query: 66  QKAFDDLPIRNIHSWNTILASYS------------------------------------- 125
           +K FD +P R++ SWNTI+A YS                                     
Sbjct: 190 RKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSAL 249

Query: 126 --------------RAGFFS-------------------QARKVFDEMPHPNIVSYNTLI 185
                         R+GF S                    AR++FD M   N+VS+N++I
Sbjct: 250 RLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMI 309

Query: 186 SSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIG 245
            ++  +    E+M IF++M  +   +   +++++     CA LG LE  R +H  ++ +G
Sbjct: 310 DAYVQNENPKEAMLIFQKMLDEG--VKPTDVSVMGALHACADLGDLERGRFIHKLSVELG 369

Query: 246 LEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFS 305
           L+ N+ V N+++  Y KC + D + S+F +++ R +V+W +M++ + Q  R         
Sbjct: 370 LDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGR--------- 429

Query: 306 CMPVKNVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAK 365
             P+                    +AL+ F QM      P+ FT+V V++A A+L++   
Sbjct: 430 --PI--------------------DALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 489

Query: 366 GKEIHGLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITG 425
            K IHG+++R   +    NV+V  AL+D+Y+K G +  AR++F+++ E+ V +WN++I G
Sbjct: 490 AKWIHGVVMRSCLD---KNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDG 549

Query: 426 FAQNGLGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEP 485
           +  +G G+ AL  F +M +  I+PN VTFL+V+SACSH+GL   GL    +M++ Y IE 
Sbjct: 550 YGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIEL 609

Query: 486 SLEHYAVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAET 545
           S++HY  M+D+ GR  RL EA D I + P     V ++GA+LGAC+IH+N++ A +AAE 
Sbjct: 610 SMDHYGAMVDLLGRAGRLNEAWDFIMQMP-VKPAVNVYGAMLGACQIHKNVNFAEKAAER 669

Query: 546 LFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVA 564
           LFE+ PD+ G +V+L+N++ AAS W     VR  M  +G +K    S +EI+N  H F +
Sbjct: 670 LFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFS 720

BLAST of Cucsat.G2170 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 342.8 bits (878), Expect = 5.3e-94
Identity = 190/563 (33.75%), Postives = 305/563 (54.17%), Query Frame = 0

Query: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           ++P S  FP       +   C + K  + G  +H H++K     DL++   LI MY +  
Sbjct: 130 LLPNSYTFPF------VLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNG 189

Query: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120
            +E+A K FD  P R++ S+  ++  Y+  G+   A+K+FDE+P  ++VS+N +IS +  
Sbjct: 190 RLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAE 249

Query: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180
            G Y E++ +F+ M +    +  DE T+V++   CA  G++E  RQVH      G   N+
Sbjct: 250 TGNYKEALELFKDMMK--TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNL 309

Query: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240
            + NA++D Y KCG+ + +  +F R+  +DV++W +++  Y                   
Sbjct: 310 KIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTH----------------- 369

Query: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300
                         N Y  EAL LFQ+ML    +PN  T + +L ACA L  I  G+ IH
Sbjct: 370 -------------MNLY-KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIH 429

Query: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360
             I +R   +   +  +  +LID+Y+K GD+++A  +FN IL K + SWN++I GFA +G
Sbjct: 430 VYIDKRLKGVTNAS-SLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHG 489

Query: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420
               +   F +M ++GI+P+ +TF+ +LSACSH+G+   G  I   M + Y + P LEHY
Sbjct: 490 RADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHY 549

Query: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480
             MID+ G      EA ++I+        V IW ++L AC++H N++L    AE L ++E
Sbjct: 550 GCMIDLLGHSGLFKEAEEMINMMEMEPDGV-IWCSLLKACKMHGNVELGESFAENLIKIE 609

Query: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           P+N G YV+LSN++A+A RW +    R L+ ++G KK    S IEI ++ H+F+  D  H
Sbjct: 610 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 651

Query: 541 SQMGEIY---ELMFILLEHMNII 561
            +  EIY   E M +LLE    +
Sbjct: 670 PRNREIYGMLEEMEVLLEKAGFV 651

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SKQ42.0e-9835.90Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX... [more]
Q9SIT78.5e-9732.20Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SHZ82.1e-9534.44Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q3E6Q15.7e-9332.01Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LN017.5e-9333.75Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_031745241.10.0100.00pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Cucumis sativu... [more]
XP_038882958.10.093.07pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Benincasa hisp... [more]
KAA0025198.10.096.53pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK07468... [more]
XP_022977857.10.086.59pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita... [more]
KAG6604304.10.086.59Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A0A0KFI00.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G476040 PE=4 SV=1[more]
A0A5D3C8H50.096.53Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1IJL90.086.59pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbi... [more]
A0A6J1BT150.085.59pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charanti... [more]
A0A6J1EJM20.085.58pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbi... [more]
Match NameE-valueIdentityDescription
AT2G21090.11.4e-9935.90Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G13600.16.1e-9832.20Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.11.5e-9634.44pentatricopeptide (PPR) repeat-containing protein [more]
AT1G11290.14.1e-9432.01Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.15.3e-9433.75Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (B10) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025846PMR5 N-terminal domainPFAMPF14416PMR5Ncoord: 62..115
e-value: 4.2E-14
score: 52.6
IPR026057PC-EsterasePFAMPF13839PC-Esterasecoord: 116..400
e-value: 1.4E-79
score: 267.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..39
NoneNo IPR availablePANTHERPTHR13533N-ACETYLNEURAMINATE 9-O-ACETYLTRANSFERASEcoord: 25..402
NoneNo IPR availablePANTHERPTHR13533:SF16PROTEIN TRICHOME BIREFRINGENCE-LIKE 14-RELATEDcoord: 25..402

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsat.G2170.T2Cucsat.G2170.T2mRNA
Cucsat.G2170.T1Cucsat.G2170.T1mRNA
Cucsat.G2170.T9Cucsat.G2170.T9mRNA
Cucsat.G2170.T4Cucsat.G2170.T4mRNA
Cucsat.G2170.T6Cucsat.G2170.T6mRNA
Cucsat.G2170.T3Cucsat.G2170.T3mRNA
Cucsat.G2170.T7Cucsat.G2170.T7mRNA
Cucsat.G2170.T5Cucsat.G2170.T5mRNA
Cucsat.G2170.T8Cucsat.G2170.T8mRNA
Cucsat.G2170.T10Cucsat.G2170.T10mRNA
Cucsat.G2170.T12Cucsat.G2170.T12mRNA
Cucsat.G2170.T11Cucsat.G2170.T11mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009834 plant-type secondary cell wall biogenesis
biological_process GO:1990937 xylan acetylation
biological_process GO:0045492 xylan biosynthetic process
biological_process GO:0010411 xyloglucan metabolic process
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016407 acetyltransferase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding