CmaCh17G012490 (gene) Cucurbita maxima (Rimu)

NameCmaCh17G012490
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr17 : 8534551 .. 8535628 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGTTCATCTTCTTCGGCGCCTTGGAAGGGCTGGCATGGTTGATGAAGCGCTTGCTGCGTTTTCCAAATTTGATTCACACACCAAAAAACACAAATGTTCGTAATGTAATCATTGATTTGCTTCTGAAATCTGGGCGACTCGACAGTGCATTGAATGTGCTCGACGAAATGCTTCTTCCAGGTTCGGAGTTTCGACCTAATCATATTAATTTCACTAAATTTTTGAAGTTAAATGAGTCGGAGGGGAGAGTGAAGGAAGATGAATTCGCTGGGTTGGTCGCCGAATTTGGTGAACACCATGTTGTTCCTAATCCTATTACATTAACGCAATTGATCTCCATGCTTTGTAGGAGTAGAAATACAGATCTTGCTTGGAATGTTTTGGATGATGTGATTATGCGGAATGGTCTTAAGGATGCTGCGCCCTGCAATGCGCTCTTGACGGGATTGGGAAAGAAGAGGGAATTTGAGAAGATGAATCTGCTGATAAGAAAGATGAAAGACATGAACATCCAGCCTAATGTTATTACTTTTGGTATTCTTATCAACCATTTGTGTAATTCAGAAGGATTGATGATGCACTAGAGGTGTTCGACAAAATGAAAGGGGAAAAAAAGAAGGCGAGGGTCGTCGCACCCGACGCAATCACGTACAATACTTTGATCGATGGGCTGTGTAAGGCGGGCGAAATTGAGACAGCCCATGAGCTCTTGAATGAGATGGTCACTGAACAACTTGCACCAAATGTGATCACCATAAATACTTTAGTCGATGGAATGTGCAAGCATGGTAGAATAAGCACTCTGTTACTTACACGGTGTTCATTAATGCGTTTTGCAAAGTCGACGATATCGATAAGGCGATGGAATTTTTGGATGAAATGTCGAAAGCTGGATGTTATCCTGATGCCATTGTCTATTATACTTTGATATGTGGCTTAGCACAAGGTGGAAGGTTGGATGATGCGAGCTTTATTACGTCGAAGTTGAAAGAGGCGGGATTCCGTTTAGATCTCGTATGCTACGATCTTCTTATCAGTGAGTTCGGTAAGAAGAACAAGATAGATATTTTATAA

mRNA sequence

ATGCTGTTCATCTTCTTCGGCGCCTTGGAAGGGCTGGCATGGTTGATGAAGCGCTTGCTGCGTTTTCCAAATTTGATTCACACACCAAAAAACACAAATGTTCGTAATGTAATCATTGATTTGCTTCTGAAATCTGGGCGACTCGACAGTGCATTGAATGTGCTCGACGAAATGCTTCTTCCAGGTTCGGAGTTTCGACCTAATCATATTAATTTCACTAAATTTTTGAAGTTAAATGAGTCGGAGGGGAGAGTGAAGGAAGATGAATTCGCTGGGTTGGTCGCCGAATTTGGTGAACACCATGTTGTTCCTAATCCTATTACATTAACGCAATTGATCTCCATGCTTTGTAGGAGTAGAAATACAGATCTTGCTTGGAATGTTTTGGATGATGTGATTATGCGGAATGGTCTTAAGGATGCTGCGCCCTGCAATGCGCTCTTGACGGGATTGGGAAAGAAGAGGGAATTTGAGAAGATGAATCTGCTGATAAGAAAGATGAAAGACATGAACATCCAGCCTAATGTTATTACTTTTGGTATTCTTATCAACCATTTAAGGATTGATGATGCACTAGAGGTGTTCGACAAAATGAAAGGGGAAAAAAAGAAGGCGAGGGTCGTCGCACCCGACGCAATCACGTACAATACTTTGATCGATGGGCTGTGTAAGGCGGGCGAAATTGAGACAGCCCATGAGCTCTTGAATGAGATGGTCACTGAACAACTTGCACCAAATAATAAGCACTCTGTTACTTACACGGTGTTCATTAATGCGTTTTGCAAAGTCGACGATATCGATAAGGCGATGGAATTTTTGGATGAAATGTCGAAAGCTGGATGTTATCCTGATGCCATTGTCTATTATACTTTGATATGTGGCTTAGCACAAGGTGGAAGGTTGGATGATGCGAGCTTTATTACGTCGAAGTTGAAAGAGGCGGGATTCCGTTTAGATCTCGTATGCTACGATCTTCTTATCAGTGAGTTCGGTAAGAAGAACAAGATAGATATTTTATAA

Coding sequence (CDS)

ATGCTGTTCATCTTCTTCGGCGCCTTGGAAGGGCTGGCATGGTTGATGAAGCGCTTGCTGCGTTTTCCAAATTTGATTCACACACCAAAAAACACAAATGTTCGTAATGTAATCATTGATTTGCTTCTGAAATCTGGGCGACTCGACAGTGCATTGAATGTGCTCGACGAAATGCTTCTTCCAGGTTCGGAGTTTCGACCTAATCATATTAATTTCACTAAATTTTTGAAGTTAAATGAGTCGGAGGGGAGAGTGAAGGAAGATGAATTCGCTGGGTTGGTCGCCGAATTTGGTGAACACCATGTTGTTCCTAATCCTATTACATTAACGCAATTGATCTCCATGCTTTGTAGGAGTAGAAATACAGATCTTGCTTGGAATGTTTTGGATGATGTGATTATGCGGAATGGTCTTAAGGATGCTGCGCCCTGCAATGCGCTCTTGACGGGATTGGGAAAGAAGAGGGAATTTGAGAAGATGAATCTGCTGATAAGAAAGATGAAAGACATGAACATCCAGCCTAATGTTATTACTTTTGGTATTCTTATCAACCATTTAAGGATTGATGATGCACTAGAGGTGTTCGACAAAATGAAAGGGGAAAAAAAGAAGGCGAGGGTCGTCGCACCCGACGCAATCACGTACAATACTTTGATCGATGGGCTGTGTAAGGCGGGCGAAATTGAGACAGCCCATGAGCTCTTGAATGAGATGGTCACTGAACAACTTGCACCAAATAATAAGCACTCTGTTACTTACACGGTGTTCATTAATGCGTTTTGCAAAGTCGACGATATCGATAAGGCGATGGAATTTTTGGATGAAATGTCGAAAGCTGGATGTTATCCTGATGCCATTGTCTATTATACTTTGATATGTGGCTTAGCACAAGGTGGAAGGTTGGATGATGCGAGCTTTATTACGTCGAAGTTGAAAGAGGCGGGATTCCGTTTAGATCTCGTATGCTACGATCTTCTTATCAGTGAGTTCGGTAAGAAGAACAAGATAGATATTTTATAA

Protein sequence

MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHLRIDDALEVFDKMKGEKKKARVVAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEFGKKNKIDIL
BLAST of CmaCh17G012490 vs. Swiss-Prot
Match: PP292_ARATH (Pentatricopeptide repeat-containing protein At3g61520, mitochondrial OS=Arabidopsis thaliana GN=At3g61520 PE=2 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.4e-51
Identity = 123/335 (36.72%), Postives = 199/335 (59.40%), Query Frame = 1

Query: 1   MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLL 60
           +L  +FG +     + + +L +  L    KN+ VRNV++D+LL++G +D A  VLDEML 
Sbjct: 157 LLIRWFGRM---GMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQ 216

Query: 61  PGSEFRPNHINFTKFLKLNES-EGR-VKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCR 120
             S F PN I  T  + L+E  +GR + E++   L++ F  H V PN + LT+ IS LC+
Sbjct: 217 KESVFPPNRI--TADIVLHEVWKGRLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCK 276

Query: 121 SRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVIT 180
           +   + AW++L D++      +A P NALL+ LG+  +  +MN L+ KM ++ I+P+V+T
Sbjct: 277 NARANAAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVT 336

Query: 181 FGILINHL----RIDDALEVFDKMKGEK-KKARVVAPDAITYNTLIDGLCKAGEIETAHE 240
            GILIN L    R+D+ALEVF+KM+G++     V+  D+I +NTLIDGLCK G ++ A E
Sbjct: 337 LGILINTLCKSRRVDEALEVFEKMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEE 396

Query: 241 LLNEM-VTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLI 300
           LL  M + E+ APN   +VTY   I+ +C+   ++ A E +  M +    P+ +   T++
Sbjct: 397 LLVRMKLEERCAPN---AVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIV 456

Query: 301 CGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLI 328
            G+ +   L+ A      +++ G + ++V Y  LI
Sbjct: 457 GGMCRHHGLNMAVVFFMDMEKEGVKGNVVTYMTLI 483

BLAST of CmaCh17G012490 vs. Swiss-Prot
Match: PP401_ARATH (Pentatricopeptide repeat-containing protein At5g28460 OS=Arabidopsis thaliana GN=At5g28460 PE=2 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 4.0e-51
Identity = 118/333 (35.44%), Postives = 192/333 (57.66%), Query Frame = 1

Query: 1   MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLL 60
           +L  +FG +     + + +L +  L    KN+ VRNV++D+LL++G +D A  VLDEML 
Sbjct: 157 LLIRWFGRM---GMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQ 216

Query: 61  PGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSR 120
             S F PN I     L     E  + E++   L++ F  H V PN + LT+ IS LC++ 
Sbjct: 217 KESVFPPNRITADIVLHEVWKERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 276

Query: 121 NTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFG 180
             + AW++L D++      +A P NALL+ LG+  +  +MN L+ KM ++ I+P+V+T G
Sbjct: 277 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 336

Query: 181 ILINHL----RIDDALEVFDKMKGEK-KKARVVAPDAITYNTLIDGLCKAGEIETAHELL 240
           ILIN L    R+D+ALEVF++M+G++     V+  D+I +NTLIDGLCK G ++ A ELL
Sbjct: 337 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 396

Query: 241 NEM-VTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICG 300
             M + E+  PN   +VTY   I+ +C+   ++ A E +  M +    P+ +   T++ G
Sbjct: 397 VRMKLEERCVPN---AVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGG 456

Query: 301 LAQGGRLDDASFITSKLKEAGFRLDLVCYDLLI 328
           + +   L+ A      +++ G + ++V Y  LI
Sbjct: 457 MCRHHGLNMAVVFFMDMEKEGVKGNVVTYMTLI 483

BLAST of CmaCh17G012490 vs. Swiss-Prot
Match: PPR37_ARATH (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 7.1e-32
Identity = 91/305 (29.84%), Postives = 154/305 (50.49%), Query Frame = 1

Query: 36  NVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVA 95
           ++IID L K G LD+A N+ +EM + G  F+ + I +T  ++     GR   D+ A L+ 
Sbjct: 251 SIIIDGLCKDGSLDNAFNLFNEMEIKG--FKADIIIYTTLIRGFCYAGRW--DDGAKLLR 310

Query: 96  EFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKR 155
           +  +  + P+ +  + LI    +      A  +  ++I R    D     +L+ G  K+ 
Sbjct: 311 DMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLIDGFCKEN 370

Query: 156 EFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDDALEVFDKMKGEKKKARVVAPD 215
           + +K N ++  M      PN+ TF ILIN       IDD LE+F KM       R V  D
Sbjct: 371 QLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMS-----LRGVVAD 430

Query: 216 AITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSVTYTVFINAFCKVDDIDKAME 275
            +TYNTLI G C+ G++E A EL  EMV+ ++ P+    V+Y + ++  C   + +KA+E
Sbjct: 431 TVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPD---IVSYKILLDGLCDNGEPEKALE 490

Query: 276 FLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEFG 335
             +++ K+    D  +Y  +I G+    ++DDA  +   L   G + D+  Y+++I    
Sbjct: 491 IFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLC 543

Query: 336 KKNKI 337
           KK  +
Sbjct: 551 KKGSL 543

BLAST of CmaCh17G012490 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 1.6e-31
Identity = 90/305 (29.51%), Postives = 157/305 (51.48%), Query Frame = 1

Query: 36  NVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFTKFLKLNESEGRVKE-DEFAGLV 95
           N +I  L K G +  A+ VLD+M+    +  PN + +   +     E +V+E  E A ++
Sbjct: 334 NSVISGLCKLGEVKEAVEVLDQMIT--RDCSPNTVTYNTLISTLCKENQVEEATELARVL 393

Query: 96  AEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKK 155
              G   ++P+  T   LI  LC +RN  +A  + +++  +    D    N L+  L  K
Sbjct: 394 TSKG---ILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSK 453

Query: 156 REFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDDALEVFDKMKGEKKKARVVAP 215
            + ++   ++++M+      +VIT+  LI+      +  +A E+FD+M+        V+ 
Sbjct: 454 GKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEME-----VHGVSR 513

Query: 216 DAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSVTYTVFINAFCKVDDIDKAM 275
           +++TYNTLIDGLCK+  +E A +L+++M+ E   P+     TY   +  FC+  DI KA 
Sbjct: 514 NSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDK---YTYNSLLTHFCRGGDIKKAA 573

Query: 276 EFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEF 335
           + +  M+  GC PD + Y TLI GL + GR++ AS +   ++  G  L    Y+ +I   
Sbjct: 574 DIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVIQGL 625

BLAST of CmaCh17G012490 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.8e-30
Identity = 83/300 (27.67%), Postives = 148/300 (49.33%), Query Frame = 1

Query: 36  NVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVA 95
           NV+I    K+G +++AL+VLD M +      P+ + +   L+     G++K+     ++ 
Sbjct: 176 NVMISGYCKAGEINNALSVLDRMSVS-----PDVVTYNTILRSLCDSGKLKQA--MEVLD 235

Query: 96  EFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKR 155
              +    P+ IT T LI   CR      A  +LD++  R    D    N L+ G+ K+ 
Sbjct: 236 RMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEG 295

Query: 156 EFEKMNLLIRKMKDMNIQPNVITFGILINHLRIDDALEVFDKMKGEKKKARVVAPDAITY 215
             ++    +  M     QPNVIT  I++  +         +K+  +  + +  +P  +T+
Sbjct: 296 RLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLR-KGFSPSVVTF 355

Query: 216 NTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDE 275
           N LI+ LC+ G +  A ++L +M      PN   S++Y   ++ FCK   +D+A+E+L+ 
Sbjct: 356 NILINFLCRKGLLGRAIDILEKMPQHGCQPN---SLSYNPLLHGFCKEKKMDRAIEYLER 415

Query: 276 MSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEFGKKNK 335
           M   GCYPD + Y T++  L + G+++DA  I ++L   G    L+ Y+ +I    K  K
Sbjct: 416 MVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGK 464

BLAST of CmaCh17G012490 vs. TrEMBL
Match: A0A0A0K547_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G336570 PE=4 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 2.3e-90
Identity = 184/328 (56.10%), Postives = 232/328 (70.73%), Query Frame = 1

Query: 19  LLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNH----INFTK 78
           L  F  L    KNTNVRN II+LLLKSGR+D+A+NVLDEMLLP SEFRPN     I F  
Sbjct: 26  LAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNAMNVLDEMLLPESEFRPNDKTAGIVFNN 85

Query: 79  FLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIM 138
            LK++  EGRVKEDE AGLV++FG+H++ P+ I LTQLIS LCRS NT+LAWN+LD+++M
Sbjct: 86  LLKIDGLEGRVKEDEIAGLVSKFGKHNIFPDTIALTQLISKLCRSGNTNLAWNILDNLMM 145

Query: 139 RNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDD 198
            NGLKDAAPCNALLTGLGK REF KMNLL+RKMKDMNIQP VITFGILINHL    RIDD
Sbjct: 146 LNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKDMNIQPTVITFGILINHLCKFRRIDD 205

Query: 199 ALEVFDKMKGEKKKARV-VAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKH 258
           ALEVF+KMKGEK++ +V VAPD I YNTLIDGLCK G  E A  L+ +M ++Q AP    
Sbjct: 206 ALEVFEKMKGEKEETKVFVAPDTIMYNTLIDGLCKVGRQEEALCLMGKMRSDQCAPT--- 265

Query: 259 SVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITS 318
           + T+   IN +C+  +I+ A +  +EM  A   P+ I   TL+ G+ +  R+  A     
Sbjct: 266 TATFNCLINGYCRSGEIEVAHKLFNEMENAQIEPNVITLNTLVDGMCKHNRISTAVEFFR 325

Query: 319 KLKEAGFRLDLVCYDLLISEFGKKNKID 338
            +++ G + + V Y + I+ F   N ++
Sbjct: 326 VMQQKGLKGNNVTYTVFINAFCNVNNMN 350

BLAST of CmaCh17G012490 vs. TrEMBL
Match: D7UCE3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02180 PE=4 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 3.2e-71
Identity = 148/326 (45.40%), Postives = 208/326 (63.80%), Query Frame = 1

Query: 20  LRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPN----HINFTKF 79
           L +  L  + + T++RN++ID+L + GR+D AL++LDEML P +EF PN    HI F+  
Sbjct: 180 LVYNELCPSRRLTHIRNILIDVLFRKGRVDDALHLLDEMLQPKAEFPPNSNTGHIVFSAL 239

Query: 80  LKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMR 139
            K ++    V E+E  GLV++F EH V PN I LTQLIS LCRS  TD AW+VL  ++  
Sbjct: 240 SKRDKVGRAVDEEEIVGLVSKFAEHEVFPNSIWLTQLISRLCRSGRTDRAWDVLHGLMKL 299

Query: 140 NGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDDA 199
            G+ +AA CNALLT LG+ REF++MN L+ +MK+M+IQPNV+TFGILINHL    R+D+A
Sbjct: 300 GGVMEAASCNALLTALGRAREFKRMNTLLAEMKEMDIQPNVVTFGILINHLCKFRRVDEA 359

Query: 200 LEVFDKMKGEKKKARVVAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSV 259
           LEVF+KM G +    +V PD ITYNTLIDGLCK G  E    L+  M ++     N  +V
Sbjct: 360 LEVFEKMNGGESNGFLVEPDVITYNTLIDGLCKVGRQEEGLGLVERMRSQPRCMPN--TV 419

Query: 260 TYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKL 319
           TY   I+ +CK   I+ A E  D+M+K G  P+ +   TL+ G+ + GR++ A    +++
Sbjct: 420 TYNCLIDGYCKASMIEAARELFDQMNKDGVPPNVVTLNTLVDGMCKHGRINGAVEFFNEM 479

Query: 320 KEAGFRLDLVCYDLLISEFGKKNKID 338
           +  G + + V Y  LI  F   N I+
Sbjct: 480 QGKGLKGNAVTYTALIRAFCNVNNIE 503

BLAST of CmaCh17G012490 vs. TrEMBL
Match: U5U729_CAMSI (Pentatricopeptide repeat-containing protein OS=Camellia sinensis PE=2 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 4.5e-65
Identity = 146/332 (43.98%), Postives = 204/332 (61.45%), Query Frame = 1

Query: 17  KRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFT--- 76
           K +L F  L    KNT++ N+I+D+LL++GR+D A  V DEML P SE  PN I      
Sbjct: 181 KSVLIFKELDPDLKNTHISNLIVDILLRAGRVDDAFQVFDEMLKPDSESPPNEITVNIAM 240

Query: 77  KFLKLNESEGR-VKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDV 136
             L   +  GR V ++E  GLV++FGE  V P+ + LTQLI+ LCR+  +D AW+V+ +V
Sbjct: 241 AGLLWRDRTGRSVSDEEIIGLVSKFGECGVFPSVVRLTQLITKLCRTGKSDRAWDVIHNV 300

Query: 137 IMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RI 196
           +   G   A  CNALL GLG+++ F+KMN L+ +MK+  IQP++ITFGIL+NHL    R+
Sbjct: 301 MKLGGDVQAPSCNALLAGLGRQQNFQKMNKLLAEMKENGIQPDIITFGILVNHLCKFRRV 360

Query: 197 DDALEVFDKMKGEKKK--ARVVAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQ-LAP 256
           D+ALEVF+KM GE++      V PD I YNTLIDGLCK G  E    LL +M  EQ  AP
Sbjct: 361 DEALEVFEKMSGERESDDGFSVKPDTILYNTLIDGLCKVGRQEQGLGLLEKMKLEQGCAP 420

Query: 257 NNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDAS 316
               + TY   I+ FCK  +I +A E  D+M+K G  P+ I   TL+ G+ + GR++ A 
Sbjct: 421 T---TATYNCLIDGFCKSGEIGRAHELFDQMNKEGVPPNVITLNTLVDGMCKHGRINSAM 480

Query: 317 FITSKLKEAGFRLDLVCYDLLISEFGKKNKID 338
              ++++  G + + V Y  LI+ F   N ID
Sbjct: 481 EFFNEMQGKGLKGNAVTYTALINAFCNANNID 509

BLAST of CmaCh17G012490 vs. TrEMBL
Match: A0A061DUV4_THECC (Pentatricopeptide repeat superfamily protein, putative OS=Theobroma cacao GN=TCM_005260 PE=4 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 2.2e-64
Identity = 153/347 (44.09%), Postives = 207/347 (59.65%), Query Frame = 1

Query: 1   MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLL 60
           +L  +FG LE    + + LL F  L  T KNT+VRNV+ID+ L+ GR+D ALNVLDEML 
Sbjct: 162 LLIRYFGRLE---MVDESLLIFNELDPTLKNTHVRNVLIDVSLRDGRVDYALNVLDEMLQ 221

Query: 61  PGSEFRPNHIN----FTKFLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISML 120
           P SE  PN +     F   +K      ++ E+E   LV +FGEH V P  I LTQLI+ L
Sbjct: 222 PLSEVPPNDVTGDIVFYGLVKRERKGRKLSEEEIIKLVLKFGEHSVFPRTIWLTQLITRL 281

Query: 121 CRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNV 180
           CRS   + AWNVL +++      +AAP NA+LTGLG+  + E+MN+L+ +MK+ +IQPN 
Sbjct: 282 CRSGKINQAWNVLQELLRLRAPLEAAPFNAVLTGLGRSGDVERMNMLLVEMKESDIQPNG 341

Query: 181 ITFGILINHL----RIDDALEVFDKM-KGEKKKARVVAPDAITYNTLIDGLCKAGEIETA 240
           +TFGILIN L    R+D+A+EV ++M +G       V  D ITYNTLIDGLCK G  E  
Sbjct: 342 VTFGILINQLCKSRRVDEAMEVLNRMGEGTGSDDVSVEADIITYNTLIDGLCKVGRQEEG 401

Query: 241 HELLNEM-VTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYT 300
             L+  M  T+ LAPN   +VTY   I+ FCKV +I++  E  D M + G  P+ I   T
Sbjct: 402 LRLMERMRCTKGLAPN---TVTYNCLIDGFCKVGEIERGKELYDRMKEEGVSPNVITLNT 461

Query: 301 LICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEFGKKNKID 338
           L+ G+ + GR   A    + ++  G + + V Y  LIS F   N ID
Sbjct: 462 LVDGMCRHGRTSSALEFFNDMQGKGLKGNAVTYTTLISAFCNVNNID 502

BLAST of CmaCh17G012490 vs. TrEMBL
Match: B9I8Q2_POPTR (Pentatricopeptide repeat-containing family protein (Fragment) OS=Populus trichocarpa GN=POPTR_0014s08611g PE=4 SV=2)

HSP 1 Score: 237.7 bits (605), Expect = 2.1e-59
Identity = 134/333 (40.24%), Postives = 198/333 (59.46%), Query Frame = 1

Query: 19  LLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPG--SEFRPNH----INF 78
           L+ F +L  + KNT +RNV + +LL+SGR+  AL V+DEM      S  RPN     I F
Sbjct: 134 LILFNDLDPSVKNTYLRNVWLSILLRSGRVKDALKVIDEMFESNDDSNCRPNDATGDILF 193

Query: 79  TKFLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDV 138
           +  LK   +E  + EDE   LV +FGEH V+ +   + +LI+ LCR+R T+  W++  ++
Sbjct: 194 SFLLKRERNEELLSEDEIVNLVLKFGEHGVLISSFWMGRLITRLCRNRKTNRGWDLFTEM 253

Query: 139 IMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RI 198
           I    + ++A CN+LLTGL ++  F +MN L+ KM +M+IQPNV+TFGILINH+    R+
Sbjct: 254 IKLGAVLESAACNSLLTGLAREGNFNRMNELMEKMVEMDIQPNVVTFGILINHMCKFRRV 313

Query: 199 DDALEVFDKMKGEKKKARV---VAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQ-LA 258
           DDALEV +KM G K+   +   V PD + YNTLIDGLCK G  +    L+  M +++  A
Sbjct: 314 DDALEVLEKMSGGKESGGISVSVEPDVVIYNTLIDGLCKVGRQQEGLGLMERMRSQKGCA 373

Query: 259 PNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDA 318
           P+   ++TY   I+ FCK  +I+K  E  DEM+K G  P+ +   TL+ G+ + GR+  A
Sbjct: 374 PD---TITYNCLIDGFCKAGEIEKGKELFDEMNKEGVAPNVVTVNTLVGGMCRTGRVSSA 433

Query: 319 SFITSKLKEAGFRLDLVCYDLLISEFGKKNKID 338
                + +  G + D V Y  LI+ F   N  +
Sbjct: 434 VNFFVEAQRRGMKGDAVTYTALINAFCNVNNFE 463

BLAST of CmaCh17G012490 vs. TAIR10
Match: AT3G61520.1 (AT3G61520.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 204.9 bits (520), Expect = 7.8e-53
Identity = 123/335 (36.72%), Postives = 199/335 (59.40%), Query Frame = 1

Query: 1   MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLL 60
           +L  +FG +     + + +L +  L    KN+ VRNV++D+LL++G +D A  VLDEML 
Sbjct: 157 LLIRWFGRM---GMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQ 216

Query: 61  PGSEFRPNHINFTKFLKLNES-EGR-VKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCR 120
             S F PN I  T  + L+E  +GR + E++   L++ F  H V PN + LT+ IS LC+
Sbjct: 217 KESVFPPNRI--TADIVLHEVWKGRLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCK 276

Query: 121 SRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVIT 180
           +   + AW++L D++      +A P NALL+ LG+  +  +MN L+ KM ++ I+P+V+T
Sbjct: 277 NARANAAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVT 336

Query: 181 FGILINHL----RIDDALEVFDKMKGEK-KKARVVAPDAITYNTLIDGLCKAGEIETAHE 240
            GILIN L    R+D+ALEVF+KM+G++     V+  D+I +NTLIDGLCK G ++ A E
Sbjct: 337 LGILINTLCKSRRVDEALEVFEKMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEE 396

Query: 241 LLNEM-VTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLI 300
           LL  M + E+ APN   +VTY   I+ +C+   ++ A E +  M +    P+ +   T++
Sbjct: 397 LLVRMKLEERCAPN---AVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIV 456

Query: 301 CGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLI 328
            G+ +   L+ A      +++ G + ++V Y  LI
Sbjct: 457 GGMCRHHGLNMAVVFFMDMEKEGVKGNVVTYMTLI 483

BLAST of CmaCh17G012490 vs. TAIR10
Match: AT5G28370.1 (AT5G28370.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 203.4 bits (516), Expect = 2.3e-52
Identity = 118/333 (35.44%), Postives = 192/333 (57.66%), Query Frame = 1

Query: 1   MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLL 60
           +L  +FG +     + + +L +  L    KN+ VRNV++D+LL++G +D A  VLDEML 
Sbjct: 157 LLIRWFGRM---GMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQ 216

Query: 61  PGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSR 120
             S F PN I     L     E  + E++   L++ F  H V PN + LT+ IS LC++ 
Sbjct: 217 KESVFPPNRITADIVLHEVWKERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 276

Query: 121 NTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFG 180
             + AW++L D++      +A P NALL+ LG+  +  +MN L+ KM ++ I+P+V+T G
Sbjct: 277 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 336

Query: 181 ILINHL----RIDDALEVFDKMKGEK-KKARVVAPDAITYNTLIDGLCKAGEIETAHELL 240
           ILIN L    R+D+ALEVF++M+G++     V+  D+I +NTLIDGLCK G ++ A ELL
Sbjct: 337 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 396

Query: 241 NEM-VTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICG 300
             M + E+  PN   +VTY   I+ +C+   ++ A E +  M +    P+ +   T++ G
Sbjct: 397 VRMKLEERCVPN---AVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGG 456

Query: 301 LAQGGRLDDASFITSKLKEAGFRLDLVCYDLLI 328
           + +   L+ A      +++ G + ++V Y  LI
Sbjct: 457 MCRHHGLNMAVVFFMDMEKEGVKGNVVTYMTLI 483

BLAST of CmaCh17G012490 vs. TAIR10
Match: AT5G28460.1 (AT5G28460.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 203.4 bits (516), Expect = 2.3e-52
Identity = 118/333 (35.44%), Postives = 192/333 (57.66%), Query Frame = 1

Query: 1   MLFIFFGALEGLAWLMKRLLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLL 60
           +L  +FG +     + + +L +  L    KN+ VRNV++D+LL++G +D A  VLDEML 
Sbjct: 157 LLIRWFGRM---GMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQ 216

Query: 61  PGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSR 120
             S F PN I     L     E  + E++   L++ F  H V PN + LT+ IS LC++ 
Sbjct: 217 KESVFPPNRITADIVLHEVWKERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 276

Query: 121 NTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFG 180
             + AW++L D++      +A P NALL+ LG+  +  +MN L+ KM ++ I+P+V+T G
Sbjct: 277 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 336

Query: 181 ILINHL----RIDDALEVFDKMKGEK-KKARVVAPDAITYNTLIDGLCKAGEIETAHELL 240
           ILIN L    R+D+ALEVF++M+G++     V+  D+I +NTLIDGLCK G ++ A ELL
Sbjct: 337 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 396

Query: 241 NEM-VTEQLAPNNKHSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICG 300
             M + E+  PN   +VTY   I+ +C+   ++ A E +  M +    P+ +   T++ G
Sbjct: 397 VRMKLEERCVPN---AVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGG 456

Query: 301 LAQGGRLDDASFITSKLKEAGFRLDLVCYDLLI 328
           + +   L+ A      +++ G + ++V Y  LI
Sbjct: 457 MCRHHGLNMAVVFFMDMEKEGVKGNVVTYMTLI 483

BLAST of CmaCh17G012490 vs. TAIR10
Match: AT1G12620.1 (AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 139.4 bits (350), Expect = 4.0e-33
Identity = 91/305 (29.84%), Postives = 154/305 (50.49%), Query Frame = 1

Query: 36  NVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFTKFLKLNESEGRVKEDEFAGLVA 95
           ++IID L K G LD+A N+ +EM + G  F+ + I +T  ++     GR   D+ A L+ 
Sbjct: 251 SIIIDGLCKDGSLDNAFNLFNEMEIKG--FKADIIIYTTLIRGFCYAGRW--DDGAKLLR 310

Query: 96  EFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKKR 155
           +  +  + P+ +  + LI    +      A  +  ++I R    D     +L+ G  K+ 
Sbjct: 311 DMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLIDGFCKEN 370

Query: 156 EFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDDALEVFDKMKGEKKKARVVAPD 215
           + +K N ++  M      PN+ TF ILIN       IDD LE+F KM       R V  D
Sbjct: 371 QLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMS-----LRGVVAD 430

Query: 216 AITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSVTYTVFINAFCKVDDIDKAME 275
            +TYNTLI G C+ G++E A EL  EMV+ ++ P+    V+Y + ++  C   + +KA+E
Sbjct: 431 TVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPD---IVSYKILLDGLCDNGEPEKALE 490

Query: 276 FLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEFG 335
             +++ K+    D  +Y  +I G+    ++DDA  +   L   G + D+  Y+++I    
Sbjct: 491 IFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLC 543

Query: 336 KKNKI 337
           KK  +
Sbjct: 551 KKGSL 543

BLAST of CmaCh17G012490 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 138.3 bits (347), Expect = 8.9e-33
Identity = 90/305 (29.51%), Postives = 157/305 (51.48%), Query Frame = 1

Query: 36  NVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHINFTKFLKLNESEGRVKE-DEFAGLV 95
           N +I  L K G +  A+ VLD+M+    +  PN + +   +     E +V+E  E A ++
Sbjct: 334 NSVISGLCKLGEVKEAVEVLDQMIT--RDCSPNTVTYNTLISTLCKENQVEEATELARVL 393

Query: 96  AEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMRNGLKDAAPCNALLTGLGKK 155
              G   ++P+  T   LI  LC +RN  +A  + +++  +    D    N L+  L  K
Sbjct: 394 TSKG---ILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSK 453

Query: 156 REFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDDALEVFDKMKGEKKKARVVAP 215
            + ++   ++++M+      +VIT+  LI+      +  +A E+FD+M+        V+ 
Sbjct: 454 GKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEME-----VHGVSR 513

Query: 216 DAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSVTYTVFINAFCKVDDIDKAM 275
           +++TYNTLIDGLCK+  +E A +L+++M+ E   P+     TY   +  FC+  DI KA 
Sbjct: 514 NSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDK---YTYNSLLTHFCRGGDIKKAA 573

Query: 276 EFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKLKEAGFRLDLVCYDLLISEF 335
           + +  M+  GC PD + Y TLI GL + GR++ AS +   ++  G  L    Y+ +I   
Sbjct: 574 DIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVIQGL 625

BLAST of CmaCh17G012490 vs. NCBI nr
Match: gi|778726868|ref|XP_011659175.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 340.5 bits (872), Expect = 3.4e-90
Identity = 184/328 (56.10%), Postives = 232/328 (70.73%), Query Frame = 1

Query: 19  LLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNH----INFTK 78
           L  F  L    KNTNVRN II+LLLKSGR+D+A+NVLDEMLLP SEFRPN     I F  
Sbjct: 26  LAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNAMNVLDEMLLPESEFRPNDKTAGIVFNN 85

Query: 79  FLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIM 138
            LK++  EGRVKEDE AGLV++FG+H++ P+ I LTQLIS LCRS NT+LAWN+LD+++M
Sbjct: 86  LLKIDGLEGRVKEDEIAGLVSKFGKHNIFPDTIALTQLISKLCRSGNTNLAWNILDNLMM 145

Query: 139 RNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDD 198
            NGLKDAAPCNALLTGLGK REF KMNLL+RKMKDMNIQP VITFGILINHL    RIDD
Sbjct: 146 LNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKDMNIQPTVITFGILINHLCKFRRIDD 205

Query: 199 ALEVFDKMKGEKKKARV-VAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKH 258
           ALEVF+KMKGEK++ +V VAPD I YNTLIDGLCK G  E A  L+ +M ++Q AP    
Sbjct: 206 ALEVFEKMKGEKEETKVFVAPDTIMYNTLIDGLCKVGRQEEALCLMGKMRSDQCAPT--- 265

Query: 259 SVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITS 318
           + T+   IN +C+  +I+ A +  +EM  A   P+ I   TL+ G+ +  R+  A     
Sbjct: 266 TATFNCLINGYCRSGEIEVAHKLFNEMENAQIEPNVITLNTLVDGMCKHNRISTAVEFFR 325

Query: 319 KLKEAGFRLDLVCYDLLISEFGKKNKID 338
            +++ G + + V Y + I+ F   N ++
Sbjct: 326 VMQQKGLKGNNVTYTVFINAFCNVNNMN 350

BLAST of CmaCh17G012490 vs. NCBI nr
Match: gi|659092909|ref|XP_008447281.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucumis melo])

HSP 1 Score: 334.0 bits (855), Expect = 3.1e-88
Identity = 183/328 (55.79%), Postives = 227/328 (69.21%), Query Frame = 1

Query: 19  LLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNH----INFTK 78
           L  F  L    KNTNVRN II+LLLKSGR+D+ALNVL EMLLP SEFRPN     I F K
Sbjct: 26  LAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNALNVLYEMLLPESEFRPNDKTAGIVFNK 85

Query: 79  FLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIM 138
            LK++ SEGR KEDE AGLV++FG++++ P+ I LTQLIS LCRS NT+LAWN+LD+++M
Sbjct: 86  MLKIDGSEGRAKEDEIAGLVSKFGKYNIFPDTIALTQLISKLCRSGNTNLAWNILDNMMM 145

Query: 139 RNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDD 198
            NGLKDAAPCNALLTGLGK REF KMNLL+RKMKDMNIQP VITFGILIN+L    RIDD
Sbjct: 146 LNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKDMNIQPTVITFGILINYLCKFRRIDD 205

Query: 199 ALEVFDKMKGEKKKAR-VVAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKH 258
           ALEVF+KMKGEK++A  VVAPD I YNTLIDGLCK G  E    L+  M + Q AP    
Sbjct: 206 ALEVFEKMKGEKEEAEVVVAPDTIMYNTLIDGLCKVGRQEEGLRLMGTMRSGQCAPT--- 265

Query: 259 SVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITS 318
           + TY   IN +C+  +I+ A +   EM      P+ I   TL+ G+ +  R+  A     
Sbjct: 266 TATYNCLINGYCRAGEIEVANKLFSEMVSEQIEPNVITLNTLVDGMCKHNRISTAVKFFR 325

Query: 319 KLKEAGFRLDLVCYDLLISEFGKKNKID 338
            +++ G + + V Y + I+ F   N ++
Sbjct: 326 DMQQKGLKGNNVTYTVFINAFCNVNNMN 350

BLAST of CmaCh17G012490 vs. NCBI nr
Match: gi|225454300|ref|XP_002275491.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial [Vitis vinifera])

HSP 1 Score: 276.9 bits (707), Expect = 4.6e-71
Identity = 148/326 (45.40%), Postives = 208/326 (63.80%), Query Frame = 1

Query: 20  LRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPN----HINFTKF 79
           L +  L  + + T++RN++ID+L + GR+D AL++LDEML P +EF PN    HI F+  
Sbjct: 180 LVYNELCPSRRLTHIRNILIDVLFRKGRVDDALHLLDEMLQPKAEFPPNSNTGHIVFSAL 239

Query: 80  LKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIMR 139
            K ++    V E+E  GLV++F EH V PN I LTQLIS LCRS  TD AW+VL  ++  
Sbjct: 240 SKRDKVGRAVDEEEIVGLVSKFAEHEVFPNSIWLTQLISRLCRSGRTDRAWDVLHGLMKL 299

Query: 140 NGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDDA 199
            G+ +AA CNALLT LG+ REF++MN L+ +MK+M+IQPNV+TFGILINHL    R+D+A
Sbjct: 300 GGVMEAASCNALLTALGRAREFKRMNTLLAEMKEMDIQPNVVTFGILINHLCKFRRVDEA 359

Query: 200 LEVFDKMKGEKKKARVVAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKHSV 259
           LEVF+KM G +    +V PD ITYNTLIDGLCK G  E    L+  M ++     N  +V
Sbjct: 360 LEVFEKMNGGESNGFLVEPDVITYNTLIDGLCKVGRQEEGLGLVERMRSQPRCMPN--TV 419

Query: 260 TYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITSKL 319
           TY   I+ +CK   I+ A E  D+M+K G  P+ +   TL+ G+ + GR++ A    +++
Sbjct: 420 TYNCLIDGYCKASMIEAARELFDQMNKDGVPPNVVTLNTLVDGMCKHGRINGAVEFFNEM 479

Query: 320 KEAGFRLDLVCYDLLISEFGKKNKID 338
           +  G + + V Y  LI  F   N I+
Sbjct: 480 QGKGLKGNAVTYTALIRAFCNVNNIE 503

BLAST of CmaCh17G012490 vs. NCBI nr
Match: gi|1009162715|ref|XP_015899583.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Ziziphus jujuba])

HSP 1 Score: 262.3 bits (669), Expect = 1.2e-66
Identity = 152/329 (46.20%), Postives = 204/329 (62.01%), Query Frame = 1

Query: 19  LLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHIN----FTK 78
           LL +  L  + KNT VR ++I  LL+ G +D A NVLDEML   +EF PN +     F K
Sbjct: 178 LLVYNELDPSSKNTPVRTLLIGELLRYGCVDDANNVLDEMLELNAEFPPNDVTGNVVFAK 237

Query: 79  FLKLNESEGRVKEDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIM 138
            LK       V ++E  GLV +FG+H V P  I LTQLIS LCR+  TD AW+VLD+V+ 
Sbjct: 238 LLKRERPGRHVSDEEIVGLVFKFGKHGVFPGKIQLTQLISRLCRNGKTDRAWDVLDNVMK 297

Query: 139 RNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDD 198
             GL +AA CNALLTGLG+K +F+++NLL+ +MK+ +I P+V+TFGI+IN+L    R+D+
Sbjct: 298 SGGLVEAASCNALLTGLGRKHDFKRINLLMAEMKERDIHPDVVTFGIVINYLCKSRRVDE 357

Query: 199 ALEVFDKMKGEKKKARV-VAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQ-LAPNNK 258
           ALEVF+KMKGE    R  V  D ITYNTLIDGLCK G  E   +L+  M  E   APN  
Sbjct: 358 ALEVFEKMKGEGDNDRFSVERDVITYNTLIDGLCKVGRQEEGLQLMKRMRLENGCAPN-- 417

Query: 259 HSVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFIT 318
            +VTY   I+ F KV +I++A E  DEM K    P+ I   TL+ G+ + GR+  A    
Sbjct: 418 -TVTYNCLIDGFNKVGEIERARELFDEMKKEKVPPNVITLNTLVDGMCKHGRVSSAVEFF 477

Query: 319 SKLKEAGFRLDLVCYDLLISEFGKKNKID 338
           ++ +  G + ++  Y  LIS F   N I+
Sbjct: 478 NEAQNDGLKANVFTYTNLISAFCNVNNIN 503

BLAST of CmaCh17G012490 vs. NCBI nr
Match: gi|802694924|ref|XP_012083305.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Jatropha curcas])

HSP 1 Score: 261.5 bits (667), Expect = 2.0e-66
Identity = 146/327 (44.65%), Postives = 207/327 (63.30%), Query Frame = 1

Query: 19  LLRFPNLIHTPKNTNVRNVIIDLLLKSGRLDSALNVLDEMLLPGSEFRPNHIN---FTKF 78
           L+ F  L ++ KNT+VRNV+IDLLL++GR++ A  VLDEMLLP  +  PN +       +
Sbjct: 169 LILFNELENSLKNTHVRNVLIDLLLRAGRVEDAFEVLDEMLLPEFDCPPNDVTGGTVFSW 228

Query: 79  LKLNESEGRVK-EDEFAGLVAEFGEHHVVPNPITLTQLISMLCRSRNTDLAWNVLDDVIM 138
           L   E  GR+  ++E   LV + GEH V PN I +TQLI +LCR+ N+D A N+L +++ 
Sbjct: 229 LMKKERLGRLATQEEIIELVLKLGEHGVFPNSILMTQLIVILCRNGNSDKACNLLLELMR 288

Query: 139 RNGLKDAAPCNALLTGLGKKREFEKMNLLIRKMKDMNIQPNVITFGILINHL----RIDD 198
                +AAPCNALLTGLG+ R+ ++MN ++ KMK+MN++PNVITFGILINHL    R+D+
Sbjct: 289 LGAALEAAPCNALLTGLGRDRDSDRMNKVMAKMKEMNVEPNVITFGILINHLCKSRRVDE 348

Query: 199 ALEVFDKMKGEKKKARV-VAPDAITYNTLIDGLCKAGEIETAHELLNEMVTEQLAPNNKH 258
           ALEVF KM G+K+   V V PD + +NTLIDGLCK G  E    LL  M  ++ +  N  
Sbjct: 349 ALEVFQKMNGDKENDGVKVEPDVVIFNTLIDGLCKVGRQEEGLALLGRMKLQKGSCPN-- 408

Query: 259 SVTYTVFINAFCKVDDIDKAMEFLDEMSKAGCYPDAIVYYTLICGLAQGGRLDDASFITS 318
           +VTY   I+ FCKV +I++ +E  DEM   G  P+A    TL+ G+ + GR + A     
Sbjct: 409 TVTYNCLIDGFCKVGEIERGLELFDEMKNEGVVPNASTINTLVDGMCKLGRTNSAIQFFD 468

Query: 319 KLKEAGFRLDLVCYDLLISEFGKKNKI 337
           +++  G + ++  Y  LI+ F   N I
Sbjct: 469 EMQSKGLKGNIYAYTSLINAFCNVNNI 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP292_ARATH1.4e-5136.72Pentatricopeptide repeat-containing protein At3g61520, mitochondrial OS=Arabidop... [more]
PP401_ARATH4.0e-5135.44Pentatricopeptide repeat-containing protein At5g28460 OS=Arabidopsis thaliana GN... [more]
PPR37_ARATH7.1e-3229.84Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN... [more]
PP281_ARATH1.6e-3129.51Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
PPR28_ARATH1.8e-3027.67Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K547_CUCSA2.3e-9056.10Uncharacterized protein OS=Cucumis sativus GN=Csa_7G336570 PE=4 SV=1[more]
D7UCE3_VITVI3.2e-7145.40Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02180 PE=4 SV=... [more]
U5U729_CAMSI4.5e-6543.98Pentatricopeptide repeat-containing protein OS=Camellia sinensis PE=2 SV=1[more]
A0A061DUV4_THECC2.2e-6444.09Pentatricopeptide repeat superfamily protein, putative OS=Theobroma cacao GN=TCM... [more]
B9I8Q2_POPTR2.1e-5940.24Pentatricopeptide repeat-containing family protein (Fragment) OS=Populus trichoc... [more]
Match NameE-valueIdentityDescription
AT3G61520.17.8e-5336.72 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G28370.12.3e-5235.44 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G28460.12.3e-5235.44 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G12620.14.0e-3329.84 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53700.18.9e-3329.51 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778726868|ref|XP_011659175.1|3.4e-9056.10PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-... [more]
gi|659092909|ref|XP_008447281.1|3.1e-8855.79PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-... [more]
gi|225454300|ref|XP_002275491.1|4.6e-7145.40PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial ... [more]
gi|1009162715|ref|XP_015899583.1|1.2e-6646.20PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-... [more]
gi|802694924|ref|XP_012083305.1|2.0e-6644.65PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh17G012490.1CmaCh17G012490.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 36..59
score: 0.0066coord: 108..133
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 209..239
score: 4.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 142..186
score: 8.5E-9coord: 250..296
score: 4.3
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 213..246
score: 2.1E-10coord: 251..285
score: 4.0E-11coord: 144..176
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 284..318
score: 9.098coord: 140..174
score: 9.547coord: 211..245
score: 13.252coord: 105..139
score: 7.541coord: 31..65
score: 8.473coord: 249..283
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 29..85
score: 7.7E-4coord: 214..290
score: 7.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 36..337
score: 2.0