Csa4G242870 (gene) Cucumber (Chinese Long) v2

NameCsa4G242870
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr4 : 10131737 .. 10133876 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTCTGTGCTTCAAGGAACCGGTCTCCCATTGTCAGCATTTCTTGTACGTGCAGTTCCTTCCCTTCATCTCTGTAAATTCAACAAGCTTCCCTCGTTTTAGAGTTCCAAAATTCCAGACGATGTTTTTCAAGATGGAGGTTTTCTGAACTACTTGAAGACTCCACCACTCCTTTTCGCTCCTAAAAATCTTTCTTTCTAATCGAGATCGTTAAACGAAGAAAAAGTTATGATCGATCGTCTGAGCCTAAATAGTTTCTTCTTGGTTCAACAATTGTGGATTTTGTCAAGTAGTTTAAGGTGATGGGTGGTGGAAGCCATGGCTGAAAACCCTAAAATGGGTGTAAAAAACCCTTCCAAAATTCCCCCCAGTTCTTCGCCACGGTCGCGCAATTCCCCACGCTTCCCTTTGCATCTAGACCTGCCGGACATATCGCCGGCTGCTAAAACCATTTGCGAAGTCCTGGTCAGAGTCTCACGGAATGAAGTAGACGGCGCGCTGTTGGCGACGGGATTGGCTCCGTCGCCGGAGCTTGTCCAAGAAGTTTTGAGGGTTTCTTATAACTCTCCGTCGTCGGCGATCAAGTTCTTCCGATGGGCGAGACAGTTGGCGAAACAGTCGGCATACTCGTGGAATCTGATGATTGATTTGTTGGGCAAAAACGAACTTTTCGAAGAAATGTGGAACGGCATTCGGACGATGAGGCAAGAAAAGATTCTTTCGTTGCCAACTTTTGTGTCGGTTTTTGGGAGTTATTGTTCTGCTGGCAGGTCTAAAGAAGCGAGAATGACCTTTGAAGTGATGGATAGGTACGAAGTTGAGAAGGATGTTGTGGCAGTGAATTCTCTACTGAGTGCAATTTGCTCTGAGGAAAATCAAACATCAGAGGCTTGGGAGTTTTTTGAGAAGCATAAAGAGAAGATCCCTTTGGATGGGGAGTCATTCGCCATTTTATTGGAAGGTTGGGAGAAAGAAGGCAATGTGGAGAAAGCTAAGGTTACATTTGATGAAATGGTGAAAAGAGTCGGCTGGAATCCTGAAAATGTTTCATCTTATGATGCATTTTTGATAACGTTGGTTCGTGGGGGCCGATCTGAAGATGCAATCAAGGTTCTTCTAAAACTGAAGAAGAATCGTTGTTTGCCAGGTTTGAAATTTCTGTCCAATGCTCTTGATAGTCTCATTCAGCAAAATGATGCAAACCATGCGATTCTATTGTGGGATATTGTCGTGGGAAGTGGATTAGTCCCTAACTTGATCGTGTACAATGCCATAATCGGATTGCTTAGTGAGAATAGTAAGATCGATGACTCGTTTCGACTCTTGGATTCCATGGTTTTCCATGGTGCTTTTCCTAACTCCTTAACTTACAACCTGATCTTCAGTTCTTTGATTAAGAATAAGAAAGTTAAGGAAGTTAGTCAATTTTTCAGGGAGATGGTAAAGAATGAATGCCCTCCTACCCCTTCTAGTTGTGCTGCAGCTATCACAATGTTGTTTGATGGTTATGACCCTGAAACAGCCATTGATATATGGAACTACATGGATGAGAATCACATCGAACCTATGGATACAAGTGCAAATGCACTGCTCATTGGCCTCTGCAACTTGAATCGGTTAACAGAGGTAAGGAGATTCGCGGACGATATGATTGACCAGCGTATTGATATATTGGAATCAACTATGAAGTTGTTGAAGAATTGTTTCTATCAGCAGAGAGGAAATTTCAGAGAGAATTATGATGGTCTCTTGCGTAGGTGGAGAGCTTCCTCAATTTTGTAATTTTTACCCCCTTCAATCCATTCTTGGTGTTATCACAAGCACAATACATAGCAGCATTAGTGATCCACCTTTGAGGCAAGTATACCTCTTCTACAACAATCATTTTGACCTTAGCAATATCACAGACTTCACCTCATCAGTAGTTCTTTTGTCAACTATTTTGCATGCTATTGCTCCATCTTTGTCTTGTCTTCGTTCTTAAGGTTGGTTGCACTGAAGTTTTTCAGTACAAAACGAGGTAAAAGGTTTCAAACGGCATACCTCGGTGTTACTAACACTCCCATGTGCCACTCCCGCATGCCAATTAAGTTAGGCTCGGTAAGTTCAAATGAAAATATTGTCAAAGTTTTAGATACTTG

mRNA sequence

ATGGCTGAAAACCCTAAAATGGGTGTAAAAAACCCTTCCAAAATTCCCCCCAGTTCTTCGCCACGGTCGCGCAATTCCCCACGCTTCCCTTTGCATCTAGACCTGCCGGACATATCGCCGGCTGCTAAAACCATTTGCGAAGTCCTGGTCAGAGTCTCACGGAATGAAGTAGACGGCGCGCTGTTGGCGACGGGATTGGCTCCGTCGCCGGAGCTTGTCCAAGAAGTTTTGAGGGTTTCTTATAACTCTCCGTCGTCGGCGATCAAGTTCTTCCGATGGGCGAGACAGTTGGCGAAACAGTCGGCATACTCGTGGAATCTGATGATTGATTTGTTGGGCAAAAACGAACTTTTCGAAGAAATGTGGAACGGCATTCGGACGATGAGGCAAGAAAAGATTCTTTCGTTGCCAACTTTTGTGTCGGTTTTTGGGAGTTATTGTTCTGCTGGCAGGTCTAAAGAAGCGAGAATGACCTTTGAAGTGATGGATAGGTACGAAGTTGAGAAGGATGTTGTGGCAGTGAATTCTCTACTGAGTGCAATTTGCTCTGAGGAAAATCAAACATCAGAGGCTTGGGAGTTTTTTGAGAAGCATAAAGAGAAGATCCCTTTGGATGGGGAGTCATTCGCCATTTTATTGGAAGGTTGGGAGAAAGAAGGCAATGTGGAGAAAGCTAAGGTTACATTTGATGAAATGGTGAAAAGAGTCGGCTGGAATCCTGAAAATGTTTCATCTTATGATGCATTTTTGATAACGTTGGTTCGTGGGGGCCGATCTGAAGATGCAATCAAGGTTCTTCTAAAACTGAAGAAGAATCGTTGTTTGCCAGGTTTGAAATTTCTGTCCAATGCTCTTGATAGTCTCATTCAGCAAAATGATGCAAACCATGCGATTCTATTGTGGGATATTGTCGTGGGAAGTGGATTAGTCCCTAACTTGATCGTGTACAATGCCATAATCGGATTGCTTAGTGAGAATAGTAAGATCGATGACTCGTTTCGACTCTTGGATTCCATGGTTTTCCATGGTGCTTTTCCTAACTCCTTAACTTACAACCTGATCTTCAGTTCTTTGATTAAGAATAAGAAAGTTAAGGAAGTTAGTCAATTTTTCAGGGAGATGGTAAAGAATGAATGCCCTCCTACCCCTTCTAGTTGTGCTGCAGCTATCACAATGTTGTTTGATGGTTATGACCCTGAAACAGCCATTGATATATGGAACTACATGGATGAGAATCACATCGAACCTATGGATACAAGTGCAAATGCACTGCTCATTGGCCTCTGCAACTTGAATCGGTTAACAGAGGTAAGGAGATTCGCGGACGATATGATTGACCAGCGTATTGATATATTGGAATCAACTATGAAGTTGTTGAAGAATTGTTTCTATCAGCAGAGAGGAAATTTCAGAGAGAATTATGATGGTCTCTTGCGTAGGTGGAGAGCTTCCTCAATTTTGTAA

Coding sequence (CDS)

ATGGCTGAAAACCCTAAAATGGGTGTAAAAAACCCTTCCAAAATTCCCCCCAGTTCTTCGCCACGGTCGCGCAATTCCCCACGCTTCCCTTTGCATCTAGACCTGCCGGACATATCGCCGGCTGCTAAAACCATTTGCGAAGTCCTGGTCAGAGTCTCACGGAATGAAGTAGACGGCGCGCTGTTGGCGACGGGATTGGCTCCGTCGCCGGAGCTTGTCCAAGAAGTTTTGAGGGTTTCTTATAACTCTCCGTCGTCGGCGATCAAGTTCTTCCGATGGGCGAGACAGTTGGCGAAACAGTCGGCATACTCGTGGAATCTGATGATTGATTTGTTGGGCAAAAACGAACTTTTCGAAGAAATGTGGAACGGCATTCGGACGATGAGGCAAGAAAAGATTCTTTCGTTGCCAACTTTTGTGTCGGTTTTTGGGAGTTATTGTTCTGCTGGCAGGTCTAAAGAAGCGAGAATGACCTTTGAAGTGATGGATAGGTACGAAGTTGAGAAGGATGTTGTGGCAGTGAATTCTCTACTGAGTGCAATTTGCTCTGAGGAAAATCAAACATCAGAGGCTTGGGAGTTTTTTGAGAAGCATAAAGAGAAGATCCCTTTGGATGGGGAGTCATTCGCCATTTTATTGGAAGGTTGGGAGAAAGAAGGCAATGTGGAGAAAGCTAAGGTTACATTTGATGAAATGGTGAAAAGAGTCGGCTGGAATCCTGAAAATGTTTCATCTTATGATGCATTTTTGATAACGTTGGTTCGTGGGGGCCGATCTGAAGATGCAATCAAGGTTCTTCTAAAACTGAAGAAGAATCGTTGTTTGCCAGGTTTGAAATTTCTGTCCAATGCTCTTGATAGTCTCATTCAGCAAAATGATGCAAACCATGCGATTCTATTGTGGGATATTGTCGTGGGAAGTGGATTAGTCCCTAACTTGATCGTGTACAATGCCATAATCGGATTGCTTAGTGAGAATAGTAAGATCGATGACTCGTTTCGACTCTTGGATTCCATGGTTTTCCATGGTGCTTTTCCTAACTCCTTAACTTACAACCTGATCTTCAGTTCTTTGATTAAGAATAAGAAAGTTAAGGAAGTTAGTCAATTTTTCAGGGAGATGGTAAAGAATGAATGCCCTCCTACCCCTTCTAGTTGTGCTGCAGCTATCACAATGTTGTTTGATGGTTATGACCCTGAAACAGCCATTGATATATGGAACTACATGGATGAGAATCACATCGAACCTATGGATACAAGTGCAAATGCACTGCTCATTGGCCTCTGCAACTTGAATCGGTTAACAGAGGTAAGGAGATTCGCGGACGATATGATTGACCAGCGTATTGATATATTGGAATCAACTATGAAGTTGTTGAAGAATTGTTTCTATCAGCAGAGAGGAAATTTCAGAGAGAATTATGATGGTCTCTTGCGTAGGTGGAGAGCTTCCTCAATTTTGTAA

Protein sequence

MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTSANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRRWRASSIL*
BLAST of Csa4G242870 vs. Swiss-Prot
Match: PP129_ARATH (Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidopsis thaliana GN=At1g77360 PE=2 SV=2)

HSP 1 Score: 197.6 bits (501), Expect = 3.2e-49
Identity = 122/415 (29.40%), Postives = 221/415 (53.25%), Query Frame = 1

Query: 37  DISPAAKTICEVLVRVSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQ 96
           D++  AK I +VL+   +  +D AL  +GL  S E+V++VL    N+     +FF+W+ +
Sbjct: 67  DVADVAKNISKVLMSSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEK 126

Query: 97  LA--KQSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKE 156
               + S  ++++MI+   K   ++ MW+ I  MR++K+L++ TF  V   Y  A +  E
Sbjct: 127 QRHYEHSVRAYHMMIESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDE 186

Query: 157 ARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLE 216
           A   F VM++Y++  ++VA N LLSA+C  +N   +A E FE  +++   D ++++ILLE
Sbjct: 187 AIYAFNVMEKYDLPPNLVAFNGLLSALCKSKN-VRKAQEVFENMRDRFTPDSKTYSILLE 246

Query: 217 GWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRC 276
           GW KE N+ KA+  F EM+   G +P+ + +Y   +  L + GR ++A+ ++  +  + C
Sbjct: 247 GWGKEPNLPKAREVFREMID-AGCHPD-IVTYSIMVDILCKAGRVDEALGIVRSMDPSIC 306

Query: 277 LPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFR 336
            P     S  + +   +N    A+  +  +  SG+  ++ V+N++IG   + +++ + +R
Sbjct: 307 KPTTFIYSVLVHTYGTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYR 366

Query: 337 LLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLF 396
           +L  M   G  PNS + N+I   LI+  +  E    FR+M+K  C P   +    I M  
Sbjct: 367 VLKEMKSKGVTPNSKSCNIILRHLIERGEKDEAFDVFRKMIK-VCEPDADTYTMVIKMFC 426

Query: 397 DGYDPETAIDIWNYMDENHIEPMDTSANALLIGLCNLNRLTEVRRFADDMIDQRI 450
           +  + ETA  +W YM +  + P   + + L+ GLC      +     ++MI+  I
Sbjct: 427 EKKEMETADKVWKYMRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGI 477

BLAST of Csa4G242870 vs. Swiss-Prot
Match: PP112_ARATH (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.4e-41
Identity = 112/413 (27.12%), Postives = 209/413 (50.61%), Query Frame = 1

Query: 37  DISPAAKTICEVLVRVSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQ 96
           D S  A+ IC++L + + ++V+  L    +  SP L++EVL+   N+   A+  F+WA  
Sbjct: 61  DASQDAERICKILTKFTDSKVETLLNEASVKLSPALIEEVLKKLSNAGVLALSVFKWAEN 120

Query: 97  LA--KQSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKE 156
               K +  ++N +I+ LGK + F+ +W+ +  M+ +K+LS  TF  +   Y  A + KE
Sbjct: 121 QKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARARKVKE 180

Query: 157 ARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEK-IPLDGESFAILL 216
           A   F  M+ +  + +    N +L  + S+     +A + F+K K+K    D +S+ ILL
Sbjct: 181 AIGAFHKMEEFGFKMESSDFNRMLDTL-SKSRNVGDAQKVFDKMKKKRFEPDIKSYTILL 240

Query: 217 EGWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNR 276
           EGW +E N+ +      EM K  G+ P+ V +Y   +    +  + E+AI+   ++++  
Sbjct: 241 EGWGQELNLLRVDEVNREM-KDEGFEPD-VVAYGIIINAHCKAKKYEEAIRFFNEMEQRN 300

Query: 277 CLPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSF 336
           C P      + ++ L  +   N A+  ++    SG       YNA++G    + +++D++
Sbjct: 301 CKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVGAYCWSQRMEDAY 360

Query: 337 RLLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITML 396
           + +D M   G  PN+ TY++I   LI+ ++ KE  + ++ M    C PT S+    + M 
Sbjct: 361 KTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM---SCEPTVSTYEIMVRMF 420

Query: 397 FDGYDPETAIDIWNYMDENHIEPMDTSANALLIGLCNLNRLTEVRRFADDMID 447
            +    + AI IW+ M    + P     ++L+  LC+ N+L E   + ++M+D
Sbjct: 421 CNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYFNEMLD 467

BLAST of Csa4G242870 vs. Swiss-Prot
Match: PPR78_ARATH (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 1.7e-34
Identity = 117/450 (26.00%), Postives = 215/450 (47.78%), Query Frame = 1

Query: 20  SPRSRNSPRFPLHLDLPDISPAAKTICEVLV--RVSRNEVDGALLATGLAPSPELVQEVL 79
           +P+S++   F   L  P        I  VL   R  +++++  L+A     S  LV++VL
Sbjct: 16  TPKSQSFRIFSTLLHDPPSPDLVNEISRVLSDHRNPKDDLEHTLVAYSPRVSSNLVEQVL 75

Query: 80  RVSYNSPSSAIKFFRWARQLAK--QSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKI-- 139
           +   N    A +FF WAR++     S  S+++++++LG ++ F  +W+ +   R+     
Sbjct: 76  KRCKNLGFPAHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFE 135

Query: 140 LSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWE 199
           +S   F  VF +Y  A    EA   F  M  + ++  V  ++ LL ++C +++  + A E
Sbjct: 136 ISSKVFWIVFRAYSRANLPSEACRAFNRMVEFGIKPCVDDLDQLLHSLCDKKH-VNHAQE 195

Query: 200 FFEKHKE-KIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLIT 259
           FF K K   I    ++++IL+ GW +  +   A+  FDEM++R      ++ +Y+A L  
Sbjct: 196 FFGKAKGFGIVPSAKTYSILVRGWARIRDASGARKVFDEMLERNC--VVDLLAYNALLDA 255

Query: 260 LVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPN 319
           L + G  +   K+  ++      P     +  + +     D + A  + D +    LVPN
Sbjct: 256 LCKSGDVDGGYKMFQEMGNLGLKPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDLVPN 315

Query: 320 LIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFR 379
           +  +N II  L +N K+DD++ LLD M+  GA P++ TYN I +    + +V   ++   
Sbjct: 316 VYTFNHIIKTLCKNEKVDDAYLLLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATKLLS 375

Query: 380 EMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTSANALLIGLC-NL 439
            M + +C P   +    + +L      + A +IW  M E    P   +   ++ GL    
Sbjct: 376 RMDRTKCLPDRHTYNMVLKLLIRIGRFDRATEIWEGMSERKFYPTVATYTVMIHGLVRKK 435

Query: 440 NRLTEVRRFADDMIDQRIDILESTMKLLKN 462
            +L E  R+ + MID+ I    +T+++L+N
Sbjct: 436 GKLEEACRYFEMMIDEGIPPYSTTVEMLRN 462

BLAST of Csa4G242870 vs. Swiss-Prot
Match: PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 1.4e-28
Identity = 107/427 (25.06%), Postives = 201/427 (47.07%), Query Frame = 1

Query: 45  ICEVLVR--VSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAK--Q 104
           +C+ L +   S  +V   L    +  +  LV +VLR   N  + A  FF WA        
Sbjct: 105 VCDFLNKKDTSHEDVVKELSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVH 164

Query: 105 SAYSWNLMIDLLGKNELFEEMWNGIRTM---RQEKILSLPTFVSVFGSYCSAGRSKEARM 164
           S +++N M+D+LGK   F+ MW  +  M    + K+++L T   V      +G+  +A  
Sbjct: 165 SGHTYNAMVDVLGKCRNFDLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVD 224

Query: 165 TFEVMDR-YEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGW 224
            F  M++ Y V+ D +A+NSL+ A+  +EN    A E F K  + I  D  +F IL+ G+
Sbjct: 225 AFLEMEKSYGVKTDTIAMNSLMDALV-KENSIEHAHEVFLKLFDTIKPDARTFNILIHGF 284

Query: 225 EKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLP 284
            K    + A+   D ++K   + P+ V +Y +F+    + G      ++L ++++N C P
Sbjct: 285 CKARKFDDARAMMD-LMKVTEFTPD-VVTYTSFVEAYCKEGDFRRVNEMLEEMRENGCNP 344

Query: 285 GLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLL 344
            +   +  + SL +      A+ +++ +   G VP+   Y+++I +LS+  +  D+  + 
Sbjct: 345 NVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKTGRFKDAAEIF 404

Query: 345 DSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNE---CPPTPSSCAAAITML 404
           + M   G   + L YN + S+ + + + +   +  + M   E   C P   + A  + M 
Sbjct: 405 EDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNVETYAPLLKMC 464

Query: 405 FDGYDPETAIDIWNYMDENHIEPMDTSANALLI-GLCNLNRLTEVRRFADDMIDQRIDIL 460
                 +    + ++M +N +  +D S   LLI GLC   ++ E   F ++ + + +   
Sbjct: 465 CHKKKMKLLGILLHHMVKNDVS-IDVSTYILLIRGLCMSGKVEEACLFFEEAVRKGMVPR 524

BLAST of Csa4G242870 vs. Swiss-Prot
Match: PP117_ARATH (Pentatricopeptide repeat-containing protein At1g73400, mitochondrial OS=Arabidopsis thaliana GN=At1g73400 PE=2 SV=2)

HSP 1 Score: 116.7 bits (291), Expect = 7.1e-25
Identity = 89/409 (21.76%), Postives = 193/409 (47.19%), Query Frame = 1

Query: 55  NEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWA--RQLAKQSAYSWNLMIDLL 114
           ++++ AL  + +  +  +V ++L+       +A +FF WA  ++       ++N MID+L
Sbjct: 110 DDMEKALDESSVDLTTPVVCKILQRLQYEEKTAFRFFTWAGHQEHYSHEPIAYNEMIDIL 169

Query: 115 G----KNELFEEMWNGIRTMRQEK--ILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYE 174
                KN+ F  + + +  M++    ++ +   + +   YC    +   +       R +
Sbjct: 170 SSTKYKNKQFRIVIDMLDYMKRNNKTVVLVDVLLEILRKYCERYLTHVQKFAKRKRIRVK 229

Query: 175 VEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAK 234
            + ++ A N LL A+C +     E      + + ++  D  +F +L  GW +  + +KA 
Sbjct: 230 TQPEINAFNMLLDALC-KCGLVKEGEALLRRMRHRVKPDANTFNVLFFGWCRVRDPKKAM 289

Query: 235 VTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVL-LKLKKNRCL--PGLKFLSN 294
              +EM++  G  PEN + Y A + T  + G  ++A  +    + K   +  P  K  + 
Sbjct: 290 KLLEEMIE-AGHKPENFT-YCAAIDTFCQAGMVDEAADLFDFMITKGSAVSAPTAKTFAL 349

Query: 295 ALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHG 354
            + +L + + A     L   ++ +G +P++  Y  +I  +    K+D++++ LD M   G
Sbjct: 350 MIVALAKNDKAEECFELIGRMISTGCLPDVSTYKDVIEGMCMAEKVDEAYKFLDEMSNKG 409

Query: 355 AFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAI 414
             P+ +TYN     L +N+K  E  + +  MV++ C P+  +    I+M F+  DP+ A 
Sbjct: 410 YPPDIVTYNCFLRVLCENRKTDEALKLYGRMVESRCAPSVQTYNMLISMFFEMDDPDGAF 469

Query: 415 DIWNYMDENH-IEPMDTSANALLIGLCNLNRLTEVRRFADDMIDQRIDI 452
           + W  MD+   ++ ++T   A++ GL + +R  E     ++++++ + +
Sbjct: 470 NTWTEMDKRDCVQDVETYC-AMINGLFDCHRAKEACFLLEEVVNKGLKL 514

BLAST of Csa4G242870 vs. TrEMBL
Match: A0A0R0FP69_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_16G108300 PE=4 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 9.3e-165
Identity = 282/485 (58.14%), Postives = 373/485 (76.91%), Query Frame = 1

Query: 1   MAENP--KMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVD 60
           +AENP     +K+PSK P  + P   N  +FP HLD P++S  A+ +C++L R S  +++
Sbjct: 4   LAENPGRSRDLKHPSKNP--TKPPQPN--QFPSHLDAPNVSSTARALCDILTRSSPQDIE 63

Query: 61  GALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELF 120
            AL ++G+ P  E   EVLR+SYN PSSA+KFFRWA +  K   ++WNLM+DLLGKN+LF
Sbjct: 64  SALSSSGIVPEEECTNEVLRLSYNYPSSAVKFFRWAGRGKKHPVHTWNLMVDLLGKNQLF 123

Query: 121 EEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLL 180
           E MW+ +R+M+QE+ LSL TF SVF SYC+A R  EA M+F+VMDRY V++DVVAVNSLL
Sbjct: 124 EPMWDAVRSMKQEQKLSLSTFASVFQSYCTAARFNEAVMSFDVMDRYGVKQDVVAVNSLL 183

Query: 181 SAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGW 240
           SAICSE+NQTS   EFFE  K K+P DG++FAILLEGWEKEGN  KAK TF +MV  +GW
Sbjct: 184 SAICSEDNQTSFGLEFFEGIKAKVPPDGDTFAILLEGWEKEGNAAKAKTTFGDMVAHIGW 243

Query: 241 NPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAI 300
           N +NV++YDAFL+TL+R G  +D ++ L  +K + C PGLKF + ALD L++QNDA+HA+
Sbjct: 244 NKDNVAAYDAFLMTLLRAGLMDDVVRFLQVMKDHDCFPGLKFFTTALDFLVKQNDADHAV 303

Query: 301 LLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSL 360
            +WD++V   LVPNLI+YNA+IGLL  N+ +D +FRLLD M FHGAFP+SLTYN+IF  L
Sbjct: 304 PVWDVMVSGELVPNLIMYNAMIGLLCNNAAVDHAFRLLDEMAFHGAFPDSLTYNMIFECL 363

Query: 361 IKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMD 420
           +KNKK +E  +FF EMVKNE PPT S+CAAAI MLFD  DPE A +IW+Y+ EN ++P+D
Sbjct: 364 VKNKKARETERFFAEMVKNEWPPTGSNCAAAIAMLFDCDDPEAAHEIWSYVVENRVKPLD 423

Query: 421 TSANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLL 480
            SANALLIGLCN++R TEV+RFA+D++D+RI+I +STM +LK+ FY++  + R+ YD L 
Sbjct: 424 ESANALLIGLCNMSRFTEVKRFAEDILDRRINIYQSTMSILKDAFYKEGRSARDRYDSLY 483

Query: 481 RRWRA 484
           RRW+A
Sbjct: 484 RRWKA 484

BLAST of Csa4G242870 vs. TrEMBL
Match: K7MGA0_SOYBN (Uncharacterized protein OS=Glycine max PE=4 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 9.3e-165
Identity = 282/485 (58.14%), Postives = 373/485 (76.91%), Query Frame = 1

Query: 1   MAENP--KMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVD 60
           +AENP     +K+PSK P  + P   N  +FP HLD P++S  A+ +C++L R S  +++
Sbjct: 8   LAENPGRSRDLKHPSKNP--TKPPQPN--QFPSHLDAPNVSSTARALCDILTRSSPQDIE 67

Query: 61  GALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELF 120
            AL ++G+ P  E   EVLR+SYN PSSA+KFFRWA +  K   ++WNLM+DLLGKN+LF
Sbjct: 68  SALSSSGIVPEEECTNEVLRLSYNYPSSAVKFFRWAGRGKKHPVHTWNLMVDLLGKNQLF 127

Query: 121 EEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLL 180
           E MW+ +R+M+QE+ LSL TF SVF SYC+A R  EA M+F+VMDRY V++DVVAVNSLL
Sbjct: 128 EPMWDAVRSMKQEQKLSLSTFASVFQSYCTAARFNEAVMSFDVMDRYGVKQDVVAVNSLL 187

Query: 181 SAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGW 240
           SAICSE+NQTS   EFFE  K K+P DG++FAILLEGWEKEGN  KAK TF +MV  +GW
Sbjct: 188 SAICSEDNQTSFGLEFFEGIKAKVPPDGDTFAILLEGWEKEGNAAKAKTTFGDMVAHIGW 247

Query: 241 NPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAI 300
           N +NV++YDAFL+TL+R G  +D ++ L  +K + C PGLKF + ALD L++QNDA+HA+
Sbjct: 248 NKDNVAAYDAFLMTLLRAGLMDDVVRFLQVMKDHDCFPGLKFFTTALDFLVKQNDADHAV 307

Query: 301 LLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSL 360
            +WD++V   LVPNLI+YNA+IGLL  N+ +D +FRLLD M FHGAFP+SLTYN+IF  L
Sbjct: 308 PVWDVMVSGELVPNLIMYNAMIGLLCNNAAVDHAFRLLDEMAFHGAFPDSLTYNMIFECL 367

Query: 361 IKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMD 420
           +KNKK +E  +FF EMVKNE PPT S+CAAAI MLFD  DPE A +IW+Y+ EN ++P+D
Sbjct: 368 VKNKKARETERFFAEMVKNEWPPTGSNCAAAIAMLFDCDDPEAAHEIWSYVVENRVKPLD 427

Query: 421 TSANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLL 480
            SANALLIGLCN++R TEV+RFA+D++D+RI+I +STM +LK+ FY++  + R+ YD L 
Sbjct: 428 ESANALLIGLCNMSRFTEVKRFAEDILDRRINIYQSTMSILKDAFYKEGRSARDRYDSLY 487

Query: 481 RRWRA 484
           RRW+A
Sbjct: 488 RRWKA 488

BLAST of Csa4G242870 vs. TrEMBL
Match: A0A061G6A0_THECC (Tetratricopeptide repeat-like superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_016711 PE=4 SV=1)

HSP 1 Score: 586.3 bits (1510), Expect = 3.5e-164
Identity = 285/488 (58.40%), Postives = 371/488 (76.02%), Query Frame = 1

Query: 1   MAENPK--MGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVD 60
           +AEN    +  K P++   +S P +    RF  HLD PDISP A+ +C++L R S ++V+
Sbjct: 4   VAENHSTTLPTKPPNQPHSNSIPHNLQPQRFRTHLDAPDISPTARILCDLLSRASPHDVE 63

Query: 61  GALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELF 120
            AL  TG+ P+ E++QEVL  SYN PSSAIKFFRWA +  K SAY+WNLM+DLLGKN++F
Sbjct: 64  TALSCTGITPTAEVIQEVLSFSYNQPSSAIKFFRWAGRYIKPSAYAWNLMVDLLGKNQIF 123

Query: 121 EEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLL 180
           E MW+ IR+M+QE +LS+ TFVSVFGSYC+  R  EA M+F+VMD+Y V++DVVAVNSLL
Sbjct: 124 EPMWDAIRSMKQESLLSVATFVSVFGSYCTVHRFSEATMSFDVMDKYGVQQDVVAVNSLL 183

Query: 181 SAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGW 240
           SAIC ++NQ S A EFF+  K+KIP DG++FAILLEGWEKEGNV KAK TF EMV RVGW
Sbjct: 184 SAICRQDNQMSVAIEFFDGIKKKIPPDGDTFAILLEGWEKEGNVAKAKNTFGEMVNRVGW 243

Query: 241 NPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAI 300
           +P   S+YDAFL TLV G ++++A+K L  +K + CLPGL+F SNALD L++QND+ H I
Sbjct: 244 SPMATSAYDAFLTTLVHGAQADEAVKFLQVMKGHNCLPGLRFFSNALDILVKQNDSTHII 303

Query: 301 LLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSL 360
            LWD +VG GLVPNLI+YNA+IGL+  N+ + ++FR LD MVFHGAFP+SLTYN+IF  L
Sbjct: 304 PLWDTMVGGGLVPNLIMYNAVIGLVCNNNDMHNAFRFLDEMVFHGAFPDSLTYNMIFQCL 363

Query: 361 IKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMD 420
           ++NK+V EV +FF EM+KNE PPT S+C  AI ML +  DPE AIDIWNYM EN + P+ 
Sbjct: 364 VRNKRVHEVGKFFVEMIKNEWPPTSSNCVMAIKMLLENDDPEMAIDIWNYMVENCVSPLV 423

Query: 421 TSANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLL 480
            SAN LLIGL NL RL+ V RFA++M+D+RI++ ESTM+ LKN F+++    R+ YD L 
Sbjct: 424 ESANELLIGLSNLGRLSWVERFAEEMLDKRINLFESTMEKLKNAFFKEGRTLRDKYDSLS 483

Query: 481 RRWRASSI 487
           RRW+ + +
Sbjct: 484 RRWKVAQM 491

BLAST of Csa4G242870 vs. TrEMBL
Match: A0A0B2QVZ1_GLYSO (Pentatricopeptide repeat-containing protein, mitochondrial OS=Glycine soja GN=glysoja_046966 PE=4 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 3.0e-163
Identity = 281/485 (57.94%), Postives = 372/485 (76.70%), Query Frame = 1

Query: 1   MAENP--KMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVD 60
           +AENP     +K+PSK P  + P   N  +FP HLD P++S  A+ +C++L R S  +++
Sbjct: 4   LAENPGRSRDLKHPSKNP--TKPPQPN--QFPSHLDAPNVSSTARALCDILTRSSPQDIE 63

Query: 61  GALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELF 120
            AL ++G+ P  E   EVLR+SYN PSSA+KFFR A +  K   ++WNLM+DLLGKN+LF
Sbjct: 64  SALSSSGIVPEEECTNEVLRLSYNYPSSAVKFFRLAGRGKKHPVHTWNLMVDLLGKNQLF 123

Query: 121 EEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLL 180
           E MW+ +R+M+QE+ LSL TF SVF SYC+A R  EA M+F+VMDRY V++DVVAVNSLL
Sbjct: 124 EPMWDAVRSMKQEQKLSLSTFASVFQSYCTAARFNEAVMSFDVMDRYGVKQDVVAVNSLL 183

Query: 181 SAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGW 240
           SAICSE+NQTS   EFFE  K K+P DG++FAILLEGWEKEGN  KAK TF +MV  +GW
Sbjct: 184 SAICSEDNQTSFGLEFFEGIKAKVPPDGDTFAILLEGWEKEGNAAKAKTTFGDMVAHIGW 243

Query: 241 NPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAI 300
           N +NV++YDAFL+TL+R G  +D ++ L  +K + C PGLKF + ALD L++QNDA+HA+
Sbjct: 244 NKDNVAAYDAFLMTLLRAGLMDDVVRFLQVMKDHDCFPGLKFFTTALDFLVKQNDADHAV 303

Query: 301 LLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSL 360
            +WD++V   LVPNLI+YNA+IGLL  N+ +D +FRLLD M FHGAFP+SLTYN+IF  L
Sbjct: 304 PVWDVMVSGELVPNLIMYNAMIGLLCNNAAVDHAFRLLDEMAFHGAFPDSLTYNMIFECL 363

Query: 361 IKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMD 420
           +KNKK +E  +FF EMVKNE PPT S+CAAAI MLFD  DPE A +IW+Y+ EN ++P+D
Sbjct: 364 VKNKKARETERFFAEMVKNEWPPTGSNCAAAIAMLFDCDDPEAAHEIWSYVVENRVKPLD 423

Query: 421 TSANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLL 480
            SANALLIGLCN++R TEV+RFA+D++D+RI+I +STM +LK+ FY++  + R+ YD L 
Sbjct: 424 ESANALLIGLCNMSRFTEVKRFAEDILDRRINIYQSTMSILKDAFYKEGRSARDRYDSLY 483

Query: 481 RRWRA 484
           RRW+A
Sbjct: 484 RRWKA 484

BLAST of Csa4G242870 vs. TrEMBL
Match: A0A0D2V822_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G184800 PE=4 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 1.3e-161
Identity = 282/483 (58.39%), Postives = 361/483 (74.74%), Query Frame = 1

Query: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60
           +AEN    +    +  P S P +    RFP H D PDISP  + +C++L R+S ++++ A
Sbjct: 4   IAENHSTKLPRKPQNQPRSIPNNLQPQRFPTHHDAPDISPTVRILCDLLTRISPHDIESA 63

Query: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120
           L +TG+ P+ + +Q+VL  SYN P SAIKFFRWA    K SAY+WNL++DLLGKN+ FE 
Sbjct: 64  LSSTGVIPTSDDIQQVLGFSYNQPLSAIKFFRWAGCFVKPSAYAWNLIVDLLGKNQSFEP 123

Query: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180
           MW+ +R+M+QE +LS  TF SVF SYC A R  EA M+F+VMDRY VE+DVVAVNSLLSA
Sbjct: 124 MWDAMRSMKQEGLLSTTTFGSVFSSYCIAHRFSEATMSFDVMDRYGVEQDVVAVNSLLSA 183

Query: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240
           IC E+NQ S A EFF+K K KIP DG++FAILLEGWEKEGN+ KAK TF EMV RVGW+P
Sbjct: 184 ICHEDNQMSVAIEFFDKIKMKIPPDGDTFAILLEGWEKEGNLAKAKNTFGEMVVRVGWSP 243

Query: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300
           +++S+YDAFL TLVRG + ++A+K L  +KKN CLPGLKF SN LD L++QND+   I L
Sbjct: 244 KHISAYDAFLTTLVRGSQVDEALKFLQVMKKNDCLPGLKFFSNTLDILVKQNDSAQIIPL 303

Query: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360
           WD +VG GLVPNLI+YNA+I +L  N  + D+FR LD M FHGAFP+SLTYN+IF  L++
Sbjct: 304 WDTMVGGGLVPNLIMYNALISVLCNNDDVHDAFRFLDEMTFHGAFPDSLTYNMIFHCLVR 363

Query: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420
           NK V+EV +FF EM KNE PPT S+ AAAI ML +  DPE AI++WN+M ENH+  +D S
Sbjct: 364 NKMVREVGKFFVEMTKNEWPPTSSNYAAAIKMLLENDDPEMAINMWNHMVENHVSTLDES 423

Query: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480
           AN LLIGLCNL RL EV+RF + M+D+RI I +STM+ LKN FY++  +FR+ YD L R 
Sbjct: 424 ANELLIGLCNLGRLVEVKRFVETMLDKRISIYDSTMEKLKNPFYKKGRSFRDKYDSLSRE 483

Query: 481 WRA 484
           W+A
Sbjct: 484 WKA 486

BLAST of Csa4G242870 vs. TAIR10
Match: AT1G77360.1 (AT1G77360.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 197.6 bits (501), Expect = 1.8e-50
Identity = 122/415 (29.40%), Postives = 221/415 (53.25%), Query Frame = 1

Query: 37  DISPAAKTICEVLVRVSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQ 96
           D++  AK I +VL+   +  +D AL  +GL  S E+V++VL    N+     +FF+W+ +
Sbjct: 67  DVADVAKNISKVLMSSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEK 126

Query: 97  LA--KQSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKE 156
               + S  ++++MI+   K   ++ MW+ I  MR++K+L++ TF  V   Y  A +  E
Sbjct: 127 QRHYEHSVRAYHMMIESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDE 186

Query: 157 ARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLE 216
           A   F VM++Y++  ++VA N LLSA+C  +N   +A E FE  +++   D ++++ILLE
Sbjct: 187 AIYAFNVMEKYDLPPNLVAFNGLLSALCKSKN-VRKAQEVFENMRDRFTPDSKTYSILLE 246

Query: 217 GWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRC 276
           GW KE N+ KA+  F EM+   G +P+ + +Y   +  L + GR ++A+ ++  +  + C
Sbjct: 247 GWGKEPNLPKAREVFREMID-AGCHPD-IVTYSIMVDILCKAGRVDEALGIVRSMDPSIC 306

Query: 277 LPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFR 336
            P     S  + +   +N    A+  +  +  SG+  ++ V+N++IG   + +++ + +R
Sbjct: 307 KPTTFIYSVLVHTYGTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYR 366

Query: 337 LLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLF 396
           +L  M   G  PNS + N+I   LI+  +  E    FR+M+K  C P   +    I M  
Sbjct: 367 VLKEMKSKGVTPNSKSCNIILRHLIERGEKDEAFDVFRKMIK-VCEPDADTYTMVIKMFC 426

Query: 397 DGYDPETAIDIWNYMDENHIEPMDTSANALLIGLCNLNRLTEVRRFADDMIDQRI 450
           +  + ETA  +W YM +  + P   + + L+ GLC      +     ++MI+  I
Sbjct: 427 EKKEMETADKVWKYMRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGI 477

BLAST of Csa4G242870 vs. TAIR10
Match: AT1G71060.1 (AT1G71060.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 172.2 bits (435), Expect = 8.0e-43
Identity = 112/413 (27.12%), Postives = 209/413 (50.61%), Query Frame = 1

Query: 37  DISPAAKTICEVLVRVSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQ 96
           D S  A+ IC++L + + ++V+  L    +  SP L++EVL+   N+   A+  F+WA  
Sbjct: 61  DASQDAERICKILTKFTDSKVETLLNEASVKLSPALIEEVLKKLSNAGVLALSVFKWAEN 120

Query: 97  LA--KQSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKE 156
               K +  ++N +I+ LGK + F+ +W+ +  M+ +K+LS  TF  +   Y  A + KE
Sbjct: 121 QKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARARKVKE 180

Query: 157 ARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEK-IPLDGESFAILL 216
           A   F  M+ +  + +    N +L  + S+     +A + F+K K+K    D +S+ ILL
Sbjct: 181 AIGAFHKMEEFGFKMESSDFNRMLDTL-SKSRNVGDAQKVFDKMKKKRFEPDIKSYTILL 240

Query: 217 EGWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNR 276
           EGW +E N+ +      EM K  G+ P+ V +Y   +    +  + E+AI+   ++++  
Sbjct: 241 EGWGQELNLLRVDEVNREM-KDEGFEPD-VVAYGIIINAHCKAKKYEEAIRFFNEMEQRN 300

Query: 277 CLPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSF 336
           C P      + ++ L  +   N A+  ++    SG       YNA++G    + +++D++
Sbjct: 301 CKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVGAYCWSQRMEDAY 360

Query: 337 RLLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITML 396
           + +D M   G  PN+ TY++I   LI+ ++ KE  + ++ M    C PT S+    + M 
Sbjct: 361 KTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM---SCEPTVSTYEIMVRMF 420

Query: 397 FDGYDPETAIDIWNYMDENHIEPMDTSANALLIGLCNLNRLTEVRRFADDMID 447
            +    + AI IW+ M    + P     ++L+  LC+ N+L E   + ++M+D
Sbjct: 421 CNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYFNEMLD 467

BLAST of Csa4G242870 vs. TAIR10
Match: AT1G52640.1 (AT1G52640.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 148.7 bits (374), Expect = 9.5e-36
Identity = 117/450 (26.00%), Postives = 215/450 (47.78%), Query Frame = 1

Query: 20  SPRSRNSPRFPLHLDLPDISPAAKTICEVLV--RVSRNEVDGALLATGLAPSPELVQEVL 79
           +P+S++   F   L  P        I  VL   R  +++++  L+A     S  LV++VL
Sbjct: 16  TPKSQSFRIFSTLLHDPPSPDLVNEISRVLSDHRNPKDDLEHTLVAYSPRVSSNLVEQVL 75

Query: 80  RVSYNSPSSAIKFFRWARQLAK--QSAYSWNLMIDLLGKNELFEEMWNGIRTMRQEKI-- 139
           +   N    A +FF WAR++     S  S+++++++LG ++ F  +W+ +   R+     
Sbjct: 76  KRCKNLGFPAHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFE 135

Query: 140 LSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSAICSEENQTSEAWE 199
           +S   F  VF +Y  A    EA   F  M  + ++  V  ++ LL ++C +++  + A E
Sbjct: 136 ISSKVFWIVFRAYSRANLPSEACRAFNRMVEFGIKPCVDDLDQLLHSLCDKKH-VNHAQE 195

Query: 200 FFEKHKE-KIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLIT 259
           FF K K   I    ++++IL+ GW +  +   A+  FDEM++R      ++ +Y+A L  
Sbjct: 196 FFGKAKGFGIVPSAKTYSILVRGWARIRDASGARKVFDEMLERNC--VVDLLAYNALLDA 255

Query: 260 LVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPN 319
           L + G  +   K+  ++      P     +  + +     D + A  + D +    LVPN
Sbjct: 256 LCKSGDVDGGYKMFQEMGNLGLKPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDLVPN 315

Query: 320 LIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFR 379
           +  +N II  L +N K+DD++ LLD M+  GA P++ TYN I +    + +V   ++   
Sbjct: 316 VYTFNHIIKTLCKNEKVDDAYLLLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATKLLS 375

Query: 380 EMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTSANALLIGLC-NL 439
            M + +C P   +    + +L      + A +IW  M E    P   +   ++ GL    
Sbjct: 376 RMDRTKCLPDRHTYNMVLKLLIRIGRFDRATEIWEGMSERKFYPTVATYTVMIHGLVRKK 435

Query: 440 NRLTEVRRFADDMIDQRIDILESTMKLLKN 462
            +L E  R+ + MID+ I    +T+++L+N
Sbjct: 436 GKLEEACRYFEMMIDEGIPPYSTTVEMLRN 462

BLAST of Csa4G242870 vs. TAIR10
Match: AT3G22670.1 (AT3G22670.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 129.0 bits (323), Expect = 7.8e-30
Identity = 107/427 (25.06%), Postives = 201/427 (47.07%), Query Frame = 1

Query: 45  ICEVLVR--VSRNEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAK--Q 104
           +C+ L +   S  +V   L    +  +  LV +VLR   N  + A  FF WA        
Sbjct: 105 VCDFLNKKDTSHEDVVKELSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVH 164

Query: 105 SAYSWNLMIDLLGKNELFEEMWNGIRTM---RQEKILSLPTFVSVFGSYCSAGRSKEARM 164
           S +++N M+D+LGK   F+ MW  +  M    + K+++L T   V      +G+  +A  
Sbjct: 165 SGHTYNAMVDVLGKCRNFDLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVD 224

Query: 165 TFEVMDR-YEVEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGW 224
            F  M++ Y V+ D +A+NSL+ A+  +EN    A E F K  + I  D  +F IL+ G+
Sbjct: 225 AFLEMEKSYGVKTDTIAMNSLMDALV-KENSIEHAHEVFLKLFDTIKPDARTFNILIHGF 284

Query: 225 EKEGNVEKAKVTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLP 284
            K    + A+   D ++K   + P+ V +Y +F+    + G      ++L ++++N C P
Sbjct: 285 CKARKFDDARAMMD-LMKVTEFTPD-VVTYTSFVEAYCKEGDFRRVNEMLEEMRENGCNP 344

Query: 285 GLKFLSNALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLL 344
            +   +  + SL +      A+ +++ +   G VP+   Y+++I +LS+  +  D+  + 
Sbjct: 345 NVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKTGRFKDAAEIF 404

Query: 345 DSMVFHGAFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNE---CPPTPSSCAAAITML 404
           + M   G   + L YN + S+ + + + +   +  + M   E   C P   + A  + M 
Sbjct: 405 EDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNVETYAPLLKMC 464

Query: 405 FDGYDPETAIDIWNYMDENHIEPMDTSANALLI-GLCNLNRLTEVRRFADDMIDQRIDIL 460
                 +    + ++M +N +  +D S   LLI GLC   ++ E   F ++ + + +   
Sbjct: 465 CHKKKMKLLGILLHHMVKNDVS-IDVSTYILLIRGLCMSGKVEEACLFFEEAVRKGMVPR 524

BLAST of Csa4G242870 vs. TAIR10
Match: AT1G73400.1 (AT1G73400.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 116.7 bits (291), Expect = 4.0e-26
Identity = 89/409 (21.76%), Postives = 193/409 (47.19%), Query Frame = 1

Query: 55  NEVDGALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWA--RQLAKQSAYSWNLMIDLL 114
           ++++ AL  + +  +  +V ++L+       +A +FF WA  ++       ++N MID+L
Sbjct: 110 DDMEKALDESSVDLTTPVVCKILQRLQYEEKTAFRFFTWAGHQEHYSHEPIAYNEMIDIL 169

Query: 115 G----KNELFEEMWNGIRTMRQEK--ILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYE 174
                KN+ F  + + +  M++    ++ +   + +   YC    +   +       R +
Sbjct: 170 SSTKYKNKQFRIVIDMLDYMKRNNKTVVLVDVLLEILRKYCERYLTHVQKFAKRKRIRVK 229

Query: 175 VEKDVVAVNSLLSAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAK 234
            + ++ A N LL A+C +     E      + + ++  D  +F +L  GW +  + +KA 
Sbjct: 230 TQPEINAFNMLLDALC-KCGLVKEGEALLRRMRHRVKPDANTFNVLFFGWCRVRDPKKAM 289

Query: 235 VTFDEMVKRVGWNPENVSSYDAFLITLVRGGRSEDAIKVL-LKLKKNRCL--PGLKFLSN 294
              +EM++  G  PEN + Y A + T  + G  ++A  +    + K   +  P  K  + 
Sbjct: 290 KLLEEMIE-AGHKPENFT-YCAAIDTFCQAGMVDEAADLFDFMITKGSAVSAPTAKTFAL 349

Query: 295 ALDSLIQQNDANHAILLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHG 354
            + +L + + A     L   ++ +G +P++  Y  +I  +    K+D++++ LD M   G
Sbjct: 350 MIVALAKNDKAEECFELIGRMISTGCLPDVSTYKDVIEGMCMAEKVDEAYKFLDEMSNKG 409

Query: 355 AFPNSLTYNLIFSSLIKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAI 414
             P+ +TYN     L +N+K  E  + +  MV++ C P+  +    I+M F+  DP+ A 
Sbjct: 410 YPPDIVTYNCFLRVLCENRKTDEALKLYGRMVESRCAPSVQTYNMLISMFFEMDDPDGAF 469

Query: 415 DIWNYMDENH-IEPMDTSANALLIGLCNLNRLTEVRRFADDMIDQRIDI 452
           + W  MD+   ++ ++T   A++ GL + +R  E     ++++++ + +
Sbjct: 470 NTWTEMDKRDCVQDVETYC-AMINGLFDCHRAKEACFLLEEVVNKGLKL 514

BLAST of Csa4G242870 vs. NCBI nr
Match: gi|449466215|ref|XP_004150822.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like isoform X1 [Cucumis sativus])

HSP 1 Score: 973.4 bits (2515), Expect = 1.5e-280
Identity = 487/487 (100.00%), Postives = 487/487 (100.00%), Query Frame = 1

Query: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60
           MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA
Sbjct: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60

Query: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120
           LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE
Sbjct: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180
           MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180

Query: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240
           ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP
Sbjct: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240

Query: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300
           ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360
           WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360

Query: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420
           NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420

Query: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480
           ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR
Sbjct: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480

Query: 481 WRASSIL 488
           WRASSIL
Sbjct: 481 WRASSIL 487

BLAST of Csa4G242870 vs. NCBI nr
Match: gi|659087034|ref|XP_008444241.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like isoform X1 [Cucumis melo])

HSP 1 Score: 927.5 bits (2396), Expect = 9.3e-267
Identity = 460/487 (94.46%), Postives = 475/487 (97.54%), Query Frame = 1

Query: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60
           MAENPKMGV+NPSKIPPSS+PRS NSPRFP HLDLPDISPAAKTICEVL++V RNEVD A
Sbjct: 1   MAENPKMGVRNPSKIPPSSAPRSPNSPRFPSHLDLPDISPAAKTICEVLIKVPRNEVDAA 60

Query: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120
           L ATGLAPSPELVQEVLRVSYN PSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE
Sbjct: 61  LSATGLAPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180
           MWNGIRTMRQEKILSLPTFVSVFGSYCSAGR KEA M+FEVMDRYEVEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRFKEATMSFEVMDRYEVEKDVVAVNSLLSA 180

Query: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240
           ICSEENQTS+AWEFFEKHKEKIPLDG+SFAILLEGWEKEGNVEKA+VTFDEMVKR+GWNP
Sbjct: 181 ICSEENQTSKAWEFFEKHKEKIPLDGDSFAILLEGWEKEGNVEKAEVTFDEMVKRIGWNP 240

Query: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300
           ENVSSYDAFLITLVRGGRSEDAIKVLL+LKKNRCLPGLKFLSNALDSLIQQNDANHAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLELKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360
           WDIVVGSGLVPNLIVYNAIIGLLSENSKIDD+FRLLDSMVFHGAFPNS+TYNLIFS LIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDTFRLLDSMVFHGAFPNSVTYNLIFSCLIK 360

Query: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420
           NKKVKEVSQFFREMVKNECPPTPS+CAAAITMLFDGYDPETAIDIWNYMDENHIEPMD S
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDAS 420

Query: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480
           ANALLIGLCNLNRLTEVR FADDMIDQRI+ILESTMKLLKNCFY QRG+FRENYDGLLRR
Sbjct: 421 ANALLIGLCNLNRLTEVRSFADDMIDQRIEILESTMKLLKNCFYLQRGSFRENYDGLLRR 480

Query: 481 WRASSIL 488
           WRASSIL
Sbjct: 481 WRASSIL 487

BLAST of Csa4G242870 vs. NCBI nr
Match: gi|778692662|ref|XP_011653504.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like isoform X2 [Cucumis sativus])

HSP 1 Score: 898.3 bits (2320), Expect = 6.0e-258
Identity = 457/487 (93.84%), Postives = 457/487 (93.84%), Query Frame = 1

Query: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60
           MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA
Sbjct: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60

Query: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120
           LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE
Sbjct: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180
           MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180

Query: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240
           ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP
Sbjct: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240

Query: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300
           ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360
           WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360

Query: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420
           NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420

Query: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480
           ANALLIGLCNLNRLTE                              RGNFRENYDGLLRR
Sbjct: 421 ANALLIGLCNLNRLTE------------------------------RGNFRENYDGLLRR 457

Query: 481 WRASSIL 488
           WRASSIL
Sbjct: 481 WRASSIL 457

BLAST of Csa4G242870 vs. NCBI nr
Match: gi|659087036|ref|XP_008444242.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 859.0 bits (2218), Expect = 4.1e-246
Identity = 433/487 (88.91%), Postives = 447/487 (91.79%), Query Frame = 1

Query: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60
           MAENPKMGV+NPSKIPPSS+PRS NSPRFP HLDLPDISPAAKTICEVL++V RNEVD A
Sbjct: 1   MAENPKMGVRNPSKIPPSSAPRSPNSPRFPSHLDLPDISPAAKTICEVLIKVPRNEVDAA 60

Query: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120
           L ATGLAPSPELVQEVLRVSYN PSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE
Sbjct: 61  LSATGLAPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180
           MWNGIRTMRQEKILSLPTFVSVFGSYCSAGR KEA M+FEVMDRYEVEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRFKEATMSFEVMDRYEVEKDVVAVNSLLSA 180

Query: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240
           ICSEENQTS+AWEFFEKHKEKIPLDG+SFAILLEGWEKEGNVEKA+VTFDEMVKR+GWNP
Sbjct: 181 ICSEENQTSKAWEFFEKHKEKIPLDGDSFAILLEGWEKEGNVEKAEVTFDEMVKRIGWNP 240

Query: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300
           ENVSSYDAFLITLVRGGRSEDAIKVLL+LKKNRCLPGLKFLSNALDSLIQQNDANHAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLELKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360
           WDIVVGSGLVPNLIVYNAIIGLLSENSKIDD+FRLLDSMVFHGAFPNS+TYNLIFS LIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDTFRLLDSMVFHGAFPNSVTYNLIFSCLIK 360

Query: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420
           NKKVKEVSQFFREMVKNECPPTPS+CAAAITMLFDGYDPETAIDIWNYMDENHIEPMD S
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDAS 420

Query: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480
           ANALLIGLCNLNRLTE                              RG+FRENYDGLLRR
Sbjct: 421 ANALLIGLCNLNRLTE------------------------------RGSFRENYDGLLRR 457

Query: 481 WRASSIL 488
           WRASSIL
Sbjct: 481 WRASSIL 457

BLAST of Csa4G242870 vs. NCBI nr
Match: gi|947058350|gb|KRH07756.1| (hypothetical protein GLYMA_16G108300 [Glycine max])

HSP 1 Score: 588.2 bits (1515), Expect = 1.3e-164
Identity = 282/485 (58.14%), Postives = 373/485 (76.91%), Query Frame = 1

Query: 1   MAENP--KMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVD 60
           +AENP     +K+PSK P  + P   N  +FP HLD P++S  A+ +C++L R S  +++
Sbjct: 4   LAENPGRSRDLKHPSKNP--TKPPQPN--QFPSHLDAPNVSSTARALCDILTRSSPQDIE 63

Query: 61  GALLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELF 120
            AL ++G+ P  E   EVLR+SYN PSSA+KFFRWA +  K   ++WNLM+DLLGKN+LF
Sbjct: 64  SALSSSGIVPEEECTNEVLRLSYNYPSSAVKFFRWAGRGKKHPVHTWNLMVDLLGKNQLF 123

Query: 121 EEMWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLL 180
           E MW+ +R+M+QE+ LSL TF SVF SYC+A R  EA M+F+VMDRY V++DVVAVNSLL
Sbjct: 124 EPMWDAVRSMKQEQKLSLSTFASVFQSYCTAARFNEAVMSFDVMDRYGVKQDVVAVNSLL 183

Query: 181 SAICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGW 240
           SAICSE+NQTS   EFFE  K K+P DG++FAILLEGWEKEGN  KAK TF +MV  +GW
Sbjct: 184 SAICSEDNQTSFGLEFFEGIKAKVPPDGDTFAILLEGWEKEGNAAKAKTTFGDMVAHIGW 243

Query: 241 NPENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAI 300
           N +NV++YDAFL+TL+R G  +D ++ L  +K + C PGLKF + ALD L++QNDA+HA+
Sbjct: 244 NKDNVAAYDAFLMTLLRAGLMDDVVRFLQVMKDHDCFPGLKFFTTALDFLVKQNDADHAV 303

Query: 301 LLWDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSL 360
            +WD++V   LVPNLI+YNA+IGLL  N+ +D +FRLLD M FHGAFP+SLTYN+IF  L
Sbjct: 304 PVWDVMVSGELVPNLIMYNAMIGLLCNNAAVDHAFRLLDEMAFHGAFPDSLTYNMIFECL 363

Query: 361 IKNKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMD 420
           +KNKK +E  +FF EMVKNE PPT S+CAAAI MLFD  DPE A +IW+Y+ EN ++P+D
Sbjct: 364 VKNKKARETERFFAEMVKNEWPPTGSNCAAAIAMLFDCDDPEAAHEIWSYVVENRVKPLD 423

Query: 421 TSANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLL 480
            SANALLIGLCN++R TEV+RFA+D++D+RI+I +STM +LK+ FY++  + R+ YD L 
Sbjct: 424 ESANALLIGLCNMSRFTEVKRFAEDILDRRINIYQSTMSILKDAFYKEGRSARDRYDSLY 483

Query: 481 RRWRA 484
           RRW+A
Sbjct: 484 RRWKA 484

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP129_ARATH3.2e-4929.40Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidop... [more]
PP112_ARATH1.4e-4127.12Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
PPR78_ARATH1.7e-3426.00Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
PP248_ARATH1.4e-2825.06Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
PP117_ARATH7.1e-2521.76Pentatricopeptide repeat-containing protein At1g73400, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0R0FP69_SOYBN9.3e-16558.14Uncharacterized protein OS=Glycine max GN=GLYMA_16G108300 PE=4 SV=1[more]
K7MGA0_SOYBN9.3e-16558.14Uncharacterized protein OS=Glycine max PE=4 SV=1[more]
A0A061G6A0_THECC3.5e-16458.40Tetratricopeptide repeat-like superfamily protein isoform 1 OS=Theobroma cacao G... [more]
A0A0B2QVZ1_GLYSO3.0e-16357.94Pentatricopeptide repeat-containing protein, mitochondrial OS=Glycine soja GN=gl... [more]
A0A0D2V822_GOSRA1.3e-16158.39Uncharacterized protein OS=Gossypium raimondii GN=B456_012G184800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G77360.11.8e-5029.40 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G71060.18.0e-4327.12 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G52640.19.5e-3626.00 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G22670.17.8e-3025.06 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G73400.14.0e-2621.76 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449466215|ref|XP_004150822.1|1.5e-280100.00PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-... [more]
gi|659087034|ref|XP_008444241.1|9.3e-26794.46PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-... [more]
gi|778692662|ref|XP_011653504.1|6.0e-25893.84PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-... [more]
gi|659087036|ref|XP_008444242.1|4.1e-24688.91PREDICTED: pentatricopeptide repeat-containing protein At1g77360, mitochondrial-... [more]
gi|947058350|gb|KRH07756.1|1.3e-16458.14hypothetical protein GLYMA_16G108300 [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU117188cucumber EST collection version 3.0transcribed_cluster
CU147907cucumber EST collection version 3.0transcribed_cluster
CU170559cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G242870.1Csa4G242870.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU147907CU147907transcribed_cluster
CU170559CU170559transcribed_cluster
CU117188CU117188transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 209..235
score: 0.079coord: 103..131
score: 0.033coord: 138..164
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 311..359
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 208..240
score: 0.0013coord: 314..347
score: 0.0012coord: 350..382
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 277..311
score: 5.426coord: 101..135
score: 8.068coord: 205..235
score: 8.364coord: 347..381
score: 11.159coord: 170..201
score: 7.848coord: 312..346
score: 9.493coord: 417..451
score: 6.38coord: 138..169
score: 5.777coord: 382..416
score: 6.763coord: 242..276
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 87..269
score: 5.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 186..266
score: 2.15E-6coord: 82..125
score: 2.1
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..48
score: 9.8E-208coord: 68..467
score: 9.8E
NoneNo IPR availablePANTHERPTHR24015:SF821SUBFAMILY NOT NAMEDcoord: 1..48
score: 9.8E-208coord: 68..467
score: 9.8E