Lsi04G021540 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G021540
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr04 : 28620931 .. 28624431 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAATTTGATTTTCTTTAATTTTTTAATACACTCGTGATATTATTACGGGAGGAGGGACGTTGAGCATAAGCAAGTGACGAGACCGGAATTTCGTGCGGCACTACCGGAATAGTTCGTCGGAAGATTGTCGCCGTGTTGTAGTCATCTATCTCTTTTTCTCAGTATAAACCCTAAACCCTAATCCCTAATGCCCATTCAGAAACCTCGCCTTTTGTTTTACCGAGTCCTCCACGCCATCGAACATAATCATCGTTCTACTTTTCGAGTTTCAGCTCTGCAATTTTCCTCTTTGTCTGCTCATTCATGGCTGTCCACGCCGGGAAAGCCCCTCATAAAGTGGCCTTCACTGCCCGACCAGTCCGGCAATCCATTGCAATCAAATTCTGCCGTGATTTCAAACCCTAACCCTGCCATTGACCTGAAATTCGAAGCAAGTTTTGAGCCGAATGACTTGTCTACCATTTCTGGCATCCTTTCCGATCGCGGTGTTCGGTCCGGTGCAGCGCTTGAGGATGCTTTGGACCGGACTGGAATCGTGCCGAGTTCGAGTTTGTTGGAAGCTGTGTTCAATCATTTTGATTCGTCTCCCAAGTTCTTGCACTCGCTGTTTCTCTGGGCTGAGAACAAACCTGGATTTTGTTCCTCTAAAGCATTGTTCAACTGTCTGATCAACGTGCTTGCGAAATCGAGGGAGTTTGATTCTGCTTGGTCTTTGATTATTCGTCGTCTCCGTGGAGATGAAGTGTCCTCTTTGGTTTCGGTCGACGTGTTCGTGATATTGATCAGGCGGTACGCTCGTGCTGGTATATTTTTAACGAAACTACAATGCCAGTACTTCCGTATGGATTTGATTGATTTTCCATTTACTTTCAAAATTCATATTTGGAAATCACCTTCTATCGCCATTGCAATTTAAGGGTTATATTCATCTTGATATTGTCATTCTCTACTGTTGAGTGCGCTTGATCGGAACTTGAAACCGAACAGAGAAATTCTTTAACTCGTTTTCTTGTTAGAGATAATGGTTTGATCTATATCTTTGTTTTTGTTCTTACGCGATGATAATTTCTTCAGGCATGCTTCAACCTGCAATTCGAACTTTTGAATTTGCTTGCAACCTAGAAACAATTTCAGGGACCAACTCGGAGGGTCTGTTTGAGATTTTACTGGATTCACTCTGCAAAGAAGGCCATGTCAGGGTAGCTTCTGAGTATTTCAATAGGAAAAGGGAAATGGATTCATGTTTTGAACCATCAATTCGAGCATATAATATACTTCTAAATGGATGGTTTCGATCAAGAAAACTCAAACATGCTGAGAGGCTCTGGTTTGAGATGAAGAAGAACAAAATATCTCCCACTGTTGTTACGTATGGCACCCTTGTAGAAGGCTACTGTCGAATGCGTCGTGTTGAAAGAGCCATTGAATTGGTGGATGAGATGAGGGGAGAAGGTATTGAGCCAAATGCAATCATTTATAACCCAATAGTTGATGCATTTGGGGAAGCTGGGAGGTTTAAGGAGGCACTGGGAATGATGGAGCGGTTCATGGTTTTGGAGCAAGGCCCTACTATCTCAACGTACAATTCTCTTATCAAGGGCTATTGCAAGGCACGTGATCTTTCTGGGGCTAGCAAGATCCTTAAGTCGATGATAGGTCGAGGATTCACTCCGACCCCGACCACCTACAACTACTTCTTCAGATTCTTCTCGAAGTATGGTAAAATTGAGGAGGGTATGAACCTTTACAATAAAATGATTGAATCAGGATATGCGCCAGATAAACTTACATATCACCTGCTGCTGAAGATGCTATGTGAAGAGGAGAGATTAAACCTTGCAGCTCAAGTTTGCAATGAAATGAAGGCTAGAGGGTTTGACATGGACTTAGCTACAAGCACCATGTTAGTGCATTTGCTTTGCAAGATGCATAAATTTGAAGAAGCTTTTGCAGAATTTGAGCACATGATTCATCGAGGAATTGTTCCCCAATATCTTACTTTCTGCAGATTACATGATGAATTCATGAAACGTGGATTGACAAAAATGGCATCTAAGCTGCAGGAAATGATGTCATCGGTTCCTCATTCAGAGAAGTTACCTGATACATATAATCAAACTCCTGATTCTATCCGTGCTAGAAGAACATCTATTATGCGTAAAGCTGAGGCAATGTCTGAAATGTTGAAGGTCTGTAAAGACCCTAGAGAGCTGGTCAAACGCAGAAGTCCATCTGAAAATGCCGTATTTAGTGCAAATAGGCTGATAGATGATATCAAGAAGAAAGCCAACCCTGTGACTGGTTGAACCCATCTTCAATTTTGCTGTTGCTGGTAAATAACTCCTTATCAGCTGAATACAAAATGACTTGCGCTACATAATTTCTCAATAGGAACCTTTGGAGAAACTCCAACAAATGGATCAGTTTCACAGGCATGTGCAGTTATTTGGTGAGGGGATATACACACTTTGTGCGGATTGAGTGTTTTTAAGGTGAACGGGGAGCATTCGTATATCAAATGAAGAGCTCGCAAGAATTCAGTTGGACTCGGCATTGTTTTATGTTTGAATGTAGAACAGAACGTAAGTCCATTTGATTTAATCTCTTCTTCCAAATTATTATTATTAGGCCTATTTGAAACACAACGGAGAGTTGTCAAAGCAAGGACCAAGTTGAAGGTTGGCGATCAACTTTAGCTGTTCAACAACATCAGTGTAATCCAAGACCCCTACTCTGATTTTCCCATTTCATTTTGCTCAATCTCTGTATTCTATTATATGTGCAAAAGCTTTTTTCTCACCAGGGATCAGAAACCATTTGCTGAACTTCAACTTTGTAGTTAGAGAGAAAAACAAGGATGGGAGTGCTGAATTCAAGGACATGACGTATGGATGTTGCTGAAAAGCAAGGTTTTTCAACAGAATGATGATGAAACGCACAATTGTTGGGAAGTGCTTTTGCAACCGAACTCAGTTGAATTGCATCATTGTCCATTAACAATGTCACGGGGAGCTTTATGAATGCAGGTGAATTTATTTGAGGAGGCTGCCACAATGTTGCTGACAATTTTGGACCGTTGGATGTAAGAAGAAGAAAATACTAAATACTTAAAATTTGTATCTTGTGGTTATTAGGCATCAACAGATTTGTGAATCTACATCCAACCGTAATAAAAAAGTAGCCCTATTGTTAAGAGCGAGGACGTTTCTGCAAGGACGGCTAGCTAGTGGAATCCACTCCACCTCCTCGTAAGATCATCTTTTTAAGTTCTTTGCCACTCAAGTATTACACAGCAACTAATTTGATTGGGGTGCAAAATCACAATTTACCAACTTTTTAAAAGGGGTTAAATTCTAAGTTTGGTTGTATAGATAATTATTAGCTTTAATAATCTTGAATGGAGCATGCTTTGCTTTTGATTATATAATTGTTTTGTTTAACCATTGTTCTTATCGTATTGTTTTAAAATATCACATCGCATCTATGGAGTGGAAATTTG

mRNA sequence

GAAAATTTGATTTTCTTTAATTTTTTAATACACTCGTGATATTATTACGGGAGGAGGGACGTTGAGCATAAGCAAGTGACGAGACCGGAATTTCGTGCGGCACTACCGGAATAGTTCGTCGGAAGATTGTCGCCGTGTTGTAGTCATCTATCTCTTTTTCTCAGTATAAACCCTAAACCCTAATCCCTAATGCCCATTCAGAAACCTCGCCTTTTGTTTTACCGAGTCCTCCACGCCATCGAACATAATCATCGTTCTACTTTTCGAGTTTCAGCTCTGCAATTTTCCTCTTTGTCTGCTCATTCATGGCTGTCCACGCCGGGAAAGCCCCTCATAAAGTGGCCTTCACTGCCCGACCAGTCCGGCAATCCATTGCAATCAAATTCTGCCGTGATTTCAAACCCTAACCCTGCCATTGACCTGAAATTCGAAGCAAGTTTTGAGCCGAATGACTTGTCTACCATTTCTGGCATCCTTTCCGATCGCGGTGTTCGGTCCGGTGCAGCGCTTGAGGATGCTTTGGACCGGACTGGAATCGTGCCGAGTTCGAGTTTGTTGGAAGCTGTGTTCAATCATTTTGATTCGTCTCCCAAGTTCTTGCACTCGCTGTTTCTCTGGGCTGAGAACAAACCTGGATTTTGTTCCTCTAAAGCATTGTTCAACTGTCTGATCAACGTGCTTGCGAAATCGAGGGAGTTTGATTCTGCTTGGTCTTTGATTATTCGTCGTCTCCGTGGAGATGAAGTGTCCTCTTTGGTTTCGGTCGACGTGTTCGTGATATTGATCAGGCGGTACGCTCGTGCTGGCATGCTTCAACCTGCAATTCGAACTTTTGAATTTGCTTGCAACCTAGAAACAATTTCAGGGACCAACTCGGAGGGTCTGTTTGAGATTTTACTGGATTCACTCTGCAAAGAAGGCCATGTCAGGGTAGCTTCTGAGTATTTCAATAGGAAAAGGGAAATGGATTCATGTTTTGAACCATCAATTCGAGCATATAATATACTTCTAAATGGATGGTTTCGATCAAGAAAACTCAAACATGCTGAGAGGCTCTGGTTTGAGATGAAGAAGAACAAAATATCTCCCACTGTTGTTACGTATGGCACCCTTGTAGAAGGCTACTGTCGAATGCGTCGTGTTGAAAGAGCCATTGAATTGGTGGATGAGATGAGGGGAGAAGGTATTGAGCCAAATGCAATCATTTATAACCCAATAGTTGATGCATTTGGGGAAGCTGGGAGGTTTAAGGAGGCACTGGGAATGATGGAGCGGTTCATGGTTTTGGAGCAAGGCCCTACTATCTCAACGTACAATTCTCTTATCAAGGGCTATTGCAAGGCACGTGATCTTTCTGGGGCTAGCAAGATCCTTAAGTCGATGATAGGTCGAGGATTCACTCCGACCCCGACCACCTACAACTACTTCTTCAGATTCTTCTCGAAGTATGGTAAAATTGAGGAGGGTATGAACCTTTACAATAAAATGATTGAATCAGGATATGCGCCAGATAAACTTACATATCACCTGCTGCTGAAGATGCTATGTGAAGAGGAGAGATTAAACCTTGCAGCTCAAGTTTGCAATGAAATGAAGGCTAGAGGGTTTGACATGGACTTAGCTACAAGCACCATGTTAGTGCATTTGCTTTGCAAGATGCATAAATTTGAAGAAGCTTTTGCAGAATTTGAGCACATGATTCATCGAGGAATTGTTCCCCAATATCTTACTTTCTGCAGATTACATGATGAATTCATGAAACGTGGATTGACAAAAATGGCATCTAAGCTGCAGGAAATGATGTCATCGGTTCCTCATTCAGAGAAGTTACCTGATACATATAATCAAACTCCTGATTCTATCCGTGCTAGAAGAACATCTATTATGCGTAAAGCTGAGGCAATGTCTGAAATGTTGAAGGTCTGTAAAGACCCTAGAGAGCTGGTCAAACGCAGAAGTCCATCTGAAAATGCCGTATTTAGTGCAAATAGGCTGATAGATGATATCAAGAAGAAAGCCAACCCTGTGACTGGTTGAACCCATCTTCAATTTTGCTGTTGCTGGTAAATAACTCCTTATCAGCTGAATACAAAATGACTTGCGCTACATAATTTCTCAATAGGAACCTTTGGAGAAACTCCAACAAATGGATCAGTTTCACAGGCATGTGCAGTTATTTGGTGAGGGGATATACACACTTTGTGCGGATTGAGTGTTTTTAAGGTGAACGGGGAGCATTCGTATATCAAATGAAGAGCTCGCAAGAATTCAGTTGGACTCGGCATTGTTTTATGTTTGAATGTAGAACAGAACGTAAGTCCATTTGATTTAATCTCTTCTTCCAAATTATTATTATTAGGCCTATTTGAAACACAACGGAGAGTTGTCAAAGCAAGGACCAAGTTGAAGGTTGGCGATCAACTTTAGCTGTTCAACAACATCAGTGGATCAGAAACCATTTGCTGAACTTCAACTTTGTAGTTAGAGAGAAAAACAAGGATGGGAGTGCTGAATTCAAGGACATGACGTATGGATGTTGCTGAAAAGCAAGGTTTTTCAACAGAATGATGATGAAACGCACAATTGTTGGGAAGTGCTTTTGCAACCGAACTCAGTTGAATTGCATCATTGTCCATTAACAATGTCACGGGGAGCTTTATGAATGCAGGCATCAACAGATTTGTGAATCTACATCCAACCGTAATAAAAAAGTAGCCCTATTGTTAAGAGCGAGGACGTTTCTGCAAGGACGGCTAGCTAGTGGAATCCACTCCACCTCCTCGTAAGATCATCTTTTTAAGTTCTTTGCCACTCAAGTATTACACAGCAACTAATTTGATTGGGGTGCAAAATCACAATTTACCAACTTTTTAAAAGGGGTTAAATTCTAAGTTTGGTTGTATAGATAATTATTAGCTTTAATAATCTTGAATGGAGCATGCTTTGCTTTTGATTATATAATTGTTTTGTTTAACCATTGTTCTTATCGTATTGTTTTAAAATATCACATCGCATCTATGGAGTGGAAATTTG

Coding sequence (CDS)

ATGCCCATTCAGAAACCTCGCCTTTTGTTTTACCGAGTCCTCCACGCCATCGAACATAATCATCGTTCTACTTTTCGAGTTTCAGCTCTGCAATTTTCCTCTTTGTCTGCTCATTCATGGCTGTCCACGCCGGGAAAGCCCCTCATAAAGTGGCCTTCACTGCCCGACCAGTCCGGCAATCCATTGCAATCAAATTCTGCCGTGATTTCAAACCCTAACCCTGCCATTGACCTGAAATTCGAAGCAAGTTTTGAGCCGAATGACTTGTCTACCATTTCTGGCATCCTTTCCGATCGCGGTGTTCGGTCCGGTGCAGCGCTTGAGGATGCTTTGGACCGGACTGGAATCGTGCCGAGTTCGAGTTTGTTGGAAGCTGTGTTCAATCATTTTGATTCGTCTCCCAAGTTCTTGCACTCGCTGTTTCTCTGGGCTGAGAACAAACCTGGATTTTGTTCCTCTAAAGCATTGTTCAACTGTCTGATCAACGTGCTTGCGAAATCGAGGGAGTTTGATTCTGCTTGGTCTTTGATTATTCGTCGTCTCCGTGGAGATGAAGTGTCCTCTTTGGTTTCGGTCGACGTGTTCGTGATATTGATCAGGCGGTACGCTCGTGCTGGCATGCTTCAACCTGCAATTCGAACTTTTGAATTTGCTTGCAACCTAGAAACAATTTCAGGGACCAACTCGGAGGGTCTGTTTGAGATTTTACTGGATTCACTCTGCAAAGAAGGCCATGTCAGGGTAGCTTCTGAGTATTTCAATAGGAAAAGGGAAATGGATTCATGTTTTGAACCATCAATTCGAGCATATAATATACTTCTAAATGGATGGTTTCGATCAAGAAAACTCAAACATGCTGAGAGGCTCTGGTTTGAGATGAAGAAGAACAAAATATCTCCCACTGTTGTTACGTATGGCACCCTTGTAGAAGGCTACTGTCGAATGCGTCGTGTTGAAAGAGCCATTGAATTGGTGGATGAGATGAGGGGAGAAGGTATTGAGCCAAATGCAATCATTTATAACCCAATAGTTGATGCATTTGGGGAAGCTGGGAGGTTTAAGGAGGCACTGGGAATGATGGAGCGGTTCATGGTTTTGGAGCAAGGCCCTACTATCTCAACGTACAATTCTCTTATCAAGGGCTATTGCAAGGCACGTGATCTTTCTGGGGCTAGCAAGATCCTTAAGTCGATGATAGGTCGAGGATTCACTCCGACCCCGACCACCTACAACTACTTCTTCAGATTCTTCTCGAAGTATGGTAAAATTGAGGAGGGTATGAACCTTTACAATAAAATGATTGAATCAGGATATGCGCCAGATAAACTTACATATCACCTGCTGCTGAAGATGCTATGTGAAGAGGAGAGATTAAACCTTGCAGCTCAAGTTTGCAATGAAATGAAGGCTAGAGGGTTTGACATGGACTTAGCTACAAGCACCATGTTAGTGCATTTGCTTTGCAAGATGCATAAATTTGAAGAAGCTTTTGCAGAATTTGAGCACATGATTCATCGAGGAATTGTTCCCCAATATCTTACTTTCTGCAGATTACATGATGAATTCATGAAACGTGGATTGACAAAAATGGCATCTAAGCTGCAGGAAATGATGTCATCGGTTCCTCATTCAGAGAAGTTACCTGATACATATAATCAAACTCCTGATTCTATCCGTGCTAGAAGAACATCTATTATGCGTAAAGCTGAGGCAATGTCTGAAATGTTGAAGGTCTGTAAAGACCCTAGAGAGCTGGTCAAACGCAGAAGTCCATCTGAAAATGCCGTATTTAGTGCAAATAGGCTGATAGATGATATCAAGAAGAAAGCCAACCCTGTGACTGGTTGA

Protein sequence

MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGNPLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSEGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSANRLIDDIKKKANPVTG
BLAST of Lsi04G021540 vs. Swiss-Prot
Match: PP375_ARATH (Pentatricopeptide repeat-containing protein At5g11310, mitochondrial OS=Arabidopsis thaliana GN=At5g11310 PE=2 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 1.2e-173
Identity = 321/604 (53.15%), Postives = 420/604 (69.54%), Query Frame = 1

Query: 10  FYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGNPLQSNSAVI 69
           F R L    + HR+ F    L  S  S+      P    I+ P++PD +  P Q N+   
Sbjct: 8   FRRNLLLNPNPHRNFFLHRLLSSSRRSSPLIPVEPLIQRIQSPAVPDSTCTPPQQNTV-- 67

Query: 70  SNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNH 129
                             DLSTIS +L +  V  G++LE ALD TGI PS  L+ A+F+ 
Sbjct: 68  ---------------SKTDLSTISNLLENTDVVPGSSLESALDETGIEPSVELVHALFDR 127

Query: 130 FDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRRLRGDEVSSL 189
             SSP  LHS+F WAE KPGF  S +LF+ ++N L K+REF+ AWSL+  R+R DE S+L
Sbjct: 128 LSSSPMLLHSVFKWAEMKPGFTLSPSLFDSVVNSLCKAREFEIAWSLVFDRVRSDEGSNL 187

Query: 190 VSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE-GLFEILLDSLCKEGHVRV 249
           VS D F++LIRRYARAGM+Q AIR FEFA + E +  + +E  L E+LLD+LCKEGHVR 
Sbjct: 188 VSADTFIVLIRRYARAGMVQQAIRAFEFARSYEPVCKSATELRLLEVLLDALCKEGHVRE 247

Query: 250 ASEYFNR-KREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGT 309
           AS Y  R    MDS + PS+R +NILLNGWFRSRKLK AE+LW EMK   + PTVVTYGT
Sbjct: 248 ASMYLERIGGTMDSNWVPSVRIFNILLNGWFRSRKLKQAEKLWEEMKAMNVKPTVVTYGT 307

Query: 310 LVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLE 369
           L+EGYCRMRRV+ A+E+++EM+   +E N +++NPI+D  GEAGR  EALGMMERF V E
Sbjct: 308 LIEGYCRMRRVQIAMEVLEEMKMAEMEINFMVFNPIIDGLGEAGRLSEALGMMERFFVCE 367

Query: 370 QGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGM 429
            GPTI TYNSL+K +CKA DL GASKILK M+ RG  PT TTYN+FF++FSK+ K EEGM
Sbjct: 368 SGPTIVTYNSLVKNFCKAGDLPGASKILKMMMTRGVDPTTTTYNHFFKYFSKHNKTEEGM 427

Query: 430 NLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATSTMLVHLL 489
           NLY K+IE+G++PD+LTYHL+LKMLCE+ +L+LA QV  EMK RG D DL T+TML+HLL
Sbjct: 428 NLYFKLIEAGHSPDRLTYHLILKMLCEDGKLSLAMQVNKEMKNRGIDPDLLTTTMLIHLL 487

Query: 490 CKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKL 549
           C++   EEAF EF++ + RGI+PQY+TF  + +    +G++ MA +L  +MSS+PHS+KL
Sbjct: 488 CRLEMLEEAFEEFDNAVRRGIIPQYITFKMIDNGLRSKGMSDMAKRLSSLMSSLPHSKKL 547

Query: 550 PDTYNQTPDS--IRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSANRLIDD 609
           P+TY +  D+   + RR SI+ +AEAMS++LK C++PR+LVK R   + AV     LIDD
Sbjct: 548 PNTYREAVDAPPDKDRRKSILHRAEAMSDVLKGCRNPRKLVKMRGSHKKAVGEDINLIDD 594

BLAST of Lsi04G021540 vs. Swiss-Prot
Match: PP150_ARATH (Pentatricopeptide repeat-containing protein At2g13420, mitochondrial OS=Arabidopsis thaliana GN=At2g13420 PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.0e-53
Identity = 136/443 (30.70%), Postives = 219/443 (49.44%), Query Frame = 1

Query: 107 LEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAK 166
           +E +L   GI  + +L+         + K   S F +  + P   ++   FN +I++L +
Sbjct: 58  MESSLQLNGISLTPNLIHQTLLRLRHNSKIALSFFQYLRSLPSPSTTPTSFNLIIDILGR 117

Query: 167 SREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFA-CNLETIS 226
            R+FD    LI+     D+ S     + F+IL++R   AG+ + A+R F+ A C LE   
Sbjct: 118 VRQFDVVRQLIVEM---DQTSP----ETFLILVKRLIAAGLTRQAVRAFDDAPCFLENRR 177

Query: 227 GTNSEGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKH 286
               E  F  LLD+LCK G+ ++A   FN ++E    F    + Y IL+ GW + R++  
Sbjct: 178 FRLVE--FGFLLDTLCKYGYTKMAVGVFNERKEE---FGSDEKVYTILIAGWCKLRRIDM 237

Query: 287 AERLWFEMKKNKISPTVVTYGTLVEGYCRM----------RRVERAIELVDEMRGEGIEP 346
           AE+   EM ++ I P VVTY  L+ G CR           R V  A ++ DEMR  GIEP
Sbjct: 238 AEKFLVEMIESGIEPNVVTYNVLLNGICRTASLHPEERFERNVRNAEKVFDEMRQRGIEP 297

Query: 347 NAIIYNPIVDAFGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASKIL 406
           +   ++ ++  +  A + +  L  M+        PTI TY S++K  C    L  A ++L
Sbjct: 298 DVTSFSIVLHMYSRAHKAELTLDKMKLMKAKGISPTIETYTSVVKCLCSCGRLEEAEELL 357

Query: 407 KSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEE 466
           ++M+  G +P+  TYN FF+ +         MNLY KM      P   TY++LL      
Sbjct: 358 ETMVESGISPSSATYNCFFKEYKGRKDANGAMNLYRKMKNGLCKPSTQTYNVLLGTFINL 417

Query: 467 ERLNLAAQVCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTF 526
            ++    ++ +++KA     DL + T LVH LC   K++EA   F  MI RG +PQ LTF
Sbjct: 418 GKMETVKEIWDDLKASETGPDLDSYTSLVHGLCSKEKWKEACGYFVEMIERGFLPQKLTF 477

Query: 527 CRLHDEFMKRGLTKMASKLQEMM 539
             L+   ++    +   +L++ +
Sbjct: 478 ETLYKGLIQSNKMRTWRRLKKKL 488

BLAST of Lsi04G021540 vs. Swiss-Prot
Match: PP129_ARATH (Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidopsis thaliana GN=At1g77360 PE=2 SV=2)

HSP 1 Score: 202.2 bits (513), Expect = 1.6e-50
Identity = 129/436 (29.59%), Postives = 218/436 (50.00%), Query Frame = 1

Query: 107 LEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAK 166
           L+ ALD++G+  S  ++E V N F ++    +  F W+E +  +  S   ++ +I   AK
Sbjct: 87  LDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEKQRHYEHSVRAYHMMIESTAK 146

Query: 167 SREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISG 226
            R++   W LI    +      +++V+ F I++R+YARA  +  AI  F     +E    
Sbjct: 147 IRQYKLMWDLINAMRK----KKMLNVETFCIVMRKYARAQKVDEAIYAFNV---MEKYDL 206

Query: 227 TNSEGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHA 286
             +   F  LL +LCK  +VR A E F     M   F P  + Y+ILL GW +   L  A
Sbjct: 207 PPNLVAFNGLLSALCKSKNVRKAQEVF---ENMRDRFTPDSKTYSILLEGWGKEPNLPKA 266

Query: 287 ERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDA 346
             ++ EM      P +VTY  +V+  C+  RV+ A+ +V  M     +P   IY+ +V  
Sbjct: 267 REVFREMIDAGCHPDIVTYSIMVDILCKAGRVDEALGIVRSMDPSICKPTTFIYSVLVHT 326

Query: 347 FGEAGRFKEALGMMERFMVLEQG---PTISTYNSLIKGYCKARDLSGASKILKSMIGRGF 406
           +G   R +EA   ++ F+ +E+      ++ +NSLI  +CKA  +    ++LK M  +G 
Sbjct: 327 YGTENRLEEA---VDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVLKEMKSKGV 386

Query: 407 TPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQ 466
           TP   + N   R   + G+ +E  +++ KMI+    PD  TY +++KM CE++ +  A +
Sbjct: 387 TPNSKSCNIILRHLIERGEKDEAFDVFRKMIKV-CEPDADTYTMVIKMFCEKKEMETADK 446

Query: 467 VCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFM 526
           V   M+ +G    + T ++L++ LC+    ++A    E MI  GI P  +TF RL    +
Sbjct: 447 VWKYMRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVTFGRLRQLLI 506

Query: 527 KRGLTKMASKLQEMMS 540
           K     +   L E M+
Sbjct: 507 KEEREDVLKFLNEKMN 508

BLAST of Lsi04G021540 vs. Swiss-Prot
Match: PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 8.1e-50
Identity = 123/434 (28.34%), Postives = 213/434 (49.08%), Query Frame = 1

Query: 111 LDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREF 170
           L +  +V + SL+  V   F +     +  F+WA ++ G+  S   +N +++VL K R F
Sbjct: 123 LSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVHSGHTYNAMVDVLGKCRNF 182

Query: 171 DSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE 230
           D  W L+   +  +E S LV++D    ++RR A++G    A+  F     +E   G  ++
Sbjct: 183 DLMWELV-NEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVDAF---LEMEKSYGVKTD 242

Query: 231 GL-FEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERL 290
            +    L+D+L KE  +  A E F +   +    +P  R +NIL++G+ ++RK   A  +
Sbjct: 243 TIAMNSLMDALVKENSIEHAHEVFLK---LFDTIKPDARTFNILIHGFCKARKFDDARAM 302

Query: 291 WFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGE 350
              MK  + +P VVTY + VE YC+     R  E+++EMR  G  PN + Y  ++ + G+
Sbjct: 303 MDLMKVTEFTPDVVTYTSFVEAYCKEGDFRRVNEMLEEMRENGCNPNVVTYTIVMHSLGK 362

Query: 351 AGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTT 410
           + +  EALG+ E+       P    Y+SLI    K      A++I + M  +G       
Sbjct: 363 SKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKTGRFKDAAEIFEDMTNQGVRRDVLV 422

Query: 411 YNYFFRFFSKYGKIEEGMNLYNKMIE---SGYAPDKLTYHLLLKMLCEEERLNLAAQVCN 470
           YN        + + E  + L  +M +      +P+  TY  LLKM C ++++ L   + +
Sbjct: 423 YNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNVETYAPLLKMCCHKKKMKLLGILLH 482

Query: 471 EMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRG 530
            M      +D++T  +L+  LC   K EEA   FE  + +G+VP+  T   L DE  K+ 
Sbjct: 483 HMVKNDVSIDVSTYILLIRGLCMSGKVEEACLFFEEAVRKGMVPRDSTCKMLVDELEKKN 542

Query: 531 LTKMASKLQEMMSS 541
           + +   K+Q ++ S
Sbjct: 543 MAEAKLKIQSLVQS 549

BLAST of Lsi04G021540 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 8.1e-50
Identity = 123/436 (28.21%), Postives = 215/436 (49.31%), Query Frame = 1

Query: 107 LEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAK 166
           LE AL+ +G+     L+E V N    +    +  F+WA  +P +C S  ++  ++ +L+K
Sbjct: 100 LELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSK 159

Query: 167 SREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISG 226
            R+F + W LI    +  E   L+  ++FV+L++R+A A M++ AI   +    +     
Sbjct: 160 MRQFGAVWGLIEEMRK--ENPQLIEPELFVVLVQRFASADMVKKAIEVLD---EMPKFGF 219

Query: 227 TNSEGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHA 286
              E +F  LLD+LCK G V+ A++ F    +M   F  ++R +  LL GW R  K+  A
Sbjct: 220 EPDEYVFGCLLDALCKHGSVKDAAKLFE---DMRMRFPVNLRYFTSLLYGWCRVGKMMEA 279

Query: 287 ERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDA 346
           + +  +M +    P +V Y  L+ GY    ++  A +L+ +MR  G EPNA  Y  ++ A
Sbjct: 280 KYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQA 339

Query: 347 FGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPT 406
             +  R +EA+ +       E    + TY +L+ G+CK   +     +L  MI +G  P+
Sbjct: 340 LCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPS 399

Query: 407 PTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCN 466
             TY +      K    EE + L  KM +  Y PD   Y++++++ C+   +  A ++ N
Sbjct: 400 ELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWN 459

Query: 467 EMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGI--VPQYLTFCRLHDEFMK 526
           EM+  G    + T  ++++ L       EA   F+ M+ RG+  V QY T   L +  +K
Sbjct: 460 EMEENGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLK 519

Query: 527 RGLTKMASKLQEMMSS 541
               +MA  +   ++S
Sbjct: 520 DKKLEMAKDVWSCITS 527

BLAST of Lsi04G021540 vs. TrEMBL
Match: A0A0A0LEE0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G878910 PE=4 SV=1)

HSP 1 Score: 1111.3 bits (2873), Expect = 0.0e+00
Identity = 555/614 (90.39%), Postives = 576/614 (93.81%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MPI KP +L YR+LH+I+HNH S+FR SALQFSSLS HSWLSTPGKPL+KWPSLPDQ  N
Sbjct: 1   MPIHKPLILIYRILHSIQHNHPSSFRFSALQFSSLSPHSWLSTPGKPLVKWPSLPDQPAN 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
           PL SNSAVISNPN AID+KFEAS+ PNDLSTIS ILSDR VR GAALEDALDRTGIVPSS
Sbjct: 61  PLPSNSAVISNPNSAIDVKFEASYSPNDLSTISSILSDRSVRPGAALEDALDRTGIVPSS 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
           SLLEAVF+HFDSSPKFLHSLFLWA  K GF  S ALFN LINVLAKSREFDSAWSLI  R
Sbjct: 121 SLLEAVFDHFDSSPKFLHSLFLWAAKKSGFRPSAALFNRLINVLAKSREFDSAWSLITSR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSEGLFEILLDSL 240
           LRG E S LVSV+VFVILIRRYARAGM+QPAIRT+EFACNLETISGT SEGLFEILLDSL
Sbjct: 181 LRGGEESFLVSVEVFVILIRRYARAGMVQPAIRTYEFACNLETISGTGSEGLFEILLDSL 240

Query: 241 CKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISP 300
           CKEGHVRVASEYFNRKREM S FEPSIRAYNIL+NGWFRSRKLKHA+RLWFEMKKNKISP
Sbjct: 241 CKEGHVRVASEYFNRKREMGSSFEPSIRAYNILINGWFRSRKLKHAQRLWFEMKKNKISP 300

Query: 301 TVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMM 360
           TVVTYGTL+EGYCRMR VE AIELVDEMR EGIEPNAI+YNPIVDA GEAGRFKEALGMM
Sbjct: 301 TVVTYGTLIEGYCRMRSVEIAIELVDEMRREGIEPNAIVYNPIVDALGEAGRFKEALGMM 360

Query: 361 ERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKY 420
           ERFMVLEQGPTISTYNSL+KGYCKA DLSGASKILK MIGRGFTPTPTTYNYFFRFFSKY
Sbjct: 361 ERFMVLEQGPTISTYNSLVKGYCKAGDLSGASKILKMMIGRGFTPTPTTYNYFFRFFSKY 420

Query: 421 GKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATS 480
           GKIEE M+LYNKMIESGYAPDKLTYHLLLKMLCEEERLNLA QVCNEMKARGFDMDLATS
Sbjct: 421 GKIEESMSLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAVQVCNEMKARGFDMDLATS 480

Query: 481 TMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS 540
           TML+HLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS
Sbjct: 481 TMLMHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS 540

Query: 541 VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSAN 600
           VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSE+AVFSAN
Sbjct: 541 VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSEDAVFSAN 600

Query: 601 RLIDDIKKKANPVT 615
           +LIDDIKKKANP T
Sbjct: 601 KLIDDIKKKANPGT 614

BLAST of Lsi04G021540 vs. TrEMBL
Match: M5XNZ3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003040mg PE=4 SV=1)

HSP 1 Score: 783.5 bits (2022), Expect = 1.9e-223
Identity = 399/612 (65.20%), Postives = 470/612 (76.80%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MP  K R      L   + N     R  +L+F   S  SWLS  G P+IKWPS PD   +
Sbjct: 1   MPSHKTRHFLSLALLLFKPNPNLNLRALSLRF--FSNQSWLSVRGNPIIKWPSPPDIPCS 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
               N A   NPNP        +F  ND STI+ +L+D  +  G++L+ ALDRTGI P  
Sbjct: 61  LPHPNPAPNPNPNPN---SSGPNFSQNDFSTIANVLADPSISPGSSLQSALDRTGIEPGP 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
            LL+AVF+HFDSSPK LH+LFLWAE +PGF SS  LF C+INVLAKSREF+SAWSLI+ R
Sbjct: 121 CLLQAVFDHFDSSPKLLHTLFLWAEKRPGFRSSATLFGCMINVLAKSREFESAWSLILNR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE-GLFEILLDS 240
           + GDE   LVSVD FVI+IRRY+RAGM Q AIRTFEFA NL++   + SE  LFE+LLDS
Sbjct: 181 IGGDEEPGLVSVDTFVIMIRRYSRAGMSQSAIRTFEFASNLDSFLNSESEMSLFEVLLDS 240

Query: 241 LCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKIS 300
           LCKEG VRVASEYF+ KR++   + PS+R YNILLNGWFRSRKLK AERLW EMK++ + 
Sbjct: 241 LCKEGLVRVASEYFDMKRKLHPDWIPSVRVYNILLNGWFRSRKLKRAERLWAEMKRDNVK 300

Query: 301 PTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGM 360
           P+VVTYGTL+EGYCRMRR E AIELV EMR EGIEPNAI+YN I+DA GEAG+FKEALGM
Sbjct: 301 PSVVTYGTLIEGYCRMRRAEIAIELVSEMRSEGIEPNAIVYNAIIDALGEAGKFKEALGM 360

Query: 361 MERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSK 420
           ME F+VLE GPTISTYNSL KG+CKA DL GASKILK MI +G  PTPTTYNYFFR+FSK
Sbjct: 361 MEHFLVLESGPTISTYNSLAKGFCKAGDLVGASKILKMMISKGCVPTPTTYNYFFRYFSK 420

Query: 421 YGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLAT 480
           +GKIEEGMNLY KMIESGY PD+LT+HLLLKMLC+E RL LA QV  EM++RG DMDLAT
Sbjct: 421 FGKIEEGMNLYTKMIESGYTPDRLTFHLLLKMLCDEGRLGLAVQVSKEMRSRGLDMDLAT 480

Query: 481 STMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMS 540
           STML+HLLC +HKF+EAFAEFE MI RG+VPQYLTF R++ E  K+G+T+MA K+  MMS
Sbjct: 481 STMLIHLLCNVHKFKEAFAEFEDMIRRGLVPQYLTFQRMNVELRKQGMTEMAHKMCNMMS 540

Query: 541 SVPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSA 600
           SVPHS  LP+TY +  D+  ARR SI++KAEAMS++LK C DPRELVK RS  EN V  A
Sbjct: 541 SVPHSTNLPNTYVRERDASHARRKSIIQKAEAMSDLLKTCSDPRELVKYRSLPENVVSRA 600

Query: 601 NRLIDDIKKKAN 612
           N+L++DIK+KAN
Sbjct: 601 NQLVEDIKRKAN 607

BLAST of Lsi04G021540 vs. TrEMBL
Match: W9SQG5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007369 PE=4 SV=1)

HSP 1 Score: 761.5 bits (1965), Expect = 7.7e-217
Identity = 386/578 (66.78%), Postives = 455/578 (78.72%), Query Frame = 1

Query: 33  SSLSAHSWLSTPGKPLIKWPSLPDQSGNPLQSNSAVISNPNPAIDLKFEASFEPNDLSTI 92
           SS S  SWLS PGKPLI+WP  P    NP    +    +PNP       A F  N+ + I
Sbjct: 43  SSASGLSWLSVPGKPLIRWPHEPCSVPNPQPDPNP---SPNPG------AEFSQNEFAAI 102

Query: 93  SGILSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCS 152
           S +L++  +  G +L  ALDRTGI PS SLL+AVF+HFDSSPK L+SLFLWAE +PG+ S
Sbjct: 103 SEVLTNPNISGGFSLHTALDRTGIEPSPSLLQAVFDHFDSSPKLLYSLFLWAEKQPGYRS 162

Query: 153 SKALFNCLINVLAKSREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAI 212
           S +LF  +INVLAKSREFDSAWSLI+ R+  +E   LV  D FVI+IRRYAR GM Q A+
Sbjct: 163 SASLFASVINVLAKSREFDSAWSLILHRIGKEEEPRLVCEDTFVIMIRRYAREGMPQSAV 222

Query: 213 RTFEFACNLETISGTNSE-GLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYN 272
           RTFEFA N   I    SE  LF ILLD+LCKEGHVR AS+YFN K+++D  + PSIRAYN
Sbjct: 223 RTFEFASNSVPICSYISEISLFGILLDALCKEGHVRAASDYFNEKKKLDPSWIPSIRAYN 282

Query: 273 ILLNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGE 332
           ILLNGWFRSRKLK AERLW EMK++ +  TVVTYGTLVEGYCRMRR E A+ELV EMR E
Sbjct: 283 ILLNGWFRSRKLKRAERLWMEMKRDNVRSTVVTYGTLVEGYCRMRRAEIAVELVKEMRTE 342

Query: 333 GIEPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGA 392
           GIEPNAI+YNPI+DA GEAGRFKEALGMMERF+VLE GPTISTYNSL+KG+CKA +L+GA
Sbjct: 343 GIEPNAIVYNPIIDALGEAGRFKEALGMMERFLVLESGPTISTYNSLVKGFCKAGNLAGA 402

Query: 393 SKILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKM 452
           SKI+K MIGRG  PTPTTYNYFF++FSK+GKIEEGMNLY KMI SG++PD+LTYHLLLKM
Sbjct: 403 SKIIKMMIGRGIIPTPTTYNYFFKYFSKFGKIEEGMNLYTKMIGSGHSPDRLTYHLLLKM 462

Query: 453 LCEEERLNLAAQVCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQ 512
           LCEE +L+LA QV  EM++RGFDMDLATSTML+HL C M +FEEA+ EF  MI RGIVPQ
Sbjct: 463 LCEEGKLDLAVQVGKEMRSRGFDMDLATSTMLIHLFCNMRRFEEAYLEFGDMIRRGIVPQ 522

Query: 513 YLTFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKLPDTYNQTPDSIRARRTSIMRKAEA 572
           YLT+ R+ DE  KRG+T+M SKL+++MSSVPHS KLP+TY +  D+   RR S+MRKAEA
Sbjct: 523 YLTYHRMKDELKKRGMTEMVSKLRDLMSSVPHSTKLPNTYTRDGDASSDRRNSVMRKAEA 582

Query: 573 MSEMLKVCKDPRELVKRRSPSENAVFSANRLIDDIKKK 610
           +S+MLK CK+ RELV  R P ENAV  ANRLI+DI+KK
Sbjct: 583 ISDMLKTCKESRELVNYRGPFENAVSLANRLIEDIQKK 611

BLAST of Lsi04G021540 vs. TrEMBL
Match: B9H9Q3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s26360g PE=4 SV=1)

HSP 1 Score: 755.7 bits (1950), Expect = 4.2e-215
Identity = 385/579 (66.49%), Postives = 451/579 (77.89%), Query Frame = 1

Query: 36  SAHSWLSTPGKPLIKWPSLPDQSGNPL--QSNSAVISNPNPAIDLKFEASFEPNDLSTIS 95
           SA SWL+  G PLIKWP  P+ + +P   Q NS+  SN NP        ++  ND  T+ 
Sbjct: 35  SAESWLAVQGNPLIKWPHNPNLAPSPSADQQNSSPTSNSNP--------NYHQNDFFTLC 94

Query: 96  GILSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSS 155
            IL D  ++ G +L  ALDRTGI P   L+++VF+HFDSSPK LHS+FLWAE KPGF SS
Sbjct: 95  NILKDPKIQLGPSLRTALDRTGIEPELGLIQSVFDHFDSSPKLLHSVFLWAEKKPGFQSS 154

Query: 156 KALFNCLINVLAKSREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIR 215
            ALFN ++N L K+REF SAW L++ R+ G+E   LVS D F ILIRRY RAGM + AIR
Sbjct: 155 AALFNSMVNFLGKAREFGSAWCLLLDRIGGNEGGDLVSSDTFAILIRRYTRAGMSEAAIR 214

Query: 216 TFEFACNLETISGTNS-EGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNI 275
           TFE+A +L+ I  + +   LFEILLDSLCKEGHVRVA++YF+RK E D C+ PS+R YNI
Sbjct: 215 TFEYASSLDLIHNSEAGTSLFEILLDSLCKEGHVRVATDYFDRKVEKDPCWVPSVRIYNI 274

Query: 276 LLNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEG 335
           LLNGWFRSRKLKHAERLW EMKK  + P+VVTYGTLVEGY RMRRVERAIELVDEM+ EG
Sbjct: 275 LLNGWFRSRKLKHAERLWLEMKKKNVKPSVVTYGTLVEGYSRMRRVERAIELVDEMKREG 334

Query: 336 IEPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGAS 395
           I+ NAI+YNPI+DA  EAGRFKE LGMME F + E+GPTISTYNSL+KGYCKA DL GAS
Sbjct: 335 IKSNAIVYNPIIDALAEAGRFKEVLGMMEHFFLCEEGPTISTYNSLVKGYCKAGDLVGAS 394

Query: 396 KILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKML 455
           KILK MI R   PTPTTYNYFFR FSK  KIEEGMNLY KMIESGY PD+LTYHLLLKML
Sbjct: 395 KILKMMISREVFPTPTTYNYFFRHFSKCRKIEEGMNLYTKMIESGYTPDRLTYHLLLKML 454

Query: 456 CEEERLNLAAQVCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQY 515
           CEEERL+LA Q+  EM+ARG DMDLATSTM  HLLCKM +FEEAFAEFE M+ RGIVPQY
Sbjct: 455 CEEERLDLAVQISKEMRARGCDMDLATSTMFTHLLCKMQRFEEAFAEFEDMLRRGIVPQY 514

Query: 516 LTFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKLPDTYNQTPDSIR-ARRTSIMRKAEA 575
           LTF RL+DEF K+GLT++A +L ++MSSV HS+ LP+TYN   D+ R ARR SI++KA  
Sbjct: 515 LTFHRLNDEFRKQGLTELARRLCKLMSSVSHSKNLPNTYNVDRDASRHARRKSILQKAGV 574

Query: 576 MSEMLKVCKDPRELVKRRSPSENAVFSANRLIDDIKKKA 611
           MSE+LK C DPRELVK RS S+N   SAN+LI+DIKK+A
Sbjct: 575 MSEILKTCNDPRELVKHRSSSQNPESSANQLIEDIKKRA 605

BLAST of Lsi04G021540 vs. TrEMBL
Match: A0A061GPP0_THECC (Pentatricopeptide repeat superfamily protein isoform 2 (Fragment) OS=Theobroma cacao GN=TCM_038312 PE=4 SV=1)

HSP 1 Score: 740.3 bits (1910), Expect = 1.8e-210
Identity = 384/577 (66.55%), Postives = 447/577 (77.47%), Query Frame = 1

Query: 36  SAHSWLSTPGKPLIKWPSLPDQSGNPLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGI 95
           S  SWLS    PLIKWP            +S+  + P+P  +  F  S    + S IS +
Sbjct: 15  SDQSWLSKKRNPLIKWPP----------PSSSPCNQPHPIPNRTFSQS----NFSIISNL 74

Query: 96  LSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKA 155
           L +  + SG++LE ALD+T I P   LL+A+F  FDSSPK LH LFLWAE KPGF SS  
Sbjct: 75  LKNSTITSGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKKPGFKSSAT 134

Query: 156 LFNCLINVLAKSREFDSAWSLIIRRLR-GDEVSSLVSVDVFVILIRRYARAGMLQPAIRT 215
           LF+ ++NVL K+R F+ AWSL++ R+  G E S+LVSV+ FVILIRRYARAGM QPAIRT
Sbjct: 135 LFDSMVNVLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQPAIRT 194

Query: 216 FEFACNLETISGTNSE-GLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNIL 275
           FEFA +LE I  ++ E  LFEI+LDSLCKEGHVRV SEY  RKRE D  + PSI+ YNIL
Sbjct: 195 FEFAKSLEQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIKVYNIL 254

Query: 276 LNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGI 335
           LNGWFRSRKLKHAERLW +MKK  + P+VVTYGTLVEGYC MRRVERAI+LVDEM+G GI
Sbjct: 255 LNGWFRSRKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEMKGVGI 314

Query: 336 EPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASK 395
           EPNA +YNPI+DA GEAGR KEALGMMER  + E GP IS Y+SL+KGYCKARDL GASK
Sbjct: 315 EPNAKVYNPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDLVGASK 374

Query: 396 ILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLC 455
           ILK MI RGF PTPTTYNYFFR+FS++ KIEE MNLY KMIESG+ PD+LTYHLLLKML 
Sbjct: 375 ILKMMISRGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLLLKMLF 434

Query: 456 EEERLNLAAQVCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYL 515
           EEERL+LA Q+  EM+ARG+D DLATSTML+HLLCKMH+FE+AF EFE MI RG+ PQYL
Sbjct: 435 EEERLDLAVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGMAPQYL 494

Query: 516 TFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMS 575
           TF R++DE  KRG+T MASKL +MMSSV  S+KLP+TY    DS RARRTSIMRKAEAMS
Sbjct: 495 TFQRMNDELKKRGMTDMASKLCDMMSSVRSSKKLPNTYGGDEDSSRARRTSIMRKAEAMS 554

Query: 576 EMLKVCKDPRELVKRRSPSENAVFSANRLIDDIKKKA 611
           +MLK CKDPRE VK R+ SENAV SA RLI+ IK+ A
Sbjct: 555 DMLKTCKDPREFVKHRTLSENAVSSAGRLIEIIKEGA 577

BLAST of Lsi04G021540 vs. TAIR10
Match: AT5G11310.1 (AT5G11310.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 611.3 bits (1575), Expect = 6.5e-175
Identity = 321/604 (53.15%), Postives = 420/604 (69.54%), Query Frame = 1

Query: 10  FYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGNPLQSNSAVI 69
           F R L    + HR+ F    L  S  S+      P    I+ P++PD +  P Q N+   
Sbjct: 8   FRRNLLLNPNPHRNFFLHRLLSSSRRSSPLIPVEPLIQRIQSPAVPDSTCTPPQQNTV-- 67

Query: 70  SNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNH 129
                             DLSTIS +L +  V  G++LE ALD TGI PS  L+ A+F+ 
Sbjct: 68  ---------------SKTDLSTISNLLENTDVVPGSSLESALDETGIEPSVELVHALFDR 127

Query: 130 FDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRRLRGDEVSSL 189
             SSP  LHS+F WAE KPGF  S +LF+ ++N L K+REF+ AWSL+  R+R DE S+L
Sbjct: 128 LSSSPMLLHSVFKWAEMKPGFTLSPSLFDSVVNSLCKAREFEIAWSLVFDRVRSDEGSNL 187

Query: 190 VSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE-GLFEILLDSLCKEGHVRV 249
           VS D F++LIRRYARAGM+Q AIR FEFA + E +  + +E  L E+LLD+LCKEGHVR 
Sbjct: 188 VSADTFIVLIRRYARAGMVQQAIRAFEFARSYEPVCKSATELRLLEVLLDALCKEGHVRE 247

Query: 250 ASEYFNR-KREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGT 309
           AS Y  R    MDS + PS+R +NILLNGWFRSRKLK AE+LW EMK   + PTVVTYGT
Sbjct: 248 ASMYLERIGGTMDSNWVPSVRIFNILLNGWFRSRKLKQAEKLWEEMKAMNVKPTVVTYGT 307

Query: 310 LVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLE 369
           L+EGYCRMRRV+ A+E+++EM+   +E N +++NPI+D  GEAGR  EALGMMERF V E
Sbjct: 308 LIEGYCRMRRVQIAMEVLEEMKMAEMEINFMVFNPIIDGLGEAGRLSEALGMMERFFVCE 367

Query: 370 QGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGM 429
            GPTI TYNSL+K +CKA DL GASKILK M+ RG  PT TTYN+FF++FSK+ K EEGM
Sbjct: 368 SGPTIVTYNSLVKNFCKAGDLPGASKILKMMMTRGVDPTTTTYNHFFKYFSKHNKTEEGM 427

Query: 430 NLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATSTMLVHLL 489
           NLY K+IE+G++PD+LTYHL+LKMLCE+ +L+LA QV  EMK RG D DL T+TML+HLL
Sbjct: 428 NLYFKLIEAGHSPDRLTYHLILKMLCEDGKLSLAMQVNKEMKNRGIDPDLLTTTMLIHLL 487

Query: 490 CKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKL 549
           C++   EEAF EF++ + RGI+PQY+TF  + +    +G++ MA +L  +MSS+PHS+KL
Sbjct: 488 CRLEMLEEAFEEFDNAVRRGIIPQYITFKMIDNGLRSKGMSDMAKRLSSLMSSLPHSKKL 547

Query: 550 PDTYNQTPDS--IRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSANRLIDD 609
           P+TY +  D+   + RR SI+ +AEAMS++LK C++PR+LVK R   + AV     LIDD
Sbjct: 548 PNTYREAVDAPPDKDRRKSILHRAEAMSDVLKGCRNPRKLVKMRGSHKKAVGEDINLIDD 594

BLAST of Lsi04G021540 vs. TAIR10
Match: AT1G77360.1 (AT1G77360.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 202.2 bits (513), Expect = 9.2e-52
Identity = 129/436 (29.59%), Postives = 218/436 (50.00%), Query Frame = 1

Query: 107 LEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAK 166
           L+ ALD++G+  S  ++E V N F ++    +  F W+E +  +  S   ++ +I   AK
Sbjct: 87  LDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEKQRHYEHSVRAYHMMIESTAK 146

Query: 167 SREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISG 226
            R++   W LI    +      +++V+ F I++R+YARA  +  AI  F     +E    
Sbjct: 147 IRQYKLMWDLINAMRK----KKMLNVETFCIVMRKYARAQKVDEAIYAFNV---MEKYDL 206

Query: 227 TNSEGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHA 286
             +   F  LL +LCK  +VR A E F     M   F P  + Y+ILL GW +   L  A
Sbjct: 207 PPNLVAFNGLLSALCKSKNVRKAQEVF---ENMRDRFTPDSKTYSILLEGWGKEPNLPKA 266

Query: 287 ERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDA 346
             ++ EM      P +VTY  +V+  C+  RV+ A+ +V  M     +P   IY+ +V  
Sbjct: 267 REVFREMIDAGCHPDIVTYSIMVDILCKAGRVDEALGIVRSMDPSICKPTTFIYSVLVHT 326

Query: 347 FGEAGRFKEALGMMERFMVLEQG---PTISTYNSLIKGYCKARDLSGASKILKSMIGRGF 406
           +G   R +EA   ++ F+ +E+      ++ +NSLI  +CKA  +    ++LK M  +G 
Sbjct: 327 YGTENRLEEA---VDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVLKEMKSKGV 386

Query: 407 TPTPTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQ 466
           TP   + N   R   + G+ +E  +++ KMI+    PD  TY +++KM CE++ +  A +
Sbjct: 387 TPNSKSCNIILRHLIERGEKDEAFDVFRKMIKV-CEPDADTYTMVIKMFCEKKEMETADK 446

Query: 467 VCNEMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFM 526
           V   M+ +G    + T ++L++ LC+    ++A    E MI  GI P  +TF RL    +
Sbjct: 447 VWKYMRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVTFGRLRQLLI 506

Query: 527 KRGLTKMASKLQEMMS 540
           K     +   L E M+
Sbjct: 507 KEEREDVLKFLNEKMN 508

BLAST of Lsi04G021540 vs. TAIR10
Match: AT3G22670.1 (AT3G22670.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 199.9 bits (507), Expect = 4.5e-51
Identity = 123/434 (28.34%), Postives = 213/434 (49.08%), Query Frame = 1

Query: 111 LDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREF 170
           L +  +V + SL+  V   F +     +  F+WA ++ G+  S   +N +++VL K R F
Sbjct: 123 LSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVHSGHTYNAMVDVLGKCRNF 182

Query: 171 DSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE 230
           D  W L+   +  +E S LV++D    ++RR A++G    A+  F     +E   G  ++
Sbjct: 183 DLMWELV-NEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVDAF---LEMEKSYGVKTD 242

Query: 231 GL-FEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERL 290
            +    L+D+L KE  +  A E F +   +    +P  R +NIL++G+ ++RK   A  +
Sbjct: 243 TIAMNSLMDALVKENSIEHAHEVFLK---LFDTIKPDARTFNILIHGFCKARKFDDARAM 302

Query: 291 WFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGE 350
              MK  + +P VVTY + VE YC+     R  E+++EMR  G  PN + Y  ++ + G+
Sbjct: 303 MDLMKVTEFTPDVVTYTSFVEAYCKEGDFRRVNEMLEEMRENGCNPNVVTYTIVMHSLGK 362

Query: 351 AGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTT 410
           + +  EALG+ E+       P    Y+SLI    K      A++I + M  +G       
Sbjct: 363 SKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKTGRFKDAAEIFEDMTNQGVRRDVLV 422

Query: 411 YNYFFRFFSKYGKIEEGMNLYNKMIE---SGYAPDKLTYHLLLKMLCEEERLNLAAQVCN 470
           YN        + + E  + L  +M +      +P+  TY  LLKM C ++++ L   + +
Sbjct: 423 YNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNVETYAPLLKMCCHKKKMKLLGILLH 482

Query: 471 EMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRG 530
            M      +D++T  +L+  LC   K EEA   FE  + +G+VP+  T   L DE  K+ 
Sbjct: 483 HMVKNDVSIDVSTYILLIRGLCMSGKVEEACLFFEEAVRKGMVPRDSTCKMLVDELEKKN 542

Query: 531 LTKMASKLQEMMSS 541
           + +   K+Q ++ S
Sbjct: 543 MAEAKLKIQSLVQS 549

BLAST of Lsi04G021540 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 199.9 bits (507), Expect = 4.5e-51
Identity = 123/436 (28.21%), Postives = 215/436 (49.31%), Query Frame = 1

Query: 107 LEDALDRTGIVPSSSLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAK 166
           LE AL+ +G+     L+E V N    +    +  F+WA  +P +C S  ++  ++ +L+K
Sbjct: 100 LELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSK 159

Query: 167 SREFDSAWSLIIRRLRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISG 226
            R+F + W LI    +  E   L+  ++FV+L++R+A A M++ AI   +    +     
Sbjct: 160 MRQFGAVWGLIEEMRK--ENPQLIEPELFVVLVQRFASADMVKKAIEVLD---EMPKFGF 219

Query: 227 TNSEGLFEILLDSLCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHA 286
              E +F  LLD+LCK G V+ A++ F    +M   F  ++R +  LL GW R  K+  A
Sbjct: 220 EPDEYVFGCLLDALCKHGSVKDAAKLFE---DMRMRFPVNLRYFTSLLYGWCRVGKMMEA 279

Query: 287 ERLWFEMKKNKISPTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDA 346
           + +  +M +    P +V Y  L+ GY    ++  A +L+ +MR  G EPNA  Y  ++ A
Sbjct: 280 KYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQA 339

Query: 347 FGEAGRFKEALGMMERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPT 406
             +  R +EA+ +       E    + TY +L+ G+CK   +     +L  MI +G  P+
Sbjct: 340 LCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPS 399

Query: 407 PTTYNYFFRFFSKYGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCN 466
             TY +      K    EE + L  KM +  Y PD   Y++++++ C+   +  A ++ N
Sbjct: 400 ELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWN 459

Query: 467 EMKARGFDMDLATSTMLVHLLCKMHKFEEAFAEFEHMIHRGI--VPQYLTFCRLHDEFMK 526
           EM+  G    + T  ++++ L       EA   F+ M+ RG+  V QY T   L +  +K
Sbjct: 460 EMEENGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLK 519

Query: 527 RGLTKMASKLQEMMSS 541
               +MA  +   ++S
Sbjct: 520 DKKLEMAKDVWSCITS 527

BLAST of Lsi04G021540 vs. TAIR10
Match: AT1G71060.1 (AT1G71060.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 197.2 bits (500), Expect = 2.9e-50
Identity = 129/467 (27.62%), Postives = 221/467 (47.32%), Query Frame = 1

Query: 75  AIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSSSLLEAVFNHFDSSP 134
           +++ +  A+    D   I  IL+     + + +E  L+   +  S +L+E V     ++ 
Sbjct: 52  SVETQVSANDASQDAERICKILTKF---TDSKVETLLNEASVKLSPALIEEVLKKLSNAG 111

Query: 135 KFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRRLRGDEVSSLVSVDV 194
               S+F WAEN+ GF  + + +N LI  L K ++F   WSL+       +   L+S + 
Sbjct: 112 VLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDM----KAKKLLSKET 171

Query: 195 FVILIRRYARAGMLQPAIRTFEFACNLETISGTNSEGLFEILLDSLCKEGHVRVASEYFN 254
           F ++ RRYARA  ++ AI  F     +E          F  +LD+L K  +V  A + F+
Sbjct: 172 FALISRRYARARKVKEAIGAFH---KMEEFGFKMESSDFNRMLDTLSKSRNVGDAQKVFD 231

Query: 255 RKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISPTVVTYGTLVEGYCR 314
           + ++    FEP I++Y ILL GW +   L   + +  EMK     P VV YG ++  +C+
Sbjct: 232 KMKKKR--FEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYGIIINAHCK 291

Query: 315 MRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMMERFMVLEQGPTIST 374
            ++ E AI   +EM     +P+  I+  +++  G   +  +AL   ER           T
Sbjct: 292 AKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPT 351

Query: 375 YNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKYGKIEEGMNLYNKMI 434
           YN+L+  YC ++ +  A K +  M  +G  P   TY+       +  + +E   +Y  M 
Sbjct: 352 YNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM- 411

Query: 435 ESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATSTMLVHLLCKMHKFE 494
                P   TY ++++M C +ERL++A ++ +EMK +G    +   + L+  LC  +K +
Sbjct: 412 --SCEPTVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLD 471

Query: 495 EAFAEFEHMIHRGIVPQYLTFCRLH----DEFMKRGLTKMASKLQEM 538
           EA   F  M+  GI P    F RL     DE  K  +T +  K+  +
Sbjct: 472 EACEYFNEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRL 503

BLAST of Lsi04G021540 vs. NCBI nr
Match: gi|449437378|ref|XP_004136469.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial [Cucumis sativus])

HSP 1 Score: 1111.3 bits (2873), Expect = 0.0e+00
Identity = 555/614 (90.39%), Postives = 576/614 (93.81%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MPI KP +L YR+LH+I+HNH S+FR SALQFSSLS HSWLSTPGKPL+KWPSLPDQ  N
Sbjct: 1   MPIHKPLILIYRILHSIQHNHPSSFRFSALQFSSLSPHSWLSTPGKPLVKWPSLPDQPAN 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
           PL SNSAVISNPN AID+KFEAS+ PNDLSTIS ILSDR VR GAALEDALDRTGIVPSS
Sbjct: 61  PLPSNSAVISNPNSAIDVKFEASYSPNDLSTISSILSDRSVRPGAALEDALDRTGIVPSS 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
           SLLEAVF+HFDSSPKFLHSLFLWA  K GF  S ALFN LINVLAKSREFDSAWSLI  R
Sbjct: 121 SLLEAVFDHFDSSPKFLHSLFLWAAKKSGFRPSAALFNRLINVLAKSREFDSAWSLITSR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSEGLFEILLDSL 240
           LRG E S LVSV+VFVILIRRYARAGM+QPAIRT+EFACNLETISGT SEGLFEILLDSL
Sbjct: 181 LRGGEESFLVSVEVFVILIRRYARAGMVQPAIRTYEFACNLETISGTGSEGLFEILLDSL 240

Query: 241 CKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISP 300
           CKEGHVRVASEYFNRKREM S FEPSIRAYNIL+NGWFRSRKLKHA+RLWFEMKKNKISP
Sbjct: 241 CKEGHVRVASEYFNRKREMGSSFEPSIRAYNILINGWFRSRKLKHAQRLWFEMKKNKISP 300

Query: 301 TVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMM 360
           TVVTYGTL+EGYCRMR VE AIELVDEMR EGIEPNAI+YNPIVDA GEAGRFKEALGMM
Sbjct: 301 TVVTYGTLIEGYCRMRSVEIAIELVDEMRREGIEPNAIVYNPIVDALGEAGRFKEALGMM 360

Query: 361 ERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKY 420
           ERFMVLEQGPTISTYNSL+KGYCKA DLSGASKILK MIGRGFTPTPTTYNYFFRFFSKY
Sbjct: 361 ERFMVLEQGPTISTYNSLVKGYCKAGDLSGASKILKMMIGRGFTPTPTTYNYFFRFFSKY 420

Query: 421 GKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATS 480
           GKIEE M+LYNKMIESGYAPDKLTYHLLLKMLCEEERLNLA QVCNEMKARGFDMDLATS
Sbjct: 421 GKIEESMSLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAVQVCNEMKARGFDMDLATS 480

Query: 481 TMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS 540
           TML+HLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS
Sbjct: 481 TMLMHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS 540

Query: 541 VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSAN 600
           VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSE+AVFSAN
Sbjct: 541 VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSEDAVFSAN 600

Query: 601 RLIDDIKKKANPVT 615
           +LIDDIKKKANP T
Sbjct: 601 KLIDDIKKKANPGT 614

BLAST of Lsi04G021540 vs. NCBI nr
Match: gi|659132770|ref|XP_008466375.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial [Cucumis melo])

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 557/614 (90.72%), Postives = 571/614 (93.00%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MPI KP LL YR+L+AI+HNH S+FR SALQFSSLSAHSWLS PGKPLIKWPSLPDQ  N
Sbjct: 1   MPIHKPSLLIYRILYAIQHNHPSSFRFSALQFSSLSAHSWLSMPGKPLIKWPSLPDQPAN 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
            L SNSAVISNPN AIDLKFEAS+ PNDLSTIS ILSDR VR GAALEDALDRTGIVPSS
Sbjct: 61  TLPSNSAVISNPNSAIDLKFEASYAPNDLSTISSILSDRTVRPGAALEDALDRTGIVPSS 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
           SLLEAVF+HFDSSPKFLHSLFLWA  K GF  S ALFN LINVLAKSREFDSAWSLI  R
Sbjct: 121 SLLEAVFDHFDSSPKFLHSLFLWAAKKSGFRPSAALFNRLINVLAKSREFDSAWSLITSR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSEGLFEILLDSL 240
           L G E S LVSV+V VILIRRYARAGMLQPAIRT+EFACNLETISGT SEGLFEILLDSL
Sbjct: 181 LLGGEESFLVSVEVLVILIRRYARAGMLQPAIRTYEFACNLETISGTGSEGLFEILLDSL 240

Query: 241 CKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKISP 300
           CKEGHVRVASEYFNRKREM S FEPSIRAYNILLNGWFRSRKLKHA+RLWFEMKKN ISP
Sbjct: 241 CKEGHVRVASEYFNRKREMGSSFEPSIRAYNILLNGWFRSRKLKHAQRLWFEMKKNNISP 300

Query: 301 TVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGMM 360
           TVVTYGTL+EGYCRMR VE AIELVDEMR EGIEPNAIIYNPIVDA GEAGRFKEALGMM
Sbjct: 301 TVVTYGTLIEGYCRMRSVEIAIELVDEMRREGIEPNAIIYNPIVDALGEAGRFKEALGMM 360

Query: 361 ERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSKY 420
           ERFMVLEQGPTISTYNSLIKGYCKA DLSGASKILK MIGRGFTPTPTTYNYFFRFFSKY
Sbjct: 361 ERFMVLEQGPTISTYNSLIKGYCKAGDLSGASKILKMMIGRGFTPTPTTYNYFFRFFSKY 420

Query: 421 GKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLATS 480
           GKIEE MNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLA QVCNEMKARGFDMDLATS
Sbjct: 421 GKIEESMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAVQVCNEMKARGFDMDLATS 480

Query: 481 TMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSS 540
           TML+HLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEF+KRGLTKMASKLQEMMSS
Sbjct: 481 TMLMHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFLKRGLTKMASKLQEMMSS 540

Query: 541 VPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSAN 600
           VPHSEKLPDTYNQTPDS+RARRTSIMRKAEAMSEMLKVC DPRELVKRRSPSENAVFSAN
Sbjct: 541 VPHSEKLPDTYNQTPDSVRARRTSIMRKAEAMSEMLKVCNDPRELVKRRSPSENAVFSAN 600

Query: 601 RLIDDIKKKANPVT 615
           +LIDDIKKKANP T
Sbjct: 601 KLIDDIKKKANPGT 614

BLAST of Lsi04G021540 vs. NCBI nr
Match: gi|645226047|ref|XP_008219858.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial [Prunus mume])

HSP 1 Score: 783.5 bits (2022), Expect = 2.7e-223
Identity = 400/612 (65.36%), Postives = 469/612 (76.63%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MP  K        L   + N     R  +L+F   S  SWLS  G P+IKWPS PD   +
Sbjct: 1   MPSHKTLHFLSLALLLFKPNPNPNLRALSLRF--FSDQSWLSVRGNPIIKWPSPPDVPCS 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
               N A   NPNP I      +F  ND  TI+ +L+D  +  G++L+ ALDRTGI P  
Sbjct: 61  LPHPNPA--PNPNPNIS---GPNFSQNDFFTIANLLADPSISPGSSLQSALDRTGIEPGP 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
            LL+AVF+HFDSSPK LH+LFLWA+ +PGF SS  LF C+INVLAKSREF+SAWSLI+ R
Sbjct: 121 CLLQAVFDHFDSSPKLLHTLFLWAQKRPGFRSSATLFGCMINVLAKSREFESAWSLILNR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE-GLFEILLDS 240
           +  DE   LVSVD FVI+IRRY+RAGM Q AIRTFEFA NL+    + SE  LFE+LLDS
Sbjct: 181 IGADEEPGLVSVDTFVIMIRRYSRAGMSQSAIRTFEFASNLDLFLNSESEMSLFEVLLDS 240

Query: 241 LCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKIS 300
            CKEG VRVASEYF+ KR++   + PS+R YNILLNGWFRSRKLK AERLW EMK++ + 
Sbjct: 241 FCKEGLVRVASEYFDMKRKLHPDWIPSVRVYNILLNGWFRSRKLKWAERLWVEMKRDNVK 300

Query: 301 PTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGM 360
           P+VVTYGTLVEGYCRMRR E AIELV EMR EGIEPNAI+YNPI+DA GEAG+FKEALGM
Sbjct: 301 PSVVTYGTLVEGYCRMRRAETAIELVSEMRSEGIEPNAIVYNPIIDALGEAGKFKEALGM 360

Query: 361 MERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSK 420
           ME F++LE GPTISTYNSL+KG+CKA DL GASKILK MI RG  PTPTTYNYFFR+FSK
Sbjct: 361 MEHFLILESGPTISTYNSLVKGFCKAGDLVGASKILKMMISRGCVPTPTTYNYFFRYFSK 420

Query: 421 YGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLAT 480
           +GKIEEGMNLY KMIESGY PD+LT+HLLLKMLC+E RL+LA QV  EM++RG DMDLAT
Sbjct: 421 FGKIEEGMNLYTKMIESGYTPDRLTFHLLLKMLCDEGRLDLAVQVSKEMRSRGLDMDLAT 480

Query: 481 STMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMS 540
           STML+HLLC MHKF+EAFAEFE MI RG+VPQYLTF R++DE  K+G+T+MA KL  MMS
Sbjct: 481 STMLIHLLCNMHKFKEAFAEFEDMIRRGLVPQYLTFQRMNDELRKQGMTEMAHKLCNMMS 540

Query: 541 SVPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSA 600
           SVPHS  LP+TY +  D+  ARR SI+ KAEAMS++LK C DPRELVK RS  EN V  A
Sbjct: 541 SVPHSTNLPNTYVRERDASHARRKSIILKAEAMSDLLKTCSDPRELVKYRSSPENVVSRA 600

Query: 601 NRLIDDIKKKAN 612
           N+L++DIK+KAN
Sbjct: 601 NQLVEDIKRKAN 605

BLAST of Lsi04G021540 vs. NCBI nr
Match: gi|596127548|ref|XP_007222034.1| (hypothetical protein PRUPE_ppa003040mg [Prunus persica])

HSP 1 Score: 783.5 bits (2022), Expect = 2.7e-223
Identity = 399/612 (65.20%), Postives = 470/612 (76.80%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MP  K R      L   + N     R  +L+F   S  SWLS  G P+IKWPS PD   +
Sbjct: 1   MPSHKTRHFLSLALLLFKPNPNLNLRALSLRF--FSNQSWLSVRGNPIIKWPSPPDIPCS 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
               N A   NPNP        +F  ND STI+ +L+D  +  G++L+ ALDRTGI P  
Sbjct: 61  LPHPNPAPNPNPNPN---SSGPNFSQNDFSTIANVLADPSISPGSSLQSALDRTGIEPGP 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
            LL+AVF+HFDSSPK LH+LFLWAE +PGF SS  LF C+INVLAKSREF+SAWSLI+ R
Sbjct: 121 CLLQAVFDHFDSSPKLLHTLFLWAEKRPGFRSSATLFGCMINVLAKSREFESAWSLILNR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE-GLFEILLDS 240
           + GDE   LVSVD FVI+IRRY+RAGM Q AIRTFEFA NL++   + SE  LFE+LLDS
Sbjct: 181 IGGDEEPGLVSVDTFVIMIRRYSRAGMSQSAIRTFEFASNLDSFLNSESEMSLFEVLLDS 240

Query: 241 LCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKIS 300
           LCKEG VRVASEYF+ KR++   + PS+R YNILLNGWFRSRKLK AERLW EMK++ + 
Sbjct: 241 LCKEGLVRVASEYFDMKRKLHPDWIPSVRVYNILLNGWFRSRKLKRAERLWAEMKRDNVK 300

Query: 301 PTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGM 360
           P+VVTYGTL+EGYCRMRR E AIELV EMR EGIEPNAI+YN I+DA GEAG+FKEALGM
Sbjct: 301 PSVVTYGTLIEGYCRMRRAEIAIELVSEMRSEGIEPNAIVYNAIIDALGEAGKFKEALGM 360

Query: 361 MERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSK 420
           ME F+VLE GPTISTYNSL KG+CKA DL GASKILK MI +G  PTPTTYNYFFR+FSK
Sbjct: 361 MEHFLVLESGPTISTYNSLAKGFCKAGDLVGASKILKMMISKGCVPTPTTYNYFFRYFSK 420

Query: 421 YGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLAT 480
           +GKIEEGMNLY KMIESGY PD+LT+HLLLKMLC+E RL LA QV  EM++RG DMDLAT
Sbjct: 421 FGKIEEGMNLYTKMIESGYTPDRLTFHLLLKMLCDEGRLGLAVQVSKEMRSRGLDMDLAT 480

Query: 481 STMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMS 540
           STML+HLLC +HKF+EAFAEFE MI RG+VPQYLTF R++ E  K+G+T+MA K+  MMS
Sbjct: 481 STMLIHLLCNVHKFKEAFAEFEDMIRRGLVPQYLTFQRMNVELRKQGMTEMAHKMCNMMS 540

Query: 541 SVPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSA 600
           SVPHS  LP+TY +  D+  ARR SI++KAEAMS++LK C DPRELVK RS  EN V  A
Sbjct: 541 SVPHSTNLPNTYVRERDASHARRKSIIQKAEAMSDLLKTCSDPRELVKYRSLPENVVSRA 600

Query: 601 NRLIDDIKKKAN 612
           N+L++DIK+KAN
Sbjct: 601 NQLVEDIKRKAN 607

BLAST of Lsi04G021540 vs. NCBI nr
Match: gi|694395195|ref|XP_009372943.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 775.0 bits (2000), Expect = 9.7e-221
Identity = 398/612 (65.03%), Postives = 467/612 (76.31%), Query Frame = 1

Query: 1   MPIQKPRLLFYRVLHAIEHNHRSTFRVSALQFSSLSAHSWLSTPGKPLIKWPSLPDQSGN 60
           MP   PR  F   L   + N        + +F   S  SWLS  G P+IKWPSLPD   +
Sbjct: 1   MPSYNPRQFFSLTLFLFKPNPNLHLPSRSRRF--FSNQSWLSVRGNPIIKWPSLPDVPCS 60

Query: 61  PLQSNSAVISNPNPAIDLKFEASFEPNDLSTISGILSDRGVRSGAALEDALDRTGIVPSS 120
               N +  +N NP+       SF P D S I+ +L+D  +  G++LE ALDRTGI P  
Sbjct: 61  LPHPNPSPNTNHNPSSG----PSFSPMDFSIIANLLTDPTISPGSSLESALDRTGIEPGP 120

Query: 121 SLLEAVFNHFDSSPKFLHSLFLWAENKPGFCSSKALFNCLINVLAKSREFDSAWSLIIRR 180
            L++AVF+HFDSSPK LH+LFLWAE +PGF SS  LF  +INVLAKSREFDSAWSLI+ R
Sbjct: 121 GLVQAVFDHFDSSPKLLHTLFLWAEKQPGFRSSAKLFGSMINVLAKSREFDSAWSLILNR 180

Query: 181 LRGDEVSSLVSVDVFVILIRRYARAGMLQPAIRTFEFACNLETISGTNSE-GLFEILLDS 240
           + GDE   LVS D FVI+IRRY RAGM + AIRTFEFA NL++   + +E  LFE+LLDS
Sbjct: 181 VGGDEGPGLVSADTFVIMIRRYTRAGMPESAIRTFEFASNLDSFLNSEAEMSLFEVLLDS 240

Query: 241 LCKEGHVRVASEYFNRKREMDSCFEPSIRAYNILLNGWFRSRKLKHAERLWFEMKKNKIS 300
           L KEG VRVASEYF+RKR++   + PS+R YNILLNGWFRSRKLK AERLW EM++  + 
Sbjct: 241 LSKEGLVRVASEYFDRKRKLHPNWIPSVRVYNILLNGWFRSRKLKQAERLWVEMRRENVK 300

Query: 301 PTVVTYGTLVEGYCRMRRVERAIELVDEMRGEGIEPNAIIYNPIVDAFGEAGRFKEALGM 360
           P+VVTYGTLVEGYCRMRR E AIELV EMR EG+EPNAI+YNPI+DA GEAGR KEALGM
Sbjct: 301 PSVVTYGTLVEGYCRMRRAEIAIELVSEMRREGVEPNAIVYNPIIDALGEAGRLKEALGM 360

Query: 361 MERFMVLEQGPTISTYNSLIKGYCKARDLSGASKILKSMIGRGFTPTPTTYNYFFRFFSK 420
           MERF+VLE GPTISTYNSL+KGYCKA DL GASKILK+MI RG  PTPTTYNYFFR+FSK
Sbjct: 361 MERFLVLESGPTISTYNSLVKGYCKAGDLVGASKILKTMISRGTAPTPTTYNYFFRYFSK 420

Query: 421 YGKIEEGMNLYNKMIESGYAPDKLTYHLLLKMLCEEERLNLAAQVCNEMKARGFDMDLAT 480
           +GKIEEGMNLY KMIESGY PD+LT+ LLLKMLCEEERL+LA QV  EM++RG DMDLAT
Sbjct: 421 HGKIEEGMNLYTKMIESGYTPDRLTFQLLLKMLCEEERLDLAIQVSKEMRSRGLDMDLAT 480

Query: 481 STMLVHLLCKMHKFEEAFAEFEHMIHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMS 540
           STML+HLLC MHK +EAF EFE MI RG+VPQY+TF R+ D   K+G+ +MA KL +MMS
Sbjct: 481 STMLIHLLCNMHKLKEAFEEFEDMIRRGLVPQYITFQRMDDVLRKQGMNQMARKLCKMMS 540

Query: 541 SVPHSEKLPDTYNQTPDSIRARRTSIMRKAEAMSEMLKVCKDPRELVKRRSPSENAVFSA 600
           SVPHS  LPDTY +  D+ RARR SI++KAEAMS++LK C DPRELVK R   EN V  A
Sbjct: 541 SVPHSTNLPDTYVRDADASRARRKSIVQKAEAMSDLLKTCSDPRELVKFRRSPENLVSIA 600

Query: 601 NRLIDDIKKKAN 612
           N+LI+DIK+KAN
Sbjct: 601 NQLIEDIKRKAN 606

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP375_ARATH1.2e-17353.15Pentatricopeptide repeat-containing protein At5g11310, mitochondrial OS=Arabidop... [more]
PP150_ARATH2.0e-5330.70Pentatricopeptide repeat-containing protein At2g13420, mitochondrial OS=Arabidop... [more]
PP129_ARATH1.6e-5029.59Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidop... [more]
PP248_ARATH8.1e-5028.34Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
PP447_ARATH8.1e-5028.21Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0LEE0_CUCSA0.0e+0090.39Uncharacterized protein OS=Cucumis sativus GN=Csa_3G878910 PE=4 SV=1[more]
M5XNZ3_PRUPE1.9e-22365.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003040mg PE=4 SV=1[more]
W9SQG5_9ROSA7.7e-21766.78Uncharacterized protein OS=Morus notabilis GN=L484_007369 PE=4 SV=1[more]
B9H9Q3_POPTR4.2e-21566.49Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s26360g PE=4 SV=1[more]
A0A061GPP0_THECC1.8e-21066.55Pentatricopeptide repeat superfamily protein isoform 2 (Fragment) OS=Theobroma c... [more]
Match NameE-valueIdentityDescription
AT5G11310.16.5e-17553.15 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G77360.19.2e-5229.59 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G22670.14.5e-5128.34 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G65820.14.5e-5128.21 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G71060.12.9e-5027.62 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449437378|ref|XP_004136469.1|0.0e+0090.39PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial ... [more]
gi|659132770|ref|XP_008466375.1|0.0e+0090.72PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial ... [more]
gi|645226047|ref|XP_008219858.1|2.7e-22365.36PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial ... [more]
gi|596127548|ref|XP_007222034.1|2.7e-22365.20hypothetical protein PRUPE_ppa003040mg [Prunus persica][more]
gi|694395195|ref|XP_009372943.1|9.7e-22165.03PREDICTED: pentatricopeptide repeat-containing protein At5g11310, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0050789 regulation of biological process
biological_process GO:0009845 seed germination
biological_process GO:0009788 negative regulation of abscisic acid-activated signaling pathway
biological_process GO:0010029 regulation of seed germination
cellular_component GO:0005575 cellular_component
cellular_component GO:0044424 intracellular part
cellular_component GO:0005829 cytosol
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G021540.1Lsi04G021540.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 235..258
score: 1.1coord: 157..177
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 332..361
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 265..314
score: 3.9E-13coord: 370..417
score: 1.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 464..516
score: 0.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 409..441
score: 2.9E-6coord: 444..476
score: 4.4E-4coord: 303..337
score: 3.5E-10coord: 269..302
score: 2.9E-7coord: 481..510
score: 0.0028coord: 374..406
score: 5.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 336..370
score: 8.659coord: 476..510
score: 10.49coord: 301..335
score: 13.406coord: 229..265
score: 6.675coord: 266..300
score: 10.939coord: 406..440
score: 10.961coord: 371..405
score: 12.617coord: 191..225
score: 5.974coord: 153..188
score: 7.695coord: 441..475
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 199..366
score: 7.7E-7coord: 403..504
score: 7.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 44..552
score: 6.3E
NoneNo IPR availablePANTHERPTHR24015:SF432SUBFAMILY NOT NAMEDcoord: 44..552
score: 6.3E

The following gene(s) are paralogous to this gene:

None