Lsi02G023820 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi02G023820
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat superfamily protein
Locationchr02 : 30498569 .. 30499663 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACAGAAACTCCATTTCCCCCAACAATTACACCTTCCCTTTCCTTCTCAAATCCTTAGCTGATTTCAACGACCTTGTGAGTGGACTATCTGTTCATACCCACGTTCTGAAATTGGGATATGTTTCTGATGTTTATGTCCAGAATTCTTTGATGGATGTGTATGCTTCGTGTCGGAAAATGGGGTTATGCAAGAAGGTGTTCGACGAAATGCCTCAAAGAGATGTTGTGTCGTGGACTGTTTTGATTATGGGTTATCGAGTTTCTTTGATGTCTGATGATGCTTTGATTGCATTTGAAGGGATGCAATATGCAGGTGTGGAGCCTAATCGTGTGACGATGGTGAATGCATTGGCTGCTTGTGCAAACTTTGGTGCCATTGAAATGGGTGTTTGGATACATGAGTTTGTGAAGAGAAAAGGATGGGAAGTGGATTTGATTTTAGGGACTTCTTTGATTGATATGTATGGGAAATGTGGGAGAATCAAAGAGGGATTGGTTGTTTTCCAAGCCATGAAAGAGAAGAATGTGTATACATGGAATGCACTCATTAAAGGGCTGGCTTTGGCCAAGAGTGGAGAGGAAGCCATTGCTTGGTTTAAGAGAATGGATGAAGGAGGAGTTGAAGCAGATGAAGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCATTCTGGCTTGGTGGACATGGGCAGGCAGATCTTCCAATCGTTGATCGACAGGAGGTTCGGGTTTTCTCCAGGAATCAAACATTATTCATGTATGGTAGATCTCTTGGCTCGTTATGGGTGTATTGAAGAGGCTTTTGTATTGATAAAGGATATGCCTTTTGAAGCCACCAAAGCAATGTGGGGTTCTTTGCTAGCTGGTAGCAGAGCTCATGGGAGCTTGGAAGTGAGTGAAATTGCAGCAAGGAAGCTTGTTGAAATGGAACCAGAAAATGGTGCTTATTATGCTGTGTTATCTAATATTTATGCAGAGATGGAGAAATGGAGTGAGGTTGAGAAAGTGAGAGAGATCATGAAAGAGAAAGGACTGAAGAAGGATTTGGGGTCAAGTTCGGTTGAGCTTCAAGAAGCTGAAAAATGCTTATGA

mRNA sequence

ATGAACAGAAACTCCATTTCCCCCAACAATTACACCTTCCCTTTCCTTCTCAAATCCTTAGCTGATTTCAACGACCTTGTGAGTGGACTATCTGTTCATACCCACGTTCTGAAATTGGGATATGTTTCTGATGTTTATGTCCAGAATTCTTTGATGGATGTGTATGCTTCGTGTCGGAAAATGGGGTTATGCAAGAAGGTGTTCGACGAAATGCCTCAAAGAGATGTTGTGTCGTGGACTGTTTTGATTATGGGTTATCGAGTTTCTTTGATGTCTGATGATGCTTTGATTGCATTTGAAGGGATGCAATATGCAGGTGTGGAGCCTAATCGTGTGACGATGGTGAATGCATTGGCTGCTTGTGCAAACTTTGGTGCCATTGAAATGGGTGTTTGGATACATGAGTTTGTGAAGAGAAAAGGATGGGAAGTGGATTTGATTTTAGGGACTTCTTTGATTGATATGTATGGGAAATGTGGGAGAATCAAAGAGGGATTGGTTGTTTTCCAAGCCATGAAAGAGAAGAATGTGTATACATGGAATGCACTCATTAAAGGGCTGGCTTTGGCCAAGAGTGGAGAGGAAGCCATTGCTTGGTTTAAGAGAATGGATGAAGGAGGAGTTGAAGCAGATGAAGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCATTCTGGCTTGGTGGACATGGGCAGGCAGATCTTCCAATCGTTGATCGACAGGAGGTTCGGGTTTTCTCCAGGAATCAAACATTATTCATGTATGGTAGATCTCTTGGCTCGTTATGGGTGTATTGAAGAGGCTTTTGTATTGATAAAGGATATGCCTTTTGAAGCCACCAAAGCAATGTGGGGTTCTTTGCTAGCTGGTAGCAGAGCTCATGGGAGCTTGGAAGTGAGTGAAATTGCAGCAAGGAAGCTTGTTGAAATGGAACCAGAAAATGGTGCTTATTATGCTGTGTTATCTAATATTTATGCAGAGATGGAGAAATGGAGTGAGGTTGAGAAAGTGAGAGAGATCATGAAAGAGAAAGGACTGAAGAAGGATTTGGGGTCAAGTTCGGTTGAGCTTCAAGAAGCTGAAAAATGCTTATGA

Coding sequence (CDS)

ATGAACAGAAACTCCATTTCCCCCAACAATTACACCTTCCCTTTCCTTCTCAAATCCTTAGCTGATTTCAACGACCTTGTGAGTGGACTATCTGTTCATACCCACGTTCTGAAATTGGGATATGTTTCTGATGTTTATGTCCAGAATTCTTTGATGGATGTGTATGCTTCGTGTCGGAAAATGGGGTTATGCAAGAAGGTGTTCGACGAAATGCCTCAAAGAGATGTTGTGTCGTGGACTGTTTTGATTATGGGTTATCGAGTTTCTTTGATGTCTGATGATGCTTTGATTGCATTTGAAGGGATGCAATATGCAGGTGTGGAGCCTAATCGTGTGACGATGGTGAATGCATTGGCTGCTTGTGCAAACTTTGGTGCCATTGAAATGGGTGTTTGGATACATGAGTTTGTGAAGAGAAAAGGATGGGAAGTGGATTTGATTTTAGGGACTTCTTTGATTGATATGTATGGGAAATGTGGGAGAATCAAAGAGGGATTGGTTGTTTTCCAAGCCATGAAAGAGAAGAATGTGTATACATGGAATGCACTCATTAAAGGGCTGGCTTTGGCCAAGAGTGGAGAGGAAGCCATTGCTTGGTTTAAGAGAATGGATGAAGGAGGAGTTGAAGCAGATGAAGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCATTCTGGCTTGGTGGACATGGGCAGGCAGATCTTCCAATCGTTGATCGACAGGAGGTTCGGGTTTTCTCCAGGAATCAAACATTATTCATGTATGGTAGATCTCTTGGCTCGTTATGGGTGTATTGAAGAGGCTTTTGTATTGATAAAGGATATGCCTTTTGAAGCCACCAAAGCAATGTGGGGTTCTTTGCTAGCTGGTAGCAGAGCTCATGGGAGCTTGGAAGTGAGTGAAATTGCAGCAAGGAAGCTTGTTGAAATGGAACCAGAAAATGGTGCTTATTATGCTGTGTTATCTAATATTTATGCAGAGATGGAGAAATGGAGTGAGGTTGAGAAAGTGAGAGAGATCATGAAAGAGAAAGGACTGAAGAAGGATTTGGGGTCAAGTTCGGTTGAGCTTCAAGAAGCTGAAAAATGCTTATGA

Protein sequence

MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRKMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEAEKCL
BLAST of Lsi02G023820 vs. Swiss-Prot
Match: PPR75_ARATH (Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana GN=PCMP-E42 PE=2 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 2.4e-73
Identity = 142/359 (39.55%), Postives = 215/359 (59.89%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVS-DVYVQNSLMDVYASCR 60
           M +  ++ N  T   +LK+     D+  G SVH   L+ G V  DV++ +SL+D+Y  C 
Sbjct: 195 MKKTGVAANEMTVVSVLKAAGKVEDVRFGRSVHGLYLETGRVKCDVFIGSSLVDMYGKCS 254

Query: 61  KMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALA 120
                +KVFDEMP R+VV+WT LI GY  S   D  ++ FE M  + V PN  T+ + L+
Sbjct: 255 CYDDAQKVFDEMPSRNVVTWTALIAGYVQSRCFDKGMLVFEEMLKSDVAPNEKTLSSVLS 314

Query: 121 ACANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYT 180
           ACA+ GA+  G  +H ++ +   E++   GT+LID+Y KCG ++E ++VF+ + EKNVYT
Sbjct: 315 ACAHVGALHRGRRVHCYMIKNSIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYT 374

Query: 181 WNALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLI 240
           W A+I G A      +A   F  M    V  +EVT +AVL AC+H GLV+ GR++F S+ 
Sbjct: 375 WTAMINGFAAHGYARDAFDLFYTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSM- 434

Query: 241 DRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEV 300
             RF   P   HY+CMVDL  R G +EEA  LI+ MP E T  +WG+L      H   E+
Sbjct: 435 KGRFNMEPKADHYACMVDLFGRKGLLEEAKALIERMPMEPTNVVWGALFGSCLLHKDYEL 494

Query: 301 SEIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQ 359
            + AA ++++++P +   Y +L+N+Y+E + W EV +VR+ MK++ + K  G S +E++
Sbjct: 495 GKYAASRVIKLQPSHSGRYTLLANLYSESQNWDEVARVRKQMKDQQVVKSPGFSWIEVK 552

BLAST of Lsi02G023820 vs. Swiss-Prot
Match: PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 275.8 bits (704), Expect = 6.8e-73
Identity = 150/390 (38.46%), Postives = 221/390 (56.67%), Query Frame = 1

Query: 8   PNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASC------RKM 67
           P+ +TFPF+LK     +D+  G  +H  V+  G+ S V+V   L+ +Y SC      RKM
Sbjct: 114 PDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFGFDSSVHVVTGLIQMYFSCGGLGDARKM 173

Query: 68  GLCKKVFDEMPQRDV---------------------------------VSWTVLIMGYRV 127
                 FDEM  +DV                                 VSWT +I GY  
Sbjct: 174 ------FDEMLVKDVNVWNALLAGYGKVGEMDEARSLLEMMPCWVRNEVSWTCVISGYAK 233

Query: 128 SLMSDDALIAFEGMQYAGVEPNRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLIL 187
           S  + +A+  F+ M    VEP+ VT++  L+ACA+ G++E+G  I  +V  +G    + L
Sbjct: 234 SGRASEAIEVFQRMLMENVEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNRAVSL 293

Query: 188 GTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGV 247
             ++IDMY K G I + L VF+ + E+NV TW  +I GLA    G EA+A F RM + GV
Sbjct: 294 NNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGV 353

Query: 248 EADEVTLVAVLCACSHSGLVDMGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEA 307
             ++VT +A+L ACSH G VD+G+++F S+   ++G  P I+HY CM+DLL R G + EA
Sbjct: 354 RPNDVTFIAILSACSHVGWVDLGKRLFNSM-RSKYGIHPNIEHYGCMIDLLGRAGKLREA 413

Query: 308 FVLIKDMPFEATKAMWGSLLAGSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEM 359
             +IK MPF+A  A+WGSLLA S  H  LE+ E A  +L+++EP N   Y +L+N+Y+ +
Sbjct: 414 DEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSELIKLEPNNSGNYMLLANLYSNL 473

BLAST of Lsi02G023820 vs. Swiss-Prot
Match: PP169_ARATH (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 1.2e-72
Identity = 125/308 (40.58%), Postives = 199/308 (64.61%), Query Frame = 1

Query: 50  SLMDVYASCRKMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEP 109
           +++  YA C  + + +K+FD+M ++DVV W  +I G   +    DAL  F+ MQ +  +P
Sbjct: 328 TMISGYARCGLLDVSRKLFDDMEEKDVVLWNAMIGGSVQAKRGQDALALFQEMQTSNTKP 387

Query: 110 NRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVF 169
           + +TM++ L+AC+  GA+++G+WIH ++++    +++ LGTSL+DMY KCG I E L VF
Sbjct: 388 DEITMIHCLSACSQLGALDVGIWIHRYIEKYSLSLNVALGTSLVDMYAKCGNISEALSVF 447

Query: 170 QAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVD 229
             ++ +N  T+ A+I GLAL      AI++F  M + G+  DE+T + +L AC H G++ 
Sbjct: 448 HGIQTRNSLTYTAIIGGLALHGDASTAISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQ 507

Query: 230 MGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLA 289
            GR  F S +  RF  +P +KHYS MVDLL R G +EEA  L++ MP EA  A+WG+LL 
Sbjct: 508 TGRDYF-SQMKSRFNLNPQLKHYSIMVDLLGRAGLLEEADRLMESMPMEADAAVWGALLF 567

Query: 290 GSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKD 349
           G R HG++E+ E AA+KL+E++P +   Y +L  +Y E   W + ++ R +M E+G++K 
Sbjct: 568 GCRMHGNVELGEKAAKKLLELDPSDSGIYVLLDGMYGEANMWEDAKRARRMMNERGVEKI 627

Query: 350 LGSSSVEL 358
            G SS+E+
Sbjct: 628 PGCSSIEV 634

BLAST of Lsi02G023820 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 274.6 bits (701), Expect = 1.5e-72
Identity = 143/386 (37.05%), Postives = 221/386 (57.25%), Query Frame = 1

Query: 4   NSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYA------- 63
           +S   N YTFP LLK+ ++ +       +H  + KLGY +DVY  NSL++ YA       
Sbjct: 109 SSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKL 168

Query: 64  ------------------------SCRKMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMS 123
                                      KM +   +F +M +++ +SWT +I GY  + M+
Sbjct: 169 AHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMN 228

Query: 124 DDALIAFEGMQYAGVEPNRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLILGTSL 183
            +AL  F  MQ + VEP+ V++ NAL+ACA  GA+E G WIH ++ +    +D +LG  L
Sbjct: 229 KEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVL 288

Query: 184 IDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGVEADE 243
           IDMY KCG ++E L VF+ +K+K+V  W ALI G A    G EAI+ F  M + G++ + 
Sbjct: 289 IDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNV 348

Query: 244 VTLVAVLCACSHSGLVDMGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLI 303
           +T  AVL ACS++GLV+ G+ IF S+ +R +   P I+HY C+VDLL R G ++EA   I
Sbjct: 349 ITFTAVLTACSYTGLVEEGKLIFYSM-ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFI 408

Query: 304 KDMPFEATKAMWGSLLAGSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEMEKWS 359
           ++MP +    +WG+LL   R H ++E+ E     L+ ++P +G  Y   +NI+A  +KW 
Sbjct: 409 QEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWD 468

BLAST of Lsi02G023820 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 270.4 bits (690), Expect = 2.9e-71
Identity = 131/353 (37.11%), Postives = 223/353 (63.17%), Query Frame = 1

Query: 6   ISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRKMGLCK 65
           + P+ +T+PFL+K++    D+  G ++H+ V++ G+ S +YVQNSL+ +YA+C  +    
Sbjct: 117 VEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAY 176

Query: 66  KVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAACANFG 125
           KVFD+MP++D+V+W  +I G+  +   ++AL  +  M   G++P+  T+V+ L+ACA  G
Sbjct: 177 KVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIG 236

Query: 126 AIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIK 185
           A+ +G  +H ++ + G   +L     L+D+Y +CGR++E   +F  M +KN  +W +LI 
Sbjct: 237 ALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIV 296

Query: 186 GLALAKSGEEAIAWFKRMDE-GGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLIDRRFG 245
           GLA+   G+EAI  FK M+   G+   E+T V +L ACSH G+V  G + F+ + +  + 
Sbjct: 297 GLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMRE-EYK 356

Query: 246 FSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVSEIAA 305
             P I+H+ CMVDLLAR G +++A+  IK MP +    +W +LL     HG  +++E A 
Sbjct: 357 IEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFAR 416

Query: 306 RKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVEL 358
            +++++EP +   Y +LSN+YA  ++WS+V+K+R+ M   G+KK  G S VE+
Sbjct: 417 IQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEV 468

BLAST of Lsi02G023820 vs. TrEMBL
Match: A0A0A0KAU3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G395830 PE=4 SV=1)

HSP 1 Score: 619.0 bits (1595), Expect = 3.7e-174
Identity = 306/362 (84.53%), Postives = 326/362 (90.06%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           MNRNSISPNNYTFPF+LKSLADF DLV G SVHTHV+K G+ SD+YVQN+LMDVYASC K
Sbjct: 101 MNRNSISPNNYTFPFVLKSLADFKDLVGGQSVHTHVVKSGHASDLYVQNTLMDVYASCGK 160

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           MGLCKKVFDEM   DVVSWT+LIMGYRVS M DDALI FE MQYAGV+PNRVT+VNALAA
Sbjct: 161 MGLCKKVFDEMLHTDVVSWTILIMGYRVSFMLDDALIVFEQMQYAGVDPNRVTIVNALAA 220

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CA+FGAIEMGVWIHEFVK K WEVD++LGT+LIDMYGKCGRIKE L VFQAMKEKNVYTW
Sbjct: 221 CASFGAIEMGVWIHEFVKTKRWEVDVVLGTALIDMYGKCGRIKEALAVFQAMKEKNVYTW 280

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           N  I GLA AK GEEAIAWFKRMDE GVEAD+VTLVAVL ACSHSGLV+ GRQIF SLI 
Sbjct: 281 NVFINGLASAKCGEEAIAWFKRMDEEGVEADDVTLVAVLSACSHSGLVNSGRQIFWSLIH 340

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            RFGFSPGIKHYSCMVD+LAR GCIEEA V+IKDMPFEAT++MWGSLL GSRAHGSLEVS
Sbjct: 341 GRFGFSPGIKHYSCMVDILARNGCIEEACVMIKDMPFEATRSMWGSLLTGSRAHGSLEVS 400

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           EIAAR+LVEMEPENG YY VLSNIYAEM KWSEVEKVREIMKE+GLKKDLGSSSVELQE 
Sbjct: 401 EIAARRLVEMEPENGGYYVVLSNIYAEMGKWSEVEKVREIMKERGLKKDLGSSSVELQEV 460

Query: 361 EK 363
            K
Sbjct: 461 GK 462

BLAST of Lsi02G023820 vs. TrEMBL
Match: W9QLW6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_009077 PE=4 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 6.5e-147
Identity = 248/361 (68.70%), Postives = 306/361 (84.76%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           M + ++ PNNYTFPFLLKSL+D  +L     VHTHV+KLG++ D+YV+NSL+DVYASC  
Sbjct: 1   MQKTNVLPNNYTFPFLLKSLSDSRELKHAQCVHTHVVKLGHLGDIYVRNSLLDVYASCGH 60

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           +G C+KVFDEMP RDVVSWTVLIMGYR     +DAL+ FE MQYAGV PNRVTMVNALAA
Sbjct: 61  VGSCRKVFDEMPYRDVVSWTVLIMGYRNCGRYEDALVVFERMQYAGVVPNRVTMVNALAA 120

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CAN GA+EMGVWIH+FV+R+G E+D+ LGTSLIDMYGKCGRI+EGL VF++MKE+N +TW
Sbjct: 121 CANSGALEMGVWIHDFVRREGLELDVRLGTSLIDMYGKCGRIEEGLAVFRSMKERNSFTW 180

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           NA+I+GLALAKSG+EA+ WF RM++ G  ADEVTLVAVLCACSHSG+VD+GRQIF +L+D
Sbjct: 181 NAVIQGLALAKSGKEAVWWFNRMEQEGFRADEVTLVAVLCACSHSGIVDVGRQIFDALVD 240

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            ++GFSP +KHY+CMVD+L R GC+EEAF  +K MP E TK +WGSLLAG RAHG+L++S
Sbjct: 241 GKYGFSPSVKHYACMVDILTRAGCLEEAFKCVKVMPHEPTKTIWGSLLAGGRAHGNLDLS 300

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           E AA KL+E+EPEN AY+ +LSN+YAEM +W +VEKVR +MK++GLKKDLG SSVEL+  
Sbjct: 301 EFAAWKLIELEPENAAYHVMLSNMYAEMGRWDDVEKVRGMMKDRGLKKDLGCSSVELEPN 360

Query: 361 E 362
           E
Sbjct: 361 E 361

BLAST of Lsi02G023820 vs. TrEMBL
Match: V4U0E7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020732mg PE=4 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 9.4e-146
Identity = 248/361 (68.70%), Postives = 301/361 (83.38%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           M RN + PNNYTFPF+LKS +DF +   G ++HTHV+K G+++D+YVQNSL+++YASC  
Sbjct: 1   MLRNYVLPNNYTFPFVLKSSSDFMEFKQGQAIHTHVVKFGHLNDIYVQNSLLNLYASCHD 60

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           M  C+KVFDEM QRDVVSWTVLIMGYR   M DDALI+FE MQYAGVEPNRVTMVNALAA
Sbjct: 61  MDQCRKVFDEMTQRDVVSWTVLIMGYRSVKMYDDALISFEQMQYAGVEPNRVTMVNALAA 120

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CA+FGA EMGVWIH+ ++RKGW+VD+ILGT+LIDMYGKCGRI++GL VFQ MKEKNV+TW
Sbjct: 121 CASFGAAEMGVWIHDSIRRKGWKVDVILGTALIDMYGKCGRIEQGLNVFQGMKEKNVFTW 180

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           NA+IKGLALAKSG EA+ WF +M + G + DEVTLV+VLCAC HSGLVD+G++IF S+I 
Sbjct: 181 NAVIKGLALAKSGHEAVLWFNKMKQEGFKPDEVTLVSVLCACGHSGLVDVGQEIFSSMIH 240

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            ++GFSPG  HY+CM+DL AR G IE AF LI +MPFE TK MWGSLLAG R   S E+S
Sbjct: 241 GKYGFSPGKNHYACMIDLFARAGYIENAFKLINEMPFEPTKTMWGSLLAGCRDQRSFELS 300

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           E  A+KL+E+EP+N AYY +LSN+YAEM +WS+VE+VRE MK +GLKKDLGSS VEL+  
Sbjct: 301 EFVAKKLLELEPDNSAYYIMLSNLYAEMGRWSDVERVRESMKGRGLKKDLGSSYVELEPQ 360

Query: 361 E 362
           E
Sbjct: 361 E 361

BLAST of Lsi02G023820 vs. TrEMBL
Match: A0A061DRG0_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_004596 PE=4 SV=1)

HSP 1 Score: 518.1 bits (1333), Expect = 8.8e-144
Identity = 242/362 (66.85%), Postives = 302/362 (83.43%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           M   S+ PNNYTFPFLLKSL+DF+ L+ G  V THV+KLG+  D+Y+QNSLM++YAS  +
Sbjct: 85  MRNASVMPNNYTFPFLLKSLSDFHQLLKGQMVQTHVIKLGHSHDIYIQNSLMNLYASSGE 144

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           MGLC++VFDE+P++DVVSWTVLI GYR     DDALIAFE MQYAGV PNRVTMVNALAA
Sbjct: 145 MGLCRQVFDEIPEKDVVSWTVLITGYRNDKKYDDALIAFEQMQYAGVVPNRVTMVNALAA 204

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           C +FGA EMGVWIH+F+ +KGWE+DLILGT+LIDMYGKCGRI+EGL VF  MKEKN +TW
Sbjct: 205 CGSFGATEMGVWIHDFITKKGWELDLILGTALIDMYGKCGRIEEGLRVFHNMKEKNNFTW 264

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           NA+I GLALAK+GE+A+ WF RM++ G + D+VTLV VLCACS SGLVD GR+IF  L++
Sbjct: 265 NAVINGLALAKNGEQAVWWFYRMEQEGFKVDDVTLVGVLCACSLSGLVDTGRKIFSFLVE 324

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            R+GF PG+KHY+C++DLL R G +++AF  I+DMPFE T+++WGSLLAG R HG+LE+S
Sbjct: 325 GRYGFLPGVKHYACIIDLLTRAGFLDDAFRFIQDMPFEPTRSIWGSLLAGCRTHGNLELS 384

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           E AA+KLVE+EP N AYY VLSN+YA+M +W + EKVR +MKE+GLKKDLG SSV+L+  
Sbjct: 385 EFAAKKLVELEPANSAYYVVLSNLYADMGRWDDAEKVRALMKERGLKKDLGCSSVDLEPQ 444

Query: 361 EK 363
           E+
Sbjct: 445 EQ 446

BLAST of Lsi02G023820 vs. TrEMBL
Match: A0A067K793_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13896 PE=4 SV=1)

HSP 1 Score: 513.8 bits (1322), Expect = 1.7e-142
Identity = 243/356 (68.26%), Postives = 297/356 (83.43%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           MN NSI PNNYTFPFLLKSL+DF D   G  VHTHV+KLG ++D+YVQNSL++VYASC +
Sbjct: 103 MNNNSIPPNNYTFPFLLKSLSDFKDFNQGQCVHTHVIKLGQLNDIYVQNSLLNVYASCGR 162

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           + LC+K+FDEMP+RDVVSWTVLIMGYR +L   DALIAFE MQY GV PNRVT+VN L A
Sbjct: 163 VALCRKLFDEMPERDVVSWTVLIMGYRDALNYADALIAFEQMQYEGVVPNRVTIVNVLGA 222

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CA+ GAIEMGVWIH+F+++ GWE+D+ILGTSLIDMY KCGRI EGL VF++MKEKN++TW
Sbjct: 223 CASLGAIEMGVWIHDFIRKNGWELDVILGTSLIDMYVKCGRIHEGLNVFKSMKEKNIFTW 282

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           N++I GLA AK G+EA+ +FKRM++ GV  DEVTLV VL ACSHSGLVD G+ IF SL+ 
Sbjct: 283 NSVINGLAFAKCGKEAVLFFKRMEQEGVNLDEVTLVNVLSACSHSGLVDTGQHIFSSLVV 342

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            ++GFSP  KHY+CMVDL AR G ++ AF +IK++PFE TK+MWGSLL G R H +LE+S
Sbjct: 343 GKYGFSPNAKHYACMVDLFARAGHLDYAFKIIKEIPFEPTKSMWGSLLTGCRVHRNLELS 402

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVE 357
           E AARKLVE+EP N A+Y VLSN+Y+EM +WS+  ++RE+MKEKGLKKD GSSSVE
Sbjct: 403 EFAARKLVELEPGNSAHYVVLSNLYSEMGRWSDAAEIRELMKEKGLKKDSGSSSVE 458

BLAST of Lsi02G023820 vs. TAIR10
Match: AT1G50270.1 (AT1G50270.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 277.3 bits (708), Expect = 1.3e-74
Identity = 142/359 (39.55%), Postives = 215/359 (59.89%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVS-DVYVQNSLMDVYASCR 60
           M +  ++ N  T   +LK+     D+  G SVH   L+ G V  DV++ +SL+D+Y  C 
Sbjct: 195 MKKTGVAANEMTVVSVLKAAGKVEDVRFGRSVHGLYLETGRVKCDVFIGSSLVDMYGKCS 254

Query: 61  KMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALA 120
                +KVFDEMP R+VV+WT LI GY  S   D  ++ FE M  + V PN  T+ + L+
Sbjct: 255 CYDDAQKVFDEMPSRNVVTWTALIAGYVQSRCFDKGMLVFEEMLKSDVAPNEKTLSSVLS 314

Query: 121 ACANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYT 180
           ACA+ GA+  G  +H ++ +   E++   GT+LID+Y KCG ++E ++VF+ + EKNVYT
Sbjct: 315 ACAHVGALHRGRRVHCYMIKNSIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYT 374

Query: 181 WNALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLI 240
           W A+I G A      +A   F  M    V  +EVT +AVL AC+H GLV+ GR++F S+ 
Sbjct: 375 WTAMINGFAAHGYARDAFDLFYTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSM- 434

Query: 241 DRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEV 300
             RF   P   HY+CMVDL  R G +EEA  LI+ MP E T  +WG+L      H   E+
Sbjct: 435 KGRFNMEPKADHYACMVDLFGRKGLLEEAKALIERMPMEPTNVVWGALFGSCLLHKDYEL 494

Query: 301 SEIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQ 359
            + AA ++++++P +   Y +L+N+Y+E + W EV +VR+ MK++ + K  G S +E++
Sbjct: 495 GKYAASRVIKLQPSHSGRYTLLANLYSESQNWDEVARVRKQMKDQQVVKSPGFSWIEVK 552

BLAST of Lsi02G023820 vs. TAIR10
Match: AT5G56310.1 (AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 275.8 bits (704), Expect = 3.9e-74
Identity = 150/390 (38.46%), Postives = 221/390 (56.67%), Query Frame = 1

Query: 8   PNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASC------RKM 67
           P+ +TFPF+LK     +D+  G  +H  V+  G+ S V+V   L+ +Y SC      RKM
Sbjct: 114 PDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFGFDSSVHVVTGLIQMYFSCGGLGDARKM 173

Query: 68  GLCKKVFDEMPQRDV---------------------------------VSWTVLIMGYRV 127
                 FDEM  +DV                                 VSWT +I GY  
Sbjct: 174 ------FDEMLVKDVNVWNALLAGYGKVGEMDEARSLLEMMPCWVRNEVSWTCVISGYAK 233

Query: 128 SLMSDDALIAFEGMQYAGVEPNRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLIL 187
           S  + +A+  F+ M    VEP+ VT++  L+ACA+ G++E+G  I  +V  +G    + L
Sbjct: 234 SGRASEAIEVFQRMLMENVEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNRAVSL 293

Query: 188 GTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGV 247
             ++IDMY K G I + L VF+ + E+NV TW  +I GLA    G EA+A F RM + GV
Sbjct: 294 NNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGV 353

Query: 248 EADEVTLVAVLCACSHSGLVDMGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEA 307
             ++VT +A+L ACSH G VD+G+++F S+   ++G  P I+HY CM+DLL R G + EA
Sbjct: 354 RPNDVTFIAILSACSHVGWVDLGKRLFNSM-RSKYGIHPNIEHYGCMIDLLGRAGKLREA 413

Query: 308 FVLIKDMPFEATKAMWGSLLAGSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEM 359
             +IK MPF+A  A+WGSLLA S  H  LE+ E A  +L+++EP N   Y +L+N+Y+ +
Sbjct: 414 DEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSELIKLEPNNSGNYMLLANLYSNL 473

BLAST of Lsi02G023820 vs. TAIR10
Match: AT2G22410.1 (AT2G22410.1 SLOW GROWTH 1)

HSP 1 Score: 275.0 bits (702), Expect = 6.6e-74
Identity = 125/308 (40.58%), Postives = 199/308 (64.61%), Query Frame = 1

Query: 50  SLMDVYASCRKMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEP 109
           +++  YA C  + + +K+FD+M ++DVV W  +I G   +    DAL  F+ MQ +  +P
Sbjct: 328 TMISGYARCGLLDVSRKLFDDMEEKDVVLWNAMIGGSVQAKRGQDALALFQEMQTSNTKP 387

Query: 110 NRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVF 169
           + +TM++ L+AC+  GA+++G+WIH ++++    +++ LGTSL+DMY KCG I E L VF
Sbjct: 388 DEITMIHCLSACSQLGALDVGIWIHRYIEKYSLSLNVALGTSLVDMYAKCGNISEALSVF 447

Query: 170 QAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVD 229
             ++ +N  T+ A+I GLAL      AI++F  M + G+  DE+T + +L AC H G++ 
Sbjct: 448 HGIQTRNSLTYTAIIGGLALHGDASTAISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQ 507

Query: 230 MGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLA 289
            GR  F S +  RF  +P +KHYS MVDLL R G +EEA  L++ MP EA  A+WG+LL 
Sbjct: 508 TGRDYF-SQMKSRFNLNPQLKHYSIMVDLLGRAGLLEEADRLMESMPMEADAAVWGALLF 567

Query: 290 GSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKD 349
           G R HG++E+ E AA+KL+E++P +   Y +L  +Y E   W + ++ R +M E+G++K 
Sbjct: 568 GCRMHGNVELGEKAAKKLLELDPSDSGIYVLLDGMYGEANMWEDAKRARRMMNERGVEKI 627

Query: 350 LGSSSVEL 358
            G SS+E+
Sbjct: 628 PGCSSIEV 634

BLAST of Lsi02G023820 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 274.6 bits (701), Expect = 8.6e-74
Identity = 143/386 (37.05%), Postives = 221/386 (57.25%), Query Frame = 1

Query: 4   NSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYA------- 63
           +S   N YTFP LLK+ ++ +       +H  + KLGY +DVY  NSL++ YA       
Sbjct: 109 SSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKL 168

Query: 64  ------------------------SCRKMGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMS 123
                                      KM +   +F +M +++ +SWT +I GY  + M+
Sbjct: 169 AHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMN 228

Query: 124 DDALIAFEGMQYAGVEPNRVTMVNALAACANFGAIEMGVWIHEFVKRKGWEVDLILGTSL 183
            +AL  F  MQ + VEP+ V++ NAL+ACA  GA+E G WIH ++ +    +D +LG  L
Sbjct: 229 KEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVL 288

Query: 184 IDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEGGVEADE 243
           IDMY KCG ++E L VF+ +K+K+V  W ALI G A    G EAI+ F  M + G++ + 
Sbjct: 289 IDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNV 348

Query: 244 VTLVAVLCACSHSGLVDMGRQIFQSLIDRRFGFSPGIKHYSCMVDLLARYGCIEEAFVLI 303
           +T  AVL ACS++GLV+ G+ IF S+ +R +   P I+HY C+VDLL R G ++EA   I
Sbjct: 349 ITFTAVLTACSYTGLVEEGKLIFYSM-ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFI 408

Query: 304 KDMPFEATKAMWGSLLAGSRAHGSLEVSEIAARKLVEMEPENGAYYAVLSNIYAEMEKWS 359
           ++MP +    +WG+LL   R H ++E+ E     L+ ++P +G  Y   +NI+A  +KW 
Sbjct: 409 QEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWD 468

BLAST of Lsi02G023820 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 270.4 bits (690), Expect = 1.6e-72
Identity = 131/353 (37.11%), Postives = 223/353 (63.17%), Query Frame = 1

Query: 6   ISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRKMGLCK 65
           + P+ +T+PFL+K++    D+  G ++H+ V++ G+ S +YVQNSL+ +YA+C  +    
Sbjct: 117 VEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAY 176

Query: 66  KVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAACANFG 125
           KVFD+MP++D+V+W  +I G+  +   ++AL  +  M   G++P+  T+V+ L+ACA  G
Sbjct: 177 KVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIG 236

Query: 126 AIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIK 185
           A+ +G  +H ++ + G   +L     L+D+Y +CGR++E   +F  M +KN  +W +LI 
Sbjct: 237 ALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIV 296

Query: 186 GLALAKSGEEAIAWFKRMDE-GGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLIDRRFG 245
           GLA+   G+EAI  FK M+   G+   E+T V +L ACSH G+V  G + F+ + +  + 
Sbjct: 297 GLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMRE-EYK 356

Query: 246 FSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVSEIAA 305
             P I+H+ CMVDLLAR G +++A+  IK MP +    +W +LL     HG  +++E A 
Sbjct: 357 IEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFAR 416

Query: 306 RKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVEL 358
            +++++EP +   Y +LSN+YA  ++WS+V+K+R+ M   G+KK  G S VE+
Sbjct: 417 IQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEV 468

BLAST of Lsi02G023820 vs. NCBI nr
Match: gi|659101438|ref|XP_008451605.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g56310-like [Cucumis melo])

HSP 1 Score: 644.0 bits (1660), Expect = 1.5e-181
Identity = 319/360 (88.61%), Postives = 337/360 (93.61%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           MNRNSISPNNYTFPF+LKSLADF DLVSG SVHTHV+KLG+ SD+YVQN+LMDVYASC K
Sbjct: 101 MNRNSISPNNYTFPFVLKSLADFKDLVSGQSVHTHVVKLGHDSDLYVQNTLMDVYASCGK 160

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           MGLCKKVFDEM QRDVVSWT+LIMGYRVSLM DDALI FE MQYAGVEPNRVT+VNALAA
Sbjct: 161 MGLCKKVFDEMLQRDVVSWTILIMGYRVSLMLDDALIVFEQMQYAGVEPNRVTIVNALAA 220

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CA+FGAIEMGVWIHEFVK K WEVD++LGT+LIDMYGKCGRIKE L VFQAMKEKNVYTW
Sbjct: 221 CASFGAIEMGVWIHEFVKTKRWEVDVVLGTALIDMYGKCGRIKEALAVFQAMKEKNVYTW 280

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           N LI GLALAKSGEEAIAWFKRMDE GVEAD+VTLVAVLCACSHSGLV+ GRQIF+SLI 
Sbjct: 281 NVLINGLALAKSGEEAIAWFKRMDEEGVEADDVTLVAVLCACSHSGLVNSGRQIFRSLIH 340

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            RFGFSP IKHYSCMVD+LAR GCIEEAFV+IKDMPFEATKAMWGSLL GSRAHG+LEVS
Sbjct: 341 GRFGFSPEIKHYSCMVDILARNGCIEEAFVMIKDMPFEATKAMWGSLLTGSRAHGNLEVS 400

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           EIAARKLVEMEPENGAYY VLSNIYAEM KWSEVEKVREIMKE+GLKKDLGSSSVELQEA
Sbjct: 401 EIAARKLVEMEPENGAYYVVLSNIYAEMGKWSEVEKVREIMKERGLKKDLGSSSVELQEA 460

BLAST of Lsi02G023820 vs. NCBI nr
Match: gi|449436789|ref|XP_004136175.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g56310-like [Cucumis sativus])

HSP 1 Score: 619.0 bits (1595), Expect = 5.3e-174
Identity = 306/362 (84.53%), Postives = 326/362 (90.06%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           MNRNSISPNNYTFPF+LKSLADF DLV G SVHTHV+K G+ SD+YVQN+LMDVYASC K
Sbjct: 101 MNRNSISPNNYTFPFVLKSLADFKDLVGGQSVHTHVVKSGHASDLYVQNTLMDVYASCGK 160

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           MGLCKKVFDEM   DVVSWT+LIMGYRVS M DDALI FE MQYAGV+PNRVT+VNALAA
Sbjct: 161 MGLCKKVFDEMLHTDVVSWTILIMGYRVSFMLDDALIVFEQMQYAGVDPNRVTIVNALAA 220

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CA+FGAIEMGVWIHEFVK K WEVD++LGT+LIDMYGKCGRIKE L VFQAMKEKNVYTW
Sbjct: 221 CASFGAIEMGVWIHEFVKTKRWEVDVVLGTALIDMYGKCGRIKEALAVFQAMKEKNVYTW 280

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           N  I GLA AK GEEAIAWFKRMDE GVEAD+VTLVAVL ACSHSGLV+ GRQIF SLI 
Sbjct: 281 NVFINGLASAKCGEEAIAWFKRMDEEGVEADDVTLVAVLSACSHSGLVNSGRQIFWSLIH 340

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            RFGFSPGIKHYSCMVD+LAR GCIEEA V+IKDMPFEAT++MWGSLL GSRAHGSLEVS
Sbjct: 341 GRFGFSPGIKHYSCMVDILARNGCIEEACVMIKDMPFEATRSMWGSLLTGSRAHGSLEVS 400

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           EIAAR+LVEMEPENG YY VLSNIYAEM KWSEVEKVREIMKE+GLKKDLGSSSVELQE 
Sbjct: 401 EIAARRLVEMEPENGGYYVVLSNIYAEMGKWSEVEKVREIMKERGLKKDLGSSSVELQEV 460

Query: 361 EK 363
            K
Sbjct: 461 GK 462

BLAST of Lsi02G023820 vs. NCBI nr
Match: gi|225437286|ref|XP_002266871.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vinifera])

HSP 1 Score: 540.4 bits (1391), Expect = 2.4e-150
Identity = 256/362 (70.72%), Postives = 306/362 (84.53%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           M+ NSI PNN+TFPFLLKSLADF  L  G  +HTHV+KLG   D+YVQNSL++VYASC  
Sbjct: 100 MHSNSILPNNFTFPFLLKSLADFKGLSEGQCIHTHVVKLGQFDDIYVQNSLLNVYASCGD 159

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           MGLC +VFDEMP RDVVSWTVLI GYR +   DDALIAFE MQYAGV PN VTMVNAL+A
Sbjct: 160 MGLCMRVFDEMPHRDVVSWTVLITGYRSAERYDDALIAFEQMQYAGVVPNHVTMVNALSA 219

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CA+FGA+EMGVWIHEF++R GWE D+ILGTSLIDMYGKCGRI+EGLVVF++MKEKNV+TW
Sbjct: 220 CADFGALEMGVWIHEFIRRSGWEFDVILGTSLIDMYGKCGRIEEGLVVFRSMKEKNVFTW 279

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           N+LIKGLALA+SG EA+ WF RM++ G++ADEVTL+AVLCACSHSG+V MGRQIF SL++
Sbjct: 280 NSLIKGLALARSGAEAVWWFYRMEQEGIKADEVTLIAVLCACSHSGMVQMGRQIFGSLMN 339

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            ++ F PG+KHY+C++DLLAR G ++EA  ++  MPFE  K MWG+ LAG RAHG LE+S
Sbjct: 340 GKYEFFPGVKHYACVIDLLARAGILQEAMEVMTRMPFEPNKVMWGAFLAGCRAHGDLELS 399

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           E AARKLVE+EP NGAYY +LSNIYAEM +WS+VEKVR +MKE GL KDLG SS+EL+  
Sbjct: 400 EFAARKLVELEPGNGAYYVLLSNIYAEMGRWSDVEKVRRLMKEGGLTKDLGCSSIELEPQ 459

Query: 361 EK 363
           E+
Sbjct: 460 ER 461

BLAST of Lsi02G023820 vs. NCBI nr
Match: gi|645258752|ref|XP_008235032.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like [Prunus mume])

HSP 1 Score: 538.9 bits (1387), Expect = 6.9e-150
Identity = 252/362 (69.61%), Postives = 312/362 (86.19%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           M+++SI PNN+TFPFLLKSLAD +D   G  +HTHVLKLG++ D+YVQNSL++VYASC +
Sbjct: 107 MHKSSILPNNFTFPFLLKSLADSHDFKQGQCLHTHVLKLGHLYDIYVQNSLLNVYASCGR 166

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           M  C++VFDEMPQRDVVSWTVLIMGYR S   DDALI+FE MQYAGV PN VTMVNALAA
Sbjct: 167 MEFCRQVFDEMPQRDVVSWTVLIMGYRNSENYDDALISFEQMQYAGVVPNHVTMVNALAA 226

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CANFGA+EMGVWIH+F++R  WE+D+ILGTSLIDMYGKCGRI+EGL VF +MKEKN ++W
Sbjct: 227 CANFGALEMGVWIHDFIRRSDWELDVILGTSLIDMYGKCGRIEEGLAVFNSMKEKNTFSW 286

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           NALIKGLALAK+G+E + WFKRM++ G+  DEVTLV+VL ACSHSGLVD+GRQIF+SL D
Sbjct: 287 NALIKGLALAKNGKETVGWFKRMEQEGIRVDEVTLVSVLNACSHSGLVDIGRQIFRSLSD 346

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            ++GF PG+KHY+CM+DLLAR G +E+A   +++MP+E TKA+WGSLLAG + HG+LE+S
Sbjct: 347 GKYGFLPGVKHYACMIDLLARSGYLEDALKCLREMPYEPTKAIWGSLLAGGKTHGNLELS 406

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           E AARKLVE+EP N  YY +LSNIYAEM +W++VEKVR +MK+K LKKDLGSSSVE + +
Sbjct: 407 EFAARKLVELEPGNSTYYVLLSNIYAEMGRWNDVEKVRGMMKQKDLKKDLGSSSVEFEPS 466

Query: 361 EK 363
           ++
Sbjct: 467 DQ 468

BLAST of Lsi02G023820 vs. NCBI nr
Match: gi|645258815|ref|XP_008235062.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like, partial [Prunus mume])

HSP 1 Score: 529.6 bits (1363), Expect = 4.2e-147
Identity = 250/362 (69.06%), Postives = 309/362 (85.36%), Query Frame = 1

Query: 1   MNRNSISPNNYTFPFLLKSLADFNDLVSGLSVHTHVLKLGYVSDVYVQNSLMDVYASCRK 60
           M+++SI PNN+TFPFLLKSLAD +D   G  +HTHVLKLG++ D+YVQNSL++VYASC +
Sbjct: 51  MHKSSILPNNFTFPFLLKSLADSHDFKQGQCLHTHVLKLGHLYDIYVQNSLLNVYASCGR 110

Query: 61  MGLCKKVFDEMPQRDVVSWTVLIMGYRVSLMSDDALIAFEGMQYAGVEPNRVTMVNALAA 120
           M  C++VFDEMPQRDVVSWTVLIMGYR S   DDALI+FE MQYAGV PN VTMVNALAA
Sbjct: 111 MEFCRQVFDEMPQRDVVSWTVLIMGYRNSENYDDALISFEQMQYAGVVPNHVTMVNALAA 170

Query: 121 CANFGAIEMGVWIHEFVKRKGWEVDLILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTW 180
           CANFGA+EMGVWIH+F++R  WE+D+ILGTSLIDMYGKCGRI+EGL VF +MKEKN +TW
Sbjct: 171 CANFGALEMGVWIHDFIRRSDWELDVILGTSLIDMYGKCGRIEEGLAVFNSMKEKNTFTW 230

Query: 181 NALIKGLALAKSGEEAIAWFKRMDEGGVEADEVTLVAVLCACSHSGLVDMGRQIFQSLID 240
           NALIKGLALAK+G+E + WFKRM++ G+  DEVTLV+VL ACSHSGLVD+GRQIF+SL D
Sbjct: 231 NALIKGLALAKNGKETVGWFKRMEQEGIRVDEVTLVSVLNACSHSGLVDIGRQIFRSLSD 290

Query: 241 RRFGFSPGIKHYSCMVDLLARYGCIEEAFVLIKDMPFEATKAMWGSLLAGSRAHGSLEVS 300
            ++GF PG+KHY+CM+DLLAR G +E+A   +++MP+E TKA+WGSLLAG + HG+LE+S
Sbjct: 291 GKYGFLPGVKHYACMIDLLARSGYLEDALKCLREMPYEPTKAIWGSLLAGGKTHGNLELS 350

Query: 301 EIAARKLVEMEPENGAYYAVLSNIYAEMEKWSEVEKVREIMKEKGLKKDLGSSSVELQEA 360
           E AARKLVE+EP N AYY +LSNIYAEM +W++VEKVR +MK+    KDLG SSVE + +
Sbjct: 351 EFAARKLVELEPGNSAYYVLLSNIYAEMGRWNDVEKVRGMMKQ----KDLGCSSVEFEPS 408

Query: 361 EK 363
           ++
Sbjct: 411 DQ 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR75_ARATH2.4e-7339.55Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana GN... [more]
PP433_ARATH6.8e-7338.46Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana GN... [more]
PP169_ARATH1.2e-7240.58Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
PP449_ARATH1.5e-7237.05Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP330_ARATH2.9e-7137.11Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KAU3_CUCSA3.7e-17484.53Uncharacterized protein OS=Cucumis sativus GN=Csa_7G395830 PE=4 SV=1[more]
W9QLW6_9ROSA6.5e-14768.70Uncharacterized protein OS=Morus notabilis GN=L484_009077 PE=4 SV=1[more]
V4U0E7_9ROSI9.4e-14668.70Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020732mg PE=4 SV=1[more]
A0A061DRG0_THECC8.8e-14466.85Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_004596 PE... [more]
A0A067K793_JATCU1.7e-14268.26Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13896 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G50270.11.3e-7439.55 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G56310.13.9e-7438.46 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G22410.16.6e-7440.58 SLOW GROWTH 1[more]
AT5G66520.18.6e-7437.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21065.11.6e-7237.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659101438|ref|XP_008451605.1|1.5e-18188.61PREDICTED: pentatricopeptide repeat-containing protein At5g56310-like [Cucumis m... [more]
gi|449436789|ref|XP_004136175.1|5.3e-17484.53PREDICTED: pentatricopeptide repeat-containing protein At5g56310-like [Cucumis s... [more]
gi|225437286|ref|XP_002266871.1|2.4e-15070.72PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vin... [more]
gi|645258752|ref|XP_008235032.1|6.9e-15069.61PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like [Prunus mu... [more]
gi|645258815|ref|XP_008235062.1|4.2e-14769.06PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like, partial [... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi02G023820.1Lsi02G023820.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 49..75
score: 0.0025coord: 77..107
score: 0.036coord: 251..275
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 175..222
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 178..212
score: 1.1E-6coord: 150..177
score: 2.7E-4coord: 47..77
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 211..245
score: 7.443coord: 9..43
score: 5.963coord: 176..210
score: 10.676coord: 44..74
score: 7.772coord: 75..109
score: 8.78coord: 248..282
score: 7.103coord: 314..348
score: 7.947coord: 110..144
score: 6.358coord: 145..175
score:
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 146..332
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..355
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF799SUBFAMILY NOT NAMEDcoord: 1..355
score: 1.5E

The following gene(s) are paralogous to this gene:

None