Lsi11G008490 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi11G008490
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionR3H domain-containing protein 2
Locationchr11 : 10710465 .. 10715167 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCATAATCCTACATTGTACTCTTTTCATTTACATTTTCGATTTTTTCTCTTCTCCTTATGGATGATTATCTGACTCCCCAGGTGGAGGAGTTGGCCTTTCTTGTTAAGGACAACCTCCCCAGCAAGCATCTTATTCTATCCATGGAAGAAACCTTCATAAACTTTCTTCATGATGAAACCAGGTAATGTGTTGGATGATGGTTAGATGATGTTAGTTATTGAATTGGATAATGTTTTTGTGTTTTATTCCACCCAATGGTGATAGGAACAAGTGGAAGGTAATGAGCTCAAACAACAGTAGCTATCTTTCTAGATTTATTTTTTATTTTTTGGATAAGAAATAATTATATTGTTGGTATCAAATTATTACAAAAGTAATGGGAAGAACCCCGCTCTAAAAGAGTTACAAAAGATCTTCCCAGAACATAAACTATAATCATGAAGGAGAGGAGTGCATTTACACCAAGTATTTGTTTTTTTAAAAAGTAATTGTCAAATAAACAACAGCATTAAAAAACCATGTTAGAATAGCTATCTATTTTAGAATGTTAAAATATCCTAATGTGCTTTCTAAAGTCAAATGGTGTAGGATTAGGTGGTTGTCTAGTGAGATTAGTTGAATGGACATCAATCAAGGCTGAAACCTTATTTAGGAAACATCGAGACAATGCTGATAGTTTTAAAAAAGTTTCAACTTACGTCTAATGAGGAAAATGGGCTGCCCTTGCAATGCATTTTTCTAAGAACATATTGGAGCAGAAAGGTGGTCAATTTTTGTTATCTCCCAATAGGTGTTCAGCTAGCATAATCTTGAGGATAGTAACCACTCTTTATTCCTCTGTTTAGTGCATCAGTTCTTAACTCACGTTATTGTTCTTGCCCATTGGAAAATATCTTTAGATGTACATTGTTCTTTACCCATGGCTTGTATGGTGATCACTAACTTTTAATAATTTATTTGAATAGCAGTTCAGATGGAATCTTGGAGTTGGAACCAATGGATTCATACAACCGTCTTCTTTTGCATCGCCTTGCTGATATTTTTGGGTACTTTTTCCCCCTTCCACGCTTATTATTTGTTCTAGTTCATTGTAAGGTTGAATTTTTTATTTTCTTGTCAAACTTCATGCCAACCTTTCTAACTTCGTACTTCTTCCACTCCTCTGTCAGATTTGCCCACATATCAGTTGGTGAAGGTGCTAATAGACACTTGGTTTTGGAGCGATGCCCAGAGTCATCGATGTATGTGACCAATTCCAGATCTGACTTCCTTTTTGTTTAATATTTTGATAATGTAAATTGTTATCGATAGAACAATTTATATGGTCATCATTTGGTGCTGCAGTTCGCTTCTCCATGTATGCGCACCTGCCAATTTATTATCCATTTTCTTATTATTAAAAAAATTAGTTAATGAATTATTTTTTAGCATTTTATCTGATTTGATAATTTACAGAGAGAATGCATTCTTGGATCTACCATAAGAATTTTAACAATTCCTGGGTTTTTTTGTTGTTTTCTTATCGTTTTAAGGAAAGTATGAAAGGACACGTTGAGAAATTGTATAAAAACAGCACCCTCATTTTTCCTTTCTTATAAATGTCTTCTCGGTAGAAAAAAGTTTTATTTTGGTTCGGAATAAGGCTTTTGCTAAGAAGCACAGACACGGATACAAGACACCGATACGACACGGACACGGCGACACGCCATATTTTAAATATCTAGGACACGACACGGCAAGGACACGTTTATTAAAATATACATTTTTAAAAATATATATCATTTTCATACCAAAATATGATGCATTTATATGCTTAAAAGACTTACCTTGATGTATTTCACACTCAAAAGTTATTATTATTGTCATATATGTGTCTTTTTAGTCTACTCAACAAGTGTTCAATGCATATCTAACACATTTGTTGCGCTAACAAATGTCTGATGCGTGTCCAACAAGTGTCGAAGTGTCGAAGTGTCGGAGTGTCCGACACGTGTCGGACACGGACACGCTAGCCAAACTAAAGTGTCTGTGCTTCTTAGGGCTTTTGAGATGTATTGTTATTTATACATTGAAATAATTGAATAAGCCTCTCCCTCATTCTTTTTTCCTCGAAGTCTTCTCCACAGAAAGAAGTTCTACTTTAGTTTGGAGTTAGGCTTTAGATCTCTGTTGTTATTCACACTTTAAAAATTTTTATTCAATCCAATGAACTCTGTCCTTTACTCCTTTTGATTGGTTCCACTAGAAATCCATCCTTGTTTGAAATTTAGTCTATACAGGCGGAACTGCAATTTGAGCTTTTTCTCTAGCCATCCTTGTATTCAAGTACAGACAGCATACCACCTTCTCTCTCCCTTGCTTTTGTATTCCAAGCTCTCAACTGCTTCCAGCTGCCCTCATTCCAAATTCTTCTTGTTTTCCTTCCATACAAGCTAATTAGAAAAACTTCCACACGTCCACTCAGAGGGCAAAGAGTTGGACTACGTAGATTAGCTATATGAAAAGATGGAATTCTTGTAATTCTTATCAAACAAGTTCTATTTATGACAAAAACCTTGGGGACGTCATTTGTTTCTTCATCATATTTCATGTATTGCTTAGCTATGAATATATTATGTGATCCAATCTTTCTTTCATCTTTTCAAAGAATGGGAAGAAACCAAATCTCATGATGTGGCTAAAAGTGAAACAAGGCCTCTTATATTTACAAACTGTGGAACCATAGGGGCATGGAAGGAAGGGGTGATACTAATAAAGGTCATTTGTTCATTCACTTTTTTGTGTATTGCAGACCGTCCATTCTTGTGAGTGATATTCTGTGGGAGTATGATGAACCTCAAATGTCAACGATACCACACCAACTATTAAGGAGAAAGGAAAATTCTTCTGGTATACTTTCCTTATTATTTCGTTACAGCATAATCAGTGCCTGCACTTTTTTCCAATTCATGCTATTGATTACAATGATCATCAGAAAGTTTACATGTGCACTTATAATGGACTTGAGGCCATTTGAAGATTGTAGAGGAGATACTATGAAGGTGTAATATTAATTTCATAATTTGTGGAAGTGCCCTGGCCCTACTGTGTTCTTTCTCTCTCTTCATTTTTAATGGCATCAGATGGATAAGAGTAGAATAAAAGGTCTTCTGAGTAGGATTTGATAGGGATTCTGGATCTGAAGTTTGAAGTTTAGTGTGAAAAGGACCTCTGGAATAGAATGAGTATGCAATTATCTCTTTTTAGGGGGAAAAAAAGGAAAATGAAAAAATGCTGTTCCAATTTGCATATATTTGAGGAAGTTTGTTTAATGCCAATTCATCATATTGAAAGTTCTCATTTGTTATGTTGAATGCCTATTCATTTTACTTGCAGATTTACTTTCAAAAAATTTATAAGAAACTGAGTTGCCTATACTTGATGTTCTATTTCAGCGAGTTCAACAAAAGCATCTCCTCAACGTTCACTTGAAGAGAGAGAAGCATCTTATCTGGCTGTCCGTGAGCGCATTTTCATGATGCACATGGGAGAAGACAGCGAACCCGTGAAGCCAAAGCCACGCTGTAATCCTGTGGTTGCACGCCGCATGATTGCACATGCACTGGGCCAAAGAATTAATTCATTTCCTGAGGATACTACTTGTCATTTCAAAGAGCAAGGAGGTATAGTGAACAATGCCTACATCCAAGCAAAAGATTCGAAGTTGCCTGATTCTACTGTGGAAGCTGTTAACAAAACCATTTCACGGTCAGATCAATGTGTGAACTCGAAGAATGAAGTGGATAAGAATTGCAATCCAGACGTGTCATTGGCAAAGGGAAGTAATGCTGTCAAAATGAAACCTGACAAGAGTTCTCCGAAGGCAAGTCGTGTCGACAATGAGTACTTGAAGAGAGAACATTTAGGAGCTGCGAAGAGGATGTTTTCACAGGCTTTAGGCAAGCACTGCCGAAAGAATGATTCTCTTCAAACTCGTGGGGAATCAGATTAAAATGGAATCAGGATTCAAAGATATCTGCAGTAGCTCTTGTTCTCTGGCATGATCTTTTGTCCTATTCACAATTTGTTGTGTTAGGATTGCTGATGCCAACTAAAAGATGAATGGGACTGTGTGATGACAAATTTGTCTACGTCGCATTGAACATGGTCGTCATCGGCGACGTGATCTGGGTGGATGAAACACCTATGGCTGCTAGTTTATGTTGGCAATTCAGTTGGCAAATGTGAAATCTAACATTGCATTCTCTAGCTTATATATTTCTTGATGTGCTCAGAATTCGATCTGGTCATTTGAGAAGCGCTGTTGAGGAGATTGATCTACGTGGTTCAAGCAGAGTCTTGAGAAAAAGGTTGTCGAAATGAATTGGGATTGTATCTGCTCACTGATGAGATGCGCATAGCGGTTTGTGTCGGGTTAAAGTAAAGGTTTCTAACTTAAATTCGTGTTAGTACTTGATTGTTATATCCTATCCTCTCTATTTTCTGCTGATGGCAAATAGCATCTCGTGGATCGTAATGAATGATTATTTACTGGACTCGACGATGTCGGGCAATACACTATGTTTTATGTGAAACTGTACTAATTTTAAGCTTTATTGTTCCCTTTGAAATCTTTGAACTATTGGCAAATACTTTCTTTTGAACGAAGGCCAAAGTGTTCGTGGATTTCAAGTATCTTTATTTTCTTATGGGGCTATAGAGC

mRNA sequence

GCATAATCCTACATTGTACTCTTTTCATTTACATTTTCGATTTTTTCTCTTCTCCTTATGGATGATTATCTGACTCCCCAGGTGGAGGAGTTGGCCTTTCTTGTTAAGGACAACCTCCCCAGCAAGCATCTTATTCTATCCATGGAAGAAACCTTCATAAACTTTCTTCATGATGAAACCAGTTCAGATGGAATCTTGGAGTTGGAACCAATGGATTCATACAACCGTCTTCTTTTGCATCGCCTTGCTGATATTTTTGGATTTGCCCACATATCAGTTGGTGAAGGTGCTAATAGACACTTGGTTTTGGAGCGATGCCCAGAGTCATCGATACCGTCCATTCTTGTGAGTGATATTCTGTGGGAGTATGATGAACCTCAAATGTCAACGATACCACACCAACTATTAAGGAGAAAGGAAAATTCTTCTGCGAGTTCAACAAAAGCATCTCCTCAACGTTCACTTGAAGAGAGAGAAGCATCTTATCTGGCTGTCCGTGAGCGCATTTTCATGATGCACATGGGAGAAGACAGCGAACCCGTGAAGCCAAAGCCACGCTGTAATCCTGTGGTTGCACGCCGCATGATTGCACATGCACTGGGCCAAAGAATTAATTCATTTCCTGAGGATACTACTTGTCATTTCAAAGAGCAAGGAGGTATAGTGAACAATGCCTACATCCAAGCAAAAGATTCGAAGTTGCCTGATTCTACTGTGGAAGCTGTTAACAAAACCATTTCACGGTCAGATCAATGTGTGAACTCGAAGAATGAAGTGGATAAGAATTGCAATCCAGACGTGTCATTGGCAAAGGGAAGTAATGCTGTCAAAATGAAACCTGACAAGAGTTCTCCGAAGGCAAGTCGTGTCGACAATGAGTACTTGAAGAGAGAACATTTAGGAGCTGCGAAGAGGATGTTTTCACAGGCTTTAGGCAAGCACTGCCGAAAGAATGATTCTCTTCAAACTCGTGGGGAATCAGATTAAAATGGAATCAGGATTCAAAGATATCTGCAGTAGCTCTTGTTCTCTGGCATGATCTTTTGTCCTATTCACAATTTGTTGTGTTAGGATTGCTGATGCCAACTAAAAGATGAATGGGACTGTGTGATGACAAATTTGTCTACGTCGCATTGAACATGGTCGTCATCGGCGACGTGATCTGGGTGGATGAAACACCTATGGCTGCTAGTTTATGTTGGCAATTCAGTTGGCAAATGTGAAATCTAACATTGCATTCTCTAGCTTATATATTTCTTGATGTGCTCAGAATTCGATCTGGTCATTTGAGAAGCGCTGTTGAGGAGATTGATCTACGTGGTTCAAGCAGAGTCTTGAGAAAAAGGTTGTCGAAATGAATTGGGATTGTATCTGCTCACTGATGAGATGCGCATAGCGGTTTGTGTCGGGTTAAAGTAAAGGTTTCTAACTTAAATTCGTGTTAGTACTTGATTGTTATATCCTATCCTCTCTATTTTCTGCTGATGGCAAATAGCATCTCGTGGATCGTAATGAATGATTATTTACTGGACTCGACGATGTCGGGCAATACACTATGTTTTATGTGAAACTGTACTAATTTTAAGCTTTATTGTTCCCTTTGAAATCTTTGAACTATTGGCAAATACTTTCTTTTGAACGAAGGCCAAAGTGTTCGTGGATTTCAAGTATCTTTATTTTCTTATGGGGCTATAGAGC

Coding sequence (CDS)

ATGGATGATTATCTGACTCCCCAGGTGGAGGAGTTGGCCTTTCTTGTTAAGGACAACCTCCCCAGCAAGCATCTTATTCTATCCATGGAAGAAACCTTCATAAACTTTCTTCATGATGAAACCAGTTCAGATGGAATCTTGGAGTTGGAACCAATGGATTCATACAACCGTCTTCTTTTGCATCGCCTTGCTGATATTTTTGGATTTGCCCACATATCAGTTGGTGAAGGTGCTAATAGACACTTGGTTTTGGAGCGATGCCCAGAGTCATCGATACCGTCCATTCTTGTGAGTGATATTCTGTGGGAGTATGATGAACCTCAAATGTCAACGATACCACACCAACTATTAAGGAGAAAGGAAAATTCTTCTGCGAGTTCAACAAAAGCATCTCCTCAACGTTCACTTGAAGAGAGAGAAGCATCTTATCTGGCTGTCCGTGAGCGCATTTTCATGATGCACATGGGAGAAGACAGCGAACCCGTGAAGCCAAAGCCACGCTGTAATCCTGTGGTTGCACGCCGCATGATTGCACATGCACTGGGCCAAAGAATTAATTCATTTCCTGAGGATACTACTTGTCATTTCAAAGAGCAAGGAGGTATAGTGAACAATGCCTACATCCAAGCAAAAGATTCGAAGTTGCCTGATTCTACTGTGGAAGCTGTTAACAAAACCATTTCACGGTCAGATCAATGTGTGAACTCGAAGAATGAAGTGGATAAGAATTGCAATCCAGACGTGTCATTGGCAAAGGGAAGTAATGCTGTCAAAATGAAACCTGACAAGAGTTCTCCGAAGGCAAGTCGTGTCGACAATGAGTACTTGAAGAGAGAACATTTAGGAGCTGCGAAGAGGATGTTTTCACAGGCTTTAGGCAAGCACTGCCGAAAGAATGATTCTCTTCAAACTCGTGGGGAATCAGATTAA

Protein sequence

MDDYLTPQVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFGFAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSTKASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINSFPEDTTCHFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCNPDVSLAKGSNAVKMKPDKSSPKASRVDNEYLKREHLGAAKRMFSQALGKHCRKNDSLQTRGESD
BLAST of Lsi11G008490 vs. TrEMBL
Match: A0A0A0LH15_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G074070 PE=4 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 1.7e-148
Identity = 266/301 (88.37%), Postives = 284/301 (94.35%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLPSKHLILSMEETFINFLH+ETSSDGILEL+PMDSYNRLLLHRLADIFG
Sbjct: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHNETSSDGILELKPMDSYNRLLLHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128
             H+SVGEG NRHLVLER PESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST
Sbjct: 69  LGHVSVGEGDNRHLVLERYPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128

Query: 129 KASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINSF 188
           K+SPQRSLEEREA+YLAVRERIFM H+GED+EP+KPKPRC+P VARRMIAHALGQR+NS 
Sbjct: 129 KSSPQRSLEEREAAYLAVRERIFMTHVGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSL 188

Query: 189 PEDTTCHFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCNPDV 248
            EDT CH KEQGG+ NNAYIQA+DSKLPDSTVEA+NKTISRSDQCVN KNE+DKNCNPDV
Sbjct: 189 SEDTNCHQKEQGGVTNNAYIQARDSKLPDSTVEAINKTISRSDQCVNLKNELDKNCNPDV 248

Query: 249 SLAKGSNAVKMKPDKSSPKASRVDNEYLKREHLGAAKRMFSQALGKHCRKNDSLQTRGES 308
           SLA+GS A KMKP KS PKAS VDNE+LKREHLGAAKRMFSQALGKHCRKN+SLQTRGE+
Sbjct: 249 SLARGSTAAKMKPAKSYPKASHVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRGEA 308

Query: 309 D 310
           D
Sbjct: 309 D 309

BLAST of Lsi11G008490 vs. TrEMBL
Match: A0A061EY00_THECC (Single-stranded nucleic acid binding R3H protein, putative isoform 1 OS=Theobroma cacao GN=TCM_025347 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.1e-73
Identity = 172/313 (54.95%), Postives = 205/313 (65.50%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLP KHL+LSMEE F+NFL D+TSSDGILELEPM+SYNRLLLHRLADIFG
Sbjct: 9   VEELAFLVKDNLPCKHLVLSMEEAFVNFLQDDTSSDGILELEPMNSYNRLLLHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128
           FAH SVGEG +RHLVLERCPE+SIPSILVSDILW+ DEPQ  T    LL R+E  + + T
Sbjct: 69  FAHESVGEGDDRHLVLERCPETSIPSILVSDILWQCDEPQSLTTSRHLLVREEAPAVAKT 128

Query: 129 K-ASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINS 188
           +  S + SLE REA+YLA RERIF M + E  EPVK KPR  PVVARRMIAHALGQ+INS
Sbjct: 129 ELPSFELSLEAREAAYLAARERIFSMDVEEVREPVKEKPRTVPVVARRMIAHALGQKINS 188

Query: 189 FPEDTTCH-FKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCV---NSKNEVDKN 248
             ++     FK+  G  N   I  KD       V+   +T +  D      N+ ++ + N
Sbjct: 189 SSQNVNARDFKDHEGQPNELNIHDKDE------VDNNLRTATYQDTVFVPGNAFSKANSN 248

Query: 249 CNPDVSLAKGSNAVKMKPDKSSP--------KASRVDNEYLKREHLGAAKRMFSQALGKH 308
            +   +   G   V  KP +  P          +RV+ EY K EHLGAAKRMF+ ALG  
Sbjct: 249 AHKHNASVVGKRNVSDKPAQKGPSDVRIPGRSRNRVNKEYSKEEHLGAAKRMFAHALGLR 308

BLAST of Lsi11G008490 vs. TrEMBL
Match: A0A061EYU8_THECC (Single-stranded nucleic acid binding R3H protein, putative isoform 2 OS=Theobroma cacao GN=TCM_025347 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.1e-73
Identity = 172/313 (54.95%), Postives = 205/313 (65.50%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLP KHL+LSMEE F+NFL D+TSSDGILELEPM+SYNRLLLHRLADIFG
Sbjct: 9   VEELAFLVKDNLPCKHLVLSMEEAFVNFLQDDTSSDGILELEPMNSYNRLLLHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128
           FAH SVGEG +RHLVLERCPE+SIPSILVSDILW+ DEPQ  T    LL R+E  + + T
Sbjct: 69  FAHESVGEGDDRHLVLERCPETSIPSILVSDILWQCDEPQSLTTSRHLLVREEAPAVAKT 128

Query: 129 K-ASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINS 188
           +  S + SLE REA+YLA RERIF M + E  EPVK KPR  PVVARRMIAHALGQ+INS
Sbjct: 129 ELPSFELSLEAREAAYLAARERIFSMDVEEVREPVKEKPRTVPVVARRMIAHALGQKINS 188

Query: 189 FPEDTTCH-FKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCV---NSKNEVDKN 248
             ++     FK+  G  N   I  KD       V+   +T +  D      N+ ++ + N
Sbjct: 189 SSQNVNARDFKDHEGQPNELNIHDKDE------VDNNLRTATYQDTVFVPGNAFSKANSN 248

Query: 249 CNPDVSLAKGSNAVKMKPDKSSP--------KASRVDNEYLKREHLGAAKRMFSQALGKH 308
            +   +   G   V  KP +  P          +RV+ EY K EHLGAAKRMF+ ALG  
Sbjct: 249 AHKHNASVVGKRNVSDKPAQKGPSDVRIPGRSRNRVNKEYSKEEHLGAAKRMFAHALGLR 308

BLAST of Lsi11G008490 vs. TrEMBL
Match: W9S6Y8_9ROSA (R3H domain-containing protein 2 OS=Morus notabilis GN=L484_015106 PE=4 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 5.3e-73
Identity = 164/293 (55.97%), Postives = 202/293 (68.94%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLP KHL+LS+EE  +NFL  +TSS GILELEPM+SYNRLL+HRLADIFG
Sbjct: 9   VEELAFLVKDNLPCKHLVLSVEEALVNFLETDTSSCGILELEPMNSYNRLLMHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSAS-- 128
           F+H SVGEG +RHL+LERCPE+S+PSILVSDIL +YDEPQ  T  HQLLRR + S  S  
Sbjct: 69  FSHESVGEGDDRHLILERCPETSVPSILVSDILGQYDEPQSPTTSHQLLRRTDASPVSLV 128

Query: 129 STKASP--QRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQR 188
           S   SP    SLEEREA+YLA RERIF + +G + EPV+PKPR  PVVARRMIAHALGQR
Sbjct: 129 SKTQSPLVPHSLEEREAAYLAARERIFSLDLGGEKEPVRPKPRSVPVVARRMIAHALGQR 188

Query: 189 INSFPEDTTCHFKEQG-GIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKN 248
           I S   DTT        G  +   I+ KD    DS+++   + +  S +  +  ++  KN
Sbjct: 189 ITSCNHDTTHKDSSSSEGQTHELNIEGKDKTESDSSLQGYQEIVHLSGKSTSLYSKERKN 248

Query: 249 CN-PDVSLAKGSNAVKMKPDKSSPKASR--VDNEYLKREHLGAAKRMFSQALG 294
            +    S    + A +   +K S   +R  ++ E+ KREHLGAAKRMF+ ALG
Sbjct: 249 YHGTGASSPSDTKAPQKLSEKISTSIARNGMNREHYKREHLGAAKRMFAHALG 301

BLAST of Lsi11G008490 vs. TrEMBL
Match: A0A067DWA9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g020450mg PE=4 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 6.9e-73
Identity = 162/299 (54.18%), Postives = 201/299 (67.22%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLPSKHL+LSMEE  +NFL D+ S+DG+LELEPMDSYNRLLLHRLADIFG
Sbjct: 9   VEELAFLVKDNLPSKHLVLSMEEALVNFLQDDNSADGVLELEPMDSYNRLLLHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128
           FAH SVGEG +RHL+LERC E+SIPSILVSDILW+Y EPQ  T  HQ+LRRKE       
Sbjct: 69  FAHESVGEGVDRHLILERCSETSIPSILVSDILWQYGEPQSLTTSHQILRRKEAPPVLKI 128

Query: 129 KA-SPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINS 188
           ++ S + SL EREA+YLA RERIF M     +EPV+ KPR  P+VA+RMIAHALGQ++ S
Sbjct: 129 QSPSGEHSLAEREAAYLAARERIFSMDARTVAEPVRQKPRSVPIVAQRMIAHALGQKMKS 188

Query: 189 FPEDTTCHFK-EQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDK---- 248
             +DT    +     + N   I+  D   P++++EA  +T     Q VNS ++ +K    
Sbjct: 189 GDQDTAVRDEITNEKLANEINIRESDKAGPNTSLEAYQETTLLPGQNVNSLDKSNKKKGK 248

Query: 249 --------NCNPDVSLAKGSNAVKMKPDKSSPKASRVDNEYLKREHLGAAKRMFSQALG 294
                      P  +  K SN V +  +  S  +S  + EYLK E LGAAKR+F+ ALG
Sbjct: 249 STVSSQSDRIAPQETSEKSSNGVSISRNGRSTNSS--NKEYLKEERLGAAKRLFAHALG 305

BLAST of Lsi11G008490 vs. NCBI nr
Match: gi|449470259|ref|XP_004152835.1| (PREDICTED: R3H domain-containing protein 1 isoform X2 [Cucumis sativus])

HSP 1 Score: 533.5 bits (1373), Expect = 2.5e-148
Identity = 266/301 (88.37%), Postives = 284/301 (94.35%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLPSKHLILSMEETFINFLH+ETSSDGILEL+PMDSYNRLLLHRLADIFG
Sbjct: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHNETSSDGILELKPMDSYNRLLLHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128
             H+SVGEG NRHLVLER PESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST
Sbjct: 69  LGHVSVGEGDNRHLVLERYPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128

Query: 129 KASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINSF 188
           K+SPQRSLEEREA+YLAVRERIFM H+GED+EP+KPKPRC+P VARRMIAHALGQR+NS 
Sbjct: 129 KSSPQRSLEEREAAYLAVRERIFMTHVGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSL 188

Query: 189 PEDTTCHFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCNPDV 248
            EDT CH KEQGG+ NNAYIQA+DSKLPDSTVEA+NKTISRSDQCVN KNE+DKNCNPDV
Sbjct: 189 SEDTNCHQKEQGGVTNNAYIQARDSKLPDSTVEAINKTISRSDQCVNLKNELDKNCNPDV 248

Query: 249 SLAKGSNAVKMKPDKSSPKASRVDNEYLKREHLGAAKRMFSQALGKHCRKNDSLQTRGES 308
           SLA+GS A KMKP KS PKAS VDNE+LKREHLGAAKRMFSQALGKHCRKN+SLQTRGE+
Sbjct: 249 SLARGSTAAKMKPAKSYPKASHVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRGEA 308

Query: 309 D 310
           D
Sbjct: 309 D 309

BLAST of Lsi11G008490 vs. NCBI nr
Match: gi|778667822|ref|XP_011648991.1| (PREDICTED: R3H domain-containing protein 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 528.9 bits (1361), Expect = 6.1e-147
Identity = 266/302 (88.08%), Postives = 284/302 (94.04%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSS-DGILELEPMDSYNRLLLHRLADIF 68
           VEELAFLVKDNLPSKHLILSMEETFINFLH+ETSS DGILEL+PMDSYNRLLLHRLADIF
Sbjct: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHNETSSSDGILELKPMDSYNRLLLHRLADIF 68

Query: 69  GFAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS 128
           G  H+SVGEG NRHLVLER PESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS
Sbjct: 69  GLGHVSVGEGDNRHLVLERYPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS 128

Query: 129 TKASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINS 188
           TK+SPQRSLEEREA+YLAVRERIFM H+GED+EP+KPKPRC+P VARRMIAHALGQR+NS
Sbjct: 129 TKSSPQRSLEEREAAYLAVRERIFMTHVGEDNEPLKPKPRCDPAVARRMIAHALGQRVNS 188

Query: 189 FPEDTTCHFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCNPD 248
             EDT CH KEQGG+ NNAYIQA+DSKLPDSTVEA+NKTISRSDQCVN KNE+DKNCNPD
Sbjct: 189 LSEDTNCHQKEQGGVTNNAYIQARDSKLPDSTVEAINKTISRSDQCVNLKNELDKNCNPD 248

Query: 249 VSLAKGSNAVKMKPDKSSPKASRVDNEYLKREHLGAAKRMFSQALGKHCRKNDSLQTRGE 308
           VSLA+GS A KMKP KS PKAS VDNE+LKREHLGAAKRMFSQALGKHCRKN+SLQTRGE
Sbjct: 249 VSLARGSTAAKMKPAKSYPKASHVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRGE 308

Query: 309 SD 310
           +D
Sbjct: 309 AD 310

BLAST of Lsi11G008490 vs. NCBI nr
Match: gi|659082547|ref|XP_008441899.1| (PREDICTED: uncharacterized protein LOC103485903 isoform X2 [Cucumis melo])

HSP 1 Score: 510.8 bits (1314), Expect = 1.7e-141
Identity = 261/303 (86.14%), Postives = 281/303 (92.74%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIFG 68
           VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILEL+PMDSYNRLLLHRLADIFG
Sbjct: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFG 68

Query: 69  FAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASST 128
             H+S GEG NRHLVLER PESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS 
Sbjct: 69  LGHVSAGEGDNRHLVLERYPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSA 128

Query: 129 KASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINSF 188
           K+SPQRSLEERE +YLAVRERIFM H+GED+EP+KPKPRC+P VARRMIAHALGQR+NSF
Sbjct: 129 KSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSF 188

Query: 189 PEDTTCHFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCNPDV 248
           PEDT CH K Q G+ NNAYIQA+DSKLP+STVEA+NKTIS+SDQC+N KNE DKNCNP+V
Sbjct: 189 PEDTNCHRKVQ-GVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNV 248

Query: 249 SLAKGSNAVKMKPDKSSPKASR-VDNEYLKREHLGAAKRMFSQALGKHCRKNDSLQTR-G 308
           SLA+GS A KMK DKSSPKAS  VDNE+LKREHLGAAKRMFSQALGKHCRKN+SLQTR G
Sbjct: 249 SLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCG 308

Query: 309 ESD 310
           E+D
Sbjct: 309 EAD 310

BLAST of Lsi11G008490 vs. NCBI nr
Match: gi|659082545|ref|XP_008441898.1| (PREDICTED: uncharacterized protein LOC103485903 isoform X1 [Cucumis melo])

HSP 1 Score: 506.1 bits (1302), Expect = 4.2e-140
Identity = 261/304 (85.86%), Postives = 281/304 (92.43%), Query Frame = 1

Query: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSS-DGILELEPMDSYNRLLLHRLADIF 68
           VEELAFLVKDNLPSKHLILSMEETFINFLHDETSS DGILEL+PMDSYNRLLLHRLADIF
Sbjct: 9   VEELAFLVKDNLPSKHLILSMEETFINFLHDETSSSDGILELKPMDSYNRLLLHRLADIF 68

Query: 69  GFAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS 128
           G  H+S GEG NRHLVLER PESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS
Sbjct: 69  GLGHVSAGEGDNRHLVLERYPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS 128

Query: 129 TKASPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRINS 188
            K+SPQRSLEERE +YLAVRERIFM H+GED+EP+KPKPRC+P VARRMIAHALGQR+NS
Sbjct: 129 AKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNS 188

Query: 189 FPEDTTCHFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCNPD 248
           FPEDT CH K Q G+ NNAYIQA+DSKLP+STVEA+NKTIS+SDQC+N KNE DKNCNP+
Sbjct: 189 FPEDTNCHRKVQ-GVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPN 248

Query: 249 VSLAKGSNAVKMKPDKSSPKASR-VDNEYLKREHLGAAKRMFSQALGKHCRKNDSLQTR- 308
           VSLA+GS A KMK DKSSPKAS  VDNE+LKREHLGAAKRMFSQALGKHCRKN+SLQTR 
Sbjct: 249 VSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRC 308

Query: 309 GESD 310
           GE+D
Sbjct: 309 GEAD 311

BLAST of Lsi11G008490 vs. NCBI nr
Match: gi|1009118127|ref|XP_015875695.1| (PREDICTED: uncharacterized protein LOC107412430 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 296.6 bits (758), Expect = 5.1e-77
Identity = 173/315 (54.92%), Postives = 209/315 (66.35%), Query Frame = 1

Query: 8   QVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELEPMDSYNRLLLHRLADIF 67
           +VEELA LVKDNLP KHL+LS+EE  INFL  +TSSDG+LELEPM+SYNRLLLHRLADIF
Sbjct: 30  KVEELASLVKDNLPCKHLVLSVEEVLINFLQTDTSSDGVLELEPMNSYNRLLLHRLADIF 89

Query: 68  GFAHISVGEGANRHLVLERCPESSIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS 127
           GF+H SVGEG +RHL+LERCPE+SIPSILVSDILW+YD+PQ S + HQLL+R E S    
Sbjct: 90  GFSHESVGEGDDRHLILERCPETSIPSILVSDILWQYDDPQSSAMTHQLLKRTETSPVLK 149

Query: 128 TKA-SPQRSLEEREASYLAVRERIFMMHMGEDSEPVKPKPRCNPVVARRMIAHALGQRIN 187
           TK+ S Q SLEEREA+YLA R+RIF M +GE+ EPVK KPR  PVVARRMIAHALGQRI+
Sbjct: 150 TKSPSVQHSLEEREAAYLAARQRIFSMDLGEEKEPVKQKPRSVPVVARRMIAHALGQRIS 209

Query: 188 SFPEDTTC-HFKEQGGIVNNAYIQAKDSKLPDSTVEAVNKTISRSDQCVNSKNEVDKNCN 247
           S  +DT    F   G   +    Q KD    + T +   +++S S     S +EV KN  
Sbjct: 210 SINQDTALKDFSGSGEETDEMKAQDKDKIDLNLTQKTFQESLSLSGTNTKSYDEVKKN-- 269

Query: 248 PDVSLAKGSNAVKMKPDKSSPKAS-------------RVDNEYLKREHLGAAKRMFSQAL 307
            + S    S +    P K + K S              V+ ++ KR+HLGAAKRMF+ AL
Sbjct: 270 -EQSTCASSQSEWKAPQKQAEKVSSSVSISRSGRDRNNVNEDHHKRQHLGAAKRMFAHAL 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LH15_CUCSA1.7e-14888.37Uncharacterized protein OS=Cucumis sativus GN=Csa_2G074070 PE=4 SV=1[more]
A0A061EY00_THECC1.1e-7354.95Single-stranded nucleic acid binding R3H protein, putative isoform 1 OS=Theobrom... [more]
A0A061EYU8_THECC1.1e-7354.95Single-stranded nucleic acid binding R3H protein, putative isoform 2 OS=Theobrom... [more]
W9S6Y8_9ROSA5.3e-7355.97R3H domain-containing protein 2 OS=Morus notabilis GN=L484_015106 PE=4 SV=1[more]
A0A067DWA9_CITSI6.9e-7354.18Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g020450mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|449470259|ref|XP_004152835.1|2.5e-14888.37PREDICTED: R3H domain-containing protein 1 isoform X2 [Cucumis sativus][more]
gi|778667822|ref|XP_011648991.1|6.1e-14788.08PREDICTED: R3H domain-containing protein 1 isoform X1 [Cucumis sativus][more]
gi|659082547|ref|XP_008441899.1|1.7e-14186.14PREDICTED: uncharacterized protein LOC103485903 isoform X2 [Cucumis melo][more]
gi|659082545|ref|XP_008441898.1|4.2e-14085.86PREDICTED: uncharacterized protein LOC103485903 isoform X1 [Cucumis melo][more]
gi|1009118127|ref|XP_015875695.1|5.1e-7754.92PREDICTED: uncharacterized protein LOC107412430 isoform X1 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR024771SUZ
IPR001374R3H_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi11G008490.1Lsi11G008490.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001374R3H domainGENE3DG3DSA:3.30.1370.50coord: 26..94
score: 5.7
IPR001374R3H domainPFAMPF01424R3Hcoord: 26..85
score: 5.1
IPR001374R3H domainSMARTSM00393R3H_4coord: 10..87
score: 0.
IPR001374R3H domainPROFILEPS51061R3Hcoord: 23..88
score: 16
IPR001374R3H domainunknownSSF82708R3H domaincoord: 11..121
score: 7.06
IPR024771SUZ domainPFAMPF12752SUZcoord: 117..152
score: 2.
IPR024771SUZ domainPROFILEPS51673SUZcoord: 91..155
score: 9
NoneNo IPR availablePANTHERPTHR15672CAMP-REGULATED PHOSPHOPROTEIN 21 RELATED R3H DOMAIN CONTAINING PROTEINcoord: 9..243
score: 5.1E
NoneNo IPR availablePANTHERPTHR15672:SF16SUBFAMILY NOT NAMEDcoord: 9..243
score: 5.1E