Lsi07G012000.1 (mRNA) Bottle gourd (USVL1VR-Ls)

NameLsi07G012000.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionVQ motif protein
Locationchr07 : 17495852 .. 17496814 (+)
Sequence length963
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAGGAATAGGCAGAATGAGAATTTGGGTGTGAACAAAATGGGGAAGAATATTAGGAAGAGTCCAATACACCAGCCAAATTTTGGTAATAATGCTGCTAGGCCTCAACCCCAGCCACAAATTTACAACATAAGTAAGAATGATTTTAGGAATATTGTTCAGCAGCTTACAGGCTCACCATCTCAGGATCATCAGCCTCCTCCTAGACCTCCACAAAATCCACCAAAACCCCAAAGTATGCGGTTGCAGAGAATAAGACCTCCCCCGTTAACACCGATTAATCGACCGAATATGCCTGCTCCTATCCCTGCTCCTGTTCCTGTGCCTCCACCACAGGCTCTAGTCAATAACAATGTGCCTAGGCCTGCACAATTTGCTCAGCCACCTCCAAGAAAGTTGCCACCAATGGTACCAGGAGGAGACTCGCATTGGCCGAACCCTGCTGCTGAGTCTCCTATTTCGGCGTATATGCGTTACCTTCAAAATTCGATGATGAATCCATCTCCAGTAGCAAACCAGGCTCAATTTGTACCACAACCTCAGATTCCTGGTCAAATGCATCCACCTCATGCACCTCCATCTGGTTTATTGCCTAATCCTAATCAACCTGTTCCTGCTCTTCCATCTCCTAGATTAAATGGTCCTCCACCCCCTATGCCAAACTTGCCTTCACCGCACTGGAACGGTCCCGCCCTTTTACCTTCCCCAACTTCCCAGTTTCTATTGCCTTCTCCTACTGGTTATTATAATTTGTTGTCCCCAAAATCACCGTATCCGTTACTCTCACCAGGGATCCAGTTTTCTCCGCCGCTGACTCCTAATTTCGCATTTCCATCCATGCCTCCTCAATCAGGGATCTTAGGTCCAGGGCCTCATCCACCACCTTCTCCAGGGGTTATGTTCCCGTTATCTCCTTCAGGGTTTTTTCCCATCTTGAGTCCAAGATGGAGAGATCAATAA

mRNA sequence

ATGGATAGGAATAGGCAGAATGAGAATTTGGGTGTGAACAAAATGGGGAAGAATATTAGGAAGAGTCCAATACACCAGCCAAATTTTGGTAATAATGCTGCTAGGCCTCAACCCCAGCCACAAATTTACAACATAAGTAAGAATGATTTTAGGAATATTGTTCAGCAGCTTACAGGCTCACCATCTCAGGATCATCAGCCTCCTCCTAGACCTCCACAAAATCCACCAAAACCCCAAAGTATGCGGTTGCAGAGAATAAGACCTCCCCCGTTAACACCGATTAATCGACCGAATATGCCTGCTCCTATCCCTGCTCCTGTTCCTGTGCCTCCACCACAGGCTCTAGTCAATAACAATGTGCCTAGGCCTGCACAATTTGCTCAGCCACCTCCAAGAAAGTTGCCACCAATGGTACCAGGAGGAGACTCGCATTGGCCGAACCCTGCTGCTGAGTCTCCTATTTCGGCGTATATGCGTTACCTTCAAAATTCGATGATGAATCCATCTCCAGTAGCAAACCAGGCTCAATTTGTACCACAACCTCAGATTCCTGGTCAAATGCATCCACCTCATGCACCTCCATCTGGTTTATTGCCTAATCCTAATCAACCTGTTCCTGCTCTTCCATCTCCTAGATTAAATGGTCCTCCACCCCCTATGCCAAACTTGCCTTCACCGCACTGGAACGGTCCCGCCCTTTTACCTTCCCCAACTTCCCAGTTTCTATTGCCTTCTCCTACTGGTTATTATAATTTGTTGTCCCCAAAATCACCGTATCCGTTACTCTCACCAGGGATCCAGTTTTCTCCGCCGCTGACTCCTAATTTCGCATTTCCATCCATGCCTCCTCAATCAGGGATCTTAGGTCCAGGGCCTCATCCACCACCTTCTCCAGGGGTTATGTTCCCGTTATCTCCTTCAGGGTTTTTTCCCATCTTGAGTCCAAGATGGAGAGATCAATAA

Coding sequence (CDS)

ATGGATAGGAATAGGCAGAATGAGAATTTGGGTGTGAACAAAATGGGGAAGAATATTAGGAAGAGTCCAATACACCAGCCAAATTTTGGTAATAATGCTGCTAGGCCTCAACCCCAGCCACAAATTTACAACATAAGTAAGAATGATTTTAGGAATATTGTTCAGCAGCTTACAGGCTCACCATCTCAGGATCATCAGCCTCCTCCTAGACCTCCACAAAATCCACCAAAACCCCAAAGTATGCGGTTGCAGAGAATAAGACCTCCCCCGTTAACACCGATTAATCGACCGAATATGCCTGCTCCTATCCCTGCTCCTGTTCCTGTGCCTCCACCACAGGCTCTAGTCAATAACAATGTGCCTAGGCCTGCACAATTTGCTCAGCCACCTCCAAGAAAGTTGCCACCAATGGTACCAGGAGGAGACTCGCATTGGCCGAACCCTGCTGCTGAGTCTCCTATTTCGGCGTATATGCGTTACCTTCAAAATTCGATGATGAATCCATCTCCAGTAGCAAACCAGGCTCAATTTGTACCACAACCTCAGATTCCTGGTCAAATGCATCCACCTCATGCACCTCCATCTGGTTTATTGCCTAATCCTAATCAACCTGTTCCTGCTCTTCCATCTCCTAGATTAAATGGTCCTCCACCCCCTATGCCAAACTTGCCTTCACCGCACTGGAACGGTCCCGCCCTTTTACCTTCCCCAACTTCCCAGTTTCTATTGCCTTCTCCTACTGGTTATTATAATTTGTTGTCCCCAAAATCACCGTATCCGTTACTCTCACCAGGGATCCAGTTTTCTCCGCCGCTGACTCCTAATTTCGCATTTCCATCCATGCCTCCTCAATCAGGGATCTTAGGTCCAGGGCCTCATCCACCACCTTCTCCAGGGGTTATGTTCCCGTTATCTCCTTCAGGGTTTTTTCCCATCTTGAGTCCAAGATGGAGAGATCAATAA

Protein sequence

MDRNRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPSQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNVPRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVPQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPSPGVMFPLSPSGFFPILSPRWRDQ
BLAST of Lsi07G012000.1 vs. Swiss-Prot
Match: IKU1_ARATH (Protein HAIKU1 OS=Arabidopsis thaliana GN=IKU1 PE=1 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 4.2e-34
Identity = 140/334 (41.92%), Postives = 172/334 (51.50%), Query Frame = 1

Query: 1   MDRNRQNENLGVNKMGKNIRKSPIHQPNFG---NNAARP--QPQPQIYNISKNDFRNIVQ 60
           MDR RQN++LGVN++GKNIRKSP+HQ  F    +N A P  Q QPQ+YNISKNDFR+IVQ
Sbjct: 1   MDRPRQNDHLGVNRIGKNIRKSPLHQSTFAASTSNGAAPRLQTQPQVYNISKNDFRSIVQ 60

Query: 61  QLTGSPSQDHQPPPRPPQNPP-KPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQA 120
           QLTGSPS++    PRPPQN   +PQ+ RLQRIRP PLT +NRP +P P  AP    P   
Sbjct: 61  QLTGSPSRESL--PRPPQNNSLRPQNTRLQRIRPSPLTQLNRPAVPLPSMAPPQSHP--- 120

Query: 121 LVNNNVPRPAQFAQPPPRKLP-------PMVPGGDSHWPNPAAESPISAYMRYLQNSMMN 180
                     QFA+ PP + P       PM+   D  W N  AESP+S YMRYLQ+S+ +
Sbjct: 121 ----------QFARQPPHQPPFPQTTQQPMMGHRDQFWSN-TAESPVSEYMRYLQSSLGD 180

Query: 181 PSPVANQAQ--FVPQPQIPGQMHPPHAPPSGLLP--NPNQPVPALPSPRLNGPPPPMPNL 240
             P ANQ Q     +P IPG    P+ P +   P    N+  P +P        P     
Sbjct: 181 SGPNANQMQPGHEQRPYIPGHEQRPYVPGNEQQPYMPGNEQRPYIPGHEQRSYMPAQ--- 240

Query: 241 PSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPP 300
            S   + P   P P  Q ++P P    N+  P  P   L P     P L P+   P   P
Sbjct: 241 -SQSQSQPQPQPQP-QQHMMPGPQPRMNMQGPLQPNQYLPP-----PGLVPS-PVPHNLP 300

Query: 301 QSGILGPGPHPPPSPGVMFPLSPSGFFPILSPRW 318
                 P P  P  P  MF     GF    SPR+
Sbjct: 301 SPRFNAPVPVTPTQPSPMFSQMYGGF---PSPRY 304

BLAST of Lsi07G012000.1 vs. Swiss-Prot
Match: VQ9_ARATH (VQ motif-containing protein 9 OS=Arabidopsis thaliana GN=VQ9 PE=1 SV=1)

HSP 1 Score: 83.2 bits (204), Expect = 5.7e-15
Identity = 99/266 (37.22%), Postives = 130/266 (48.87%), Query Frame = 1

Query: 37  QPQPQIYNISKNDFRNIVQQLTGSPSQDH-QPPPRPPQNPPKP-QSMRLQRIRPPPLTPI 96
           Q QP +YNI+KNDFR++VQ+LTGSP+ +    PP+ P + PKP QS RL RIRPPPL  +
Sbjct: 77  QHQPPVYNINKNDFRDVVQKLTGSPAHERISAPPQQPIHHPKPQQSSRLHRIRPPPL--V 136

Query: 97  NRPNMPAPIPAPVPVPPPQALVNNNVPRPAQFAQP--PPRKLPPMVPGGDSHWPNPAAES 156
           +  N P  +     +P     +N N        +P  P   LPP+ P       + AAES
Sbjct: 137 HVINRPPGLLNDALIPQGSHHMNQNWTGVGFNLRPTAPLSPLPPLPP------VHAAAES 196

Query: 157 PISAYMRYLQNSMMNPSPVANQAQFVPQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPR 216
           P+S+YMRYLQNSM   +  +N+ +F                 SGL P      P     +
Sbjct: 197 PVSSYMRYLQNSMF--AIDSNRKEF-----------------SGLSPLAPLVSPRWYQQQ 256

Query: 217 LNGPPPPMPNLPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPL 276
            N PP    + P PH   P+   S T    +P+P  +    SPKSPY LLSP I  SP  
Sbjct: 257 ENAPPSQHNSFPPPHPPPPSSAVSQTVPTSIPAPPLFGCSSSPKSPYGLLSPSILLSPS- 306

Query: 277 TPNFAFPSMPPQSGILGPGPHPPPSP 299
           +    FP        + P   P PSP
Sbjct: 317 SGQLGFP--------VSPTTVPLPSP 306

BLAST of Lsi07G012000.1 vs. Swiss-Prot
Match: PERK2_ARATH (Proline-rich receptor-like protein kinase PERK2 OS=Arabidopsis thaliana GN=PERK2 PE=2 SV=3)

HSP 1 Score: 54.3 bits (129), Expect = 2.8e-06
Identity = 87/250 (34.80%), Postives = 106/250 (42.40%), Query Frame = 1

Query: 59  GSPSQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNN 118
           G+PS    PPP+P   PP PQ        P P+TP   P  P  +P  +P PPP   +  
Sbjct: 9   GTPS----PPPQPLPIPPPPQ--------PLPVTP---PPPPTALPPALPPPPPPTALPP 68

Query: 119 NVPRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFV 178
            +P P     PPP  +PP+ P   S  P P   SP+             PSP        
Sbjct: 69  ALPPP-----PPPTTVPPIPPSTPSP-PPPLTPSPLP------------PSPTTPSPPLT 128

Query: 179 PQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPT 238
           P P  P     P  PP+     P  P P  PSP    PPPP P++PSP      L PSP 
Sbjct: 129 PSPTTPSPPLTPSPPPAITPSPPLTPSPLPPSPTTPSPPPPSPSIPSP-----PLTPSPP 188

Query: 239 SQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNF-AFPSMPPQSGILGPGPHPPPS 298
                 SP      L P SP P  SP    +PP +P   + P+ PP+ G L P P   PS
Sbjct: 189 PS----SP------LRPSSPPPP-SPATPSTPPRSPPPPSTPTPPPRVGSLSPPPPASPS 208

Query: 299 PGVMFPLSPS 308
            G   P +PS
Sbjct: 249 GG-RSPSTPS 208

BLAST of Lsi07G012000.1 vs. TrEMBL
Match: A0A0A0L576_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G011640 PE=4 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 1.4e-164
Identity = 299/323 (92.57%), Postives = 309/323 (95.67%), Query Frame = 1

Query: 1   MDRNRQNENLGVNKMGKNIRKSPIHQPNFGNN-AARPQPQPQIYNISKNDFRNIVQQLTG 60
           MDRNRQNENLGVNK+GKNIRKSPIHQPNFGNN AARPQPQPQIYNISKNDFRNIVQQLTG
Sbjct: 1   MDRNRQNENLGVNKLGKNIRKSPIHQPNFGNNNAARPQPQPQIYNISKNDFRNIVQQLTG 60

Query: 61  SPSQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNN 120
           SPSQD+QPPPRPPQNPPK QSMRLQRIRPPPLTPINRPN+PAPIPAPVPVPPPQALVNNN
Sbjct: 61  SPSQDNQPPPRPPQNPPKSQSMRLQRIRPPPLTPINRPNIPAPIPAPVPVPPPQALVNNN 120

Query: 121 VPRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVP 180
           VPRP QFAQPPPR+LPP+  GGDSHWPNPAAESPISAYMRYLQNSMMNPSPV NQAQF+P
Sbjct: 121 VPRPPQFAQPPPRQLPPVAMGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVGNQAQFIP 180

Query: 181 QPQIPGQMHPPHAPPSGLL--PNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSP 240
           Q Q+PGQMHPPHAPP GLL  PNPN PVPALPSPRLNGPPPP+PN PSPHWNGPALLPSP
Sbjct: 181 QSQVPGQMHPPHAPPPGLLPNPNPNPPVPALPSPRLNGPPPPIPNFPSPHWNGPALLPSP 240

Query: 241 TSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPS 300
           TSQFLLPSPTGYYNLLSPKSPYPLLSPGIQF+PPLTPNFAFPSM PQSGILGPGPHPPPS
Sbjct: 241 TSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFTPPLTPNFAFPSM-PQSGILGPGPHPPPS 300

Query: 301 PGVMFPLSPSGFFPILSPRWRDQ 321
           PGV+FPLSPSG FPILSPRWRDQ
Sbjct: 301 PGVLFPLSPSGIFPILSPRWRDQ 322

BLAST of Lsi07G012000.1 vs. TrEMBL
Match: B9RN55_RICCO (LRX2, putative OS=Ricinus communis GN=RCOM_1343860 PE=4 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 5.1e-111
Identity = 231/322 (71.74%), Postives = 254/322 (78.88%), Query Frame = 1

Query: 5   RQNENLGVNKMGKNIRKSPIHQPNFGNNAA---RPQPQPQIYNISKNDFRNIVQQLTGSP 64
           +QN+ LGVNK+GKNIRKSP+HQPNF NNA    R QPQPQ+YNISKNDFRNIVQQLTGSP
Sbjct: 8   QQNDPLGVNKLGKNIRKSPLHQPNFANNANNANRQQPQPQVYNISKNDFRNIVQQLTGSP 67

Query: 65  SQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNVP 124
           SQ+  P PRPPQNPPKPQSMRLQ+IRPPPLTPINRP++P P+PAP   PPP    NNN  
Sbjct: 68  SQE--PLPRPPQNPPKPQSMRLQKIRPPPLTPINRPHIPPPVPAPAVAPPPPVPFNNNFA 127

Query: 125 RPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQ---FV 184
           RP QF  P P  +PPM PG DS W N  AESPISAYMRYLQNS+M+PSP  NQAQ     
Sbjct: 128 RPGQFGHPSPTMMPPMAPG-DSAWAN-TAESPISAYMRYLQNSIMDPSPRGNQAQPSLQQ 187

Query: 185 PQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPT 244
            Q Q P  +HP   P SGLLPNP+  +PALPSPRLNGP P + NLPSP  NGPALLPSPT
Sbjct: 188 LQAQGPAYIHP-QPPSSGLLPNPH--MPALPSPRLNGPVPHVTNLPSPQMNGPALLPSPT 247

Query: 245 SQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPSP 304
           SQFLLPSPTGY NLLSP+SPYP  SPG+QF PPL  NF F  M  QSGILGPGP PPPSP
Sbjct: 248 SQFLLPSPTGYMNLLSPRSPYPFYSPGVQFPPPLAHNFTFSPM-AQSGILGPGPQPPPSP 307

Query: 305 GVMFPLSPSGFFPILSPRWRDQ 321
           G++FPLSP+GFFP+ SPRWRDQ
Sbjct: 308 GLVFPLSPTGFFPLSSPRWRDQ 321

BLAST of Lsi07G012000.1 vs. TrEMBL
Match: A0A061GAC9_THECC (VQ motif-containing protein isoform 1 OS=Theobroma cacao GN=TCM_015718 PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 8.6e-111
Identity = 235/340 (69.12%), Postives = 260/340 (76.47%), Query Frame = 1

Query: 3   RNRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPS 62
           +NR N++LGVNK+GKNI+KSP+HQPNF NNAAR QPQPQ+YNISKNDFRNIVQQLTGSPS
Sbjct: 5   KNRHNDHLGVNKIGKNIKKSPLHQPNFANNAARQQPQPQVYNISKNDFRNIVQQLTGSPS 64

Query: 63  QDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVP-------PPQAL 122
           QD  P PRPPQNPPKPQSMRLQRIRPPPLTPINRP++P P+P PVP P       PP A 
Sbjct: 65  QD--PLPRPPQNPPKPQSMRLQRIRPPPLTPINRPHIPPPVPVPVPAPAHVPALVPPPAP 124

Query: 123 VNNNVPRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQA 182
            NN++ RP  +  P P  L PM+PG D  W N  AESPISAYMRYLQ S+++PSPV NQ 
Sbjct: 125 YNNSLVRPGHYGPPSPAMLHPMMPG-DVIWGN-TAESPISAYMRYLQTSLIDPSPVGNQV 184

Query: 183 QFVPQPQIPGQMHPPHAPPS-GLLPNPNQPV--------------PALPSPRLNGPPPPM 242
           Q    P +PGQ  P   PPS GLLPNP  PV              P +PSPR+ GP P M
Sbjct: 185 QPQLYPPVPGQ--PQALPPSSGLLPNPPMPVLPSPRGVNGPVPPMPNIPSPRMKGPVPSM 244

Query: 243 PNLPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPS 302
           PNLPSP  NGP LLPSPTSQFLLPSPTGY NLLSP+SPYPLLSPG+QF PP+TPNFAF  
Sbjct: 245 PNLPSPRMNGPPLLPSPTSQFLLPSPTGYMNLLSPRSPYPLLSPGVQF-PPMTPNFAFSP 304

Query: 303 MPPQSGILGPGPHPPPSPGVMFPLSPSGFFPILSPRWRDQ 321
           M  QSGILGPGP PPPSPG++FPLSPSGFFP  SPRWRDQ
Sbjct: 305 M-GQSGILGPGPQPPPSPGLVFPLSPSGFFPFPSPRWRDQ 336

BLAST of Lsi07G012000.1 vs. TrEMBL
Match: M5WA02_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009019mg PE=4 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 1.6e-109
Identity = 234/318 (73.58%), Postives = 255/318 (80.19%), Query Frame = 1

Query: 4   NRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPSQ 63
           NR N++LGVNK+GKNIRKSP+HQPNF NNAAR QPQPQ+YNISKNDFRNIVQQLTGSPSQ
Sbjct: 6   NRHNDHLGVNKIGKNIRKSPLHQPNFANNAARQQPQPQVYNISKNDFRNIVQQLTGSPSQ 65

Query: 64  DHQPPPRPPQNPPKPQSMRLQRIRPPPLTPIN-RPNMPAPIPAPVPVPPPQALVNNNVPR 123
           +  P PRPPQNPPKPQSMRLQRIRPPPLTPIN RP +P P  AP P  PP    NNN  R
Sbjct: 66  E--PLPRPPQNPPKPQSMRLQRIRPPPLTPINNRPVIPPP--APHPSAPPLVPYNNNFMR 125

Query: 124 PAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVPQPQ 183
             QF QP P  +PP  P GDS W N  AESPISAYMRYLQ+SM++P+P  NQAQ  PQPQ
Sbjct: 126 -TQFGQPSPTPMPPF-PHGDSMWAN-TAESPISAYMRYLQSSMLDPTPRGNQAQ--PQPQ 185

Query: 184 IPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPTSQFL 243
            PGQ     AP +GLLPNP+  +PA P PR+NGP PP PNLP P  NGPALLPSPTSQFL
Sbjct: 186 GPGQSQS-QAPSTGLLPNPS--MPAHPPPRMNGPVPPAPNLPHPPVNGPALLPSPTSQFL 245

Query: 244 LPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPSPGVMF 303
           LPSPTG+ NLLSP+SPYPLLSPG+QF PPLTPNF F  M  QSGILGPGP PPPSPG +F
Sbjct: 246 LPSPTGFMNLLSPRSPYPLLSPGMQFPPPLTPNFQFSPM-AQSGILGPGPQPPPSPGYLF 305

Query: 304 PLSPSGFFPILSPRWRDQ 321
           PLSPSGFFPI SPRWR+Q
Sbjct: 306 PLSPSGFFPISSPRWREQ 310

BLAST of Lsi07G012000.1 vs. TrEMBL
Match: A0A0B2PB10_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_024244 PE=4 SV=1)

HSP 1 Score: 395.2 bits (1014), Expect = 7.6e-107
Identity = 230/329 (69.91%), Postives = 250/329 (75.99%), Query Frame = 1

Query: 3   RNRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPS 62
           +NR N++LGVNK+GKNIRKSP+HQPNF NNAAR QPQPQ+YNISKNDFR+IVQQLTGSPS
Sbjct: 5   KNRHNDSLGVNKLGKNIRKSPLHQPNFANNAARQQPQPQVYNISKNDFRDIVQQLTGSPS 64

Query: 63  QDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNVPR 122
           QD  PPPRPP NPPKPQSMRLQ+IRPPPLTPINRP MP P+  P+P  PP    NN +PR
Sbjct: 65  QD--PPPRPPHNPPKPQSMRLQKIRPPPLTPINRPRMPPPM--PMPAAPPSVPYNNAIPR 124

Query: 123 PAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQ----FV 182
           PAQF QPP        P GD  W N   ESPISAYMRYLQNS+M+P    NQ Q      
Sbjct: 125 PAQFGQPP-------TPPGDI-WSN-TTESPISAYMRYLQNSIMDPGQRGNQVQPQPHPY 184

Query: 183 PQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPM-------PNLPSPHWNG- 242
           PQPQ+PG + P   P S LLPNP  P+P  PS R NGP PPM       P+LPSP  NG 
Sbjct: 185 PQPQVPGNVQPHPPPLSALLPNP--PIPMYPSLRFNGPVPPMNATNPPVPSLPSPQANGP 244

Query: 243 PALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGP 302
           P LLPSPTSQFLLPSPTGY NLLSP+SPYPLLSPGIQF  PLTPNF F SM  Q GILGP
Sbjct: 245 PPLLPSPTSQFLLPSPTGYMNLLSPRSPYPLLSPGIQFPSPLTPNFPFSSM-AQPGILGP 304

Query: 303 GPHPPPSPGVMFPLSPSGFFPILSPRWRD 320
           GP PPPSPG+MFPLSPSGFFPI SPRWR+
Sbjct: 305 GPQPPPSPGLMFPLSPSGFFPISSPRWRE 317

BLAST of Lsi07G012000.1 vs. TAIR10
Match: AT1G32610.1 (AT1G32610.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 171.0 bits (432), Expect = 1.2e-42
Identity = 145/324 (44.75%), Postives = 180/324 (55.56%), Query Frame = 1

Query: 8   ENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPSQDHQP 67
           + LGVNK+GKNI+KSP+             PQPQ Y++S NDF +IVQQLT SPS++  P
Sbjct: 9   DQLGVNKIGKNIKKSPL-------------PQPQGYSMSNNDFTSIVQQLTDSPSRESLP 68

Query: 68  PPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNVPRPAQFA 127
            P P +N  KPQ    Q+IRP     INRP +P P+ A               P     A
Sbjct: 69  QPLP-RNLLKPQ----QKIRPVGQIQINRPCVPPPVMAQ--------------PTHEFVA 128

Query: 128 QPPPRKLP----PMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQ----FVP 187
           +PP   LP    P++  GD    N  AES +S YMRY Q+S+ +  P  NQ Q       
Sbjct: 129 RPPMHPLPHGSQPIISHGDQFGSN-TAESSVSVYMRYRQSSLGDSGPNENQMQPSHDNQQ 188

Query: 188 QPQIPGQM--HPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPN--LPSPHWNGPALLP 247
           QPQ+ GQ   H  H+P      +  +  P LP+P+ +GPP  M N  LPSP +NG  +LP
Sbjct: 189 QPQVEGQAQSHNHHSPRFN---DSARNTPILPTPKFDGPPQQMHNNSLPSPRFNGRGILP 248

Query: 248 SPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTP-NFAFPSMPPQSGILGPGPHP 307
           +PTSQ+   SPT Y NLLSP+SP PLLS G+Q+ PPLTP N+ F SM  Q GILGPG  P
Sbjct: 249 TPTSQYRPQSPTAYRNLLSPRSPSPLLSTGVQYPPPLTPRNYTFSSM-DQPGILGPGTIP 289

Query: 308 PPSPGVMFPLSPSGFFPILSPRWR 319
            P        SP G  PI S RWR
Sbjct: 309 LPH------ASPFGVIPISSQRWR 289

BLAST of Lsi07G012000.1 vs. TAIR10
Match: AT2G35230.1 (AT2G35230.1 VQ motif-containing protein)

HSP 1 Score: 146.7 bits (369), Expect = 2.4e-35
Identity = 140/334 (41.92%), Postives = 172/334 (51.50%), Query Frame = 1

Query: 1   MDRNRQNENLGVNKMGKNIRKSPIHQPNFG---NNAARP--QPQPQIYNISKNDFRNIVQ 60
           MDR RQN++LGVN++GKNIRKSP+HQ  F    +N A P  Q QPQ+YNISKNDFR+IVQ
Sbjct: 1   MDRPRQNDHLGVNRIGKNIRKSPLHQSTFAASTSNGAAPRLQTQPQVYNISKNDFRSIVQ 60

Query: 61  QLTGSPSQDHQPPPRPPQNPP-KPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQA 120
           QLTGSPS++    PRPPQN   +PQ+ RLQRIRP PLT +NRP +P P  AP    P   
Sbjct: 61  QLTGSPSRESL--PRPPQNNSLRPQNTRLQRIRPSPLTQLNRPAVPLPSMAPPQSHP--- 120

Query: 121 LVNNNVPRPAQFAQPPPRKLP-------PMVPGGDSHWPNPAAESPISAYMRYLQNSMMN 180
                     QFA+ PP + P       PM+   D  W N  AESP+S YMRYLQ+S+ +
Sbjct: 121 ----------QFARQPPHQPPFPQTTQQPMMGHRDQFWSN-TAESPVSEYMRYLQSSLGD 180

Query: 181 PSPVANQAQ--FVPQPQIPGQMHPPHAPPSGLLP--NPNQPVPALPSPRLNGPPPPMPNL 240
             P ANQ Q     +P IPG    P+ P +   P    N+  P +P        P     
Sbjct: 181 SGPNANQMQPGHEQRPYIPGHEQRPYVPGNEQQPYMPGNEQRPYIPGHEQRSYMPAQ--- 240

Query: 241 PSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPP 300
            S   + P   P P  Q ++P P    N+  P  P   L P     P L P+   P   P
Sbjct: 241 -SQSQSQPQPQPQP-QQHMMPGPQPRMNMQGPLQPNQYLPP-----PGLVPS-PVPHNLP 300

Query: 301 QSGILGPGPHPPPSPGVMFPLSPSGFFPILSPRW 318
                 P P  P  P  MF     GF    SPR+
Sbjct: 301 SPRFNAPVPVTPTQPSPMFSQMYGGF---PSPRY 304

BLAST of Lsi07G012000.1 vs. TAIR10
Match: AT5G46780.1 (AT5G46780.1 VQ motif-containing protein)

HSP 1 Score: 112.5 bits (280), Expect = 5.0e-25
Identity = 121/321 (37.69%), Postives = 143/321 (44.55%), Query Frame = 1

Query: 2   DRNRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP 61
           + +  + +LGVNKMGKNIRK P +Q N   N     PQ  +YNI+K DFR+IVQQLTG  
Sbjct: 13  NNDHHHHHLGVNKMGKNIRKDPPNQQNQQQN-----PQALVYNINKTDFRSIVQQLTGLG 72

Query: 62  SQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNVP 121
           S     PP+   N PKP + RL ++RP PLT +N P  P P P PV   P   + +  V 
Sbjct: 73  STSSVNPPQT--NHPKPPNSRLVKVRPAPLTQLNHPPPPPPPPPPVQSVP---IASEPVQ 132

Query: 122 RPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVPQP 181
              QF+  P                   AESPISAYMRYL  S    SPV N+ Q  PQ 
Sbjct: 133 PVNQFSSNP-------------------AESPISAYMRYLIES----SPVGNRVQ--PQN 192

Query: 182 QIPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPTSQF 241
           Q                 NP QP   L      GP              P    SP SQF
Sbjct: 193 Q-----------------NPVQPSTGLFQSHQTGP-------------NPMSFQSPASQF 237

Query: 242 LLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPH--PPPSPG 301
            L SP        P+SP+PL SP   FSP                 LG      PPPSPG
Sbjct: 253 AL-SP-------QPRSPFPLFSPNFAFSP---------------RFLGGSNESLPPPSPG 237

Query: 302 VMFPLSPSGFFPILSPRWRDQ 321
                    FFP+LSP W++Q
Sbjct: 313 F--------FFPLLSPLWKNQ 237

BLAST of Lsi07G012000.1 vs. TAIR10
Match: AT1G78310.1 (AT1G78310.1 VQ motif-containing protein)

HSP 1 Score: 83.2 bits (204), Expect = 3.2e-16
Identity = 99/266 (37.22%), Postives = 130/266 (48.87%), Query Frame = 1

Query: 37  QPQPQIYNISKNDFRNIVQQLTGSPSQDH-QPPPRPPQNPPKP-QSMRLQRIRPPPLTPI 96
           Q QP +YNI+KNDFR++VQ+LTGSP+ +    PP+ P + PKP QS RL RIRPPPL  +
Sbjct: 77  QHQPPVYNINKNDFRDVVQKLTGSPAHERISAPPQQPIHHPKPQQSSRLHRIRPPPL--V 136

Query: 97  NRPNMPAPIPAPVPVPPPQALVNNNVPRPAQFAQP--PPRKLPPMVPGGDSHWPNPAAES 156
           +  N P  +     +P     +N N        +P  P   LPP+ P       + AAES
Sbjct: 137 HVINRPPGLLNDALIPQGSHHMNQNWTGVGFNLRPTAPLSPLPPLPP------VHAAAES 196

Query: 157 PISAYMRYLQNSMMNPSPVANQAQFVPQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPR 216
           P+S+YMRYLQNSM   +  +N+ +F                 SGL P      P     +
Sbjct: 197 PVSSYMRYLQNSMF--AIDSNRKEF-----------------SGLSPLAPLVSPRWYQQQ 256

Query: 217 LNGPPPPMPNLPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPL 276
            N PP    + P PH   P+   S T    +P+P  +    SPKSPY LLSP I  SP  
Sbjct: 257 ENAPPSQHNSFPPPHPPPPSSAVSQTVPTSIPAPPLFGCSSSPKSPYGLLSPSILLSPS- 306

Query: 277 TPNFAFPSMPPQSGILGPGPHPPPSP 299
           +    FP        + P   P PSP
Sbjct: 317 SGQLGFP--------VSPTTVPLPSP 306

BLAST of Lsi07G012000.1 vs. NCBI nr
Match: gi|659095634|ref|XP_008448685.1| (PREDICTED: leucine-rich repeat extensin-like protein 3 [Cucumis melo])

HSP 1 Score: 594.3 bits (1531), Expect = 1.2e-166
Identity = 301/324 (92.90%), Postives = 311/324 (95.99%), Query Frame = 1

Query: 1   MDRNRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGS 60
           MDRNRQNENLGVNK+GKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGS
Sbjct: 1   MDRNRQNENLGVNKLGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGS 60

Query: 61  PSQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNV 120
           PSQD+QPPPRPPQNPPK QSMRLQRIRPPPLTPINRPN+PAPIPAPVPVPPPQA+VNNNV
Sbjct: 61  PSQDNQPPPRPPQNPPKSQSMRLQRIRPPPLTPINRPNIPAPIPAPVPVPPPQAVVNNNV 120

Query: 121 PRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVPQ 180
           PRP QFAQPPPR+LPPM  GGDSHWPNPAAESPISAYMRYLQNSMMNPSPV NQAQFVPQ
Sbjct: 121 PRPPQFAQPPPRQLPPMALGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVGNQAQFVPQ 180

Query: 181 PQIPGQMHPPHAPPSGLL----PNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPS 240
           PQIPGQ+HPPHAPPSGLL    PNPN PVPALPSPRLNGPPPP+PN PSPHWNGPALLPS
Sbjct: 181 PQIPGQIHPPHAPPSGLLPNPNPNPNPPVPALPSPRLNGPPPPIPNFPSPHWNGPALLPS 240

Query: 241 PTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPP 300
           PTSQFLLPSPTGYYN+LSPKSPYPLLSPGIQF+PPLTPNFAFPSM PQSGILGPGPHPPP
Sbjct: 241 PTSQFLLPSPTGYYNMLSPKSPYPLLSPGIQFTPPLTPNFAFPSM-PQSGILGPGPHPPP 300

Query: 301 SPGVMFPLSPSGFFPILSPRWRDQ 321
           SPGV+FPLSPSG FPILSPRWRDQ
Sbjct: 301 SPGVLFPLSPSGIFPILSPRWRDQ 323

BLAST of Lsi07G012000.1 vs. NCBI nr
Match: gi|449462017|ref|XP_004148738.1| (PREDICTED: protein HAIKU1-like [Cucumis sativus])

HSP 1 Score: 587.0 bits (1512), Expect = 1.9e-164
Identity = 299/323 (92.57%), Postives = 309/323 (95.67%), Query Frame = 1

Query: 1   MDRNRQNENLGVNKMGKNIRKSPIHQPNFGNN-AARPQPQPQIYNISKNDFRNIVQQLTG 60
           MDRNRQNENLGVNK+GKNIRKSPIHQPNFGNN AARPQPQPQIYNISKNDFRNIVQQLTG
Sbjct: 1   MDRNRQNENLGVNKLGKNIRKSPIHQPNFGNNNAARPQPQPQIYNISKNDFRNIVQQLTG 60

Query: 61  SPSQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNN 120
           SPSQD+QPPPRPPQNPPK QSMRLQRIRPPPLTPINRPN+PAPIPAPVPVPPPQALVNNN
Sbjct: 61  SPSQDNQPPPRPPQNPPKSQSMRLQRIRPPPLTPINRPNIPAPIPAPVPVPPPQALVNNN 120

Query: 121 VPRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVP 180
           VPRP QFAQPPPR+LPP+  GGDSHWPNPAAESPISAYMRYLQNSMMNPSPV NQAQF+P
Sbjct: 121 VPRPPQFAQPPPRQLPPVAMGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVGNQAQFIP 180

Query: 181 QPQIPGQMHPPHAPPSGLL--PNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSP 240
           Q Q+PGQMHPPHAPP GLL  PNPN PVPALPSPRLNGPPPP+PN PSPHWNGPALLPSP
Sbjct: 181 QSQVPGQMHPPHAPPPGLLPNPNPNPPVPALPSPRLNGPPPPIPNFPSPHWNGPALLPSP 240

Query: 241 TSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPS 300
           TSQFLLPSPTGYYNLLSPKSPYPLLSPGIQF+PPLTPNFAFPSM PQSGILGPGPHPPPS
Sbjct: 241 TSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFTPPLTPNFAFPSM-PQSGILGPGPHPPPS 300

Query: 301 PGVMFPLSPSGFFPILSPRWRDQ 321
           PGV+FPLSPSG FPILSPRWRDQ
Sbjct: 301 PGVLFPLSPSGIFPILSPRWRDQ 322

BLAST of Lsi07G012000.1 vs. NCBI nr
Match: gi|255548275|ref|XP_002515194.1| (PREDICTED: protein HAIKU1 [Ricinus communis])

HSP 1 Score: 409.1 bits (1050), Expect = 7.3e-111
Identity = 231/322 (71.74%), Postives = 254/322 (78.88%), Query Frame = 1

Query: 5   RQNENLGVNKMGKNIRKSPIHQPNFGNNAA---RPQPQPQIYNISKNDFRNIVQQLTGSP 64
           +QN+ LGVNK+GKNIRKSP+HQPNF NNA    R QPQPQ+YNISKNDFRNIVQQLTGSP
Sbjct: 8   QQNDPLGVNKLGKNIRKSPLHQPNFANNANNANRQQPQPQVYNISKNDFRNIVQQLTGSP 67

Query: 65  SQDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVPPPQALVNNNVP 124
           SQ+  P PRPPQNPPKPQSMRLQ+IRPPPLTPINRP++P P+PAP   PPP    NNN  
Sbjct: 68  SQE--PLPRPPQNPPKPQSMRLQKIRPPPLTPINRPHIPPPVPAPAVAPPPPVPFNNNFA 127

Query: 125 RPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQ---FV 184
           RP QF  P P  +PPM PG DS W N  AESPISAYMRYLQNS+M+PSP  NQAQ     
Sbjct: 128 RPGQFGHPSPTMMPPMAPG-DSAWAN-TAESPISAYMRYLQNSIMDPSPRGNQAQPSLQQ 187

Query: 185 PQPQIPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPT 244
            Q Q P  +HP   P SGLLPNP+  +PALPSPRLNGP P + NLPSP  NGPALLPSPT
Sbjct: 188 LQAQGPAYIHP-QPPSSGLLPNPH--MPALPSPRLNGPVPHVTNLPSPQMNGPALLPSPT 247

Query: 245 SQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPSP 304
           SQFLLPSPTGY NLLSP+SPYP  SPG+QF PPL  NF F  M  QSGILGPGP PPPSP
Sbjct: 248 SQFLLPSPTGYMNLLSPRSPYPFYSPGVQFPPPLAHNFTFSPM-AQSGILGPGPQPPPSP 307

Query: 305 GVMFPLSPSGFFPILSPRWRDQ 321
           G++FPLSP+GFFP+ SPRWRDQ
Sbjct: 308 GLVFPLSPTGFFPLSSPRWRDQ 321

BLAST of Lsi07G012000.1 vs. NCBI nr
Match: gi|590675613|ref|XP_007039497.1| (VQ motif-containing protein isoform 1 [Theobroma cacao])

HSP 1 Score: 408.3 bits (1048), Expect = 1.2e-110
Identity = 235/340 (69.12%), Postives = 260/340 (76.47%), Query Frame = 1

Query: 3   RNRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPS 62
           +NR N++LGVNK+GKNI+KSP+HQPNF NNAAR QPQPQ+YNISKNDFRNIVQQLTGSPS
Sbjct: 5   KNRHNDHLGVNKIGKNIKKSPLHQPNFANNAARQQPQPQVYNISKNDFRNIVQQLTGSPS 64

Query: 63  QDHQPPPRPPQNPPKPQSMRLQRIRPPPLTPINRPNMPAPIPAPVPVP-------PPQAL 122
           QD  P PRPPQNPPKPQSMRLQRIRPPPLTPINRP++P P+P PVP P       PP A 
Sbjct: 65  QD--PLPRPPQNPPKPQSMRLQRIRPPPLTPINRPHIPPPVPVPVPAPAHVPALVPPPAP 124

Query: 123 VNNNVPRPAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQA 182
            NN++ RP  +  P P  L PM+PG D  W N  AESPISAYMRYLQ S+++PSPV NQ 
Sbjct: 125 YNNSLVRPGHYGPPSPAMLHPMMPG-DVIWGN-TAESPISAYMRYLQTSLIDPSPVGNQV 184

Query: 183 QFVPQPQIPGQMHPPHAPPS-GLLPNPNQPV--------------PALPSPRLNGPPPPM 242
           Q    P +PGQ  P   PPS GLLPNP  PV              P +PSPR+ GP P M
Sbjct: 185 QPQLYPPVPGQ--PQALPPSSGLLPNPPMPVLPSPRGVNGPVPPMPNIPSPRMKGPVPSM 244

Query: 243 PNLPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPS 302
           PNLPSP  NGP LLPSPTSQFLLPSPTGY NLLSP+SPYPLLSPG+QF PP+TPNFAF  
Sbjct: 245 PNLPSPRMNGPPLLPSPTSQFLLPSPTGYMNLLSPRSPYPLLSPGVQF-PPMTPNFAFSP 304

Query: 303 MPPQSGILGPGPHPPPSPGVMFPLSPSGFFPILSPRWRDQ 321
           M  QSGILGPGP PPPSPG++FPLSPSGFFP  SPRWRDQ
Sbjct: 305 M-GQSGILGPGPQPPPSPGLVFPLSPSGFFPFPSPRWRDQ 336

BLAST of Lsi07G012000.1 vs. NCBI nr
Match: gi|645265665|ref|XP_008238256.1| (PREDICTED: vegetative cell wall protein gp1 [Prunus mume])

HSP 1 Score: 406.8 bits (1044), Expect = 3.6e-110
Identity = 236/318 (74.21%), Postives = 255/318 (80.19%), Query Frame = 1

Query: 4   NRQNENLGVNKMGKNIRKSPIHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPSQ 63
           NR N++LGVNK+GKNIRKSP+HQPNF NNAAR QPQPQ+YNISKNDFRNIVQQLTGSPSQ
Sbjct: 6   NRHNDHLGVNKIGKNIRKSPLHQPNFANNAARQQPQPQVYNISKNDFRNIVQQLTGSPSQ 65

Query: 64  DHQPPPRPPQNPPKPQSMRLQRIRPPPLTPIN-RPNMPAPIPAPVPVPPPQALVNNNVPR 123
           +  P PRPPQNPPKPQSMRLQRIRPPPLTPIN RP +P P  AP P  PP    NNN  R
Sbjct: 66  E--PLPRPPQNPPKPQSMRLQRIRPPPLTPINNRPVIPPP--APHPSAPPPVPYNNNFMR 125

Query: 124 PAQFAQPPPRKLPPMVPGGDSHWPNPAAESPISAYMRYLQNSMMNPSPVANQAQFVPQPQ 183
           P QF QP P  +PP  P GDS W N  AESPISAYMRYLQ+SM++P+P  NQAQ  PQPQ
Sbjct: 126 P-QFGQPSPTPMPPF-PHGDSMWAN-TAESPISAYMRYLQSSMLDPTPRGNQAQ--PQPQ 185

Query: 184 IPGQMHPPHAPPSGLLPNPNQPVPALPSPRLNGPPPPMPNLPSPHWNGPALLPSPTSQFL 243
            PGQ     AP +GLLPNP+  +PA P PR+NGP PP PNLP P  N PALLPSPTSQFL
Sbjct: 186 GPGQSQS-QAPSTGLLPNPS--MPAHPPPRMNGPVPPAPNLPHPPVNAPALLPSPTSQFL 245

Query: 244 LPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPSMPPQSGILGPGPHPPPSPGVMF 303
           LPSPTGY NLLSP+SPYPLLSPG+QF PPLTPNF F  M  QSGILGPGP PPPSPG +F
Sbjct: 246 LPSPTGYMNLLSPRSPYPLLSPGMQFPPPLTPNFQFSPM-AQSGILGPGPQPPPSPGYLF 305

Query: 304 PLSPSGFFPILSPRWRDQ 321
           PLSPSGFFPI SPRWRDQ
Sbjct: 306 PLSPSGFFPISSPRWRDQ 310

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
IKU1_ARATH4.2e-3441.92Protein HAIKU1 OS=Arabidopsis thaliana GN=IKU1 PE=1 SV=1[more]
VQ9_ARATH5.7e-1537.22VQ motif-containing protein 9 OS=Arabidopsis thaliana GN=VQ9 PE=1 SV=1[more]
PERK2_ARATH2.8e-0634.80Proline-rich receptor-like protein kinase PERK2 OS=Arabidopsis thaliana GN=PERK2... [more]
Match NameE-valueIdentityDescription
A0A0A0L576_CUCSA1.4e-16492.57Uncharacterized protein OS=Cucumis sativus GN=Csa_3G011640 PE=4 SV=1[more]
B9RN55_RICCO5.1e-11171.74LRX2, putative OS=Ricinus communis GN=RCOM_1343860 PE=4 SV=1[more]
A0A061GAC9_THECC8.6e-11169.12VQ motif-containing protein isoform 1 OS=Theobroma cacao GN=TCM_015718 PE=4 SV=1[more]
M5WA02_PRUPE1.6e-10973.58Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009019mg PE=4 SV=1[more]
A0A0B2PB10_GLYSO7.6e-10769.91Uncharacterized protein OS=Glycine soja GN=glysoja_024244 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G32610.11.2e-4244.75 hydroxyproline-rich glycoprotein family protein[more]
AT2G35230.12.4e-3541.92 VQ motif-containing protein[more]
AT5G46780.15.0e-2537.69 VQ motif-containing protein[more]
AT1G78310.13.2e-1637.22 VQ motif-containing protein[more]
Match NameE-valueIdentityDescription
gi|659095634|ref|XP_008448685.1|1.2e-16692.90PREDICTED: leucine-rich repeat extensin-like protein 3 [Cucumis melo][more]
gi|449462017|ref|XP_004148738.1|1.9e-16492.57PREDICTED: protein HAIKU1-like [Cucumis sativus][more]
gi|255548275|ref|XP_002515194.1|7.3e-11171.74PREDICTED: protein HAIKU1 [Ricinus communis][more]
gi|590675613|ref|XP_007039497.1|1.2e-11069.12VQ motif-containing protein isoform 1 [Theobroma cacao][more]
gi|645265665|ref|XP_008238256.1|3.6e-11074.21PREDICTED: vegetative cell wall protein gp1 [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR008889VQ
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi07G012000Lsi07G012000gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi07G012000.1.CDS.1Lsi07G012000.1.CDS.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi07G012000.1Lsi07G012000.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008889VQPFAMPF05678VQcoord: 38..63
score: 2.
NoneNo IPR availablePANTHERPTHR33783FAMILY NOT NAMEDcoord: 212..320
score: 2.8E-103coord: 2..184
score: 2.8E
NoneNo IPR availablePANTHERPTHR33783:SF2SUBFAMILY NOT NAMEDcoord: 2..184
score: 2.8E-103coord: 212..320
score: 2.8E