Lsi10G002440.1 (mRNA) Bottle gourd (USVL1VR-Ls)

NameLsi10G002440.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionHydroxyproline-rich glycoprotein
Locationchr10 : 4053986 .. 4057006 (-)
Sequence length1251
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAATCTGAAAAACGAAGGGAGAGACTGAGAGCAATGCGAATGGAAGCTGCTCAGGCCGATGTAGCTAATTATGTCGAAACTTCTCTGCCTAATCATCTTTCTAATCCATTGGTCGAGTCCTCAACTACCACGGTAGGGCAATTAGCACCATGTACAGCCCCAAGATTTGACTATTACACAAACCCTATGGCTGCATTTTCTACTAGCAAGAAGAAAGGGAAGATTGAGAATCAGCCTGTGTCTGATAATTACGTTCCTTACTACCCCAATACTTCTTCAGCAACTCATCTTCCACCAAAATTTTCAGGTAATTTTATGTTTTCTTTAATAACTAATGCATGCAAATATATGAGGGATCATGTAATGGGAATGGGATATGAAGTTACTTGTTTGTTGATGGATATTTATTTTAAAAATGTCCAATTGCTTATAGAGGGGTCAAATATGGCATAATGTGTCGGATAAATTTTTTCCCCCTCCATAGAAACATGTTACAGATAAAATAACTTTACTAACAATGAAGTGGTGTAATCTGTAAACAAAAACTAGTGAAAGATGCTCCAACTTGAGGTTGTAATATGAGCTAAGTCACCGTAACTGTCCATGGCTTTGTTCTTTTGTTGTTATTCTCTGGTTCCTTTCTAATCAAAGCAAGCCACAAAACAACTCTGACTGCACACTTCCATATAATTTTAGCTTTTCTTTTTTATATGGTTTGGAAGGCAAAAAGGTAAGTAAAATGATCCATACATATAGGATTAGCCGTGGTAGTAATGACAATGAAGGCATGTATTGAGTTGCAGTAATTTTGCATGGCATGTTGACAAATTCCAAGGGGTGGGACACTATCGCTTGAGCGAGTGTGAGTTTGTTGAAATGCTGTGTTGCTTTTGCAGATAGTACATTTTTTCACTAGGTAGGGTTATTTGATTAAACTCTACCATTCAACACCCAACACTAACTTACTAGGAGCTTAAGTGTCAACATTTTTAAAGCTTTTAGGTTTAGAAATGGTTGTGAAGATGCTATTAGGAATCAGAAGTGTTTTTGGTTATGAACAATTAATTAAGGAGCCTGCATCACGGAGCAGCTCAGGAACAAACATGAGTCCTCAAGATTATGGCTTTTGGTATGCCTCTACTACTGTGATAAAGACCAGCTATTCTTTCAAAACCCCAAGAAACAACGAGACATTTGTAATTTTACCTGCATGTTTCAATCAAATAACATAAAAAGAAATCTACGGGCTATTTGTTTCAATGGAGACACCCTATAACCTTATTTAGAAGAAAGCAGTCCTGTTGTATCCAATACATGTACAGCTGTACTCCCTACTTGAAGTGTTGAAGCAAGTTCCCAAAAATAAGGTGGCTTGGCTCCACTCATGTTGGACATTCCCGTAGAAAATTTTGCATTGCTTTCTTAATCAAAGCCGTTGGATTGGAAAGGATTCTAAAAGATATATTGTACTTGTGAGAATTAATGATAAAAGGTAGTCTTTAGGGGGCTTTACAGTTCTTAGAGATACATTACTTGTGAGAAACTTAATGATAAAAGTTAATCTCTCTTCTTTACAGTCCTTGTTTGTTTGATTAGCTAATATTAAATGTCTAGAATATTAAAGATTTGCAATCAGGTATGTTTCAATTAGTGTCTCGGAACAAATGTTTGTGTTTGTTTCGCAAGCTTTCCTTAGGGGAATTAAATTTGGTTATAATACAAATGCACATCACTCTCTCTTATGCAAAATATGCTCGTGTAATTTGATTTTAGTTCCTAATTCCTCCCTATGGTTTGATGCCGTACGATCGTGGAAGCACTATGTTTTAAACAATAAGTGACAGGGTATTATAGTCTTGTTTAAAGTAGGAGGGGTAATATTTCCTTCGTATCGGGCTTTATGTGATGTTATTGGTGGAATTGAGAAGGATCCCTTCGGGTGTTCAGTTATTATTTTTGTTAATGCTTGTTATGTTGGATTTTATTCCACATCCCGGGTGTGGTCTTTCATCCAGCATTTTGTAATTTGTAGCTAGGGAATCACCTTTTCAAATCTAAATATACTATTGTGTTGTAGGATTGAGAAACCCTGAAATGTCTCCCTCTTCTACTCATCAATTCCATCAATATTCACCTGATCAGAGAACGTTCTATGCACGAGGTTTTAGTGGATCTGGTGGCCATGGTAGACCAGGAATGCCCAGACCTTTTCCTATGGATCAAGGAGCTCCTCATATGTGGCGTGGACCGAGAAGGCCATTTGTCAACCAGTTCCCTAGCCATCCACCGTGGGAGATGAGCTCCCCCAGCCATGTCTCTGGACCAAGAGGTAACTCGTACACCAATCCTACTCAAGACAGGGCTAATTACCATAGTTCAAGCCCTAGTCCAGGGTACCAAGGCAGTTTTAGTCCAGGTGGAGACAGCCATGGACATCATAACAATATGACCCCTAGTCCAAGATTTGGCTCTGGACGAGGTACTGGTTCTCATGGTCGTCATTCTTCATTTGACAAATCACCTGGACCAGAACAATTTTACAATGCCTCCATGCTTGAAGATCCTTGGAAGGTTCTGCAACCTTGTATTTGGACGACAATTCCTCCATTGAGTAATTCTGCAAAACCTTTGGAATCTTGGATTTCCAAGTTTGGTACAAAGAAAGCAAGAGTTTCAGATTCTTCTTCTGGCAGGTCAAGCTCTCAACCTAGCCTCGCCGAGTACCTGGCTGCCTCTTTCAAGGAAGCAGTCGAGGATGCACCAAGTGTGTGAAAGTGACATTTTATCCAGGCATTTCATTTTTACTGTAATACTCAAATTCAGTATTCTCTAAATTCCAGCCTAATTTTCCTCTATAGACTGGTTTTATGTAGTAAATCTACAGTTAATGGATATATTTTATCAATCTTGTTTAGATTATGTACATAATTTAGCTTCTTCCTTTTAACATTTGTTGTCAAATGCTCTTCTTTGTTAAGCATTTAATAGGATTGAGTTCCTCATGGTTGAACCACT

mRNA sequence

ATGGAAGAATCTGAAAAACGAAGGGAGAGACTGAGAGCAATGCGAATGGAAGCTGCTCAGGCCGATGTAGCTAATTATGTCGAAACTTCTCTGCCTAATCATCTTTCTAATCCATTGGTCGAGTCCTCAACTACCACGGTAGGGCAATTAGCACCATGTACAGCCCCAAGATTTGACTATTACACAAACCCTATGGCTGCATTTTCTACTAGCAAGAAGAAAGGGAAGATTGAGAATCAGCCTGTGTCTGATAATTACGTTCCTTACTACCCCAATACTTCTTCAGCAACTCATCTTCCACCAAAATTTTCAGGATTGAGAAACCCTGAAATGTCTCCCTCTTCTACTCATCAATTCCATCAATATTCACCTGATCAGAGAACGTTCTATGCACGAGGTTTTAGTGGATCTGGTGGCCATGGTAGACCAGGAATGCCCAGACCTTTTCCTATGGATCAAGGAGCTCCTCATATGTGGCGTGGACCGAGAAGGCCATTTGTCAACCAGTTCCCTAGCCATCCACCGTGGGAGATGAGCTCCCCCAGCCATGTCTCTGGACCAAGAGGTAACTCGTACACCAATCCTACTCAAGACAGGGCTAATTACCATAGTTCAAGCCCTAGTCCAGGGTACCAAGGCAGTTTTAGTCCAGGTGGAGACAGCCATGGACATCATAACAATATGACCCCTAGTCCAAGATTTGGCTCTGGACGAGGTACTGGTTCTCATGGTCGTCATTCTTCATTTGACAAATCACCTGGACCAGAACAATTTTACAATGCCTCCATGCTTGAAGATCCTTGGAAGGTTCTGCAACCTTGTATTTGGACGACAATTCCTCCATTGAGTAATTCTGCAAAACCTTTGGAATCTTGGATTTCCAAGTTTGGTACAAAGAAAGCAAGAGTTTCAGATTCTTCTTCTGGCAGGTCAAGCTCTCAACCTAGCCTCGCCGAGTACCTGGCTGCCTCTTTCAAGGAAGCAGTCGAGGATGCACCAAGTGTGTGAAAGTGACATTTTATCCAGGCATTTCATTTTTACTGTAATACTCAAATTCAGTATTCTCTAAATTCCAGCCTAATTTTCCTCTATAGACTGGTTTTATGTAGTAAATCTACAGTTAATGGATATATTTTATCAATCTTGTTTAGATTATGTACATAATTTAGCTTCTTCCTTTTAACATTTGTTGTCAAATGCTCTTCTTTGTTAAGCATTTAATAGGATTGAGTTCCTCATGGTTGAACCACT

Coding sequence (CDS)

ATGGAAGAATCTGAAAAACGAAGGGAGAGACTGAGAGCAATGCGAATGGAAGCTGCTCAGGCCGATGTAGCTAATTATGTCGAAACTTCTCTGCCTAATCATCTTTCTAATCCATTGGTCGAGTCCTCAACTACCACGGTAGGGCAATTAGCACCATGTACAGCCCCAAGATTTGACTATTACACAAACCCTATGGCTGCATTTTCTACTAGCAAGAAGAAAGGGAAGATTGAGAATCAGCCTGTGTCTGATAATTACGTTCCTTACTACCCCAATACTTCTTCAGCAACTCATCTTCCACCAAAATTTTCAGGATTGAGAAACCCTGAAATGTCTCCCTCTTCTACTCATCAATTCCATCAATATTCACCTGATCAGAGAACGTTCTATGCACGAGGTTTTAGTGGATCTGGTGGCCATGGTAGACCAGGAATGCCCAGACCTTTTCCTATGGATCAAGGAGCTCCTCATATGTGGCGTGGACCGAGAAGGCCATTTGTCAACCAGTTCCCTAGCCATCCACCGTGGGAGATGAGCTCCCCCAGCCATGTCTCTGGACCAAGAGGTAACTCGTACACCAATCCTACTCAAGACAGGGCTAATTACCATAGTTCAAGCCCTAGTCCAGGGTACCAAGGCAGTTTTAGTCCAGGTGGAGACAGCCATGGACATCATAACAATATGACCCCTAGTCCAAGATTTGGCTCTGGACGAGGTACTGGTTCTCATGGTCGTCATTCTTCATTTGACAAATCACCTGGACCAGAACAATTTTACAATGCCTCCATGCTTGAAGATCCTTGGAAGGTTCTGCAACCTTGTATTTGGACGACAATTCCTCCATTGAGTAATTCTGCAAAACCTTTGGAATCTTGGATTTCCAAGTTTGGTACAAAGAAAGCAAGAGTTTCAGATTCTTCTTCTGGCAGGTCAAGCTCTCAACCTAGCCTCGCCGAGTACCTGGCTGCCTCTTTCAAGGAAGCAGTCGAGGATGCACCAAGTGTGTGA

Protein sequence

MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDYYTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFHQYSPDQRTFYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSSPSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGTGSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKKARVSDSSSGRSSSQPSLAEYLAASFKEAVEDAPSV
BLAST of Lsi10G002440.1 vs. TrEMBL
Match: Q6E437_CUCME (ACT11D09.5 OS=Cucumis melo GN=ACT11D0.5 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 3.5e-155
Identity = 280/327 (85.63%), Postives = 294/327 (89.91%), Query Frame = 1

Query: 9   ERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDYYTNPMAAF 68
           ERLRAMRMEAAQADV NY+ETSLPNHLSNPLVESS T VGQLAPCTAPRFDYYTNPMAAF
Sbjct: 242 ERLRAMRMEAAQADVVNYIETSLPNHLSNPLVESSATMVGQLAPCTAPRFDYYTNPMAAF 301

Query: 69  STSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFHQYSPDQRT 128
           STSKKKGKIENQPVSD +VPY+ NTSS T+LPP F GLRNPEMSPSSTHQFHQYSPDQRT
Sbjct: 302 STSKKKGKIENQPVSDTFVPYHHNTSSTTYLPPTFPGLRNPEMSPSSTHQFHQYSPDQRT 361

Query: 129 FYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSSPSHVSGPR 188
           FYARG S +GGHG PGMPRP+ ++QG PHMWRGPRRPFVNQFP+HPP EM+S SHVSGPR
Sbjct: 362 FYARGDSEAGGHGSPGMPRPYAVNQGDPHMWRGPRRPFVNQFPTHPPREMNSSSHVSGPR 421

Query: 189 GNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGTGSHGRHSS 248
           GNSYTNPTQDRA Y SSSP+PG+ GS SPG  SHGHH NMTPSPRFG GRGTG HGRHS 
Sbjct: 422 GNSYTNPTQDRAKYRSSSPNPGFHGSLSPGRGSHGHHGNMTPSPRFGYGRGTGFHGRHSL 481

Query: 249 FDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKKARVSDSSS 308
            DKS GPEQFYN SMLEDPWKVLQPCIWTTI   SNSAKP ESWISKFGTKKARVSDSSS
Sbjct: 482 LDKS-GPEQFYNVSMLEDPWKVLQPCIWTTIDSSSNSAKPSESWISKFGTKKARVSDSSS 541

Query: 309 GRSSS-QPSLAEYLAASFKEAVEDAPS 335
           GRSSS QPSLAEYLAASFKEA+EDAP+
Sbjct: 542 GRSSSQQPSLAEYLAASFKEAIEDAPN 567

BLAST of Lsi10G002440.1 vs. TrEMBL
Match: A0A0A0LQW8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G435470 PE=4 SV=1)

HSP 1 Score: 493.0 bits (1268), Expect = 2.8e-136
Identity = 260/335 (77.61%), Postives = 277/335 (82.69%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDY 60
           MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESS T VGQLAPCTAPRFDY
Sbjct: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMVGQLAPCTAPRFDY 60

Query: 61  YTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFH 120
           YTNPMAAFSTSKKKGKIENQPVSDN+VPY+ NTSS T+ PP F G               
Sbjct: 61  YTNPMAAFSTSKKKGKIENQPVSDNFVPYHHNTSSTTYFPPTFPG--------------- 120

Query: 121 QYSPDQRTFYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSS 180
                         S +GGHGRPGMPRP+ ++QG  HMWRGPR PFVNQFP+ PP EM+S
Sbjct: 121 -------------DSEAGGHGRPGMPRPYAVNQGDLHMWRGPRGPFVNQFPTQPPREMNS 180

Query: 181 PSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGT 240
           PSHVSGPRGN YTNPTQ+RANY SSSP+PG++GSFSPG  S+GHH NMTPSPRFG GR T
Sbjct: 181 PSHVSGPRGNPYTNPTQNRANYRSSSPNPGFRGSFSPGRGSYGHHGNMTPSPRFGYGRAT 240

Query: 241 GSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKK 300
           GSHGRHSS DKS GPEQFYN SMLEDPWKVLQPCIWTTI PLSNSAKP E WISKFGTKK
Sbjct: 241 GSHGRHSSSDKS-GPEQFYNISMLEDPWKVLQPCIWTTIAPLSNSAKPSEYWISKFGTKK 300

Query: 301 ARVSDSSSGRSSS-QPSLAEYLAASFKEAVEDAPS 335
           ARVSDSSS RSSS QPSLAEYLAASFKEA+E+AP+
Sbjct: 301 ARVSDSSSSRSSSQQPSLAEYLAASFKEAIEEAPN 306

BLAST of Lsi10G002440.1 vs. TrEMBL
Match: E0CRX4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g07660 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 9.8e-49
Identity = 154/348 (44.25%), Postives = 199/348 (57.18%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVETS-LPNHLSNPLVESSTTTVGQLAPCTAPRFD 60
           MEESEKRRERL+AMRMEAAQ  V++ V+TS +P +LSNPLVE S T   Q   C  PRFD
Sbjct: 1   MEESEKRRERLKAMRMEAAQTKVSDTVDTSAMPGYLSNPLVEGSATLPVQEDSCVTPRFD 60

Query: 61  YYTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFS---GLRNPEMSPSST 120
           +YT+PM+AFS++K++ K+ NQ   D   P   +  +AT      S   G RN EM+PS  
Sbjct: 61  FYTDPMSAFSSNKRRSKVGNQIQQDYLTPSSNSGYTATMARMSSSLSAGPRNCEMTPSPN 120

Query: 121 HQFH-QYSPDQRTFYARGFSGSGGHGRPG--MPRPFPMDQGAPHMWRGPRRPFVNQFPSH 180
             F   +SP Q    A+G   S G  R    M  PFP  QG P +W G         PS+
Sbjct: 121 PPFQPNFSPGQGINQAQGLYHSSGPYRSPIEMASPFPAHQGTPGVWNGSNGMPRYGVPSN 180

Query: 181 PPWEMSSPSHVSGPRGNSYTNPTQDRANYHSSSPSP--GYQGSFSPG---GDSHGHHNNM 240
            P   + PS    P G+      + R ++ ++SPSP  G  GS SP    G S    N+M
Sbjct: 181 SPRGGNFPSPGFRPVGSPSFRSGRGRGHWFNNSPSPVSGRGGSSSPNSGRGRSGWFGNSM 240

Query: 241 TPSPRFGSGRGTGSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWT---TIPPLSNS 300
           +P    G GRG G H   S+ D+   PE FYN SM+EDPWK L+P IW+    +  + N+
Sbjct: 241 SPGSGRGRGRGLGFHAHVSAQDR---PELFYNKSMVEDPWKFLKPVIWSREKALGKMGNA 300

Query: 301 AKPLESWISK-FGTKKARVSDSSSGRSSSQPSLAEYLAASFKEAVEDA 333
           +   +SW+ K    KK RVS++++  SSSQ SLAEYLAASF EAV DA
Sbjct: 301 SDSPKSWLPKSINMKKTRVSEATN-ESSSQQSLAEYLAASFNEAVNDA 344

BLAST of Lsi10G002440.1 vs. TrEMBL
Match: W9SMI8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000896 PE=4 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 1.4e-47
Identity = 155/351 (44.16%), Postives = 200/351 (56.98%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVA--NYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRF 60
           MEESEKRRERLRAMR EAA   V   N    ++P +LSNPLVE+S             RF
Sbjct: 1   MEESEKRRERLRAMRHEAAAQSVNSDNNEAPAMPCYLSNPLVETSAAAPPPEQSHGTSRF 60

Query: 61  DYYTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSAT-HLPPKFSGLRNPEMSPSSTH 120
           D+YT+PMAAFS +K++    + P+S ++V    N+ S     P  FSG R   MSP+  H
Sbjct: 61  DFYTDPMAAFSANKRRNNTSD-PISSHHVTPPANSGSPMLRSPSPFSGPRYAGMSPA--H 120

Query: 121 QFHQ-YSPDQRTFYARGFSGS--GGHGRPGMPRPFPMDQGA--PHMWRGPRRPFVNQFPS 180
           QF   YSP+ R +  +GF        G  GM RPF M QG   P +  G    + N FPS
Sbjct: 121 QFQSNYSPNPRMYQPQGFGHDPISQSGELGMSRPFNMHQGNMDPSIGPGSAAGYYN-FPS 180

Query: 181 HPPWEMSSPSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPG-----GDSHGHHNN 240
           + P     PS   GP G S+ N  Q RA++H+ SP+PG     SP      G    H  +
Sbjct: 181 NQPRGSRFPSPRIGPTG-SFFNAGQGRAHWHNHSPNPGLGRGGSPSPSLGRGGGRWHGGS 240

Query: 241 MTPSPRFGSGRGTGSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTI-PPLSNSA 300
            +P      GRG GS GRH + D+  GPE+FY+ SM+ED WK L+P +W  +   LS+ +
Sbjct: 241 TSPGSGRRGGRGPGSAGRHFTMDRQLGPERFYDESMIEDAWKFLEPVVWREVDASLSSLS 300

Query: 301 KP--LESWISK-FGTKKARVSDSSSGRSSSQPSLAEYLAASFKEAVEDAPS 335
            P   +SWI++  G KKA+VSDS+S +S SQPSLAEYLAASF EA +D  S
Sbjct: 301 TPDSSKSWITRSLGAKKAKVSDSTS-KSGSQPSLAEYLAASFDEANKDESS 345

BLAST of Lsi10G002440.1 vs. TrEMBL
Match: A0A061FEH0_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_034520 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 5.6e-44
Identity = 149/372 (40.05%), Postives = 200/372 (53.76%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVET-SLPNHLSNPLVESSTTTVGQLAPCTAPRFD 60
           M+ESEKR+ERL+AMR+EAAQ++V N V T S+P HLSNPL E+S+T   Q   C+ PRFD
Sbjct: 1   MDESEKRKERLKAMRLEAAQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFD 60

Query: 61  YYTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSS--ATHLPPKFSGLRNPEMSPSSTH 120
           YYT+PMAAFS +KK+GK +NQ   + + P  P TS      + P   G RN +M+P   H
Sbjct: 61  YYTDPMAAFSANKKRGKADNQSTQNYFTP--PTTSGWPVARVSPSHPGPRNYDMNPPVRH 120

Query: 121 QFHQYSPDQRTFYARG-FSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPW 180
              QYS DQR ++ +G  S    H  P    P  M  G    W G  + F N + S    
Sbjct: 121 MQSQYSLDQRMYHQQGPHSNFAAHRSPITRSPSHMHHGNSDAWNG-SQAFGNYYSSASD- 180

Query: 181 EMSSPSHVSG-PRGNSYTNP---TQDRANYHSSSPSPGYQ--------------GSF--- 240
              SP  + G P  +  T P       A+ +S+SP+PG+               G++   
Sbjct: 181 --GSPGGMFGTPLMHPGTTPRFWNPSNASRYSNSPTPGFSPADIPYGRGRPQQFGNYPLP 240

Query: 241 SPG-----------GDSHGHHNNMTPSPRFGSGRGTGSHGRHSSFDKSPGPEQFYNASML 300
           SPG           G   G+  ++T       GRG G HG  S+ ++  GPE FY+ SML
Sbjct: 241 SPGHGGSLGLSSGRGRGRGYGGSITHGIGRSGGRGLGFHGHSSASNRMMGPESFYDESML 300

Query: 301 EDPWKVLQPCIW----TTIPPLSNSAKPLESWISK-FGTKKARVSDSSSGRSSSQPSLAE 332
           EDPW+ L+P +W      +  LSN      SW  K    KK +VS++S+ + +SQ SLAE
Sbjct: 301 EDPWQHLKPVLWRRREAGMDSLSNPDSS-NSWFPKSISAKKVKVSEASN-KFNSQLSLAE 360

BLAST of Lsi10G002440.1 vs. TAIR10
Match: AT4G24500.1 (AT4G24500.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 123.6 bits (309), Expect = 2.3e-28
Identity = 123/348 (35.34%), Postives = 168/348 (48.28%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQA---DVANYVETSLPN-HLSNPLVESSTTTVGQLAPCTAP 60
           ME+SEKR++ L+AMRMEAA     D     ETS+   HLSNPL E+S     Q       
Sbjct: 1   MEDSEKRKQMLKAMRMEAAAQNDDDATTGTETSMSTGHLSNPLAETSNH---QQDSFETQ 60

Query: 61  RFDYYTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSST 120
           RFDYYT+PMAA+S+ KK    + Q +S       P+   ++ +PP+F     P + P S 
Sbjct: 61  RFDYYTDPMAAYSSFKKNKTPKQQYISS------PSHQGSSPVPPQFP----PSVPPGSL 120

Query: 121 HQFHQYSPDQRTFYARGFSGSGGHGRP-GMPRPFPMDQGAPHMWRGP-RRPFVNQFPSHP 180
              +Q   +   F+A        H  P GM    P  +G P  W    R P VN   S P
Sbjct: 121 CSEYQAQTNHGGFHA-------AHYEPRGMAHLSPSHRGPPAGWNNNFRPPPVNH--SGP 180

Query: 181 PWEMSSPSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRF 240
           P  +  P   S    N   N    R +Y+++ P       FS  G  + +    T  P  
Sbjct: 181 PQWVPRPFPFSQEMPNMGNNRFGGRGSYNNTPPQ------FSNYGRQNANWGGNT-YPNS 240

Query: 241 GSGRGTGSHGRHSSFDKS-------PGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAK 300
           G GR  G  G ++SF +        PG E+FY+ SM EDPWK L+P +W      S+S+ 
Sbjct: 241 GRGRSRG-RGMNTSFGRDGGRRPMEPGAERFYSNSMAEDPWKHLKPVLWKNCSDASSSSS 300

Query: 301 PLESWISK-FGTKKARVSDSSSGRSSSQPSLAEYLAASFKEAVEDAPS 335
             ++W+ K    KK+  S+++   SS+Q SLAEYLAAS   A  D  S
Sbjct: 301 TGQAWLPKSIAPKKSVTSEATHKTSSNQQSLAEYLAASLDGATCDESS 318

BLAST of Lsi10G002440.1 vs. NCBI nr
Match: gi|659118496|ref|XP_008459151.1| (PREDICTED: uncharacterized protein LOC103498353 isoform X1 [Cucumis melo])

HSP 1 Score: 568.5 bits (1464), Expect = 7.5e-159
Identity = 288/335 (85.97%), Postives = 302/335 (90.15%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDY 60
           MEESEKRRERLRAMRMEAAQADV NY+ETSLPNHLSNPLVESS T VGQLAPCTAPRFDY
Sbjct: 1   MEESEKRRERLRAMRMEAAQADVVNYIETSLPNHLSNPLVESSATMVGQLAPCTAPRFDY 60

Query: 61  YTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFH 120
           YTNPMAAFSTSKKKGKIENQPVSD +VPY+ NTSS T+LPP F GLRNPEMSPSSTHQFH
Sbjct: 61  YTNPMAAFSTSKKKGKIENQPVSDTFVPYHHNTSSTTYLPPTFPGLRNPEMSPSSTHQFH 120

Query: 121 QYSPDQRTFYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSS 180
           QYSPDQRTFYARG S +GGHG PGMPRP+ ++QG PHMWRGPRRPFVNQFP+HPP EM+S
Sbjct: 121 QYSPDQRTFYARGDSEAGGHGSPGMPRPYAVNQGDPHMWRGPRRPFVNQFPTHPPREMNS 180

Query: 181 PSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGT 240
            SHVSGPRGNSYTNPTQDRA Y SSSP+PG+ GS SPG  SHGHH NMTPSPRFG GRGT
Sbjct: 181 SSHVSGPRGNSYTNPTQDRAKYRSSSPNPGFHGSLSPGRGSHGHHGNMTPSPRFGYGRGT 240

Query: 241 GSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKK 300
           G HGRHS  DKS GPEQFYN SMLEDPWKVLQPCIWTTI   SNSAKP ESWISKFGTKK
Sbjct: 241 GFHGRHSLLDKS-GPEQFYNVSMLEDPWKVLQPCIWTTIDSSSNSAKPSESWISKFGTKK 300

Query: 301 ARVSDSSSGRSSS-QPSLAEYLAASFKEAVEDAPS 335
           ARVSDSSSGRSSS QPSLAEYLAASFKEA+EDAP+
Sbjct: 301 ARVSDSSSGRSSSQQPSLAEYLAASFKEAIEDAPN 334

BLAST of Lsi10G002440.1 vs. NCBI nr
Match: gi|46095228|gb|AAS80151.1| (ACT11D09.5 [Cucumis melo])

HSP 1 Score: 555.8 bits (1431), Expect = 5.0e-155
Identity = 280/327 (85.63%), Postives = 294/327 (89.91%), Query Frame = 1

Query: 9   ERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDYYTNPMAAF 68
           ERLRAMRMEAAQADV NY+ETSLPNHLSNPLVESS T VGQLAPCTAPRFDYYTNPMAAF
Sbjct: 242 ERLRAMRMEAAQADVVNYIETSLPNHLSNPLVESSATMVGQLAPCTAPRFDYYTNPMAAF 301

Query: 69  STSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFHQYSPDQRT 128
           STSKKKGKIENQPVSD +VPY+ NTSS T+LPP F GLRNPEMSPSSTHQFHQYSPDQRT
Sbjct: 302 STSKKKGKIENQPVSDTFVPYHHNTSSTTYLPPTFPGLRNPEMSPSSTHQFHQYSPDQRT 361

Query: 129 FYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSSPSHVSGPR 188
           FYARG S +GGHG PGMPRP+ ++QG PHMWRGPRRPFVNQFP+HPP EM+S SHVSGPR
Sbjct: 362 FYARGDSEAGGHGSPGMPRPYAVNQGDPHMWRGPRRPFVNQFPTHPPREMNSSSHVSGPR 421

Query: 189 GNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGTGSHGRHSS 248
           GNSYTNPTQDRA Y SSSP+PG+ GS SPG  SHGHH NMTPSPRFG GRGTG HGRHS 
Sbjct: 422 GNSYTNPTQDRAKYRSSSPNPGFHGSLSPGRGSHGHHGNMTPSPRFGYGRGTGFHGRHSL 481

Query: 249 FDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKKARVSDSSS 308
            DKS GPEQFYN SMLEDPWKVLQPCIWTTI   SNSAKP ESWISKFGTKKARVSDSSS
Sbjct: 482 LDKS-GPEQFYNVSMLEDPWKVLQPCIWTTIDSSSNSAKPSESWISKFGTKKARVSDSSS 541

Query: 309 GRSSS-QPSLAEYLAASFKEAVEDAPS 335
           GRSSS QPSLAEYLAASFKEA+EDAP+
Sbjct: 542 GRSSSQQPSLAEYLAASFKEAIEDAPN 567

BLAST of Lsi10G002440.1 vs. NCBI nr
Match: gi|659082736|ref|XP_008442005.1| (PREDICTED: uncharacterized protein LOC103486001 [Cucumis melo])

HSP 1 Score: 551.2 bits (1419), Expect = 1.2e-153
Identity = 280/335 (83.58%), Postives = 298/335 (88.96%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDY 60
           MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESS T +GQLAPCT PRFDY
Sbjct: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMMGQLAPCTTPRFDY 60

Query: 61  YTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFH 120
           YTNPMAAFSTSKKKGKIENQ VSDN+VPY+ NTSS     P F GLRNPEMS +STHQFH
Sbjct: 61  YTNPMAAFSTSKKKGKIENQLVSDNFVPYHHNTSS-----PTFPGLRNPEMSSASTHQFH 120

Query: 121 QYSPDQRTFYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSS 180
           Q SPD+R FYARG S +GGHG PGMPRP+ +DQG PHMWRG +RPFVNQ+P+HPP EM+S
Sbjct: 121 QCSPDRRMFYARGDSEAGGHGSPGMPRPYAVDQGDPHMWRGSKRPFVNQYPTHPPREMNS 180

Query: 181 PSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGT 240
           PSHVS PRGNSYTNPTQDRANY SSSP+PG+ GSFSPG  SHGHH NMTPSPRFG GRGT
Sbjct: 181 PSHVSRPRGNSYTNPTQDRANYRSSSPNPGFLGSFSPGRGSHGHHGNMTPSPRFGYGRGT 240

Query: 241 GSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWIS-KFGTK 300
           GSHGRHSS DKSPGPEQFYN SMLEDPWKVLQPCIWTTI P SNS +P ESWIS KFGTK
Sbjct: 241 GSHGRHSSLDKSPGPEQFYNVSMLEDPWKVLQPCIWTTIAPSSNSTEPSESWISTKFGTK 300

Query: 301 KARVSDSSSGRSSSQPSLAEYLAASFKEAVEDAPS 335
           KARVSDSSSGRS+SQPSLAEYLAASFKEA+ED P+
Sbjct: 301 KARVSDSSSGRSNSQPSLAEYLAASFKEAIEDVPN 330

BLAST of Lsi10G002440.1 vs. NCBI nr
Match: gi|659118500|ref|XP_008459154.1| (PREDICTED: uncharacterized protein LOC103498353 isoform X2 [Cucumis melo])

HSP 1 Score: 495.4 bits (1274), Expect = 8.1e-137
Identity = 260/335 (77.61%), Postives = 274/335 (81.79%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDY 60
           MEESEKRRERLRAMRMEAAQADV NY+ETSLPNHLSNPLVESS T VGQLAPCTAPRFDY
Sbjct: 1   MEESEKRRERLRAMRMEAAQADVVNYIETSLPNHLSNPLVESSATMVGQLAPCTAPRFDY 60

Query: 61  YTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFH 120
           YTNPMAAFSTSKKKGKIENQPVSD +VPY+ NTSS T+LPP F G               
Sbjct: 61  YTNPMAAFSTSKKKGKIENQPVSDTFVPYHHNTSSTTYLPPTFPG--------------- 120

Query: 121 QYSPDQRTFYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSS 180
                         S +GGHG PGMPRP+ ++QG PHMWRGPRRPFVNQFP+HPP EM+S
Sbjct: 121 -------------DSEAGGHGSPGMPRPYAVNQGDPHMWRGPRRPFVNQFPTHPPREMNS 180

Query: 181 PSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGT 240
            SHVSGPRGNSYTNPTQDRA Y SSSP+PG+ GS SPG  SHGHH NMTPSPRFG GRGT
Sbjct: 181 SSHVSGPRGNSYTNPTQDRAKYRSSSPNPGFHGSLSPGRGSHGHHGNMTPSPRFGYGRGT 240

Query: 241 GSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKK 300
           G HGRHS  DKS GPEQFYN SMLEDPWKVLQPCIWTTI   SNSAKP ESWISKFGTKK
Sbjct: 241 GFHGRHSLLDKS-GPEQFYNVSMLEDPWKVLQPCIWTTIDSSSNSAKPSESWISKFGTKK 300

Query: 301 ARVSDSSSGRSSS-QPSLAEYLAASFKEAVEDAPS 335
           ARVSDSSSGRSSS QPSLAEYLAASFKEA+EDAP+
Sbjct: 301 ARVSDSSSGRSSSQQPSLAEYLAASFKEAIEDAPN 306

BLAST of Lsi10G002440.1 vs. NCBI nr
Match: gi|449460730|ref|XP_004148098.1| (PREDICTED: uncharacterized protein LOC101221481 [Cucumis sativus])

HSP 1 Score: 493.0 bits (1268), Expect = 4.0e-136
Identity = 260/335 (77.61%), Postives = 277/335 (82.69%), Query Frame = 1

Query: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSTTTVGQLAPCTAPRFDY 60
           MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESS T VGQLAPCTAPRFDY
Sbjct: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMVGQLAPCTAPRFDY 60

Query: 61  YTNPMAAFSTSKKKGKIENQPVSDNYVPYYPNTSSATHLPPKFSGLRNPEMSPSSTHQFH 120
           YTNPMAAFSTSKKKGKIENQPVSDN+VPY+ NTSS T+ PP F G               
Sbjct: 61  YTNPMAAFSTSKKKGKIENQPVSDNFVPYHHNTSSTTYFPPTFPG--------------- 120

Query: 121 QYSPDQRTFYARGFSGSGGHGRPGMPRPFPMDQGAPHMWRGPRRPFVNQFPSHPPWEMSS 180
                         S +GGHGRPGMPRP+ ++QG  HMWRGPR PFVNQFP+ PP EM+S
Sbjct: 121 -------------DSEAGGHGRPGMPRPYAVNQGDLHMWRGPRGPFVNQFPTQPPREMNS 180

Query: 181 PSHVSGPRGNSYTNPTQDRANYHSSSPSPGYQGSFSPGGDSHGHHNNMTPSPRFGSGRGT 240
           PSHVSGPRGN YTNPTQ+RANY SSSP+PG++GSFSPG  S+GHH NMTPSPRFG GR T
Sbjct: 181 PSHVSGPRGNPYTNPTQNRANYRSSSPNPGFRGSFSPGRGSYGHHGNMTPSPRFGYGRAT 240

Query: 241 GSHGRHSSFDKSPGPEQFYNASMLEDPWKVLQPCIWTTIPPLSNSAKPLESWISKFGTKK 300
           GSHGRHSS DKS GPEQFYN SMLEDPWKVLQPCIWTTI PLSNSAKP E WISKFGTKK
Sbjct: 241 GSHGRHSSSDKS-GPEQFYNISMLEDPWKVLQPCIWTTIAPLSNSAKPSEYWISKFGTKK 300

Query: 301 ARVSDSSSGRSSS-QPSLAEYLAASFKEAVEDAPS 335
           ARVSDSSS RSSS QPSLAEYLAASFKEA+E+AP+
Sbjct: 301 ARVSDSSSSRSSSQQPSLAEYLAASFKEAIEEAPN 306

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Q6E437_CUCME3.5e-15585.63ACT11D09.5 OS=Cucumis melo GN=ACT11D0.5 PE=4 SV=1[more]
A0A0A0LQW8_CUCSA2.8e-13677.61Uncharacterized protein OS=Cucumis sativus GN=Csa_2G435470 PE=4 SV=1[more]
E0CRX4_VITVI9.8e-4944.25Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g07660 PE=4 SV=... [more]
W9SMI8_9ROSA1.4e-4744.16Uncharacterized protein OS=Morus notabilis GN=L484_000896 PE=4 SV=1[more]
A0A061FEH0_THECC5.6e-4440.05Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma... [more]
Match NameE-valueIdentityDescription
AT4G24500.12.3e-2835.34 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|659118496|ref|XP_008459151.1|7.5e-15985.97PREDICTED: uncharacterized protein LOC103498353 isoform X1 [Cucumis melo][more]
gi|46095228|gb|AAS80151.1|5.0e-15585.63ACT11D09.5 [Cucumis melo][more]
gi|659082736|ref|XP_008442005.1|1.2e-15383.58PREDICTED: uncharacterized protein LOC103486001 [Cucumis melo][more]
gi|659118500|ref|XP_008459154.1|8.1e-13777.61PREDICTED: uncharacterized protein LOC103498353 isoform X2 [Cucumis melo][more]
gi|449460730|ref|XP_004148098.1|4.0e-13677.61PREDICTED: uncharacterized protein LOC101221481 [Cucumis sativus][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0050896 response to stimulus
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi10G002440Lsi10G002440gene


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi10G002440.1.three_prime_UTR.1Lsi10G002440.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi10G002440.1.CDS.2Lsi10G002440.1.CDS.2CDS
Lsi10G002440.1.CDS.1Lsi10G002440.1.CDS.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi10G002440.1Lsi10G002440.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 1..21
scor
NoneNo IPR availablePANTHERPTHR36054FAMILY NOT NAMEDcoord: 1..335
score: 1.7