Lsi04G018800 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G018800
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform 1
Locationchr04 : 25934864 .. 25938938 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAGCTTACCGATATAGCACCATCTGAAGGAGTTCCTTCTGAGTCTTTCAAATTGTCGGTCTCAACTCTATCACATTCACTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCTTGTGATGGGGCTCTTTTAAGGTCCGGTTTAGATTCTGCTCGTCTCTACTTCCACCAGAGGGCTGCATGTTCATCTGCTGAGTTGATGCAAAACAATGATTCACGGGAGTGGTGCAGAACTTCTGGTTACTATGTGGATGCTCAGATGTGGCAAGAAACATATGATTATAGGCCTGGATTGACTCCAGTTGAGCCCAGCAATGGAATGGAGTTACAACCTGCGGGTTTGCCAGACATTTTTGCTCTCTATGGAAAGGCATCTCGAATTATTTTGGATGCCATCAGCTTCTACCTAAACTTGCGCAGCTCTCCTTTCACGGAAATACTTGATAATGTCCCCCTAAGAAGCAGGGAGATATCATCTTCTGTGTTGTCTGTGTGCTGTTATGGGAGACCCTCATTTCATGGAGAGCATCACCATAAACTAACTGCTCAAGAGGATAGCCAGTTGGCTATGTATGCATCGGACCATGACCATCAAATTGATAAAAGTCTTATTACTCTGGTCAAGTCGGATAAGGCAGGTTTACTAATAAAGGATTTCAATGGTAGATGGATTCTTGTGGATGGTGATCTTGGCCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTATCAAGCAACTGCGGGGTATGTGAATCCTGCTTTGCTCAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTATCATTCAAACTCATGCCTAAATCCATGACTAGCCTCAGTTGTTCAGAAATGAGAGCAGCTGGCCATGGGGTAGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTTATGCAGAGATCACACTCAACTGACGAACTTTTTAACCGCCCAAATTTCCAGAATTTCAGTTTCTCTACATCACAAGATGGTAGGCCAAATATTCTTCTTATAATTTTTAATTTTGGAGCCGATGAATATCATTTATTGAGGTTTCCCCCCTTTTGTTCTGTAACCTCCCTTGGTATGGGGTCGACTCTCCCTCTTTTGTAGTTTAATTACATCAATTAAATTGTTTCTTATAAGAAGAAAAAGTCATTTATTGAGGGTTTATAAATGCAATATTAATTTTCTTTTACATGTTTAGTACTTAGCTGCTTGGTTTGATTGATGTTAGTTGTGCTTCTTTCTACAGTACACACAATGCTTCTGTAGGTTGCTGAAATTCAAATTTCCCTTCCCTTCGATGTTTTACCTAGACGTAATAAAGAGCCATCTGTTTAGATTGCCATTTTGCTATATATTTGGAGAATTTCTCTTAGTGAAAGAGCGTTGTGAGAAAATGATTTATGTTTCAATGTATATTTTTGGGTGAATTATGGATTGGAGGCCATGTATTTGCATTATTTTGATATTTATTATTGCTTGATAGTTTGAATGCTTGCTTGCCCGAGTATTTAGTTGTTCATTTCATCTTTATCCACTTGTTGGGTAGTGTTCCTTCTATAGTAATTTCTTAGCGTTAGTAGCCACCTTAGTGAGAAGCTTTCAATTAACAGGGTGTTGGATCGATGAAATGCTGGTACAGGATCCATAAAAATGAGGAGGAGAAAGAATAATTCAAGTACCAAACCTCTACCCCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAAAGTACAGGATATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAGGATTGTGAGAGTCACATTCACACATTAGATAGCCCTTGTGCCAGCACAAGAATGGAGATTGGATGGCCTCCTGGAGTGCCGTTCGTTCATCCTCACGATCTACCTAATAAGGCAAAAATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGACGTTGAATTAAGTCTTACTGAACCTGGGCAAGTGGGTCAACAGTCAACCAACTGTAAGTGGATCTCTGTCATATTCATATATGCATGCATCGACTGGTTCTTGTTTTTTACTCCAACAAGTTCCTGCTTTGCAATCAATGTGAATGAAGCTTACTGATTTTGGTTCAATTCAGAAAATCCTTGAGGTATTGTTATAGTCTTATTGTCTTTGGAAAGAGGGAAATGAAATCATAGGTGTGCATCATTATTTTTCCTATTGTGTGAAGGTGGGTAAGGACAGAATTCAAAACTGTAAGCATGTACATTCTTTCCCGTTACATTATTACTTTCCCCTTCCCCCATTTTTATTATTGTCTGCAGAATCATGTTTATTATTATTATTATTATTTATTTATTTCTCTTCCATCCCCTTGAAACTGGCCAGTCTTTTTTAATTTGGTGCTGAAAATTTCCGAGATAATTTTATGAATCATCAAGGAATCAGGAAGATTAACAGTAGTGGTTGATGGAAGGGTAGCTTTATTGCAATGCTGCATTTGCTGTGCCTTAGGGATGCATTTGTGAGTGTAATAGATTTGATATATGATTTGAACTTGTGTCCATGTCCTTAAGTTGTGTCCATGTCTGATGGCCGTGGGGACTGTTGACGGAGATTAGTTTGGTTCATGAGTGAATTGGAGTCTAGAGTCTATATTTCCTAGAAGCTTGAAGATAATTGTGTTGGATCTTTTGAACTCTGTGCTACACATGTTTGACGGACCTTGGAAGCTCTTCTGTAGAACAATTTTACAGGCACATCAATCTTTGCGCATCATTTTTTTTTAGAAACCAAGGATATCATTCAAAACAACCTAAAGGACAAGGGGATAAAGCATCCCCTTCCCAAAGGAAGTGTTACAAAAAGACCTTCCAATCTAAGTTAATGGTGTCTATGGTGTTGTTACAAAACAACTATGTTTAGGCCGTTAAACAACCTATGTTCTGCATATCATGGCTTTTCACCCTAGCATTGAGTCATGGACTGATTTTAGTTAACTTTTTCAAAATTTTAACAAGTTCGTGAGTGTTACCTAGAAGGATACATACTCATAGTTGTGCTGTCGGGAGCTTGAAAGTTTAGTATTGGCTAGGCAATTACGTAGTCCATTTGTATAAAATCATACTACTTCATGTAGATTTGGCACTGTCTTCGTTCAGTTTCTGTTGTTGCACCAGCATCACCAACCCAACTTTTCAGCCCTGCACCTAATACACAGGCTCCCCTTTCTCTCGCTCCATATATTTTTTCCATTTTTGTTGTTGATCTAAGGGCTTCCTGACTTCACTTCATCATATAGTTTTTCTTCCTCCCATTTTTGTTTTTGCCTTTTCTTGCTTGGCTTGCATGTCTGTTTGCTCGATCTGCATCTTCTTTTACTTAAAATGTTTGTTTTAGCTTTATTCTTTTTGGATGTCAATGGCATAATGTTTTTTTTTTTTTTGGTTCCTGTTTCTTCAGGAATCAGTTCATGTTCTGTACATCAGTGGATTTTCAAAGCCTCGTAGCTTGCAACTTCAAAACCAGTTCCATTCATTCATGGCTGAGCCGTTGATCAGCTCATCATGCCTGCGCTCGGATTGTTGTTTCAATACCCAATGCTGACAAATTGTGCCTGTTCGCATCTCCAGTGAAGACTGTACATATGTATTACTTTCTGGGTAAGCCAGTTTCATCGTCATCTCCTCTAACCGTTGGACAGCTTTATCTTATTTTGTTGGAGGACAACTCTTGAAGAAAGGGGTTTGTTAAGCTAGCTAACCAGTGATGGTAATGGAGTGGAAAGAACTATTTCCTTTGAATTCTAGTAGAGATTTTTTTCATGTTAGTCATACCTTCATGTTCCTAGACTAGTCAGTATGATGTGTAAAATTTGGGAAATACGATAGAGAGCAGAAATATACAAACTTGAGGTTCCTGAAGAAATATTCCCTGTTTTTTCCTTTTTCTATTCTTGATGTTGTTCTGCGATCATGACTTGAGCTCTTCACAATACATCTCTTTTGCTTTTAAA

mRNA sequence

ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAGCTTACCGATATAGCACCATCTGAAGGAGTTCCTTCTGAGTCTTTCAAATTGTCGGTCTCAACTCTATCACATTCACTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCTTGTGATGGGGCTCTTTTAAGGTCCGGTTTAGATTCTGCTCGTCTCTACTTCCACCAGAGGGCTGCATGTTCATCTGCTGAGTTGATGCAAAACAATGATTCACGGGAGTGGTGCAGAACTTCTGGTTACTATGTGGATGCTCAGATGTGGCAAGAAACATATGATTATAGGCCTGGATTGACTCCAGTTGAGCCCAGCAATGGAATGGAGTTACAACCTGCGGGTTTGCCAGACATTTTTGCTCTCTATGGAAAGGCATCTCGAATTATTTTGGATGCCATCAGCTTCTACCTAAACTTGCGCAGCTCTCCTTTCACGGAAATACTTGATAATGTCCCCCTAAGAAGCAGGGAGATATCATCTTCTGTGTTGTCTGTGTGCTGTTATGGGAGACCCTCATTTCATGGAGAGCATCACCATAAACTAACTGCTCAAGAGGATAGCCAGTTGGCTATGTATGCATCGGACCATGACCATCAAATTGATAAAAGTCTTATTACTCTGGTCAAGTCGGATAAGGCAGGTTTACTAATAAAGGATTTCAATGGTAGATGGATTCTTGTGGATGGTGATCTTGGCCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTATCAAGCAACTGCGGGGTATGTGAATCCTGCTTTGCTCAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTATCATTCAAACTCATGCCTAAATCCATGACTAGCCTCAGTTGTTCAGAAATGAGAGCAGCTGGCCATGGGGTAGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTTATGCAGAGATCACACTCAACTGACGAACTTTTTAACCGCCCAAATTTCCAGAATTTCAGTTTCTCTACATCACAAGATGGATCCATAAAAATGAGGAGGAGAAAGAATAATTCAAGTACCAAACCTCTACCCCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAAAGTACAGGATATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAGGATTGTGAGAGTCACATTCACACATTAGATAGCCCTTGTGCCAGCACAAGAATGGAGATTGGATGGCCTCCTGGAGTGCCGTTCGTTCATCCTCACGATCTACCTAATAAGGCAAAAATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGACGTTGAATTAAGTCTTACTGAACCTGGGCAAGTGGGTCAACAGTCAACCAACTGAATCAGTTCATGTTCTGTACATCAGTGGATTTTCAAAGCCTCGTAGCTTGCAACTTCAAAACCAGTTCCATTCATTCATGGCTGAGCCGTTGATCAGCTCATCATGCCTGCGCTCGGATTGTTGTTTCAATACCCAATGCTGACAAATTGTGCCTGTTCGCATCTCCAGTGAAGACTGTACATATGTATTACTTTCTGGGTAAGCCAGTTTCATCGTCATCTCCTCTAACCGTTGGACAGCTTTATCTTATTTTGTTGGAGGACAACTCTTGAAGAAAGGGGTTTGTTAAGCTAGCTAACCAGTGATGGTAATGGAGTGGAAAGAACTATTTCCTTTGAATTCTAGTAGAGATTTTTTTCATGTTAGTCATACCTTCATGTTCCTAGACTAGTCAGTATGATGTGTAAAATTTGGGAAATACGATAGAGAGCAGAAATATACAAACTTGAGGTTCCTGAAGAAATATTCCCTGTTTTTTCCTTTTTCTATTCTTGATGTTGTTCTGCGATCATGACTTGAGCTCTTCACAATACATCTCTTTTGCTTTTAAA

Coding sequence (CDS)

ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAGCTTACCGATATAGCACCATCTGAAGGAGTTCCTTCTGAGTCTTTCAAATTGTCGGTCTCAACTCTATCACATTCACTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCTTGTGATGGGGCTCTTTTAAGGTCCGGTTTAGATTCTGCTCGTCTCTACTTCCACCAGAGGGCTGCATGTTCATCTGCTGAGTTGATGCAAAACAATGATTCACGGGAGTGGTGCAGAACTTCTGGTTACTATGTGGATGCTCAGATGTGGCAAGAAACATATGATTATAGGCCTGGATTGACTCCAGTTGAGCCCAGCAATGGAATGGAGTTACAACCTGCGGGTTTGCCAGACATTTTTGCTCTCTATGGAAAGGCATCTCGAATTATTTTGGATGCCATCAGCTTCTACCTAAACTTGCGCAGCTCTCCTTTCACGGAAATACTTGATAATGTCCCCCTAAGAAGCAGGGAGATATCATCTTCTGTGTTGTCTGTGTGCTGTTATGGGAGACCCTCATTTCATGGAGAGCATCACCATAAACTAACTGCTCAAGAGGATAGCCAGTTGGCTATGTATGCATCGGACCATGACCATCAAATTGATAAAAGTCTTATTACTCTGGTCAAGTCGGATAAGGCAGGTTTACTAATAAAGGATTTCAATGGTAGATGGATTCTTGTGGATGGTGATCTTGGCCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTATCAAGCAACTGCGGGGTATGTGAATCCTGCTTTGCTCAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTATCATTCAAACTCATGCCTAAATCCATGACTAGCCTCAGTTGTTCAGAAATGAGAGCAGCTGGCCATGGGGTAGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTTATGCAGAGATCACACTCAACTGACGAACTTTTTAACCGCCCAAATTTCCAGAATTTCAGTTTCTCTACATCACAAGATGGATCCATAAAAATGAGGAGGAGAAAGAATAATTCAAGTACCAAACCTCTACCCCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAAAGTACAGGATATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAGGATTGTGAGAGTCACATTCACACATTAGATAGCCCTTGTGCCAGCACAAGAATGGAGATTGGATGGCCTCCTGGAGTGCCGTTCGTTCATCCTCACGATCTACCTAATAAGGCAAAAATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGACGTTGAATTAAGTCTTACTGAACCTGGGCAAGTGGGTCAACAGTCAACCAACTGA

Protein sequence

MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
BLAST of Lsi04G018800 vs. TrEMBL
Match: A0A061F8X2_THECC (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_032425 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 2.0e-228
Identity = 387/475 (81.47%), Postives = 428/475 (90.11%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++  NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E  P GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY  DH+HQ+DK LI++VKSDKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWTA+HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

BLAST of Lsi04G018800 vs. TrEMBL
Match: A0A061FH49_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_032425 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 2.0e-228
Identity = 387/475 (81.47%), Postives = 428/475 (90.11%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++  NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E  P GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY  DH+HQ+DK LI++VKSDKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWTA+HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

BLAST of Lsi04G018800 vs. TrEMBL
Match: D7SWX7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00600 PE=4 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 1.3e-227
Identity = 385/475 (81.05%), Postives = 430/475 (90.53%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGN LPSLGRVKL D+   EG+PS+S+KLSVSTLS SLAQYSAAIIQFP+ DGALLRSG
Sbjct: 1   MAGNSLPSLGRVKLCDLIACEGLPSDSYKLSVSTLSQSLAQYSAAIIQFPSSDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSA LYFHQRA+  +A+++ NN+SREWC+TSGYY D Q WQETYD+RPGLTP E ++G+
Sbjct: 61  LDSAHLYFHQRASYPAADMIHNNESREWCKTSGYYADPQQWQETYDFRPGLTPPESNSGL 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E  PAGLPDIF+L GKA+R ILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 EFPPAGLPDIFSLLGKAARDILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
           GRPSF G  HH LT QED QL M+ SDH+HQ+DKSLITLVKSDKAGL ++DF+GRW+LVD
Sbjct: 181 GRPSFQGPQHHNLTTQEDGQLVMF-SDHEHQVDKSLITLVKSDKAGLHVRDFHGRWVLVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+AIVYPGLALYQATAGYV PAL RT+++N+QG+MYGRCSL+FKLMPKSMTSL+C
Sbjct: 241 GDLGPQEAIVYPGLALYQATAGYVGPALHRTEISNMQGNMYGRCSLAFKLMPKSMTSLNC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH T++LFNR NF +F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTEQLFNRNNFPSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NNS  KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNLK+CESHIHTLDSPCA+TR
Sbjct: 361 NNSRCKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNLKECESHIHTLDSPCANTR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTA+HD+ELSL EPGQ  Q S N
Sbjct: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLIEPGQASQHSAN 474

BLAST of Lsi04G018800 vs. TrEMBL
Match: A0A0A0KZ26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G608060 PE=4 SV=1)

HSP 1 Score: 795.4 bits (2053), Expect = 3.7e-227
Identity = 386/394 (97.97%), Postives = 390/394 (98.98%), Query Frame = 1

Query: 80  MQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELQPAGLPDIFALYGKASR 139
           MQ+NDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMEL PAGLPDIFALYGKASR
Sbjct: 1   MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR 60

Query: 140 IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS 199
           IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS
Sbjct: 61  IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS 120

Query: 200 QLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA 259
           QLAMY SDHD+QIDKSLITL K+DKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA
Sbjct: 121 QLAMYTSDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA 180

Query: 260 TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP 319
           TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP
Sbjct: 181 TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP 240

Query: 320 VDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV 379
           VDDFMQRSHSTD+LFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV
Sbjct: 241 VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV 300

Query: 380 LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA 439
           LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA
Sbjct: 301 LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA 360

Query: 440 KIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           KIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Sbjct: 361 KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 394

BLAST of Lsi04G018800 vs. TrEMBL
Match: A0A0D2M686_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G036400 PE=4 SV=1)

HSP 1 Score: 789.3 bits (2037), Expect = 2.7e-225
Identity = 381/476 (80.04%), Postives = 427/476 (89.71%), Query Frame = 1

Query: 1   MAGN-GLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRS 60
           MAG+ GLPSLGRVK+TD+ PSEG+PS+S+KLSVSTLS S AQYSAA+IQFPA DGALLRS
Sbjct: 1   MAGDDGLPSLGRVKITDLIPSEGLPSDSYKLSVSTLSQSFAQYSAAVIQFPAGDGALLRS 60

Query: 61  GLDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNG 120
           GLDSA LYF QR A  SA+++  NDSREWC+TSGYY D Q+WQETYDYRPGLTP+EPSN 
Sbjct: 61  GLDSACLYFQQREAYPSADMIHTNDSREWCKTSGYYADPQLWQETYDYRPGLTPIEPSNA 120

Query: 121 MELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCC 180
           MEL P GLPDIF L GKA+R +LDA+S+YLNLRSSPFTEILDNVPLRSRE+SSSVLSVCC
Sbjct: 121 MELPPGGLPDIFGLLGKAARGVLDAMSYYLNLRSSPFTEILDNVPLRSREVSSSVLSVCC 180

Query: 181 YGRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILV 240
           + RPSFHG  HH LT Q+D QL M+  DHDHQ+DKSLI++VKSDKAGL ++DF+GRW LV
Sbjct: 181 HARPSFHGAQHHNLTTQDDGQLMMF-HDHDHQVDKSLISVVKSDKAGLHVRDFHGRWFLV 240

Query: 241 DGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLS 300
           DGDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NNI G+MYGRCSL FKLMPKSMTSLS
Sbjct: 241 DGDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNIPGNMYGRCSLVFKLMPKSMTSLS 300

Query: 301 CSEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIK--MRRR 360
           CSEMRAAGHGV+ QFQ+PVPVDDFMQRSH TD+LFNR  FQ+FSF T+QDGS+K  MRRR
Sbjct: 301 CSEMRAAGHGVEAQFQIPVPVDDFMQRSHPTDQLFNRNTFQSFSFPTAQDGSMKPLMRRR 360

Query: 361 KNNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCAST 420
           KNN+  KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNLK+CE+HIH LDSPCA+ 
Sbjct: 361 KNNTRCKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNLKECENHIHALDSPCANI 420

Query: 421 RMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           RMEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWTA+HD++LSLTEPGQ  QQS N
Sbjct: 421 RMEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMDLSLTEPGQASQQSAN 475

BLAST of Lsi04G018800 vs. TAIR10
Match: AT3G12940.1 (AT3G12940.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 716.5 bits (1848), Expect = 1.1e-206
Identity = 351/476 (73.74%), Postives = 408/476 (85.71%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNG+P+LGRVK+ D+ PSEG+PS+S+KL+V+TLS SLAQYSAAIIQFPA DGALLRSG
Sbjct: 2   MAGNGMPTLGRVKVCDLVPSEGLPSDSYKLAVTTLSQSLAQYSAAIIQFPASDGALLRSG 61

Query: 61  LDSARLYFHQRAACSSAE-LMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNG 120
           LDSARLYFHQR +  +   ++  NDS+EWC+TSGYY D Q WQE+Y+YRPGLTP EPSN 
Sbjct: 62  LDSARLYFHQRDSYPATNNMIHTNDSQEWCKTSGYYADPQSWQESYEYRPGLTPTEPSNS 121

Query: 121 MELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCC 180
           ME  PAGLPDIFAL GKA+R++LDAI FYLNLRS PFTEILDNVPLR+ E+SSSVLSVCC
Sbjct: 122 MEFPPAGLPDIFALLGKAARVVLDAIGFYLNLRSCPFTEILDNVPLRNCEVSSSVLSVCC 181

Query: 181 YGRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILV 240
           Y RPSFHG  HH LT  ED QL +Y SDHDHQ+DKSLI+ VKSDKAGL I+D +G+WILV
Sbjct: 182 YARPSFHGAQHHSLT--EDEQLILY-SDHDHQLDKSLISFVKSDKAGLHIRDMHGQWILV 241

Query: 241 DGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLS 300
           D DLGPQ+A+VYPGLALYQATAGYV+PA+ RTD+N++QGS+ GR SL+FKLMPKSMT+LS
Sbjct: 242 DVDLGPQEAVVYPGLALYQATAGYVSPAVHRTDLNSLQGSIEGRFSLAFKLMPKSMTNLS 301

Query: 301 CSEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIKM--RRR 360
           CSEMRAAGHGV+ QFQLPV VDDFMQRSHS DELFNR   Q+F    SQDGS+K   +RR
Sbjct: 302 CSEMRAAGHGVEAQFQLPVSVDDFMQRSHSNDELFNRQTLQSFIVPQSQDGSMKQLKKRR 361

Query: 361 KNNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCAST 420
           K++S  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CE++ + ++SPCA+ 
Sbjct: 362 KSDSRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECENNHNVMNSPCANI 421

Query: 421 RMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           R EIGWP GVPFVHPHDLPNKAKIGFLE YEPGW+ +HD+E SL+E  Q  Q  TN
Sbjct: 422 RREIGWPHGVPFVHPHDLPNKAKIGFLETYEPGWSETHDMEFSLSETAQGNQHVTN 474

BLAST of Lsi04G018800 vs. TAIR10
Match: AT3G19895.1 (AT3G19895.1 RING/U-box superfamily protein)

HSP 1 Score: 158.7 bits (400), Expect = 8.9e-39
Identity = 117/336 (34.82%), Postives = 166/336 (49.40%), Query Frame = 1

Query: 4   NGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDS 63
           +G P L RV+L++I P EG PS  +  +V  LS SL +Y+A++I+  + D AL+R GL++
Sbjct: 56  SGTP-LARVRLSEILPYEGAPSPVYAKAVEALSVSLMRYNASVIEIGSEDTALMRCGLEA 115

Query: 64  ARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELQ 123
           ARLYF                     RT    V  +  +    YR G +  +    ++  
Sbjct: 116 ARLYF---------------------RTRSLTVSGKGNRGLSMYRAGRSVED----LDSS 175

Query: 124 PAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRP 183
           P  + +IF   GK +R  L AI+ +L LRS  F  +LD+ PL   E+SSSVL +  Y   
Sbjct: 176 PPCMAEIFRCLGKVARAALSAIARHLRLRSDVFNHMLDDFPLAPNEVSSSVL-LASYAHA 235

Query: 184 SFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDL 243
           S     H        +++         +++K L+TL  SD  G+ + D NGRW   D   
Sbjct: 236 SIQNGKHASGGGNLSAKI---------EVEKGLLTLFCSDGTGIQVCDPNGRWYTADNGC 295

Query: 244 GPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGS-MYGRCSLSFKLMPKSMTSLSCSE 303
           G  D ++  G AL  ATAG    A  RT  +++  +   GR SL+F+LMPKS   L CS 
Sbjct: 296 GVGDLLLITGKALSHATAGLRPAASYRTTTDHLSATDTRGRASLAFRLMPKSNAILDCSP 354

Query: 304 MRAAGHGVDVQFQLPVPVDDFMQR-SHSTDELFNRP 338
           + AAGH V  Q  +PV V  FM       D L N P
Sbjct: 356 IEAAGH-VIPQSYVPVSVSQFMDNLLAENDTLVNPP 354

BLAST of Lsi04G018800 vs. NCBI nr
Match: gi|449453784|ref|XP_004144636.1| (PREDICTED: uncharacterized protein LOC101216737 [Cucumis sativus])

HSP 1 Score: 945.3 bits (2442), Expect = 4.2e-272
Identity = 465/473 (98.31%), Postives = 469/473 (99.15%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYFHQRAACSSAELMQ+NDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM
Sbjct: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           EL PAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
           GRPSFHGEHHHKLTAQEDSQLAMY SDHD+QIDKSLITL K+DKAGLLIKDFNGRWILVD
Sbjct: 181 GRPSFHGEHHHKLTAQEDSQLAMYTSDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Sbjct: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIKMRRRKNN 360
           SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTD+LFNRPNFQNFSFSTSQDGSIKMRRRKNN
Sbjct: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNN 360

Query: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420
           SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME
Sbjct: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420

Query: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Sbjct: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 473

BLAST of Lsi04G018800 vs. NCBI nr
Match: gi|659130950|ref|XP_008465436.1| (PREDICTED: uncharacterized protein LOC103503048 isoform X1 [Cucumis melo])

HSP 1 Score: 941.0 bits (2431), Expect = 7.9e-271
Identity = 462/473 (97.67%), Postives = 469/473 (99.15%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVD QMWQETYDYRPGLTPVEPS+GM
Sbjct: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDTQMWQETYDYRPGLTPVEPSSGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           EL PAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
           GRPSFHGEHHHKLTAQEDSQL+MYASDHD+QIDKSLITL KSDKAGLLIKDFNGRWILVD
Sbjct: 181 GRPSFHGEHHHKLTAQEDSQLSMYASDHDNQIDKSLITLFKSDKAGLLIKDFNGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMT+LSC
Sbjct: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTNLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIKMRRRKNN 360
           SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTD+LFNRPNFQNFSFSTSQDGSIKMRRRKN+
Sbjct: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNS 360

Query: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420
           SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCE+HIHTLDSPCASTRME
Sbjct: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCENHIHTLDSPCASTRME 420

Query: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Sbjct: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 473

BLAST of Lsi04G018800 vs. NCBI nr
Match: gi|659130954|ref|XP_008465438.1| (PREDICTED: uncharacterized protein LOC103503048 isoform X2 [Cucumis melo])

HSP 1 Score: 941.0 bits (2431), Expect = 7.9e-271
Identity = 462/473 (97.67%), Postives = 469/473 (99.15%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVD QMWQETYDYRPGLTPVEPS+GM
Sbjct: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDTQMWQETYDYRPGLTPVEPSSGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           EL PAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
           GRPSFHGEHHHKLTAQEDSQL+MYASDHD+QIDKSLITL KSDKAGLLIKDFNGRWILVD
Sbjct: 181 GRPSFHGEHHHKLTAQEDSQLSMYASDHDNQIDKSLITLFKSDKAGLLIKDFNGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMT+LSC
Sbjct: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTNLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIKMRRRKNN 360
           SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTD+LFNRPNFQNFSFSTSQDGSIKMRRRKN+
Sbjct: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNS 360

Query: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420
           SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCE+HIHTLDSPCASTRME
Sbjct: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCENHIHTLDSPCASTRME 420

Query: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Sbjct: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 473

BLAST of Lsi04G018800 vs. NCBI nr
Match: gi|590611974|ref|XP_007022254.1| (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 [Theobroma cacao])

HSP 1 Score: 799.7 bits (2064), Expect = 2.8e-228
Identity = 387/475 (81.47%), Postives = 428/475 (90.11%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++  NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E  P GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY  DH+HQ+DK LI++VKSDKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWTA+HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

BLAST of Lsi04G018800 vs. NCBI nr
Match: gi|590611970|ref|XP_007022253.1| (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 799.7 bits (2064), Expect = 2.8e-228
Identity = 387/475 (81.47%), Postives = 428/475 (90.11%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++  NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELQPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E  P GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYASDHDHQIDKSLITLVKSDKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY  DH+HQ+DK LI++VKSDKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDELFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWTA+HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A061F8X2_THECC2.0e-22881.472-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform ... [more]
A0A061FH49_THECC2.0e-22881.472-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=T... [more]
D7SWX7_VITVI1.3e-22781.05Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00600 PE=4 SV=... [more]
A0A0A0KZ26_CUCSA3.7e-22797.97Uncharacterized protein OS=Cucumis sativus GN=Csa_4G608060 PE=4 SV=1[more]
A0A0D2M686_GOSRA2.7e-22580.04Uncharacterized protein OS=Gossypium raimondii GN=B456_002G036400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12940.11.1e-20673.74 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT3G19895.18.9e-3934.82 RING/U-box superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449453784|ref|XP_004144636.1|4.2e-27298.31PREDICTED: uncharacterized protein LOC101216737 [Cucumis sativus][more]
gi|659130950|ref|XP_008465436.1|7.9e-27197.67PREDICTED: uncharacterized protein LOC103503048 isoform X1 [Cucumis melo][more]
gi|659130954|ref|XP_008465438.1|7.9e-27197.67PREDICTED: uncharacterized protein LOC103503048 isoform X2 [Cucumis melo][more]
gi|590611974|ref|XP_007022254.1|2.8e-22881.472-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 [The... [more]
gi|590611970|ref|XP_007022253.1|2.8e-22881.472-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G018800.1Lsi04G018800.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33644FAMILY NOT NAMEDcoord: 1..70
score: 2.2E-227coord: 90..473
score: 2.2E
NoneNo IPR availablePANTHERPTHR33644:SF2SUBFAMILY NOT NAMEDcoord: 1..70
score: 2.2E-227coord: 90..473
score: 2.2E