Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTGTTATTGCGAGTATCGAGTCAGGGTCACCAACTCGGCGTAATCTCCGGCAATGGCGACGGCGGGATTAACAGAGGAAGCAAAAGGAAGTCCATTATTAAGCTTAATCGGATTATGCCGAGGTCGTCGGAATTGAGGATGATGTACGAAGAGGGAAATTTGGAGCATCCGAAGTGGTCTGGAAATAGCCTTCTGTCTCGTCTTGTCGGCGGCCTCATTTCTTTCAAACCTTTTTTCTCCATTCTTAAGCTCACCGCCAGACAAGTCATGATCGGGTTTCTCTTTTTTCTTTTTTTCTTTTTTTGAATTAAATTATTTAATCAATTTTTTTTTTTTGTTTAAATTTAATAAATGGAACCTTTTAAACTCATTCCTAAAAGCTCTGGACTAAATTGGTCCATTTAAAATATGAAAGACTAAAAGTACCTATTTCTCAAAATTTAGGGATTAAAAATGTATTTTATTTAGTTTTATTTATTTTCAAAATACTACACTTTTATTTGTAAATTTTGAATTTAATTTTTATTTTTTTATTACAATAGAGGAAGGAGAGGTCCAAAGCACGGACCTCTCGGTTGCTAACATGCTAGTTGAGCTAAGGAAAATGATCTTTCAAAATGTTTTATTTTTTTAACTTTGAGTTTCAAATTTTGTTTTTATTTGATCCCAAAATTTCAAGATTTATACTTTTAACTTTCATTTCTAAACTAAATACTCACTTTTGATTCTTAATTAGTATTGATATTTGTAATTTATTTAAATAATATTCATTCTTTACAAAAATAAAATTAAAATTCAATATTTCTTCTTTTAACCTTCATTTCTACACTAAATACTCATTTTTTTTTTTAATGATAGTGAATCTCATAAAAAAGATATAATACTATTAAACTATTATTTATTCATATCAACATATGAGTGAAATGAGTACATATTATAGTTGCAGTATCGATCTCATCTCATTTTTTAAATGTGGAGGGTAAGTTGCAAATTTTTTAATATATATATATATTTAAACAAAGGAGAAAGTTGGAATTTTATTTTGAAATATTACTTTTTCTTTTATTTTTGTTTATGTGAGTCACTATTTCTACCTGTGATAAAAAAAAAAAAAAAAAAAACTACACTATAGCTTACCTATTTGAGTAATTGAAATGAAAATATTTAAATTACAGAAACTTTATTTAATTATATTACCAAAATTTGTAATTTAAACTACGTTTTTTTTTTTCCCCTCTTATCTTATTGGAACAGTACGGCGGCAAAGAATAATTTTCCTTGGCAAAAAATGAGGTCGGAGATTTTGGAGAGTGAAGAAGTTTACAAGGTGTTCGAGAAAGTTCAAAATCCCTCCATCGTCTACCCTCATTGTAATTCCTCTAATTTCTTCCTTTCGAATTTCATAAATTATATCATGAAAGAAATAATATTACAATTTCTATAAATTTTAAATACATAAATTAATTGCATAAGATCATCATCCACAACACATTTCAGGATTTTGATTAGTTATAGGATGTTTTATCTCATTAATAGATTGATCTCAGAGAAAAAGTATTTTTCAAAAAAATTATTTTTATTTAAATTTTTTTTTATAAGAACTGTTTAAAATAATACATTTCAAAATCTACTTTGAGTGGTTATTAAACACATTCTAATTTTTTTTCCAAAAATGACTAATTTTTTAAATTAATCATTTGAAAATGTATTTCAAACACACCCGTAGTCTTGCATATTTAATTTTCACCAATATCATATTTTTGGTTTTGTAATTAACCATGTACATATTGTCTTTTTTTTTTTTTTTTTTTGACAGATTACCTAAAGCCTTTTCATGCATACGATGAGGGTCATCTCTCATGGCTTGTAAGTACTCACTAATTGGATATTAATTTTAGAGTAATATGGCTTCTATAGCTAGGTTGCAATAATATTTTTTTTTCTAGTTCATCTTTTCATACCAGGAAAAAATTATACAAAATTTGATTGAAAAATGAACGTATCACCTTGATGCTACGGAGTCCTTCATGTAATATACGTGATTTGGATTAAGTGATTTAGGTCTGTTTTACTAATATGATTGAAGGCAATTTCTCATTATTATTATGAGATAATTACGATTCTCAAAACTATGACTATTTTTTAAGGGTTGTCTAATGCTCACTCATTACTGAATGGCCATAGGTGATTGTAGAATTTAATTGATTAGAGGTGGTGAACTTGTTAAATAATAAGTCGACTAATTTTACTAGAGGCAATATTCGATGATAATGACATCAAGCTTTTGGCTCAAATTCGTGAGAATGTCTTCTTCATTTATACTTATCGCGATATATTGTCTTGGCATATTCTCTTGCTTAAAAAACTATAGTGGAAAACCGTATTGATCTTCTTCATTTTCCTTATCTTGAGTTGTTGTTTTGTTGATCTGGTATCGAATGAAGTGATTTTGTAATTATGCTTACTTCTAGTTATCCTAAAAAATAAAGTATTAATACATCTTATCGTAGTTGACTAAAATTGACAATAATATATTATTTAGAAAGTCAAAAGATATGTGTCTCTCTTTCAGGCTGCAGCAGAAGTAGAGGCTGCAACAATGTCAATGGTTATTCGAGCAGTACCCAATGCCTCTTGTTTAGATGAAGCAAAGGAAGTTGTATTTGGAAATTGGCTTCGACAAATTGATGAGCATCATATGAAGTATTCAAACAATCCTATTCTAGACATTCTAGACATTGGTTGTTCTATAGGTTTGAGCACAACATATTTGGCTGATAAGTTTCCTTCGGCTAAAACTACTGTAAGTCACTTTTAATTCTCTCTTTCTTATCTTGATACAATTATTAGTTCAAAATTTTAATTGATATAACATTCAAAGTTAGGAGTATTAAATTTACATTTTTCCAGAAAGAATATAATTGATAACAATTAGCTTGTTTGACAAAGTTTGCTTATAGATGGTTATAATGTTTGATTTGATGTTTAGTATGCATTTGGTGTAGGGATTAGATTTATCTCCTTACTTCCTTGCTGTGGCTCAACACAATGAAAACAAAAGAACACATCCAAGAAAGAATCCAATAAGATGGTTACATGAAAATGGGGAACATACTAGCTTTCCTTCAAGATCATTTGACCTACTTTCCATTGCTTATTTGGTAATAAATTCCTCTTCATTTAAGATCTTATTATGTAATCAAATTATGTCCATGGTTCTTCTAGAATGTCTACCTATCTCCACGTAGTAATGATATATTGTCCATATATTTTGGAACCACTCAAAATGTCTCATATCAACTAAGATAATTGTATTTAACTTATATATCATATGGATCTCGCACTCTTTTCAAGGTAGATCTTTGTTTGCACCTAACATTAATCTATAATATTTTAGTTTCACGAATGTCCCCAAACAATAATTGTTAATTTGCTCAAAGAATCATTTCGACTTCTTCGACCGGGTTCCACAATTATCATTATTGACAATGCGGTAATTAAGTTGATAATTATGTATATATGTCTTAGATGTTTTTTTTTTTTTTTTTTGAAAGAAAACCCTCATCTACTATATGGTATATATATAACAAATCTAATTTTACATTTGTAATTTTCATGTGCAGCCCAAATCAAAGATTACTCAGGTATATGGTGTAGTACCTTTCTTTTTGTTTCATAAATTTGACGCTGGCTTTAATAATGAAAATATTTTGCTAAAATCTGTATTAATTTACCATGACATTGTAGGAATTATCTCCAATCATATACACATTACTGAAGAGTGTAGAGCCATATGTAGATGAATACTATCTCACTGATGTAGAAGGAAGAATGAGAGAAGTTGGATTTGTGAATGTGAAATCAAGGCTAACAGACCCAAGACATGTTACATTCACAGCAACCGTTCCACTATGA
mRNA sequence
ATGGCATTGTTATTGCGAGTATCGAGTCAGGGTCACCAACTCGGCGTAATCTCCGGCAATGGCGACGGCGGGATTAACAGAGGAAGCAAAAGGAAGTCCATTATTAAGCTTAATCGGATTATGCCGAGGTCGTCGGAATTGAGGATGATGTACGAAGAGGGAAATTTGGAGCATCCGAAGTGGTCTGGAAATAGCCTTCTGTCTCGTCTTGTCGGCGGCCTCATTTCTTTCAAACCTTTTTTCTCCATTCTTAAGCTCACCGCCAGACAAGTCATGATCGGTACGGCGGCAAAGAATAATTTTCCTTGGCAAAAAATGAGGTCGGAGATTTTGGAGAGTGAAGAAGTTTACAAGGTGTTCGAGAAAGTTCAAAATCCCTCCATCGTCTACCCTCATTATTACCTAAAGCCTTTTCATGCATACGATGAGGGTCATCTCTCATGGCTTGCTGCAGCAGAAGTAGAGGCTGCAACAATGTCAATGGTTATTCGAGCAGTACCCAATGCCTCTTGTTTAGATGAAGCAAAGGAAGTTGTATTTGGAAATTGGCTTCGACAAATTGATGAGCATCATATGAAGTATTCAAACAATCCTATTCTAGACATTCTAGACATTGGTTGTTCTATAGGTTTGAGCACAACATATTTGGCTGATAAGTTTCCTTCGGCTAAAACTACTGGATTAGATTTATCTCCTTACTTCCTTGCTGTGGCTCAACACAATGAAAACAAAAGAACACATCCAAGAAAGAATCCAATAAGATGGTTACATGAAAATGGGGAACATACTAGCTTTCCTTCAAGATCATTTGACCTACTTTCCATTGCTTATTTGTTTCACGAATGTCCCCAAACAATAATTGTTAATTTGCTCAAAGAATCATTTCGACTTCTTCGACCGGGTTCCACAATTATCATTATTGACAATGCGGAATTATCTCCAATCATATACACATTACTGAAGAGTGTAGAGCCATATGTAGATGAATACTATCTCACTGATGTAGAAGGAAGAATGAGAGAAGTTGGATTTGTGAATGTGAAATCAAGGCTAACAGACCCAAGACATGTTACATTCACAGCAACCGTTCCACTATGA
Coding sequence (CDS)
ATGGCATTGTTATTGCGAGTATCGAGTCAGGGTCACCAACTCGGCGTAATCTCCGGCAATGGCGACGGCGGGATTAACAGAGGAAGCAAAAGGAAGTCCATTATTAAGCTTAATCGGATTATGCCGAGGTCGTCGGAATTGAGGATGATGTACGAAGAGGGAAATTTGGAGCATCCGAAGTGGTCTGGAAATAGCCTTCTGTCTCGTCTTGTCGGCGGCCTCATTTCTTTCAAACCTTTTTTCTCCATTCTTAAGCTCACCGCCAGACAAGTCATGATCGGTACGGCGGCAAAGAATAATTTTCCTTGGCAAAAAATGAGGTCGGAGATTTTGGAGAGTGAAGAAGTTTACAAGGTGTTCGAGAAAGTTCAAAATCCCTCCATCGTCTACCCTCATTATTACCTAAAGCCTTTTCATGCATACGATGAGGGTCATCTCTCATGGCTTGCTGCAGCAGAAGTAGAGGCTGCAACAATGTCAATGGTTATTCGAGCAGTACCCAATGCCTCTTGTTTAGATGAAGCAAAGGAAGTTGTATTTGGAAATTGGCTTCGACAAATTGATGAGCATCATATGAAGTATTCAAACAATCCTATTCTAGACATTCTAGACATTGGTTGTTCTATAGGTTTGAGCACAACATATTTGGCTGATAAGTTTCCTTCGGCTAAAACTACTGGATTAGATTTATCTCCTTACTTCCTTGCTGTGGCTCAACACAATGAAAACAAAAGAACACATCCAAGAAAGAATCCAATAAGATGGTTACATGAAAATGGGGAACATACTAGCTTTCCTTCAAGATCATTTGACCTACTTTCCATTGCTTATTTGTTTCACGAATGTCCCCAAACAATAATTGTTAATTTGCTCAAAGAATCATTTCGACTTCTTCGACCGGGTTCCACAATTATCATTATTGACAATGCGGAATTATCTCCAATCATATACACATTACTGAAGAGTGTAGAGCCATATGTAGATGAATACTATCTCACTGATGTAGAAGGAAGAATGAGAGAAGTTGGATTTGTGAATGTGAAATCAAGGCTAACAGACCCAAGACATGTTACATTCACAGCAACCGTTCCACTATGA
Protein sequence
MALLLRVSSQGHQLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLVGGLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYPHYYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHHMKYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNPIRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNAELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVPL
Homology
BLAST of HG10001711 vs. NCBI nr
Match:
XP_038902342.1 (uncharacterized protein LOC120088975 [Benincasa hispida])
HSP 1 Score: 549.7 bits (1415), Expect = 1.9e-152
Identity = 290/383 (75.72%), Postives = 311/383 (81.20%), Query Frame = 0
Query: 1 MALLLRVSSQGHQLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPK 60
MAL L SQGHQLGVISGN G+KRK IIKL+RIM R SELR +YEEGNLEHPK
Sbjct: 2 MALSLGAPSQGHQLGVISGN-------GNKRKPIIKLSRIMAR-SELRTLYEEGNLEHPK 61
Query: 61 WSGNSLLSRLVGGLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVF 120
WSGNSLLSRLVGG+IS KPFF+ILKL ARQ+MI AA NN PWQKMRSEILE EEVYK
Sbjct: 62 WSGNSLLSRLVGGIISVKPFFAILKLAARQLMISKAAGNNIPWQKMRSEILECEEVYKEL 121
Query: 121 EKVQNPSIVYPHYYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVF 180
EKV+NP I+YP YYLKPFHAYDEGHLSWLAA EVEAATMSMV+RAVPNASCLDEAKEVVF
Sbjct: 122 EKVRNPCIIYPDYYLKPFHAYDEGHLSWLAATEVEAATMSMVMRAVPNASCLDEAKEVVF 181
Query: 181 GNWLRQIDEHHMKY-SNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQ 240
GNWL++IDEHHMKY SNNPILDILDIGCSIGLSTTYLADKFPSAKT GLDLSPYFLA A+
Sbjct: 182 GNWLQRIDEHHMKYSSNNPILDILDIGCSIGLSTTYLADKFPSAKTIGLDLSPYFLAGAE 241
Query: 241 HNENKRTHPRKNPIRWLHENGEHTSFPSRSFDLLSIAYLFHECP--QTIIVNLLKESFRL 300
HNENKR HPRKNPIRWLHENGEHTSFPSRSF LLSIA+L + I+ N + L
Sbjct: 242 HNENKRAHPRKNPIRWLHENGEHTSFPSRSFYLLSIAHLATSSSSFKIILCNPIMSRILL 301
Query: 301 LRPGSTIIIID---------------NAELSPIIYTLLKSVEPYVDEYYLTDVEGRMREV 360
+ + ++ ELSPIIYTLLKSVEPYVDEY+LTDVEGRMREV
Sbjct: 302 IDKDNKYDVVHTFRHKLSFTNPKSKITQELSPIIYTLLKSVEPYVDEYHLTDVEGRMREV 361
Query: 361 GFVNVKSRLTDPRHVTFTATVPL 366
GFVNVKSRLTDPRHVT TATVPL
Sbjct: 362 GFVNVKSRLTDPRHVTVTATVPL 376
BLAST of HG10001711 vs. NCBI nr
Match:
XP_022149776.1 (uncharacterized protein LOC111018124 [Momordica charantia])
HSP 1 Score: 524.2 bits (1349), Expect = 8.7e-145
Identity = 269/359 (74.93%), Postives = 295/359 (82.17%), Query Frame = 0
Query: 13 QLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLVG 72
QL +IS NG R + R I ++ + +++ ++EEG+LEHP WSG + LSRLVG
Sbjct: 9 QLPIISANG----QRRTARPRGITMDSMRSKAT----VFEEGHLEHPNWSGETPLSRLVG 68
Query: 73 GLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYPH 132
LISFKP +S+LKL AR+VMI TA KNN PW+KM SEILES +VYK E VQN SIVYP
Sbjct: 69 ALISFKPLYSVLKLAARRVMISTAEKNNIPWRKMTSEILESADVYKELESVQNLSIVYPD 128
Query: 133 YYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHHM 192
YYLKPFHAYDEGHLSWLAAAEVE ATMSMV+RAVP AS LDEAKEVVFGNWLR +DEHH
Sbjct: 129 YYLKPFHAYDEGHLSWLAAAEVEPATMSMVMRAVPEASSLDEAKEVVFGNWLRTVDEHHR 188
Query: 193 KYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNP 252
+YS NPIL ILDIGCSIGLST YLADKFP AKTTGLDLSPYFLAVAQ+ E KRT PRKN
Sbjct: 189 QYSGNPILHILDIGCSIGLSTRYLADKFPLAKTTGLDLSPYFLAVAQYMEKKRTAPRKNA 248
Query: 253 IRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA-- 312
IRWLHENGEHTS PS+SFDL+SIAYLFHECPQ IVNLLKESFRLLRPG TI I DNA
Sbjct: 249 IRWLHENGEHTSLPSQSFDLVSIAYLFHECPQVAIVNLLKESFRLLRPGGTIAITDNAPK 308
Query: 313 -----ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVP 365
ELSPII+TLLKS EPY+DEYYLTD+EGRMREVGFVNVKSRLTDPRHVT TATVP
Sbjct: 309 SKITQELSPIIFTLLKSTEPYLDEYYLTDLEGRMREVGFVNVKSRLTDPRHVTVTATVP 359
BLAST of HG10001711 vs. NCBI nr
Match:
XP_038902341.1 (uncharacterized protein LOC120088974 [Benincasa hispida])
HSP 1 Score: 486.5 bits (1251), Expect = 2.0e-133
Identity = 247/361 (68.42%), Postives = 286/361 (79.22%), Query Frame = 0
Query: 12 HQLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLV 71
HQL +ISGNG S+ K + +S ++EEG LE P WSG + LSRLV
Sbjct: 8 HQLALISGNGQQRTRSHSRTKR--GFIKFQASTSSEAAVFEEGQLERPNWSGQTPLSRLV 67
Query: 72 GGLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYP 131
G LISFKP +SILKL ARQV+I TA K N PW+ + S+ILES +VYK E VQNPSIVYP
Sbjct: 68 GALISFKPLYSILKLGARQVLISTAEKKNIPWRNLTSDILES-DVYKELESVQNPSIVYP 127
Query: 132 HYYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHH 191
YYLKPFHAYDEG+LSWLAAAEV+ ATMSM++RAVPNAS +DEAKE+VFGNWLR+I+EHH
Sbjct: 128 DYYLKPFHAYDEGNLSWLAAAEVQPATMSMIMRAVPNASSVDEAKEIVFGNWLRRIEEHH 187
Query: 192 MKYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKN 251
+KYS NPILDILDIGCSIG T LADKFP+AK TGLDLSPYFLAVAQ+ + K+ PRKN
Sbjct: 188 LKYSGNPILDILDIGCSIGFGTRQLADKFPTAKVTGLDLSPYFLAVAQYMDKKKA-PRKN 247
Query: 252 PIRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA- 311
IRWLH NGE TS PSRSFDLLSI+Y+FHECP IVN+LKESFR+LRPG TI+I D A
Sbjct: 248 AIRWLHGNGEDTSLPSRSFDLLSISYVFHECPHVAIVNILKESFRVLRPGGTIVITDQAS 307
Query: 312 ------ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVP 366
ELSP+++TLLKS EP++DEY+LTD+E +MREVGFVNV SRLTDPRHVT TATVP
Sbjct: 308 KSKVVQELSPVLFTLLKSTEPHLDEYHLTDLEEKMREVGFVNVTSRLTDPRHVTITATVP 364
BLAST of HG10001711 vs. NCBI nr
Match:
XP_008456428.1 (PREDICTED: uncharacterized protein LOC103496371 [Cucumis melo] >KAA0054425.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein [Cucumis melo var. makuwa] >TYK14611.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein [Cucumis melo var. makuwa])
HSP 1 Score: 478.0 bits (1229), Expect = 7.2e-131
Identity = 242/360 (67.22%), Postives = 286/360 (79.44%), Query Frame = 0
Query: 13 QLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLVG 72
QL +ISG G+G G +S L ++ +S +YEEG LE P WSG + LSRLVG
Sbjct: 9 QLALISG-GNGQQRGGRISRSNRGLIKVQASTSSEVGVYEEGRLERPNWSGQTPLSRLVG 68
Query: 73 GLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYPH 132
LISFKP +SILKL ARQV+I TA K N W+K+ S++LES +VYK + VQNPSIVYP
Sbjct: 69 ALISFKPLYSILKLGARQVLISTAEKKNISWRKLTSDVLES-DVYKELDSVQNPSIVYPD 128
Query: 133 YYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHHM 192
YYLKPFHAYDEG+LSWLAAAEV+ ATMSM++RAVP AS +DEAKE+VFGNWLR+I+EHH+
Sbjct: 129 YYLKPFHAYDEGNLSWLAAAEVQPATMSMIMRAVPTASSVDEAKEIVFGNWLRRIEEHHL 188
Query: 193 KYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNP 252
KYS NPILDILDIGCS+G T LADKFP+AK TGLDLSPYFLAVAQ+ + K+T PRKN
Sbjct: 189 KYSGNPILDILDIGCSVGFGTRQLADKFPTAKVTGLDLSPYFLAVAQYMDKKKT-PRKNA 248
Query: 253 IRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA-- 312
IRWLH NGE T PSRSFDLLSI+Y+ HECP IVN+++ESFRLLRPG TI+I D A
Sbjct: 249 IRWLHGNGEDTGLPSRSFDLLSISYVLHECPHVAIVNIIRESFRLLRPGGTIVITDQASK 308
Query: 313 -----ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVPL 366
ELSP+++TLLKS EPY+DEY+LTD+E +MRE+GFVNV SRLTDPRHVT TATVPL
Sbjct: 309 SKVVQELSPVLFTLLKSTEPYLDEYHLTDLEEKMREIGFVNVTSRLTDPRHVTITATVPL 365
BLAST of HG10001711 vs. NCBI nr
Match:
XP_022149773.1 (uncharacterized protein LOC111018123 isoform X1 [Momordica charantia])
HSP 1 Score: 476.1 bits (1224), Expect = 2.7e-130
Identity = 241/362 (66.57%), Postives = 292/362 (80.66%), Query Frame = 0
Query: 12 HQLGVISGNGDGGINR-GSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRL 71
+++ + SGNG I R S ++ +I+ R SE+ ++EEG LE P WSG + LSRL
Sbjct: 8 YRIALTSGNGHRRIGRLRSSKRGLIEFQR--SARSEM-AVFEEGKLERPNWSGETSLSRL 67
Query: 72 VGGLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVY 131
VG LISFKP FS+LKL AR+V+I TA K N PW+KM SEIL+S +VYK + VQ+ SIVY
Sbjct: 68 VGALISFKPLFSLLKLGARRVLISTAEKKNIPWRKMTSEILDS-DVYKELDSVQDLSIVY 127
Query: 132 PHYYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEH 191
P YYLKPFHAYDEGHLSWLAAAE E ATMSM++RAVP+AS +DEAKE+VFGNWLR I+EH
Sbjct: 128 PDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVPDASSVDEAKEIVFGNWLRTIEEH 187
Query: 192 HMKYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRK 251
H++YS NPILDILDIGCS+GLST LADKFP AK TGLDLSPYFLAVAQ+ + K++ PRK
Sbjct: 188 HLQYSENPILDILDIGCSVGLSTRQLADKFPLAKVTGLDLSPYFLAVAQYMD-KKSAPRK 247
Query: 252 NPIRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA 311
N IRWLH N E++S PSRSFDL+SIA++FHECPQ IV++LKESFRLLRPG ++ D A
Sbjct: 248 NSIRWLHGNAENSSLPSRSFDLVSIAFMFHECPQVAIVSILKESFRLLRPGGEFVVTDQA 307
Query: 312 -------ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATV 366
ELSP+++TLLKS EPY+DEY+LTD+EGRMRE+GFVNV+S+LTDPRHVT TATV
Sbjct: 308 PKSKAVQELSPVLFTLLKSTEPYLDEYHLTDLEGRMREIGFVNVRSKLTDPRHVTVTATV 364
BLAST of HG10001711 vs. ExPASy Swiss-Prot
Match:
P49016 (Demethylmenaquinone methyltransferase OS=Lactococcus lactis subsp. lactis (strain IL1403) OX=272623 GN=menG PE=3 SV=1)
HSP 1 Score: 57.8 bits (138), Expect = 3.0e-07
Identity = 44/138 (31.88%), Postives = 66/138 (47.83%), Query Frame = 0
Query: 200 LDILDIGCSIGLSTTYLADKF-PSAKTTGLDLSPYFLAVAQHNENKRTHPRKNPIRWLHE 259
L ILD+ C G T L++ PS K GLD S L +A + K K I +L
Sbjct: 55 LSILDLCCGTGDWTFDLSESVGPSGKVIGLDFSENMLEIA---KAKLKEEAKKNIEFLQG 114
Query: 260 NGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNAELSPIIY- 319
N F SFD+++I Y P + V LKE FR+L+PG ++ I+ + + IY
Sbjct: 115 NAMALPFEKGSFDVVTIGYGLRNTPDYLTV--LKEIFRVLKPGGRVVCIETSHPTLPIYK 174
Query: 320 ----TLLKSVEPYVDEYY 332
K+V P++ + +
Sbjct: 175 QAFELYFKNVMPFLGKVF 187
BLAST of HG10001711 vs. ExPASy Swiss-Prot
Match:
Q8KF69 (Demethylmenaquinone methyltransferase OS=Chlorobaculum tepidum (strain ATCC 49652 / DSM 12025 / NBRC 103806 / TLS) OX=194439 GN=menG PE=3 SV=1)
HSP 1 Score: 47.4 bits (111), Expect = 4.1e-04
Identity = 40/131 (30.53%), Postives = 59/131 (45.04%), Query Frame = 0
Query: 202 ILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNPIRWLHENGE 261
ILD+ G +A K P AK TG DLSP LA+A ++ +P I +L E
Sbjct: 67 ILDVATGTGDLAASMA-KIPGAKVTGYDLSPEMLAIA-----RKKYPN---IEFLEGFAE 126
Query: 262 HTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIID-----NAELSPII 321
F RSF ++S + + +KE R+L+PG II+ NA + +
Sbjct: 127 KMPFDDRSFHVVSAGFGVRNFED--LAQGMKEFHRVLKPGGCAYIIEPMIPRNAVMKKLY 186
Query: 322 YTLLKSVEPYV 328
K+V P +
Sbjct: 187 LIYFKNVLPKI 186
BLAST of HG10001711 vs. ExPASy TrEMBL
Match:
A0A6J1D9E9 (uncharacterized protein LOC111018124 OS=Momordica charantia OX=3673 GN=LOC111018124 PE=4 SV=1)
HSP 1 Score: 524.2 bits (1349), Expect = 4.2e-145
Identity = 269/359 (74.93%), Postives = 295/359 (82.17%), Query Frame = 0
Query: 13 QLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLVG 72
QL +IS NG R + R I ++ + +++ ++EEG+LEHP WSG + LSRLVG
Sbjct: 9 QLPIISANG----QRRTARPRGITMDSMRSKAT----VFEEGHLEHPNWSGETPLSRLVG 68
Query: 73 GLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYPH 132
LISFKP +S+LKL AR+VMI TA KNN PW+KM SEILES +VYK E VQN SIVYP
Sbjct: 69 ALISFKPLYSVLKLAARRVMISTAEKNNIPWRKMTSEILESADVYKELESVQNLSIVYPD 128
Query: 133 YYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHHM 192
YYLKPFHAYDEGHLSWLAAAEVE ATMSMV+RAVP AS LDEAKEVVFGNWLR +DEHH
Sbjct: 129 YYLKPFHAYDEGHLSWLAAAEVEPATMSMVMRAVPEASSLDEAKEVVFGNWLRTVDEHHR 188
Query: 193 KYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNP 252
+YS NPIL ILDIGCSIGLST YLADKFP AKTTGLDLSPYFLAVAQ+ E KRT PRKN
Sbjct: 189 QYSGNPILHILDIGCSIGLSTRYLADKFPLAKTTGLDLSPYFLAVAQYMEKKRTAPRKNA 248
Query: 253 IRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA-- 312
IRWLHENGEHTS PS+SFDL+SIAYLFHECPQ IVNLLKESFRLLRPG TI I DNA
Sbjct: 249 IRWLHENGEHTSLPSQSFDLVSIAYLFHECPQVAIVNLLKESFRLLRPGGTIAITDNAPK 308
Query: 313 -----ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVP 365
ELSPII+TLLKS EPY+DEYYLTD+EGRMREVGFVNVKSRLTDPRHVT TATVP
Sbjct: 309 SKITQELSPIIFTLLKSTEPYLDEYYLTDLEGRMREVGFVNVKSRLTDPRHVTVTATVP 359
BLAST of HG10001711 vs. ExPASy TrEMBL
Match:
A0A5D3CS29 (S-adenosyl-L-methionine-dependent methyltransferases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold552G001220 PE=4 SV=1)
HSP 1 Score: 478.0 bits (1229), Expect = 3.5e-131
Identity = 242/360 (67.22%), Postives = 286/360 (79.44%), Query Frame = 0
Query: 13 QLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLVG 72
QL +ISG G+G G +S L ++ +S +YEEG LE P WSG + LSRLVG
Sbjct: 9 QLALISG-GNGQQRGGRISRSNRGLIKVQASTSSEVGVYEEGRLERPNWSGQTPLSRLVG 68
Query: 73 GLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYPH 132
LISFKP +SILKL ARQV+I TA K N W+K+ S++LES +VYK + VQNPSIVYP
Sbjct: 69 ALISFKPLYSILKLGARQVLISTAEKKNISWRKLTSDVLES-DVYKELDSVQNPSIVYPD 128
Query: 133 YYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHHM 192
YYLKPFHAYDEG+LSWLAAAEV+ ATMSM++RAVP AS +DEAKE+VFGNWLR+I+EHH+
Sbjct: 129 YYLKPFHAYDEGNLSWLAAAEVQPATMSMIMRAVPTASSVDEAKEIVFGNWLRRIEEHHL 188
Query: 193 KYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNP 252
KYS NPILDILDIGCS+G T LADKFP+AK TGLDLSPYFLAVAQ+ + K+T PRKN
Sbjct: 189 KYSGNPILDILDIGCSVGFGTRQLADKFPTAKVTGLDLSPYFLAVAQYMDKKKT-PRKNA 248
Query: 253 IRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA-- 312
IRWLH NGE T PSRSFDLLSI+Y+ HECP IVN+++ESFRLLRPG TI+I D A
Sbjct: 249 IRWLHGNGEDTGLPSRSFDLLSISYVLHECPHVAIVNIIRESFRLLRPGGTIVITDQASK 308
Query: 313 -----ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVPL 366
ELSP+++TLLKS EPY+DEY+LTD+E +MRE+GFVNV SRLTDPRHVT TATVPL
Sbjct: 309 SKVVQELSPVLFTLLKSTEPYLDEYHLTDLEEKMREIGFVNVTSRLTDPRHVTITATVPL 365
BLAST of HG10001711 vs. ExPASy TrEMBL
Match:
A0A1S3C2T1 (uncharacterized protein LOC103496371 OS=Cucumis melo OX=3656 GN=LOC103496371 PE=4 SV=1)
HSP 1 Score: 478.0 bits (1229), Expect = 3.5e-131
Identity = 242/360 (67.22%), Postives = 286/360 (79.44%), Query Frame = 0
Query: 13 QLGVISGNGDGGINRGSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRLVG 72
QL +ISG G+G G +S L ++ +S +YEEG LE P WSG + LSRLVG
Sbjct: 9 QLALISG-GNGQQRGGRISRSNRGLIKVQASTSSEVGVYEEGRLERPNWSGQTPLSRLVG 68
Query: 73 GLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVYPH 132
LISFKP +SILKL ARQV+I TA K N W+K+ S++LES +VYK + VQNPSIVYP
Sbjct: 69 ALISFKPLYSILKLGARQVLISTAEKKNISWRKLTSDVLES-DVYKELDSVQNPSIVYPD 128
Query: 133 YYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEHHM 192
YYLKPFHAYDEG+LSWLAAAEV+ ATMSM++RAVP AS +DEAKE+VFGNWLR+I+EHH+
Sbjct: 129 YYLKPFHAYDEGNLSWLAAAEVQPATMSMIMRAVPTASSVDEAKEIVFGNWLRRIEEHHL 188
Query: 193 KYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRKNP 252
KYS NPILDILDIGCS+G T LADKFP+AK TGLDLSPYFLAVAQ+ + K+T PRKN
Sbjct: 189 KYSGNPILDILDIGCSVGFGTRQLADKFPTAKVTGLDLSPYFLAVAQYMDKKKT-PRKNA 248
Query: 253 IRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA-- 312
IRWLH NGE T PSRSFDLLSI+Y+ HECP IVN+++ESFRLLRPG TI+I D A
Sbjct: 249 IRWLHGNGEDTGLPSRSFDLLSISYVLHECPHVAIVNIIRESFRLLRPGGTIVITDQASK 308
Query: 313 -----ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATVPL 366
ELSP+++TLLKS EPY+DEY+LTD+E +MRE+GFVNV SRLTDPRHVT TATVPL
Sbjct: 309 SKVVQELSPVLFTLLKSTEPYLDEYHLTDLEEKMREIGFVNVTSRLTDPRHVTITATVPL 365
BLAST of HG10001711 vs. ExPASy TrEMBL
Match:
A0A6J1D7N3 (uncharacterized protein LOC111018123 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018123 PE=4 SV=1)
HSP 1 Score: 476.1 bits (1224), Expect = 1.3e-130
Identity = 241/362 (66.57%), Postives = 292/362 (80.66%), Query Frame = 0
Query: 12 HQLGVISGNGDGGINR-GSKRKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRL 71
+++ + SGNG I R S ++ +I+ R SE+ ++EEG LE P WSG + LSRL
Sbjct: 8 YRIALTSGNGHRRIGRLRSSKRGLIEFQR--SARSEM-AVFEEGKLERPNWSGETSLSRL 67
Query: 72 VGGLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVY 131
VG LISFKP FS+LKL AR+V+I TA K N PW+KM SEIL+S +VYK + VQ+ SIVY
Sbjct: 68 VGALISFKPLFSLLKLGARRVLISTAEKKNIPWRKMTSEILDS-DVYKELDSVQDLSIVY 127
Query: 132 PHYYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEH 191
P YYLKPFHAYDEGHLSWLAAAE E ATMSM++RAVP+AS +DEAKE+VFGNWLR I+EH
Sbjct: 128 PDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVPDASSVDEAKEIVFGNWLRTIEEH 187
Query: 192 HMKYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRK 251
H++YS NPILDILDIGCS+GLST LADKFP AK TGLDLSPYFLAVAQ+ + K++ PRK
Sbjct: 188 HLQYSENPILDILDIGCSVGLSTRQLADKFPLAKVTGLDLSPYFLAVAQYMD-KKSAPRK 247
Query: 252 NPIRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA 311
N IRWLH N E++S PSRSFDL+SIA++FHECPQ IV++LKESFRLLRPG ++ D A
Sbjct: 248 NSIRWLHGNAENSSLPSRSFDLVSIAFMFHECPQVAIVSILKESFRLLRPGGEFVVTDQA 307
Query: 312 -------ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATV 366
ELSP+++TLLKS EPY+DEY+LTD+EGRMRE+GFVNV+S+LTDPRHVT TATV
Sbjct: 308 PKSKAVQELSPVLFTLLKSTEPYLDEYHLTDLEGRMREIGFVNVRSKLTDPRHVTVTATV 364
BLAST of HG10001711 vs. ExPASy TrEMBL
Match:
A0A6J1FY99 (uncharacterized protein LOC111449047 OS=Cucurbita moschata OX=3662 GN=LOC111449047 PE=4 SV=1)
HSP 1 Score: 474.9 bits (1221), Expect = 2.9e-130
Identity = 247/362 (68.23%), Postives = 284/362 (78.45%), Query Frame = 0
Query: 12 HQLGVISGNGDGGINRGSK-RKSIIKLNRIMPRSSELRMMYEEGNLEHPKWSGNSLLSRL 71
HQL +ISG G R + R+ IK SSE+ ++EEG LE P W+G + LSRL
Sbjct: 8 HQLALISGKGQQKTGRLCRTRRGFIKAQ--ASTSSEV-AVFEEGQLERPNWAGETPLSRL 67
Query: 72 VGGLISFKPFFSILKLTARQVMIGTAAKNNFPWQKMRSEILESEEVYKVFEKVQNPSIVY 131
VG LISFKP +SILKL ARQV I TA K N PW+K+ S+ILES +VYK E VQN SIVY
Sbjct: 68 VGALISFKPVYSILKLGARQVFISTAEKKNIPWRKISSDILES-DVYKELESVQNLSIVY 127
Query: 132 PHYYLKPFHAYDEGHLSWLAAAEVEAATMSMVIRAVPNASCLDEAKEVVFGNWLRQIDEH 191
P YYLKPFHAYDEG+LSWLAAAE E ATMSM++RAVP A+ +DEAK+VVFGNWLR I+EH
Sbjct: 128 PDYYLKPFHAYDEGNLSWLAAAEAEPATMSMIMRAVPTATSVDEAKKVVFGNWLRTIEEH 187
Query: 192 HMKYSNNPILDILDIGCSIGLSTTYLADKFPSAKTTGLDLSPYFLAVAQHNENKRTHPRK 251
H++YS NPILDILDIGCS+G T LADKFP+AK TGLDLSPYFLAVAQ+ + KR PRK
Sbjct: 188 HLQYSKNPILDILDIGCSVGFGTRQLADKFPTAKVTGLDLSPYFLAVAQYMDKKRA-PRK 247
Query: 252 NPIRWLHENGEHTSFPSRSFDLLSIAYLFHECPQTIIVNLLKESFRLLRPGSTIIIIDNA 311
N IRWLH NGE T PSRSFDLLSIAYLFHECPQ IVN+LKESFRLLRPG TI+I D A
Sbjct: 248 NAIRWLHGNGEETGLPSRSFDLLSIAYLFHECPQVAIVNILKESFRLLRPGGTIVITDQA 307
Query: 312 -------ELSPIIYTLLKSVEPYVDEYYLTDVEGRMREVGFVNVKSRLTDPRHVTFTATV 366
ELSP+++TLLKS EP++DEY+LTD+E +M +VGFVNV SRLTDPRHVT TATV
Sbjct: 308 SKSKAVQELSPVLFTLLKSTEPHLDEYHLTDLEEKMSQVGFVNVTSRLTDPRHVTITATV 364
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038902342.1 | 1.9e-152 | 75.72 | uncharacterized protein LOC120088975 [Benincasa hispida] | [more] |
XP_022149776.1 | 8.7e-145 | 74.93 | uncharacterized protein LOC111018124 [Momordica charantia] | [more] |
XP_038902341.1 | 2.0e-133 | 68.42 | uncharacterized protein LOC120088974 [Benincasa hispida] | [more] |
XP_008456428.1 | 7.2e-131 | 67.22 | PREDICTED: uncharacterized protein LOC103496371 [Cucumis melo] >KAA0054425.1 S-a... | [more] |
XP_022149773.1 | 2.7e-130 | 66.57 | uncharacterized protein LOC111018123 isoform X1 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
P49016 | 3.0e-07 | 31.88 | Demethylmenaquinone methyltransferase OS=Lactococcus lactis subsp. lactis (strai... | [more] |
Q8KF69 | 4.1e-04 | 30.53 | Demethylmenaquinone methyltransferase OS=Chlorobaculum tepidum (strain ATCC 4965... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1D9E9 | 4.2e-145 | 74.93 | uncharacterized protein LOC111018124 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A5D3CS29 | 3.5e-131 | 67.22 | S-adenosyl-L-methionine-dependent methyltransferases superfamily protein OS=Cucu... | [more] |
A0A1S3C2T1 | 3.5e-131 | 67.22 | uncharacterized protein LOC103496371 OS=Cucumis melo OX=3656 GN=LOC103496371 PE=... | [more] |
A0A6J1D7N3 | 1.3e-130 | 66.57 | uncharacterized protein LOC111018123 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1FY99 | 2.9e-130 | 68.23 | uncharacterized protein LOC111449047 OS=Cucurbita moschata OX=3662 GN=LOC1114490... | [more] |
Match Name | E-value | Identity | Description | |