Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAGCAAAGAGCAGGAGAATTTTGTAGTCAGTCCATGCAAATGTGCATGCAAGATAGAAAAACAACAAAATCAACAGGATATGGAGAACAAAGAAGTGGAGAATCCAGAAGTTGGTTCTCCATGCAAGATAGAAAAACAAGAAGATATGGAAAACAAAGAGCTGGAAATGTTTGAAGTTGGTCCATGTGAATCTGCATACCAGATGGGGTTCTTGATTGGGAAGAGGTTCTCTGATACCATAAGGAGCAGATTGGAGAAAGACATGGTTCTTCGCCAACAGCTGCTTCCTTTTGCTCAAACTCCAGAGTCAAAGCCTCTTATAGAAGCCCTCTGCAAAAATAACAAGGCCAAGTACCCAACGCATTGGGATGAACTCATGGGAATTGTAGATGGAAGTCAAGCACCTCGCCTTGAGGTAATAATAATGGGCGGCTGCACCTAATCCCAAGCAGCCCTTTTTTATACATAGTAGAAACAATATAAAAATAATCAATCTTTTTTTAAAAAAACTATCACATGTATGACGGAACAATAGATGAGTGCAGTCCAACACCCAAGTTTTTCTCATAATAATAAGAGAAGTACTTAGGTTCTACCTAGGTGTGTGCTACCTAAGCTGCACCTATTTTTGTGCAGCCCAGACAACTATTTAATAAAAAGTTGTCACGTGGCAATTTTTTTTTAACGTGTGTAAAGAAAAGAAGAGCATGGACTAGGTGCAGCCTGACTGCACCTAATCATTGCTCTAATGATAATATAATGCAAATGGTAGAGAGAATAGAGAGTGAAAAATGAGATATTTTGTTTTTGCTAACTTTGGAAAATCAATGTGCAGATTATTCTACTCAACTTCAGGAGGGAGATTCTTCCATTTATTCAGGACGAGGCTTCCATTGTTGATTGTACAGATGATTGTTCTGATATTCTTGTTGTTAGTGATTCATTAGCCCTTGCAGCACATAATGAAGATGCCAGCTTTGGTCTGTCTGGCTACACGTAAGTTCACAAAAACTAAACTTTATCCTAGTTTTTATCAAAGAGCTCAACATACTGTTAAACCAATTTCATTTTTCTTTTCTTCACTTTTGAAGCTATTTGATCAAAGGAAAACTGCAAAATGGGACATGCTACATTGCTTATACACACGCAGGGGAGATACCAAGTCGTGCTTTCAGTTTTAATAGCAATGGCCTGGTAAAGTTCTCATCCCATACCATCATTATTCAGACATTTTCTTTCTTTTTTATTTTTTCTGAAACCATCCATTCATTCTGAAGGAACAATTTAAAGAGCATGGATAGATCTTACATAGGAAATTATAGACTCAAGCTCTAAATAGATGTTGAAATATCGAAAACTGACACATAATATTTGTCCAACTATTTTGATACTGAATGATAAAACATATCTTTTCCAATCTCTTAGGCATTTACCATGAATACAGTGCGTCCAGTGAATGATGAGATTGAACCTGGGGCAATCGGACGAAACTTTATCTCCCGGGACCTCCTTGAATCTACAAGCTTCGAAGATGCAATAGCTGTAAGGCCCTTCATTAATAATGTCTTCTATTAATTATAGATTATGTTTCAGCTTAATTACCTTAGCAAGTTCAAAACATAACAAATAGGGTTGGATTGTCTTTTTCAGAGAATTCGCTCAGCAGAAATATCCCTCGGCCACAATTATAATGTGATTGATGTTCAGACAAGAAGAATTGCGAGTGTAGAAACAGCATCAAAGTTTAGATTGTCAGTCCACGAGGTTGAGGCTACACCATTTTTCCATGCAAATATGTATTCCCACCTTCAGAATATTAGGCAGGTAATAATATCAGGAAAATCCACAAATAAAGCAATGACACCATGCAATTGTATAACTTATAAGGTCGGCCTTCTAATTGAAAGAAAGTTACATTCATTCTTCAGCAATAAAAATTTGTCCTCAAAGCCTTCTCCATCTCCATAACATTGGAATTTGAACTAGAAAAAAGTTTGGATTCTGCCTAGATGCACCTATCTTTGTGCAGCCCAGGCAGTTGGACAATAAAAAGATAAATTATTTTTTATGTTTCTTTTTTTGTGTGTATAAATAGAGAGAAGCATGCGGCTGCACCTAGTGAGGTTGTACCTAATCATTACCCTTTGAACTATTAGCTAAATGAAATCTGCAAGTGATCAGTTTTGAAGTCAAAATAAATTGACACAGATAATAGATGAAAACTCCACAAGCAGAACAAGACGAGCTGATGTCATGGCAAAAGAAACAAAAGATGATTTCCTGTCGGTTATTGGAGATACAGACAACGAGGAATATCCTATCTATATGAAAGGTAAAATAAATAAAAAGATGATTTATATTGCCTAGCCTTTTCAAAACTGAATTCGAAAATTTTCTCAATGTCTTCAGGTCCTAAGCTTTACACAATGTGTAGTGTTCTAATTGACCTAGATGAAGAAACTCTATCAATTTTTCAAGGAAATCCGAAGAACAAAGAGATATCCCATGTCTTCTCCCTATCAGAGTTGAAGAAACCATAA
mRNA sequence
ATGGAAAGCAAAGAGCAGGAGAATTTTGTAGTCAGTCCATGCAAATGTGCATGCAAGATAGAAAAACAACAAAATCAACAGGATATGGAGAACAAAGAAGTGGAGAATCCAGAAGTTGGTTCTCCATGCAAGATAGAAAAACAAGAAGATATGGAAAACAAAGAGCTGGAAATGTTTGAAGTTGGTCCATGTGAATCTGCATACCAGATGGGGTTCTTGATTGGGAAGAGGTTCTCTGATACCATAAGGAGCAGATTGGAGAAAGACATGGTTCTTCGCCAACAGCTGCTTCCTTTTGCTCAAACTCCAGAGTCAAAGCCTCTTATAGAAGCCCTCTGCAAAAATAACAAGGCCAAGTACCCAACGCATTGGGATGAACTCATGGGAATTGTAGATGGAAGTCAAGCACCTCGCCTTGAGATTATTCTACTCAACTTCAGGAGGGAGATTCTTCCATTTATTCAGGACGAGGCTTCCATTGTTGATTGTACAGATGATTGTTCTGATATTCTTGTTGTTAGTGATTCATTAGCCCTTGCAGCACATAATGAAGATGCCAGCTTTGGTCTGTCTGGCTACACCTATTTGATCAAAGGAAAACTGCAAAATGGGACATGCTACATTGCTTATACACACGCAGGGGAGATACCAAGTCGTGCTTTCAGTTTTAATAGCAATGGCCTGGCATTTACCATGAATACAGTGCGTCCAGTGAATGATGAGATTGAACCTGGGGCAATCGGACGAAACTTTATCTCCCGGGACCTCCTTGAATCTACAAGCTTCGAAGATGCAATAGCTAGAATTCGCTCAGCAGAAATATCCCTCGGCCACAATTATAATGTGATTGATGTTCAGACAAGAAGAATTGCGAGTGTAGAAACAGCATCAAAGTTTAGATTGTCAGTCCACGAGGTTGAGGCTACACCATTTTTCCATGCAAATATGTATTCCCACCTTCAGAATATTAGGCAGATAATAGATGAAAACTCCACAAGCAGAACAAGACGAGCTGATGTCATGGCAAAAGAAACAAAAGATGATTTCCTGTCGGTTATTGGAGATACAGACAACGAGGAATATCCTATCTATATGAAAGGTCCTAAGCTTTACACAATGTGTAGTGTTCTAATTGACCTAGATGAAGAAACTCTATCAATTTTTCAAGGAAATCCGAAGAACAAAGAGATATCCCATGTCTTCTCCCTATCAGAGTTGAAGAAACCATAA
Coding sequence (CDS)
ATGGAAAGCAAAGAGCAGGAGAATTTTGTAGTCAGTCCATGCAAATGTGCATGCAAGATAGAAAAACAACAAAATCAACAGGATATGGAGAACAAAGAAGTGGAGAATCCAGAAGTTGGTTCTCCATGCAAGATAGAAAAACAAGAAGATATGGAAAACAAAGAGCTGGAAATGTTTGAAGTTGGTCCATGTGAATCTGCATACCAGATGGGGTTCTTGATTGGGAAGAGGTTCTCTGATACCATAAGGAGCAGATTGGAGAAAGACATGGTTCTTCGCCAACAGCTGCTTCCTTTTGCTCAAACTCCAGAGTCAAAGCCTCTTATAGAAGCCCTCTGCAAAAATAACAAGGCCAAGTACCCAACGCATTGGGATGAACTCATGGGAATTGTAGATGGAAGTCAAGCACCTCGCCTTGAGATTATTCTACTCAACTTCAGGAGGGAGATTCTTCCATTTATTCAGGACGAGGCTTCCATTGTTGATTGTACAGATGATTGTTCTGATATTCTTGTTGTTAGTGATTCATTAGCCCTTGCAGCACATAATGAAGATGCCAGCTTTGGTCTGTCTGGCTACACCTATTTGATCAAAGGAAAACTGCAAAATGGGACATGCTACATTGCTTATACACACGCAGGGGAGATACCAAGTCGTGCTTTCAGTTTTAATAGCAATGGCCTGGCATTTACCATGAATACAGTGCGTCCAGTGAATGATGAGATTGAACCTGGGGCAATCGGACGAAACTTTATCTCCCGGGACCTCCTTGAATCTACAAGCTTCGAAGATGCAATAGCTAGAATTCGCTCAGCAGAAATATCCCTCGGCCACAATTATAATGTGATTGATGTTCAGACAAGAAGAATTGCGAGTGTAGAAACAGCATCAAAGTTTAGATTGTCAGTCCACGAGGTTGAGGCTACACCATTTTTCCATGCAAATATGTATTCCCACCTTCAGAATATTAGGCAGATAATAGATGAAAACTCCACAAGCAGAACAAGACGAGCTGATGTCATGGCAAAAGAAACAAAAGATGATTTCCTGTCGGTTATTGGAGATACAGACAACGAGGAATATCCTATCTATATGAAAGGTCCTAAGCTTTACACAATGTGTAGTGTTCTAATTGACCTAGATGAAGAAACTCTATCAATTTTTCAAGGAAATCCGAAGAACAAAGAGATATCCCATGTCTTCTCCCTATCAGAGTTGAAGAAACCATAA
Protein sequence
MESKEQENFVVSPCKCACKIEKQQNQQDMENKEVENPEVGSPCKIEKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTDDCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSELKKP
Homology
BLAST of HG10018526 vs. NCBI nr
Match:
XP_038886333.1 (uncharacterized protein LOC120076545 isoform X2 [Benincasa hispida])
HSP 1 Score: 758.1 bits (1956), Expect = 4.0e-215
Identity = 379/412 (91.99%), Postives = 397/412 (96.36%), Query Frame = 0
Query: 1 MESKEQENFVVSPCKCACKIEKQQNQQDMENKEVENPEVGSPCKIEK---QEDMENKELE 60
MESKEQENFVVSPCKCAC+IEKQQ QQDMENKEVEN EVG PC I K Q+DMENKELE
Sbjct: 1 MESKEQENFVVSPCKCACEIEKQQKQQDMENKEVENSEVG-PCGIGKQQNQQDMENKELE 60
Query: 61 MFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCKNNK 120
MFEVGPCES YQMGFLIGKRFSDTIRSRLEKDMVLR+QL+PFAQTPESKPLIE+LCKNNK
Sbjct: 61 MFEVGPCESPYQMGFLIGKRFSDTIRSRLEKDMVLREQLVPFAQTPESKPLIESLCKNNK 120
Query: 121 AKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTDDCSDILVVSDSL 180
AKYPTHWDELMGIVDGS+AP LEIILLNFRREILPFIQDEASIVDCTDDCSD+L+VSDSL
Sbjct: 121 AKYPTHWDELMGIVDGSEAPALEIILLNFRREILPFIQDEASIVDCTDDCSDLLIVSDSL 180
Query: 181 ALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRP 240
ALAAHNEDASFGLSGYTYLIKGKLQNG CYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRP
Sbjct: 181 ALAAHNEDASFGLSGYTYLIKGKLQNGICYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRP 240
Query: 241 VNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASVETAS 300
VNDEI+PGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVID+QTRRIASVETAS
Sbjct: 241 VNDEIDPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRRIASVETAS 300
Query: 301 KFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDFLSVIGDTD 360
K+RLSVHEV ATPFFHANMYSHLQ ++QIIDENSTSR +RADVM+KE+KDDFLSVIGDTD
Sbjct: 301 KYRLSVHEVGATPFFHANMYSHLQTVKQIIDENSTSRKKRADVMSKESKDDFLSVIGDTD 360
Query: 361 NEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSELKKP 410
N+EYPIYMKGPKLYTMCSVLIDLDEE LSIFQGNPKNKEI+HVFSLSELKKP
Sbjct: 361 NKEYPIYMKGPKLYTMCSVLIDLDEEMLSIFQGNPKNKEITHVFSLSELKKP 411
BLAST of HG10018526 vs. NCBI nr
Match:
XP_038886332.1 (uncharacterized protein LOC120076545 isoform X1 [Benincasa hispida])
HSP 1 Score: 744.6 bits (1921), Expect = 4.6e-211
Identity = 379/436 (86.93%), Postives = 397/436 (91.06%), Query Frame = 0
Query: 1 MESKEQENFVVSPCKCACKIEKQQNQQDMENKEVENPEVGSPCKIEK---QEDMENKELE 60
MESKEQENFVVSPCKCAC+IEKQQ QQDMENKEVEN EVG PC I K Q+DMENKELE
Sbjct: 1 MESKEQENFVVSPCKCACEIEKQQKQQDMENKEVENSEVG-PCGIGKQQNQQDMENKELE 60
Query: 61 MFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCKNNK 120
MFEVGPCES YQMGFLIGKRFSDTIRSRLEKDMVLR+QL+PFAQTPESKPLIE+LCKNNK
Sbjct: 61 MFEVGPCESPYQMGFLIGKRFSDTIRSRLEKDMVLREQLVPFAQTPESKPLIESLCKNNK 120
Query: 121 AKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTDDCSDILVVSDSL 180
AKYPTHWDELMGIVDGS+AP LEIILLNFRREILPFIQDEASIVDCTDDCSD+L+VSDSL
Sbjct: 121 AKYPTHWDELMGIVDGSEAPALEIILLNFRREILPFIQDEASIVDCTDDCSDLLIVSDSL 180
Query: 181 ALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRP 240
ALAAHNEDASFGLSGYTYLIKGKLQNG CYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRP
Sbjct: 181 ALAAHNEDASFGLSGYTYLIKGKLQNGICYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRP 240
Query: 241 VNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASVETAS 300
VNDEI+PGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVID+QTRRIASVETAS
Sbjct: 241 VNDEIDPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRRIASVETAS 300
Query: 301 KFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDFLSVIGDTD 360
K+RLSVHEV ATPFFHANMYSHLQ ++QIIDENSTSR +RADVM+KE+KDDFLSVIGDTD
Sbjct: 301 KYRLSVHEVGATPFFHANMYSHLQTVKQIIDENSTSRKKRADVMSKESKDDFLSVIGDTD 360
Query: 361 NEEYPIYMK------------------------GPKLYTMCSVLIDLDEETLSIFQGNPK 410
N+EYPIYMK GPKLYTMCSVLIDLDEE LSIFQGNPK
Sbjct: 361 NKEYPIYMKGKYKDDLYCLVLSKLNSKIFLMSSGPKLYTMCSVLIDLDEEMLSIFQGNPK 420
BLAST of HG10018526 vs. NCBI nr
Match:
KAG7015933.1 (hypothetical protein SDJN02_21037, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 627.1 bits (1616), Expect = 1.1e-175
Identity = 307/363 (84.57%), Postives = 337/363 (92.84%), Query Frame = 0
Query: 46 EKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPES 105
E++ +E KELE+FEVGPCE AYQMGFLIG RFSDTIR RLE+DMVLR LLPFAQ P++
Sbjct: 6 EQENAVETKELEVFEVGPCECAYQMGFLIGNRFSDTIRRRLEQDMVLRHHLLPFAQAPQT 65
Query: 106 KPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTD 165
KPLIEA+C NNKAKYP+HWDELMGIVDGSQAP LEII+LNFRREIL IQDEASIVDCTD
Sbjct: 66 KPLIEAICNNNKAKYPSHWDELMGIVDGSQAPLLEIIVLNFRREILSCIQDEASIVDCTD 125
Query: 166 DCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNS 225
+CSD+LVVSDSLALAAHNEDA FGLSGYTYLIK +LQNG C+IAY+HAGE+PSRAFSFNS
Sbjct: 126 ECSDVLVVSDSLALAAHNEDALFGLSGYTYLIKARLQNGICFIAYSHAGELPSRAFSFNS 185
Query: 226 NGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 285
NGLAFTMN VRP+NDEI+P AIGRNF+SRDLLESTSFEDAIARIRSAEISLGHNYNVIDV
Sbjct: 186 NGLAFTMNAVRPMNDEIDPRAIGRNFLSRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 245
Query: 286 QTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKET 345
QTRRIASVE ASKFR+SVHEV ATPFFHANMYSHLQNI+QIIDENSTSR RRADVM KE+
Sbjct: 246 QTRRIASVEIASKFRVSVHEVGATPFFHANMYSHLQNIKQIIDENSTSRKRRADVMPKES 305
Query: 346 KDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSE 405
KDDFL V+GD +N+ +PIYMKGPKLYTMCSVLIDLDE+TLSIF+GNPKNKEI+HVFSL E
Sbjct: 306 KDDFLCVLGDAENKNFPIYMKGPKLYTMCSVLIDLDEQTLSIFRGNPKNKEITHVFSLQE 365
Query: 406 LKK 409
LK+
Sbjct: 366 LKE 368
BLAST of HG10018526 vs. NCBI nr
Match:
KAG6578351.1 (hypothetical protein SDJN03_22799, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 626.3 bits (1614), Expect = 1.8e-175
Identity = 308/363 (84.85%), Postives = 336/363 (92.56%), Query Frame = 0
Query: 46 EKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPES 105
E++ +E KELEMFEVGPCE AYQMGFLIG RFSDTIR RLE+DMVLR LLPFAQ P++
Sbjct: 6 EQENGVEIKELEMFEVGPCECAYQMGFLIGNRFSDTIRRRLEQDMVLRHHLLPFAQAPQT 65
Query: 106 KPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTD 165
KPLIEA+C NNKAKYP+HWDELMGIVDGSQAP LEII+LNFRREIL IQDEASIVDCTD
Sbjct: 66 KPLIEAICNNNKAKYPSHWDELMGIVDGSQAPLLEIIVLNFRREILSCIQDEASIVDCTD 125
Query: 166 DCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNS 225
+CSD+LVVSDSLALAAHNEDA FGLSGYTYLIK KLQNG C+IAY+HAGE+PSRAFSFNS
Sbjct: 126 ECSDVLVVSDSLALAAHNEDALFGLSGYTYLIKAKLQNGICFIAYSHAGELPSRAFSFNS 185
Query: 226 NGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 285
NGLAFTMN VRP+NDEI+P AIGRNF+SRDLLESTSFEDAIARIRSAEISLGHNYNVIDV
Sbjct: 186 NGLAFTMNAVRPMNDEIDPRAIGRNFLSRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 245
Query: 286 QTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKET 345
QTRRIASVE ASKFR+SVHEV ATPFFHANMYSHLQNI+QIIDENSTSR RRADVM KE+
Sbjct: 246 QTRRIASVEIASKFRVSVHEVGATPFFHANMYSHLQNIKQIIDENSTSRKRRADVMPKES 305
Query: 346 KDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSE 405
K DFL V+GD +N+ +PIYMKGPKLYTMCSVLIDLDE+TLSIF+GNPKNKEI+HVFSL E
Sbjct: 306 KHDFLCVLGDAENKNFPIYMKGPKLYTMCSVLIDLDEQTLSIFRGNPKNKEITHVFSLQE 365
Query: 406 LKK 409
LK+
Sbjct: 366 LKE 368
BLAST of HG10018526 vs. NCBI nr
Match:
XP_022938595.1 (uncharacterized protein LOC111444781 isoform X1 [Cucurbita moschata])
HSP 1 Score: 625.5 bits (1612), Expect = 3.1e-175
Identity = 306/363 (84.30%), Postives = 337/363 (92.84%), Query Frame = 0
Query: 46 EKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPES 105
E++ +E KELE+FEVGPCE AYQMGFLIG RFSDTIR RLE+DMVLR LLPFAQ P++
Sbjct: 6 EQENAVETKELEVFEVGPCECAYQMGFLIGNRFSDTIRRRLEQDMVLRHHLLPFAQAPQT 65
Query: 106 KPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTD 165
KPLIEA+C NNKAKYP+HWDELMGIVDGSQAP LEII+LNFRREIL IQDEASIVDCTD
Sbjct: 66 KPLIEAICNNNKAKYPSHWDELMGIVDGSQAPLLEIIVLNFRREILSCIQDEASIVDCTD 125
Query: 166 DCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNS 225
+CSD+LVVSDSLALAAHNEDA FGLSGYTYLIK +LQNG C+IAY+HAGE+PSRAFSFNS
Sbjct: 126 ECSDVLVVSDSLALAAHNEDALFGLSGYTYLIKARLQNGICFIAYSHAGELPSRAFSFNS 185
Query: 226 NGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 285
NGLAFTMN VRP+NDEI+P AIGRNF+SRDLLESTSFEDAIARIRSAEISLGHNYNVIDV
Sbjct: 186 NGLAFTMNAVRPMNDEIDPRAIGRNFLSRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 245
Query: 286 QTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKET 345
QTRRIASVE ASKFR+SVHEV ATPFFHANMYSHLQNI+QIID+NSTSR RRADVM KE+
Sbjct: 246 QTRRIASVEIASKFRVSVHEVGATPFFHANMYSHLQNIKQIIDKNSTSRKRRADVMPKES 305
Query: 346 KDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSE 405
KDDFL V+GD +N+ +PIYMKGPKLYTMCSVLIDLDE+TLSIF+GNPKNKEI+HVFSL E
Sbjct: 306 KDDFLCVLGDAENKNFPIYMKGPKLYTMCSVLIDLDEQTLSIFRGNPKNKEITHVFSLQE 365
Query: 406 LKK 409
LK+
Sbjct: 366 LKE 368
BLAST of HG10018526 vs. ExPASy Swiss-Prot
Match:
P21133 (Acyl-coenzyme A:6-aminopenicillanic-acid-acyltransferase 40 kDa form OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) OX=227321 GN=penDE PE=1 SV=1)
HSP 1 Score: 48.9 bits (115), Expect = 1.6e-04
Identity = 76/321 (23.68%), Postives = 132/321 (41.12%), Query Frame = 0
Query: 104 ESKPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDC 163
E + L+ L + K ++P +++E+ GI G++ EI++LN R E +V+
Sbjct: 46 ELEQLLRELEQVMKQRWPRYYEEICGIAKGAEREVSEIVMLNTRTEF------AYGLVEA 105
Query: 164 TDDCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKL-QNGTCYI-AYTHAGEIPSRAF 223
D C+ + + + AL N D F + LI+ + Q G I T AG I
Sbjct: 106 RDGCTTVYCKTPNGALQGQNWD--FFTATKENLIQLTICQPGLPTIKMITEAGIIGK--V 165
Query: 224 SFNSNGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARI-------RSAEI 283
FNS G+A N + + P + + R LESTS +A +I SA I
Sbjct: 166 GFNSAGVAVNYNALH--LHGLRPTGLPSHLALRMALESTSPSEAYEKIVSQGGMAASAFI 225
Query: 284 SLGHNYNVIDVQTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSR 343
+G+ + ++ I+ + + + H L + +S SR
Sbjct: 226 MVGNAHEAYGLEFSPISLCKQVADTNGRIVHTNHCLLNHGPSAQELNPL-----PDSWSR 285
Query: 344 TRRADVMAK---ETKDDFLSVIGDTDNEEYPI-----YMKG-PKLYTMCSVLIDLDEETL 403
R + + TK+ F + D DN YP+ Y +G + T+ +++ D
Sbjct: 286 HGRMEHLLSGFDGTKEAFAKLWEDEDN--YPLSICRAYKEGKSRGSTLFNIVFDHVGRKA 345
Query: 404 SIFQGNPKNKEISHVFSLSEL 407
++ G P N + + V + S L
Sbjct: 346 TVRLGRPNNPDETFVMTFSNL 347
BLAST of HG10018526 vs. ExPASy TrEMBL
Match:
A0A6J1FK77 (uncharacterized protein LOC111444781 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444781 PE=4 SV=1)
HSP 1 Score: 625.5 bits (1612), Expect = 1.5e-175
Identity = 306/363 (84.30%), Postives = 337/363 (92.84%), Query Frame = 0
Query: 46 EKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPES 105
E++ +E KELE+FEVGPCE AYQMGFLIG RFSDTIR RLE+DMVLR LLPFAQ P++
Sbjct: 6 EQENAVETKELEVFEVGPCECAYQMGFLIGNRFSDTIRRRLEQDMVLRHHLLPFAQAPQT 65
Query: 106 KPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTD 165
KPLIEA+C NNKAKYP+HWDELMGIVDGSQAP LEII+LNFRREIL IQDEASIVDCTD
Sbjct: 66 KPLIEAICNNNKAKYPSHWDELMGIVDGSQAPLLEIIVLNFRREILSCIQDEASIVDCTD 125
Query: 166 DCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNS 225
+CSD+LVVSDSLALAAHNEDA FGLSGYTYLIK +LQNG C+IAY+HAGE+PSRAFSFNS
Sbjct: 126 ECSDVLVVSDSLALAAHNEDALFGLSGYTYLIKARLQNGICFIAYSHAGELPSRAFSFNS 185
Query: 226 NGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 285
NGLAFTMN VRP+NDEI+P AIGRNF+SRDLLESTSFEDAIARIRSAEISLGHNYNVIDV
Sbjct: 186 NGLAFTMNAVRPMNDEIDPRAIGRNFLSRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 245
Query: 286 QTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKET 345
QTRRIASVE ASKFR+SVHEV ATPFFHANMYSHLQNI+QIID+NSTSR RRADVM KE+
Sbjct: 246 QTRRIASVEIASKFRVSVHEVGATPFFHANMYSHLQNIKQIIDKNSTSRKRRADVMPKES 305
Query: 346 KDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSE 405
KDDFL V+GD +N+ +PIYMKGPKLYTMCSVLIDLDE+TLSIF+GNPKNKEI+HVFSL E
Sbjct: 306 KDDFLCVLGDAENKNFPIYMKGPKLYTMCSVLIDLDEQTLSIFRGNPKNKEITHVFSLQE 365
Query: 406 LKK 409
LK+
Sbjct: 366 LKE 368
BLAST of HG10018526 vs. ExPASy TrEMBL
Match:
A0A6J1K003 (uncharacterized protein LOC111489413 OS=Cucurbita maxima OX=3661 GN=LOC111489413 PE=4 SV=1)
HSP 1 Score: 623.6 bits (1607), Expect = 5.7e-175
Identity = 307/363 (84.57%), Postives = 337/363 (92.84%), Query Frame = 0
Query: 46 EKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPES 105
E++ +E KELEMF+VGPCE AYQMGFLIG RFSDTIR RLE+DMVLR+ LLPFAQ P++
Sbjct: 6 EQENVVEIKELEMFQVGPCECAYQMGFLIGNRFSDTIRRRLEQDMVLRRHLLPFAQAPQT 65
Query: 106 KPLIEALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTD 165
KPLIEA+C NNKAKYP+HWDELMGIVDGSQAP LE+I+LNFRREIL IQDEASIVDCTD
Sbjct: 66 KPLIEAICNNNKAKYPSHWDELMGIVDGSQAPLLEVIVLNFRREILSCIQDEASIVDCTD 125
Query: 166 DCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNS 225
+CSD+LVVSDSLALAAHNEDA FGLSGYTYLIK +LQNG +IAY+HAGE+PSRAFSFNS
Sbjct: 126 ECSDVLVVSDSLALAAHNEDALFGLSGYTYLIKAELQNGIRFIAYSHAGELPSRAFSFNS 185
Query: 226 NGLAFTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 285
NGLAFTMN VRPVNDEI+P AIGRNF+SRDLLESTSFEDAIARIRSAEISLGHNYNVIDV
Sbjct: 186 NGLAFTMNAVRPVNDEIDPRAIGRNFLSRDLLESTSFEDAIARIRSAEISLGHNYNVIDV 245
Query: 286 QTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKET 345
QTRRIASVE ASKFR+SVHEV ATPFFHANMYSHLQNI+QIIDENSTSR RRADVM KE+
Sbjct: 246 QTRRIASVEIASKFRVSVHEVGATPFFHANMYSHLQNIKQIIDENSTSRKRRADVMPKES 305
Query: 346 KDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSE 405
KDDFL V+GD DN+ +PIYMKGPKLYTMCSVLIDLDE+TLSIF+GNPKNKEI+HVFSL E
Sbjct: 306 KDDFLCVLGDADNKNFPIYMKGPKLYTMCSVLIDLDEQTLSIFRGNPKNKEITHVFSLQE 365
Query: 406 LKK 409
LK+
Sbjct: 366 LKE 368
BLAST of HG10018526 vs. ExPASy TrEMBL
Match:
A0A6J1FDL0 (uncharacterized protein LOC111444781 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444781 PE=4 SV=1)
HSP 1 Score: 592.8 bits (1527), Expect = 1.1e-165
Identity = 290/339 (85.55%), Postives = 317/339 (93.51%), Query Frame = 0
Query: 70 MGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCKNNKAKYPTHWDELMG 129
MGFLIG RFSDTIR RLE+DMVLR LLPFAQ P++KPLIEA+C NNKAKYP+HWDELMG
Sbjct: 1 MGFLIGNRFSDTIRRRLEQDMVLRHHLLPFAQAPQTKPLIEAICNNNKAKYPSHWDELMG 60
Query: 130 IVDGSQAPRLEIILLNFRREILPFIQDEASIVDCTDDCSDILVVSDSLALAAHNEDASFG 189
IVDGSQAP LEII+LNFRREIL IQDEASIVDCTD+CSD+LVVSDSLALAAHNEDA FG
Sbjct: 61 IVDGSQAPLLEIIVLNFRREILSCIQDEASIVDCTDECSDVLVVSDSLALAAHNEDALFG 120
Query: 190 LSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRPVNDEIEPGAIGR 249
LSGYTYLIK +LQNG C+IAY+HAGE+PSRAFSFNSNGLAFTMN VRP+NDEI+P AIGR
Sbjct: 121 LSGYTYLIKARLQNGICFIAYSHAGELPSRAFSFNSNGLAFTMNAVRPMNDEIDPRAIGR 180
Query: 250 NFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASVETASKFRLSVHEVEAT 309
NF+SRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASVE ASKFR+SVHEV AT
Sbjct: 181 NFLSRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASVEIASKFRVSVHEVGAT 240
Query: 310 PFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDFLSVIGDTDNEEYPIYMKGPK 369
PFFHANMYSHLQNI+QIID+NSTSR RRADVM KE+KDDFL V+GD +N+ +PIYMKGPK
Sbjct: 241 PFFHANMYSHLQNIKQIIDKNSTSRKRRADVMPKESKDDFLCVLGDAENKNFPIYMKGPK 300
Query: 370 LYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSELKK 409
LYTMCSVLIDLDE+TLSIF+GNPKNKEI+HVFSL ELK+
Sbjct: 301 LYTMCSVLIDLDEQTLSIFRGNPKNKEITHVFSLQELKE 339
BLAST of HG10018526 vs. ExPASy TrEMBL
Match:
A0A6J1BWT2 (uncharacterized protein LOC111006042 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111006042 PE=4 SV=1)
HSP 1 Score: 522.7 bits (1345), Expect = 1.4e-144
Identity = 254/358 (70.95%), Postives = 306/358 (85.47%), Query Frame = 0
Query: 55 ELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCK 114
E+EMFEVGPCESAYQMGF IG+RFSDTI+SRL+ D+VLR QLLPFAQTP+S+PLI+ALC
Sbjct: 2 EMEMFEVGPCESAYQMGFSIGERFSDTIKSRLDNDLVLRDQLLPFAQTPQSQPLIQALCN 61
Query: 115 NNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEA-SIVDCTDDCSDILVV 174
NNK K+P +WDEL+GI +GS P LEIIL+NFR+EILPF+Q E S+VDC+DDCSDILVV
Sbjct: 62 NNKNKFPRYWDELIGIAEGSGVPILEIILINFRKEILPFLQKEVPSVVDCSDDCSDILVV 121
Query: 175 SDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLAFTMN 234
SDS+A+AAHNEDA+ L G+TYL+K KLQNG ++AYT+AGE+PS AF FN +GLAFT+N
Sbjct: 122 SDSMAIAAHNEDANVALVGHTYLVKAKLQNGLSFLAYTYAGELPSCAFGFNGHGLAFTLN 181
Query: 235 TVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRRIASV 294
+V P NDE+ GAIGRNFISRDLLE+ S E+AI RIRSAE+S+GH+YN+IDVQTRRI +V
Sbjct: 182 SVPPTNDEVAAGAIGRNFISRDLLETISLENAILRIRSAEVSVGHSYNLIDVQTRRIVNV 241
Query: 295 ETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDFLSVI 354
ETAS+FR S E+ ATPFFHANMY+HLQ I Q+ DENSTSR RRADV+ K ++ +FLS++
Sbjct: 242 ETASRFRFSAKEIGATPFFHANMYTHLQ-INQVQDENSTSRQRRADVLPKGSRSEFLSIL 301
Query: 355 GDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS--ELKKP 410
GDTDN++YPIYM GP LYT+C+ LIDLDE+TLSI QGNPK ISHVFSL LKKP
Sbjct: 302 GDTDNKKYPIYMTGPTLYTLCTALIDLDEQTLSIIQGNPKKNVISHVFSLPLVGLKKP 358
BLAST of HG10018526 vs. ExPASy TrEMBL
Match:
A0A6J1JUL7 (acyl-coenzyme A:6-aminopenicillanic-acid-acyltransferase 40 kDa form-like OS=Cucurbita maxima OX=3661 GN=LOC111489044 PE=4 SV=1)
HSP 1 Score: 521.5 bits (1342), Expect = 3.1e-144
Identity = 249/362 (68.78%), Postives = 312/362 (86.19%), Query Frame = 0
Query: 51 MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 110
M K++EMFEVGPCESAYQMGFLIGKRFSDTI+SRL+KD+VLR QLLPFAQ P+S+PLIE
Sbjct: 1 MAGKKVEMFEVGPCESAYQMGFLIGKRFSDTIKSRLDKDLVLRDQLLPFAQAPQSQPLIE 60
Query: 111 ALCKNNKAKYPTHWDELMGIVDGSQAPRLEIILLNFRREILPFIQDEASI-VDCTDDCSD 170
ALC NNK ++PT+WDEL+GI +GS P LEIIL+NFR+EILPF+ E ++ VDC+DDCSD
Sbjct: 61 ALCNNNKTRFPTYWDELVGIAEGSSVPVLEIILINFRKEILPFLHREVALAVDCSDDCSD 120
Query: 171 ILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLA 230
+LVVS+++A+AAHNEDA+ L G+TYL+KGKLQNG ++AYT+AGE+PS AF FN++GLA
Sbjct: 121 LLVVSENMAIAAHNEDANVALVGHTYLVKGKLQNGLSFLAYTYAGELPSCAFGFNNHGLA 180
Query: 231 FTMNTVRPVNDEIEPGAIGRNFISRDLLESTSFEDAIARIRSAEISLGHNYNVIDVQTRR 290
FT+N+V P N+E+ G IGRNFISRDLLESTS E+AI+R+ SAE+S+GH+YN+IDVQTRR
Sbjct: 181 FTLNSVPPTNEEVAAGGIGRNFISRDLLESTSLENAISRVHSAEVSVGHSYNLIDVQTRR 240
Query: 291 IASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDF 350
I +VETAS+FR SV+EV ATPFFHANMY+HLQ + Q+ D NS SR +RAD + K +K+DF
Sbjct: 241 IVNVETASRFRYSVNEVGATPFFHANMYTHLQ-VNQVEDANSRSRQKRADELPKGSKNDF 300
Query: 351 LSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFS--LSELK 410
+SV+GD DN+EYPIYM+GP LYT+C+ +IDLDE+TLSI QGNP+ ISHVFS L EL+
Sbjct: 301 MSVLGDVDNKEYPIYMRGPTLYTLCTAVIDLDEQTLSIIQGNPEKNVISHVFSIPLVELE 360
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038886333.1 | 4.0e-215 | 91.99 | uncharacterized protein LOC120076545 isoform X2 [Benincasa hispida] | [more] |
XP_038886332.1 | 4.6e-211 | 86.93 | uncharacterized protein LOC120076545 isoform X1 [Benincasa hispida] | [more] |
KAG7015933.1 | 1.1e-175 | 84.57 | hypothetical protein SDJN02_21037, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6578351.1 | 1.8e-175 | 84.85 | hypothetical protein SDJN03_22799, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022938595.1 | 3.1e-175 | 84.30 | uncharacterized protein LOC111444781 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
P21133 | 1.6e-04 | 23.68 | Acyl-coenzyme A:6-aminopenicillanic-acid-acyltransferase 40 kDa form OS=Emericel... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FK77 | 1.5e-175 | 84.30 | uncharacterized protein LOC111444781 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1K003 | 5.7e-175 | 84.57 | uncharacterized protein LOC111489413 OS=Cucurbita maxima OX=3661 GN=LOC111489413... | [more] |
A0A6J1FDL0 | 1.1e-165 | 85.55 | uncharacterized protein LOC111444781 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1BWT2 | 1.4e-144 | 70.95 | uncharacterized protein LOC111006042 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1JUL7 | 3.1e-144 | 68.78 | acyl-coenzyme A:6-aminopenicillanic-acid-acyltransferase 40 kDa form-like OS=Cuc... | [more] |
Match Name | E-value | Identity | Description | |