Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCCCTACATTTCCATCCACCGCAGTCCACAGACCTTCTTCTCCTTCCTCTCAAATCCACGCCACTGACAAGAAGCTGAAGCAGAAGCATCAGCAGCAGAAGGGTCCTCGCAGCCCTTTCCATGTTCTCAATGGTATTTCCTTTTCCACCGCCTGCAACAGCTCCAGCATCGGCAGTGACGCTTCCTCCGCCTCCACCGAAGCCCCCAGAGGCTGTCTCAGGTTCTTTCTCTCTCATTCCTCTGCTTCTTCTAAAACCCCTGCTAATAAGCTTAAACCATCTTCCAAAACCCCCAAATCTACCTCCAATCTTCGCCCCATTAAGCCTCTCAGATCTAAGCCACTCAAGGAGAATGCTCCCAAACGGGCTGTTAAGAACTACTCTAGGGCTGCAAAACCTAGCTCCACTAAGTTGGACCCATTGAAGAAAAACTCCCCCTGTTTGTATAGATGGCCATCCGGGAAGAAACCCTGTTCATTAGCTACCCACAAATCTAAAATGTTGGCATCTGGTGGTGAGAAATTGGAGAAGCATGGGGCACATAGTGTAGTGAGGATGGTTGATGATGGTAAATGTGAACCTTCTGATCTTAATTTAGTTCCCAGTGATTTCAATTTCACTCCCATGCGTAAAATGGAAAACGGCTCTGGTTGTGATCCAACAGTTGATAAGGTTGTTGCCCTGGAGAATTCAAACACAGATCATAGCAAAACGCCTCCTGTTCAAGCCTCTGTATCACCAGAATTACAGTGTGGTTCAGCACTTATGCCTACAGCTACTCCCGTTTGCTATGGTGCTGGCTATGTCGTTTCTGGGGTCTCTGACAAGAGAAAGTGCAGACCGAGAGGGCTTCTTATTGTTGGAGAGAATAATGCGTCCATTTCTAAAGTCAAGCCTATTCAGATTTTTGAAGAAGATGAAACCATTACAAGGGACACTTCCAATTCCGTCGTTTTAAAGGTCCCTTCACCCATTGAAGCTTCTATGAATTGGCTTTTATCTCCCTGCAATGAGGAGGATGAGGATCACAAAGAGAAGTCTGAAAATGCTTCACCTCAGTGTAAGAACCTCGCTGAATCGATTGCACTTCGTTCTGTTCCTTCACCATCATCTATCGATGCTCTTCCTCCTGACATATACAGTCCTCCAGAGTTTCAAGGTTTCTTGGAGCCATTATCATTTGAGGACACCTCTCCCTCATGTGCTCCTAATTGCTTGGATGTTATTTTAAAGGAGGGAAGAGGACAACAGAGGTATCAAGTCAATAGGGAGAATTCTCCATTCTCAATTGACTCGTTAAGTAGTGATAATGTTATCCGGACACCTCAATCAGACTCAAATTCGGCTCAAAAAGTTTTCCCTCCGTGGTTAACTGCTGACAGTTATTTGAAATGTGATCAGAATTCAGCGCCTGAATTGTTTTCACCAGCAAATCTACCTCGAGACAACCCTAATGCAATAACGAGTATAACAGATTTAAGTTTCCAATTTGATTGTCTGGCCACAATATCCAATTCCATGGATCTTCATCAATTTCAAAAGATTCTTGAAGATCAGGCTTTTATGAATAGCAATTCCTCCTGTGAGGATTTGTTAAAATCCAAGATGAGAGTATCATGGAGGGAAGGGTTAATGAGCCGGATCTACGAGATGGATGAGTTCGATACTTGTCGGTGCTTGTCAGATGAAGAAGAGAATGTTGATTCTTGCAGCATTAGCTTGTCAGATATCCTTAAGACTCCTCTGGAGCATAATGATTGTGAGGCTGATCCTATAGTTTCTAACCGTTCTTGTTCTCCTGGATTATCAGTTGATGAGGAAGCCGAAGAATATGACAAATGTAAAGAAATGTGGTCTCATCAAGTATCTTGTTCTTGTGCGGAATCCATTAGCACTGATGGAGGTGGCTTGATTGCTTCAGGGGACTCAGATTGGAATTTATGCTACAAAAATGGATTGTTTGATTCTTAA
mRNA sequence
ATGAGCCCTACATTTCCATCCACCGCAGTCCACAGACCTTCTTCTCCTTCCTCTCAAATCCACGCCACTGACAAGAAGCTGAAGCAGAAGCATCAGCAGCAGAAGGGTCCTCGCAGCCCTTTCCATGTTCTCAATGGTATTTCCTTTTCCACCGCCTGCAACAGCTCCAGCATCGGCAGTGACGCTTCCTCCGCCTCCACCGAAGCCCCCAGAGGCTGTCTCAGGTTCTTTCTCTCTCATTCCTCTGCTTCTTCTAAAACCCCTGCTAATAAGCTTAAACCATCTTCCAAAACCCCCAAATCTACCTCCAATCTTCGCCCCATTAAGCCTCTCAGATCTAAGCCACTCAAGGAGAATGCTCCCAAACGGGCTGTTAAGAACTACTCTAGGGCTGCAAAACCTAGCTCCACTAAGTTGGACCCATTGAAGAAAAACTCCCCCTGTTTGTATAGATGGCCATCCGGGAAGAAACCCTGTTCATTAGCTACCCACAAATCTAAAATGTTGGCATCTGGTGGTGAGAAATTGGAGAAGCATGGGGCACATAGTGTAGTGAGGATGGTTGATGATGGTAAATGTGAACCTTCTGATCTTAATTTAGTTCCCAGTGATTTCAATTTCACTCCCATGCGTAAAATGGAAAACGGCTCTGGTTGTGATCCAACAGTTGATAAGGTTGTTGCCCTGGAGAATTCAAACACAGATCATAGCAAAACGCCTCCTGTTCAAGCCTCTGTATCACCAGAATTACAGTGTGGTTCAGCACTTATGCCTACAGCTACTCCCGTTTGCTATGGTGCTGGCTATGTCGTTTCTGGGGTCTCTGACAAGAGAAAGTGCAGACCGAGAGGGCTTCTTATTGTTGGAGAGAATAATGCGTCCATTTCTAAAGTCAAGCCTATTCAGATTTTTGAAGAAGATGAAACCATTACAAGGGACACTTCCAATTCCGTCGTTTTAAAGGTCCCTTCACCCATTGAAGCTTCTATGAATTGGCTTTTATCTCCCTGCAATGAGGAGGATGAGGATCACAAAGAGAAGTCTGAAAATGCTTCACCTCAGTGTAAGAACCTCGCTGAATCGATTGCACTTCGTTCTGTTCCTTCACCATCATCTATCGATGCTCTTCCTCCTGACATATACAGTCCTCCAGAGTTTCAAGGTTTCTTGGAGCCATTATCATTTGAGGACACCTCTCCCTCATGTGCTCCTAATTGCTTGGATGTTATTTTAAAGGAGGGAAGAGGACAACAGAGGTATCAAGTCAATAGGGAGAATTCTCCATTCTCAATTGACTCGTTAAGTAGTGATAATGTTATCCGGACACCTCAATCAGACTCAAATTCGGCTCAAAAAGTTTTCCCTCCGTGGTTAACTGCTGACAGTTATTTGAAATGTGATCAGAATTCAGCGCCTGAATTGTTTTCACCAGCAAATCTACCTCGAGACAACCCTAATGCAATAACGAGTATAACAGATTTAAGTTTCCAATTTGATTGTCTGGCCACAATATCCAATTCCATGGATCTTCATCAATTTCAAAAGATTCTTGAAGATCAGGCTTTTATGAATAGCAATTCCTCCTGTGAGGATTTGTTAAAATCCAAGATGAGAGTATCATGGAGGGAAGGGTTAATGAGCCGGATCTACGAGATGGATGAGTTCGATACTTGTCGGTGCTTGTCAGATGAAGAAGAGAATGTTGATTCTTGCAGCATTAGCTTGTCAGATATCCTTAAGACTCCTCTGGAGCATAATGATTGTGAGGCTGATCCTATAGTTTCTAACCGTTCTTGTTCTCCTGGATTATCAGTTGATGAGGAAGCCGAAGAATATGACAAATGTAAAGAAATGTGGTCTCATCAAGTATCTTGTTCTTGTGCGGAATCCATTAGCACTGATGGAGGTGGCTTGATTGCTTCAGGGGACTCAGATTGGAATTTATGCTACAAAAATGGATTGTTTGATTCTTAA
Coding sequence (CDS)
ATGAGCCCTACATTTCCATCCACCGCAGTCCACAGACCTTCTTCTCCTTCCTCTCAAATCCACGCCACTGACAAGAAGCTGAAGCAGAAGCATCAGCAGCAGAAGGGTCCTCGCAGCCCTTTCCATGTTCTCAATGGTATTTCCTTTTCCACCGCCTGCAACAGCTCCAGCATCGGCAGTGACGCTTCCTCCGCCTCCACCGAAGCCCCCAGAGGCTGTCTCAGGTTCTTTCTCTCTCATTCCTCTGCTTCTTCTAAAACCCCTGCTAATAAGCTTAAACCATCTTCCAAAACCCCCAAATCTACCTCCAATCTTCGCCCCATTAAGCCTCTCAGATCTAAGCCACTCAAGGAGAATGCTCCCAAACGGGCTGTTAAGAACTACTCTAGGGCTGCAAAACCTAGCTCCACTAAGTTGGACCCATTGAAGAAAAACTCCCCCTGTTTGTATAGATGGCCATCCGGGAAGAAACCCTGTTCATTAGCTACCCACAAATCTAAAATGTTGGCATCTGGTGGTGAGAAATTGGAGAAGCATGGGGCACATAGTGTAGTGAGGATGGTTGATGATGGTAAATGTGAACCTTCTGATCTTAATTTAGTTCCCAGTGATTTCAATTTCACTCCCATGCGTAAAATGGAAAACGGCTCTGGTTGTGATCCAACAGTTGATAAGGTTGTTGCCCTGGAGAATTCAAACACAGATCATAGCAAAACGCCTCCTGTTCAAGCCTCTGTATCACCAGAATTACAGTGTGGTTCAGCACTTATGCCTACAGCTACTCCCGTTTGCTATGGTGCTGGCTATGTCGTTTCTGGGGTCTCTGACAAGAGAAAGTGCAGACCGAGAGGGCTTCTTATTGTTGGAGAGAATAATGCGTCCATTTCTAAAGTCAAGCCTATTCAGATTTTTGAAGAAGATGAAACCATTACAAGGGACACTTCCAATTCCGTCGTTTTAAAGGTCCCTTCACCCATTGAAGCTTCTATGAATTGGCTTTTATCTCCCTGCAATGAGGAGGATGAGGATCACAAAGAGAAGTCTGAAAATGCTTCACCTCAGTGTAAGAACCTCGCTGAATCGATTGCACTTCGTTCTGTTCCTTCACCATCATCTATCGATGCTCTTCCTCCTGACATATACAGTCCTCCAGAGTTTCAAGGTTTCTTGGAGCCATTATCATTTGAGGACACCTCTCCCTCATGTGCTCCTAATTGCTTGGATGTTATTTTAAAGGAGGGAAGAGGACAACAGAGGTATCAAGTCAATAGGGAGAATTCTCCATTCTCAATTGACTCGTTAAGTAGTGATAATGTTATCCGGACACCTCAATCAGACTCAAATTCGGCTCAAAAAGTTTTCCCTCCGTGGTTAACTGCTGACAGTTATTTGAAATGTGATCAGAATTCAGCGCCTGAATTGTTTTCACCAGCAAATCTACCTCGAGACAACCCTAATGCAATAACGAGTATAACAGATTTAAGTTTCCAATTTGATTGTCTGGCCACAATATCCAATTCCATGGATCTTCATCAATTTCAAAAGATTCTTGAAGATCAGGCTTTTATGAATAGCAATTCCTCCTGTGAGGATTTGTTAAAATCCAAGATGAGAGTATCATGGAGGGAAGGGTTAATGAGCCGGATCTACGAGATGGATGAGTTCGATACTTGTCGGTGCTTGTCAGATGAAGAAGAGAATGTTGATTCTTGCAGCATTAGCTTGTCAGATATCCTTAAGACTCCTCTGGAGCATAATGATTGTGAGGCTGATCCTATAGTTTCTAACCGTTCTTGTTCTCCTGGATTATCAGTTGATGAGGAAGCCGAAGAATATGACAAATGTAAAGAAATGTGGTCTCATCAAGTATCTTGTTCTTGTGCGGAATCCATTAGCACTGATGGAGGTGGCTTGATTGCTTCAGGGGACTCAGATTGGAATTTATGCTACAAAAATGGATTGTTTGATTCTTAA
Protein sequence
MSPTFPSTAVHRPSSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGSDASSASTEAPRGCLRFFLSHSSASSKTPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENAPKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHGAHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTPPVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKPIQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAESIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRYQVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANLPRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVSWREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCSPGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFDS
Homology
BLAST of HG10009900 vs. NCBI nr
Match:
XP_038906910.1 (uncharacterized protein LOC120092781 [Benincasa hispida] >XP_038906911.1 uncharacterized protein LOC120092781 [Benincasa hispida] >XP_038906913.1 uncharacterized protein LOC120092781 [Benincasa hispida])
HSP 1 Score: 1146.0 bits (2963), Expect = 0.0e+00
Identity = 592/663 (89.29%), Postives = 608/663 (91.70%), Query Frame = 0
Query: 1 MSPTFPSTAVHRP--SSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSI 60
MSP FPST +H P SS SSQIHAT+ K K K QQQ+G RSPF+VLNGISFSTACNSSSI
Sbjct: 1 MSPAFPSTTIHTPSSSSSSSQIHATNNKPKLKQQQQQGSRSPFNVLNGISFSTACNSSSI 60
Query: 61 GSDAS--SASTEAPRGCLRFFLSHSSASSKT-PANKLKPSSKTPKSTSNLRPIKPLRSKP 120
SDAS S STEAPRGCLRFFLSHS+ASSKT PANK KPSSK PKSTSNLRPIKPLRSKP
Sbjct: 61 ASDASSTSTSTEAPRGCLRFFLSHSTASSKTPPANKFKPSSKIPKSTSNLRPIKPLRSKP 120
Query: 121 LKENAPKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEK 180
LKENAPKRAVK+YSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSL THKSKMLASGGE+
Sbjct: 121 LKENAPKRAVKHYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLGTHKSKMLASGGEE 180
Query: 181 LEKHGAHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENG-SGCDPTVDKVVALENSNT 240
LEK+G H VVRMVDDGKCEPSDLNLVPSDFNFTPMRKME G SG DPTVDKV ALENSN
Sbjct: 181 LEKNGTHGVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMEYGSSGLDPTVDKVAALENSNI 240
Query: 241 DHSKTPPVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNAS 300
D SKTPPVQASVSPELQCGSA+MPT TP+CYGAGYVVSGVSDKRKCRPRGLLIVG+N AS
Sbjct: 241 DQSKTPPVQASVSPELQCGSAIMPTVTPICYGAGYVVSGVSDKRKCRPRGLLIVGDNTAS 300
Query: 301 ISKVKPIQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQ 360
ISKVKPIQIFEED ITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHK+KS NASPQ
Sbjct: 301 ISKVKPIQIFEEDGNITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKQKSANASPQ 360
Query: 361 CKNLAESIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEG 420
KNLAESIAL SVPSPSSIDAL P I SPPEFQGFLEPLSFE+TS SCAPNCLDVILKEG
Sbjct: 361 SKNLAESIALHSVPSPSSIDALSPYISSPPEFQGFLEPLSFEETSSSCAPNCLDVILKEG 420
Query: 421 RGQQRYQVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPEL 480
RGQQRYQVN ENSPFSIDSLSSDNVIRTP SDS+ AQKVFPPWLTADS KCDQNSA EL
Sbjct: 421 RGQQRYQVNGENSPFSIDSLSSDNVIRTPHSDSSLAQKVFPPWLTADSCGKCDQNSASEL 480
Query: 481 FSPANLPRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLK 540
FS ANLPRD+PNAITSITDLSFQFDCLATI NSMDLHQFQKILEDQAF NSNSSCEDL K
Sbjct: 481 FSRANLPRDSPNAITSITDLSFQFDCLATIPNSMDLHQFQKILEDQAFSNSNSSCEDLFK 540
Query: 541 SKMRVSWREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIV 600
SKMRVSWREGLMSRIYEMDEFDTCRCLSDEEENVDSC LSDILKTPLEHNDCEADPIV
Sbjct: 541 SKMRVSWREGLMSRIYEMDEFDTCRCLSDEEENVDSCRNCLSDILKTPLEHNDCEADPIV 600
Query: 601 SNRSCSPGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGL 658
SN CSPGL VDEEA+EYDKCKEMWSHQV CSCAESISTDGGGLIASGDSDWNLCYKNGL
Sbjct: 601 SNCFCSPGLLVDEEADEYDKCKEMWSHQVPCSCAESISTDGGGLIASGDSDWNLCYKNGL 660
BLAST of HG10009900 vs. NCBI nr
Match:
XP_011649077.1 (uncharacterized protein LOC105434579 [Cucumis sativus] >KGN61440.1 hypothetical protein Csa_005987 [Cucumis sativus])
HSP 1 Score: 1114.0 bits (2880), Expect = 0.0e+00
Identity = 569/657 (86.61%), Postives = 603/657 (91.78%), Query Frame = 0
Query: 1 MSPTFPSTAVHRPSSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGS 60
MSPTFPST +H+ SS S + +KK KQ+ QQ++GPRSPFHVLN ISF TACN+SSIGS
Sbjct: 1 MSPTFPSTTIHKTSSSSPS--SINKKQKQQQQQEQGPRSPFHVLNAISFPTACNTSSIGS 60
Query: 61 DASSASTEAPRGCLRFFLSHSSASSKTPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENA 120
DASS STEAPRGCLRFFL HSSASSKTPANKLKPSSKTPKS SN+RPIKPLRSKPLKENA
Sbjct: 61 DASSTSTEAPRGCLRFFLPHSSASSKTPANKLKPSSKTPKSISNVRPIKPLRSKPLKENA 120
Query: 121 PKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHG 180
PK VK +SRAA+P+STKLDPLKKNSPCLYRWPSGKKP SL THKSKMLAS GE+ KHG
Sbjct: 121 PKPPVKLHSRAARPTSTKLDPLKKNSPCLYRWPSGKKPSSLCTHKSKMLASAGEESGKHG 180
Query: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTP 240
AHSVVRMVDDGKCEPSDLNLVP+DFNFTPMRKMENGSG DPTVD VVALENSNTDHSKTP
Sbjct: 181 AHSVVRMVDDGKCEPSDLNLVPNDFNFTPMRKMENGSGFDPTVDNVVALENSNTDHSKTP 240
Query: 241 PVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKP 300
PVQAS+SPELQCGSA+MP TPVCYGAGYVVSG+SDKRKCRPRGLLIVG+N ASISKVKP
Sbjct: 241 PVQASISPELQCGSAIMPAVTPVCYGAGYVVSGISDKRKCRPRGLLIVGDNIASISKVKP 300
Query: 301 IQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 360
IQIFEED +IT+DTSNSVV KVPSPIEASMNWLLSPCNEEDEDHKE S+NAS Q KNLAE
Sbjct: 301 IQIFEEDRSITKDTSNSVVFKVPSPIEASMNWLLSPCNEEDEDHKE-SKNASTQSKNLAE 360
Query: 361 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 420
S+ALRSVPSPSSIDALPPD+YSPPEFQGF+EPLSFEDTSPSCA N L+VIL EGRGQQRY
Sbjct: 361 SVALRSVPSPSSIDALPPDVYSPPEFQGFMEPLSFEDTSPSCARNSLNVILNEGRGQQRY 420
Query: 421 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 480
QVN ENSPFSIDSLSSDNVI+TPQSDSNSAQKVFPPWL+ADS K DQNSA ELF NL
Sbjct: 421 QVNGENSPFSIDSLSSDNVIQTPQSDSNSAQKVFPPWLSADSCEKNDQNSASELF--LNL 480
Query: 481 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 540
PRD+ NAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAF N+NSSCEDLL+SKMRVS
Sbjct: 481 PRDSSNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFRNNNSSCEDLLESKMRVS 540
Query: 541 WREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCS 600
WREGLMSR+YEMDEFDTCRCLSDEEENVDSCSISLSDI+KTPLEH DCE DPIVSN SCS
Sbjct: 541 WREGLMSRLYEMDEFDTCRCLSDEEENVDSCSISLSDIIKTPLEHTDCEVDPIVSNSSCS 600
Query: 601 PGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFDS 658
PGL V+EEAEEY K KEM SHQV CSCAESISTDGGGLIASGDSDWNLCY+NGLFDS
Sbjct: 601 PGLLVNEEAEEYGKFKEMQSHQVPCSCAESISTDGGGLIASGDSDWNLCYRNGLFDS 652
BLAST of HG10009900 vs. NCBI nr
Match:
XP_016902071.1 (PREDICTED: uncharacterized protein LOC103496888 [Cucumis melo] >KAA0039544.1 uncharacterized protein E6C27_scaffold744G00030 [Cucumis melo var. makuwa] >TYK01757.1 uncharacterized protein E5676_scaffold775G00970 [Cucumis melo var. makuwa])
HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 559/657 (85.08%), Postives = 593/657 (90.26%), Query Frame = 0
Query: 1 MSPTFPSTAVHRPSSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGS 60
MSPTFPST +H+ SS S DKK +Q+ QQ++G SPFHVLN ISF TACN+SSIGS
Sbjct: 1 MSPTFPSTTIHKTSSSSPSF--IDKKQQQRQQQEQGQCSPFHVLNAISFPTACNTSSIGS 60
Query: 61 DASSASTEAPRGCLRFFLSHSSASSKTPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENA 120
DASS STEAPRGCLRFFL HSSASSKTPANKLKPSSKTPKS SN+R IKPLRSKPLKE A
Sbjct: 61 DASSTSTEAPRGCLRFFLPHSSASSKTPANKLKPSSKTPKSISNVRAIKPLRSKPLKEKA 120
Query: 121 PKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHG 180
PK AVK +SRAA+P+STKLDPLKKNSPCLYRWPSGKKP SL THKSKMLAS GE+L HG
Sbjct: 121 PKPAVKLHSRAARPTSTKLDPLKKNSPCLYRWPSGKKPSSLCTHKSKMLASSGEELGNHG 180
Query: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTP 240
AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSG DPTVD VVALENSNTDHSKTP
Sbjct: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGFDPTVDIVVALENSNTDHSKTP 240
Query: 241 PVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKP 300
PVQAS+SPELQCGSA+MP TPVCYGAGYVVSG+SDKRKCRPRGLLIVG+N ASISKVKP
Sbjct: 241 PVQASISPELQCGSAIMPAVTPVCYGAGYVVSGISDKRKCRPRGLLIVGDNIASISKVKP 300
Query: 301 IQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 360
IQIFEED +IT+DTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKE S+NAS K+LAE
Sbjct: 301 IQIFEEDGSITKDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKE-SKNASTPPKHLAE 360
Query: 361 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 420
SIALRSVPSPSSI+ALPPD+YSPPEFQGFLEPLS EDTS SCA N L+VIL E RGQQRY
Sbjct: 361 SIALRSVPSPSSINALPPDVYSPPEFQGFLEPLSCEDTSTSCARNSLNVILNEARGQQRY 420
Query: 421 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 480
QVN ENSPFS+DSLSSDNVI+TPQSDSNSA+K FPPWL+ADSY K +QNSA ELFS NL
Sbjct: 421 QVNGENSPFSVDSLSSDNVIQTPQSDSNSAKKDFPPWLSADSYEKHNQNSASELFS--NL 480
Query: 481 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 540
PRD+ N ITSITDLSFQFDCLATISNSMDLHQFQKILEDQAF N+NSSCEDLL+SKMRVS
Sbjct: 481 PRDSSNTITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFRNNNSSCEDLLESKMRVS 540
Query: 541 WREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCS 600
WREGLMSR+YEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLE D E DPIVS CS
Sbjct: 541 WREGLMSRLYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLELTDFEVDPIVSTSFCS 600
Query: 601 PGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFDS 658
PGL V+EEAEEY KCKEM SHQV CSCAESISTDGGGLIASGDSDWNLCY+NGLFDS
Sbjct: 601 PGLLVNEEAEEYGKCKEMRSHQVPCSCAESISTDGGGLIASGDSDWNLCYRNGLFDS 652
BLAST of HG10009900 vs. NCBI nr
Match:
XP_023550090.1 (uncharacterized protein LOC111808388 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 904.8 bits (2337), Expect = 4.3e-259
Identity = 486/650 (74.77%), Postives = 538/650 (82.77%), Query Frame = 0
Query: 14 SSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGSDASSASTEAPRGC 73
SSPSSQIHA +KKLKQ+ QQ +GPR PF VLNGISFS ACN++S+GSDASS ST+APRGC
Sbjct: 9 SSPSSQIHAANKKLKQQQQQHQGPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGC 68
Query: 74 LRFFLSHSSASSK--TPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENAPKRAVKNYSRA 133
LRFFLSHSS+SSK TPANKLK SSK PKSTS++RP+KPLRSKPLKENAPKRA+ + SRA
Sbjct: 69 LRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRA 128
Query: 134 -AKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHGAHSVVRMVDD 193
A+PSSTKL+PLKKNSP LYRWPSGKKPCSL THK K+LAS GE+LE+HGAHSVVRMVDD
Sbjct: 129 PAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDD 188
Query: 194 -GKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTPPVQASVSPE 253
KCE PSDFNFTP+R++ENGSG DPT DKVVALE SN DH+KTPPVQASVSPE
Sbjct: 189 PNKCE-------PSDFNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPE 248
Query: 254 LQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKPIQIF-EEDE 313
LQCGSALMPT TPVCYGAGYVVSG+SDKRKCRP+G+LIVG+N SIS VKPIQ F EED
Sbjct: 249 LQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPKGVLIVGDNTPSISNVKPIQNFEEEDG 308
Query: 314 TITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAESIALRSVP 373
+ R+TSNSVVLKVPSPIEASMNWLLSPCNEEDEDH++KSENAS
Sbjct: 309 SNKRETSNSVVLKVPSPIEASMNWLLSPCNEEDEDHQDKSENAS---------------- 368
Query: 374 SPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRYQVNRENSP 433
F+GFLEPLSFED SPSCAPNCLDVIL EGRGQ RY+VN ENSP
Sbjct: 369 ---------------SHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSP 428
Query: 434 FSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANLPRDNPNAI 493
FSIDSLSSDNVIRTPQSDS+SA K F PWLTADS K DQ+SA D+ A+
Sbjct: 429 FSIDSLSSDNVIRTPQSDSSSAPKHFLPWLTADSCDKHDQDSA----------SDSHKAM 488
Query: 494 TSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVSWREGLMSR 553
TSITDLSFQFDCLATISNSMDL+QFQK+LEDQAF NSNSSCE+L KS+MRVSWREGLMSR
Sbjct: 489 TSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSKSQMRVSWREGLMSR 548
Query: 554 IYEMDEFDTCRCLS-DEEENVDSCSISLSDIL-KTPLEHNDCEADPIVSNRSCSPGLSVD 613
IYEMDEFD+CRCLS DEEEN D+CSISLSDIL KTPL+HNDCEADPI+ N SCS L V+
Sbjct: 549 IYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVN 605
Query: 614 EEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFD 657
EEAEEY+K S++V+C CAESISTDGGGL+ASGDSDW+LCY+NGLFD
Sbjct: 609 EEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYRNGLFD 605
BLAST of HG10009900 vs. NCBI nr
Match:
XP_022928860.1 (uncharacterized protein LOC111435650 [Cucurbita moschata] >XP_022928861.1 uncharacterized protein LOC111435650 [Cucurbita moschata] >XP_022928862.1 uncharacterized protein LOC111435650 [Cucurbita moschata])
HSP 1 Score: 901.7 bits (2329), Expect = 3.6e-258
Identity = 488/658 (74.16%), Postives = 536/658 (81.46%), Query Frame = 0
Query: 14 SSPSSQIHATDKKLKQKHQQQ--------KGPRSPFHVLNGISFSTACNSSSIGSDASSA 73
SSPSSQIHA +KKLK +HQQQ +GPR PF VLNGISFS ACN++S+GSDASS
Sbjct: 10 SSPSSQIHAANKKLKHQHQQQQQQQQQQHQGPRGPFRVLNGISFSPACNTASVGSDASST 69
Query: 74 STEAPRGCLRFFLSHSSASSK--TPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENAPKR 133
ST+APRGCLRFFLSHSS+SSK TPANKLK SSK PKSTS++RP+KPLRSKPLKENAPKR
Sbjct: 70 STDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLKPLRSKPLKENAPKR 129
Query: 134 AVKNYSRA-AKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHGAH 193
A+ + SRA AKP STKL+PLKKNSP LYRWPSGKKPCSL HK K+LAS GE+LE+HGAH
Sbjct: 130 ALTHNSRAPAKPFSTKLEPLKKNSPSLYRWPSGKKPCSLGAHKPKILASDGEELERHGAH 189
Query: 194 SVVRMVDD-GKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTPP 253
VVRMVDD KCE PSDF+FTPMR+++NGSG DPT KVVALE SN DH+KTPP
Sbjct: 190 GVVRMVDDANKCE-------PSDFSFTPMREIQNGSGLDPTAGKVVALEASNKDHTKTPP 249
Query: 254 VQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKPI 313
VQASVSPELQCGSALMPT TPVCYGAGYVVSG+SDKRKCRPRG+LIVGEN SIS VKPI
Sbjct: 250 VQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGENTPSISNVKPI 309
Query: 314 QIFEE-DETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 373
Q FEE D + R+TSNSVVLKVPSPIEASMNWLLSPCNEEDEDH++KSENAS
Sbjct: 310 QNFEEGDGSNERETSNSVVLKVPSPIEASMNWLLSPCNEEDEDHQDKSENAS-------- 369
Query: 374 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 433
F+GFLEPLSFED SPSCAPNCLDVIL EGRGQ RY
Sbjct: 370 -----------------------SHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRY 429
Query: 434 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 493
+VN ENSPFSIDSLSSDNVIRTPQSDSNSA K FPPWLTADS K DQNSA
Sbjct: 430 EVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFPPWLTADSCDKHDQNSA--------- 489
Query: 494 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 553
D+ A+TSITDLSFQFDCLATISNSMDL+QFQK+LEDQAF NSNSSCE+L KS+MRVS
Sbjct: 490 -SDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSKSQMRVS 549
Query: 554 WREGLMSRIYEMDEFDTCRCLS-DEEENVDSCSISLSDIL-KTPLEHNDCEADPIVSNRS 613
WREGLMSRIYEMDEFD+CRCLS DEEEN D+CSI+LSDIL KTPL+HNDCEADPI+ N S
Sbjct: 550 WREGLMSRIYEMDEFDSCRCLSDDEEENADTCSINLSDILKKTPLKHNDCEADPIICNSS 609
Query: 614 CSPGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFD 657
CS L V+EEAEEY+K S++V+C CAESISTDGGGL+ASGDSDW+LCYKNGLFD
Sbjct: 610 CSSRLLVNEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD 614
BLAST of HG10009900 vs. ExPASy TrEMBL
Match:
A0A0A0LHN8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G122530 PE=4 SV=1)
HSP 1 Score: 1114.0 bits (2880), Expect = 0.0e+00
Identity = 569/657 (86.61%), Postives = 603/657 (91.78%), Query Frame = 0
Query: 1 MSPTFPSTAVHRPSSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGS 60
MSPTFPST +H+ SS S + +KK KQ+ QQ++GPRSPFHVLN ISF TACN+SSIGS
Sbjct: 1 MSPTFPSTTIHKTSSSSPS--SINKKQKQQQQQEQGPRSPFHVLNAISFPTACNTSSIGS 60
Query: 61 DASSASTEAPRGCLRFFLSHSSASSKTPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENA 120
DASS STEAPRGCLRFFL HSSASSKTPANKLKPSSKTPKS SN+RPIKPLRSKPLKENA
Sbjct: 61 DASSTSTEAPRGCLRFFLPHSSASSKTPANKLKPSSKTPKSISNVRPIKPLRSKPLKENA 120
Query: 121 PKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHG 180
PK VK +SRAA+P+STKLDPLKKNSPCLYRWPSGKKP SL THKSKMLAS GE+ KHG
Sbjct: 121 PKPPVKLHSRAARPTSTKLDPLKKNSPCLYRWPSGKKPSSLCTHKSKMLASAGEESGKHG 180
Query: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTP 240
AHSVVRMVDDGKCEPSDLNLVP+DFNFTPMRKMENGSG DPTVD VVALENSNTDHSKTP
Sbjct: 181 AHSVVRMVDDGKCEPSDLNLVPNDFNFTPMRKMENGSGFDPTVDNVVALENSNTDHSKTP 240
Query: 241 PVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKP 300
PVQAS+SPELQCGSA+MP TPVCYGAGYVVSG+SDKRKCRPRGLLIVG+N ASISKVKP
Sbjct: 241 PVQASISPELQCGSAIMPAVTPVCYGAGYVVSGISDKRKCRPRGLLIVGDNIASISKVKP 300
Query: 301 IQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 360
IQIFEED +IT+DTSNSVV KVPSPIEASMNWLLSPCNEEDEDHKE S+NAS Q KNLAE
Sbjct: 301 IQIFEEDRSITKDTSNSVVFKVPSPIEASMNWLLSPCNEEDEDHKE-SKNASTQSKNLAE 360
Query: 361 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 420
S+ALRSVPSPSSIDALPPD+YSPPEFQGF+EPLSFEDTSPSCA N L+VIL EGRGQQRY
Sbjct: 361 SVALRSVPSPSSIDALPPDVYSPPEFQGFMEPLSFEDTSPSCARNSLNVILNEGRGQQRY 420
Query: 421 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 480
QVN ENSPFSIDSLSSDNVI+TPQSDSNSAQKVFPPWL+ADS K DQNSA ELF NL
Sbjct: 421 QVNGENSPFSIDSLSSDNVIQTPQSDSNSAQKVFPPWLSADSCEKNDQNSASELF--LNL 480
Query: 481 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 540
PRD+ NAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAF N+NSSCEDLL+SKMRVS
Sbjct: 481 PRDSSNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFRNNNSSCEDLLESKMRVS 540
Query: 541 WREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCS 600
WREGLMSR+YEMDEFDTCRCLSDEEENVDSCSISLSDI+KTPLEH DCE DPIVSN SCS
Sbjct: 541 WREGLMSRLYEMDEFDTCRCLSDEEENVDSCSISLSDIIKTPLEHTDCEVDPIVSNSSCS 600
Query: 601 PGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFDS 658
PGL V+EEAEEY K KEM SHQV CSCAESISTDGGGLIASGDSDWNLCY+NGLFDS
Sbjct: 601 PGLLVNEEAEEYGKFKEMQSHQVPCSCAESISTDGGGLIASGDSDWNLCYRNGLFDS 652
BLAST of HG10009900 vs. ExPASy TrEMBL
Match:
A0A5D3BQU9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold775G00970 PE=4 SV=1)
HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 559/657 (85.08%), Postives = 593/657 (90.26%), Query Frame = 0
Query: 1 MSPTFPSTAVHRPSSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGS 60
MSPTFPST +H+ SS S DKK +Q+ QQ++G SPFHVLN ISF TACN+SSIGS
Sbjct: 1 MSPTFPSTTIHKTSSSSPSF--IDKKQQQRQQQEQGQCSPFHVLNAISFPTACNTSSIGS 60
Query: 61 DASSASTEAPRGCLRFFLSHSSASSKTPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENA 120
DASS STEAPRGCLRFFL HSSASSKTPANKLKPSSKTPKS SN+R IKPLRSKPLKE A
Sbjct: 61 DASSTSTEAPRGCLRFFLPHSSASSKTPANKLKPSSKTPKSISNVRAIKPLRSKPLKEKA 120
Query: 121 PKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHG 180
PK AVK +SRAA+P+STKLDPLKKNSPCLYRWPSGKKP SL THKSKMLAS GE+L HG
Sbjct: 121 PKPAVKLHSRAARPTSTKLDPLKKNSPCLYRWPSGKKPSSLCTHKSKMLASSGEELGNHG 180
Query: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTP 240
AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSG DPTVD VVALENSNTDHSKTP
Sbjct: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGFDPTVDIVVALENSNTDHSKTP 240
Query: 241 PVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKP 300
PVQAS+SPELQCGSA+MP TPVCYGAGYVVSG+SDKRKCRPRGLLIVG+N ASISKVKP
Sbjct: 241 PVQASISPELQCGSAIMPAVTPVCYGAGYVVSGISDKRKCRPRGLLIVGDNIASISKVKP 300
Query: 301 IQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 360
IQIFEED +IT+DTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKE S+NAS K+LAE
Sbjct: 301 IQIFEEDGSITKDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKE-SKNASTPPKHLAE 360
Query: 361 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 420
SIALRSVPSPSSI+ALPPD+YSPPEFQGFLEPLS EDTS SCA N L+VIL E RGQQRY
Sbjct: 361 SIALRSVPSPSSINALPPDVYSPPEFQGFLEPLSCEDTSTSCARNSLNVILNEARGQQRY 420
Query: 421 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 480
QVN ENSPFS+DSLSSDNVI+TPQSDSNSA+K FPPWL+ADSY K +QNSA ELFS NL
Sbjct: 421 QVNGENSPFSVDSLSSDNVIQTPQSDSNSAKKDFPPWLSADSYEKHNQNSASELFS--NL 480
Query: 481 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 540
PRD+ N ITSITDLSFQFDCLATISNSMDLHQFQKILEDQAF N+NSSCEDLL+SKMRVS
Sbjct: 481 PRDSSNTITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFRNNNSSCEDLLESKMRVS 540
Query: 541 WREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCS 600
WREGLMSR+YEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLE D E DPIVS CS
Sbjct: 541 WREGLMSRLYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLELTDFEVDPIVSTSFCS 600
Query: 601 PGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFDS 658
PGL V+EEAEEY KCKEM SHQV CSCAESISTDGGGLIASGDSDWNLCY+NGLFDS
Sbjct: 601 PGLLVNEEAEEYGKCKEMRSHQVPCSCAESISTDGGGLIASGDSDWNLCYRNGLFDS 652
BLAST of HG10009900 vs. ExPASy TrEMBL
Match:
A0A1S4E1G9 (uncharacterized protein LOC103496888 OS=Cucumis melo OX=3656 GN=LOC103496888 PE=4 SV=1)
HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 559/657 (85.08%), Postives = 593/657 (90.26%), Query Frame = 0
Query: 1 MSPTFPSTAVHRPSSPSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGS 60
MSPTFPST +H+ SS S DKK +Q+ QQ++G SPFHVLN ISF TACN+SSIGS
Sbjct: 1 MSPTFPSTTIHKTSSSSPSF--IDKKQQQRQQQEQGQCSPFHVLNAISFPTACNTSSIGS 60
Query: 61 DASSASTEAPRGCLRFFLSHSSASSKTPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENA 120
DASS STEAPRGCLRFFL HSSASSKTPANKLKPSSKTPKS SN+R IKPLRSKPLKE A
Sbjct: 61 DASSTSTEAPRGCLRFFLPHSSASSKTPANKLKPSSKTPKSISNVRAIKPLRSKPLKEKA 120
Query: 121 PKRAVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHG 180
PK AVK +SRAA+P+STKLDPLKKNSPCLYRWPSGKKP SL THKSKMLAS GE+L HG
Sbjct: 121 PKPAVKLHSRAARPTSTKLDPLKKNSPCLYRWPSGKKPSSLCTHKSKMLASSGEELGNHG 180
Query: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTP 240
AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSG DPTVD VVALENSNTDHSKTP
Sbjct: 181 AHSVVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGFDPTVDIVVALENSNTDHSKTP 240
Query: 241 PVQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKP 300
PVQAS+SPELQCGSA+MP TPVCYGAGYVVSG+SDKRKCRPRGLLIVG+N ASISKVKP
Sbjct: 241 PVQASISPELQCGSAIMPAVTPVCYGAGYVVSGISDKRKCRPRGLLIVGDNIASISKVKP 300
Query: 301 IQIFEEDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 360
IQIFEED +IT+DTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKE S+NAS K+LAE
Sbjct: 301 IQIFEEDGSITKDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKE-SKNASTPPKHLAE 360
Query: 361 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 420
SIALRSVPSPSSI+ALPPD+YSPPEFQGFLEPLS EDTS SCA N L+VIL E RGQQRY
Sbjct: 361 SIALRSVPSPSSINALPPDVYSPPEFQGFLEPLSCEDTSTSCARNSLNVILNEARGQQRY 420
Query: 421 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 480
QVN ENSPFS+DSLSSDNVI+TPQSDSNSA+K FPPWL+ADSY K +QNSA ELFS NL
Sbjct: 421 QVNGENSPFSVDSLSSDNVIQTPQSDSNSAKKDFPPWLSADSYEKHNQNSASELFS--NL 480
Query: 481 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 540
PRD+ N ITSITDLSFQFDCLATISNSMDLHQFQKILEDQAF N+NSSCEDLL+SKMRVS
Sbjct: 481 PRDSSNTITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFRNNNSSCEDLLESKMRVS 540
Query: 541 WREGLMSRIYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCS 600
WREGLMSR+YEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLE D E DPIVS CS
Sbjct: 541 WREGLMSRLYEMDEFDTCRCLSDEEENVDSCSISLSDILKTPLELTDFEVDPIVSTSFCS 600
Query: 601 PGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFDS 658
PGL V+EEAEEY KCKEM SHQV CSCAESISTDGGGLIASGDSDWNLCY+NGLFDS
Sbjct: 601 PGLLVNEEAEEYGKCKEMRSHQVPCSCAESISTDGGGLIASGDSDWNLCYRNGLFDS 652
BLAST of HG10009900 vs. ExPASy TrEMBL
Match:
A0A6J1ELH1 (uncharacterized protein LOC111435650 OS=Cucurbita moschata OX=3662 GN=LOC111435650 PE=4 SV=1)
HSP 1 Score: 901.7 bits (2329), Expect = 1.8e-258
Identity = 488/658 (74.16%), Postives = 536/658 (81.46%), Query Frame = 0
Query: 14 SSPSSQIHATDKKLKQKHQQQ--------KGPRSPFHVLNGISFSTACNSSSIGSDASSA 73
SSPSSQIHA +KKLK +HQQQ +GPR PF VLNGISFS ACN++S+GSDASS
Sbjct: 10 SSPSSQIHAANKKLKHQHQQQQQQQQQQHQGPRGPFRVLNGISFSPACNTASVGSDASST 69
Query: 74 STEAPRGCLRFFLSHSSASSK--TPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENAPKR 133
ST+APRGCLRFFLSHSS+SSK TPANKLK SSK PKSTS++RP+KPLRSKPLKENAPKR
Sbjct: 70 STDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLKPLRSKPLKENAPKR 129
Query: 134 AVKNYSRA-AKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHGAH 193
A+ + SRA AKP STKL+PLKKNSP LYRWPSGKKPCSL HK K+LAS GE+LE+HGAH
Sbjct: 130 ALTHNSRAPAKPFSTKLEPLKKNSPSLYRWPSGKKPCSLGAHKPKILASDGEELERHGAH 189
Query: 194 SVVRMVDD-GKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTPP 253
VVRMVDD KCE PSDF+FTPMR+++NGSG DPT KVVALE SN DH+KTPP
Sbjct: 190 GVVRMVDDANKCE-------PSDFSFTPMREIQNGSGLDPTAGKVVALEASNKDHTKTPP 249
Query: 254 VQASVSPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKPI 313
VQASVSPELQCGSALMPT TPVCYGAGYVVSG+SDKRKCRPRG+LIVGEN SIS VKPI
Sbjct: 250 VQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGENTPSISNVKPI 309
Query: 314 QIFEE-DETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAE 373
Q FEE D + R+TSNSVVLKVPSPIEASMNWLLSPCNEEDEDH++KSENAS
Sbjct: 310 QNFEEGDGSNERETSNSVVLKVPSPIEASMNWLLSPCNEEDEDHQDKSENAS-------- 369
Query: 374 SIALRSVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRY 433
F+GFLEPLSFED SPSCAPNCLDVIL EGRGQ RY
Sbjct: 370 -----------------------SHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRY 429
Query: 434 QVNRENSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANL 493
+VN ENSPFSIDSLSSDNVIRTPQSDSNSA K FPPWLTADS K DQNSA
Sbjct: 430 EVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFPPWLTADSCDKHDQNSA--------- 489
Query: 494 PRDNPNAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVS 553
D+ A+TSITDLSFQFDCLATISNSMDL+QFQK+LEDQAF NSNSSCE+L KS+MRVS
Sbjct: 490 -SDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSKSQMRVS 549
Query: 554 WREGLMSRIYEMDEFDTCRCLS-DEEENVDSCSISLSDIL-KTPLEHNDCEADPIVSNRS 613
WREGLMSRIYEMDEFD+CRCLS DEEEN D+CSI+LSDIL KTPL+HNDCEADPI+ N S
Sbjct: 550 WREGLMSRIYEMDEFDSCRCLSDDEEENADTCSINLSDILKKTPLKHNDCEADPIICNSS 609
Query: 614 CSPGLSVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFD 657
CS L V+EEAEEY+K S++V+C CAESISTDGGGL+ASGDSDW+LCYKNGLFD
Sbjct: 610 CSSRLLVNEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD 614
BLAST of HG10009900 vs. ExPASy TrEMBL
Match:
A0A6J1I200 (uncharacterized protein LOC111468888 OS=Cucurbita maxima OX=3661 GN=LOC111468888 PE=4 SV=1)
HSP 1 Score: 899.4 bits (2323), Expect = 8.7e-258
Identity = 488/653 (74.73%), Postives = 537/653 (82.24%), Query Frame = 0
Query: 14 SSPSSQIHATDKKLKQKHQQQ---KGPRSPFHVLNGISFSTACNSSSIGSDASSASTEAP 73
SSPSSQIHA +KKLK + QQQ +GPRSPF VLNGISFS ACN++S+GSDASS ST+AP
Sbjct: 10 SSPSSQIHADNKKLKHQQQQQHHHQGPRSPFRVLNGISFSPACNTASVGSDASSTSTDAP 69
Query: 74 RGCLRFFLSHSSASSK--TPANKLKPSSKTPKSTSNLRPIKPLRSKPLKENAPKRAVKNY 133
RGCLRFFLSHSS+SSK TPANKLK SSK PKSTS++RP+KPLRSKPLKENAPKRA+ +
Sbjct: 70 RGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHN 129
Query: 134 SRA-AKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHGAHSVVRM 193
SRA AKPSSTKL+PLKKNSP LYRWPSGKKPCSL THK K+LAS G++LE+ GAHSVVRM
Sbjct: 130 SRAPAKPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGKELERRGAHSVVRM 189
Query: 194 VDD-GKCEPSDLNLVPSDFNFTPMRKMENGSGCDPTVDKVVALENSNTDHSKTPPVQASV 253
VDD KC+ PSDFNFTPMR++ENGSG DPT DKVVALE SN DH+KTPPVQASV
Sbjct: 190 VDDANKCK-------PSDFNFTPMREIENGSGLDPTADKVVALEASNKDHTKTPPVQASV 249
Query: 254 SPELQCGSALMPTATPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASISKVKPIQIF-E 313
SPELQCGSAL+PT TPVCYGAGYVVSG+SDKRKCRPRG+LIVG+N SIS VKPIQ F E
Sbjct: 250 SPELQCGSALVPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEE 309
Query: 314 EDETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASPQCKNLAESIALR 373
ED + R+TSNSVVLKVPSPIEA MNWLLSPCNEEDED ++KSENAS
Sbjct: 310 EDGSNKRETSNSVVLKVPSPIEAYMNWLLSPCNEEDEDLQDKSENAS------------- 369
Query: 374 SVPSPSSIDALPPDIYSPPEFQGFLEPLSFEDTSPSCAPNCLDVILKEGRGQQRYQVNRE 433
F+GFLEPLSFED SPSCAPNCLDVIL EGRGQ RY+VN E
Sbjct: 370 ------------------SHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGE 429
Query: 434 NSPFSIDSLSSDNVIRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPELFSPANLPRDNP 493
NSPFSIDSLSSDNVIRTPQSDSNSA K FPPWLTADS K DQNSA D+
Sbjct: 430 NSPFSIDSLSSDNVIRTPQSDSNSAPKHFPPWLTADSCHKHDQNSA----------SDSH 489
Query: 494 NAITSITDLSFQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVSWREGL 553
A+TSITDLSFQFDCLATISNSMDL+QFQK+LEDQAF NSNSSCE+L KS+MRVSWREGL
Sbjct: 490 KAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSKSQMRVSWREGL 549
Query: 554 MSRIYEMDEFDTCRCLSD-EEENVDSCSISLSDIL-KTPLEHNDCEADPIVSNRSCSPGL 613
MSRIYEMDEFD+CRCLSD EEEN D+CSISLSDIL KTPL+HNDCEADPI+ N SCS L
Sbjct: 550 MSRIYEMDEFDSCRCLSDNEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRL 609
Query: 614 SVDEEAEEYDKCKEMWSHQVSCSCAESISTDGGGLIASGDSDWNLCYKNGLFD 657
V+EEAEEY+K S++V+C CAESISTDGGGL+ASGDSDW+LCYKNGLFD
Sbjct: 610 LVNEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD 609
BLAST of HG10009900 vs. TAIR 10
Match:
AT2G43990.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 6 growth stages; Has 1419 Blast hits to 494 proteins in 144 species: Archae - 0; Bacteria - 300; Metazoa - 246; Fungi - 102; Plants - 31; Viruses - 2; Other Eukaryotes - 738 (source: NCBI BLink). )
HSP 1 Score: 198.0 bits (502), Expect = 2.4e-50
Identity = 215/697 (30.85%), Postives = 313/697 (44.91%), Query Frame = 0
Query: 16 PSSQIHATDKKLKQKHQQQKGPRSPFHVLNGISFSTACNSSSIGSDAS---SASTEAPRG 75
P SQ +K ++K ++++ P +P FST + SS S+ S S S EA G
Sbjct: 16 PQSQPQPPEKN-EKKLKRERFPVNPLR-----DFSTRSSGSSSCSNVSASGSTSGEASNG 75
Query: 76 CLRFFLSHSSASSKT---------PANKLKPSSKTPKSTSNLRPIKPLRSKPLKENAPKR 135
C RF LSHS +SS + P ++ +K PKS P+ SKPL P
Sbjct: 76 CHRFLLSHSFSSSSSSSSSSLGVFPRRPVRSVAKNPKSA-------PVVSKPLIRKKPSS 135
Query: 136 AVKNYSRAAKPSSTKLDPLKKNSPCLYRWPSGKKPCSLATHKSKMLASGGEKLEKHGAHS 195
+ K + T+ L K+ C SGK+P T K + ++ L+K + S
Sbjct: 136 LEE---VKLKSTLTEKPNLLKSQRCKTNPVSGKRPTCKITMKPEKVS----VLKKQSSVS 195
Query: 196 VVRMVDDGKCEPSDLNLVPSDFNFTPMRKMENGSGC-----DPTVDKVVALENSNTDHSK 255
+ D + + + + S TP+ K+E GS D NS++ +
Sbjct: 196 RNVKLKDSQ---TTIRVDDSIAQSTPVSKLETGSDLIYRRKSEATDDGRLSSNSSSYQDR 255
Query: 256 TPPVQASVSPELQCGSALMPTA---TPVCYGAGYVVSGVSDKRKCRPRGLLIVGENNASI 315
TPPVQASVSPE+QCGS++ +A + CY AG+++SGVSDKRKC+P+G+L VGEN +
Sbjct: 256 TPPVQASVSPEIQCGSSMNLSASAQSQACYAAGHLLSGVSDKRKCKPKGILTVGENGFEV 315
Query: 316 SKVKPIQIFEE--DETITRDTSNSVVLKVPSPIEASMNWLLSPCNEEDEDHKEKSENASP 375
K K + +E + D S + +P P +AS++WLLSPC+EE E KE S++
Sbjct: 316 GKGKILNDSDEFDEGDFGNDGSYDDISVMPLPADASVHWLLSPCDEEKEHEKEISDDGFS 375
Query: 376 QCKNLAESIALRSVPSP--------------SSIDALPPDIY-----------SPPE--- 435
Q + + E + PSP S P DIY SP E
Sbjct: 376 QFQQIVECVG-HETPSPLSDRSASSDLCNISSGRSLSPMDIYKETTRRISSSLSPNELFR 435
Query: 436 FQGFLEPLSFE------DTSPSCAPNCLDVILKEGRGQQRYQVNRENSPFSIDSLSSDNV 495
F+ F+ S + DTSP+C LD + ++SP S+D+L S+NV
Sbjct: 436 FRRFIHLSSCDGEASAFDTSPTCE---LD--------PSEHLKGDKSSPLSVDTLGSENV 495
Query: 496 IRTPQSDSNSAQKVFPPWLTADSYLKCDQNSAPE----LFSPANLPRDNPNAITSITDLS 555
I+TP+S+S+ A+ K D S E F A L + + S
Sbjct: 496 IQTPESNSSFDNYFGLSCSQAEIQKKHDVGSDLESLTMKFQSAGLSPRIQASSWEPSRSS 555
Query: 556 FQFDCLATISNSMDLHQFQKILEDQAFMNSNSSCEDLLKSKMRVSWREGLMSRIYEMDEF 615
F FD LAT S+S+DL QFQ+ L D++ + + + + + ++ +RV M +
Sbjct: 556 FNFDYLATSSDSIDLSQFQRGLVDRSSCHPHVTLDKVSRTHLRVEQTNSHMPEMKSQQIT 615
Query: 616 DTCRCLSDEEENVDSCSISLSDILKTPLEHNDCEADPIVSNRSCSPGLSVDEEAEEYDKC 653
DT E D + N E A K
Sbjct: 616 DT-------------------------------EFD--IQNHK--------ESAASLGKG 632
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038906910.1 | 0.0e+00 | 89.29 | uncharacterized protein LOC120092781 [Benincasa hispida] >XP_038906911.1 unchara... | [more] |
XP_011649077.1 | 0.0e+00 | 86.61 | uncharacterized protein LOC105434579 [Cucumis sativus] >KGN61440.1 hypothetical ... | [more] |
XP_016902071.1 | 0.0e+00 | 85.08 | PREDICTED: uncharacterized protein LOC103496888 [Cucumis melo] >KAA0039544.1 unc... | [more] |
XP_023550090.1 | 4.3e-259 | 74.77 | uncharacterized protein LOC111808388 [Cucurbita pepo subsp. pepo] | [more] |
XP_022928860.1 | 3.6e-258 | 74.16 | uncharacterized protein LOC111435650 [Cucurbita moschata] >XP_022928861.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0LHN8 | 0.0e+00 | 86.61 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G122530 PE=4 SV=1 | [more] |
A0A5D3BQU9 | 0.0e+00 | 85.08 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S4E1G9 | 0.0e+00 | 85.08 | uncharacterized protein LOC103496888 OS=Cucumis melo OX=3656 GN=LOC103496888 PE=... | [more] |
A0A6J1ELH1 | 1.8e-258 | 74.16 | uncharacterized protein LOC111435650 OS=Cucurbita moschata OX=3662 GN=LOC1114356... | [more] |
A0A6J1I200 | 8.7e-258 | 74.73 | uncharacterized protein LOC111468888 OS=Cucurbita maxima OX=3661 GN=LOC111468888... | [more] |
Match Name | E-value | Identity | Description | |
AT2G43990.1 | 2.4e-50 | 30.85 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |