Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGTTTTGCAAGGAAATTTGCATGATGTATCAGTGGAGTTAGAAGGGATGGGTGGAGGTGGAAAGTTGCTGCTGGAGGTAAGTTACCCTGGCATTTTATAGCAACTAAATATGGCATCCTCGTCATGTTTTCTTAATGTACTGCTAATTTGTAATAACTTCTGTGTTTGGGAGCTGTACTATATTTGTGTCTGTTTTCTATGTTTGTAACTCAAGAGGTTTAGTTTCTTTTCTTCTGGTAGGACAATAAACTTTTGAAGGTTATTAGTTTTTGTTGGATAATTAAGTCTTGCTTTCTTAAAACCTGTTTGTTCTAATGTGATTGATAATTGTTTGTTAATGGTGTCTCTCAGATCAAGTATAGGACTTTTGATGAAATTGAAGATGACAAACGATGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAGTAAGGGTTTTGTATCTGCTTTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGAAAGTTAAAGTCATTCAATGATGAGTACCAATCAAGTCATCATTTATTAAGCAAGCAAAACGACATAGAGGATATACCTTCATACATGCAGACGAATACCGAAGTCTGTATAACTGATATAAACGATCCCAATGAGGGAAAATTTGATGAGGTTGAAACAAGTGATAATACTGTGGACAGTGGACAATCGCTGAAAGAAGTGACTCAAAGTCTTTTAGCAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAACCAAAATATTGTCAAGAAGCTAGGTCTTCCTGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAGGCACGAAAGACTGCTGAAGCAGGTTATATCGAATCGGGGCTTGCAACACCCAAAAGTTTGGATACTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACATTAACTGATGTGAAGAAAATAACAAAAGATCTATTAAGTCAAACTGAGTCTGTTTTAGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGGTGAGGGCTCAAAAAAAGAGGGAGAGAAGCTCGGTAGTTCAGGGGATGGATCATTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGCGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAACGAGTCTACAGACACACAGGTAAAATATTTGGTTTCCAGAATGGCTCTAATATTTATAAAGTTGTCTCTGAATTTTGGTAATTCAATGACATAGCATTAGCAAATTGTCCCAATGTTATTTTTTTCATTCCTAACTGTAGGAGTCTACAGGTTGCAATTTGGCGTGATTTTATGCGGAGGAGACTAGTTGTTGCCTTCAGGGGCACTGAACAAGTAGGTTATTATGTTATGTTGATATTCTGTTCTCTGGAGTTTGACTTTCCGTGTAATAAGTTCTGACCTTTTGACTTCTCCAGTCAAGATGGAAGGACCTAATAACAGACCTGATGCTAGTCCCTGCAGGGTACTTATTTTTCTTTCCCTCAAGGATAATCCTTTATCCGGTACTTCATATTTATCATTCTCCTTGTTTCTTATGCAATGATGGATAATAAAAAAGAACATTGTACATGGGGCAAAACCTCTCGAATAAAAAGCCATGTACCATGAAAACTGTGGTAGTGCAGTTTATTCTAGTTAGCTAATTTAAATAAAGGGGAAAAGAACTATTGAATTTTAAATGTACTTAAAATGATTAGCCTAATTTTCCCATTTCTTTGCACTATGTCTAGTTGTATATTATTTGACCGTTGGATCTCAGGATGGGCTATTTGTGACCCTTCATCCTTTGGGTTTAAGATAGAAAGACCCCCTCTTGCTTTTGAATTTTTGGTAGAAAGAAGCTAATTAGAAGAGTACTTTATCCATTCGATAGTTGGTATAGTTATAACACAACTTGGATATAATATTTCAGCAAGGACCAAAAAACGGTAAATGAGTGAGCATATGGATCTCTGAAGGCTTTAAAACTTATGTGGATAAAAGGACGCTTTGTTATGGTTTCATATCAACTTCAATTTCACAAACTTTTAAAACATGTCATCTATTTTCTTTTGACTTAGGTTGAATCCTGAAAGGATAAGTGGAGATTTCAACGAGGAAGTTCAAGTAAGAAATATTTTAATTATATTACAGGATATGTATTGTTCTGAACTCCATGCACTGTACTCTTCGTGTAATGTTAAATGATGAAGATCTGTTTATTCCAGGTTCACAGTGGGTTCTTAAGTGCGTATGATTCAGTACGAATGAGAATTATTTCCCTCATTAAAATGGCCATTAATTATAAGTGAGTCTGATAGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATAAAAAAAATTAGAAAAGAATAATCCTGTGGAGGCAGTAATATGAACTGACTCAAAACGGTCCATCCAAGAACTACATTTGTCTTAAAAAACTCTTTGATTTCTCTCTAACCACAATTCCAAAGCAAGGCTTTGACTTCATTGACATATAATAATTGAGACCTCGAAGGTAATGTTACTCCCTTACGAGCTGTAGAATATTGTGTTTGGCAGATGAAGAGAACTCCCATCGCAAATTGAAAATTCTGAATATCTTATACCAGCAAGCCATTGAGAAGGAACAACCAAAAAGTAAGTGATTAAGAGATTCCCCATCTATATAATAAAGACAGCAAATAGAAGGTAGAAGGCATGATGTAGGCATCTTCTGAAGCACTTCCTAAGTATACAGACAACTATTACGTAAAATTTATGTAAGAATATTGAGTAATCTAGGACTCTCTTATTTCCAAATAGCAGTAAACAAGTCTTTAGGCATAATCCATTTGGGGTGAGAGGAACACTAGAATCTTTTGAGGGAAGGAGCTTAATCCTTTTGTTGTCTGGTCTTGGCTTGATTCCATGTTCCTTTGTGGGCTTTAGTTACGAAGCTTTTTTGTAATTATTCTCTTTGCTTTATTTTGCTTTATTGGAGGTCTTTCTTTTAGGTAGCCTCCTCTTTTTTGGGCTGACAGGGTACTAGGAGAGATTTGCAAAACCATAACAAAGTGTGCGGCTTCCTCTTCCTATTATATATCAGTACAAGAGGAGGGGTAAAAAGGTCCTGACAAGGGAAGCAAGTAATTAGTCACGTGGGAGTGAGGGAGTGAAATGAGAATGTTGGTTGTGGGAACCATTGATAGTGATTCTGAAGGGTATTAGTAGGGTTGTTATTCCCTTTCATAGTCTCCATGTTATTTCTCTGTTTTTAGTTTTAGTTCTTCTTGAGGGTGGGAGGAACTGGATTGTACAAACTATTATCATATCACTACATCTTGAGGAAGTTTCCTTACACAAACCCTCTCCATGAACTCCTCCACAAGGACTCCTTCATTTGGAAGGAGGAAGGCACTCGTGCTTTTGAGAAGCTAAAGCAAGTGATGATGTCCTTACGCATACTAGCTCTACTGGATTTTAACCTGCCATTCATAATTGTAACTGTAATTGGTGCTTCCAGAGCCAGGTTGGGAGCTTTGCTATCTCAGAATCATCGCCCCATTGCTTACTTCAGCCATACTCTTTCGGCAAGAGTGAGGGCCAAATCAATTTATGATCGTGAATTAATGGTAGTGGTATTGGCTATTAAAAAATGATTACATCATTTGTTGGGACTGAAGTTTACGGTGATAGCTAATTAGAGAGCTATATAGTTCTTGCTGGAACAAAGAGAAACTGAACCCCAGTATCAATGGCAAGCCCATAAATCCTTGTGATCCAAAAACTTCAACTCTCCACCAAAGTGAAGAAAATAGATAAAGAATAGACACCTTGTGTGACATTTGATGTATTGAATGAGGCATCATTTCATAGAATCAGAAGACTCTCCGTTGCACCATGGGCATCCAATGCAGCCCAAGTAATACCTCTGGAGCTCTAAATAGTCTTGACGAAAAGTTGATCAGCACTTTGAATTCTAGTTTCTTAAAGGATTACAATGGCAACAAATTTTGACAAAGATAAAATCTACCTTTGGTCTTAGGATATCTGCTCTCACCAGCATTTTAGCTTCTCAAAATTAGAAGGGAAGCATCAAACTCAGTTCAAAGTTCAGCATGAAAATTGCAAAAGTGAACGCAATTGAGAACAATTCTTTATACAATATAACGAAGTCATGCGAGAGCTTAAGGGATGACATCAAAGAAATCTTTAATTGCAGACGGGATAACTTTGGGTACTAAGATTGAGAATCAAAATGAGTTTTTAATATTTTGTTTGAAATATCATATAAATATAGATAGAACATGATCATTTGAGTGGGGCACGTTCCCCTCCCCCCAACGTGGATCCCTTTGTATAAGGGTAACCTTTGTAAGAGAGTAAAAAAGACAATAAAGGACAACATTACTAAGATCTTTACAATAATATAACCGTACTAATATAAGCACTATAACAAAAACAAACTAACTAACGAACTCATAACAAACCCAATGGACACACCAAAAAGCTATATTAGCTGCATCACTTCAATTGTTCGATTTTCTTTGAGGTTCATGTGGGAAAACACAATTTTTTCTAATATTCAGAAATCAAGATGGGTTATTGATTCAAAACTCTTTTGTGCACTCTGTTTGACTTATATCCCTTTTCTTGCAGTGATGATTGTGCTGAGCCACCACTCAAATGGCATGTTTATGTTACAGGTCACAGTTTGGGTGGTGCATTAGCTACACTTCTTGCTCTTGAACTTTCGTCAAGTCAACTTGCAAGGTTAGTGTTTATTTTCTACAGGAATGAGTGATGCTGGATCCTGAAGATTGAAAATAGGTCATGGCAGGAAAAGGATTATGGTTAAGCAAATAGTGGCCTTTTAAATGAGATATTGAAGAATTAATTTTTTTTACCACTTTCTTTACCCTAATTAAAAAATTCTTAACATATTCCTTCATTTACCTATTTATCGATTATTTCTTATCCAAAAAAAACCTATTTATCGATTATTTTTCTAACTTTACATGTAAGGCATCATTGATCTTAATAATTCACTTTTTTAGAAGAGAAACATACACTCTTGATTATTATGGTTTGAGTAATGATTCTATTTGGTTAGAAATTTAGCCATTAGACTAGCTGTAGTATTCCTATTAGTTCTATGATTATTACCCTTTTTAATTTTTAGTGATTAGACAGCTTGGCTTTTCTCCTCTTTTATCTTTCTTACCAAATACAATATTGCTCTAAGTTTTACAAGAGTGCTCTCTCAATACTGAGTACATGGGAAGAAACTGAACAACATAATGCCAATTGAATGCAACTTTCAGACTTTTTCATTGATGTAGAGTGTTGATATAATATTAAATTTTCCATAACCCATCAAATGCCAAATTTGCTTCTCTTATTGAAACTTGTGGTCTCAAGATGCACGAAATCCCCCATTGATCCAAGAAGGAAATCATGTAAGGGTTTAATTTTCCCATGCTTTCTTCGGTTTTCCAAAGTTACGGAGACATGCCAAGCTTTTCGGCCCTATTTCAGTGATTCTCTTTCAAGCTTCCAAGTGCTCTATATTGACTTGCTATCAAAGTGAATCACTCAGCTCTCTTTCCAGTTGGCTTCTTTGGTGGTTTCAGCTCTGGTTCGAAATTCTCGACTAGTTTTGTCGTTTCTTTTTCCGGCTGTTTTTGTCTCCAGAGGTTCCCAAGTGTTTTGGAGGAGGGATTTTTTTCATGCTTAGTTTGAAATTTAGTGTTCTTTAAGTGGTCAATCTTTACAAGCTTTTCTCTTGTTCTCGCAGCTCTCTTATTTTGTGTTTTTAGAAGTCTCAAAGTTTGGTTTGAACCTTAAGTTCTTTGCAAGAGTGTTCTCTACGTCTTTTCATTCTTGGGGCGCTTCCTTGTCCTTTTTATTTCTTGTTTATGTTCCTTTTTCTCTTCCATATGGAGTTTGTATCTTTGAGCACTAGTCTCTTTTCATTTTTTTGTTCTTTTTTTTTTGGAAAGAAATAAAACTTTTCATTAATGTAATGAAAATACAAGTAAAAATATACTAAAAGCAGTACAACATCACATAACAATACAATACAAAATCAAGCAGAAAATATAAAAGCTCCCCAATTCAGACAAATATCTTGAATGGAAAAATTCACAAAAACTTAGATAAAGAACACTAAGAGGAAGCATTAGGTCTTGCTTCTTTAAAACGTTATGTTCATTGAAGATGCTTATCTAGGAAATTTCTTTTATTTCTTTCGAACTAAATTTTAGTTAATAATATTTTGACTGCATTTATTGGAGAATCTTCTAAACTAACCGCAACAAATAGAGTAATTTGAATTTCAGAAGCGATCAACAAAATTGTGCAACAAACTTTAGTAATGCACATGTTCTCATTCACTGGTTGTGCTCAAAATTATGATCAAAATTCTCAACTTTTGTGTAGTAAACCCAAATGTATTAACACCCACTTAATTACTAACGAGATAAGGTAGTTTTCTTTTTCTCTAGTTCTTGTTTTCATTTTGTGATTCATCATGTATTATTGTAATTTGAGCATTATACTCTTTTCATTATATCAATGTAAAGTTTGTTTTCTTTAAAAAAAACAAAAAAAAAAAAACAAAACATATTTACAGCAAACCATCCCTTCAATCTAGTTCAAATATATCTTCCTTGCCAACTTGTCAATCATTCTATTGAATTGTACTTTTGGAAATTCTTTAGTCGACAGATCAACAGTTTTACCTATACTTAGAACATAAGGATACATGTAAGAATCGTTGATTGGCATAGAGATTAACAATCCTCTAATGCAAGTATAACATGGACAGGGTCCAAGTATCGAATCCCAAAGAAACAAAAGGCAAGATTTCAATTAAACAAATAAAGTCCTCAAAGTTTGTAACCAAAGGGGGATTCAAAACTGTTTCAGTCTTCAGGGGGAATTGCGCCCACCTTGCGCCTGCAATCTGCCTTGTTCAATTAGATCGAGACACAACCTGACATGCGGCCTTCATGCGCCTGCATCTTGCTTTCTCAAGCACAAAGAGTTAATTCGTATGCGATCAAGTTGGGCATTGAATCTACCCTAATCAACACTTAGCAACATAAATCGAATTTAACACATATGCAACCCTCTTTGAGATACAATTTAGGAGTCTGTAAGGATTCCCTTCAAGGAGAATCACTAACTTTACTGATAATAACACACAAAATATAAGCTCCCACCTTTCGAGAGGGTGGTTCTCTCCCAAATGTCTGATAATATAATTCCAAAAGTAAGCATAATCACAAAATACATTTAACAATTCTTAAATAGGCTAACCCTACTGCCAAACTAAGTGGGTGAATTCCCACTTTTACCCCCTCCTAACATAAGTATGTATGAGTGGAGGCCTAACAAAGTCTAAGCAAAAAAAAAAAAGATCCGAGTGATGGATTAAAGAGTTTTCTCCAAGGATTAAAGCTTGATGATTTGGGGGAGTCTTGGCTGCCTATTATAGACCTCCTAGCGCTAGGATTTCTTTCAGATCGGCGTGGAAATCCTTCCTTATCAGTATAGCCGAGAATAAAGGTGATTTGATTTGATTTCATTGAAGCTTCTACAAATCTCGTGATTTTTGCCACCCCCCCCCCCCCCAAAAAAAAAATCTTAAGGGTATTGTGCCTGCAAGGCTAGTGTATTCTCAACACAATTCATGTACTACTCAGCTGCTTCTGTTTTTGTCATCTATCATGATTGAGACCGCAATATGGGCGCAATTTGGACAGTTTGCTGCATACAACACAAACATAAGAATACATTAGAATAAATTTGAATCTATTTACACTTTTTTTGCTTCATTTCACGGCTAAGAATGCATGCTTGTCTTATTTTTTACCTAAATTCGCAATATGATATAGAATATGATGTTAAACACATGCTTTCCAAATAATAAACTGAGTTTTTAAGATTCAATAATGAGAGTTATACAATATGCTATCAGCTGTACAGGAAATTGGGTTTGCGGACTAAATAAATAAATTATTCTACTTTACTGTTTTGAATCTTTTGATTGGCAAAAATTGGAAGTTGATCCTCGTTTCAATATCCTGCAATAAAAAGTCCTGCTTATCCTTTTCTTTTCTTCTTCTTTTTCTTTCTTTCTCTAAATTAGGCACGAGGCAATAACTGTGACCATGTATAATTTTGGATCTCCTAGAGTTGGCAACCGGCAATTTGCAGAAATTTACAACAAGGTATGAAATTCCCTTGTTTCATTTGTAAAATAGAAAATTCTTATCCTCTTAATTTGTTCATGTTGTTGGCTTCCCTCTCCCTCTTCTGAGTGGAAGTCCAATATGTTAAAGAAAATTATGACTTAATTATATTTTCCTTTCTTGTATTTTCCTTTTCTGATATTTTCCTTTGTTGGGTCCATTGTTGGCTATTTACTCAGCAATTCTTTGTATTCTAATTAAGTTATTATTCTAGAATATAATATACAAAATTTACATGGTATCAGAGCCATAAGTCCTAGGGTTTTATACTTTTCGATCTCTGCCTCCATTGACGGTCGGTCGATGGAGGCGGATCCATGAAGTCTCGTCGCCATTACACTACTTCTGACCTGTTCATCCATCTCTGCCTCCGTCACTGATTTTGTTATGTTTTTAGATTTACATTTGTCGGTCTTTGTTCTATTTCGGATTTGCGTTCGTAGGTCTCTATTTCGTTTTCAAATTGGTTTTTCGTTGGTCTGTTTCAAATCTTATGTAGATCTCTGTTTCCAGTCCAGAAATTCTTCGTGGGTCTCTGTTTTCGGTCCAAAAGTGCTTTGTGGGTATCTGTCTCCAACCCAGAAGTTCTTTGTGGATCTCTGTTTCCGGCTTAGAAGTCTTTCGTGGTCTTTGTTTCTAGCCCAGATATGTCGTGATTTGATTCAGAAGTTTTCGTCATCGATTTCTCTCAGCACTTCACAATCTCAAGTCTACCACCTTGAGTTTGAGGGATGTTAA
mRNA sequence
ATGTGTGTTTTGCAAGGAAATTTGCATGATGTATCAGTGGAGTTAGAAGGGATGGGTGGAGGTGGAAAGTTGCTGCTGGAGATCAAGTATAGGACTTTTGATGAAATTGAAGATGACAAACGATGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAGTAAGGGTTTTGTATCTGCTTTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGAAAGTTAAAGTCATTCAATGATGAGTACCAATCAAGTCATCATTTATTAAGCAAGCAAAACGACATAGAGGATATACCTTCATACATGCAGACGAATACCGAAGTCTGTATAACTGATATAAACGATCCCAATGAGGGAAAATTTGATGAGGTTGAAACAAGTGATAATACTGTGGACAGTGGACAATCGCTGAAAGAAGTGACTCAAAGTCTTTTAGCAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAACCAAAATATTGTCAAGAAGCTAGGTCTTCCTGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAGGCACGAAAGACTGCTGAAGCAGGTTATATCGAATCGGGGCTTGCAACACCCAAAAGTTTGGATACTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACATTAACTGATGTGAAGAAAATAACAAAAGATCTATTAAGTCAAACTGAGTCTGTTTTAGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGGTGAGGGCTCAAAAAAAGAGGGAGAGAAGCTCGGTAGTTCAGGGGATGGATCATTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGCGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAACGAGTCTACAGACACACAGGTTGCAATTTGGCGTGATTTTATGCGGAGGAGACTAGTTGTTGCCTTCAGGGGCACTGAACAATCAAGATGGAAGGACCTAATAACAGACCTGATGCTAGTCCCTGCAGGTGATGATTGTGCTGAGCCACCACTCAAATGGCATGTTTATGTTACAGGTCACAGTTTGGGTGGTGCATTAGCTACACTTCTTGCTCTTGAACTTTCGTCAAGTCAACTTGCAAGGTCTCTATTTCGTTTTCAAATTGGTTTTTCGTTGGTCTGTTTCAAATCTTATGTAGATCTCTGTTTCCAGTCCAGAAATTCTTCGTGGGTCTCTGTTTTCGGTCCAAAAGTGCTTTGTGGGTATCTGTCTCCAACCCAGAAGTTCTTTGTGGATCTCTGTTTCCGGCTTAGAAGTCTTTCGTGGTCTTTGTTTCTAGCCCAGATATGTCGTGATTTGATTCAGAAGTTTTCGTCATCGATTTCTCTCAGCACTTCACAATCTCAAGTCTACCACCTTGAGTTTGAGGGATGTTAA
Coding sequence (CDS)
ATGTGTGTTTTGCAAGGAAATTTGCATGATGTATCAGTGGAGTTAGAAGGGATGGGTGGAGGTGGAAAGTTGCTGCTGGAGATCAAGTATAGGACTTTTGATGAAATTGAAGATGACAAACGATGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAGTAAGGGTTTTGTATCTGCTTTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGAAAGTTAAAGTCATTCAATGATGAGTACCAATCAAGTCATCATTTATTAAGCAAGCAAAACGACATAGAGGATATACCTTCATACATGCAGACGAATACCGAAGTCTGTATAACTGATATAAACGATCCCAATGAGGGAAAATTTGATGAGGTTGAAACAAGTGATAATACTGTGGACAGTGGACAATCGCTGAAAGAAGTGACTCAAAGTCTTTTAGCAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAACCAAAATATTGTCAAGAAGCTAGGTCTTCCTGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAGGCACGAAAGACTGCTGAAGCAGGTTATATCGAATCGGGGCTTGCAACACCCAAAAGTTTGGATACTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACATTAACTGATGTGAAGAAAATAACAAAAGATCTATTAAGTCAAACTGAGTCTGTTTTAGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGGTGAGGGCTCAAAAAAAGAGGGAGAGAAGCTCGGTAGTTCAGGGGATGGATCATTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGCGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAACGAGTCTACAGACACACAGGTTGCAATTTGGCGTGATTTTATGCGGAGGAGACTAGTTGTTGCCTTCAGGGGCACTGAACAATCAAGATGGAAGGACCTAATAACAGACCTGATGCTAGTCCCTGCAGGTGATGATTGTGCTGAGCCACCACTCAAATGGCATGTTTATGTTACAGGTCACAGTTTGGGTGGTGCATTAGCTACACTTCTTGCTCTTGAACTTTCGTCAAGTCAACTTGCAAGGTCTCTATTTCGTTTTCAAATTGGTTTTTCGTTGGTCTGTTTCAAATCTTATGTAGATCTCTGTTTCCAGTCCAGAAATTCTTCGTGGGTCTCTGTTTTCGGTCCAAAAGTGCTTTGTGGGTATCTGTCTCCAACCCAGAAGTTCTTTGTGGATCTCTGTTTCCGGCTTAGAAGTCTTTCGTGGTCTTTGTTTCTAGCCCAGATATGTCGTGATTTGATTCAGAAGTTTTCGTCATCGATTTCTCTCAGCACTTCACAATCTCAAGTCTACCACCTTGAGTTTGAGGGATGTTAA
Protein sequence
MCVLQGNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVVGSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPNEGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDLLSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSEEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLVVAFRGTEQSRWKDLITDLMLVPAGDDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLARSLFRFQIGFSLVCFKSYVDLCFQSRNSSWVSVFGPKVLCGYLSPTQKFFVDLCFRLRSLSWSLFLAQICRDLIQKFSSSISLSTSQSQVYHLEFEGC
Homology
BLAST of HG10006944 vs. NCBI nr
Match:
XP_038876505.1 (uncharacterized protein LOC120068939 [Benincasa hispida])
HSP 1 Score: 762.7 bits (1968), Expect = 2.1e-216
Identity = 400/497 (80.48%), Postives = 421/497 (84.71%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GNLHDV+VELEGMGGGGKLL+EIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 226 GNLHDVTVELEGMGGGGKLLMEIKYRTFDEIEDDKRWWRVPFISEFLRSNGFVSALNKVV 285
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQN+ EDIPSY+QTNT+V ITDI PN
Sbjct: 286 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNNAEDIPSYVQTNTKVSITDIKYPN 345
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE +DNTV+SGQ LKEVTQSLL KQFDKQFWTNLADVTNQNIVKKLGLPAPEK
Sbjct: 346 EGKSDEVEINDNTVESGQLLKEVTQSLLTKQFDKQFWTNLADVTNQNIVKKLGLPAPEKF 405
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 406 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 465
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
LSQTESVLGALMVLTATISQLNKEA+L+GKKDTK EGSKKEGEKLGSSGDGSLLDNRNSE
Sbjct: 466 LSQTESVLGALMVLTATISQLNKEARLVGKKDTKDEGSKKEGEKLGSSGDGSLLDNRNSE 525
Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV
Sbjct: 526 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 585
Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
VAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 586 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLIK 645
Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
D+CAEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G V
Sbjct: 646 MAINYNDECAEPPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAVTVTMYNFGSPRV 705
BLAST of HG10006944 vs. NCBI nr
Match:
XP_011654507.1 (uncharacterized protein LOC101204368 isoform X1 [Cucumis sativus] >KAE8648243.1 hypothetical protein Csa_018335 [Cucumis sativus])
HSP 1 Score: 729.6 bits (1882), Expect = 2.0e-206
Identity = 388/497 (78.07%), Postives = 411/497 (82.70%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS+H LL+K+ND ED S +QTNTEV ITD N P
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSNHLLLTKRNDEEDTSSNVQTNTEVSITDTNYPI 341
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE S+NTV+SGQSLKEVTQ LLA QFDKQFWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISNNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 401
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIG+EARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+T+DL
Sbjct: 402 KWDGFELLNKIGMEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTRDL 461
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+G SGDGSLLDNRNSE
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKFGEKVGGSGDGSLLDNRNSE 521
Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
EMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRR+LV
Sbjct: 522 EMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRKLV 581
Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
VAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 582 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIISLIK 641
Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
DD AEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G V
Sbjct: 642 KAIYYNDDRAEPPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPRV 701
BLAST of HG10006944 vs. NCBI nr
Match:
XP_031740823.1 (uncharacterized protein LOC101204368 isoform X4 [Cucumis sativus])
HSP 1 Score: 729.6 bits (1882), Expect = 2.0e-206
Identity = 388/497 (78.07%), Postives = 411/497 (82.70%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 83 GNSHEVTVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 142
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS+H LL+K+ND ED S +QTNTEV ITD N P
Sbjct: 143 GSDTVPVRQFVEYAFGKLKSFNDEYQSNHLLLTKRNDEEDTSSNVQTNTEVSITDTNYPI 202
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE S+NTV+SGQSLKEVTQ LLA QFDKQFWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 203 EGKSDEVEISNNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 262
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIG+EARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+T+DL
Sbjct: 263 KWDGFELLNKIGMEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTRDL 322
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+G SGDGSLLDNRNSE
Sbjct: 323 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKFGEKVGGSGDGSLLDNRNSE 382
Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
EMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRR+LV
Sbjct: 383 EMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRKLV 442
Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
VAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 443 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIISLIK 502
Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
DD AEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G V
Sbjct: 503 KAIYYNDDRAEPPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPRV 562
BLAST of HG10006944 vs. NCBI nr
Match:
XP_011654508.1 (uncharacterized protein LOC101204368 isoform X3 [Cucumis sativus])
HSP 1 Score: 729.2 bits (1881), Expect = 2.6e-206
Identity = 384/464 (82.76%), Postives = 399/464 (85.99%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS+H LL+K+ND ED S +QTNTEV ITD N P
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSNHLLLTKRNDEEDTSSNVQTNTEVSITDTNYPI 341
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE S+NTV+SGQSLKEVTQ LLA QFDKQFWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISNNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 401
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIG+EARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+T+DL
Sbjct: 402 KWDGFELLNKIGMEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTRDL 461
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+G SGDGSLLDNRNSE
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKFGEKVGGSGDGSLLDNRNSE 521
Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
EMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRR+LV
Sbjct: 522 EMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRKLV 581
Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
VAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 582 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIISLIK 641
Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR 428
DD AEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR
Sbjct: 642 KAIYYNDDRAEPPVKWHVYVTGHSLGGALATLLALELSSSQLAR 685
BLAST of HG10006944 vs. NCBI nr
Match:
XP_008460597.1 (PREDICTED: uncharacterized protein LOC103499378 isoform X1 [Cucumis melo])
HSP 1 Score: 727.2 bits (1876), Expect = 9.8e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 341
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 401
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 402 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 461
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 521
Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 522 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 581
Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
VVAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 582 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 641
Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G
Sbjct: 642 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 701
BLAST of HG10006944 vs. ExPASy TrEMBL
Match:
A0A1S3CCU0 (uncharacterized protein LOC103499378 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499378 PE=4 SV=1)
HSP 1 Score: 727.2 bits (1876), Expect = 4.7e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 341
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 401
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 402 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 461
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 521
Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 522 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 581
Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
VVAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 582 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 641
Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G
Sbjct: 642 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 701
BLAST of HG10006944 vs. ExPASy TrEMBL
Match:
A0A1S3CDA2 (uncharacterized protein LOC103499378 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499378 PE=4 SV=1)
HSP 1 Score: 727.2 bits (1876), Expect = 4.7e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 84 GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 143
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P
Sbjct: 144 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 203
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 204 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 263
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 264 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 323
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 324 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 383
Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 384 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 443
Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
VVAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 444 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 503
Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G
Sbjct: 504 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 563
BLAST of HG10006944 vs. ExPASy TrEMBL
Match:
A0A1S3CE13 (uncharacterized protein LOC103499378 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103499378 PE=4 SV=1)
HSP 1 Score: 727.2 bits (1876), Expect = 4.7e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 78 GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 137
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P
Sbjct: 138 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 197
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 198 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 257
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 258 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 317
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 318 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 377
Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 378 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 437
Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
VVAFRGTEQSRWKDL TDLMLVPAG
Sbjct: 438 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 497
Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR ++ + G
Sbjct: 498 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 557
BLAST of HG10006944 vs. ExPASy TrEMBL
Match:
A0A6J1EY43 (uncharacterized protein LOC111439777 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)
HSP 1 Score: 712.6 bits (1838), Expect = 1.2e-201
Identity = 381/497 (76.66%), Postives = 406/497 (81.69%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
G+LHDVSVELEGMGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLRS GF SALNKVV
Sbjct: 78 GDLHDVSVELEGMGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVV 137
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTV V QFVEYAFGKLKSFNDEYQSS +LLSKQ D EDIPSYMQTN EV ITDI+DP
Sbjct: 138 GSDTVSVGQFVEYAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPE 197
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
E + D+ T+DNT ++GQ LKEVTQS+LAKQFDK FWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 198 EDESDDDATNDNTKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKL 257
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLAT KSLD D EQKNI+MVDSTLTDVKKITKDL
Sbjct: 258 KWDGFELLNKIGLEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDL 317
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
LSQTESVLG LMVLTATISQLNKE+Q IGKKDT+ EGSKK GEKLGSSGDGSLLDNRNSE
Sbjct: 318 LSQTESVLGGLMVLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSE 377
Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
EM+ALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF RRRLV
Sbjct: 378 EMRALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLV 437
Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
VAFRGTEQSRWKDL TDLML PAG
Sbjct: 438 VAFRGTEQSRWKDLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIK 497
Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
DDCAEPP+KWHVYVTGHSLGGALATLLALEL+SSQLAR ++ + G V
Sbjct: 498 MAINYNDDCAEPPVKWHVYVTGHSLGGALATLLALELTSSQLARHGAINVTMYNFGSPRV 557
BLAST of HG10006944 vs. ExPASy TrEMBL
Match:
A0A6J1EYQ8 (uncharacterized protein LOC111439777 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)
HSP 1 Score: 712.6 bits (1838), Expect = 1.2e-201
Identity = 381/497 (76.66%), Postives = 406/497 (81.69%), Query Frame = 0
Query: 6 GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
G+LHDVSVELEGMGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLRS GF SALNKVV
Sbjct: 61 GDLHDVSVELEGMGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVV 120
Query: 66 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
GSDTV V QFVEYAFGKLKSFNDEYQSS +LLSKQ D EDIPSYMQTN EV ITDI+DP
Sbjct: 121 GSDTVSVGQFVEYAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPE 180
Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
E + D+ T+DNT ++GQ LKEVTQS+LAKQFDK FWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 181 EDESDDDATNDNTKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKL 240
Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
KWDGFELLNKIGLEARK+AEAGYIESGLAT KSLD D EQKNI+MVDSTLTDVKKITKDL
Sbjct: 241 KWDGFELLNKIGLEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDL 300
Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
LSQTESVLG LMVLTATISQLNKE+Q IGKKDT+ EGSKK GEKLGSSGDGSLLDNRNSE
Sbjct: 301 LSQTESVLGGLMVLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSE 360
Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
EM+ALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF RRRLV
Sbjct: 361 EMRALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLV 420
Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
VAFRGTEQSRWKDL TDLML PAG
Sbjct: 421 VAFRGTEQSRWKDLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIK 480
Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
DDCAEPP+KWHVYVTGHSLGGALATLLALEL+SSQLAR ++ + G V
Sbjct: 481 MAINYNDDCAEPPVKWHVYVTGHSLGGALATLLALELTSSQLARHGAINVTMYNFGSPRV 540
BLAST of HG10006944 vs. TAIR 10
Match:
AT4G13550.1 (triglyceride lipases;triglyceride lipases )
HSP 1 Score: 405.2 bits (1040), Expect = 7.9e-113
Identity = 246/525 (46.86%), Postives = 314/525 (59.81%), Query Frame = 0
Query: 3 VLQGNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS-------K 62
V GNLH V VEL+G+GGGGK+ LEIKY+ F E+E++K+WWR PF+SEFL+ K
Sbjct: 68 VCDGNLHKVLVELDGIGGGGKVQLEIKYKGFGEVEEEKKWWRFPFVSEFLQRNEIKSVLK 127
Query: 63 GFV------SALNKVVGSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSY 122
FV S L +V S+ VP RQFVEYAFG+LKS ND + LL+ N ED
Sbjct: 128 NFVDSEAVESVLKNLVDSEAVPARQFVEYAFGQLKSLNDAPLKNTELLN--NTAEDSEGA 187
Query: 123 MQTNTEVCITDINDPNEGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVT 182
++ N + GK + + D G L++ +S + Q + FW N+ D+
Sbjct: 188 SSEDSSDQHRSTNLSSSGKLSKDKDGDGD-GHGNELEDDNES-GSIQSESNFWDNIPDIV 247
Query: 183 NQNIVKKLGLPAPEKLKWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKN-- 242
QNIV+KLGLP+PEKLKW+G ELL GL++RKTAEAGYIESGLAT + + D E+++
Sbjct: 248 GQNIVQKLGLPSPEKLKWNGTELLENFGLQSRKTAEAGYIESGLATADTREADDEKEDGQ 307
Query: 243 --IRMVDSTLTDVKKITKDLLSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKK 302
I S+L D+K T++LL Q ++V GALMVL A + L+K++ K K S
Sbjct: 308 VAINASKSSLADMKNATQELLKQADNVFGALMVLKAVVPHLSKDSVGSEKVIEKNGSSSV 367
Query: 303 EGEKLGSSGDGSL--------LDNRNSEEMKALFATAESAMEAWAMLATSLGHPSFIKSE 362
+ GSS + D +N+EEMK LF++AESAMEAWAMLAT+LGHPSFIKSE
Sbjct: 368 TDDVSGSSKTEKISGLVNVDGADEKNAEEMKTLFSSAESAMEAWAMLATALGHPSFIKSE 427
Query: 363 FEKLCFLDNESTDTQVAIWRDFMRRRLVVAFRGTEQSRWKDLITDLMLVPAG-------- 422
FEKLCFL+N+ TDTQVAIWRD R+R+V+AFRGTEQ++WKDL TDLMLVPAG
Sbjct: 428 FEKLCFLENDITDTQVAIWRDARRKRVVIAFRGTEQTKWKDLQTDLMLVPAGLNPERIGG 487
Query: 423 ----------------------------------DDCAEPPLKWHVYVTGHSLGGALATL 457
DD E KWHVYVTGHSLGGALATL
Sbjct: 488 DFKQEVQVHSGFLSAYDSVRIRIISLLKMTIGYIDDVTEREDKWHVYVTGHSLGGALATL 547
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038876505.1 | 2.1e-216 | 80.48 | uncharacterized protein LOC120068939 [Benincasa hispida] | [more] |
XP_011654507.1 | 2.0e-206 | 78.07 | uncharacterized protein LOC101204368 isoform X1 [Cucumis sativus] >KAE8648243.1 ... | [more] |
XP_031740823.1 | 2.0e-206 | 78.07 | uncharacterized protein LOC101204368 isoform X4 [Cucumis sativus] | [more] |
XP_011654508.1 | 2.6e-206 | 82.76 | uncharacterized protein LOC101204368 isoform X3 [Cucumis sativus] | [more] |
XP_008460597.1 | 9.8e-206 | 78.51 | PREDICTED: uncharacterized protein LOC103499378 isoform X1 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3CCU0 | 4.7e-206 | 78.51 | uncharacterized protein LOC103499378 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CDA2 | 4.7e-206 | 78.51 | uncharacterized protein LOC103499378 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CE13 | 4.7e-206 | 78.51 | uncharacterized protein LOC103499378 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1EY43 | 1.2e-201 | 76.66 | uncharacterized protein LOC111439777 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EYQ8 | 1.2e-201 | 76.66 | uncharacterized protein LOC111439777 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT4G13550.1 | 7.9e-113 | 46.86 | triglyceride lipases;triglyceride lipases | [more] |