HG10006944 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10006944
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionC2 domain-containing protein
LocationChr07: 23464306 .. 23473458 (+)
RNA-Seq ExpressionHG10006944
SyntenyHG10006944
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGTTTTGCAAGGAAATTTGCATGATGTATCAGTGGAGTTAGAAGGGATGGGTGGAGGTGGAAAGTTGCTGCTGGAGGTAAGTTACCCTGGCATTTTATAGCAACTAAATATGGCATCCTCGTCATGTTTTCTTAATGTACTGCTAATTTGTAATAACTTCTGTGTTTGGGAGCTGTACTATATTTGTGTCTGTTTTCTATGTTTGTAACTCAAGAGGTTTAGTTTCTTTTCTTCTGGTAGGACAATAAACTTTTGAAGGTTATTAGTTTTTGTTGGATAATTAAGTCTTGCTTTCTTAAAACCTGTTTGTTCTAATGTGATTGATAATTGTTTGTTAATGGTGTCTCTCAGATCAAGTATAGGACTTTTGATGAAATTGAAGATGACAAACGATGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAGTAAGGGTTTTGTATCTGCTTTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGAAAGTTAAAGTCATTCAATGATGAGTACCAATCAAGTCATCATTTATTAAGCAAGCAAAACGACATAGAGGATATACCTTCATACATGCAGACGAATACCGAAGTCTGTATAACTGATATAAACGATCCCAATGAGGGAAAATTTGATGAGGTTGAAACAAGTGATAATACTGTGGACAGTGGACAATCGCTGAAAGAAGTGACTCAAAGTCTTTTAGCAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAACCAAAATATTGTCAAGAAGCTAGGTCTTCCTGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAGGCACGAAAGACTGCTGAAGCAGGTTATATCGAATCGGGGCTTGCAACACCCAAAAGTTTGGATACTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACATTAACTGATGTGAAGAAAATAACAAAAGATCTATTAAGTCAAACTGAGTCTGTTTTAGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGGTGAGGGCTCAAAAAAAGAGGGAGAGAAGCTCGGTAGTTCAGGGGATGGATCATTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGCGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAACGAGTCTACAGACACACAGGTAAAATATTTGGTTTCCAGAATGGCTCTAATATTTATAAAGTTGTCTCTGAATTTTGGTAATTCAATGACATAGCATTAGCAAATTGTCCCAATGTTATTTTTTTCATTCCTAACTGTAGGAGTCTACAGGTTGCAATTTGGCGTGATTTTATGCGGAGGAGACTAGTTGTTGCCTTCAGGGGCACTGAACAAGTAGGTTATTATGTTATGTTGATATTCTGTTCTCTGGAGTTTGACTTTCCGTGTAATAAGTTCTGACCTTTTGACTTCTCCAGTCAAGATGGAAGGACCTAATAACAGACCTGATGCTAGTCCCTGCAGGGTACTTATTTTTCTTTCCCTCAAGGATAATCCTTTATCCGGTACTTCATATTTATCATTCTCCTTGTTTCTTATGCAATGATGGATAATAAAAAAGAACATTGTACATGGGGCAAAACCTCTCGAATAAAAAGCCATGTACCATGAAAACTGTGGTAGTGCAGTTTATTCTAGTTAGCTAATTTAAATAAAGGGGAAAAGAACTATTGAATTTTAAATGTACTTAAAATGATTAGCCTAATTTTCCCATTTCTTTGCACTATGTCTAGTTGTATATTATTTGACCGTTGGATCTCAGGATGGGCTATTTGTGACCCTTCATCCTTTGGGTTTAAGATAGAAAGACCCCCTCTTGCTTTTGAATTTTTGGTAGAAAGAAGCTAATTAGAAGAGTACTTTATCCATTCGATAGTTGGTATAGTTATAACACAACTTGGATATAATATTTCAGCAAGGACCAAAAAACGGTAAATGAGTGAGCATATGGATCTCTGAAGGCTTTAAAACTTATGTGGATAAAAGGACGCTTTGTTATGGTTTCATATCAACTTCAATTTCACAAACTTTTAAAACATGTCATCTATTTTCTTTTGACTTAGGTTGAATCCTGAAAGGATAAGTGGAGATTTCAACGAGGAAGTTCAAGTAAGAAATATTTTAATTATATTACAGGATATGTATTGTTCTGAACTCCATGCACTGTACTCTTCGTGTAATGTTAAATGATGAAGATCTGTTTATTCCAGGTTCACAGTGGGTTCTTAAGTGCGTATGATTCAGTACGAATGAGAATTATTTCCCTCATTAAAATGGCCATTAATTATAAGTGAGTCTGATAGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATAAAAAAAATTAGAAAAGAATAATCCTGTGGAGGCAGTAATATGAACTGACTCAAAACGGTCCATCCAAGAACTACATTTGTCTTAAAAAACTCTTTGATTTCTCTCTAACCACAATTCCAAAGCAAGGCTTTGACTTCATTGACATATAATAATTGAGACCTCGAAGGTAATGTTACTCCCTTACGAGCTGTAGAATATTGTGTTTGGCAGATGAAGAGAACTCCCATCGCAAATTGAAAATTCTGAATATCTTATACCAGCAAGCCATTGAGAAGGAACAACCAAAAAGTAAGTGATTAAGAGATTCCCCATCTATATAATAAAGACAGCAAATAGAAGGTAGAAGGCATGATGTAGGCATCTTCTGAAGCACTTCCTAAGTATACAGACAACTATTACGTAAAATTTATGTAAGAATATTGAGTAATCTAGGACTCTCTTATTTCCAAATAGCAGTAAACAAGTCTTTAGGCATAATCCATTTGGGGTGAGAGGAACACTAGAATCTTTTGAGGGAAGGAGCTTAATCCTTTTGTTGTCTGGTCTTGGCTTGATTCCATGTTCCTTTGTGGGCTTTAGTTACGAAGCTTTTTTGTAATTATTCTCTTTGCTTTATTTTGCTTTATTGGAGGTCTTTCTTTTAGGTAGCCTCCTCTTTTTTGGGCTGACAGGGTACTAGGAGAGATTTGCAAAACCATAACAAAGTGTGCGGCTTCCTCTTCCTATTATATATCAGTACAAGAGGAGGGGTAAAAAGGTCCTGACAAGGGAAGCAAGTAATTAGTCACGTGGGAGTGAGGGAGTGAAATGAGAATGTTGGTTGTGGGAACCATTGATAGTGATTCTGAAGGGTATTAGTAGGGTTGTTATTCCCTTTCATAGTCTCCATGTTATTTCTCTGTTTTTAGTTTTAGTTCTTCTTGAGGGTGGGAGGAACTGGATTGTACAAACTATTATCATATCACTACATCTTGAGGAAGTTTCCTTACACAAACCCTCTCCATGAACTCCTCCACAAGGACTCCTTCATTTGGAAGGAGGAAGGCACTCGTGCTTTTGAGAAGCTAAAGCAAGTGATGATGTCCTTACGCATACTAGCTCTACTGGATTTTAACCTGCCATTCATAATTGTAACTGTAATTGGTGCTTCCAGAGCCAGGTTGGGAGCTTTGCTATCTCAGAATCATCGCCCCATTGCTTACTTCAGCCATACTCTTTCGGCAAGAGTGAGGGCCAAATCAATTTATGATCGTGAATTAATGGTAGTGGTATTGGCTATTAAAAAATGATTACATCATTTGTTGGGACTGAAGTTTACGGTGATAGCTAATTAGAGAGCTATATAGTTCTTGCTGGAACAAAGAGAAACTGAACCCCAGTATCAATGGCAAGCCCATAAATCCTTGTGATCCAAAAACTTCAACTCTCCACCAAAGTGAAGAAAATAGATAAAGAATAGACACCTTGTGTGACATTTGATGTATTGAATGAGGCATCATTTCATAGAATCAGAAGACTCTCCGTTGCACCATGGGCATCCAATGCAGCCCAAGTAATACCTCTGGAGCTCTAAATAGTCTTGACGAAAAGTTGATCAGCACTTTGAATTCTAGTTTCTTAAAGGATTACAATGGCAACAAATTTTGACAAAGATAAAATCTACCTTTGGTCTTAGGATATCTGCTCTCACCAGCATTTTAGCTTCTCAAAATTAGAAGGGAAGCATCAAACTCAGTTCAAAGTTCAGCATGAAAATTGCAAAAGTGAACGCAATTGAGAACAATTCTTTATACAATATAACGAAGTCATGCGAGAGCTTAAGGGATGACATCAAAGAAATCTTTAATTGCAGACGGGATAACTTTGGGTACTAAGATTGAGAATCAAAATGAGTTTTTAATATTTTGTTTGAAATATCATATAAATATAGATAGAACATGATCATTTGAGTGGGGCACGTTCCCCTCCCCCCAACGTGGATCCCTTTGTATAAGGGTAACCTTTGTAAGAGAGTAAAAAAGACAATAAAGGACAACATTACTAAGATCTTTACAATAATATAACCGTACTAATATAAGCACTATAACAAAAACAAACTAACTAACGAACTCATAACAAACCCAATGGACACACCAAAAAGCTATATTAGCTGCATCACTTCAATTGTTCGATTTTCTTTGAGGTTCATGTGGGAAAACACAATTTTTTCTAATATTCAGAAATCAAGATGGGTTATTGATTCAAAACTCTTTTGTGCACTCTGTTTGACTTATATCCCTTTTCTTGCAGTGATGATTGTGCTGAGCCACCACTCAAATGGCATGTTTATGTTACAGGTCACAGTTTGGGTGGTGCATTAGCTACACTTCTTGCTCTTGAACTTTCGTCAAGTCAACTTGCAAGGTTAGTGTTTATTTTCTACAGGAATGAGTGATGCTGGATCCTGAAGATTGAAAATAGGTCATGGCAGGAAAAGGATTATGGTTAAGCAAATAGTGGCCTTTTAAATGAGATATTGAAGAATTAATTTTTTTTACCACTTTCTTTACCCTAATTAAAAAATTCTTAACATATTCCTTCATTTACCTATTTATCGATTATTTCTTATCCAAAAAAAACCTATTTATCGATTATTTTTCTAACTTTACATGTAAGGCATCATTGATCTTAATAATTCACTTTTTTAGAAGAGAAACATACACTCTTGATTATTATGGTTTGAGTAATGATTCTATTTGGTTAGAAATTTAGCCATTAGACTAGCTGTAGTATTCCTATTAGTTCTATGATTATTACCCTTTTTAATTTTTAGTGATTAGACAGCTTGGCTTTTCTCCTCTTTTATCTTTCTTACCAAATACAATATTGCTCTAAGTTTTACAAGAGTGCTCTCTCAATACTGAGTACATGGGAAGAAACTGAACAACATAATGCCAATTGAATGCAACTTTCAGACTTTTTCATTGATGTAGAGTGTTGATATAATATTAAATTTTCCATAACCCATCAAATGCCAAATTTGCTTCTCTTATTGAAACTTGTGGTCTCAAGATGCACGAAATCCCCCATTGATCCAAGAAGGAAATCATGTAAGGGTTTAATTTTCCCATGCTTTCTTCGGTTTTCCAAAGTTACGGAGACATGCCAAGCTTTTCGGCCCTATTTCAGTGATTCTCTTTCAAGCTTCCAAGTGCTCTATATTGACTTGCTATCAAAGTGAATCACTCAGCTCTCTTTCCAGTTGGCTTCTTTGGTGGTTTCAGCTCTGGTTCGAAATTCTCGACTAGTTTTGTCGTTTCTTTTTCCGGCTGTTTTTGTCTCCAGAGGTTCCCAAGTGTTTTGGAGGAGGGATTTTTTTCATGCTTAGTTTGAAATTTAGTGTTCTTTAAGTGGTCAATCTTTACAAGCTTTTCTCTTGTTCTCGCAGCTCTCTTATTTTGTGTTTTTAGAAGTCTCAAAGTTTGGTTTGAACCTTAAGTTCTTTGCAAGAGTGTTCTCTACGTCTTTTCATTCTTGGGGCGCTTCCTTGTCCTTTTTATTTCTTGTTTATGTTCCTTTTTCTCTTCCATATGGAGTTTGTATCTTTGAGCACTAGTCTCTTTTCATTTTTTTGTTCTTTTTTTTTTGGAAAGAAATAAAACTTTTCATTAATGTAATGAAAATACAAGTAAAAATATACTAAAAGCAGTACAACATCACATAACAATACAATACAAAATCAAGCAGAAAATATAAAAGCTCCCCAATTCAGACAAATATCTTGAATGGAAAAATTCACAAAAACTTAGATAAAGAACACTAAGAGGAAGCATTAGGTCTTGCTTCTTTAAAACGTTATGTTCATTGAAGATGCTTATCTAGGAAATTTCTTTTATTTCTTTCGAACTAAATTTTAGTTAATAATATTTTGACTGCATTTATTGGAGAATCTTCTAAACTAACCGCAACAAATAGAGTAATTTGAATTTCAGAAGCGATCAACAAAATTGTGCAACAAACTTTAGTAATGCACATGTTCTCATTCACTGGTTGTGCTCAAAATTATGATCAAAATTCTCAACTTTTGTGTAGTAAACCCAAATGTATTAACACCCACTTAATTACTAACGAGATAAGGTAGTTTTCTTTTTCTCTAGTTCTTGTTTTCATTTTGTGATTCATCATGTATTATTGTAATTTGAGCATTATACTCTTTTCATTATATCAATGTAAAGTTTGTTTTCTTTAAAAAAAACAAAAAAAAAAAAACAAAACATATTTACAGCAAACCATCCCTTCAATCTAGTTCAAATATATCTTCCTTGCCAACTTGTCAATCATTCTATTGAATTGTACTTTTGGAAATTCTTTAGTCGACAGATCAACAGTTTTACCTATACTTAGAACATAAGGATACATGTAAGAATCGTTGATTGGCATAGAGATTAACAATCCTCTAATGCAAGTATAACATGGACAGGGTCCAAGTATCGAATCCCAAAGAAACAAAAGGCAAGATTTCAATTAAACAAATAAAGTCCTCAAAGTTTGTAACCAAAGGGGGATTCAAAACTGTTTCAGTCTTCAGGGGGAATTGCGCCCACCTTGCGCCTGCAATCTGCCTTGTTCAATTAGATCGAGACACAACCTGACATGCGGCCTTCATGCGCCTGCATCTTGCTTTCTCAAGCACAAAGAGTTAATTCGTATGCGATCAAGTTGGGCATTGAATCTACCCTAATCAACACTTAGCAACATAAATCGAATTTAACACATATGCAACCCTCTTTGAGATACAATTTAGGAGTCTGTAAGGATTCCCTTCAAGGAGAATCACTAACTTTACTGATAATAACACACAAAATATAAGCTCCCACCTTTCGAGAGGGTGGTTCTCTCCCAAATGTCTGATAATATAATTCCAAAAGTAAGCATAATCACAAAATACATTTAACAATTCTTAAATAGGCTAACCCTACTGCCAAACTAAGTGGGTGAATTCCCACTTTTACCCCCTCCTAACATAAGTATGTATGAGTGGAGGCCTAACAAAGTCTAAGCAAAAAAAAAAAAGATCCGAGTGATGGATTAAAGAGTTTTCTCCAAGGATTAAAGCTTGATGATTTGGGGGAGTCTTGGCTGCCTATTATAGACCTCCTAGCGCTAGGATTTCTTTCAGATCGGCGTGGAAATCCTTCCTTATCAGTATAGCCGAGAATAAAGGTGATTTGATTTGATTTCATTGAAGCTTCTACAAATCTCGTGATTTTTGCCACCCCCCCCCCCCCCAAAAAAAAAATCTTAAGGGTATTGTGCCTGCAAGGCTAGTGTATTCTCAACACAATTCATGTACTACTCAGCTGCTTCTGTTTTTGTCATCTATCATGATTGAGACCGCAATATGGGCGCAATTTGGACAGTTTGCTGCATACAACACAAACATAAGAATACATTAGAATAAATTTGAATCTATTTACACTTTTTTTGCTTCATTTCACGGCTAAGAATGCATGCTTGTCTTATTTTTTACCTAAATTCGCAATATGATATAGAATATGATGTTAAACACATGCTTTCCAAATAATAAACTGAGTTTTTAAGATTCAATAATGAGAGTTATACAATATGCTATCAGCTGTACAGGAAATTGGGTTTGCGGACTAAATAAATAAATTATTCTACTTTACTGTTTTGAATCTTTTGATTGGCAAAAATTGGAAGTTGATCCTCGTTTCAATATCCTGCAATAAAAAGTCCTGCTTATCCTTTTCTTTTCTTCTTCTTTTTCTTTCTTTCTCTAAATTAGGCACGAGGCAATAACTGTGACCATGTATAATTTTGGATCTCCTAGAGTTGGCAACCGGCAATTTGCAGAAATTTACAACAAGGTATGAAATTCCCTTGTTTCATTTGTAAAATAGAAAATTCTTATCCTCTTAATTTGTTCATGTTGTTGGCTTCCCTCTCCCTCTTCTGAGTGGAAGTCCAATATGTTAAAGAAAATTATGACTTAATTATATTTTCCTTTCTTGTATTTTCCTTTTCTGATATTTTCCTTTGTTGGGTCCATTGTTGGCTATTTACTCAGCAATTCTTTGTATTCTAATTAAGTTATTATTCTAGAATATAATATACAAAATTTACATGGTATCAGAGCCATAAGTCCTAGGGTTTTATACTTTTCGATCTCTGCCTCCATTGACGGTCGGTCGATGGAGGCGGATCCATGAAGTCTCGTCGCCATTACACTACTTCTGACCTGTTCATCCATCTCTGCCTCCGTCACTGATTTTGTTATGTTTTTAGATTTACATTTGTCGGTCTTTGTTCTATTTCGGATTTGCGTTCGTAGGTCTCTATTTCGTTTTCAAATTGGTTTTTCGTTGGTCTGTTTCAAATCTTATGTAGATCTCTGTTTCCAGTCCAGAAATTCTTCGTGGGTCTCTGTTTTCGGTCCAAAAGTGCTTTGTGGGTATCTGTCTCCAACCCAGAAGTTCTTTGTGGATCTCTGTTTCCGGCTTAGAAGTCTTTCGTGGTCTTTGTTTCTAGCCCAGATATGTCGTGATTTGATTCAGAAGTTTTCGTCATCGATTTCTCTCAGCACTTCACAATCTCAAGTCTACCACCTTGAGTTTGAGGGATGTTAA

mRNA sequence

ATGTGTGTTTTGCAAGGAAATTTGCATGATGTATCAGTGGAGTTAGAAGGGATGGGTGGAGGTGGAAAGTTGCTGCTGGAGATCAAGTATAGGACTTTTGATGAAATTGAAGATGACAAACGATGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAGTAAGGGTTTTGTATCTGCTTTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGAAAGTTAAAGTCATTCAATGATGAGTACCAATCAAGTCATCATTTATTAAGCAAGCAAAACGACATAGAGGATATACCTTCATACATGCAGACGAATACCGAAGTCTGTATAACTGATATAAACGATCCCAATGAGGGAAAATTTGATGAGGTTGAAACAAGTGATAATACTGTGGACAGTGGACAATCGCTGAAAGAAGTGACTCAAAGTCTTTTAGCAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAACCAAAATATTGTCAAGAAGCTAGGTCTTCCTGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAGGCACGAAAGACTGCTGAAGCAGGTTATATCGAATCGGGGCTTGCAACACCCAAAAGTTTGGATACTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACATTAACTGATGTGAAGAAAATAACAAAAGATCTATTAAGTCAAACTGAGTCTGTTTTAGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGGTGAGGGCTCAAAAAAAGAGGGAGAGAAGCTCGGTAGTTCAGGGGATGGATCATTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGCGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAACGAGTCTACAGACACACAGGTTGCAATTTGGCGTGATTTTATGCGGAGGAGACTAGTTGTTGCCTTCAGGGGCACTGAACAATCAAGATGGAAGGACCTAATAACAGACCTGATGCTAGTCCCTGCAGGTGATGATTGTGCTGAGCCACCACTCAAATGGCATGTTTATGTTACAGGTCACAGTTTGGGTGGTGCATTAGCTACACTTCTTGCTCTTGAACTTTCGTCAAGTCAACTTGCAAGGTCTCTATTTCGTTTTCAAATTGGTTTTTCGTTGGTCTGTTTCAAATCTTATGTAGATCTCTGTTTCCAGTCCAGAAATTCTTCGTGGGTCTCTGTTTTCGGTCCAAAAGTGCTTTGTGGGTATCTGTCTCCAACCCAGAAGTTCTTTGTGGATCTCTGTTTCCGGCTTAGAAGTCTTTCGTGGTCTTTGTTTCTAGCCCAGATATGTCGTGATTTGATTCAGAAGTTTTCGTCATCGATTTCTCTCAGCACTTCACAATCTCAAGTCTACCACCTTGAGTTTGAGGGATGTTAA

Coding sequence (CDS)

ATGTGTGTTTTGCAAGGAAATTTGCATGATGTATCAGTGGAGTTAGAAGGGATGGGTGGAGGTGGAAAGTTGCTGCTGGAGATCAAGTATAGGACTTTTGATGAAATTGAAGATGACAAACGATGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAGTAAGGGTTTTGTATCTGCTTTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGAAAGTTAAAGTCATTCAATGATGAGTACCAATCAAGTCATCATTTATTAAGCAAGCAAAACGACATAGAGGATATACCTTCATACATGCAGACGAATACCGAAGTCTGTATAACTGATATAAACGATCCCAATGAGGGAAAATTTGATGAGGTTGAAACAAGTGATAATACTGTGGACAGTGGACAATCGCTGAAAGAAGTGACTCAAAGTCTTTTAGCAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAACCAAAATATTGTCAAGAAGCTAGGTCTTCCTGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAGGCACGAAAGACTGCTGAAGCAGGTTATATCGAATCGGGGCTTGCAACACCCAAAAGTTTGGATACTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACATTAACTGATGTGAAGAAAATAACAAAAGATCTATTAAGTCAAACTGAGTCTGTTTTAGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGGTGAGGGCTCAAAAAAAGAGGGAGAGAAGCTCGGTAGTTCAGGGGATGGATCATTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGCGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAACGAGTCTACAGACACACAGGTTGCAATTTGGCGTGATTTTATGCGGAGGAGACTAGTTGTTGCCTTCAGGGGCACTGAACAATCAAGATGGAAGGACCTAATAACAGACCTGATGCTAGTCCCTGCAGGTGATGATTGTGCTGAGCCACCACTCAAATGGCATGTTTATGTTACAGGTCACAGTTTGGGTGGTGCATTAGCTACACTTCTTGCTCTTGAACTTTCGTCAAGTCAACTTGCAAGGTCTCTATTTCGTTTTCAAATTGGTTTTTCGTTGGTCTGTTTCAAATCTTATGTAGATCTCTGTTTCCAGTCCAGAAATTCTTCGTGGGTCTCTGTTTTCGGTCCAAAAGTGCTTTGTGGGTATCTGTCTCCAACCCAGAAGTTCTTTGTGGATCTCTGTTTCCGGCTTAGAAGTCTTTCGTGGTCTTTGTTTCTAGCCCAGATATGTCGTGATTTGATTCAGAAGTTTTCGTCATCGATTTCTCTCAGCACTTCACAATCTCAAGTCTACCACCTTGAGTTTGAGGGATGTTAA

Protein sequence

MCVLQGNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVVGSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPNEGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDLLSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSEEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLVVAFRGTEQSRWKDLITDLMLVPAGDDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLARSLFRFQIGFSLVCFKSYVDLCFQSRNSSWVSVFGPKVLCGYLSPTQKFFVDLCFRLRSLSWSLFLAQICRDLIQKFSSSISLSTSQSQVYHLEFEGC
Homology
BLAST of HG10006944 vs. NCBI nr
Match: XP_038876505.1 (uncharacterized protein LOC120068939 [Benincasa hispida])

HSP 1 Score: 762.7 bits (1968), Expect = 2.1e-216
Identity = 400/497 (80.48%), Postives = 421/497 (84.71%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GNLHDV+VELEGMGGGGKLL+EIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 226 GNLHDVTVELEGMGGGGKLLMEIKYRTFDEIEDDKRWWRVPFISEFLRSNGFVSALNKVV 285

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQN+ EDIPSY+QTNT+V ITDI  PN
Sbjct: 286 GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNNAEDIPSYVQTNTKVSITDIKYPN 345

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE +DNTV+SGQ LKEVTQSLL KQFDKQFWTNLADVTNQNIVKKLGLPAPEK 
Sbjct: 346 EGKSDEVEINDNTVESGQLLKEVTQSLLTKQFDKQFWTNLADVTNQNIVKKLGLPAPEKF 405

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 406 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 465

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
           LSQTESVLGALMVLTATISQLNKEA+L+GKKDTK EGSKKEGEKLGSSGDGSLLDNRNSE
Sbjct: 466 LSQTESVLGALMVLTATISQLNKEARLVGKKDTKDEGSKKEGEKLGSSGDGSLLDNRNSE 525

Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
           EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV
Sbjct: 526 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 585

Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
           VAFRGTEQSRWKDL TDLMLVPAG                                    
Sbjct: 586 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLIK 645

Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
                 D+CAEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   V
Sbjct: 646 MAINYNDECAEPPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAVTVTMYNFGSPRV 705

BLAST of HG10006944 vs. NCBI nr
Match: XP_011654507.1 (uncharacterized protein LOC101204368 isoform X1 [Cucumis sativus] >KAE8648243.1 hypothetical protein Csa_018335 [Cucumis sativus])

HSP 1 Score: 729.6 bits (1882), Expect = 2.0e-206
Identity = 388/497 (78.07%), Postives = 411/497 (82.70%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS+H LL+K+ND ED  S +QTNTEV ITD N P 
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSNHLLLTKRNDEEDTSSNVQTNTEVSITDTNYPI 341

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE S+NTV+SGQSLKEVTQ LLA QFDKQFWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISNNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 401

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIG+EARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+T+DL
Sbjct: 402 KWDGFELLNKIGMEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTRDL 461

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+G SGDGSLLDNRNSE
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKFGEKVGGSGDGSLLDNRNSE 521

Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
           EMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRR+LV
Sbjct: 522 EMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRKLV 581

Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
           VAFRGTEQSRWKDL TDLMLVPAG                                    
Sbjct: 582 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIISLIK 641

Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
                 DD AEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   V
Sbjct: 642 KAIYYNDDRAEPPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPRV 701

BLAST of HG10006944 vs. NCBI nr
Match: XP_031740823.1 (uncharacterized protein LOC101204368 isoform X4 [Cucumis sativus])

HSP 1 Score: 729.6 bits (1882), Expect = 2.0e-206
Identity = 388/497 (78.07%), Postives = 411/497 (82.70%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 83  GNSHEVTVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 142

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS+H LL+K+ND ED  S +QTNTEV ITD N P 
Sbjct: 143 GSDTVPVRQFVEYAFGKLKSFNDEYQSNHLLLTKRNDEEDTSSNVQTNTEVSITDTNYPI 202

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE S+NTV+SGQSLKEVTQ LLA QFDKQFWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 203 EGKSDEVEISNNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 262

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIG+EARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+T+DL
Sbjct: 263 KWDGFELLNKIGMEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTRDL 322

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+G SGDGSLLDNRNSE
Sbjct: 323 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKFGEKVGGSGDGSLLDNRNSE 382

Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
           EMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRR+LV
Sbjct: 383 EMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRKLV 442

Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
           VAFRGTEQSRWKDL TDLMLVPAG                                    
Sbjct: 443 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIISLIK 502

Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
                 DD AEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   V
Sbjct: 503 KAIYYNDDRAEPPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPRV 562

BLAST of HG10006944 vs. NCBI nr
Match: XP_011654508.1 (uncharacterized protein LOC101204368 isoform X3 [Cucumis sativus])

HSP 1 Score: 729.2 bits (1881), Expect = 2.6e-206
Identity = 384/464 (82.76%), Postives = 399/464 (85.99%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS+H LL+K+ND ED  S +QTNTEV ITD N P 
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSNHLLLTKRNDEEDTSSNVQTNTEVSITDTNYPI 341

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE S+NTV+SGQSLKEVTQ LLA QFDKQFWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISNNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 401

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIG+EARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+T+DL
Sbjct: 402 KWDGFELLNKIGMEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTRDL 461

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+G SGDGSLLDNRNSE
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKFGEKVGGSGDGSLLDNRNSE 521

Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
           EMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRR+LV
Sbjct: 522 EMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRKLV 581

Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
           VAFRGTEQSRWKDL TDLMLVPAG                                    
Sbjct: 582 VAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIISLIK 641

Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR 428
                 DD AEPP+KWHVYVTGHSLGGALATLLALELSSSQLAR
Sbjct: 642 KAIYYNDDRAEPPVKWHVYVTGHSLGGALATLLALELSSSQLAR 685

BLAST of HG10006944 vs. NCBI nr
Match: XP_008460597.1 (PREDICTED: uncharacterized protein LOC103499378 isoform X1 [Cucumis melo])

HSP 1 Score: 727.2 bits (1876), Expect = 9.8e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P 
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 341

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 401

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 402 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 461

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 521

Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
           EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 522 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 581

Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
           VVAFRGTEQSRWKDL TDLMLVPAG                                   
Sbjct: 582 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 641

Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
                  DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   
Sbjct: 642 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 701

BLAST of HG10006944 vs. ExPASy TrEMBL
Match: A0A1S3CCU0 (uncharacterized protein LOC103499378 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499378 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 4.7e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 222 GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 281

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P 
Sbjct: 282 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 341

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 342 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 401

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 402 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 461

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 462 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 521

Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
           EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 522 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 581

Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
           VVAFRGTEQSRWKDL TDLMLVPAG                                   
Sbjct: 582 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 641

Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
                  DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   
Sbjct: 642 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 701

BLAST of HG10006944 vs. ExPASy TrEMBL
Match: A0A1S3CDA2 (uncharacterized protein LOC103499378 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499378 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 4.7e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 84  GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 143

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P 
Sbjct: 144 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 203

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 204 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 263

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 264 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 323

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 324 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 383

Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
           EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 384 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 443

Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
           VVAFRGTEQSRWKDL TDLMLVPAG                                   
Sbjct: 444 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 503

Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
                  DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   
Sbjct: 504 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 563

BLAST of HG10006944 vs. ExPASy TrEMBL
Match: A0A1S3CE13 (uncharacterized protein LOC103499378 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103499378 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 4.7e-206
Identity = 391/498 (78.51%), Postives = 409/498 (82.13%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           GN H+V+VELEGMGGGGKLLLEIKYR+FDEIEDDKRWWRVPFISEFLRS GFVSALNKVV
Sbjct: 78  GNSHEVTVELEGMGGGGKLLLEIKYRSFDEIEDDKRWWRVPFISEFLRSSGFVSALNKVV 137

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTVPVRQFVEYAFGKLKSFNDEYQS H LL KQND EDI S M+TNTEV ITD N P 
Sbjct: 138 GSDTVPVRQFVEYAFGKLKSFNDEYQSDHLLLRKQNDEEDISSNMRTNTEVSITDTNSPI 197

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           EGK DEVE SDNTV+SGQSLKEVTQ LLA QFDKQFWTNLADVT+QNIVKKLGLPAPEKL
Sbjct: 198 EGKSDEVEISDNTVESGQSLKEVTQGLLAMQFDKQFWTNLADVTSQNIVKKLGLPAPEKL 257

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLATPKSLD DHEQKNIRMVDSTLTDVKK+TKDL
Sbjct: 258 KWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDL 317

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKL-GSSGDGSLLDNRNS 305
           LSQTESVLG LMVLTATISQLNKEAQLIGKKDTK EGSKK GEK+ G SGDGSLLDNRNS
Sbjct: 318 LSQTESVLGGLMVLTATISQLNKEAQLIGKKDTKDEGSKKVGEKVGGGSGDGSLLDNRNS 377

Query: 306 EEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 365
           EEMKALFATAESAMEAWAMLA SLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL
Sbjct: 378 EEMKALFATAESAMEAWAMLAMSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRL 437

Query: 366 VVAFRGTEQSRWKDLITDLMLVPAG----------------------------------- 425
           VVAFRGTEQSRWKDL TDLMLVPAG                                   
Sbjct: 438 VVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLI 497

Query: 426 -------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSL 457
                  DD AE P+KWHVYVTGHSLGGALATLLALELSSSQLAR    ++  +  G   
Sbjct: 498 KKAIYYNDDRAESPVKWHVYVTGHSLGGALATLLALELSSSQLARHEAITVTMYNFGSPR 557

BLAST of HG10006944 vs. ExPASy TrEMBL
Match: A0A6J1EY43 (uncharacterized protein LOC111439777 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)

HSP 1 Score: 712.6 bits (1838), Expect = 1.2e-201
Identity = 381/497 (76.66%), Postives = 406/497 (81.69%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           G+LHDVSVELEGMGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLRS GF SALNKVV
Sbjct: 78  GDLHDVSVELEGMGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVV 137

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTV V QFVEYAFGKLKSFNDEYQSS +LLSKQ D EDIPSYMQTN EV ITDI+DP 
Sbjct: 138 GSDTVSVGQFVEYAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPE 197

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           E + D+  T+DNT ++GQ LKEVTQS+LAKQFDK FWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 198 EDESDDDATNDNTKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKL 257

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLAT KSLD D EQKNI+MVDSTLTDVKKITKDL
Sbjct: 258 KWDGFELLNKIGLEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDL 317

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
           LSQTESVLG LMVLTATISQLNKE+Q IGKKDT+ EGSKK GEKLGSSGDGSLLDNRNSE
Sbjct: 318 LSQTESVLGGLMVLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSE 377

Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
           EM+ALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF RRRLV
Sbjct: 378 EMRALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLV 437

Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
           VAFRGTEQSRWKDL TDLML PAG                                    
Sbjct: 438 VAFRGTEQSRWKDLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIK 497

Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
                 DDCAEPP+KWHVYVTGHSLGGALATLLALEL+SSQLAR    ++  +  G   V
Sbjct: 498 MAINYNDDCAEPPVKWHVYVTGHSLGGALATLLALELTSSQLARHGAINVTMYNFGSPRV 557

BLAST of HG10006944 vs. ExPASy TrEMBL
Match: A0A6J1EYQ8 (uncharacterized protein LOC111439777 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)

HSP 1 Score: 712.6 bits (1838), Expect = 1.2e-201
Identity = 381/497 (76.66%), Postives = 406/497 (81.69%), Query Frame = 0

Query: 6   GNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRSKGFVSALNKVV 65
           G+LHDVSVELEGMGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLRS GF SALNKVV
Sbjct: 61  GDLHDVSVELEGMGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVV 120

Query: 66  GSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSYMQTNTEVCITDINDPN 125
           GSDTV V QFVEYAFGKLKSFNDEYQSS +LLSKQ D EDIPSYMQTN EV ITDI+DP 
Sbjct: 121 GSDTVSVGQFVEYAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPE 180

Query: 126 EGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKL 185
           E + D+  T+DNT ++GQ LKEVTQS+LAKQFDK FWTNLADVTNQNIVKKLGLPAPEKL
Sbjct: 181 EDESDDDATNDNTKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKL 240

Query: 186 KWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKNIRMVDSTLTDVKKITKDL 245
           KWDGFELLNKIGLEARK+AEAGYIESGLAT KSLD D EQKNI+MVDSTLTDVKKITKDL
Sbjct: 241 KWDGFELLNKIGLEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDL 300

Query: 246 LSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKKEGEKLGSSGDGSLLDNRNSE 305
           LSQTESVLG LMVLTATISQLNKE+Q IGKKDT+ EGSKK GEKLGSSGDGSLLDNRNSE
Sbjct: 301 LSQTESVLGGLMVLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSE 360

Query: 306 EMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLV 365
           EM+ALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF RRRLV
Sbjct: 361 EMRALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLV 420

Query: 366 VAFRGTEQSRWKDLITDLMLVPAG------------------------------------ 425
           VAFRGTEQSRWKDL TDLML PAG                                    
Sbjct: 421 VAFRGTEQSRWKDLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIK 480

Query: 426 ------DDCAEPPLKWHVYVTGHSLGGALATLLALELSSSQLAR----SLFRFQIGFSLV 457
                 DDCAEPP+KWHVYVTGHSLGGALATLLALEL+SSQLAR    ++  +  G   V
Sbjct: 481 MAINYNDDCAEPPVKWHVYVTGHSLGGALATLLALELTSSQLARHGAINVTMYNFGSPRV 540

BLAST of HG10006944 vs. TAIR 10
Match: AT4G13550.1 (triglyceride lipases;triglyceride lipases )

HSP 1 Score: 405.2 bits (1040), Expect = 7.9e-113
Identity = 246/525 (46.86%), Postives = 314/525 (59.81%), Query Frame = 0

Query: 3   VLQGNLHDVSVELEGMGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRS-------K 62
           V  GNLH V VEL+G+GGGGK+ LEIKY+ F E+E++K+WWR PF+SEFL+        K
Sbjct: 68  VCDGNLHKVLVELDGIGGGGKVQLEIKYKGFGEVEEEKKWWRFPFVSEFLQRNEIKSVLK 127

Query: 63  GFV------SALNKVVGSDTVPVRQFVEYAFGKLKSFNDEYQSSHHLLSKQNDIEDIPSY 122
            FV      S L  +V S+ VP RQFVEYAFG+LKS ND    +  LL+  N  ED    
Sbjct: 128 NFVDSEAVESVLKNLVDSEAVPARQFVEYAFGQLKSLNDAPLKNTELLN--NTAEDSEGA 187

Query: 123 MQTNTEVCITDINDPNEGKFDEVETSDNTVDSGQSLKEVTQSLLAKQFDKQFWTNLADVT 182
              ++       N  + GK  + +  D     G  L++  +S  + Q +  FW N+ D+ 
Sbjct: 188 SSEDSSDQHRSTNLSSSGKLSKDKDGDGD-GHGNELEDDNES-GSIQSESNFWDNIPDIV 247

Query: 183 NQNIVKKLGLPAPEKLKWDGFELLNKIGLEARKTAEAGYIESGLATPKSLDTDHEQKN-- 242
            QNIV+KLGLP+PEKLKW+G ELL   GL++RKTAEAGYIESGLAT  + + D E+++  
Sbjct: 248 GQNIVQKLGLPSPEKLKWNGTELLENFGLQSRKTAEAGYIESGLATADTREADDEKEDGQ 307

Query: 243 --IRMVDSTLTDVKKITKDLLSQTESVLGALMVLTATISQLNKEAQLIGKKDTKGEGSKK 302
             I    S+L D+K  T++LL Q ++V GALMVL A +  L+K++    K   K   S  
Sbjct: 308 VAINASKSSLADMKNATQELLKQADNVFGALMVLKAVVPHLSKDSVGSEKVIEKNGSSSV 367

Query: 303 EGEKLGSSGDGSL--------LDNRNSEEMKALFATAESAMEAWAMLATSLGHPSFIKSE 362
             +  GSS    +         D +N+EEMK LF++AESAMEAWAMLAT+LGHPSFIKSE
Sbjct: 368 TDDVSGSSKTEKISGLVNVDGADEKNAEEMKTLFSSAESAMEAWAMLATALGHPSFIKSE 427

Query: 363 FEKLCFLDNESTDTQVAIWRDFMRRRLVVAFRGTEQSRWKDLITDLMLVPAG-------- 422
           FEKLCFL+N+ TDTQVAIWRD  R+R+V+AFRGTEQ++WKDL TDLMLVPAG        
Sbjct: 428 FEKLCFLENDITDTQVAIWRDARRKRVVIAFRGTEQTKWKDLQTDLMLVPAGLNPERIGG 487

Query: 423 ----------------------------------DDCAEPPLKWHVYVTGHSLGGALATL 457
                                             DD  E   KWHVYVTGHSLGGALATL
Sbjct: 488 DFKQEVQVHSGFLSAYDSVRIRIISLLKMTIGYIDDVTEREDKWHVYVTGHSLGGALATL 547

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876505.12.1e-21680.48uncharacterized protein LOC120068939 [Benincasa hispida][more]
XP_011654507.12.0e-20678.07uncharacterized protein LOC101204368 isoform X1 [Cucumis sativus] >KAE8648243.1 ... [more]
XP_031740823.12.0e-20678.07uncharacterized protein LOC101204368 isoform X4 [Cucumis sativus][more]
XP_011654508.12.6e-20682.76uncharacterized protein LOC101204368 isoform X3 [Cucumis sativus][more]
XP_008460597.19.8e-20678.51PREDICTED: uncharacterized protein LOC103499378 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CCU04.7e-20678.51uncharacterized protein LOC103499378 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3CDA24.7e-20678.51uncharacterized protein LOC103499378 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3CE134.7e-20678.51uncharacterized protein LOC103499378 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1EY431.2e-20176.66uncharacterized protein LOC111439777 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EYQ81.2e-20176.66uncharacterized protein LOC111439777 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT4G13550.17.9e-11346.86triglyceride lipases;triglyceride lipases [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 394..435
e-value: 4.8E-8
score: 34.7
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 305..393
e-value: 7.0E-7
score: 30.5
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 334..426
IPR002921Fungal lipase-like domainPFAMPF01764Lipase_3coord: 399..426
e-value: 3.0E-6
score: 27.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 274..301
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 274..291
NoneNo IPR availablePANTHERPTHR47759OS04G0509100 PROTEINcoord: 390..427
coord: 3..389

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10006944.1HG10006944.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006629 lipid metabolic process