Csa4G338950 (gene) Cucumber (Chinese Long) v2

NameCsa4G338950
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionHepatocyte growth factor-regulated tyrosine kinase substrate, putative; contains IPR008942 (ENTH/VHS)
LocationChr4 : 13977414 .. 13990071 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGCTTTCCTCTATAAGCTTTTTTGCTGAAACGCAACAACGCCCATTCTCTCACCAAAGGACAGAGCGACACACGCATACACTTTTAGATAGAGAGAGAGAGATTCTCATTTCTTTCTCTTCTGTCGTTCAACTCCAAATCTTCATCCCCATTATTTGCTTCTTTATTATCCTTTCAATTCCTCCTCTTAATCATTCATCATTACGCCTACCCACAGAGTTTCCAGCCATTTTTCAACCCTTTTTGCCCCAACCCCATTTCTTCTTTTTTCCTCAAATGCTCATCCTCTTCCTCTTCTAGGGTTTTTCACTTCTCTTCTTTTGTCTATTCTGGTAACGTTAACTCTACTTTCTTGCTTTTCACCTGTTCTTCAGTATTGAGTTTTTATTTTGAGATCTGTGTTGCTACTCTTTTCATTTTGATTGAAACTCCTCTTTTTTTCACACATGGGTTTGCACTTTACTGGACTTGAAGTTGAGTATCAGCTGTATCGTGTTTCTTTTACTTTCATTTCATGTGTTTTTGAACTTTTCTTTTTTCTTTTCCCCAGGCACTAAATTCTCTATTATTGTTGTTAATTTGTTTGGTTTTTAAGCTTGGACTTCCTTTTTCCCCCATTACAGAGTGGCTTATGGTTCCCACATCGGAAGGCTTTCTTGATTTCTATCTCTGCTTCTATTGGTGCAGTATGCCTATAAAGTCCAAGATATTGTAGAGCATTTTGGGGTTTTGTTTACCTTGACGATTTGTTACTCCCTTGACAATTCAAATATATAGGAGGTTGCTGATCTCGGTTAGTTATTCTTTTACTCTCTACTTTGTCATTTTAGTGTTTGAGATTTTTCATTTTCTGTATAAAAAGTTATGGTAATGTGTAGTTGTTATTATAAGCATTAGTAAGTGAAAATTTCATTATTTTTACTAAATACGTATTGCACAGCAGAACATAAAGATAAGTCTGCATTTAATTCTTTCTTTAAAAAACTACAAGGCCTTTTACCATTATGCTCTGTTATTATAGATTGTTCACTTTAGTGAGCGATGTTGTTTTTCCATTGAGTTTTTCTTTTGTCATCTGTAATGAATGATAATTAGCTCTGGATTATCTTTTTTCTCTCCATTACCTTGTCAGTTGAGTAGGAACATTGCCTATATTTGGAGTAAGGATGCCAAGACATGTGTAGTATAGTTTCTTCTCTTGCAAGTTTCTGTAGTTATTTACCAATGTTACAAAATATTATAGCATATCTGTGGTAGACCATCTGTGTCACAATTGACACAAATAGTAGTCTATTGCGGTTTATCACATATAGATTGTGATATTTTGTTATATTTGTAAATATTTTGGTTCATTTTACTACATTTCTTTCAATGTTGAAGGTTTCTGTAAGAAGTTATCACTTGACTGCATTACATTTTTTAGTCAATATTTTCTGACTATTGAAATATTTTAGTTGCCCTTTCACTTCTGAATCTGCCTCTTCTATTCTTACGTTTTTTTATCGACATTTTATTTTTTAAAAAAACCAAAATTAGGGGTACTATAAAGATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGTGTGTCTTGTATATTCTCAAAGTATATTTTCCCACTTTTTTTGGTTTGATAAGGAAGAACTCATTTGGATTCTGCCTACAAAATATGTGAGGATAGATTTGGTTTTAAGTATGAGTTTATGACTCCCAAGCCAGACTGCTTATATCAGATGTATGGATGACAATATCTCGTTATGCCTTTATAATATCACATTGCCACAAGTTCTAGTCTTCTTTGAGGAGATGGTCTTATTACTCTTAGTTGTAAGATTTGTGCCGCAGTTTCTTCTTCTATAAAAAATGCATTTTATGAAAATGCCATTTTCCTCAGTATTGCTTTCTTTGTTCTATTTGTGCAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGGTAAGATATTGATTCTAGTTACGCCAATTCTATTATATATTTAATATAAAAGCATGTGCCTCCTTTCATGTTTTATGTACCAAATTCATTCTTTTAGGAAAATGCATGCAATGAATATTAGGCCTTTTATCCCGCATGCTTCCAATTTTATCGGAGGTCCTTGAAGAGATGCCAGTTATCCTGCATCACAGGTGTGAAAGAGCACCAAAGAGCATGTGGATGGTGAGGTGTAATTGGCACTATTGGGAATATACCAATCTTAACATTAAGAATGGAGGAAGAGTGAGATTAGAGGAGAAAGTAAGTTTTTTCTTCTTTCTCCCCTCTCTCCCTTCTCCTACCTTCTTCTTCCCTTTGCATCATCTCATCCCCTCCAATCATCTTTCTCAGTTCTCCAGCGAGTGTCTAGCCTTTTATTTCGATCGCTCATTCAGAACTAGATATGGAAATGGAAAGTTGCAATATAGTTGACTCTCTTCACTGTATTTGGTTATGGACTATTCCAAGATGAAGACGTGGACGGCAACTAAATTCTGCCTTCATCAAATTTCGAACCGAGTTGGTTGCAAGAAGTTCTCTCAGAACTAAGCCAAAAACTAGAAAATCAATTCTTTCTCAAGAAAGAGAAGACCGATAATGGAATAACAGAAGTATCTAAATTCAGAGCCACAAAAGGTTGGATATTGAGATGCACTCTTTGGTCATGTATTGGCGGTAGGTCCTTGGTCATGTATTGGTGGTAGATCCTTCATTCAAGTTCTTTTAGGTGAGGAAAAACAGGGTTGGAAAACTTTTATTCAATTGCTAGGAGGCTTCAAATCGAAACTTGAATACTCGACATGGATTTCATCCCATTCAATGTCATCAAATATCCTTAAGAAGGACGTGAGCGCACGAAAGTCAACATGGGAAAGATATGAAGTCAAGGTGCATCCGAAAAAAAGTCACACATTCCTCTGTTTTTGACCTGGAAAAATAGAGCATGAGTTGTTACTGTATTATAAAGAATCCAAAAGTTCTCAAGACTAATTTCAACAATCTTTGGATTGTAACACGACTTTTTGAGTTTGATAAATGCTGAGCCATAGCTAGCTCACTAAAATTATACTTTGATACTGATGTCATTCTTAAAACTTTGTCTGCGGAATGTGCCTTAATCCTGTTGGATTAGGGTGAGTTGGAAGCTTTTGCTGAGTTCCCATGAACATGGCAAGAATGTGGTCAGTTTCATTTAAATAAATTGAAAAGTGGAACAAGTATCTACATGGTCGTCTGAATGTAATGAAAAGGTTCAATGGGTGGATCTCGATTAAAGATTTACCATTGGGCTGTTGGAGCTGAAAACCTTTTAAGTTACTGGGTCTTACTTGGGCGGATTAGTATCTATAGCAAATGGAACGTTGAATCTTATTAATGTTGTTGAAGCAAAAAATCAAGTTAAAAGAAACTTATGTGGTTTTATGCAATCTACCATAGAAATTTCAGATGAAAGCAAAGGTAGTATTTTTTTGAATTTTGGAGATGGCCCCCCCCTCAAAGTTTAAGGACTAAGGTGCTTTGTTTATTAAAGATTGTTCCAACCTAATTTATTTGACTTGACTAAAGCAAGTTATGATTGATGAAGATTTAGATTCTTCTGTTTTGAATCTAGAATGGACATTTCAAGCTGCCCCGTAGTCCAAAAAATTTTCATGAAATTATTTTGCATCGAAGTAGCTTCAAGGGGAGAACCACACACAAAACCCACTAGAAATTGACAGTAGAGACTCTCCGACATGAGTCTTCCTGCTGACGATGGAAGAATCAAGGAGAAGTGAACAAACTCACGCTTTTCGAATGAGACGGACTACGACAACGCTTCGAACTGAAAGCAGTCCAAGGGATGCCTTTATTCTTGGTGCGAAAAGAAAGTTTCAAAATTTGCCAGTTGTTAGCCCTTTAAAGGACGCCCATAAGTCTTAAGAAGATAAAGACTCAACAGTCATTAATGATTCCTGTTTCCCAATTAATGAACCCTTGCCCAAAGTCTCCTTTGTCCCATTAGCAACCTCTCTTGAAAATAACAACTCTAAGTCTCTCTCATGGGTAGGTGAATGAGGTTGGACTGTGCAAGCCTATTTTGTCTTCCACACCTCACCTTTCACCTTCATCCCTGTTGGATAAAGGGTTATATAAAGTTATATGGAAAACTAGTAGGAAAGTTAATATACTCACCTTTGGGTTGCCGAACTTATGCAAAGAAAACTCCCAAATAGTTGCCTCTTACCTTTGGTTTGCCCTCTGTGCATGAAAGAAGAGGAAGATTTACCACATCTGTCTTTTACATGTTCATATTCAACCAGTTGTTGGGGAACCTGTTCTCTTTATTCAGTGTTGCTTGGGTTTTTGGACATTTGTTTAGCTCAAAAGTTCAATAGGTTCTTTTGGGTCTTTTCTTAAAGAAAGAAAGGGCCGAGACTAATATGGGGAAACATGACTAAAGCCTTGCTTGTGGAATTATGGTTTGAATGAAATCAAAGGGGCTTCTTCAACAACAAAATCTATAATCATCCTTCAGACTTCTTCTAAAGGCCAAAGACCAAGAGGACGTATCTTGGTCCCAATGTGCTGCAACTGATCCAGTGGGCAAAAGAGCAATTCTAAAGAGTCTAGAATATTGCTCTTTAAAGGGAGTGTTACCTACCTAAATATCTGTCCAAAAGCCAATTCTTTGGCCGTTACCAAGATTAAAAGAGGCAAATGAATCAACCGACCTCCAAACTCTAGCAATGCTAACCCAAGGACTCCTTAGGCTATTACCAGACTTCCCTTTAGTGAACCAATCGGAGGGCTTCTTACCATGCTAATTCCTTTCAATTCTTAAGCTTCCAATTACTTTGAAAGACTTGGAGTTTCTTCCAAGATTCGGGTTTACATGAAGCCTGTCTTCTGCTGGGCCGTACCTGACCATGAAAGCAATACTGATATCTCTCTTGTCGAAGATCTCTAGTCACTTCCTTTGCTCCCTTCTGGTATGAAGATTCTCGTTGGATAATACGGCAAACCTACCTCTTGCATTTTCCAACACTTTCAAGATATTGGAAGGAGAGATAACGAGCACACACTAATTTACGTGGAAACCCGAGTACCGGGAGAAAAACCACGATTGTTTGTTGATATTATTTTCTAATGAATAATACAATAGGTACAAGGGAGAATAAATAGAGAATAACAAGAGAATAAAAAAGGAAAAGATATAGGAAATAAGGAAAATATTCCCATAATCTTTCCAAAAACATTCTAAGATTCTAACAAGGAAAATATTGAGAAAGTAAAGGAAGGATTCCAACAAAAATAATGGCAAACTCTAACCTTCTTTGCCAAAAGACCATAGAATTCTTCATGTCTGCAGCGCCCACTAAACAATCCCTAACACAAGCTGAGGTGCTGAAATTCAGCTCCAGTGTACCAAATATCCTTCCATTTCTTTCTTCTATGCAAATCTTCCTAGCCTTCCCCCACAAACCCAACCAAGAATTTCTTCCCATCGATCTTGAGCTTTCCCATCTCTTAGATTTTCTCAAATTTTGAGAGAAAAGTTATCAGAAGTTCTTCGGTTGAAGCTTCCTCTGAATTCAGTGTGTCTCTCAGTGGGTGCATTATTTTTGGGTGTTAGTTCACCTTTTATTTTGTTTCTTAATTTATAGTGGATTTTTGTTAGCTTGATTAAGGTCAATGGTCTCAGTTAGGTTGGGCAGTTTAGATGATTTTTCTCAGTTTTTTTATTCGCATGCATTGATAGATTTCTGTGAGGAGTTTCGCTTTGGTGAATTCATCGTGTTCTTCAAAATATTGCTTGCTTAATTTTTTAGTTTGAGTTCACCCACAGTTGTCTTTTCTTATTTTCTTTAGGCTGTTTCAATTTTTTCTTTAGATCTAATACTTTATCGCAAAAAGATTTGTCTATCTTATTTTTTTATTTTGTGTCTCAAGCATTAGTCTCTTTTTATCATTTCATTGAAAAATTGTTTCGTAGATTTCAATTTTACGAATATATTCATAAAATACTGTTGTCAACGGTTGTTTTTCTAAAGTTAAAAACTAATAAACCATCATGGATTGACCTAAGGGTAAAAAGGGAGATGTAGTCTCAAATAGTTCATGGTTCAATCTACGGTGATCACCTACCATAGGCTTAGGTGGGTTGCCCCTTTTAGCAAAAAGAAAAAAAAAAGGTAGAAAAGTAATAAATTATATTAAAGTTATTCTTACTTCCAAACTAAGTTAAATATTTTGTTATTATTATTCATATTCATATCATGTTTTTCTTTTTATTATTGAAAGATATTGGAGATATCTATTAATTCTTATTATCGATGTTGTGGCCTTTAATTTGTAGATATATTGATATATTGATGAATATTTCAATCTTTAACCTAGTCGGCACTTTATTTTATTCATAATTCAAGATTTCTTGGTTAGGAAAAAAGAGAAGCAAATAAGGTCAAGTACCATTATATATTAAATTTTGATGAAGTTTCTATCTACTAAATTTGTCCCCAGTACAAAGCATTCTTCACATACTAGTCTGCTGCTCTGTTGTTGGACACCACACCATCAATATCTTAATCTTTTGTGCAGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGGGTTCTCCCTATTCTTGTGAAGATAGTGAAGAAAAAGGTTTGAATTTGTTCTATTTATTTACGACAAATAGTTGACTATTAATCTTTCTTTAGAAGTATGGCGCAAACATGGTTATGAATATGGCGGCATGTCATATTATAGATAATAAGAACATGAATACCTCTATTAAAAGTGTTTTTTTTATTAAATTACAAATCTACCTAAAAGCTTGAGCCAATGGGTGACGACAAATTTAATATAATATCTAACCGTCTCCTCTATTTGTGGGTTTGAAATATGGAGAAAGTTCAATAAGTAGAAATAAATTTTAAATGGGGAGGGAATTCATTGCTGGGGTTTGATTCGTTTGAACACAGAACCTCCTTGACTACCTCCTTGATCACCAGCTCTGATTATATCTTAAATCACCCATCTACCCAAAAGCTTATGATGTGTATATGTGTGTTCTTGGAAAGGAAGATACAAGAGACTGGGTTCTTGAAAAATGACTGACTTTTTTATTAATTCTGGAAATATGACAAACTAACAGTATTTATACTAAAAACCTAATAACAGTCAAAGTCAAACCTAATTATAACAGCCAGTTAAAATATAACAAACTAATTACATCAGCTTAAGCTGATAGGTGAAGGGAATTTTAATATAATATGTAACACTTTTAAATGTATTTGATTAATTTTTTATATAACAACAAATTCACACTAGACCTTGGTTTTATTTTATAATTTATGAAGTAATATAAAAAGGGAGGTAATTTCTAACATTTCAAGAAGTTGACGAAACCAATAATATATATATCCAATGATTCAAGTACAGTAAACAAAGATTTTCAAATCCAAATAAAAAAATTGTGAATTGTTTTCCGCTCTTGAGAGTATTCGTAACTTTTAGTATTAGTTTTTTTTGAAATGGAGACAAGTCTCCCAAGAATCACAAAAGAAGATGGCTCAAGAAACGTTTCCAACCGGTTGAACAATTATTGTTTTTCAGCATTTTTGTGGTTTTGATTCTGCGAGGCTGGTTGTTGTCTGTGTTCCCTCTTCAATCATCTCGAAGCTTATTGTGGGCCAGTTGTCAGTTTTCTTTGGGTTCCTTTTGACAATCTGTTTTAATTACTTCATACCTAGTTTACATTATAGATTCCATTTGTTGTTTTGGGTTAGAGGATTTTGATTATCTCGCTGTTATTGCTGAGTTTGATTTGGTTATTTTTTATATCTGTCTTAGTTTGTTTGCTCTCTGTATAATTTTAATATTAGTCTCTTTTCATTATGTGAATTTTCTATATTATTGAATAAATTAGAGTATTTGAGAGAACATAATGACCTTTTACATAGAGGAATAAATAGACCTCAAAGAAACTACAAAAGGAAAATACATAAATGGAAAATACCAAAAAGGATAATAATAATAAGCCAAATCCTAATTTCCATTTAACACATAATATCAGTGAAGAGCTTTGTTTCCTTTTTTCTAAAAAAAGGAATAGAAAATTTTTTGTTAAACAATTCAACATCTTGAAAAAGTTGAAGTTTGAATTTGATTATGTTCTTACGATTCTATGCTTAGAAAATAAGTGAATTTTTTGCGTAAAGTTATTATCTCTCTTGCATTTTGATTTTGGAATAAGATACAAATTTGAACTTTTTCAAGATGTAAGAGAGAGGAGCGACATTTTCTTAAGAGAATCTTATGTTTTTCGTGCAACTTTTTCAACAAGATTCTATGATTCAACAAATTTCAACTTTTTCAAGATGTCCGGAAAATAAATTTCAAATCTTATGTATTTTACAATTTTTTTTTGCAATTTCCTGATTTTCTATTTTTGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGGTAGGACATTGCCCCCATATTTGCTATATATTTCTTTGATTAGAAGTTGTATATTGAAGCAAATGGATATAATCATGTTAAGAGAATTAAATCCTTGTTGTTCAACACTATTTATAGTTTGAAGGAGTGGAAGAATAAAATCATCAGTAAAGTAGATCATAATAATATGTTTTCATGGTTCTAATGAGTTGAATGGCGTTGACTTTGAAGAAATGAGTTCAAGGCATGATGGGCACCATGCATAGGATATAGTAGTATATGAATTACCTGGCAACCAAACGTAATAAGGCTAGATGGCTGCCTTGTGAAAATAGTCAAGGTGTGGACAAGCTGATTAGGACACCCACAGATATCAAAAAGATAAAAGAAAAGTTTGATGTACATTGTTAGTTTCATAAAAAATTTATCATTGATATTGGTTGATGCTTGCTAAATAATAATAGTTCTTATAGGTTCAGAATGAAAGATATTTTGATTGTGTCTAATCTTCAACTTTGTAGCGTTGGGATTATATTTTTTGGGTTCAGTGAAAATGAAGAAAGAGAGGGGAAAGTATTAGATTTGATTCCCACACATTATTGTTTATGTGATTCCTTTTGTTATGTATATTCCCATTCTTCGTTTCTTGTCAGGAAAAGAAAAGAGAGAAGAAACATTGCTGCTTAGAAATTCCCCGAAGAGTTCCATTGAGAAAACCTTGGTACTTCATAAAAGAAAAGATTAGGAACTCCGGACATTGTTTTCTGTTTGTTGTTGTTTACTTGGTTTTGGTTTTTGGATTTTGGATTTTGGAGCACTAATCTCTCTTCATTTCCCTAATGAAACGTTTGGTCATTTTCGAAGAAAGGAACTTGAACATATGTTCCATCTTTTATGATTACATTCAGTAACTGTAGAATCTTTGAAGTCGGAATACTCCAAATGAATTTTTCATTTGTTTCTCAGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGGTATATCTTGTTGAGCTTATTCTTCTGCACTTTTCCGTCTTGTTATCCACTTGTTTTTCCCTTTTTATTTTTGACATACAGAACTCTAAGTTGTTAAAAATCCTAGAAATTTGTCATGAACTATTTATGAAACGAAAAACACATTACTCCCATTATAAGTGTTGATGTTTGTGCAGCTAATGGTTTGCTAAAAGGAACTATTTATGGAATTCTAGTATTATTGATTTATTCCATCATATTTGATTTTTTTTGGTTTTATTTGTTATTATTTTCTAGGATTCTTTCCTTTTATGTTTATATTTTTCTTATTTCATTGAGGCTGTATTCTATATCTAATTTAGGATTTTGTTAATAAGAATAAGAAAGTGAGAAATATTCACATGGTATCAAAGCAACAAACAGAAAAACCCTAACCTTAATTATTGTGCCGCCACTGACCTCTTCAACTCTTGCCGCTGCCGTCATTGATCTACGCTAGTGCTTGCCGTTGCTAATTTCGGATCTGGCCGCCGAAGCCTTTTATTTCTCAGATCTGGTCGTTAAAAGTTTTTTCAAGATCTGGTAAAAAATCTGGCGCTCAACCTTTTTTTTTCTGATCTGGTAGTTGCTGGTTTCCAATTTTTTTTTGAAATGGAGACAAGCCTCTTTAGTAATATTAATAATAATAAGAGAGACTAACGCTCAAAGTACAAGAGAGTTATATAGAGAACAAAAGGGCTAAAACAGATACAAACAATAGCTAAATCAAACTCAACAATAATAACGAGCTAAACAAAACCCTCTATCTGGAAGCACAATTGAAAACAGAAAAGTAAACTAAGTACAAAGTAATTTTTTTTGAAAAGGAGACAAGCTTCTTTATTATTAATAAATTCAAAGTACAAGAGAGCTATACAATGAGAATAATAGGAAAGCCAAGAAATGGGGAGAGAGAGAGAGAGAGAGGATCAGTAGGTGCACTCGGACATCTCAATTAGGTTGACACTCCTATAGCACCCTCATCATATCCAAAATACAAAGAACAAGAACAATACTAAGGTCATGAAAAGACCAAAGTAACAATCAAAACAATGATAGAAACTACCTACGGTAGACAAAAACAAGGCTGAAAATAAAAACATAACGGCAGAAATACACGTTAAAGCCCATCCAAACTACAGAAGCACTAATTCTGAACTGGCAATCGGGGGAAACTACATTGAATCTCATTAGATGAAGGCGGTTCGACTGAGGCAAATGTCCTGTATGGAGTGAACTTTGAATTCTGCATTTGAAGAACACCAAGCTGCTGCGTTTCTTTTAGATATATCTACGATGTCAACCTAATTCCTTTTTTTTTTTATCAAGGAAGATGTGTTGGTTACATTCGAACCAAATCTCAACCAGAAGCGCTTTTGTCAAATTAACCCAAATTAGAAAATGTTTTTTTTAATAAATAAGGCCCCCTCAATAATTGGAGCACATTGGCGCTAAACGAACCATCAAAGACCCAAACTGATTTGAGTATATAAAATATGCTAAACCAACAAGTAGATGAGAAGGGACAGAAAATGAATAAGTGTATTAAGAGTTCACTGTTTTCCAAGCAAAGGAGGCACACTGAAGGCAGCAAACAGCAAACAGCTTCCTTTGCAACTTCTAGGAACAGTTTAGAGGACCAACTGCCATAATCCAAACCAAGACATTTATTCTCCTTGGACTGCTGGTTTTCCAAATTGCTTAAAGTAATTTTTTTCCAAAGGTGAAGAGGAAGAAAGGGGCTTTGAAAGGGACTTAACTGAAAAATGGCCTAAGGATTCTAATGACCAAACTCTCCTATCATCCAAGTCCGTTAGATCTTTCTGAGATATGAGTGTTAAAAGACCTTGGAAATCTTGAATTTCTTCATCTTTGAAATACTAATTGTAATCTAGAATTCAGAGAAGTATATTTCTTATTCATCTTGAGTCAAAGGAATGTTACATATTTATATAGAGAATAAACTAAACCCTAGAGACTATGTACAATTACAATAAAGGGCATATGATATAAATATAAATATATATATATATCATAACACCCCGCCTCAAGCTGGAGCAAATATGTCGATCATGCCCAGCTTGTTGCACAGATAGCTTATCCTTGCTCCATTTAAAGCTTTAGTTAGAATATAGTTCTACTAGTTGTATATGTTTATGATATTGTTATTACTGGAACTATTTCTTTTAGGAGTTTGATTTCTTGCTTGCTAGCAGAAAGATGTTCGTTGGATGATTGGAGAGTTCTTCCATCCCTTTCAAAGAGAAAGGGCACCTTCTTTGCTTTGCAAGGGTTTGTGCTTTATTGTGGGATTTGTGGGGCGGAAGAAATAGTTGAAGTGTTTTAGGGTATGGAAAGGGACCTTAATGGGGTTTGGTCTTTGGTGAAGTTTCATGTTTCTTTGTGGGCTTTGGTTTCAAAGGCTTTTTGTAACTACACTCATGAGGCTGTTTTTTACCTAATTGGAAACTCTTTGTTTAGTTGGGTTTTTATGGGCTTGTATTTTTGTATGTCGTATTTTCTTTTCTTTCCAATGAAAGCAATTGGTTCTCTTAAAAATAGAAAAAGAAAAAAATGAAAAAAACCTTCGAATAAATCCACTTATTGCTCATCACATTTGCCCTTTTTCATGTTATTGTGCTTTCGTAGCCTCTTTTCTCACAAGTTAAATTGTTCTCATTTTAATGTTTTTTATGTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGTATATTCTCTTGCCAGTTTTGACTCTCTCCCTCTCTCTTTATATATACCTATATTTCTCTCTTTATAA

mRNA sequence

ATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGGGTTCTCCCTATTCTTGTGAAGATAGTGAAGAAAAAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGTATATTCTCTTGCCAGTTTTGACTCTCTCCCTCTCTCTTTATATATACCTATATTTCTCTCTTTATAA

Coding sequence (CDS)

ATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGGGTTCTCCCTATTCTTGTGAAGATAGTGAAGAAAAAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGTATATTCTCTTGCCAGTTTTGACTCTCTCCCTCTCTCTTTATATATACCTATATTTCTCTCTTTATAA

Protein sequence

MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL*
BLAST of Csa4G338950 vs. Swiss-Prot
Match: HGS_RAT (Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Rattus norvegicus GN=Hgs PE=1 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 1.2e-14
Identity = 42/135 (31.11%), Postives = 74/135 (54.81%), Query Frame = 1

Query: 5   LVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEM 64
           L++ ATS+ L ETDW   +QIC+L+     QAK  + +IKK++ +KN +  LYA+ ++E 
Sbjct: 11  LLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVNSIKKKVNDKNPHVALYALEVMES 70

Query: 65  LMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFPQYY 124
           ++ N G+ +H +V +   +  L +++K++ ++ VR +I  L+ A   A      K+    
Sbjct: 71  VVKNCGQTVHDEVANKQTMEELKELLKRQVEVNVRNKILYLIQAWAHAFRN-EPKYKVVQ 130

Query: 125 SAYYDLVSAGVQFPQ 140
             Y  +   G  FP+
Sbjct: 131 DTYQIMKVEGHVFPE 144

BLAST of Csa4G338950 vs. Swiss-Prot
Match: HGS_MOUSE (Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Mus musculus GN=Hgs PE=1 SV=2)

HSP 1 Score: 81.6 bits (200), Expect = 1.2e-14
Identity = 42/135 (31.11%), Postives = 74/135 (54.81%), Query Frame = 1

Query: 5   LVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEM 64
           L++ ATS+ L ETDW   +QIC+L+     QAK  + +IKK++ +KN +  LYA+ ++E 
Sbjct: 11  LLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVNSIKKKVNDKNPHVALYALEVMES 70

Query: 65  LMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFPQYY 124
           ++ N G+ +H +V +   +  L +++K++ ++ VR +I  L+ A   A      K+    
Sbjct: 71  VVKNCGQTVHDEVANKQTMEELKELLKRQVEVNVRNKILYLIQAWAHAFRN-EPKYKVVQ 130

Query: 125 SAYYDLVSAGVQFPQ 140
             Y  +   G  FP+
Sbjct: 131 DTYQIMKVEGHVFPE 144

BLAST of Csa4G338950 vs. Swiss-Prot
Match: HGS_HUMAN (Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Homo sapiens GN=HGS PE=1 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 2.0e-14
Identity = 42/135 (31.11%), Postives = 73/135 (54.07%), Query Frame = 1

Query: 5   LVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEM 64
           L++ ATS+ L ETDW   +QIC+L+     QAK  + +IKK++ +KN +  LYA+ ++E 
Sbjct: 11  LLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVNSIKKKVNDKNPHVALYALEVMES 70

Query: 65  LMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFPQYY 124
           ++ N G+ +H +V +   +  L  ++K++ ++ VR +I  L+ A   A      K+    
Sbjct: 71  VVKNCGQTVHDEVANKQTMEELKDLLKRQVEVNVRNKILYLIQAWAHAFRN-EPKYKVVQ 130

Query: 125 SAYYDLVSAGVQFPQ 140
             Y  +   G  FP+
Sbjct: 131 DTYQIMKVEGHVFPE 144

BLAST of Csa4G338950 vs. Swiss-Prot
Match: HGS_BOVIN (Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Bos taurus GN=HGS PE=2 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 2.0e-14
Identity = 42/135 (31.11%), Postives = 73/135 (54.07%), Query Frame = 1

Query: 5   LVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEM 64
           L++ ATS+ L ETDW   +QIC+L+     QAK  + +IKK++ +KN +  LYA+ ++E 
Sbjct: 11  LLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVSSIKKKVNDKNPHVALYALEVMES 70

Query: 65  LMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFPQYY 124
           ++ N G+ +H +V +   +  L  ++K++ ++ VR +I  L+ A   A      K+    
Sbjct: 71  VVKNCGQTVHDEVANKQTMEELKDLLKRQVEVNVRNKILYLIQAWAHAFRN-EPKYKVVQ 130

Query: 125 SAYYDLVSAGVQFPQ 140
             Y  +   G  FP+
Sbjct: 131 DTYQIMKVEGHVFPE 144

BLAST of Csa4G338950 vs. Swiss-Prot
Match: HSE1_NEUCR (Class E vacuolar protein-sorting machinery protein hse1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=hse1 PE=3 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 4.7e-11
Identity = 47/189 (24.87%), Postives = 80/189 (42.33%), Query Frame = 1

Query: 4   ELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLE 63
           E +N AT E L   DW   +++C+ VA D   AKE + ++ KRL ++NAN QLY + +  
Sbjct: 12  EAINKATDENLTSEDWGAIMEVCDRVATDANGAKEAVNSMIKRLAHRNANVQLYTLEVAN 71

Query: 64  MLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFPQY 123
            L  N G+ +H+++        L+K+   ++     +   L      + +  +       
Sbjct: 72  ALSQNCGKNMHRELSSRAFTDALLKLANDRNTHTQVKAKILERMKEWSDMFKSDSDLGIM 131

Query: 124 YSAYYDLVSAGVQF-PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSES 183
           Y AYY L  +     P   P  +  +   +Q         ++LS QE   +  P   S +
Sbjct: 132 YDAYYRLKQSNPTLQPPSAPQKNVLTDADRQKEEEELQMALQLSLQEEERKKRPAGASGA 191

Query: 184 SIIEKAGNA 192
           +    +G A
Sbjct: 192 TASSSSGGA 200

BLAST of Csa4G338950 vs. TrEMBL
Match: A0A0A0L323_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G338950 PE=4 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 3.0e-121
Identity = 230/230 (100.00%), Postives = 230/230 (100.00%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL 231
           ESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL 230

BLAST of Csa4G338950 vs. TrEMBL
Match: W9RQ74_9ROSA (TOM1-like protein 2 OS=Morus notabilis GN=L484_013104 PE=4 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 6.5e-84
Identity = 165/209 (78.95%), Postives = 181/209 (86.60%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVN ATS+KL E DW KNI+ICELVA DQRQAK+VIKAIKKRLG+K+ N QLYAVL
Sbjct: 1   MAAELVNCATSDKLPEMDWTKNIEICELVARDQRQAKDVIKAIKKRLGSKHTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVID+G++PILVKIVKKKSDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGIIPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYY+AYY+LVSAGVQFPQRPPAVSS+ PT Q  NN   NG +  S  E  AR  EPQ +
Sbjct: 121 PQYYNAYYELVSAGVQFPQRPPAVSSDHPTPQPNNNNLPNGELASSRHEGFARQAEPQDV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPE 209
            ESSII+KAGN LEVLKEVLDAVD +HPE
Sbjct: 181 PESSIIQKAGNVLEVLKEVLDAVDSQHPE 209

BLAST of Csa4G338950 vs. TrEMBL
Match: A0A0B0N1D0_GOSAR (Target of Myb 1 OS=Gossypium arboreum GN=F383_15811 PE=4 SV=1)

HSP 1 Score: 316.2 bits (809), Expect = 3.2e-83
Identity = 163/209 (77.99%), Postives = 183/209 (87.56%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATS+KLAE DW KNI+ICELVA DQRQAK+V+KAIKKRLG+KN N QLYAVL
Sbjct: 1   MAAELVNSATSDKLAEMDWAKNIEICELVARDQRQAKDVVKAIKKRLGSKNPNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE +HK VID+G+LPILVKIVKKKSDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENVHKLVIDTGILPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYYSAYYDLVSAGV+FPQRP A  SN PT Q I + + NG +  + QE VA+  EPQI+
Sbjct: 121 PQYYSAYYDLVSAGVEFPQRPHAAPSNPPTSQPIKSNTLNGELASARQEAVAKEAEPQIV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPE 209
            ESSII+KA NALEVL+EVLDAVD ++PE
Sbjct: 181 PESSIIQKASNALEVLREVLDAVDAQNPE 209

BLAST of Csa4G338950 vs. TrEMBL
Match: A0A061DN59_THECC (ENTH/VHS/GAT family protein isoform 1 OS=Theobroma cacao GN=TCM_003342 PE=4 SV=1)

HSP 1 Score: 314.7 bits (805), Expect = 9.4e-83
Identity = 161/208 (77.40%), Postives = 180/208 (86.54%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKL E DW KNI+ICELVA DQRQAK+V+KAIKKRLG+KN N QLY+VL
Sbjct: 1   MAAELVNSATSEKLTEMDWTKNIEICELVARDQRQAKDVVKAIKKRLGSKNPNTQLYSVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE +HKQVIDSG+LPILVKIVKKKSDLP+RERIFLLLDATQT+LGG+SGKF
Sbjct: 61  LLEMLMNNIGENVHKQVIDSGILPILVKIVKKKSDLPIRERIFLLLDATQTSLGGSSGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVA-RVEPQIL 180
           PQYYSAYYDLVSAGVQFPQRP A  SN PT     N + NG +  +  E +A + EPQI+
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPHATPSNPPTSLPNKNNTLNGELAAARHEAIAQQTEPQIV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHP 208
            ESSII+KA NALEVLKEVLDAVDP++P
Sbjct: 181 PESSIIQKASNALEVLKEVLDAVDPQNP 208

BLAST of Csa4G338950 vs. TrEMBL
Match: A0A0D2SFE0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G068300 PE=4 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 1.6e-82
Identity = 162/209 (77.51%), Postives = 182/209 (87.08%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVN ATSEKLAE DW KNI+ICELVA DQRQAK+V+KAIKKRLG+KN N QLY+VL
Sbjct: 1   MAAELVNFATSEKLAEMDWAKNIEICELVARDQRQAKDVVKAIKKRLGSKNPNTQLYSVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE +HK VID+G+LPILVKIVKKKSDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENVHKLVIDTGILPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYYSAYYDLVSAGV+FPQRP A  SN PT Q I + + NG +  + QE VA+  EPQI+
Sbjct: 121 PQYYSAYYDLVSAGVEFPQRPHATPSNPPTSQPIKSNTLNGELASARQEAVAKEAEPQIV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPE 209
            ESSII+KA NALEVL+EVLDAVD ++PE
Sbjct: 181 PESSIIQKASNALEVLREVLDAVDAQNPE 209

BLAST of Csa4G338950 vs. TAIR10
Match: AT5G63640.1 (AT5G63640.1 ENTH/VHS/GAT family protein)

HSP 1 Score: 272.7 bits (696), Expect = 2.1e-73
Identity = 147/211 (69.67%), Postives = 168/211 (79.62%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELV+SATSEKLA+ DW KNI+ICEL A D+RQAK+VIKAIKKRLG+KN N QLYAV 
Sbjct: 1   MAAELVSSATSEKLADVDWAKNIEICELAARDERQAKDVIKAIKKRLGSKNPNTQLYAVQ 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVID+GVLP LVKIVKKKSDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGVLPTLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARV---EPQ 180
           PQYY+AYY+LV+AGV+F QRP A +    T Q +   + N  +  +  E  A     E Q
Sbjct: 121 PQYYTAYYELVNAGVKFTQRPNA-TPVVVTAQAVPRNTLNEQLASARNEGPATTQQRESQ 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPE 209
            +S SSI++KA  ALE+LKEVLDAVD ++PE
Sbjct: 181 SVSPSSILQKASTALEILKEVLDAVDSQNPE 210

BLAST of Csa4G338950 vs. TAIR10
Match: AT1G21380.1 (AT1G21380.1 Target of Myb protein 1)

HSP 1 Score: 161.4 bits (407), Expect = 6.7e-40
Identity = 88/216 (40.74%), Postives = 128/216 (59.26%), Query Frame = 1

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+++  +  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDIINMEPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE++++ ++D  +LP +VKIVKKK DL VRE+I  LLD  Q A GG+ G+FP
Sbjct: 65  LETLSKNCGESVYQLIVDRDILPDMVKIVKKKPDLTVREKILSLLDTWQEAFGGSGGRFP 124

Query: 122 QYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENVA 181
           QYY+AY +L SAG++FP R         PP      P   Q   + ++  I+ S Q +  
Sbjct: 125 QYYNAYNELRSAGIEFPPRTESSVPFFTPP---QTQPIVAQATASDEDAAIQASLQSD-- 184

Query: 182 RVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPE 209
             +   LS    I+ A  +++VL ++L A+DP HPE
Sbjct: 185 --DASALSMEE-IQSAQGSVDVLTDMLGALDPSHPE 212

BLAST of Csa4G338950 vs. TAIR10
Match: AT1G76970.1 (AT1G76970.1 Target of Myb protein 1)

HSP 1 Score: 154.5 bits (389), Expect = 8.2e-38
Identity = 90/210 (42.86%), Postives = 128/210 (60.95%), Query Frame = 1

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+L+  D  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE +++ +ID G+L  +VKIVKKK +L VRE+I  LLD  Q A GG  G++P
Sbjct: 65  LETLSKNCGENVYQLIIDRGLLNDMVKIVKKKPELNVREKILTLLDTWQEAFGGRGGRYP 124

Query: 122 QYYSAYYDLVSAGVQFPQR-PPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 181
           QYY+AY DL SAG++FP R   ++S  +P Q Q     ++  I+ S Q + A       S
Sbjct: 125 QYYNAYNDLRSAGIEFPPRTESSLSFFTPPQTQ---PDEDAAIQASLQGDDA-------S 184

Query: 182 ESSI--IEKAGNALEVLKEVLDAVDPRHPE 209
             S+  I+ A  +++VL ++L A DP +PE
Sbjct: 185 SLSLEEIQSAEGSVDVLMDMLGAHDPGNPE 204

BLAST of Csa4G338950 vs. TAIR10
Match: AT4G32760.2 (AT4G32760.2 ENTH/VHS/GAT family protein)

HSP 1 Score: 147.5 bits (371), Expect = 1.0e-35
Identity = 83/217 (38.25%), Postives = 120/217 (55.30%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   +V  ATSE L   DW  N++IC+++  D  QAK+V+K IKKR+G++N  AQL A+ 
Sbjct: 1   MVNAMVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N G+ +H  V + GV+  +V+IVKKK D  V+E+I +L+D  Q A GG   ++
Sbjct: 61  LLETIVKNCGDMVHMHVAEKGVIHEMVRIVKKKPDFHVKEKILVLIDTWQEAFGGPRARY 120

Query: 121 PQYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENV 180
           PQYY+ Y +L+ AG  FPQR         PP     +     + N      +     E  
Sbjct: 121 PQYYAGYQELLRAGAVFPQRSERSAPVFTPPQTQPLTSYPPNLRNAGPGNDV----PEPS 180

Query: 181 ARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPE 209
           A  E   LS S  I+ A   ++VL E+L A++P + E
Sbjct: 181 AEPEFPTLSLSE-IQNAKGIMDVLAEMLSALEPGNKE 212

BLAST of Csa4G338950 vs. TAIR10
Match: AT3G08790.1 (AT3G08790.1 ENTH/VHS/GAT family protein)

HSP 1 Score: 145.2 bits (365), Expect = 5.0e-35
Identity = 77/211 (36.49%), Postives = 123/211 (58.29%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   LV+ ATS+ L   DW  N++IC+++ H+  Q +EV+  IKKRL ++ +  QL A+ 
Sbjct: 1   MVHPLVDRATSDMLIGPDWAMNLEICDMLNHEPGQTREVVSGIKKRLTSRTSKVQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N GE IH QV +  +L  +VK+ K+K ++ V+E+I +L+D  Q +  G  G+ 
Sbjct: 61  LLETIITNCGELIHMQVAEKDILHKMVKMAKRKPNIQVKEKILILIDTWQESFSGPQGRH 120

Query: 121 PQYYSAYYDLVSAGVQFPQRP---PAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQ 180
           PQYY+AY +L+ AG+ FPQRP   P+   N P+ +   N S+N      +    +     
Sbjct: 121 PQYYAAYQELLRAGIVFPQRPQITPSSGQNGPSTRYPQN-SRNARQEAIDTSTESEFPTL 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPE 209
            L+E   I+ A   ++VL E+++A+D  + E
Sbjct: 181 SLTE---IQNARGIMDVLAEMMNAIDGNNKE 207

BLAST of Csa4G338950 vs. NCBI nr
Match: gi|700199333|gb|KGN54491.1| (hypothetical protein Csa_4G338950 [Cucumis sativus])

HSP 1 Score: 442.6 bits (1137), Expect = 4.3e-121
Identity = 230/230 (100.00%), Postives = 230/230 (100.00%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL 231
           ESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEVYSLASFDSLPLSLYIPIFLSL 230

BLAST of Csa4G338950 vs. NCBI nr
Match: gi|449449813|ref|XP_004142659.1| (PREDICTED: target of Myb protein 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 401.7 bits (1031), Expect = 8.4e-109
Identity = 208/208 (100.00%), Postives = 208/208 (100.00%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPE 209
           ESSIIEKAGNALEVLKEVLDAVDPRHPE
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPE 208

BLAST of Csa4G338950 vs. NCBI nr
Match: gi|659069348|ref|XP_008449359.1| (PREDICTED: LOW QUALITY PROTEIN: TOM1-like protein 1 [Cucumis melo])

HSP 1 Score: 390.6 bits (1002), Expect = 1.9e-105
Identity = 201/208 (96.63%), Postives = 204/208 (98.08%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLGNKNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQ Q NNTSQNG+IRLSEQENVARVEPQIL 
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARVEPQILP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPE 209
           ESSIIEKAGNALEVLKEVLDAVDP+HPE
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPQHPE 208

BLAST of Csa4G338950 vs. NCBI nr
Match: gi|778694136|ref|XP_011653749.1| (PREDICTED: target of Myb protein 1 isoform X2 [Cucumis sativus])

HSP 1 Score: 360.5 bits (924), Expect = 2.1e-96
Identity = 191/208 (91.83%), Postives = 191/208 (91.83%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 SDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPE 209
           ESSIIEKAGNALEVLKEVLDAVDPRHPE
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPE 191

BLAST of Csa4G338950 vs. NCBI nr
Match: gi|703098599|ref|XP_010096423.1| (TOM1-like protein 2 [Morus notabilis])

HSP 1 Score: 318.5 bits (815), Expect = 9.3e-84
Identity = 165/209 (78.95%), Postives = 181/209 (86.60%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVN ATS+KL E DW KNI+ICELVA DQRQAK+VIKAIKKRLG+K+ N QLYAVL
Sbjct: 1   MAAELVNCATSDKLPEMDWTKNIEICELVARDQRQAKDVIKAIKKRLGSKHTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVID+G++PILVKIVKKKSDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGIIPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYY+AYY+LVSAGVQFPQRPPAVSS+ PT Q  NN   NG +  S  E  AR  EPQ +
Sbjct: 121 PQYYNAYYELVSAGVQFPQRPPAVSSDHPTPQPNNNNLPNGELASSRHEGFARQAEPQDV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPE 209
            ESSII+KAGN LEVLKEVLDAVD +HPE
Sbjct: 181 PESSIIQKAGNVLEVLKEVLDAVDSQHPE 209

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HGS_RAT1.2e-1431.11Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Rattus norvegicu... [more]
HGS_MOUSE1.2e-1431.11Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Mus musculus GN=... [more]
HGS_HUMAN2.0e-1431.11Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Homo sapiens GN=... [more]
HGS_BOVIN2.0e-1431.11Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Bos taurus GN=HG... [more]
HSE1_NEUCR4.7e-1124.87Class E vacuolar protein-sorting machinery protein hse1 OS=Neurospora crassa (st... [more]
Match NameE-valueIdentityDescription
A0A0A0L323_CUCSA3.0e-121100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G338950 PE=4 SV=1[more]
W9RQ74_9ROSA6.5e-8478.95TOM1-like protein 2 OS=Morus notabilis GN=L484_013104 PE=4 SV=1[more]
A0A0B0N1D0_GOSAR3.2e-8377.99Target of Myb 1 OS=Gossypium arboreum GN=F383_15811 PE=4 SV=1[more]
A0A061DN59_THECC9.4e-8377.40ENTH/VHS/GAT family protein isoform 1 OS=Theobroma cacao GN=TCM_003342 PE=4 SV=1[more]
A0A0D2SFE0_GOSRA1.6e-8277.51Uncharacterized protein OS=Gossypium raimondii GN=B456_007G068300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G63640.12.1e-7369.67 ENTH/VHS/GAT family protein[more]
AT1G21380.16.7e-4040.74 Target of Myb protein 1[more]
AT1G76970.18.2e-3842.86 Target of Myb protein 1[more]
AT4G32760.21.0e-3538.25 ENTH/VHS/GAT family protein[more]
AT3G08790.15.0e-3536.49 ENTH/VHS/GAT family protein[more]
Match NameE-valueIdentityDescription
gi|700199333|gb|KGN54491.1|4.3e-121100.00hypothetical protein Csa_4G338950 [Cucumis sativus][more]
gi|449449813|ref|XP_004142659.1|8.4e-109100.00PREDICTED: target of Myb protein 1 isoform X1 [Cucumis sativus][more]
gi|659069348|ref|XP_008449359.1|1.9e-10596.63PREDICTED: LOW QUALITY PROTEIN: TOM1-like protein 1 [Cucumis melo][more]
gi|778694136|ref|XP_011653749.1|2.1e-9691.83PREDICTED: target of Myb protein 1 isoform X2 [Cucumis sativus][more]
gi|703098599|ref|XP_010096423.1|9.3e-8478.95TOM1-like protein 2 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002014VHS_dom
IPR008942ENTH_VHS
Vocabulary: Biological Process
TermDefinition
GO:0006886intracellular protein transport
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006886 intracellular protein transport
biological_process GO:0016310 phosphorylation
cellular_component GO:0005622 intracellular
cellular_component GO:0005886 plasma membrane
molecular_function GO:0016301 kinase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU123183cucumber EST collection version 3.0transcribed_cluster
CU146189cucumber EST collection version 3.0transcribed_cluster
CU163789cucumber EST collection version 3.0transcribed_cluster
CU169367cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G338950.1Csa4G338950.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU163789CU163789transcribed_cluster
CU146189CU146189transcribed_cluster
CU123183CU123183transcribed_cluster
CU169367CU169367transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002014VHS domainPFAMPF00790VHScoord: 4..113
score: 8.2
IPR002014VHS domainSMARTSM00288VHS_2coord: 2..134
score: 7.4
IPR002014VHS domainPROFILEPS50179VHScoord: 9..138
score: 32
IPR008942ENTH/VHSGENE3DG3DSA:1.25.40.90coord: 4..139
score: 4.4
IPR008942ENTH/VHSunknownSSF48464ENTH/VHS domaincoord: 4..139
score: 2.62
NoneNo IPR availablePANTHERPTHR13856VHS DOMAIN CONTAINING PROTEIN FAMILYcoord: 1..208
score: 1.8E
NoneNo IPR availablePANTHERPTHR13856:SF81ENTH/VHS/GAT FAMILY PROTEINcoord: 1..208
score: 1.8E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa4G338950CSPI04G17030Wild cucumber (PI 183967)cpicuB181
The following gene(s) are paralogous to this gene:

None