HG10011195.1 (mRNA) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10011195.1
TypemRNA
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNicastrin
LocationChr01: 3383127 .. 3402925 (+)
Sequence length1872
RNA-Seq ExpressionHG10011195.1
SyntenyHG10011195.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATTCAACAGATGAACACTCAATGGAGTCGGTTCCTGATCTTCAAAATTCGATGTACCTAGCGGTTGATGGTTATCCATGCATTCGACTACTTAATCTTTCTGGAGAAATTGGCTGTTCAAGTAGGTTCTTTTGCACTAGATGTTGAAAATCAATTACTTTTTCTAGTGCCAATGTAATCCTCAGTGGAATATGGTGGTATCATCCATGTCTTTTTACTTTGTACCTCAGATCCTGGACGAGAAAAGGTTGTAGTTCCAATGATTAACTTCAAAGATGCTAATGAGATATTGCAGCCATCTGCAATTTTAGTTTCAATGGATGAAATCTCGAGTTTCTTTACTAGGTGGGACTTACATTTTTATATGTACTCCCTTGATTCATACATATTCTCGTTTCCATATAAGGATTATTTTCTAGGCTTGGGTCTAAAACACTTTCCAAACCTCCCGATGAAAAACTAAAGATAAGTGAAGATACAACCTGCTTTTGTAAAGGAAACAAAATTTTTCATTAATGAAGAAAAGAGTACAAGGGAAAATGAAAAATAGACTACAATGGGTAGACTGAGAAAAGAAAGAAAAAGCATATGGATGATAAAGACAAATAAAGCAAATAAAGTTTAAAACTACAAGAAAGCACCCCAATTATAACAAATATATTGGGAAGAGTAACCCTTAAAAGCTTTTGAATGGGAGCATCAATGAGAAGTTTTAATCTTGATAAGCTCATAACCATCTCCATGAATAAAATTTATCCTCGAAGATTCGTTGATTTCGTTCCAGCCAAATGTCTAAAAGAATGGCTTTTGAACATTGATCTATATAAGCTTAGGTGTGATTGGAACAAGTGGAAAACAAAGAATTTGAAATATATTGCCATGAACATCTTCACCAAAAAGCCTATATAGAAATAGAGAACAAAGAAAACCAGCAAATCTCGAGAAAGAACACAGATATGAACTTTATCTTTCAAAGCTGCAAAACACATAGCAAAAGGAAGGAGATAAAGCCAGAAATGGAATCTTCTTTTGGAATTGGAAAAAAAAAAGGTTTGGAGTGGAATAATAACTAAGAAAATCGAAATCTCCTTTGGATCTCTTGTACTAAAATCTATCACCACTTGCGATGACATTCAAATAAAAAATCCTCAATTGCACTTTTCCTTGCATATTTTGTGAACTGTTTCTCCATTGGGATTATATAAATGATCTCTCCTCATTAGAATTGGTAGAGAACTATATACTCAGGTAGAATTCATTTACCTATTTCAGACTACAGGAAGATTTGCATTTTGCAAATAATGTTGGCGGTGTTTTAATCGAACCAGGAACTGAAATACAAAATAGAATGGAAGGTAAAATTCACTTACTTATTTAGGTTCTTTCCCCCCTTCATTTTCTTCTTTCCCGTTCTCTCTCTCTCTGTGTTTTTTTCTTTTTTTATTTTTTTTTTGGTTGTAATTGTGTGGTAGTTCAGGCAGGCACATCAATTACTGACGTGGTCATTTATTTTTGCTTGCAGGATTTTCTCCTGCCCAAAAGTTTCCCCAAGCTAAATTTGCTCCCTATAAAAAAATTGACTATGAATGGAACCCAATTGTATGTTTGAATATCCTTACAGTATTCTCAGATTTTTTAGGAGTGGTCAACTTTTAGAAGCTTTTTACTGCAAAACTCTTATTCTGTTCTCATTTATAAAGCTAGGCATCAATCTACGAATCCTTGTTTGATATCATATCAGCCATATGTTAACTATGACTTATTCTCAGCTTTCCCCTCACTTTATTAAAGGATTCCTCTGTTTAAGATCATTGCACCATCAATCAGGCATGTCAGTTATGGATTTTTGAGGAAACCTAAAAGAGTGGAATTTGCAGAATAGGTTGATCTGAACCTCACTCTTTCTACTGTCTGGCTTAATACCAGCAAAGATAAATGGCTTTCACTTTGGTGGCTCGTGAAATTTGGGTTTTCACCATAAAATCATTAATCAATGTTCTTTCAACTATACCTCGGGGGATCCTCAGACTTATAGAAGCTGATTTTGGAAGGGGCTCTTTTAGAAGAAGAATAAATTCTTTCATGGGATCAACCTAAATTGAATCAATACAATGGAAACTTTTTTATAAAAAGCACTCTTGAAGCCAACTTGTGCATAGGTTTTTGGGTTGGCAGCAGGCAAGTTAGTATGGTTTTGAGCTTCTTTTTTAGTTTCCTCTTGCTTTCAAAGCTCTATGGCATTCAACTCTCAAGTCTTGGACTTTGCTAGAAGATTTGTAGATTGAAAATTTTTGTTTATAGGGAGTTGTATAAAGGATTAGTCTCCTTTTATCTTTCATCTTGTTCTTGTTGCTCCATTTTCTTTGATGTATTTTCTCTTTTCTTTGTTTGTTTCATTGTACTTTGAGCATTAGACTCATTTCATTAATTCAATGAAAAAATCTTGTTTCCGTTTAAAAAAAAAGCCAACTTGTGCATAGCTCAACTGGTTTAGGGCCTTAGGCATATACTCTCGACCAAAAGGTCAAAAGTTTGAATTAATACCCCTGGATTAGTCTTTCCCCATATTGGTTAAAGACACTTTGATTATGAGACTCAAGACCATCTCTTCATACATGACCGTTTTTCAAAAGACCACCTCTAATTGGTTTTTCAAGCCGCCTTTGACCTAAATGACAGTTGCCCACTCTTTTCAGACCATCCCTTCTAAGAGAAGAAATAGGTGCGTTGGGTCAATATCAGCATGGTTTTCTTTGCATTATTTGGAATGAAAGGAACAACAAATTTTTTTCATGACAAATATAGAACTACTGAAGCTTTGATTGACATTGACAACATTCTTTTTCTAGCTTTATATTGGTGTAAACATGTCTCTCTTTTAACAATTATGGTTTGAGCACTCTAATATTCAATTGGTGAAGTGTTTTGTAACTTCATTTGGTTGCGATGAATGATTCCTTCTAATATAAATTGGGTCTCAATCCATTCTACTTTTGAGTATCTATCTAAGTTTTCTTTGTAATAGGTAGAAATAGAAACATGGCTTTGCTGGCAAGGGGGTTGGGGATTTATCTTCGAGGAGGAAGCTTTATGGCGCAAAGTGGTAGGGAGTATTCATGGAAGTGGTTCACACAATTGGCATACTTTGGGTAATTATGGAAGAAGTCTTCGAAGTCCATGGATCAACATTTCTTGAGCTTGGCTACAAGTAGAATATCTCGCTTCCTTTATACTTGACAATGGAAGAAGGATTTCATTTTGACGGATACTTGGCCGAGAACTTCTTCTTTGAGTTTTCTCTTCCCAAGTTTACTAAAAATTGCTTCTAATCCCAAAGATTCTGTCCTTGATCATTGGGATCCAGTTACTCAATCATGCACAATCACATCTAGAAGATTTCTGAAGGAAGATGAAGTGTTGGAGTTTCAGCGGTTATTGGCATCTTTAGAGGGCCAAAAGGTGGTTGATAGAAATGATTCTAGAAGCTGGAAGTTGGATTCGTCAGGTTCACTTTCAGTTAATTCTCTTTTCAAGCATTTGGTGGCATCTCCTCCCATGACTAGATGCCATGACTAACAAGTTGTATAAAGCTTTGTGGAAATCAAACTGTCCAAGGAGAGTTAACGTTATTATTCGGGTTATGATTTTGGGGCGCTGAATTGCTCAAATACTCTTCAAAGGAGGCTTCCAAGTCACTCACTCTTGCCCTCAATTTTTCCTCTTTGCTTGTTGGAATAATGAAGATTCTCTGCATCTATTCTTGGGTTGTTGTTACTCCATAGAATGTTGGAGGAAGCTTATCTCTTTGTTTAATCTTCAATGGGTCTTGATCGATGATTTCAAGTCCAACGTCATCTAGCTTTTAGTGGGCCCTTCATTAAAAGCCTAGGCTCAAATTTTGTGGGCTAACATGGTTAAAGCTATTCTTTTTGAAATTTGGTTTGAGAGGAATGAACGCATTTTTCAAGATAAGTCACATCCTTAGTTTGATCGTTATGAGATTACAAGAGTCAAAGGCTTCTACATGGTGCTCTCTTTCCAAATATTTTGCAGACTATTCTATCCAAGATACTTGTCTAAATTGGAATGCTTTTATCGTTTCTTCTAATATTTGAATCTTTGTTTAGTCTTGTATTACTTGCTTTATATTCGTAAGTTTGAGTTTTATGTTTCGTTTTCTCTTCTGTACTTTGAGTATTAGACTCATTTTATTAATTCAATGAAAAGTCTTGTTTCTGTTTCAAAGTGAAATATCACTGGCCTTCATTTCTTATTTGGCTCGACAATCTGTAGCATCTAACAATTTGCACTTGCAGGGATCGGGAATTATGTGGAATCAATATAACTTTCCTGTTTTCTTAATATCTGAGAGCAGCATTTCATCTATACAGGAAGTAATTCTGATAACTTTCAACTGTTAGATGTTTTAATTACTATTTGAGAGTTTGGCTCAGTTGCTATATGCTTCTCTCTGGAAATCAAATTTTTATTTTGATTTTTCTGATGGTAGCCATAGCCATTAAGCGTTCTTTAATAATGTATGGTTCTTTGTCATTAGAATAAGATCATGACGTGTTTTTATAATTCTTATAAATGGTTAAGGAGAACCTACCTTTACAGATCATGATTCTATGTTTCTTTGTGGGCTTTGGTTATGATGCTTTTTTGTAATTATTTCCTTAGCTTTATTTTGCTTGATTGGAGACCTTTCTTTTAGGTAGCCTCCTCTTTTTTGTGTGCCCTTACATTCTTTCCTTTTTTTTGAAAGTGTGGTTATTCATCACAAAACTGAAACACAATATGTAAACCTCACCAAATAACAACCCAAAGCAGAAAGTAGAAATGTTTTTGAACCAGTGCATGTGACTTAGGGTTGAAATTCATTGATGTTTTGCAATTATGTTTGTATACTGAATTAAAAGAGGACAGCTTTAGTTTATATTTTTCTGAAGCTACAGAACTTACGTATGTATAATCAAGGACAGCATTTTGACATATGCTTGATGTTTCTTATGTAATTGACCATAAGCTTGATCCATTTAAGTGCTTTCATGCTTAAAAGATAATTATGTGAATAACTATGACTAAAATAATTTGTTCTGTTGCATAATCTTACCACTTAATGCAGGCTGCTTCAAAAAATGTGAAGAATAAGAAAAATTACATATCTAATGTTGCTGAATTTGATCTGGTGATGCAGGTACTTAGTTTTACATTTCAGTGATTTCACATCTAAAATGGTTATTTAATCACAGTTTGCAAGGTTTGTGTAATTGATTTTTTATTTTATTTTTATTTTTTTTCTTTCCTCTAGACAACTAAGGCTGGAACTCATAATTCAATGTCTTGTTTGAAAGAAGAAACATGCCTTCCGTTAGGTGGATACAGGTTAATTTTTACTGTTTTAGATACCCCCTTCTCTCTCTCTTTTTTTTTTTTGGGTCTCCACATAGTTTTACAATCTCACTGTGTTTATTTTATTAAAAGAGATGAATATCTTTACAGTGTCTGGTCATCACTTCCTCCAATCAATACTTCTTCCTCTGATCAATCTAAGCCCGTCATTCTCACAGTAGCTTCAATGGATTCTGCCTCTTTCTTCCGTGATAAAAGCATTGGTGCAGACTCCCCCATCTCTGTATGATTTATTTTGAGCAAGTATATCAGTCATAAACCTTACCATCTTTGTTCTGTTACCCGTTAAAGTTGTTTTTGGTACATTTTAGGGCTTGATTGCGTTGCTGGCTGCAGTTGACGCACTTTCCCATGTGGAGGGATTGGATGATCTTCATAAACAGGTTCTTATTGAAGCTTCCTTCTTCTTCTTGTTTTTTTTTTTTTGGGATAAGTTCTTATTGAAGCTGTTTTCTTTGTATCTAATTACACTGTCTTTATGTTAAGAGCTTACTAACTGAAATGGAAATATTTTATAATTTTTTTTCATCTACATAGCTGCATTTTGGACTTATTTAACCTTAACCCTTCCCCTTTAATAGAGGATCAAAGCCTCAGTTACGACCTGTCTAAATTAATATGGAAATTGAAACAAAAACCATTTTCTTGTTTGAACCATACTTTGTAACTTTTTTGATGAGATTCATAAAAGCACCCTGCGAATATCTCAGTACCTTTCGTGTTTGATCTAGGAACGGTGGAAAACAATGATCACATCTTTCATAATGAAAGTTATGTTTCTTATAGTAAAAAAGGGCCACATCTTTGTATATTTCCTTAGATCATTGGTGCCTTTGACAGTGAAAATTATATTTCTTATAAGAAAAAGGTCACATCTTTGTAAGTTTCTCTAGATCATTGGTGTCAACCTTGTATGTGTGTGTAATGGTCACATTTTAGATAATGAAAGTTATGTTTCTATCCTAAACAACCTTATGTGTGTGTTGCATGACTTGGTAGGTTATTTCCAAGTTGTGTGATGATGGTGGGTTGGGGATAAGAAATCTGGCATGTAAGAAGTTTTCTTCTGGCTATATTCCTTCATTTATTCTCATTAAAAAAAAAGAAAAAAAGAGAAAAGTTCTCTTCTGGCTATGTAAATGCTGCATTTTCCTTTAAACACACAAGATTATTGTGTTGGGTGGTGGGATACACCTACCTTGATGAAAGACACATTTAGGAGTCCTTAGATGCTATCATTCTTTTTCTGCAGTATATTAATTTTGTAATGTTCACTCTGGGGAATGGGTCATCAAACTTTATCCAAATGTTTATAAAATCTAAATGTTTTCTTTAGGAAACATGGATACTCTGGTCCGACATGTCTTGTGATACGTTTTGGACACATGGACATGTTTCCAACGTACAAAAATAATTGTTTGCTTTCATTATTACTATTTAATTAATATTAGACATGTGGATATGTGCCTATGCGAAATGAGAAGTTTTAGAAAAACTCATGGCTGATGGCCTTTTTGTAATATTTGATTACATGGCGTGAAGGATCTTTACCTTTTTTTTTTTTTTTTTTTTTTTGTGATCCCTTCCTTTAGTGGAGGCTTTGATTTTCCTAATGGTTCTTATAGGCATCTATTTAACTGGATTGTTTTCTGGTCCAAATTGGATATGTGCATGTTGGTCATGACCATTCTATATCTTCCAGTATGTGGGCTTTTAATCAATGGATTTTTAATTATCAGTTACGTGATCTACCACTAAAAAATGGGAATTTTATTTGGTCCAACTCTGGCACTCTTGTTTATCTCTATGGGATCGGTTTTTAACTACGGACAGCTATGTCAAAAAGTATGGGGATATGACTGTACAACATCTTGACAGAATCACATCTGGTCATTATCCTTTGGTTCTCTCTACAGGTAATATTCATTGGGGCCTTTGCTTGTTTTGTTTTGAGAATTCCTGGTTATAGGTCCCATCGTTTTGGCCCTTGGTGGAGGATTGGTGGAACAGAAACCCCATATTTGGATGGTTTGGTCACAATTTGATGATGAAGTTGAAAAGATTGAAGATCTTTCTTTGAGCCTGGAGACGAAACCAGGATAACATTGCCACGTAGCTCCCATTACTCGTGTCATAGCTCAGTGATCTAGATAAAGTGGAGGATTGTAGACCGCTTACTAATGAACATTCTCTCTGCCGTTCCCTTAAGGAACAAATTGAATGTTTAATTGCTTAGGAACATATTTATTGGCAGTAGAGGTGTAAGCTTAAATGGCTTCGTGAAGGTAACGAAAATGCTCGTTTTTTCCATCGTATTTTTGCCGCTTGTAAGCGTAAGAGCTCGATCATGAAAATTCTCTCTCGAGATGGTTGTAGTCTTGTAACAGCCGACGATATTGAAATTAAATTTCTTGATTTTTACATGAAACTTTTCACAAAATATGGTAATTAGCGATTCCTATCGACTACTATTGAGTGTTCTACTAGCGCCACAGAGACTGCTAATCTTGAAGGGCCTTTCTCAAAGGATGAGGTACACAGAGCGGTCTCTTCTTCGGGTGTTGGTAAACCTCCTGGTCCTGATGGTTATATTGCAGAGTTCTTTAAATCTTTTTGGCCTAGTCTTAAAAGTGAGATTATGAATATGGTTCATGACTTTTTTGACTCTGGTGTTATTAATGCCTCGTTGAACGAGACCTACATTTGCCTTATCCCAAAGAAAATTGAATCTAGACTGGTATCTGATTATTGACTTATTAGTCTTATTTCATGCGCATATAAGATCTTTGTCAGTGTTTTGTCTGACTGTTTGAAGACAATCCTCCCGTGCACCATGGCTGTTAATCAGTTAGCTTTTGTGGCTAATAGACAAATCATGGATGCATCTCTCTTGGCGAATGTGTTGGTTGATGATTGGATTCTAAATGATAAAGAAGGGGTGGTTCTTAAACTGGATCTTGAAAAGGCCTTTGATACTGTTGATTGGAATTTCCAAGATGCTTTATAGATTAAAGGTTTTGGTCTGACTTGGCGGTCGTGGATACATGGTTGTATTTCTAGTGCGAATTTTTCGATTATCATCAATGGCCGGCGTAGGGGGAAGATTACCCCATCTCGTGTCATCAGGCAGGGGCATTCGCTTTCACCATTCTTATTTATTTTGGTTGTTGATTATTTGAGTTGCCTTATGAGTCATAGCTTTTTGTTGGGCCTTCTTGCTACTCATTCTATTGGAAGGTCTTCTCTTACTTTGAATCACTTAAAATTTGCTGATGACACTTTACTTTTTTCCACTACTGATCGTGTTGCTTTACAACATTTCTTTGACGTGGTTCGAATTTTTGAAAAGGCGTCTAGTTTGGCTATTAATTTTTCAGAGTGAAATACCGAGGATTCATATCAGTGATGCTGATTTGACGTGGTTGTTGAGTACCTTTGGCTGTAAGCAGGGTAGAGTTTGGCTTATCTTGGTTTGCTTTTGGGTGGGAACCCACATTCTCCTATTTTTGGCAATCGGTTGTTGAGCGGATACAACATAAATTACACAATTGGAAATATGCCTACATTTCTAAGGGAAGACGTCTAACTCTTATTCATGCTACGCTATCTGGCATACCTACTTATTATCTTTCTCTTTTTCATGCCCCCGAATGATGTCATTAAAACTCTGGAGAAGTTGGTTCGAGATTTTCTTTCGAAAGGATCTCGGGGTGGTGGTGGAATGCATATTGTTAATTGGGAGATTGTTCAGTGGCCTAGATTAATGGGTGGATTGGGTGTTGGCAATTTTCAGTTGCATAATTTTGTTTTGCTGGCAAAGTGGATCTGGCGTTTTCTTCATGAGCCTAATGCATTGTAGCGTGATTTTATTTTGACTAAGTATTACTCCCCCTCTTGTGTCTGGCCTTCTGTTGTTCAGGGTGGTCCATCGATCTCCTTGGTGGTTTATTTCACAGACTGTTGGTTTGATTGCCTCTCGTACTCGGAATTGTCTTGGCAATGGTACTTCTATTTCTTTCTGGAAAGATCCTTGGCTGGATTGTGGTATTCTATGCAATGCTTTTCCTTGATTATTTCATCTTGCGCTTCAACCAGATGTTTCGGTGGCTTATGCCTGGGTTGTAGAATCTGAGGCTTGGGACTTGGATCTTGGCCGTGATCTTAATGATCTGGAGACGGTTGAGTGAGCTTCTTTGTCTCATATTTTATCATCGATCAGGTTGCATCCCATATTTGATTCTTGGTGTTGGACATTGGAATCTTCTTCGTCCTTCTCTGTTAAATCTCTTTTAGATGATTTGGTTGGGGTGACTGAGCCATATGTCTAAGACCTCTACTCAGTTATCTGAAAAGACTTTTATCCAAAGAAGATTAAGCTTTTCCTTTTGAAGCCTAGTCTTGGTGCTGTTAATACTGCAAATTATTTATAGCGTTGTATGCCATGTCTCTTTCGCCATGTTGGTGTATTATGTGTCAGTACGATGGGGAATCTCCCTCTCATCCTTTTGTGCATTGCTCTTTTGCTTCTCGTTTCTGGCACATTGTTTGGAAGCTTTTGGTTGGTCTCTGACAGGATCTAACTTTATCTTTGATATTTTGGCATCTCTTATGGTGAGTCATCCTTTTGGTGGTACTAAACGATTGATTTGGTTGGCCCTTACGTATGCTTTTTTTGAACCCTTTGGTGCGAACGCAATGGTCGTGTTTTCAATGATTTTTTCTCATCTTTTGGTAGATTTTTGTATTGTCTACTGCTTTTTATTGGTGCAGAGATAAACACCCTTTAATCATTTCGGTTTTTCTTTTTTAGTCTCCAATTGGAAATATTTCTTATAATTGTCTATTGGCTTTTGAGGTTTCCCTATTTCATTATTCAAACCCACTGGTACTAGTTATTATTGTTTGTCCTTTTTCTTGTTTTGTGGGGGTGGGATCTATCCTTAAGAAGCCCATATCCATATGCTTATGTGTCCCTTTAGCATTTATGAGACTTTGTGGGGGGTTTGATGATGTGTGGGATATAATCATCTCAATCCCATATGCAGATGGTCTATAATTTGTTTCATTTTCTATTTGGCTTCATTCAGATTGATTACATAGGGCATGGTGCTTTAAATCTTGTCTGTATCATTTGCTTTTCGTTGTTCTTATCTTGCTAATAGTTATTCTTGATTATACAGCTTGTTTTTGGTGTCTTCACTGGAGAGTCCTGGGGCTACCTTGGTAGTCGGAGATTTTTGCTTGAACTTGATCTACAGTCCGATGCTGTCAGTGGCCTTAACAATAGGTTAATTGATACGGTATACTACTGAACTTGTAAATTGGTGCACTGAATTTTAGTTTTCTTGTACTCATCATACTTTGAGTGTGTTTTCTACTTCTATTGCTCGAGTGTCTTCCTAGTACATGATCACTGGAAAAATGCATTATGGATGAAAAGAATCATCCAAACCCTTTGGCTGCTAATTTGAAACAGTCGAAGGATGATTTCCCCCTTCATCGTGTGTAGTTGTTCATATCGCGTGTATTTGATTTATTATTTATACAATTCTTGTAAGTTTTTGTTTGCTTAATTTGTTATTTTCAATTGGGAAGGTAAATATATTCTTAACTTTTTTTTTCAATTTGTTCTTAGCCAGGGTCCTAGGTATTTCTTGTATGCTTTCAGGACTCAAGTTTTTGTTAGTAGAAACCTCAAGAAGATTATGAGAGATTTTCCTAGTCCTCGTGGGAATTTGGAATGTTAGGTGTTGGTTTCCTAGGCCTTCGAAAGGCTTCTCTTGCCATTTCTTTTCCAACCTTTTGGCAGCCCTTGTTCCCATCGATCTTCTACTCACTATGGAAGGTGTATACTCCAAAGAAGGTCAAATACTTTGGATTGGATTCAGAGACTATTCTTTTGTCTTGAGACCAAAATGGCGTTTTCTCTACATGGAGGCGTTGAGTATCTAGGACATTTGCTGTGGTTGTGCCGGTTTTCTTGCCATATTAGGGATCACATGTTTGAGACTCTAGAGTTGAGCTAGCTTGGTTCATCACAAAGATTGTTCTTGTATGATGGGGAAAGTGTTGCTCCATTCTCCCTTTAGGGGAAAAGGGAAACAGTTTACGAAGGACAGCTTTTTTGCCATCATTTGGGGCTTATGGCTGGGGAGAAATAACAGAATCTTCAGATGGTTTAAGAGGCCTTGGGAGGATGCTTGGTCCTTAGCTAGGCTCAATATCCTGCTTTGGCCTCCCCCTTTGGAGGTAACCTGATGACAATTTCCTTTTAGGATCCTACGGTGGATAAGATTAGGAAGAGGTTGGCTTTGTGGAAGAAAAATTTCTTTTCTAAAGCTGGTAGGCTTACTCTCATTAGATTTGTCTTGGGTGGCATCTCCATTTATTATCTTCCCCCTTTACGAACCCCAAGTGAGGCCGGTAAAAGCATTCAAAAGCTTATGAGATATTTTCTTTGGAAATGGTTGGATGAAGGAAGAGGTTATGAGCTGGGAAGCCTTTGGATGTTGGTTTGGGGGGGGGGGGGGGAGGTTCTGAGTTTGGATATTTGAGGATTAGGAACAAAGCTCTGTTGGCCAAGTGGCTATGGTGTTTACCCTCTAAACTCGAATCTTTATGGCATAGAATCATTGTGAGTAAGTACGACACTCACCCCTTCGAGTGGTCTTCAAGTGGGGTTAAAGGTATGCGTTAGAATCCCTGGAAGGATATTTCTCTTGAGCTTCCCTCTTTCTCCTATCCAATTAGTTGTGTGGTGGGGGAAGGTAAGGACACACTTTTGGGAGGATCTTTGGGTGGGGGATAGTTTCCTCTTCTGTGTCTTGGCCATGTTTGCCTATGGCGACATGCCCAAACCATCCTTTAACCACCTTATATTTATGTCTTGTAACTAGTTTAGCAAGTTGTTTTCATTTTCATATTGTTTTCCTTGTAAGTGACTCTAGCCATTTTGACGTGCGAGCTTGACATAGTTGAAATTGTGTAGAATGTACTATAGAGGACACCTACTTGTCAACCCTGACCAAGTATTGTTTACTCTTTGCAATCAAAAACCTTCTTGCAATTGACAACCTACAATGCTTTCTTTAATGATGTTTTCAATGCTTTCTCTCTCACATTTTATTTTTTTGAATTGGAAACAAAACGTGAGTGTAGGCTAGACCCTACACTGAAGTTGTTCATGACTTTCGTTTTTTGTACGGCTAACTTGCAATTTCCTCATCTATACTATCTTTCCTCTCTTAAAAATCGTTTGATATGTGATTTCTTGGCGTGGACTGGGAGCTCAGTTTCGTTTTAATTCGGGTTCCGTCGTTCTTTGTTCGACAGGAAAATGACGGAGATGGTCTCTCTTCTGTCGTTGATTGGAGAGGTTACTTTTAGAATCGGGAGGAGATTTTCGCATTTGGAGTCTTGACCCTTAAGAGTGCTTTTCTTGCAAATTTTCGACACTTATTGGACCCTTCTCTTGTTAGCGAGTTGGTGTTTGATATCCTTTGATATCCATGGTTTGATTGTTTTCCCTCTTCGTCAGTTGACAATCGTTCTCTTAATCCTACGACCCATCTTACAAGATTCTGGTTTATGTATTATGGCTATTCCCGCAAGGCCGCAAGGAATAACAACAATAGATGCACGGTAAAACTAAAGAAAGGAATCAGAGAGGTTCGGGACCTAGAGTGCTCTATCTCCTACGAGAAAAGGCAAAAGCCTTGATTAGAGAGGGGTCGTTATCAGCTTAATGATCGTGTTATCGTGGAATGTTAAGGGTCTCGACTCATAGAAAAAGAGAGCTGCCATCAGGTACATTATCACAATGATCCATCCATCCATCGTTATTCTCCAAGAAACAAAAAGCAGAGATATAGATCAACTCTTCATCAAATCGATTTGGAGAGCGAGGGGTATTGATTGGGCTTCTAGGGACGTTGTTTGTTCATTTGGGGGTATTTTGATACTATGGGATGTTTCTTCTTTCTCAGCTTCATCTGGTACTCAAGGTGCATATTCTCTCTCTATTCTTTTCACGTTAGCGGATGGTTACAACTTCTGGGTTTCAGGCATCCATGGTCCTTCCTATGATTATGAAAAGCCTCCTTTCATCCATGAACTCTATGATTTATACTGTCTTGTTGAAGATAATTGGATTTTGGGAGGAGACTTCAATATTATTCGTTGGCCCCCTCAAAATGATGCTTTTGCCAAAGAAACAGTGAAACAATGAACCATTTGTTCATCCACTGCCCGTTTGCTAATACTATTTGGACTGAATTGTTCAGTGCCTTCAATTGGACCACAGTTCTCCCATATGGCTTGTTGGATTGGATCAGAATGACATTACTGAACCATCCCTTCAAAGACACAAAAGCTAATTTATGGACCAGCTTCATTTGCACGGCTTTGTCTAAGATTTGAGAAGAAAGGGATGCTAGAATCTTGAGAAACAACTCAAATTCAATGATGTAATAGTGGCTTCCATTTTTAATGCTCTTTATTGGTGTAAACAGTTACAACCTCTCAAAGAGATCACAATTTAAGTTTTCTTGTTGCAAATTAGAACAATCTTTTGTACTCCCTTAAATAGGGCCTTTTTTGGATTATTTCATTCATCAATGAAACTGAAAATGAGTATACTTATCACGACCGAAGAAAATTCGTATGGTGCATAGTTTTCAATTGAAATTCTCAGACTCCTGAAAGCATACCTATGGACGGTGCCTTCATTGCCAGGATCGTTTCAATAAATTGGCAATTACATGTGGGTGAGAGAAGTACGTAGGCTTAGACAATAACTTTTCTCCCTCACATGAGATGAAGAAAGTGAGGTTTCACAGATTCCTAGTTGAGGAAAATAAGGATTTTTTTTTTTTTTTTTTTTTTTTTGAAATAGAAACAAAACCTTTTCATTGAATTAATGAAATGAGTCTAATGATCAAAGTACAATAAAACAAACAAAGAAAAGAGAAAATACAACATAAAAAATGGAGAAACAAGAACAAAATGATAGATAAAGTAGACCAATCTTTAATACAACTCCTAATAAACAAAAAATTCCAAACTGCCAAACCTCTAGCAAAGAACAAGACTGGAGAGTTGAAAGCCATAAGCTTTGAAAGTAGAGGAAACTTGAAATGGAGCTCAAAACCAGACTGAGTTGCCCGCTGCCAACCCCAGAAATCTAGCCAGCCAAACATAAGGAGATCGTCAGATTCCAGAAAATAAAATAATGTGATGAAGACTCGAAATACTCTAAAAACTCCACAGCTGAAACTAACCAAAATAAAATCTCCAAGCATAACACTATATGAGCATCCACAATCCAAACAAAACGACCTTACAAGAATCGAAAGGCCTAATCCACAAAAATAGTTCAATTTCTGGACAAAAACCACCAAACTATGCTTTAATTGAAGGAGATTGAGGAGGAATTGCTTCCATCTGAAAACCACATATTTCAGTCAAAGCGGAGAACTTGGCAGGAATTATGTGAGACAATTTTGCTTGAACATTTTCACCCTTCTCAAACAAAGTATCAAAACCATCTGCAAAGCAAACCTCTAGAGTCTCCTCAACAAAATCATCTTGTGCTAGAGAATCCATAATATTTTCACTGCTTAAACTAGCATTTGAAACACCATCAGAATCATAACCAGAAGGAGTGTCTTTAGGAGATGGATGACTATAAGGATTTCATGTAGTCATATTTGAAGTGGTTTAGCATGGCATTTCCTTTCCGTGACATGATGCAGATTTGGCATGATATGAGAAAAGTGATTTTTTTGAGCTAAAGCTGCCTTCATTTATTTATTTTTTGTTCGTTGGTCCCTGTGTTCTCTTCTTTTCTTTTCTTTTTGATAGCTTCGGCCCATTTATTGAAGTTGATAAAAAGTTGAACTCATTGTGCTTTCACTTGTTCAATTTAGGTTTTTGAAATTGGATCTGTTGGAAAGAGCTCCAGTCATGGATTTGGAAAGTTTTTTGCTCACATGACAGAGGTAATGGCTTCTTCCTATAAACTATTTTATTATTATTTTAAGGGCAAATTAACTTTTTGGTCCCTGAGGTTAGAGATGTCCATTTAACCTGCAGGGTTTCCTCGCCCTGCCTTGCCTTTGCACGAAGAATGGGGGAGAGGGTGGGGAGAGATTTCTCCCCATTGAGCAGGGTTTGGGACGTCCCCGACCTTGCCCTGATTCCCCACCTCGTCCCATCTAGTTATATAAATAATACATAAAACAAACCCTAATTTATAATGACTAAAGAAGGAAATTGATGGCTAATTTGTTGGATTCCTTTTCCTTTTTTTTTTTTTTTTGAAATATGTTGTAATGTTTAAGTTGAAAACTATGTTGTTGTATTTGTATATTGAAATTTGAACTTTTAGAAACTAGGGTGATTTAAATATTCTAACTAAATTTTTTCACATGGCATATTACTGTTTTTATTAGGATATTTTTCCTTAAATGGAAATTGGATTGGAAATTATCTACTCAAGCATAGAAACTTCTAAAAATGTATTTGTTTGAGAATAAATTAAAAAAAAAAGCAAATGGAGAACTTGAACCCCACAAAGAATATCTGTCTGATCCCTAAGGAACTAAACAAGGAATTTCACGAGGATGGGAATTGAAATAGGGGGCGGGACAGGGGAGCTGGCCTTGTCCTTGTCCTTGCCCTTGCCCCACTTTGTAGATATCTCTAGTTAAGGTTTAGGATTTGTGTCAATTTAGTTCCTAAGTTTCAAAATTAAACAATCAAGTCCCTGATGTTTAGAAAATGCTCAATTTTGGTTCCTTTATCAAAGACACCATCAACTCGATGCTGGCATGGCAATGTTTAAAATAAAACAATAAAAACAAGTAGATAATACTAATAATCAAAATTAAAATATAATTAAATAAAACTCTCCTCCACCACCAACACAGAAACCCACGCTGCCGCTGAAGAAACAACCCCCCACCAAACCTCCCCTCCACCACTCCTAACCATCCCCTTCTCCAACTCTACTACAGCCTCTTCTCTATCCCCTTTTCCCCTACTGCTTCTTTAGTCGGCCGCCACCACTTTTTTTCCTTTTTCTTTTCCCCCCAAATTTGGGCTTTTTCTTCTTTTGATTTTTGGAAAATTTCTGACATTTCTAATGTGGTAAATTTTCTTTTTCCACCCAAATAACAAGAAAATTTCTTAATTAAATAGAGAAGAAATTTTCTTGTTATCTGGTCAAAGAAGAAAGCTGAGGGTGGGGTGGTGGTGGGCCATGGCAGGCGGAATAAGAGTAGAGAGAGAAGAAGATGGAGAAGAAGTGGGAGAAGGAGGAGGAGGAGAAAAAGAAGAAGAGAGAGCAGAAGATGGAGGAGAAGAAAAAGAAGGGTGCCGACACCAAAGCTGTGGGAAAGGAAGGGGTGGAAAGAGTAGTTTTTAGACAAATATTTTTTATTATTTAAAAATCAGTATTTTAATTTAAACATTACCATGTCTGCATCGTGTTAATGGAACTATTAATGATGGGACAAAAAATGAGCATTTTCAAAACACTGGAGGATTGTCTGATTTTGAAACTTAGGGACTAAATTGACACAAACCTAGATCTCAGAAACCAAAAAGGTAATTTATGTACCCTTATTGTAAAGTTTCTCTATTCATATATGTTGCTGCAACATTTAAAATCATAAACTGTGTATAGGCATTTTCATATCTCTGGAAACTGCTTCTTAGTTGTGTTCTAAATTTGATCAAACGTACATCCAAACTCTTGAGAGTCGAGGCCATTACCTTTTTTCATATTGGATATTTGATCTATTTTGTTATACTAGTCCGATATGGTGCCCTAGTAATATATTCAACTATTCCGTTGATATTTTGTTCTATAACCTACAATGCACAGTCATGTTGGTTGAAATTTTTTATTTGAACCTATTATTTTAGCAAGATATTGTTTCTGTAGGTTTCATCTTCAAAGAATGAGACATGGAATGCCTTGAAGCTTGCTCAAGAGTCACTTCCATCAGAGAACATAAAAGTCTCACCAGCTAGTACTGCAAATCCAGGGATACCACCATCTTCCTTGATGGCTTTTCTGGCAAAGGTTTTCAACAAACATACCTGATGATACTATTATTTACCCTCTATTGTCCAACCCCTCTGATCTACTTTATTTCTTTTTTGACCAAAACACACACACACACACAAAATTATAAACGAAAGGCACTCAATTTAAGGTTTTAGACCTTGATAATTTGACAAGTTTGTTTGGTCCATGCAGAACCCGCAAATCTCTGGGGTGGTATTAGAAGACTTTGATACTAGCTTTACCAATAAATTTTACCAGAGTCAGCTCGATGATTTACGTTAGTTATTTATGTTCTTTTCTTTTCTTCCTTTCTTATTTGTCATCATCGCAATTATTTATAGCTGGTCTTGCTTTTTCTTATTACTCGTATATTTTTTTCAATCTAGATAATATAAACTCATCAGCTATTGAAGCAGCTGCTTTACTTGTTGCCCGAAGTCTTTACATTCTTGCAACCAACAAAAAGGAATTGAGTAGGTCTGCCCTAACTGCTATCAAAGTGAACACCTCGTTGGTTGAAGAGCTTATAGGATGTCTTCTGAATTGTGAGCCGGGTCTCTCTTGTGAGTTGGTGAAGAGATATATTTCTCCCTCCAGTGTTTGTCCAAACCATTACGTTGGTGTTATCCTTGATGAACCTTCCTCTACTCCTTATCCTGGTTATGTTCATGACGTTTCAAGATTTGTTTGGAACTTTTTGGCTGACAGAACATCTAATCCTAAAGAGAATACTATCTCAGTCTGTTCACAAAATTGTGATGACAAAAGTGAGGTGTGCATTGGAGCAGAGACTGGAAAGGGAACTTGTGTTATATCAACCACCAGGTACCGTTATATTTACAGGCATTCAACTGCTTTACAAAGGGGTATTGGTGGGCGCATGACATACTTAGATATTGCATCAAAATTTAAGTTAAAAAACATATTTTTCGTAATAAAAAAAAAAACTTCAATTAAATTAAAATAAAGAATATTGAAAACTATAAATGTGACAACTTAAGTATAGCTCGATTGGTTTAAGGTATTTATCTTCTATTAAGAGGTCGGATGTTTGTCACCCTATCTGTTTGACTGCGAAGGTGTCTCGAGGAAGGATTGCCCCTAGAGAAGAGTGTGTTGACGCCTGTAATGTAGAGGTGTAAGTGGGTTGGTCTGTCCTGCCACCAAAATTGATGTCACAATCCAAATGGAAAGTCAAGAGCATTGATTTGATCTTTTTTTTTTTTCTTTTTGGTGGGGGAATAAAGGAGAGCTATGTACTTCCCCCCACTTTACTTTTTTTTTCTCTTTCTGATGAGAAAATGTGTATGCAATTGTCCGACTTTAACTTAAATAGGATACATACAAAATTGGTCCTTTCTTTGTCATTTTTTTCCTTAAATGGGAAATCAACTTTGATTAAAAGGAAAGGGCCGGCCAACCCATAAGAGGGTATTGCCATTTTTTTAATAATTAAAAAAACCCAACTTTTGTGACTTACCCATTGTTGCATTATGGGCTAAAGAATACCTTCGGATGGTCCACTCTTTACATCGCCGTAGTATCTTAGTTTTTACATAGTATTCCATCTTGTTTGTAATTTTTTTTTTATTGATCAATGAAATTTTATTCTAAAAAAAAAAAAAAGTTGTACGTAACTTTTCTTTTTTCCACGGGTGGTCATAATCGAGGAGCTCAGTTTGTTTCTTGGTTTATTGATAATACGTGTGGCTGATTGCTTCCACATCCAATTTGAGCAACAACATGCCATACTCTCAATTTTTTTAGAGTAATTGTTTTAAATGACAAAACTATTAGAAATATTTTCAAATATGACAAAATGTCATTGTCTGATAGATGATAGTAGTCTATCGCAGTCTATCACTGTAGACAGAGTGACATTTTGTTATATTTGTAAATAAGTTAGCTCATTTTGCTATATTTGAAAACAACCATTTTTTTTATTAGTATTTAAGAAAAAAAAGACTTGCATAGTAGTTGCACTGCAAAATGCTCTTTAATGACTTCCCTTCTCTTTCTAATATCGGTAGTTATTTGCTACAGGTTCGTCCCAGCATACTCAACAAGATTGAAGTTCGAATCTGGATATTGGAACGTGCTTCCTCCAAATTCGTCAGACCCGCTTGGGGCTGTCGATCCAGTTTGGACAGAGAGCAATTGGAATACCATAGGACTCCGAATGTATACCATCCAAGCTACTGCTTATGATCGTTTTGTTTTACTTGGAGGCATTACTACCACAATCTTAGCTTATTTTGCAATAGTAGCCGTGCGAAGCTCCATTATAAAGGCCTTGAAGAGAGATTGA

mRNA sequence

ATGCATTCAACAGATGAACACTCAATGGAGTCGGTTCCTGATCTTCAAAATTCGATGTACCTAGCGGTTGATGGTTATCCATGCATTCGACTACTTAATCTTTCTGGAGAAATTGGCTGTTCAAATCCTGGACGAGAAAAGGTTGTAGTTCCAATGATTAACTTCAAAGATGCTAATGAGATATTGCAGCCATCTGCAATTTTAGTTTCAATGGATGAAATCTCGAGTTTCTTTACTAGACTACAGGAAGATTTGCATTTTGCAAATAATGTTGGCGGTGTTTTAATCGAACCAGGAACTGAAATACAAAATAGAATGGAAGGATTTTCTCCTGCCCAAAAGTTTCCCCAAGCTAAATTTGCTCCCTATAAAAAAATTGACTATGAATGGAACCCAATTGCTGCTTCAAAAAATGTGAAGAATAAGAAAAATTACATATCTAATGTTGCTGAATTTGATCTGGTGATGCAGACAACTAAGGCTGGAACTCATAATTCAATGTCTTGTTTGAAAGAAGAAACATGCCTTCCGTTAGGTGGATACAGTGTCTGGTCATCACTTCCTCCAATCAATACTTCTTCCTCTGATCAATCTAAGCCCGTCATTCTCACAGTAGCTTCAATGGATTCTGCCTCTTTCTTCCGTGATAAAAGCATTGGTGCAGACTCCCCCATCTCTGGCTTGATTGCGTTGCTGGCTGCAGTTGACGCACTTTCCCATGTGGAGGGATTGGATGATCTTCATAAACAGCTTGTTTTTGGTGTCTTCACTGGAGAGTCCTGGGGCTACCTTGGTAGTCGGAGATTTTTGCTTGAACTTGATCTACAGTCCGATGCTGTCAGTGGCCTTAACAATAGGTTAATTGATACGGTTTTTGAAATTGGATCTGTTGGAAAGAGCTCCAGTCATGGATTTGGAAAGTTTTTTGCTCACATGACAGAGGTTTCATCTTCAAAGAATGAGACATGGAATGCCTTGAAGCTTGCTCAAGAGTCACTTCCATCAGAGAACATAAAAGTCTCACCAGCTAGTACTGCAAATCCAGGGATACCACCATCTTCCTTGATGGCTTTTCTGGCAAAGAACCCGCAAATCTCTGGGGTGGTATTAGAAGACTTTGATACTAGCTTTACCAATAAATTTTACCAGAGTCAGCTCGATGATTTACATAATATAAACTCATCAGCTATTGAAGCAGCTGCTTTACTTGTTGCCCGAAGTCTTTACATTCTTGCAACCAACAAAAAGGAATTGAGTAGGTCTGCCCTAACTGCTATCAAAGTGAACACCTCGTTGGTTGAAGAGCTTATAGGATGTCTTCTGAATTGTGAGCCGGGTCTCTCTTGTGAGTTGGTGAAGAGATATATTTCTCCCTCCAGTGTTTGTCCAAACCATTACGTTGGTGTTATCCTTGATGAACCTTCCTCTACTCCTTATCCTGGTTATGTTCATGACGTTTCAAGATTTGTTTGGAACTTTTTGGCTGACAGAACATCTAATCCTAAAGAGAATACTATCTCAGTCTGTTCACAAAATTGTGATGACAAAAGTGAGGTGTGCATTGGAGCAGAGACTGGAAAGGGAACTTGTGTTATATCAACCACCAGGTTCGTCCCAGCATACTCAACAAGATTGAAGTTCGAATCTGGATATTGGAACGTGCTTCCTCCAAATTCGTCAGACCCGCTTGGGGCTGTCGATCCAGTTTGGACAGAGAGCAATTGGAATACCATAGGACTCCGAATGTATACCATCCAAGCTACTGCTTATGATCGTTTTGTTTTACTTGGAGGCATTACTACCACAATCTTAGCTTATTTTGCAATAGTAGCCGTGCGAAGCTCCATTATAAAGGCCTTGAAGAGAGATTGA

Coding sequence (CDS)

ATGCATTCAACAGATGAACACTCAATGGAGTCGGTTCCTGATCTTCAAAATTCGATGTACCTAGCGGTTGATGGTTATCCATGCATTCGACTACTTAATCTTTCTGGAGAAATTGGCTGTTCAAATCCTGGACGAGAAAAGGTTGTAGTTCCAATGATTAACTTCAAAGATGCTAATGAGATATTGCAGCCATCTGCAATTTTAGTTTCAATGGATGAAATCTCGAGTTTCTTTACTAGACTACAGGAAGATTTGCATTTTGCAAATAATGTTGGCGGTGTTTTAATCGAACCAGGAACTGAAATACAAAATAGAATGGAAGGATTTTCTCCTGCCCAAAAGTTTCCCCAAGCTAAATTTGCTCCCTATAAAAAAATTGACTATGAATGGAACCCAATTGCTGCTTCAAAAAATGTGAAGAATAAGAAAAATTACATATCTAATGTTGCTGAATTTGATCTGGTGATGCAGACAACTAAGGCTGGAACTCATAATTCAATGTCTTGTTTGAAAGAAGAAACATGCCTTCCGTTAGGTGGATACAGTGTCTGGTCATCACTTCCTCCAATCAATACTTCTTCCTCTGATCAATCTAAGCCCGTCATTCTCACAGTAGCTTCAATGGATTCTGCCTCTTTCTTCCGTGATAAAAGCATTGGTGCAGACTCCCCCATCTCTGGCTTGATTGCGTTGCTGGCTGCAGTTGACGCACTTTCCCATGTGGAGGGATTGGATGATCTTCATAAACAGCTTGTTTTTGGTGTCTTCACTGGAGAGTCCTGGGGCTACCTTGGTAGTCGGAGATTTTTGCTTGAACTTGATCTACAGTCCGATGCTGTCAGTGGCCTTAACAATAGGTTAATTGATACGGTTTTTGAAATTGGATCTGTTGGAAAGAGCTCCAGTCATGGATTTGGAAAGTTTTTTGCTCACATGACAGAGGTTTCATCTTCAAAGAATGAGACATGGAATGCCTTGAAGCTTGCTCAAGAGTCACTTCCATCAGAGAACATAAAAGTCTCACCAGCTAGTACTGCAAATCCAGGGATACCACCATCTTCCTTGATGGCTTTTCTGGCAAAGAACCCGCAAATCTCTGGGGTGGTATTAGAAGACTTTGATACTAGCTTTACCAATAAATTTTACCAGAGTCAGCTCGATGATTTACATAATATAAACTCATCAGCTATTGAAGCAGCTGCTTTACTTGTTGCCCGAAGTCTTTACATTCTTGCAACCAACAAAAAGGAATTGAGTAGGTCTGCCCTAACTGCTATCAAAGTGAACACCTCGTTGGTTGAAGAGCTTATAGGATGTCTTCTGAATTGTGAGCCGGGTCTCTCTTGTGAGTTGGTGAAGAGATATATTTCTCCCTCCAGTGTTTGTCCAAACCATTACGTTGGTGTTATCCTTGATGAACCTTCCTCTACTCCTTATCCTGGTTATGTTCATGACGTTTCAAGATTTGTTTGGAACTTTTTGGCTGACAGAACATCTAATCCTAAAGAGAATACTATCTCAGTCTGTTCACAAAATTGTGATGACAAAAGTGAGGTGTGCATTGGAGCAGAGACTGGAAAGGGAACTTGTGTTATATCAACCACCAGGTTCGTCCCAGCATACTCAACAAGATTGAAGTTCGAATCTGGATATTGGAACGTGCTTCCTCCAAATTCGTCAGACCCGCTTGGGGCTGTCGATCCAGTTTGGACAGAGAGCAATTGGAATACCATAGGACTCCGAATGTATACCATCCAAGCTACTGCTTATGATCGTTTTGTTTTACTTGGAGGCATTACTACCACAATCTTAGCTTATTTTGCAATAGTAGCCGTGCGAAGCTCCATTATAAAGGCCTTGAAGAGAGATTGA

Protein sequence

MHSTDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKIDYEWNPIAASKNVKNKKNYISNVAEFDLVMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSENIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDKSEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD
Homology
BLAST of HG10011195.1 vs. NCBI nr
Match: XP_038896057.1 (nicastrin [Benincasa hispida])

HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 591/649 (91.06%), Postives = 609/649 (93.84%), Query Frame = 0

Query: 1   MHSTDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANE 60
           + S+DEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+E
Sbjct: 17  LSSSDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADE 76

Query: 61  ILQPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKF 120
           ILQPSA+LVSMDEISSFFTRLQ+D HFANNVGGVLI+PGTE+QN  +GFSPAQKFPQA F
Sbjct: 77  ILQPSAVLVSMDEISSFFTRLQDDSHFANNVGGVLIKPGTEMQNSTKGFSPAQKFPQASF 136

Query: 121 APYKKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDL 180
           APYKKIDYEWNPI                          AASKNVKNKK+YISNVAEFDL
Sbjct: 137 APYKKIDYEWNPIGSGIMWNQYNFPVFLISESSISSIQEAASKNVKNKKDYISNVAEFDL 196

Query: 181 VMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 240
           VMQTTKAGTHNSMSCLKE TCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF
Sbjct: 197 VMQTTKAGTHNSMSCLKEVTCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 256

Query: 241 RDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELD 300
           RDKSIGADSPISGLIALLAAVDALSHV+G DDLHKQLVF VFTGESWGYLGSRRFLLELD
Sbjct: 257 RDKSIGADSPISGLIALLAAVDALSHVDGFDDLHKQLVFVVFTGESWGYLGSRRFLLELD 316

Query: 301 LQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLP 360
           LQSDAVSGLNNRLID+VFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLA+ESLP
Sbjct: 317 LQSDAVSGLNNRLIDSVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLARESLP 376

Query: 361 SENIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINS 420
            ENIKVSPAST NPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTN+FYQS LDDLHNINS
Sbjct: 377 LENIKVSPASTTNPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNQFYQSHLDDLHNINS 436

Query: 421 SAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKR 480
           SAIEAAALLVAR+LYILATNKKELS SALTAIKVNTSLVEELIGCLLNC+PGLSCELVKR
Sbjct: 437 SAIEAAALLVARTLYILATNKKELSSSALTAIKVNTSLVEELIGCLLNCDPGLSCELVKR 496

Query: 481 YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCD 540
           YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTS PKENT SVCSQNCD
Sbjct: 497 YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCD 556

Query: 541 DKSEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN 600
           DKSEVCIGAETGKGTCVISTTR++PAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN
Sbjct: 557 DKSEVCIGAETGKGTCVISTTRYIPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN 616

Query: 601 WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAV+SSIIKALKRD
Sbjct: 617 WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVQSSIIKALKRD 665

BLAST of HG10011195.1 vs. NCBI nr
Match: XP_004140732.1 (nicastrin [Cucumis sativus] >KGN57470.2 hypothetical protein Csa_011284 [Cucumis sativus])

HSP 1 Score: 1130.9 bits (2924), Expect = 0.0e+00
Identity = 575/649 (88.60%), Postives = 594/649 (91.53%), Query Frame = 0

Query: 1   MHSTDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANE 60
           + S+DEHSMESVPDLQNSMYLAVD YPCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+E
Sbjct: 17  LSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADE 76

Query: 61  ILQPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKF 120
           ILQPSA+LVSMD ISSFFTRLQ+D HFANNVGGVLIEPGT IQNR EGFSPAQKFPQAKF
Sbjct: 77  ILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFSPAQKFPQAKF 136

Query: 121 APYKKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDL 180
           APY+K DYEWNP                           AASKNVK+KK+YISNVAEFDL
Sbjct: 137 APYEKSDYEWNPSGSGIMWNQYNFPVFLISESSISSIQEAASKNVKSKKDYISNVAEFDL 196

Query: 181 VMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 240
           VMQTTKAGTH+SMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF
Sbjct: 197 VMQTTKAGTHSSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 256

Query: 241 RDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELD 300
           RDKSIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELD
Sbjct: 257 RDKSIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFAVFTGESWGYLGSRRFLLELD 316

Query: 301 LQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLP 360
           LQSDAVSGL NRLIDTVFEIGSVGKSS HG G FFAHMTEVSSS NETWNALKLA+ESLP
Sbjct: 317 LQSDAVSGLENRLIDTVFEIGSVGKSSKHGSGNFFAHMTEVSSSNNETWNALKLARESLP 376

Query: 361 SENIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINS 420
            ENIKVSPAST NPGIPPSSLMAFLAKNPQ+SGVVLEDFDT FTN+FYQS LDDLHNINS
Sbjct: 377 LENIKVSPASTTNPGIPPSSLMAFLAKNPQVSGVVLEDFDTGFTNQFYQSYLDDLHNINS 436

Query: 421 SAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKR 480
           SAIEAAALLVAR+LYILA NKKELS S LTAIKVNTSLVEELIGCLLNC+PGLSCELVKR
Sbjct: 437 SAIEAAALLVARTLYILAINKKELSSSVLTAIKVNTSLVEELIGCLLNCDPGLSCELVKR 496

Query: 481 YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCD 540
           YISPSSVCPNHYVGVILDEPSS PYP YVHDVSRFVWNFLADRTS PKENT SVCSQNCD
Sbjct: 497 YISPSSVCPNHYVGVILDEPSSAPYPDYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCD 556

Query: 541 DKSEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN 600
           DKSEVCIGAETGKGTC ISTTR++PAYSTRLKFESGYW+VLPPNSSD LG VDPVWTESN
Sbjct: 557 DKSEVCIGAETGKGTCAISTTRYIPAYSTRLKFESGYWSVLPPNSSDHLGTVDPVWTESN 616

Query: 601 WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           WNTIGLR+YTIQA AYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD
Sbjct: 617 WNTIGLRVYTIQAAAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 665

BLAST of HG10011195.1 vs. NCBI nr
Match: XP_008457115.1 (PREDICTED: nicastrin isoform X1 [Cucumis melo])

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 572/649 (88.14%), Postives = 597/649 (91.99%), Query Frame = 0

Query: 1   MHSTDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANE 60
           + S+DE  MESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVV+PMINFKDA+E
Sbjct: 35  LSSSDEQKMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVLPMINFKDADE 94

Query: 61  ILQPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKF 120
           IL+PSA+LVSMD ISSFFTRLQ+D HFANNVGGVLIEPGT IQNR EGFSPAQKFPQAKF
Sbjct: 95  ILEPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFSPAQKFPQAKF 154

Query: 121 APYKKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDL 180
           APYKK DYEWNPI                          AASKNVK+KK+Y+SNVAEFDL
Sbjct: 155 APYKKNDYEWNPIGSGIMWNRYNFPVFLISESSISSIQEAASKNVKSKKDYVSNVAEFDL 214

Query: 181 VMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 240
           VMQTTKAGTH+SMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF
Sbjct: 215 VMQTTKAGTHSSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 274

Query: 241 RDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELD 300
           RDKSIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELD
Sbjct: 275 RDKSIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELD 334

Query: 301 LQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLP 360
           LQSDAVSGL+NRLID VFEIGSVGKSS+HG G FFAHMTEVSSSKNETWNALKLA+ESLP
Sbjct: 335 LQSDAVSGLSNRLIDMVFEIGSVGKSSNHGSGNFFAHMTEVSSSKNETWNALKLARESLP 394

Query: 361 SENIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINS 420
            ENIKVSPAST NPGIPPSSLMAFLAKNPQISGVVL+DFDT FTN+FYQS LDDLHNINS
Sbjct: 395 LENIKVSPASTTNPGIPPSSLMAFLAKNPQISGVVLDDFDTGFTNQFYQSHLDDLHNINS 454

Query: 421 SAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKR 480
           SAIEAAALLVAR+LYILA NK ELS S LTAIKVNTSLVEELIGCLLNC+PGLSCELVKR
Sbjct: 455 SAIEAAALLVARTLYILAINKNELSSSVLTAIKVNTSLVEELIGCLLNCDPGLSCELVKR 514

Query: 481 YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCD 540
           YI+PSSVCPNHYVGVILDEPSS PYP YVHDVSRFVWNFLADRTS PKENT SVCSQNCD
Sbjct: 515 YITPSSVCPNHYVGVILDEPSSAPYPDYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCD 574

Query: 541 DKSEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN 600
           D+SEVCIGAETGKGTCVISTTR++PAYSTRLKFESGYW+VLPPNSSD LGAVDPVWTESN
Sbjct: 575 DRSEVCIGAETGKGTCVISTTRYIPAYSTRLKFESGYWSVLPPNSSDHLGAVDPVWTESN 634

Query: 601 WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           WNTIGLR+YTIQA AYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD
Sbjct: 635 WNTIGLRIYTIQAAAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 683

BLAST of HG10011195.1 vs. NCBI nr
Match: XP_022997757.1 (nicastrin [Cucurbita maxima])

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 562/646 (87.00%), Postives = 591/646 (91.49%), Query Frame = 0

Query: 4   TDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQ 63
           +DEHSMESVPDLQNSMYL VDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+EI Q
Sbjct: 20  SDEHSMESVPDLQNSMYLVVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADEIFQ 79

Query: 64  PSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPY 123
           PSAILVSMD+ISSFF RLQ+D +FA+NVGGVLI+PGTEIQ R +GFSPAQKFPQAKFAPY
Sbjct: 80  PSAILVSMDDISSFFARLQDDSNFASNVGGVLIKPGTEIQKRTKGFSPAQKFPQAKFAPY 139

Query: 124 KKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDLVMQ 183
           +KIDYEWNPI                          AASKNVK+KK Y SNVAEFDLVMQ
Sbjct: 140 QKIDYEWNPIGSGVMWNQYNFPVFLISESSISPVQEAASKNVKDKKVYTSNVAEFDLVMQ 199

Query: 184 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDK 243
           TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPIN  S D+SKP+ILTVASMDSASFFRDK
Sbjct: 200 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINI-SLDKSKPIILTVASMDSASFFRDK 259

Query: 244 SIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQS 303
           SIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELDLQS
Sbjct: 260 SIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELDLQS 319

Query: 304 DAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSEN 363
           D+VSGLNN LIDTVFEIGSVGK S+ GFG FFAHMTEVSSSKNETWNALKLAQESLP EN
Sbjct: 320 DSVSGLNNTLIDTVFEIGSVGKKSNDGFGNFFAHMTEVSSSKNETWNALKLAQESLPFEN 379

Query: 364 IKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAI 423
           IKVSPAST NPGIPPSSLMAFLAKN Q+SGVVLEDFDTSFTN+FYQS LDDLHNINSSAI
Sbjct: 380 IKVSPASTTNPGIPPSSLMAFLAKNSQVSGVVLEDFDTSFTNQFYQSHLDDLHNINSSAI 439

Query: 424 EAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYIS 483
           EAAALLVAR+LYILATNKKELS SAL AIK+NTSLVEE+IGCLLNC+PGLSCELVKRYIS
Sbjct: 440 EAAALLVARTLYILATNKKELSSSALNAIKLNTSLVEEIIGCLLNCDPGLSCELVKRYIS 499

Query: 484 PSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDKS 543
           P +VCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTS PKENT SVCSQNCDDKS
Sbjct: 500 PINVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCDDKS 559

Query: 544 EVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNT 603
           EVCIGAETGKGTCV+STTR+VPAYSTRL FESG WNVLPPNSSDP+GAVDPVWTESNWNT
Sbjct: 560 EVCIGAETGKGTCVVSTTRYVPAYSTRLMFESGSWNVLPPNSSDPMGAVDPVWTESNWNT 619

Query: 604 IGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           IGLR YT+QATAYDRFVLLGGITTTIL+YFAIVAVR SI+KALK+D
Sbjct: 620 IGLRTYTVQATAYDRFVLLGGITTTILSYFAIVAVRGSIMKALKKD 664

BLAST of HG10011195.1 vs. NCBI nr
Match: XP_023525268.1 (nicastrin [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1108.2 bits (2865), Expect = 0.0e+00
Identity = 562/646 (87.00%), Postives = 593/646 (91.80%), Query Frame = 0

Query: 4   TDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQ 63
           +DEHSMESVPDLQNSMYL VDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+EILQ
Sbjct: 20  SDEHSMESVPDLQNSMYLVVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADEILQ 79

Query: 64  PSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPY 123
           PSAILVSMD+ISSFF RLQ+D +FA+NVGGVLI+PGTEIQ R +GFSPAQKFPQAKFAPY
Sbjct: 80  PSAILVSMDDISSFFARLQDDSNFASNVGGVLIKPGTEIQKRTKGFSPAQKFPQAKFAPY 139

Query: 124 KKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDLVMQ 183
           +KIDYEWNPI                          AASKNVK+KK Y SNVAEFDLVMQ
Sbjct: 140 QKIDYEWNPIGSGVMWNQYNFPVFLISESSISPVQEAASKNVKDKKVYTSNVAEFDLVMQ 199

Query: 184 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDK 243
           TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPIN  SSD+SKP+ILTVASMDSASFFRDK
Sbjct: 200 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINI-SSDKSKPIILTVASMDSASFFRDK 259

Query: 244 SIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQS 303
           SIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELDLQS
Sbjct: 260 SIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELDLQS 319

Query: 304 DAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSEN 363
           ++VSGLNN LIDTVFEIGSVGK S+ GFG FFAHMTEVSSSKNETWNALKLA+ESLP EN
Sbjct: 320 NSVSGLNNTLIDTVFEIGSVGKKSNDGFGNFFAHMTEVSSSKNETWNALKLARESLPFEN 379

Query: 364 IKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAI 423
           IKVSPAST NPGIPPSSLMAFLAKN Q+SGVVLEDFDTSFTN+FYQS LDDLHNINSSAI
Sbjct: 380 IKVSPASTTNPGIPPSSLMAFLAKNSQVSGVVLEDFDTSFTNQFYQSHLDDLHNINSSAI 439

Query: 424 EAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYIS 483
           EAAALLVAR+LYILATNKKELS SAL AIK+NTSLVEELIGCLLNC+PGLSCELVKRYIS
Sbjct: 440 EAAALLVARTLYILATNKKELSSSALNAIKLNTSLVEELIGCLLNCDPGLSCELVKRYIS 499

Query: 484 PSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDKS 543
           P++VCPNHYVGVILDEPSSTPYPGYVHDVSRF WNFLADRTS  KENT SVCSQNCDDKS
Sbjct: 500 PTNVCPNHYVGVILDEPSSTPYPGYVHDVSRFAWNFLADRTSIRKENTSSVCSQNCDDKS 559

Query: 544 EVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNT 603
           EVCIGAETGKGTCV+STTR+VPAYSTRL FESG WNVLPPNSSDP+GAVDPVWTESNWNT
Sbjct: 560 EVCIGAETGKGTCVVSTTRYVPAYSTRLMFESGSWNVLPPNSSDPMGAVDPVWTESNWNT 619

Query: 604 IGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           IGLRMYT+QATAYDRFVLLGGITTTIL+YFAIVAVR SI+KALK+D
Sbjct: 620 IGLRMYTVQATAYDRFVLLGGITTTILSYFAIVAVRGSIMKALKKD 664

BLAST of HG10011195.1 vs. ExPASy Swiss-Prot
Match: Q8GUM5 (Nicastrin OS=Arabidopsis thaliana OX=3702 GN=At3g52640/At3g52650 PE=2 SV=1)

HSP 1 Score: 803.5 bits (2074), Expect = 1.7e-231
Identity = 409/644 (63.51%), Postives = 494/644 (76.71%), Query Frame = 0

Query: 8   SMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQPSAI 67
           S+ESVPDLQ  MY+AVDG+PC+RLLNLSGEIGCSNPG  KVV P+I  KD  +++QP  I
Sbjct: 33  SIESVPDLQKLMYVAVDGFPCVRLLNLSGEIGCSNPGINKVVAPIIKLKDVKDLVQPHTI 92

Query: 68  LVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKID 127
           LV+ DE+  FFTR+  DL FA+ +GGVL+E G+  Q +++GFSP ++FPQA+F+PY+ ++
Sbjct: 93  LVTADEMEDFFTRVSTDLSFASKIGGVLVESGSNFQQKLKGFSPDKRFPQAQFSPYENVE 152

Query: 128 YEWNPIAAS---------------------KNVKNKK-----NYISNVAEFDLVMQTTKA 187
           Y+WN  A+S                       + +KK      Y S+VAEF++VM+TTKA
Sbjct: 153 YKWNSAASSIMWRNYNFPVYLLSESGISAVHEILSKKKMKHGTYTSDVAEFNMVMETTKA 212

Query: 188 GTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDKSIGA 247
           GTHNS +CL+E TCLPLGGYSVWSSLPPI+ SSS+  KPV+LTVASMD+ASFFRDKS GA
Sbjct: 213 GTHNSEACLQEGTCLPLGGYSVWSSLPPISVSSSNNRKPVVLTVASMDTASFFRDKSFGA 272

Query: 248 DSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQSDAVS 307
           DSPISGL+ALL AVDALS V+G+ +L KQLVF V TGE+WGYLGSRRFL ELDL SDAV+
Sbjct: 273 DSPISGLVALLGAVDALSRVDGISNLKKQLVFLVLTGETWGYLGSRRFLHELDLHSDAVA 332

Query: 308 GLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSENIKVS 367
           GL+N  I+TV EIGSVGK  S G   FFAH T VSS  N T +ALK+AQ+SL S+NIK+ 
Sbjct: 333 GLSNTSIETVLEIGSVGKGLSGGINTFFAHKTRVSSVTNMTLDALKIAQDSLASKNIKIL 392

Query: 368 PASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAIEAAA 427
            A TANPGIPPSSLMAF+ KNPQ S VVLEDFDT+F NKFY S LDDL NINSS++ AAA
Sbjct: 393 SADTANPGIPPSSLMAFMRKNPQTSAVVLEDFDTNFVNKFYHSHLDDLSNINSSSVVAAA 452

Query: 428 LLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYISPSSV 487
            +VAR+LYILA++ K+ S SAL +I VN S VEEL+ CLL CEPGLSC LVK YISP++ 
Sbjct: 453 SVVARTLYILASDNKDTSNSALGSIHVNASFVEELLTCLLACEPGLSCNLVKDYISPTNT 512

Query: 488 CPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQN-CDDKSEVC 547
           CP +Y GVIL EPSS PY GYV DVSRF+WNFLAD+TS  K NT SVCS+  C    EVC
Sbjct: 513 CPGNYAGVILGEPSSKPYLGYVGDVSRFLWNFLADKTSVQKGNTTSVCSKGVCSKTDEVC 572

Query: 548 IGAETGK-GTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNTIG 607
           I AE+ K GTCV+STTR+VPAYSTRLK+  G W +LP NSSD +G VDPVWTESNW+T+ 
Sbjct: 573 IKAESNKEGTCVVSTTRYVPAYSTRLKYNDGAWTILPQNSSDSMGMVDPVWTESNWDTLR 632

Query: 608 LRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           + +YT+Q +AYD  VL+ GIT T LAY  I+A +S I KALK+D
Sbjct: 633 VHVYTVQHSAYDNAVLVAGITVTTLAYIGILAAKSIITKALKQD 676

BLAST of HG10011195.1 vs. ExPASy Swiss-Prot
Match: F0ZBA6 (Nicastrin OS=Dictyostelium purpureum OX=5786 GN=ncstn PE=1 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 1.6e-40
Identity = 157/647 (24.27%), Postives = 280/647 (43.28%), Query Frame = 0

Query: 26  YPCIRLLNLSGEIGCS-----NPGREKVVVPMINFKDANEILQPSAILVSMDEISSFFTR 85
           YPC +++   G+ GCS     N G   ++    ++ +     Q   I+V +D  + F + 
Sbjct: 40  YPCTKIMTSDGQFGCSSKHGGNSGILYLIDDDESYNNYFSYSQQKDIIVVLD-TNYFNST 99

Query: 86  LQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKIDYEWNPIA------ 145
              +LH  + + G+++   T+   +   +SP  ++P   +  Y   + EWNP A      
Sbjct: 100 SVLNLHNKSKIEGIIVLTDTK---KTYPYSPDSRYPNKIYGLYPNSNLEWNPNADGFTYF 159

Query: 146 ---------------ASKNVKNKK---NYISNVAEFDLVMQTTKAGTHNSMSCLKEETCL 205
                          A +NV        Y +  AE D  MQ    G  NS +CL+   C 
Sbjct: 160 SFPFPIFAIDNQTSVAIRNVSKHNRDGQYPAWGAELDSFMQ----GAINSETCLRRGFCE 219

Query: 206 PLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDKSIGADSPISGLIALLAAVD 265
           P+GG S+WSS     +S  D+ K +IL +   D+ +FFRD SIGAD      + LL+ + 
Sbjct: 220 PVGGQSIWSSF----SSKIDKEKEIILVMLPFDTTAFFRDLSIGADQSSFATVTLLSVIK 279

Query: 266 ALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLE-LDLQSDAVSGLNNRLID------ 325
           +L+ V+     +K++VF  +  E WGY+GS  F+ + L+ Q    +   ++ ID      
Sbjct: 280 SLAAVD-RSSWNKEVVFAFWNAERWGYVGSEYFINDLLNFQCKTYNSDKSKCIDPPRADL 339

Query: 326 ------------TVFEIGSVGKSS-SHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSE 385
                       T+ E+  +G++      GK+  ++    +  +   + L     S  + 
Sbjct: 340 AFQTQINFTKISTIIELNQIGRAQLDKNLGKYSLYLHTAGTKTSSVTDILDQVASSYENS 399

Query: 386 NIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSA 445
            I   P  T    +PPSS M+FL K  +I  VV+ D D  ++N +Y  + DD  N+  S 
Sbjct: 400 TITFKP--TTQTELPPSSSMSFLKKTNKIPVVVITDHDYKYSNPYYGYEQDDNENVLGST 459

Query: 446 IEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYI 505
           +          +YIL+T    ++      I ++ + +  L  C  +    ++C  +    
Sbjct: 460 LNDI-------VYILSTFIDRIA-GGNNNITIDKNFINILYPCFTS---SITCFNILMKT 519

Query: 506 SPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDK 565
            P +  PN Y  V     ++T  P     + R +++           +T++ C+ + D  
Sbjct: 520 YPLNEVPNFYSSVFGTSLTTTLSPYETKLIHRLLYSI------TQYNSTLTNCTSDNDCP 579

Query: 566 SEVCIGAETGKGTCVISTTRFVPAYSTRLKFES--GYWNVLPPNSSDPLGAVDPVWTESN 622
           S +C       G CV S T    A S    F++    W ++  NSS       P++TESN
Sbjct: 580 SSLCY-----SGQCVSSNTHLHNALSLGFDFDTSKNVWKIV--NSS------YPIFTESN 639

BLAST of HG10011195.1 vs. ExPASy Swiss-Prot
Match: Q54JT7 (Nicastrin OS=Dictyostelium discoideum OX=44689 GN=ncstn PE=3 SV=2)

HSP 1 Score: 161.0 bits (406), Expect = 4.3e-38
Identity = 166/686 (24.20%), Postives = 301/686 (43.88%), Query Frame = 0

Query: 3   STDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNP-GREKVVVPMI----NFKD 62
           STD  S +S   +++ MY +++ YPC R++ L+G+IGCS+  G +  ++ +I    ++ +
Sbjct: 17  STDVISSQS--SIEDKMYTSLNSYPCTRIMTLNGQIGCSSSHGGDSGILYLIDSDESYHN 76

Query: 63  ANEILQPSAILVSMDEISSFFTR-LQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFP 122
                Q   I+V  D  S++F + L  +++    + G L+   T+I  +   +SP  ++P
Sbjct: 77  YFSYNQQKDIIVVFD--SNYFNKTLVLEMYSKKKMNGALVL--TDI-GKTYPYSPEDQYP 136

Query: 123 QAKFAPYKKIDYEWNP------------------------IAASKNVKNKKNYISNVAEF 182
             +F  Y   +  WNP                        I     +     Y +  AE 
Sbjct: 137 IKQFGLYPDSNLNWNPNGDGFTYMNFPFPMFALELKTSIIIRNLSTINRDGKYPAYGAEL 196

Query: 183 DLVMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSAS 242
           D  MQ    G  N+ +CL+   C P+GG S+WSS   +     DQSKP+IL +  +D+ +
Sbjct: 197 DSFMQ----GAINAETCLRRGFCEPVGGQSIWSSFSEV----IDQSKPIILVMLPIDATA 256

Query: 243 FFRDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLE 302
           FFRD + G D     L  LL+ ++ L  V+      K+++F ++  E WGY+GS  F+ +
Sbjct: 257 FFRDLATGTDQSGYALTVLLSMLNTLQGVD-KTKWDKEVIFAMWNSERWGYVGSTNFVND 316

Query: 303 L------DLQSDAVSGLNN-RLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNA 362
           L       L S+  +  ++  ++D  FE   +   + +   +F      V+S K +T N 
Sbjct: 317 LLNFNCTSLDSNNQNSCSSPPMLDLTFE--QIKFENIYAIIEFNQIGRPVNSGK-KTPNK 376

Query: 363 LKL--------------------AQESLPSENIKVSPASTANPGIPPSSLMAFLAKNPQ- 422
           L +                    +Q +   EN  +    T    +PP S M+F+ +  + 
Sbjct: 377 LDIYNLVFHPNGGAGANQLMDVFSQSTQSYENSTIQFQKTTQNELPPCSSMSFIKEINKK 436

Query: 423 -----ISGVVLEDFDTSFTNKFYQSQLDDLHNIN--SSAIEAAALLVARSLYILATNKKE 482
                I  +V+ D D  + N ++  + D+  NIN  +S +     + ++S+ +LA     
Sbjct: 437 SAPNFIGTLVITDHDYQYNNPYFGDEQDNSGNINTTTSTLFDMVQVFSKSIDLLAGGN-- 496

Query: 483 LSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYIS--PSSVCPNHYVGVILDEPS 542
                   +KV+   + E+  CL      ++C  V + +S  P +  PN Y GV    P 
Sbjct: 497 ------GTVKVDDLFIREINVCLTQ---SITCNWVTKLMSTFPYNPIPNFYSGVYGVSPV 556

Query: 543 STPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDKSEVCIGAETGKGTCVISTT 602
           +   P      +RF++      T +    T      +CD  S +C+        C+ S T
Sbjct: 557 NHITP----IETRFIFRMATYLTQHRTNATNCTSDNDCDTSSSICVNK-----VCLYSNT 616

Query: 603 RFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNTIGLRMYTIQATAYDRFVL 622
            +  A S    F++   +    N+S       PV+ ESNW+   +R++ + + A + + L
Sbjct: 617 HYHNAISLAFSFDNSKSSWTIVNTS------YPVFVESNWDYTTVRLFQVGSYANEIWFL 657

BLAST of HG10011195.1 vs. ExPASy Swiss-Prot
Match: Q92542 (Nicastrin OS=Homo sapiens OX=9606 GN=NCSTN PE=1 SV=2)

HSP 1 Score: 113.2 bits (282), Expect = 1.0e-23
Identity = 165/684 (24.12%), Postives = 263/684 (38.45%), Query Frame = 0

Query: 27  PCIRLLNLSGEIGC--SNPGREKVVVPMINFKDANEILQ--PSAILVSMDEISSFFTRLQ 86
           PC+RLLN + +IGC  S  G   V+  +   +D   +L   P+   + + E   F   L 
Sbjct: 49  PCVRLLNATHQIGCQSSISGDTGVIHVVEKEEDLQWVLTDGPNPPYMVLLESKHFTRDLM 108

Query: 87  EDLH-FANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKI---------DYEWN- 146
           E L    + + G+ +       +   GFSP+ + P   F  Y            + +WN 
Sbjct: 109 EKLKGRTSRIAGLAV--SLTKPSPASGFSPSVQCPNDGFGVYSNSYGPEFAHCREIQWNS 168

Query: 147 ------------PIAASKNVKNKKNYISNVAEFDLVMQTTKAGTH--------------- 206
                       PI   ++ +N+   I    +   + Q   A T                
Sbjct: 169 LGNGLAYEDFSFPIFLLED-ENETKVIKQCYQDHNLSQNGSAPTFPLCAMQLFSHMHAVI 228

Query: 207 NSMSCLK------------EETCLPLGGYSVWSSLPPINTSSS-DQSKPVILTVASMDSA 266
           ++ +C++            E  C PL  Y+VWS L PINT+ +      V++    +DS 
Sbjct: 229 STATCMRRSSIQSTFSINPEIVCDPLSDYNVWSMLKPINTTGTLKPDDRVVVAATRLDSR 288

Query: 267 SFFRDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLL 326
           SFF + + GA+S ++  +  LAA +AL     +  L + ++F  F GE++ Y+GS R + 
Sbjct: 289 SFFWNVAPGAESAVASFVTQLAAAEALQKAPDVTTLPRNVMFVFFQGETFDYIGSSRMVY 348

Query: 327 ELDLQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQE 386
           +++     V  L N  +D+  E+G V   +S    + + H   VS       N ++    
Sbjct: 349 DMEKGKFPVQ-LEN--VDSFVELGQVALRTSL---ELWMHTDPVSQKNESVRNQVEDLLA 408

Query: 387 SLPSENIKVSPASTANPG----IPPSSLMAFL-AKNPQISGVVLEDFDTSFTNKFYQSQL 446
           +L      V       P     +PPSSL  FL A+N  ISGVVL D   +F NK+YQS  
Sbjct: 409 TLEKSGAGVPAVILRRPNQSQPLPPSSLQRFLRARN--ISGVVLADHSGAFHNKYYQSIY 468

Query: 447 DDLHNINSS-------------------AIEAAALLVARSLYILATNKKELSRSALTAIK 506
           D   NIN S                   A+   A ++ R+LY LA               
Sbjct: 469 DTAENINVSYPEWLSPEEDLNFVTDTAKALADVATVLGRALYELAGGTNFSDTVQADPQT 528

Query: 507 VNTSLVEELIGCLLNCEPGLSCELVKRYISPSSVCPNHYVGVILDEPSSTPYPGYVHDVS 566
           V   L   LI    +    +  + ++ Y+    +   HY+ V    P++T Y        
Sbjct: 529 VTRLLYGFLIKANNSWFQSILRQDLRSYLGDGPL--QHYIAV--SSPTNTTY-------- 588

Query: 567 RFVWNFLADRTSNPKENTISVCSQNCDDKSEV----------------CIGAETGK-GTC 615
             V   LA+ T       +++  + C D S+V                    ET +   C
Sbjct: 589 -VVQYALANLTG----TVVNLTREQCQDPSKVPSENKDLYEYSWVQGPLHSNETDRLPRC 648

BLAST of HG10011195.1 vs. ExPASy Swiss-Prot
Match: P57716 (Nicastrin OS=Mus musculus OX=10090 GN=Ncstn PE=1 SV=3)

HSP 1 Score: 111.7 bits (278), Expect = 3.0e-23
Identity = 131/494 (26.52%), Postives = 208/494 (42.11%), Query Frame = 0

Query: 165 NSMSCLKEETCLPLGGYSVWSSLPPINTS-SSDQSKPVILTVASMDSASFFRDKSIGADS 224
           ++ S   E  C PL  Y+VWS L PINTS   +    V++    +DS SFF + + GA+S
Sbjct: 237 STFSINPEIVCDPLSDYNVWSMLKPINTSVGLEPDVRVVVAATRLDSRSFFWNVAPGAES 296

Query: 225 PISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQSDAVSGL 284
            ++  +  LAA +AL     +  L + ++F  F GE++ Y+GS R + +++     V  L
Sbjct: 297 AVASFVTQLAAAEALHKAPDVTTLSRNVMFVFFQGETFDYIGSSRMVYDMENGKFPVR-L 356

Query: 285 NNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSS-KNETWNALKLAQESLPSENIKVSP 344
            N  ID+  E+G V   +S         M++ + S KN+  + L   ++S       V  
Sbjct: 357 EN--IDSFVELGQVALRTSLDLWMHTDPMSQKNESVKNQVEDLLATLEKSGAGVPEVVLR 416

Query: 345 ASTANPGIPPSSLMAFL-AKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNIN-------- 404
               +  +PPSSL  FL A+N  ISGVVL D   SF N++YQS  D   NIN        
Sbjct: 417 RLAQSQALPPSSLQRFLRARN--ISGVVLADHSGSFHNRYYQSIYDTAENINVTYPEWQS 476

Query: 405 -----------SSAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELI-GCLL 464
                      + A+   A ++AR+LY LA      S     +I+ +   V  L+ G L+
Sbjct: 477 PEEDLNFVTDTAKALANVATVLARALYELAGGTNFSS-----SIQADPQTVTRLLYGFLV 536

Query: 465 NCEPGLSCELVKR----YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADR 524
                    ++K     Y+    +   HY+ V    P++T Y      V ++    L  +
Sbjct: 537 RANNSWFQSILKHDLRSYLDDRPL--QHYIAV--SSPTNTTY------VVQYALANLTGK 596

Query: 525 TSNPKENTISVCSQNCDDKSEV------------CIGAETGKGT-----CVISTTRFVPA 584
            +N       +  + C D S+V              G      T     CV ST R   A
Sbjct: 597 ATN-------LTREQCQDPSKVPNESKDLYEYSWVQGPWNSNRTERLPQCVRSTVRLARA 656

Query: 585 YSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNTIGLRMYTIQATAYDRFVLLGGIT 615
            S    FE   W+    ++          W ES W  I  R++ I +   +   L+ G +
Sbjct: 657 LSP--AFELSQWSSTEYST----------WAESRWKDIQARIFLIASKELEFITLIVGFS 691

BLAST of HG10011195.1 vs. ExPASy TrEMBL
Match: A0A1S3C4C7 (Nicastrin OS=Cucumis melo OX=3656 GN=LOC103496866 PE=3 SV=1)

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 572/649 (88.14%), Postives = 597/649 (91.99%), Query Frame = 0

Query: 1   MHSTDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANE 60
           + S+DE  MESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVV+PMINFKDA+E
Sbjct: 35  LSSSDEQKMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVLPMINFKDADE 94

Query: 61  ILQPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKF 120
           IL+PSA+LVSMD ISSFFTRLQ+D HFANNVGGVLIEPGT IQNR EGFSPAQKFPQAKF
Sbjct: 95  ILEPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFSPAQKFPQAKF 154

Query: 121 APYKKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDL 180
           APYKK DYEWNPI                          AASKNVK+KK+Y+SNVAEFDL
Sbjct: 155 APYKKNDYEWNPIGSGIMWNRYNFPVFLISESSISSIQEAASKNVKSKKDYVSNVAEFDL 214

Query: 181 VMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 240
           VMQTTKAGTH+SMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF
Sbjct: 215 VMQTTKAGTHSSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 274

Query: 241 RDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELD 300
           RDKSIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELD
Sbjct: 275 RDKSIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELD 334

Query: 301 LQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLP 360
           LQSDAVSGL+NRLID VFEIGSVGKSS+HG G FFAHMTEVSSSKNETWNALKLA+ESLP
Sbjct: 335 LQSDAVSGLSNRLIDMVFEIGSVGKSSNHGSGNFFAHMTEVSSSKNETWNALKLARESLP 394

Query: 361 SENIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINS 420
            ENIKVSPAST NPGIPPSSLMAFLAKNPQISGVVL+DFDT FTN+FYQS LDDLHNINS
Sbjct: 395 LENIKVSPASTTNPGIPPSSLMAFLAKNPQISGVVLDDFDTGFTNQFYQSHLDDLHNINS 454

Query: 421 SAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKR 480
           SAIEAAALLVAR+LYILA NK ELS S LTAIKVNTSLVEELIGCLLNC+PGLSCELVKR
Sbjct: 455 SAIEAAALLVARTLYILAINKNELSSSVLTAIKVNTSLVEELIGCLLNCDPGLSCELVKR 514

Query: 481 YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCD 540
           YI+PSSVCPNHYVGVILDEPSS PYP YVHDVSRFVWNFLADRTS PKENT SVCSQNCD
Sbjct: 515 YITPSSVCPNHYVGVILDEPSSAPYPDYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCD 574

Query: 541 DKSEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN 600
           D+SEVCIGAETGKGTCVISTTR++PAYSTRLKFESGYW+VLPPNSSD LGAVDPVWTESN
Sbjct: 575 DRSEVCIGAETGKGTCVISTTRYIPAYSTRLKFESGYWSVLPPNSSDHLGAVDPVWTESN 634

Query: 601 WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           WNTIGLR+YTIQA AYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD
Sbjct: 635 WNTIGLRIYTIQAAAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 683

BLAST of HG10011195.1 vs. ExPASy TrEMBL
Match: A0A6J1K5Z2 (Nicastrin OS=Cucurbita maxima OX=3661 GN=LOC111492619 PE=3 SV=1)

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 562/646 (87.00%), Postives = 591/646 (91.49%), Query Frame = 0

Query: 4   TDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQ 63
           +DEHSMESVPDLQNSMYL VDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+EI Q
Sbjct: 20  SDEHSMESVPDLQNSMYLVVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADEIFQ 79

Query: 64  PSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPY 123
           PSAILVSMD+ISSFF RLQ+D +FA+NVGGVLI+PGTEIQ R +GFSPAQKFPQAKFAPY
Sbjct: 80  PSAILVSMDDISSFFARLQDDSNFASNVGGVLIKPGTEIQKRTKGFSPAQKFPQAKFAPY 139

Query: 124 KKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDLVMQ 183
           +KIDYEWNPI                          AASKNVK+KK Y SNVAEFDLVMQ
Sbjct: 140 QKIDYEWNPIGSGVMWNQYNFPVFLISESSISPVQEAASKNVKDKKVYTSNVAEFDLVMQ 199

Query: 184 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDK 243
           TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPIN  S D+SKP+ILTVASMDSASFFRDK
Sbjct: 200 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINI-SLDKSKPIILTVASMDSASFFRDK 259

Query: 244 SIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQS 303
           SIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELDLQS
Sbjct: 260 SIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELDLQS 319

Query: 304 DAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSEN 363
           D+VSGLNN LIDTVFEIGSVGK S+ GFG FFAHMTEVSSSKNETWNALKLAQESLP EN
Sbjct: 320 DSVSGLNNTLIDTVFEIGSVGKKSNDGFGNFFAHMTEVSSSKNETWNALKLAQESLPFEN 379

Query: 364 IKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAI 423
           IKVSPAST NPGIPPSSLMAFLAKN Q+SGVVLEDFDTSFTN+FYQS LDDLHNINSSAI
Sbjct: 380 IKVSPASTTNPGIPPSSLMAFLAKNSQVSGVVLEDFDTSFTNQFYQSHLDDLHNINSSAI 439

Query: 424 EAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYIS 483
           EAAALLVAR+LYILATNKKELS SAL AIK+NTSLVEE+IGCLLNC+PGLSCELVKRYIS
Sbjct: 440 EAAALLVARTLYILATNKKELSSSALNAIKLNTSLVEEIIGCLLNCDPGLSCELVKRYIS 499

Query: 484 PSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDKS 543
           P +VCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTS PKENT SVCSQNCDDKS
Sbjct: 500 PINVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCDDKS 559

Query: 544 EVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNT 603
           EVCIGAETGKGTCV+STTR+VPAYSTRL FESG WNVLPPNSSDP+GAVDPVWTESNWNT
Sbjct: 560 EVCIGAETGKGTCVVSTTRYVPAYSTRLMFESGSWNVLPPNSSDPMGAVDPVWTESNWNT 619

Query: 604 IGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           IGLR YT+QATAYDRFVLLGGITTTIL+YFAIVAVR SI+KALK+D
Sbjct: 620 IGLRTYTVQATAYDRFVLLGGITTTILSYFAIVAVRGSIMKALKKD 664

BLAST of HG10011195.1 vs. ExPASy TrEMBL
Match: A0A6J1GAI9 (Nicastrin OS=Cucurbita moschata OX=3662 GN=LOC111452408 PE=3 SV=1)

HSP 1 Score: 1103.2 bits (2852), Expect = 0.0e+00
Identity = 559/646 (86.53%), Postives = 592/646 (91.64%), Query Frame = 0

Query: 4   TDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQ 63
           +DEHSMESVPDLQNSMYL VDG+PCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+EILQ
Sbjct: 20  SDEHSMESVPDLQNSMYLVVDGHPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADEILQ 79

Query: 64  PSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPY 123
           PSAILVSMD+ISSFF RLQ+D +FA+NVGGVLI+PGTEIQ R +GFSPAQKFPQAKFAPY
Sbjct: 80  PSAILVSMDDISSFFARLQDDSNFASNVGGVLIKPGTEIQKRTKGFSPAQKFPQAKFAPY 139

Query: 124 KKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDLVMQ 183
           +KIDYEWNPI                          AASKNVK+KK Y SNVAEFDLVMQ
Sbjct: 140 QKIDYEWNPIGSGVMWNQYNFPVFLISESSISPVQEAASKNVKDKKIYTSNVAEFDLVMQ 199

Query: 184 TTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDK 243
           TTKAGT NSMSCLKEETCLPLGGYSVWSSLPPIN  SSD+SKP+ILTVASMDSASFFRDK
Sbjct: 200 TTKAGTRNSMSCLKEETCLPLGGYSVWSSLPPINI-SSDKSKPIILTVASMDSASFFRDK 259

Query: 244 SIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQS 303
           SIGADSPISGLIALLA+VDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELDLQS
Sbjct: 260 SIGADSPISGLIALLASVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELDLQS 319

Query: 304 DAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSEN 363
           D+VSGLNN LIDTVFEIGSVGK S+ GFG FFAHMTEVSSSKNETWNALKLAQESLP EN
Sbjct: 320 DSVSGLNNTLIDTVFEIGSVGKKSNDGFGNFFAHMTEVSSSKNETWNALKLAQESLPFEN 379

Query: 364 IKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAI 423
           IKVSPAST NPGIPPSSLMAFLAKN Q+SGVVLEDFDTSFTN+FYQS LDDLHNINSSAI
Sbjct: 380 IKVSPASTTNPGIPPSSLMAFLAKNSQVSGVVLEDFDTSFTNQFYQSHLDDLHNINSSAI 439

Query: 424 EAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYIS 483
           EAAALLVAR+LYILATNKKELS SAL AIK+NTSLVEELIGCLLNC+PGLSCELVKRYIS
Sbjct: 440 EAAALLVARTLYILATNKKELSSSALNAIKLNTSLVEELIGCLLNCDPGLSCELVKRYIS 499

Query: 484 PSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDKS 543
           P++VCPNHYVGVILDEPSSTPYPGYVHDVSRF WNFLADRTS  KENT SVCSQNCDDKS
Sbjct: 500 PTNVCPNHYVGVILDEPSSTPYPGYVHDVSRFAWNFLADRTSIRKENTSSVCSQNCDDKS 559

Query: 544 EVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNT 603
           EVCIGAETGKGTCV+STTR+VPAYSTRL F+SG WNVLPPN+SDP+GAVDPVWTESNWNT
Sbjct: 560 EVCIGAETGKGTCVVSTTRYVPAYSTRLTFDSGSWNVLPPNASDPMGAVDPVWTESNWNT 619

Query: 604 IGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           IGLRMYT+QATAYDRFVLLGGITTTIL+YFAIVAVR SI+KALK+D
Sbjct: 620 IGLRMYTVQATAYDRFVLLGGITTTILSYFAIVAVRGSIMKALKKD 664

BLAST of HG10011195.1 vs. ExPASy TrEMBL
Match: A0A6J1CNM4 (Nicastrin OS=Momordica charantia OX=3673 GN=LOC111013258 PE=3 SV=1)

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 553/647 (85.47%), Postives = 584/647 (90.26%), Query Frame = 0

Query: 3   STDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEIL 62
           S D HSMESVPDLQNSMYL VDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDA+E+ 
Sbjct: 20  SDDTHSMESVPDLQNSMYLVVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADELS 79

Query: 63  QPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAP 122
           QPSAI+VSMDEISSFF+RL++D +FANNVGGVLIEPGTE+QNR EGFSPA+KFPQA+FAP
Sbjct: 80  QPSAIVVSMDEISSFFSRLKDDSNFANNVGGVLIEPGTEMQNRTEGFSPARKFPQAEFAP 139

Query: 123 YKKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDLVM 182
           Y+K+DY+WNPI                          A+SKNVKNKK Y SNVAEFDLVM
Sbjct: 140 YQKVDYDWNPIGSGIMWKQYNFPVFLISETSISSIHEASSKNVKNKKAYTSNVAEFDLVM 199

Query: 183 QTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRD 242
           QTTKAGTHNS+SCLKEETCLPLGGYSVWSSLPPIN  SSDQSKP+ILTVASMDSASFFRD
Sbjct: 200 QTTKAGTHNSVSCLKEETCLPLGGYSVWSSLPPINI-SSDQSKPIILTVASMDSASFFRD 259

Query: 243 KSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQ 302
           KSIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELDLQ
Sbjct: 260 KSIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFAVFTGESWGYLGSRRFLLELDLQ 319

Query: 303 SDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSE 362
           SD VSGLNN+LIDTVFEIGSVGKSSSHG G FFAHMTEVSSSKNETWNALK AQESLP E
Sbjct: 320 SDVVSGLNNKLIDTVFEIGSVGKSSSHGIGNFFAHMTEVSSSKNETWNALKHAQESLPFE 379

Query: 363 NIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSA 422
             K+SPAST NPGIPPSSLMAFL KN  +SGVVLEDFDT FTN+FYQS LDDL+NINSSA
Sbjct: 380 ITKISPASTTNPGIPPSSLMAFLKKNSHVSGVVLEDFDTGFTNQFYQSHLDDLYNINSSA 439

Query: 423 IEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYI 482
           IEAAALLVAR+LYILATNKKEL  SA+T+IKVNTSLVEELIGCLLNC+PGLSCELVKRYI
Sbjct: 440 IEAAALLVARTLYILATNKKELISSAITSIKVNTSLVEELIGCLLNCDPGLSCELVKRYI 499

Query: 483 SPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCDDK 542
           SP+SVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTS  KENT S CS NC+DK
Sbjct: 500 SPTSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSISKENTSSACSPNCNDK 559

Query: 543 SEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWN 602
           SEVCIGAE GKGTCVISTTR+VPAYSTRLK+ESG W VLP NSSDP+GAVDPVWTESNWN
Sbjct: 560 SEVCIGAEIGKGTCVISTTRYVPAYSTRLKYESGSWYVLPSNSSDPMGAVDPVWTESNWN 619

Query: 603 TIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           TIGLRMYTIQ TAYDRFVLLGGITTTILAYFAIVAVR SIIKALKRD
Sbjct: 620 TIGLRMYTIQTTAYDRFVLLGGITTTILAYFAIVAVRGSIIKALKRD 665

BLAST of HG10011195.1 vs. ExPASy TrEMBL
Match: A0A1S3C4S1 (Nicastrin OS=Cucumis melo OX=3656 GN=LOC103496866 PE=3 SV=1)

HSP 1 Score: 1072.0 bits (2771), Expect = 9.3e-310
Identity = 551/649 (84.90%), Postives = 575/649 (88.60%), Query Frame = 0

Query: 1   MHSTDEHSMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANE 60
           + S+DE  MESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVV+PMINFKDA+E
Sbjct: 35  LSSSDEQKMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVLPMINFKDADE 94

Query: 61  ILQPSAILVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKF 120
           IL+PSA+LVSMD ISSFFTRLQ+D HFANNVGGVLIEPGT IQNR EGFSPAQKFPQAKF
Sbjct: 95  ILEPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFSPAQKFPQAKF 154

Query: 121 APYKKIDYEWNPI--------------------------AASKNVKNKKNYISNVAEFDL 180
           APYKK DYEWNPI                          AASKNVK+KK+Y+SNVAEFDL
Sbjct: 155 APYKKNDYEWNPIGSGIMWNRYNFPVFLISESSISSIQEAASKNVKSKKDYVSNVAEFDL 214

Query: 181 VMQTTKAGTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 240
           VMQTTKAGTH+SMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF
Sbjct: 215 VMQTTKAGTHSSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFF 274

Query: 241 RDKSIGADSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELD 300
           RDKSIGADSPISGLIALLAAVDALSHV+GLDDLHKQLVF VFTGESWGYLGSRRFLLELD
Sbjct: 275 RDKSIGADSPISGLIALLAAVDALSHVDGLDDLHKQLVFVVFTGESWGYLGSRRFLLELD 334

Query: 301 LQSDAVSGLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLP 360
           LQSDAVSGL+NRLID                         VSSSKNETWNALKLA+ESLP
Sbjct: 335 LQSDAVSGLSNRLIDM------------------------VSSSKNETWNALKLARESLP 394

Query: 361 SENIKVSPASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINS 420
            ENIKVSPAST NPGIPPSSLMAFLAKNPQISGVVL+DFDT FTN+FYQS LDDLHNINS
Sbjct: 395 LENIKVSPASTTNPGIPPSSLMAFLAKNPQISGVVLDDFDTGFTNQFYQSHLDDLHNINS 454

Query: 421 SAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKR 480
           SAIEAAALLVAR+LYILA NK ELS S LTAIKVNTSLVEELIGCLLNC+PGLSCELVKR
Sbjct: 455 SAIEAAALLVARTLYILAINKNELSSSVLTAIKVNTSLVEELIGCLLNCDPGLSCELVKR 514

Query: 481 YISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQNCD 540
           YI+PSSVCPNHYVGVILDEPSS PYP YVHDVSRFVWNFLADRTS PKENT SVCSQNCD
Sbjct: 515 YITPSSVCPNHYVGVILDEPSSAPYPDYVHDVSRFVWNFLADRTSIPKENTSSVCSQNCD 574

Query: 541 DKSEVCIGAETGKGTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESN 600
           D+SEVCIGAETGKGTCVISTTR++PAYSTRLKFESGYW+VLPPNSSD LGAVDPVWTESN
Sbjct: 575 DRSEVCIGAETGKGTCVISTTRYIPAYSTRLKFESGYWSVLPPNSSDHLGAVDPVWTESN 634

Query: 601 WNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           WNTIGLR+YTIQA AYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD
Sbjct: 635 WNTIGLRIYTIQAAAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 659

BLAST of HG10011195.1 vs. TAIR 10
Match: AT3G52640.1 (Zn-dependent exopeptidases superfamily protein )

HSP 1 Score: 803.5 bits (2074), Expect = 1.2e-232
Identity = 409/644 (63.51%), Postives = 494/644 (76.71%), Query Frame = 0

Query: 8   SMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQPSAI 67
           S+ESVPDLQ  MY+AVDG+PC+RLLNLSGEIGCSNPG  KVV P+I  KD  +++QP  I
Sbjct: 33  SIESVPDLQKLMYVAVDGFPCVRLLNLSGEIGCSNPGINKVVAPIIKLKDVKDLVQPHTI 92

Query: 68  LVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKID 127
           LV+ DE+  FFTR+  DL FA+ +GGVL+E G+  Q +++GFSP ++FPQA+F+PY+ ++
Sbjct: 93  LVTADEMEDFFTRVSTDLSFASKIGGVLVESGSNFQQKLKGFSPDKRFPQAQFSPYENVE 152

Query: 128 YEWNPIAAS---------------------KNVKNKK-----NYISNVAEFDLVMQTTKA 187
           Y+WN  A+S                       + +KK      Y S+VAEF++VM+TTKA
Sbjct: 153 YKWNSAASSIMWRNYNFPVYLLSESGISAVHEILSKKKMKHGTYTSDVAEFNMVMETTKA 212

Query: 188 GTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDKSIGA 247
           GTHNS +CL+E TCLPLGGYSVWSSLPPI+ SSS+  KPV+LTVASMD+ASFFRDKS GA
Sbjct: 213 GTHNSEACLQEGTCLPLGGYSVWSSLPPISVSSSNNRKPVVLTVASMDTASFFRDKSFGA 272

Query: 248 DSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQSDAVS 307
           DSPISGL+ALL AVDALS V+G+ +L KQLVF V TGE+WGYLGSRRFL ELDL SDAV+
Sbjct: 273 DSPISGLVALLGAVDALSRVDGISNLKKQLVFLVLTGETWGYLGSRRFLHELDLHSDAVA 332

Query: 308 GLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSENIKVS 367
           GL+N  I+TV EIGSVGK  S G   FFAH T VSS  N T +ALK+AQ+SL S+NIK+ 
Sbjct: 333 GLSNTSIETVLEIGSVGKGLSGGINTFFAHKTRVSSVTNMTLDALKIAQDSLASKNIKIL 392

Query: 368 PASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDLHNINSSAIEAAA 427
            A TANPGIPPSSLMAF+ KNPQ S VVLEDFDT+F NKFY S LDDL NINSS++ AAA
Sbjct: 393 SADTANPGIPPSSLMAFMRKNPQTSAVVLEDFDTNFVNKFYHSHLDDLSNINSSSVVAAA 452

Query: 428 LLVARSLYILATNKKELSRSALTAIKVNTSLVEELIGCLLNCEPGLSCELVKRYISPSSV 487
            +VAR+LYILA++ K+ S SAL +I VN S VEEL+ CLL CEPGLSC LVK YISP++ 
Sbjct: 453 SVVARTLYILASDNKDTSNSALGSIHVNASFVEELLTCLLACEPGLSCNLVKDYISPTNT 512

Query: 488 CPNHYVGVILDEPSSTPYPGYVHDVSRFVWNFLADRTSNPKENTISVCSQN-CDDKSEVC 547
           CP +Y GVIL EPSS PY GYV DVSRF+WNFLAD+TS  K NT SVCS+  C    EVC
Sbjct: 513 CPGNYAGVILGEPSSKPYLGYVGDVSRFLWNFLADKTSVQKGNTTSVCSKGVCSKTDEVC 572

Query: 548 IGAETGK-GTCVISTTRFVPAYSTRLKFESGYWNVLPPNSSDPLGAVDPVWTESNWNTIG 607
           I AE+ K GTCV+STTR+VPAYSTRLK+  G W +LP NSSD +G VDPVWTESNW+T+ 
Sbjct: 573 IKAESNKEGTCVVSTTRYVPAYSTRLKYNDGAWTILPQNSSDSMGMVDPVWTESNWDTLR 632

Query: 608 LRMYTIQATAYDRFVLLGGITTTILAYFAIVAVRSSIIKALKRD 624
           + +YT+Q +AYD  VL+ GIT T LAY  I+A +S I KALK+D
Sbjct: 633 VHVYTVQHSAYDNAVLVAGITVTTLAYIGILAAKSIITKALKQD 676

BLAST of HG10011195.1 vs. TAIR 10
Match: AT3G52640.2 (Zn-dependent exopeptidases superfamily protein )

HSP 1 Score: 787.7 bits (2033), Expect = 6.7e-228
Identity = 409/673 (60.77%), Postives = 494/673 (73.40%), Query Frame = 0

Query: 8   SMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQPSAI 67
           S+ESVPDLQ  MY+AVDG+PC+RLLNLSGEIGCSNPG  KVV P+I  KD  +++QP  I
Sbjct: 33  SIESVPDLQKLMYVAVDGFPCVRLLNLSGEIGCSNPGINKVVAPIIKLKDVKDLVQPHTI 92

Query: 68  LVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKID 127
           LV+ DE+  FFTR+  DL FA+ +GGVL+E G+  Q +++GFSP ++FPQA+F+PY+ ++
Sbjct: 93  LVTADEMEDFFTRVSTDLSFASKIGGVLVESGSNFQQKLKGFSPDKRFPQAQFSPYENVE 152

Query: 128 YEWNPIAAS---------------------KNVKNKK-----NYISNVAEFDLVMQTTKA 187
           Y+WN  A+S                       + +KK      Y S+VAEF++VM+TTKA
Sbjct: 153 YKWNSAASSIMWRNYNFPVYLLSESGISAVHEILSKKKMKHGTYTSDVAEFNMVMETTKA 212

Query: 188 GTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDKSIGA 247
           GTHNS +CL+E TCLPLGGYSVWSSLPPI+ SSS+  KPV+LTVASMD+ASFFRDKS GA
Sbjct: 213 GTHNSEACLQEGTCLPLGGYSVWSSLPPISVSSSNNRKPVVLTVASMDTASFFRDKSFGA 272

Query: 248 DSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQSDAVS 307
           DSPISGL+ALL AVDALS V+G+ +L KQLVF V TGE+WGYLGSRRFL ELDL SDAV+
Sbjct: 273 DSPISGLVALLGAVDALSRVDGISNLKKQLVFLVLTGETWGYLGSRRFLHELDLHSDAVA 332

Query: 308 GLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSENIKVS 367
           GL+N  I+TV EIGSVGK  S G   FFAH T VSS  N T +ALK+AQ+SL S+NIK+ 
Sbjct: 333 GLSNTSIETVLEIGSVGKGLSGGINTFFAHKTRVSSVTNMTLDALKIAQDSLASKNIKIL 392

Query: 368 PASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDL------------ 427
            A TANPGIPPSSLMAF+ KNPQ S VVLEDFDT+F NKFY S LDDL            
Sbjct: 393 SADTANPGIPPSSLMAFMRKNPQTSAVVLEDFDTNFVNKFYHSHLDDLCKKSHSLSFSSF 452

Query: 428 -----------------HNINSSAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSL 487
                             NINSS++ AAA +VAR+LYILA++ K+ S SAL +I VN S 
Sbjct: 453 RSKPHFALLIPFWCCIAANINSSSVVAAASVVARTLYILASDNKDTSNSALGSIHVNASF 512

Query: 488 VEELIGCLLNCEPGLSCELVKRYISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWN 547
           VEEL+ CLL CEPGLSC LVK YISP++ CP +Y GVIL EPSS PY GYV DVSRF+WN
Sbjct: 513 VEELLTCLLACEPGLSCNLVKDYISPTNTCPGNYAGVILGEPSSKPYLGYVGDVSRFLWN 572

Query: 548 FLADRTSNPKENTISVCSQN-CDDKSEVCIGAETGK-GTCVISTTRFVPAYSTRLKFESG 607
           FLAD+TS  K NT SVCS+  C    EVCI AE+ K GTCV+STTR+VPAYSTRLK+  G
Sbjct: 573 FLADKTSVQKGNTTSVCSKGVCSKTDEVCIKAESNKEGTCVVSTTRYVPAYSTRLKYNDG 632

Query: 608 YWNVLPPNSSDPLGAVDPVWTESNWNTIGLRMYTIQATAYDRFVLLGGITTTILAYFAIV 624
            W +LP NSSD +G VDPVWTESNW+T+ + +YT+Q +AYD  VL+ GIT T LAY  I+
Sbjct: 633 AWTILPQNSSDSMGMVDPVWTESNWDTLRVHVYTVQHSAYDNAVLVAGITVTTLAYIGIL 692

BLAST of HG10011195.1 vs. TAIR 10
Match: AT3G52640.3 (Zn-dependent exopeptidases superfamily protein )

HSP 1 Score: 674.9 bits (1740), Expect = 6.4e-194
Identity = 356/586 (60.75%), Postives = 427/586 (72.87%), Query Frame = 0

Query: 8   SMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDANEILQPSAI 67
           S+ESVPDLQ  MY+AVDG+PC+RLLNLSGEIGCSNPG  KVV P+I  KD  +++QP  I
Sbjct: 33  SIESVPDLQKLMYVAVDGFPCVRLLNLSGEIGCSNPGINKVVAPIIKLKDVKDLVQPHTI 92

Query: 68  LVSMDEISSFFTRLQEDLHFANNVGGVLIEPGTEIQNRMEGFSPAQKFPQAKFAPYKKID 127
           LV+ DE+  FFTR+  DL FA+ +GGVL+E G+  Q +++GFSP ++FPQA+F+PY+ ++
Sbjct: 93  LVTADEMEDFFTRVSTDLSFASKIGGVLVESGSNFQQKLKGFSPDKRFPQAQFSPYENVE 152

Query: 128 YEWNPIAAS---------------------KNVKNKK-----NYISNVAEFDLVMQTTKA 187
           Y+WN  A+S                       + +KK      Y S+VAEF++VM+TTKA
Sbjct: 153 YKWNSAASSIMWRNYNFPVYLLSESGISAVHEILSKKKMKHGTYTSDVAEFNMVMETTKA 212

Query: 188 GTHNSMSCLKEETCLPLGGYSVWSSLPPINTSSSDQSKPVILTVASMDSASFFRDKSIGA 247
           GTHNS +CL+E TCLPLGGYSVWSSLPPI+ SSS+  KPV+LTVASMD+ASFFRDKS GA
Sbjct: 213 GTHNSEACLQEGTCLPLGGYSVWSSLPPISVSSSNNRKPVVLTVASMDTASFFRDKSFGA 272

Query: 248 DSPISGLIALLAAVDALSHVEGLDDLHKQLVFGVFTGESWGYLGSRRFLLELDLQSDAVS 307
           DSPISGL+ALL AVDALS V+G+ +L KQLVF V TGE+WGYLGSRRFL ELDL SDAV+
Sbjct: 273 DSPISGLVALLGAVDALSRVDGISNLKKQLVFLVLTGETWGYLGSRRFLHELDLHSDAVA 332

Query: 308 GLNNRLIDTVFEIGSVGKSSSHGFGKFFAHMTEVSSSKNETWNALKLAQESLPSENIKVS 367
           GL+N  I+TV EIGSVGK  S G   FFAH T VSS  N T +ALK+AQ+SL S+NIK+ 
Sbjct: 333 GLSNTSIETVLEIGSVGKGLSGGINTFFAHKTRVSSVTNMTLDALKIAQDSLASKNIKIL 392

Query: 368 PASTANPGIPPSSLMAFLAKNPQISGVVLEDFDTSFTNKFYQSQLDDL------------ 427
            A TANPGIPPSSLMAF+ KNPQ S VVLEDFDT+F NKFY S LDDL            
Sbjct: 393 SADTANPGIPPSSLMAFMRKNPQTSAVVLEDFDTNFVNKFYHSHLDDLCKKSHSLSFSSF 452

Query: 428 -----------------HNINSSAIEAAALLVARSLYILATNKKELSRSALTAIKVNTSL 487
                             NINSS++ AAA +VAR+LYILA++ K+ S SAL +I VN S 
Sbjct: 453 RSKPHFALLIPFWCCIAANINSSSVVAAASVVARTLYILASDNKDTSNSALGSIHVNASF 512

Query: 488 VEELIGCLLNCEPGLSCELVKRYISPSSVCPNHYVGVILDEPSSTPYPGYVHDVSRFVWN 537
           VEEL+ CLL CEPGLSC LVK YISP++ CP +Y GVIL EPSS PY GYV DVSRF+WN
Sbjct: 513 VEELLTCLLACEPGLSCNLVKDYISPTNTCPGNYAGVILGEPSSKPYLGYVGDVSRFLWN 572

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896057.10.0e+0091.06nicastrin [Benincasa hispida][more]
XP_004140732.10.0e+0088.60nicastrin [Cucumis sativus] >KGN57470.2 hypothetical protein Csa_011284 [Cucumis... [more]
XP_008457115.10.0e+0088.14PREDICTED: nicastrin isoform X1 [Cucumis melo][more]
XP_022997757.10.0e+0087.00nicastrin [Cucurbita maxima][more]
XP_023525268.10.0e+0087.00nicastrin [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q8GUM51.7e-23163.51Nicastrin OS=Arabidopsis thaliana OX=3702 GN=At3g52640/At3g52650 PE=2 SV=1[more]
F0ZBA61.6e-4024.27Nicastrin OS=Dictyostelium purpureum OX=5786 GN=ncstn PE=1 SV=1[more]
Q54JT74.3e-3824.20Nicastrin OS=Dictyostelium discoideum OX=44689 GN=ncstn PE=3 SV=2[more]
Q925421.0e-2324.12Nicastrin OS=Homo sapiens OX=9606 GN=NCSTN PE=1 SV=2[more]
P577163.0e-2326.52Nicastrin OS=Mus musculus OX=10090 GN=Ncstn PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A1S3C4C70.0e+0088.14Nicastrin OS=Cucumis melo OX=3656 GN=LOC103496866 PE=3 SV=1[more]
A0A6J1K5Z20.0e+0087.00Nicastrin OS=Cucurbita maxima OX=3661 GN=LOC111492619 PE=3 SV=1[more]
A0A6J1GAI90.0e+0086.53Nicastrin OS=Cucurbita moschata OX=3662 GN=LOC111452408 PE=3 SV=1[more]
A0A6J1CNM40.0e+0085.47Nicastrin OS=Momordica charantia OX=3673 GN=LOC111013258 PE=3 SV=1[more]
A0A1S3C4S19.3e-31084.90Nicastrin OS=Cucumis melo OX=3656 GN=LOC103496866 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52640.11.2e-23263.51Zn-dependent exopeptidases superfamily protein [more]
AT3G52640.26.7e-22860.77Zn-dependent exopeptidases superfamily protein [more]
AT3G52640.36.4e-19460.75Zn-dependent exopeptidases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF05450Nicastrincoord: 200..404
e-value: 1.1E-64
score: 218.1
NoneNo IPR availableGENE3D3.40.630.10Zn peptidasescoord: 137..428
e-value: 5.2E-19
score: 70.8
NoneNo IPR availableSUPERFAMILY53187Zn-dependent exopeptidasescoord: 172..411
IPR041084Nicastrin, small lobePFAMPF18266Ncstrn_smallcoord: 27..135
e-value: 3.6E-23
score: 82.4
IPR008710NicastrinPANTHERPTHR21092NICASTRINcoord: 135..622
coord: 7..134

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
HG10011195HG10011195gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
HG10011195.1-cdsHG10011195.1-cds-Chr01:3383127..3383250CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3383360..3383474CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3384402..3384484CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3384652..3384728CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3388258..3388329CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3388444..3388517CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3388635..3388767CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3388856..3388927CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3393725..3393844CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3398471..3398542CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3400492..3400632CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3400810..3400894CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3401004..3401442CDS
HG10011195.1-cdsHG10011195.1-cds-Chr01:3402661..3402925CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
HG10011195.1HG10011195.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016485 protein processing
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005887 integral component of plasma membrane