CaUC01G012040 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G012040
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionNicalin
LocationCiama_Chr01: 22297207 .. 22315099 (-)
RNA-Seq ExpressionCaUC01G012040
SyntenyCaUC01G012040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGTGTTTTGTTAAAAGAAAAAAAAAGAAAAAAAACACTTCCGAAGGGGCACTTACTCTAAGAGTCAGAACATTTAACAAAGCTTGGGGTCAAGTGAGGTGTATGCAGGCGGTGGGCATCTCTCCTCAACGGCGAGAACACCAGAAAGAGCCGCGAGCTGATTCAATCTCCATGTGCGCATCATCCATTTTATTCTAATCAATTCCCTTCCAAATCTCATCCTTCTACTTCTCATATCTGCCAACGCCCTTTCCTCTCCAGTCTCACCTCCAGATCTTGCATCCATGGCTCCTCGTAAACCCCGCGAGCCACAAGTTCTCGAATCCTTCTACCCCGTCGTCGCCCTTGTCTTCATTCTAGTTGCCTGTGTCGAGCTCTGTGACGCCGCCACAGTCGTCGATGTCTACCGTCTCATTCACTACGATATCTCTGGTGTTCCCTTTGGATCCCGCGCCGCCACACTCAATCACCATGCTTCCTCTCTTCATTTTCCCTCTGCTGCTGATCTCTCTCGCTCCGTTCTCATCATTCCTCTTTCTGAACTCAATATCACCTTTCTCCAAGGTACCATTCTCACTTTCTTAGCTGCTGGATCCTCGTCGTTAATATGTGGAGATTTTAGTAAACCACTTGTGAACTGTATTCAACTGGCTGCGCCTTGAATTACTTTAATAGATACAGTTTTGGATAGTTTCTTACTTTGTGTATTTGTATTCAAATATTCAAAATTTGGGAGGTTTTAATCTTCTTTCTATTACCAATCAGTAGTGTGTTTGTAGACTTGCAGAACTTTGTTAATCTAATTTCTCTTCTATTTGAAGGAAGAAGAATATGATTGTAGGCTTTGTTCTTACTTCTTAACTTTCTACATATCAGAAATCATGCCTGTTTTTCTCAGTGAGAAATTATGTGAACTTTAATATATGTATATTTTATAAGGATTTAGGGTTTACTACTCCCGCTCGTCTTTCACCTGTTGGTAACCGGGCTATCTTTTGGTCCTGTGCAGAATGTATATCTCAGAAAAAGCGTCTGGGAGGTCTGTTGATTTTGCTTCCCAGGTTTCTTGGCTCAGATGGCCCGAAAAATGATGACATTAAATGTCCACATAATGGAGAGGGGATGATCAAGGATTTATTGGTTGAACTTGAACGGTTGCTCATACATTCTACTATACCTGTGAGTGAAATGAAAGTAACTTCTGGTGAATCTTCATCATTTCACCAGACATTTTAATAATTAACAGTTATTGAGTTCTCATTTTCTGAATTTTCTATGTTTATTCTTTCTATGTTTATTCTGAATTCTGTTTTATATTCTGTTCCTTCTATACATGGTTGAAGCTTTTAAGCATTTAGCTTCCATTTATATTCATCTAAACTTTATATTCTTTCTACATTAAATAAATGAATGTTATCTACTCTCAGTTTGTGAGTTATGACCACCTTACCTTCTTGTATTCCTTTACTTCAATTTATATTTTCAGTTTCTATAATGTGTTTGCGTTTCTGAGAGTTTCTATGTCGTTCCAGTATCCTGTATATTTTGCTTCAGAAGGTGAAGATATTAATGCTGTTTTGGCTGATGTCAAGAACAATGACGCCACTGGTCAGCTTGCAACTGCAACTACTGGCGGGTTAGTTGAATCTATTATGAAATTATGTGCGCTGTAGTGGATTTTATGTGTTTAACGGAAATCACAATAGAGAAAGGAGAGAATGGTTACTTTTGGCATATAAATATATTTGTATAATTTATATAATTTTCTACTATGTTTTATATTTTAATTCTGATTTTGATATGATTATTTAAGGGTTGTTATTATTATTGTTATTTATTATTTATTATTTATTTATTTTTTTTGTTGCAAGCTTACCTTCTATTGCAACAGACAACAACATTAAGTTAGAATGTGTTGAGGGCAAAATCGTGTGCAAGTGTGTTGAGGGCAAAATCGTTTGCAAGTGTGTTGAGGGCAAAATCGTTTGCAAGTGTGTTTGAGGGCAAAATCGTTTGCAAGTGTGTTGAGGGCAATATCTATGAGTCGTTTGAGGTTGGTGAGGATAGGGTTGTCCTTTCCCATCTTCAATTCTGGAATGACACCCTCTTTTTCTGTTCTGGTAAGGAGCCGTCATTCTTGATTTTGAATAATGTTCTTGCCTTCTTTGAAGAGATGTATGGGTTGAGAATTAATAGAGGGAAGTGTTGTGTTATGGGGATTAATTGTGGTGAGGAGAAACTCAATCGTTGGGCCGAAATGGGGGGGGTGATGTCGGCAGTCTTCCCTCTTCTTACCTTGGTCTCCCTCTTGGGAGCATCCCTAACTCGAGTTCCTTATGGGATTTGGTGGTGGATAGAGTTCGGAAGAGGTACGCTTCTTGGAAAAAAGGTTTTCTTTTCTGAAGTTGGTAGGCTCATCTTGATGAAATCTATCTTGAGTGGCATTCATGTCTACTTCTTTTCCCTGTTTAGAGCCTTGTGTGAAGTGTGTAAGAAAATTGAGAGGCTCATGCATGATTTTCTTTGGGAAGGGATTGAAGAAGCTAAGGGCCAGGGATCGCATTTGATTAATTGGGATGTTGAGAGACCAGTCTCCTAGGGGGAGGGGGCTTGGAATTGGGAAGCTTAGGGTGTGAAATAGAGTTTTGTTAGACAAATGCCTGTGGCATTTTTCCTCTGAGCCCAACTCTTTGTGATGCAGGATCATAGCTAGTAAACATAGTCTCCATCCTTTTTAGTGGCTATCTAAGGGAGCTAAGGGCACTTACCAGAATTTGCAGAAAGATATTTTGGAGATGTTAGGAACCAAGATAGTATGGCTACCTTTGTCCCACATTGGTTAGAATGGGACGACCAATTGGCTTGACTCTCCCACCTCAATACCTAACTTTAGGGTGTGGTTCACCAAGGTGCTTAAGTACCTAACAATGGTATCATCAGATCGGGTGTGACCGAGTGAAGTATCGTCGAAGTAGCCGTCGGGACATCGACTTTTCGGGGATAGGGTATGGTTAGGAACCAAGGTAATATGTGTTACCTTTGTCCTACATTGGTTAGAATGGGATGACCAATGTGGTACTTAGGTGGCTTGACTCTCTCACCTCAATAGCTAGCTTTTGGGGTGTGGTTCTCCAAGGTGCTTAAGTACCTAACACTCTCTTCTTTTTTGTTCCTTTGGTTCGTTGTGTGGTGGGGGAGGGGGAGTGCAACATACTTTTGGGAAGATCATTGGGTGGGGGAGAGACCTCTCTCTGCTCTTTTCCCTCGTTTGTTTCACCTGTCGTCTCTCAAAAATCACTTTGTTGGTGGTTTTCTTGTATGGTCTGGGAGCTCTTGTTCTTTTTCTTTTGGGTTCCGTCGTTTTTTTCCCCTTTTTGATAGGGAGGCGAAGGAGGTGACTGCCCTTTAGATTAAGGAGAAGGAATGTTCGTGTTTGGAGCCCCATTCCTTTGGAAGGGTTTTCATGCAAGTCTTTCTTTCCGTGTTTGGTCAATCCCTTTCCTATAGGTGTATCAGTTTTTTCGGTTCTTCGAAGGATTAAGATTCCTAGGAAGGTGAGGTTCTTTACTTGGCAGGTCCTTCATGATTGTGCTAACACATTGGATAGGCTTGCAAGGAAGTTGCCTTCGCTTGTTGGGCCGTTCTGTTGTATTCTTTGTCGGAAGGCGGAGGAAGACCTGGACCATATTCTCTAGTGTTGTAACTTTGTGATATCTGTTTGGGATTTATTCATTCAGACGTTTGGTATGTCGTACGTTCATCATAGAGATGCTAGTGCTATGATCAAGGAGTTCCTCCTCAATCTGCCTTTTGGGGAGAAGGGTTGCTTCTTAAGGACGGTGGGAGTGTGTGTGATTTTATAGGTCTTGTGGGGTGAGTGGAACAATAGTGTGTTCAGGGGGATGGATAGGGATCCTAGGGAGACTTGGTCCCTTGCCTGCTATCATGTGCTATCATGTCTGTTTGTGGGCTTCAATTTCAAAGACCTTGTGTAACTATTCTATACGAATGATCTCTTATAGTTGGAGCCCCTTTGTAACATCTCGAGTTCCAGATAAATTGGAATTAATTATATCAAATTGTTGGGCTTTGATTCCCTTTAATTTTTGGATTTTGGAGCCAATCTAAAACAATTGAGAAGATCATTAATTTGATGAGGATTTAAATTGATTAAATGTTTGGAAGGATTTAATTTATCTCAAACTATTGAATTTTATGTTCCCTTTAATTTTGGTTTTTGAACCAAAATGGGGCAAATTTGAGAAGATAATTGGTTTAGATTAATTGGCAAGAGAATTTGAGATTAGTTGGTATTAGAGCCCAAGATTATAGGTTCTGTAGAGTCACTTACAATGTAAGTCCAGAATGTCCCTATGGCCCATTAACGAAATCTTCGTCATCGCCAAGTATGTTCTTTAAGTAAAGGTTTACAGAATTGTATGCATAGTGTTTGTTTGATTTAATGCATTCGTAGATTATGTTTTCTTTGACTGAGAATTTGAGATGGGGTAGTTATGTATGGATGTTAGGAGATGACATGTTGAGTCAGAGGAGGTAGAGGAGGCATAAACGTTAGAAGAGGGATATGGGACCCTCATGTTCAGGGCTCACAAGTGTAGGACCCTTAGGTTTGAGACCCTCTTGCACCGCAGGATCTCCCTCAACCACTTAACCCTTAGGTCAAGGGAACAAGGAATTGCACCGGGAATTCAGACCCATAATGTACATGTGCCTCTGTACCTCCGTAGACAAAGACTCCACCTTTCTTCTTGGGTGATTATGCTGCAATTGTAGTGTTGATATGATAAGTTGTTACAATAACTGTGATGGGAAAGGTAAATGACTTAGTGGCGAAACAAATAGCTTAGCATGCCCAATCTCAACAAATACCACCTCACCCCAACGATTCCAGTGCCACATGTTCAAACTTAGAATCAAAATTTACAAGACCATCACTAGAAGCAAAACACTTGTGAGATTTTAGGAAGTATAATCCCATGCATTTGATGGGTCATTGAAGGACCCTACCAATGCGAAACTATGGTCGTCTTCGATAGAGACAATTTTCATTATATGAAATGCTTGGATGGCCAGAAGCTTCAGTGTGCTGTTTTCATGCTTACAAATAATGCCAAAATTTGGTGGCAATCAGCGGAAAGGATATGGATACCAGTGGTGGACCAGGAACATGGGATCAATTCAAAGAATGCCTCTACTAAAATATTTCTTGGCTAACCTGAGATACAATAAATGGAGTGCATTTCTAGACTTGAAGCAAGACGCCATAACTTGAGATACAACAAGCATGGTGCCTTGCTTCAAATCTAGAAGATATACAGCTGACTTTTCCTTTTCGACAGTTGATGACTGGGAAGATTGTTGCCTGAAGTGTTGTAGGTCTATCTGAGTCTCGACTTTTCTATCGAGGCCCGGGAGCCTTTTGATTTACCTTCCTTTTCTGACCTGAGGAGGATCCTAGTCCTATGGCCTCAGGTGTCCTTTCTGGAGTGTCTATTCAAGTTGCTATATGGGGAGTGCTTTTGACCGTTAATGACTGGGAAGACTGTTGCTTAAAGTGTTGTATTGGTCTATCTGAGTCCTGACTTTTCTGCTAAGGCTCAGCCTTTTGATCTGCCTTCCTTTTCTAACCCGAGGAGGATCCTAGTCCTAACGACCTCAGGTGTTCAGTCCTTGCCAGGGAAGTCTATTCGAGTTGCCATTAGAGGGTGTCCCTTTTTGTGGGCTTGTATATTTTTGTTTGCGCGTGTATTCTTTCATTTTATTCTTAATAGAAGTTGTTATTTTCATTTTAAAAAAACAACATTAAGTTGGAATGTTAAATGGGTGCTAAAATAAGAAAAAGTTGGAATGTTAAATCATATCATGTTTCTCTTTATCTTTTTTTATCTTTTATTTTCGCTACTACCTCCAAAATTTCCCCTTAATTATCTTTGTCAATTCCCCCTTTTGCATAAAAGATTGGCGATGATTTTTCTATTAACTTCCTTCATTCTTTAGTTCATACTCATGTAAGAGTATTAAATACCAATGTTGGTGAAAATGTTGAGATCTCATTTTTGGTAATTTTGATCTAGGCATCGTTTCATCTCTCAATGAGAATTCTTTTGGATTAGGGCTGCTTTTGTACTTGGGTTTGGGGCTCTTTTTGTTATGGGGTTATTTCTTTATATGCCCTTGAATATCCTTTCAACTCTCAATGAAAATTTGGTTTTTTTATTAAAAAAAGTGTGATTATTATTCATGTCATTTTTTTTGTGAAATTTCTAATATGTCTGGACACTTTTTGATATGGTATTGAAATTTCCTCAATACTAAAAAAAACGTCCATCAACCTATATTTTTTTGATATCAAATGATTCAGTTTCTCCCTCTTGCCTCTTGTATATTGTTCAAAAAAAAAAAAAAAAAAAATCATTCAGTTTCTGATTCTCATTAAAAAGGTACTTGCAACATAAACTGTCATTGTGCCATTTGCAAGTGCTTTTATTTAATAATTTCATATTATCTTGAATCTTTTTTCTCATAAAGGAAACTTGAGAGCTTGAGGATAAGTCTATAATTGCTAATACTCTTCTCTGTATTCTTAAATGTTTGCAGGTACAAGCTTGTTGTTTCGGCAGCAGAACCAAGAAAACTTGTATCTTCCACGATTACAAATATTCAGGTACCTTTTCCAGTTTCCTAGCATTTGAATAGCTAGAGTTATTGCATATAATTATTTATTTGAATACATTTATCTCAACAAACAGCTTCACTAAAACAAGCTAGGATGCACATATGCATTATGCTTCTAAGCAACATGTTAGAGTCCAAGTGATTGTCTTGTCATCATATATATTAGCACATTAGAAAGCTTTTACACAGAAGAACTGGTTGGCTTGTTGATCTTATAGTTATATATATTTGTATAAGTATTTTGGTAAGAAACTAAACTTTCATTGAGCAAAATGAAGGAATATACCAGGGCATACACACACACACACACACACAAAAAAAGGCCAACACAACCCATAAGAATGGGCTCCAATCCGAAAAACTCAAACCAATATCATAATTACAAAAAGGCCCAGTGGCCGATGCCCATAAAGAGGCAATAAACCATGTCCCATCCTATACCTTATCACATGACTTCTCCACCCCTCTAAATAATCTATTGATTATCTCGAGCCAAACACCCCACAAAATAGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTAAACCAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAAAAAAGAAAAGAAAAGGCCACAATACTTTGCAATTCTCTCAAAATGGTGAACTTAGGAGCACCTCCTCAATCCTAGCATAACCGTCCCACTTCTTTTTAGGGCATAAACGAAATGAGTTCAGCCGCCAAGACTTTAGAGTCAAACAAACTGACACCCCACAATAATGCAAGTTCGTTGGGCTCTGATGTAAAGGTTGCACCATTGCAGGAGTAACGCAAAGGAAGAATGTCTCTGAGTACGGTCCACGTTGTTGACTCTTCCATGTAAAACCTGGCATACGAAAAAGAATACCTTCCTAGAGATTGTAACCTTTCAGAGTGTGGAAAAGGCATAGGCAGGTTGTTGGCCTTTGAAACCCTAAGATAACGAGGGGGGGATGATCAAATAGGAGCCACAAAATGCAACTTCTTACTTGAAATATGATATAAACATGGGTACAACTCGCAAAGGAGTTTCTCATTTGCCCACCAATCCTCCCAAAATATACACTGGTTCCTTCCCCAACAGAGCATCTAACCACAAAGAAAAAGATGAGAAGACAAGGGAAATCACCTAAGAAGCACGGGACACGACATGGCACGAACATAGCGATCCGTAAGACACAAACACATTGGAGATACTTTTTTTTTAATACAAAATATGCTATATTGATATATTCCTACTTTTGGATAAGACATGCAAATTTAACATAAAAACAAAATATTCTTCACTATAAGCATATTAAGTTAAGTCAACCAATGATATTTGAAATCTAAAATATAAAAGATTTAGTTGGGTATGACTAGAAGATGAATATGTCATCCTAATACTTTAGGATTGATCAATATTGGCTCTCGTCTCATTTTTGTGTCTAATTCTTTTTAATTGTCTTGCGTTGTTTTATCTTTATTTTTCTAATTGGTGCCAGGGTTGGCTGCCTGGACTAAAATCTGATGGGGATGCTAGTCAACTCCCAACGATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGTATGTGACTTATTTAACACGCTAACAATATCTGCCATATTTTATCTAGATCTTATCTAGATGATGTGGATTATCTCTCTGGTTGTAGGAATTATCCGTGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTTGAAATTGCAAGGTTATTTTCTCTTCTTTATTCAAACCCTAAGACAAGAGGAAGGTACAATCTACTTTTTGGGCTCACTTCTGGCGGGCCTTATAACTACAATGGGACTCACAAGGTTCTCATCCTTTCTTTGTACATACAATCATAATTATTACTTCTTATCATTTTCGATTATTATTTATTTATTCTATAATTGTTTCTATTTGTTGTCCATTAATGTAAAGTATTGCTTTCCCAGTGGCTTCAAAGCTTTGATCACCGTCTCCGTGAGAGCATTGACTATGCTATTTGCTTAAATAGTATTGGCTCATGGGATGACAAATTATGGCTGCATGTCTCCAAGCCTCCGGAAAATGCCTACATTAAGCAAATCTTTGAAGTAAGCAGATTTACTTTTAAAATGTCCTATGTCTTGTGTAAATAATGAAATAGAAAAAACTAAATGTCTTAATACTGTGTTGTTTATAACGTGTTGATTGTGGATGCCCTGCACTCTGTAGGATTTCTCAAATGTTGCCGAGGATTTGGGCTTTAAAGTTGATCTGAAGCACAAGAAGATCAATATTTCAAACCCTCGAGTGAGTTAATACACCTGAATTTTTTTCTTCTTTTTCTATTTCTGACAGAAAATTAATTTTCAGATGAAGAGAATATTATGAAATTCAATTACTGACATTTGAAAATTGATATCGAATCTACAACATTGTATTGAAATATTACTGAAATAGAAAGATGGAAAAAGAAGACAGACAGTTTGAAAGGTTCGTTCTCTCCCCAGGATAAGCCTAATCCCCACCTACCTGACACACATACATCAGTACCTACACAATCTTCCTTTAATAACAACCCAAGGCCATGCCCACAGTATCCTCGTGACAAAATTCTCCCATTCATTTACTTCGTGCTTCTCCACCTTTCTTTCTACGTGTGCATGCAAATCTAATGGTGGTTTAAGCAAGCCTATCACGCGTTGATGATGCACAACAACTATAGATATCTTTCTGACTTTACAATCTCTGAAGAAAAACCCTCTTCTGCCTTCCCATACCTGGTTTCTGGCTTCTGATTGATGCAAAAGTCTTTAAGAAATCAGACTAGTCTTACTTTAAGTTACTAGCTGCAACTAACAGAAATAACAAACAACCAAAAATCAAAATAACAGAGTTAATAATGGTTAAGTAAAGAACGGAAAATAACGAGTACCACATAATGCACACATATATTGGCCACTTAAGAGAATTTCATTCTTAAGTATAGGATTGCAGCTTCATCTCTTAATGATACTCTCCATGTTCTGAGTCACTTGACCTTAGATTCTGAAGTTTTCTTTTTCATCCTTGTCCCACATTTTCACATGTAAAATTTCTATTTTTGTCTTTGTTTTGTATTGTGCACTCCAAATTTTTTTCAAAAGTTATTTACCATTCTTTGTGTTCTTTATTTCATGCTTAAGTCATGGAGGACCAATGGACTATTGCACTCAAATGCTTTTTGTTTCAAAGTCGTTATTTTAATGTTTTGTATACTAAAAAAAGTATTAGAAAAATTGGAAGACTTGGAGATATGTTCATCTAGAAATTCAAGAACCTATGTATATGAATCCACTCTTCTGAGTATCTTTCATGGTCCAACCTTCTTTCTTCGTCAACTTGTTGTAGATATGCCAATAGGTTATCGATTCTTACTTAAGTAAAACCACAAAACCATCTCACTTTGTTAGGATACCTCCTAACAAATTGGCATTAGAGCATGTTCGTATTGGAATATCGTTTCAGCTATGGAATCGGATGCAATATGTGGTAGGCTAAAGAAGATGTTAGACATCTTGAAATCAATGCAACAAACATTTAAGGATTACCAACAAGGATGTGTAGATAGAGAGAGAGAAAGAGAAGAAAGAAGACGAACAGTCATCAAAGCAGAGAAAAAAAACCTTGTTCAGAAACTGTTTGGTGGAGTTCAAAGGGTTGCTGGACGATGTCGATCTTACCTGAGAGTGCCTTCGAAAAAGCGAGGAAAAATCAGCCGACCAAGCAAGAGGGAAGGGAAGGAAGAGGATACTATGGCAAAGACAGAATAAAAAACACCCCGAGAAGGTGAAGGTAAGAATGTTAAAAATCTAAAAGTGGCGAAAGTTAGTGAGATGAGAAAGGGAATTAAGAGGAAAATACCAGCATGGAGGTGATGAGGCAAGCGAAGGAAGAATGAATCGAGTGGTAAAAAGGAGTGAAAATCGAGACAGAGGTTGGAAGAAGAAAAGGAGGAAGAAGAACCTTCCGGCGAAATCGTGGTCAACAAAGCCAAGAGGAGAGCAAGACGAGGAAGTTCATTATCGACATCAATGGCAAGGGTGGCTGGAGATGAATCTGTAGGGGAGGCATCGATACAACCGTCATAAAGAGAAGGTAGGAAAAAGATTGGAGGGGGACAACCATGAAGGGAACATTGAAGAGATCACTACCGGCGGCAAGAATGGGGGTGGCCAAACAAGAGATGTTGGAGAAGGGTTGTTTGATGGGGTGGTGTGCGGATTGAAAAACCACGAGGTACGGACTTAACTGATTTCACACTCTTGAATAGACGAAATGGCAAGCAAAAGAGGTGGACTTTCTGGCGATTAGTGTGATGAAGAGTTGGCCTTGCCCACGAACAGTCATCGAGTTAAGAGGATTCTTGGGGCTCACCGGCTATTATAGGAGGTTTGTTCAGGGATATGGATTGATAGCAGCTCCCTTAACTAAATTACTGCACAAAAATGGGTTTGAATGGACTGCCGAGGCAAGGGAAGCATTTGAGGCGTTGAAGAGAGCGATGATGACGGTGTCGGTTATAGCCTTGCCGGATTTCTCTCAGCCATTTACAATTGAAATTGATGCTTCCGATTACGAATTTGGGGGCAATTTTGACTTAGAATGGAAGACTGATAGCCTATTTTAGTCGAACGTTGTCTCAACGGGCCCAAGCTAAATTGATATATGAGAGGGAACTTATGGTGGTAGTTTTAGCAGTACAGAAATGCAGACACTACTTAATTGCGAACAATTTCACGGTTATTTCAGATCAAAAGGCATTGAAATTTTTGATAGAGCAGAGGGAGGTACAACCTCAATTCGAAAGGTGGTTAATCAAGTTATTAGGGTATGATTTCGAGATCTTATACCAACTGGGGTTGAATAACAAGGTAGCGGATGCGTTGTCACGGGTGAATTTGAATGGGGAGTTACATAGCCTCATCGAACCACCATTGTTGGATACAAAACCTATACAAAGAGAAGTGGAGGAGGATGCTGAGTTACAAGAAATCGTATCCTTATTGAAGAAGGACCTAGAAGGCAAACCAAAATACCAAATGCAGCAAGGGAAACTTTTGTACAAGGGAAGGTTAGTTCTTTCGAAATCTTCATCCTTGATTCCATCCATACTGCATACTTATCATGATTCCGTATTGGGAGGCCACTTGGGTTTCCTAAGAACATACAAGCGAATGATTGGAGAACTATTTTGGCTGGGGATGAAAGCTGAAGTGAAGAGGTATGTAGCAAGCTGCGAAATATGTCAAAGGAGGATTTAGCTATGGATTTTATCGAAGGGCTCCCAAAATCAGGGGGGTACAGTTCTATTATGGTGGTGGTTGATAGGTTTAGTAAATATGCTCACCTCATATTGTTGTCCCATCCTTTTTCGGCAAAGCAAGTGGCTGAGGCAATTGTTTAGGAGGTCATCAAGCACCATGGGATACCGAAATCAATAGTTTCAGATGGGGACAAGATCTTTTTAAGCCATTTCTGGAAGAAACTCTTTGCGGTATGGGGTACTAAGCTAAATCATAGCACGGCCTTCCATCCTCAGACGGATGGTCAAACCGAAAGGGTAAATCGATGTTTAGAAACGTATCTTCGTTGTTTTTGTAACGTGCAACCCTTAAGATGGAGTAAATGGGTGCCCTGGGCCGAATTGTGGTATAACACTACCTTCCATGCGTCAACCAAGACAACTCCATATCAAATTGTTTTTGGTCATCCACCTCCCCCAATTCTGTCATATGGAGAGGAAATCATCAAACGATGCAGTTGAAAGACAATTGATGGCCCGAATGAAGTGTTAAGTGCCTTAAGTGAACATCTAATGGTTGCTCAAAACCGAATGAAAAAGCAGGCTGATAGCAAAAGGAGGGAGTTGGTATTCGAAGTGGGAAATGAAGTTTTCCTAAAATTGCGCCCTTATAGACAGAGGTCTCTAGCAAGAAAAAAATGTGAAATATTGGCTCCCCGATACTATGGACCGTGCAAGGTGATTGAAAGAGTTGGTGAGGTAGCCTATAAACTCGAGTTACCACCGGAGGCAGCCATCCGCAATGTGTTTCACATCTCACAGCTAAAGAAGATGGTGGGACAGAGGAGCCAGAGTTCAAGATCACCCACCTGTTCTGACAGGAATTTGAATTGCAAGTTGTGCCTGAAGATATCGTGGGAGTGCGCTGGAACAAGGAATTGGCAATGAATGAATGGCTGGTGAAATGGAAGGACATGGCAGAAGAAGAAGCTACCTGGGAGTCGAATTACCTGATACAGCAGCAGTTTCCTCATCTCCACCTTGAGGACAAGGTGAATTTTGAGAAGGGGGGTATTGTTAGGCCTCCAATTGTCTATACATATAAGAGAAATGGTAAAAGGGGAAACTCGGGCAATTTAATTGAAAAATGAGGACAAATAACGGAAAAGAATGAAGGTTGTTAGGGGACCACAGAGGTTAGTAAAAGAGGTGCTGCTGGGTTGGAGGAAATCATCTGAGAAATTGAGTAAGCTGCAGGCCCTAATTTGGGAGAGGAGAGAATGTCCAGCTCTCTCTAAAGTGCTTGAGTGACGTTGCTGAATTTCTTTCTTTGTTTTATCTTGTTGCCTATCTTTGTTAGTGACTGGAAGAACTCTTGGTTTTGTCTACTCTTGGTTGATAAATAATATTCTTGAGGAGGAGTTCCATCTCTATTTCTTCTATGTTTGTGTCTTGTTGGGTATTTTACTGTTTGCAGGGGATTCAGATACCTAACAAATAGTTTAGCATTATTAACCTGAGTTGAGATTATTTTTGATATCAGGATTTATAAACAATCATACCAAACCTTCTGTTAAACGGGATCTTTAGTGTCAATGTTTCCAAATGGAACTTGTTCACTGCATGGTTTGTTGAACTTGGACATTGGCTCTGGGTTCCTAATACCTATTTAATTGACAAAAGAGTGAGAATAGAAATGGAAATTTGATGACTAGGGTAACAATATCATACAAGGTTCTTCAATGAAAAATGAAATTTGTAATTGTGATTCTAAATATAGAAATCGTTGTATTAATATGTTTTATTTATATTGTTACAGAAAATTTAGCGGTGGAAAATTCAAATCTTATTGTTTGAGCGTACCTTTATACTAAGATACATTGTTTTGAGGATGACAAATGATACTAAAGTTATGCTTGACTGCTTGTATATATATTCTCCAAATTCAATGGCAAACGTGACTCGAGCATTTAGAATCCTAGACAAGTGTTTTTCAAATGTCATGCTACTTATCCACTTAATAGAAAAAAATTAAATTGGCAATGTGGACTACATCATTTATCTTGTGGCTAGGGTAAACTGGGTCAAACTAAAGTGGGCATGCTTCTTACCTTATAATTAGTTAATAATCCCAACCAACCCTGAATTCTTTTAATTATTGCATCCTATGGTCACTGCTCCAAGAAAAGGGTGTTTCTCGGATAATAGCAACATTTAGTAGGAAAGAGCAGCGCTCTTTTTTTCTTTTTCTTTTTCCTTTTTATGCAGGGGTGGGGGAGGACAAAACAGAAAGAGCCAAAGACTTTTGAAAACTGATCTAATGTACATTGCCATCATCACATTTTTCATCATGATGTGTGCTAATCATTTTTTCAAAAAGGATTGTTCTAGCTTTCCCTTAGCATAGTTTACAAATGATTACTTCTATTTTGTGATTATTATAGCCTACTTAATAAATCCTTTGTATGCTGTAGGTAGCCTGGGAGCACGAACAGTTTTCAAGATTGAGAGTAACTGCTGCTACCCTTTCTGAACTCTCTGCTGCTCCTGAGCTTTTGGAAAGGACTGGAGGTTTGGCTGACAACAGGTTCGATTTTTAAGAAATCGATCAGTTTCCTGGTTCTGTTCCACTTCCATGTTTGCATGTGTGCGAGCATACATATTTATGTGTTTGTGTGTAATTTGTATTCTGACAAGACCAACTTCCAACAGATTGTTTCTGAACGAGAATAAAATTGCCAAGAGTATCAAGTTAGTTGCCGAGAGTCTTGCAGTAAGTTCTATCATTGTCTTGCTTAGCCGTAATTTTGAATCAAGTATTTTTATCATATTTATGTGTTTCCTGTTTCAAGTGCTAGTGTTTGTTTTTGGCAATCGAGAATTCATCTCAAGTTCAGTATGTTTAAATATGGCACTTAGTTAATAGCTTCTACTGTAATTTACAAGTGGAGTGTGCACGAATAGACGGCTTCCACCATTGATTTTCAAATTGCAAATGGTTCAGAATGCCCGTGGTATTATTTTATCTTGTTTTCTTGATTTCTCTAATGCACAATTCCTTGCATTTTATGCAGAGGCATATTTACAGATACGAAGGAAAGAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAGTCAATCCAACTTATATTCGATCATGGTTGGATCTTTTATCACGAACGCCTCGAGTTGCTCCATTCCTGTCGAAAGATGATCCTTTCATATCAGCATTAAAAAAGGTTTTATATCAAAAAAAAGTTGTGCCAGTTGATACCATCTACTTTCATTTAGCTTCATATAATGTGTTCACCCCAAGATTGATTGGCATTCTGATAGAAAAAAAATAAGATATTGTTCTGAAACGTGTTTATGTTCAACTGGCTCCTTTTCGTCAAAAGATGACTCCTTCCTTCGTATTGGGTTTTATATTAGAGAATAAAATTGTGTCAGTTACCATTCTTGCCCCCTTTCATTTAGCTCAACCTAAAAGTATTCCTTGGAAACATGCATAACGTGCTTAAATATTCTGACTTTTGGAGAAATTGCAACCTTCTGTTTGAAATTAGTTTGTTAAAATCTTGACTATTTTTGTTATGGCAGAACTACTATGGTTAGTGCCGCACTAACCATAGTAGTGCTGATCCATATTTTCCTTGCATGCTTGTGTAATACAAAAGCATGAAACCCATTTATGCAAACACAATATTTTATTGGTCAGTTTGTTTGGCTGCAGTGGCACTCAGGCAAATGCTCAGTGTTTTAAAAAGCCCTCTCGGGTGCACGTCTAGGTTCAAGGTGTAGGTCTGGTGCCTCACCTTTTTAAGGCGAGGCTCACAAAATAAGGTGTGCCTCCGCGCCTTTTGTGAAGCACCAAGGCACCGAGATGGTACAATTATTTTATCAAAAAAAAAGAAAAAAAAGAAGTAATAATTAGTGTCTTTCCTTTAATTAAATTAGAAAAATCAAATTTACCAAGGCTAAATGAAAATTTTCTTGTGTTTGTGGGTCTTCTTTTCATATTTCTACTTTAACTATGCTTCTTTCTTTATGTATTACTTTTATATATAGTGTGTCTCATAGAAAAAAAAGTCTTTTTTTTTTGTGCCTTGTGCTTAATCTTGAGAAGACTATTGTACTTATTACTGAGCTTAAAAAACACTTCGAATGTACCTCTAACTCTCAGCCATCGTGGAATGATATATTTATATTACCTTCAGGAACTGGAGGTCCATACCCATGATGTGAGCTTGCAACATGAAGTATTTGACGGAATGTTCACCTTTTATGATTCAACTGCAGCTAAACTTCACATATACCAGGTACTTCGCGGATCAATTGTGAACTTGGCTCTCTGCTTGCACTTGCATAGTTACATGTAATAGATCCAGTTTTGCTATAATTTTTTCAGCTCTACAATCGGAGCATGCTTCTTCTGAAAACAAAGCTCCATTTTATTTTAGTTTTGTACACTATCATTTTATATATTCGTGTCAGCTAATTCTAGATCTGTGGTAATGTATAATCCTTGGGTGTTATTTCTTGACTTGAATCTAAGCAAAGCTTCACATTTGTTGTTTCAGGTTGCTAGTGTGACATTCGACTTGCTTTTGCTTTTGGTATTGGGGTCATATTTAGTTTTACTCTTCTGTTTCCTCGTGATCACAACCAGGGTATGTTGTATTATTTACTTTATCAAATTTAGGAAATGATCTCTCATTATCCCTTAAACATAAATAAAATATATAAATGTCTGTCCTCAAGAAGGGGCTCTTAGGACCCTTGTTATCAAGCACTCGCCACTTAAAATTATTGCGTATAGAGCCTGTTTGGAAATTAGTGGTTTAACACTTTTCAAGCGGAGGAGGAACATAAAAATGCTTTTGGAAAAAGTTTAAAGTACACCAGAGCGTTTTGTGAGAGAACACTTGTTTATTGTTTTTGGCTTGAGTGCTTAATTGTGAATTGTATCAACAAAGTGATATGTTTGTCAAGAATATTTATTTTCAAAGCACTCTTAAGCACACCTTGAGTTTGGGATGACTTGGGAGAATATTATTTTTAAGTAAAATATTTGTATTACTCCTTTGAAACACTCTCTTGAAGTGTTTTTTTCTTCTTTTTTTTTTTTTTAGGGTCTCGATGATCTGATCGGTCTATTTAGACGCCCTCCTTCCCGAAAAGTAAAAACAGCTTGACGACTGCAGTAGATTTTGATGCTTGGATTTGCATCATTGCCACCCGAATTTTTGCCATCTAGATTAGACAGGCCTTATGATGGAGGTCCAAATGTGACAAAGGTGTGAAACCAATCATCAGGGCTCGAGGAACTGAAGGTTTTATGTGGCCATTCCTCGCACGATTTTATTTTTCTTTCTTTGACATCCTCTCTGTGACATAATTAACAGTTACAAGAACTCTCGAGGTGCGATAATTTTTGTTTGGTCTCCATATTTAAATTCCTACTGTTTAACCCTCATTTGTGGCTTCCACTTTTGACAATGATCTTTTCCTTAGAATTTGTTGAGTCTATTATTTCTTGCTCAGTTGATGTTAGTGGCAGGCAATAGATTCTGTTCAACAGTTAAAGAAAATCTTTCTCAGAGAATATGTTTGAGGT

mRNA sequence

TTGTGTTTTGTTAAAAGAAAAAAAAAGAAAAAAAACACTTCCGAAGGGGCACTTACTCTAAGAGTCAGAACATTTAACAAAGCTTGGGGTCAAGTGAGGTGTATGCAGGCGGTGGGCATCTCTCCTCAACGGCGAGAACACCAGAAAGAGCCGCGAGCTGATTCAATCTCCATGTGCGCATCATCCATTTTATTCTAATCAATTCCCTTCCAAATCTCATCCTTCTACTTCTCATATCTGCCAACGCCCTTTCCTCTCCAGTCTCACCTCCAGATCTTGCATCCATGGCTCCTCGTAAACCCCGCGAGCCACAAGTTCTCGAATCCTTCTACCCCGTCGTCGCCCTTGTCTTCATTCTAGTTGCCTGTGTCGAGCTCTGTGACGCCGCCACAGTCGTCGATGTCTACCGTCTCATTCACTACGATATCTCTGGTGTTCCCTTTGGATCCCGCGCCGCCACACTCAATCACCATGCTTCCTCTCTTCATTTTCCCTCTGCTGCTGATCTCTCTCGCTCCGTTCTCATCATTCCTCTTTCTGAACTCAATATCACCTTTCTCCAAGAATGTATATCTCAGAAAAAGCGTCTGGGAGGTCTGTTGATTTTGCTTCCCAGGTTTCTTGGCTCAGATGGCCCGAAAAATGATGACATTAAATGTCCACATAATGGAGAGGGGATGATCAAGGATTTATTGGTTGAACTTGAACGGTTGCTCATACATTCTACTATACCTTATCCTGTATATTTTGCTTCAGAAGGTGAAGATATTAATGCTGTTTTGGCTGATGTCAAGAACAATGACGCCACTGGTCAGCTTGCAACTGCAACTACTGGCGGGTACAAGCTTGTTGTTTCGGCAGCAGAACCAAGAAAACTTGTATCTTCCACGATTACAAATATTCAGGGTTGGCTGCCTGGACTAAAATCTGATGGGGATGCTAGTCAACTCCCAACGATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGAATTATCCGTGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTTGAAATTGCAAGGTTATTTTCTCTTCTTTATTCAAACCCTAAGACAAGAGGAAGGTACAATCTACTTTTTGGGCTCACTTCTGGCGGGCCTTATAACTACAATGGGACTCACAAGTGGCTTCAAAGCTTTGATCACCGTCTCCGTGAGAGCATTGACTATGCTATTTGCTTAAATAGTATTGGCTCATGGGATGACAAATTATGGCTGCATGTCTCCAAGCCTCCGGAAAATGCCTACATTAAGCAAATCTTTGAAGATTTCTCAAATGTTGCCGAGGATTTGGGCTTTAAAGTTGATCTGAAGCACAAGAAGATCAATATTTCAAACCCTCGAGTAGCCTGGGAGCACGAACAGTTTTCAAGATTGAGAGTAACTGCTGCTACCCTTTCTGAACTCTCTGCTGCTCCTGAGCTTTTGGAAAGGACTGGAGGTTTGGCTGACAACAGATTGTTTCTGAACGAGAATAAAATTGCCAAGAGTATCAAGTTAGTTGCCGAGAGTCTTGCAAGGCATATTTACAGATACGAAGGAAAGAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAGTCAATCCAACTTATATTCGATCATGGTTGGATCTTTTATCACGAACGCCTCGAGTTGCTCCATTCCTGTCGAAAGATGATCCTTTCATATCAGCATTAAAAAAGGTTGCTAGTGTGACATTCGACTTGCTTTTGCTTTTGGTATTGGGGTCATATTTAGTTTTACTCTTCTGTTTCCTCGTGATCACAACCAGGGGTCTCGATGATCTGATCGGTCTATTTAGACGCCCTCCTTCCCGAAAAGTAAAAACAGCTTGACGACTGCAGTAGATTTTGATGCTTGGATTTGCATCATTGCCACCCGAATTTTTGCCATCTAGATTAGACAGGCCTTATGATGGAGGTCCAAATGTGACAAAGGTGTGAAACCAATCATCAGGGCTCGAGGAACTGAAGGTTTTATGTGGCCATTCCTCGCACGATTTTATTTTTCTTTCTTTGACATCCTCTCTGTGACATAATTAACAGTTACAAGAACTCTCGAGGTGCGATAATTTTTGTTTGGTCTCCATATTTAAATTCCTACTGTTTAACCCTCATTTGTGGCTTCCACTTTTGACAATGATCTTTTCCTTAGAATTTGTTGAGTCTATTATTTCTTGCTCAGTTGATGTTAGTGGCAGGCAATAGATTCTGTTCAACAGTTAAAGAAAATCTTTCTCAGAGAATATGTTTGAGGT

Coding sequence (CDS)

ATGGCTCCTCGTAAACCCCGCGAGCCACAAGTTCTCGAATCCTTCTACCCCGTCGTCGCCCTTGTCTTCATTCTAGTTGCCTGTGTCGAGCTCTGTGACGCCGCCACAGTCGTCGATGTCTACCGTCTCATTCACTACGATATCTCTGGTGTTCCCTTTGGATCCCGCGCCGCCACACTCAATCACCATGCTTCCTCTCTTCATTTTCCCTCTGCTGCTGATCTCTCTCGCTCCGTTCTCATCATTCCTCTTTCTGAACTCAATATCACCTTTCTCCAAGAATGTATATCTCAGAAAAAGCGTCTGGGAGGTCTGTTGATTTTGCTTCCCAGGTTTCTTGGCTCAGATGGCCCGAAAAATGATGACATTAAATGTCCACATAATGGAGAGGGGATGATCAAGGATTTATTGGTTGAACTTGAACGGTTGCTCATACATTCTACTATACCTTATCCTGTATATTTTGCTTCAGAAGGTGAAGATATTAATGCTGTTTTGGCTGATGTCAAGAACAATGACGCCACTGGTCAGCTTGCAACTGCAACTACTGGCGGGTACAAGCTTGTTGTTTCGGCAGCAGAACCAAGAAAACTTGTATCTTCCACGATTACAAATATTCAGGGTTGGCTGCCTGGACTAAAATCTGATGGGGATGCTAGTCAACTCCCAACGATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGAATTATCCGTGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTTGAAATTGCAAGGTTATTTTCTCTTCTTTATTCAAACCCTAAGACAAGAGGAAGGTACAATCTACTTTTTGGGCTCACTTCTGGCGGGCCTTATAACTACAATGGGACTCACAAGTGGCTTCAAAGCTTTGATCACCGTCTCCGTGAGAGCATTGACTATGCTATTTGCTTAAATAGTATTGGCTCATGGGATGACAAATTATGGCTGCATGTCTCCAAGCCTCCGGAAAATGCCTACATTAAGCAAATCTTTGAAGATTTCTCAAATGTTGCCGAGGATTTGGGCTTTAAAGTTGATCTGAAGCACAAGAAGATCAATATTTCAAACCCTCGAGTAGCCTGGGAGCACGAACAGTTTTCAAGATTGAGAGTAACTGCTGCTACCCTTTCTGAACTCTCTGCTGCTCCTGAGCTTTTGGAAAGGACTGGAGGTTTGGCTGACAACAGATTGTTTCTGAACGAGAATAAAATTGCCAAGAGTATCAAGTTAGTTGCCGAGAGTCTTGCAAGGCATATTTACAGATACGAAGGAAAGAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAGTCAATCCAACTTATATTCGATCATGGTTGGATCTTTTATCACGAACGCCTCGAGTTGCTCCATTCCTGTCGAAAGATGATCCTTTCATATCAGCATTAAAAAAGGTTGCTAGTGTGACATTCGACTTGCTTTTGCTTTTGGTATTGGGGTCATATTTAGTTTTACTCTTCTGTTTCCTCGTGATCACAACCAGGGGTCTCGATGATCTGATCGGTCTATTTAGACGCCCTCCTTCCCGAAAAGTAAAAACAGCTTGA

Protein sequence

MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKKVASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLIGLFRRPPSRKVKTA
Homology
BLAST of CaUC01G012040 vs. NCBI nr
Match: XP_038894420.1 (nicalin-1 isoform X2 [Benincasa hispida])

HSP 1 Score: 996.5 bits (2575), Expect = 8.7e-287
Identity = 511/564 (90.60%), Postives = 523/564 (92.73%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVL+SFYP++ALVFILVACVELCDAATVVDVYRLI YDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPILALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKN 120
           NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLL+LLP+   SDGP+N
Sbjct: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLVLLPKIFDSDGPEN 120

Query: 121 DDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLAT 180
           DDIK PHNGEGMIK+LLVELERLLIHSTIPYPVYFASEGEDI+AVLADVKNNDATGQLAT
Sbjct: 121 DDIKSPHNGEGMIKNLLVELERLLIHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLAT 180

Query: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240
           ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL
Sbjct: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240

Query: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300
           SVGSDSNGSG+VALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGVVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360
           RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVA 420
           NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNE+KIA SIKLVA
Sbjct: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNESKIANSIKLVA 420

Query: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480
           ES+A+HIYRYEGKNIQVFADDSSLAVNPTYIR WLDLLSRTPRVAPFLSKDDPFISALKK
Sbjct: 421 ESIAKHIYRYEGKNIQVFADDSSLAVNPTYIRLWLDLLSRTPRVAPFLSKDDPFISALKK 480

Query: 481 ----------------------------------VASVTFDLLLLLVLGSYLVLLFCFLV 531
                                             VASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVGLQHEVFDGMFTFYDSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 540

BLAST of CaUC01G012040 vs. NCBI nr
Match: XP_023520704.1 (nicalin-1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 981.9 bits (2537), Expect = 2.2e-282
Identity = 499/564 (88.48%), Postives = 520/564 (92.20%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVLESFYP++ALVF+LVAC ELCDAATVVDVYRLIHYDISGVPFGSRAA+L
Sbjct: 1   MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISGVPFGSRAASL 60

Query: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKN 120
           NHHA+SLHFP AADLSR+V IIPL ELN TF++ECISQ+KRLGGLLILLP+ LGSDGPKN
Sbjct: 61  NHHAASLHFPPAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKN 120

Query: 121 DDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLAT 180
           DD KCP NG+GMIKDLLVELERLLIH+T+PYPVYFASEGEDINAVLADVK+NDATGQLAT
Sbjct: 121 DDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLAT 180

Query: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240
           ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDA+QLPTIAIVASYDTFGAAPEL
Sbjct: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDANQLPTIAIVASYDTFGAAPEL 240

Query: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300
           SVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGIVALLEIARLFSLLYSSPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360
           R+RESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RIRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVA 420
           NISNPRVAWEHEQFSRLRVTAATLS +SAAP+LLERTGGLADNRLFLNE+ IAKSIKLVA
Sbjct: 361 NISNPRVAWEHEQFSRLRVTAATLSGISAAPDLLERTGGLADNRLFLNESAIAKSIKLVA 420

Query: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480
           ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK
Sbjct: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480

Query: 481 ----------------------------------VASVTFDLLLLLVLGSYLVLLFCFLV 531
                                             VASVTFDL+LLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVSLQHEPFDGMFTFYDSTAAKLHIYQVASVTFDLVLLLVLGSYLVLLFCFLV 540

BLAST of CaUC01G012040 vs. NCBI nr
Match: XP_008463652.1 (PREDICTED: nicalin-1 [Cucumis melo])

HSP 1 Score: 978.8 bits (2529), Expect = 1.9e-281
Identity = 505/564 (89.54%), Postives = 519/564 (92.02%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVL+SFYPV+ALVFILVACVELCDAATVVDVYRLI YDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKN 120
           NHHASSLHFPS ADLSR+VLIIPL EL +TFLQECISQKKRLGGLL+LLPR LGS+  KN
Sbjct: 61  NHHASSLHFPSGADLSRTVLIIPLCELKMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 121 DDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLAT 180
           DDIKC  NGEG+IKDLLVELERLLIHSTIPYPVYFAS+GEDI+AVLADVKNNDATGQLAT
Sbjct: 121 DDIKCT-NGEGVIKDLLVELERLLIHSTIPYPVYFASDGEDIDAVLADVKNNDATGQLAT 180

Query: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240
           ATTGGYKLVVSAAEP+KL+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL
Sbjct: 181 ATTGGYKLVVSAAEPKKLISSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240

Query: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300
           SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360
           RLRE IDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVA 420
           NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGL DNRLFL+E+KIAKSIKLVA
Sbjct: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVA 420

Query: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480
           ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSW+DLLSRTPRVAPFLSKDDPFISALKK
Sbjct: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWVDLLSRTPRVAPFLSKDDPFISALKK 480

Query: 481 ----------------------------------VASVTFDLLLLLVLGSYLVLLFCFLV 531
                                             VASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 540

BLAST of CaUC01G012040 vs. NCBI nr
Match: KAG6583712.1 (Nicalin-1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 975.7 bits (2521), Expect = 1.6e-280
Identity = 498/567 (87.83%), Postives = 518/567 (91.36%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVLESFYP++ALVF+LVAC ELCDAATVVDVYRLIHYDIS VPFGSRAA+L
Sbjct: 46  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASL 105

Query: 61  NHHASSLHFP---SAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDG 120
           NHHA+SLHFP   +AADLSR+V IIPL ELN TF++ECISQ+KRLGGLLILLP+ LGSDG
Sbjct: 106 NHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDG 165

Query: 121 PKNDDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQ 180
           PKNDD KCP NG+GMIKDLLVELERLLIH+T+PYPVYFASEGEDINAVLADVK+NDATGQ
Sbjct: 166 PKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQ 225

Query: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA 240
           LATATTGGYKLVVS AEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA
Sbjct: 226 LATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA 285

Query: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300
           PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Sbjct: 286 PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 345

Query: 301 FDHRLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360
           FDHR+RESIDYAICLNSIGSWDDKLWLHVSKPPEN YIKQIFEDFSNVAEDLGFKVDLKH
Sbjct: 346 FDHRIRESIDYAICLNSIGSWDDKLWLHVSKPPENTYIKQIFEDFSNVAEDLGFKVDLKH 405

Query: 361 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIK 420
           KKINISNPRVAWEHEQFSRLRVTAATLS +SAAPELLERTGGLADNRLFLNE+ IAKSIK
Sbjct: 406 KKINISNPRVAWEHEQFSRLRVTAATLSGISAAPELLERTGGLADNRLFLNESAIAKSIK 465

Query: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480
           LVAESLARHIYRYEGKNIQVFADDSSLA+NPTYIRSWLDLLSRTPRVAPFLSKDDPFISA
Sbjct: 466 LVAESLARHIYRYEGKNIQVFADDSSLAINPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 525

Query: 481 LKK----------------------------------VASVTFDLLLLLVLGSYLVLLFC 531
           LKK                                  VASVTFDL+LLLVLGSYLVLLFC
Sbjct: 526 LKKELEVHTHDVSLQHEPFDGMFTFYDSTAAKLHIYQVASVTFDLVLLLVLGSYLVLLFC 585

BLAST of CaUC01G012040 vs. NCBI nr
Match: XP_022973118.1 (nicalin-1-like [Cucurbita maxima])

HSP 1 Score: 973.4 bits (2515), Expect = 7.9e-280
Identity = 497/567 (87.65%), Postives = 519/567 (91.53%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVLESFYP++ALVF+LVAC ELCDAA VVDVYRLIHYDISGVPFGSRAA+L
Sbjct: 1   MAPRKPREPQVLESFYPLLALVFLLVACTELCDAAAVVDVYRLIHYDISGVPFGSRAASL 60

Query: 61  NHHASSLHFP---SAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDG 120
           NHHA+SLHFP   +AADLSR+V IIPL ELN TF++EC+SQ+KRLGGLLILLP+ LGSDG
Sbjct: 61  NHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECLSQRKRLGGLLILLPKILGSDG 120

Query: 121 PKNDDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQ 180
           PKNDD KCP NG+GMIKDLLVELERLLIH+T+PYPVYFASEGEDINAVLADVK+NDATGQ
Sbjct: 121 PKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQ 180

Query: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA 240
           LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK DGDASQLPTIAIVASYDTFGA+
Sbjct: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKFDGDASQLPTIAIVASYDTFGAS 240

Query: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300
           PELSVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Sbjct: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSSPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300

Query: 301 FDHRLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360
           FDHR+RESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH
Sbjct: 301 FDHRIRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360

Query: 361 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIK 420
           KKINISNPRVAWEHEQFSRLRVTAATLS +SAAPELLERTGGLADNRLFLNE+ IAKSIK
Sbjct: 361 KKINISNPRVAWEHEQFSRLRVTAATLSGISAAPELLERTGGLADNRLFLNESAIAKSIK 420

Query: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480
           LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA
Sbjct: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480

Query: 481 LKK----------------------------------VASVTFDLLLLLVLGSYLVLLFC 531
           LKK                                  VASVTFDL+LLLVLGSYLVLLFC
Sbjct: 481 LKKELEVHTHDVSLQHEPFDGVFTFYDSTAAKLHVYQVASVTFDLVLLLVLGSYLVLLFC 540

BLAST of CaUC01G012040 vs. ExPASy Swiss-Prot
Match: Q6NZ07 (Nicalin-1 OS=Danio rerio OX=7955 GN=ncl1 PE=2 SV=1)

HSP 1 Score: 243.4 bits (620), Expect = 5.7e-63
Identity = 191/595 (32.10%), Postives = 303/595 (50.92%), Query Frame = 0

Query: 11  VLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFP 70
           +L+  +P+  ++F+++ C    +AA    VYR+  YD+ G  +GSR A LN  A ++   
Sbjct: 12  MLKVSFPLSLVLFLVLVCPLRAEAAHEFSVYRMQQYDLQGQTYGSRNAILNTEARTV--- 71

Query: 71  SAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGE 130
            A  LSR  +++ L++ +    Q+ + Q    G ++I+L                PHN  
Sbjct: 72  EAEVLSRRCVMMRLADFSYEKYQKALRQS--AGAVVIIL----------------PHNMS 131

Query: 131 GMIKDLL---VELERLLIHSTIPYPVYFASEGEDINAVLADVK--------NNDATGQLA 190
            + +D++   +ELE  L+ +    PVYFA E E++ ++    +        ++ A   L 
Sbjct: 132 TLPQDIVQQFMELEPELLATETIVPVYFALEDEELLSIYTQTQISSSSQGSSSAAEVLLH 191

Query: 191 TATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPE 250
           TAT  G+++V S A+ + +    IT+++G L G  S G+   LPTI +VA YD+FG AP 
Sbjct: 192 TATANGFQMVTSGAQSKAVSDWAITSLEGRLTG--SGGE--DLPTIVLVAHYDSFGVAPW 251

Query: 251 LSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SF 310
           LS G+DSNGSG+  LLE+ARLFS LYS  +T   YNLLF L+ GG +NY GT +WL+ + 
Sbjct: 252 LSYGADSNGSGVAILLELARLFSRLYSYKRTHAGYNLLFFLSGGGKFNYQGTKRWLEDNL 311

Query: 311 DHR----LRESIDYAICLNSIGSWDDKLWLHVSKPPEN-----AYIKQIFEDFSNVAEDL 370
           DH     L++++ + +CL+++G+  D L LHVSKPP+        +K++    ++   DL
Sbjct: 312 DHTDASLLQDNVAFVLCLDTLGN-SDNLHLHVSKPPKEGSPQYTLLKELETVVAHQHPDL 371

Query: 371 GFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSEL---------------SAAPELL 430
            F   + HKKIN+++  +AWEHE+F   R+ A TLS L               S +P L 
Sbjct: 372 KFA--MVHKKINLADDTLAWEHERFGIRRLPAFTLSHLESHRSPARHSIMDMRSVSPSL- 431

Query: 431 ERTGGLADNRLFLNENKIAKSIKLVAESLARHIYRYEGK----NIQVFADDSSLAVNPTY 490
               G A     ++  K++++ K++AE+LAR IY    K    ++++F +   + V    
Sbjct: 432 -EGAGEATTGPHVDLGKLSRNTKVIAETLARVIYNLTEKGVTGDLEIFTE--QMQVQEDQ 491

Query: 491 IRSWLDLLSRTPRVAPFLSKDDPFISALK------------------------------- 529
           + S +D L+  PR A  L KD   I+ L+                               
Sbjct: 492 LASLVDWLTAQPRAAQLLDKDSSIINTLEHQLSRYLKDVKRHLVRADKRDPEFVFYDQLK 551

BLAST of CaUC01G012040 vs. ExPASy Swiss-Prot
Match: Q8VCM8 (Nicalin OS=Mus musculus OX=10090 GN=Ncln PE=1 SV=2)

HSP 1 Score: 231.9 bits (590), Expect = 1.7e-59
Identity = 173/545 (31.74%), Postives = 277/545 (50.83%), Query Frame = 0

Query: 20  ALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFPSAADLSRSV 79
           A++ ++   +   DAA    VYR+  YD+ G P+G+R A LN  A ++    A  LSR  
Sbjct: 28  AVLLLVAPPLPAADAAHEFTVYRMQQYDLQGQPYGTRNAVLNTEARTV---DADVLSRRC 87

Query: 80  LIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGEGMIKDLLVE 139
           +++ L + +    Q+ + Q    G ++I+LPR + +          P +    +    +E
Sbjct: 88  VLMRLLDFSYEHYQKALRQS--AGAVVIILPRAMAA---------VPQD----VVRQFME 147

Query: 140 LERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATG--------QLATATTGGYKLVVS 199
           +E  ++      PVYFA E E + ++    +   A+          L TAT  G+++V S
Sbjct: 148 IEPEMLAMETVVPVYFAVEDEALLSIYEQTQAASASQGSASAAEVLLHTATANGFQMVTS 207

Query: 200 AAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGI 259
            A+ + +    IT+++G L GL  +     LPTI IVA YD FG AP LS+G+DSNGSGI
Sbjct: 208 GAQSQAVSDWLITSVEGRLTGLGGE----DLPTIVIVAHYDAFGVAPWLSLGADSNGSGI 267

Query: 260 VALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR----LRESI 319
             LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ S DH     L++++
Sbjct: 268 SVLLELARLFSRLYTYKRTHAAYNLLFFASGGGKFNYQGTKRWLEDSLDHTDSSLLQDNV 327

Query: 320 DYAICLNSIGSWDDKLWLHVSKPP-----ENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 379
            + +CL+++G     L LHVSKPP     ++A+++++    ++   D+ F   + HKKIN
Sbjct: 328 AFVLCLDTVGR-GSHLRLHVSKPPREGTLQHAFLRELETVAAHQFPDVSF--SMVHKKIN 387

Query: 380 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTG---GLADNRLFLNENKIAKSIKL 439
           +++  +AWEHE+F+  R+ A TLS L +      R G    + D R  ++   + ++ ++
Sbjct: 388 LADDVLAWEHERFAIRRLPAFTLSHLES-----HRAGPRSSIMDVRSRVDSKTLTRNTRI 447

Query: 440 VAESLARHIYRYEGK----NIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPF 499
           +AE+L R IY    K    ++ VF +   + V    I S +D L+  PR A  L KD  F
Sbjct: 448 IAEALTRVIYNLTEKGTPPDMPVFTE--QMQVQEEQIDSVMDWLTNQPRAAQLLDKDGTF 507

Query: 500 ISALK-------------------------------------KVASVTFDLLLLLVLGSY 503
           +S L+                                     +V    FDLLL L +G+Y
Sbjct: 508 LSTLEHFLSRYLKDVRQHHVKADKRDPEFVFYDQLKQVMNAYRVKPAIFDLLLALCIGAY 540

BLAST of CaUC01G012040 vs. ExPASy Swiss-Prot
Match: Q5XIA1 (Nicalin OS=Rattus norvegicus OX=10116 GN=Ncln PE=2 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 1.7e-59
Identity = 175/545 (32.11%), Postives = 273/545 (50.09%), Query Frame = 0

Query: 20  ALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFPSAADLSRSV 79
           A++ ++   +   DAA    VYR+  YD+ G P+G+R A LN  A ++    A  LSR  
Sbjct: 28  AVLLLVAPPLPAADAAHEFTVYRMQQYDLQGQPYGTRNAVLNTEARTV---DADVLSRRC 87

Query: 80  LIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGEGMIKDLLVE 139
           +++ L + +    Q+ + Q    G ++I+LPR + +          P +    +    +E
Sbjct: 88  VLMRLLDFSYEHYQKALRQS--AGAVVIILPRAMAA---------VPQD----VVRQFME 147

Query: 140 LERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATG--------QLATATTGGYKLVVS 199
           +E  ++      PVYFA E E + ++    +   A+          L TAT  G+++V S
Sbjct: 148 IEPEMLAMETVVPVYFAVEDEALLSIYEQTQAASASQGSASAAEVLLHTATANGFQMVTS 207

Query: 200 AAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGI 259
            A+ + +    IT+++G L GL  +     LPTI IVA YD FG AP LS+G+DSNGSGI
Sbjct: 208 GAQSQAVSDWLITSVEGRLTGLGGE----DLPTIVIVAHYDAFGVAPWLSLGADSNGSGI 267

Query: 260 VALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR----LRESI 319
             LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ S DH     L++++
Sbjct: 268 SVLLELARLFSRLYTYKRTHAAYNLLFFASGGGKFNYQGTKRWLEDSLDHTDSSLLQDNV 327

Query: 320 DYAICLNSIGSWDDKLWLHVSKPPENAYIKQIF-EDFSNVA----EDLGFKVDLKHKKIN 379
            + +CL+++G     L LHVSKPP    ++ +F  +   VA     D+ F   + HKKIN
Sbjct: 328 AFVLCLDTVGR-GSHLRLHVSKPPREGTLQHVFLRELEMVAAHQFPDVSF--SMVHKKIN 387

Query: 380 ISNPRVAWEHEQFSRLRVTAATLSELS---AAPELLERTGGLADNRLFLNENKIAKSIKL 439
           +++  +AWEHE+F+  R+ A TLS L    A P        + D R  ++   + ++ ++
Sbjct: 388 LADDVLAWEHERFAIRRLPAFTLSHLENHRAGPR-----SSIMDVRSRVDSKTLTRNTRI 447

Query: 440 VAESLARHIYRYEGK----NIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPF 499
           +AE+L R IY    K    ++ VF +   + V    I S +D L+  PR A  L KD  F
Sbjct: 448 IAEALTRVIYNLTEKGTPPDMPVFTE--QMQVQQEQIDSVMDWLTNQPRAAQLLDKDGTF 507

Query: 500 ISALK-------------------------------------KVASVTFDLLLLLVLGSY 503
           +S L+                                     +V    FDLLL L +G+Y
Sbjct: 508 LSTLEHFLSRYLKDVRQHHVKADKRDPEFVFYDQLKQVMNAYRVKPAIFDLLLALCIGAY 540

BLAST of CaUC01G012040 vs. ExPASy Swiss-Prot
Match: Q969V3 (Nicalin OS=Homo sapiens OX=9606 GN=NCLN PE=1 SV=2)

HSP 1 Score: 228.8 bits (582), Expect = 1.4e-58
Identity = 153/480 (31.87%), Postives = 255/480 (53.12%), Query Frame = 0

Query: 20  ALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFPSAADLSRSV 79
           A++ ++   +   DAA    VYR+  YD+ G P+G+R A LN  A ++   +A  LSR  
Sbjct: 28  AVLLLVAPPLPAADAAHEFTVYRMQQYDLQGQPYGTRNAVLNTEARTM---AAEVLSRRC 87

Query: 80  LIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGEGMIKDLLVE 139
           +++ L + +    Q+ + Q    G ++I+LPR + +          P +    +    +E
Sbjct: 88  VLMRLLDFSYEQYQKALRQS--AGAVVIILPRAMAA---------VPQD----VVRQFME 147

Query: 140 LERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATG--------QLATATTGGYKLVVS 199
           +E  ++      PVYFA E E + ++    +   A+          L TAT  G+++V S
Sbjct: 148 IEPEMLAMETAVPVYFAVEDEALLSIYKQTQAASASQGSASAAEVLLRTATANGFQMVTS 207

Query: 200 AAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGI 259
             + + +    I +++G L GL  +     LPTI IVA YD FG AP LS+G+DSNGSG+
Sbjct: 208 GVQSKAVSDWLIASVEGRLTGLGGE----DLPTIVIVAHYDAFGVAPWLSLGADSNGSGV 267

Query: 260 VALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR----LRESI 319
             LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ + DH     L++++
Sbjct: 268 SVLLELARLFSRLYTYKRTHAAYNLLFFASGGGKFNYQGTKRWLEDNLDHTDSSLLQDNV 327

Query: 320 DYAICLNSIGSWDDKLWLHVSKPPENAYIKQIF-EDFSNVA--EDLGFKVDLKHKKINIS 379
            + +CL+++G     L LHVSKPP    ++  F  +   VA  +    +  + HK+IN++
Sbjct: 328 AFVLCLDTVGR-GSSLHLHVSKPPREGTLQHAFLRELETVAAHQFPEVRFSMVHKRINLA 387

Query: 380 NPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVAESL 439
              +AWEHE+F+  R+ A TLS L +  +   +   + D R  ++   + ++ +++AE+L
Sbjct: 388 EDVLAWEHERFAIRRLPAFTLSHLESHRD--GQRSSIMDVRSRVDSKTLTRNTRIIAEAL 447

Query: 440 ARHIYRYEGK----NIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALK 480
            R IY    K    ++ VF +   + +    + S +D L+  PR A  + KD  F+S L+
Sbjct: 448 TRVIYNLTEKGTPPDMPVFTE--QMQIQQEQLDSVMDWLTNQPRAAQLVDKDSTFLSTLE 480

BLAST of CaUC01G012040 vs. ExPASy Swiss-Prot
Match: Q5ZJH2 (Nicalin OS=Gallus gallus OX=9031 GN=NCLN PE=2 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.4e-50
Identity = 136/400 (34.00%), Postives = 219/400 (54.75%), Query Frame = 0

Query: 14  SFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFPSAA 73
           SF   V  V +L+      +AA    VYR+  Y++ G P+G+R+A LN  A ++    A 
Sbjct: 21  SFLLFVPAVLLLLGPPPAAEAAHESTVYRMQQYELGGQPYGTRSAVLNTEARTV---EAD 80

Query: 74  DLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGEGMI 133
            LSR  +++ L + +    Q+ + Q    G ++I+LPR + S     D +K         
Sbjct: 81  VLSRRCVMMRLVDFSYEQYQKALRQS--AGAVVIILPRSISS--VPQDVVK--------- 140

Query: 134 KDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATG--------QLATATTGG 193
           + + +E E L + + +  PVYFA E +++ ++    +   A+          L TAT  G
Sbjct: 141 QFMEIEPEMLAMETIV--PVYFAVEDDELLSIYEQTRAASASQGSASAAEVLLHTATANG 200

Query: 194 YKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSD 253
           +++V S A+ + +    I +++G L GL  +     LPT+ IVA YD+FG AP LS G+D
Sbjct: 201 FQMVTSGAQSKAIHDWLIPSVEGRLTGLGGE----DLPTVVIVAHYDSFGVAPWLSHGAD 260

Query: 254 SNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR--- 313
           SNGSG+  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ + DH    
Sbjct: 261 SNGSGVSVLLELARLFSRLYTYRRTHAGYNLLFFASGGGKFNYQGTKRWLEDNLDHTDSS 320

Query: 314 -LRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGF---KVDLKH 373
            L++++ + +CL+++G   + L LHVSKPP+   ++  F     +     F   K  + H
Sbjct: 321 LLQDNVAFVLCLDTLGR-GNSLHLHVSKPPKEGTLQHAFLRELEMVVASQFPEVKFSMVH 380

Query: 374 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERT 398
           KKIN++   +AWEHE+F+  R+ A T+S L +  + L  +
Sbjct: 381 KKINLAEDILAWEHERFAIRRLPAFTISHLESHRDSLRNS 397

BLAST of CaUC01G012040 vs. ExPASy TrEMBL
Match: A0A1S3CJR5 (nicalin-1 OS=Cucumis melo OX=3656 GN=LOC103501746 PE=4 SV=1)

HSP 1 Score: 978.8 bits (2529), Expect = 9.1e-282
Identity = 505/564 (89.54%), Postives = 519/564 (92.02%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVL+SFYPV+ALVFILVACVELCDAATVVDVYRLI YDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKN 120
           NHHASSLHFPS ADLSR+VLIIPL EL +TFLQECISQKKRLGGLL+LLPR LGS+  KN
Sbjct: 61  NHHASSLHFPSGADLSRTVLIIPLCELKMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 121 DDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLAT 180
           DDIKC  NGEG+IKDLLVELERLLIHSTIPYPVYFAS+GEDI+AVLADVKNNDATGQLAT
Sbjct: 121 DDIKCT-NGEGVIKDLLVELERLLIHSTIPYPVYFASDGEDIDAVLADVKNNDATGQLAT 180

Query: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240
           ATTGGYKLVVSAAEP+KL+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL
Sbjct: 181 ATTGGYKLVVSAAEPKKLISSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240

Query: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300
           SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360
           RLRE IDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVA 420
           NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGL DNRLFL+E+KIAKSIKLVA
Sbjct: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVA 420

Query: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480
           ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSW+DLLSRTPRVAPFLSKDDPFISALKK
Sbjct: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWVDLLSRTPRVAPFLSKDDPFISALKK 480

Query: 481 ----------------------------------VASVTFDLLLLLVLGSYLVLLFCFLV 531
                                             VASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 540

BLAST of CaUC01G012040 vs. ExPASy TrEMBL
Match: A0A6J1IC52 (nicalin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111471642 PE=4 SV=1)

HSP 1 Score: 973.4 bits (2515), Expect = 3.8e-280
Identity = 497/567 (87.65%), Postives = 519/567 (91.53%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQVLESFYP++ALVF+LVAC ELCDAA VVDVYRLIHYDISGVPFGSRAA+L
Sbjct: 1   MAPRKPREPQVLESFYPLLALVFLLVACTELCDAAAVVDVYRLIHYDISGVPFGSRAASL 60

Query: 61  NHHASSLHFP---SAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDG 120
           NHHA+SLHFP   +AADLSR+V IIPL ELN TF++EC+SQ+KRLGGLLILLP+ LGSDG
Sbjct: 61  NHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECLSQRKRLGGLLILLPKILGSDG 120

Query: 121 PKNDDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQ 180
           PKNDD KCP NG+GMIKDLLVELERLLIH+T+PYPVYFASEGEDINAVLADVK+NDATGQ
Sbjct: 121 PKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQ 180

Query: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA 240
           LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK DGDASQLPTIAIVASYDTFGA+
Sbjct: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKFDGDASQLPTIAIVASYDTFGAS 240

Query: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300
           PELSVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Sbjct: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSSPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300

Query: 301 FDHRLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360
           FDHR+RESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH
Sbjct: 301 FDHRIRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360

Query: 361 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIK 420
           KKINISNPRVAWEHEQFSRLRVTAATLS +SAAPELLERTGGLADNRLFLNE+ IAKSIK
Sbjct: 361 KKINISNPRVAWEHEQFSRLRVTAATLSGISAAPELLERTGGLADNRLFLNESAIAKSIK 420

Query: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480
           LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA
Sbjct: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480

Query: 481 LKK----------------------------------VASVTFDLLLLLVLGSYLVLLFC 531
           LKK                                  VASVTFDL+LLLVLGSYLVLLFC
Sbjct: 481 LKKELEVHTHDVSLQHEPFDGVFTFYDSTAAKLHVYQVASVTFDLVLLLVLGSYLVLLFC 540

BLAST of CaUC01G012040 vs. ExPASy TrEMBL
Match: A0A0A0LYB7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G167840 PE=4 SV=1)

HSP 1 Score: 973.4 bits (2515), Expect = 3.8e-280
Identity = 501/564 (88.83%), Postives = 517/564 (91.67%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRKPREPQV +SFYPV+ALVFILVACVELCDAATVVDVYRLI YDISGVPFGSRAATL
Sbjct: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147

Query: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKN 120
           NHHASSLHFP+ ADLSR+VLIIPL ELN+TFLQECISQKKRLGGLL+LLPR LGS+  KN
Sbjct: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207

Query: 121 DDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLAT 180
           DDIKCP NGEG+IK L VELERLL+HSTIPYPVYFASEGEDI+AVLADVKNNDATGQLAT
Sbjct: 208 DDIKCP-NGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLAT 267

Query: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240
           ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+L
Sbjct: 268 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDL 327

Query: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300
           SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 328 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 387

Query: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360
           RLRE IDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 388 RLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 447

Query: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVA 420
           NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGL DNRLFL+E+KIAKSIKLVA
Sbjct: 448 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVA 507

Query: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480
           ESLARHIYRYEGKNIQVFADDSSLA+NPT+IRSWLDLLSRTPRVAPFLSKDDPFI+ALKK
Sbjct: 508 ESLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKK 567

Query: 481 ----------------------------------VASVTFDLLLLLVLGSYLVLLFCFLV 531
                                             VASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 568 ELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 627

BLAST of CaUC01G012040 vs. ExPASy TrEMBL
Match: A0A6J1EL15 (nicalin-1-like OS=Cucurbita moschata OX=3662 GN=LOC111434272 PE=4 SV=1)

HSP 1 Score: 970.3 bits (2507), Expect = 3.2e-279
Identity = 497/567 (87.65%), Postives = 518/567 (91.36%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MA RKPREPQVLESFYP++ALVF+LVA  ELCDAATVVDVYRLIHYDIS VPFGSRAA+L
Sbjct: 1   MASRKPREPQVLESFYPLLALVFLLVAYTELCDAATVVDVYRLIHYDISAVPFGSRAASL 60

Query: 61  NHHASSLHFP---SAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDG 120
           NHHA+SLHFP   +AADLSR+V IIPL ELN TF++ECISQ+KRLGGLLILLP+ LGSDG
Sbjct: 61  NHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDG 120

Query: 121 PKNDDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQ 180
           PKNDD KCP NG+GMIKDLLVELERLLIH+T+PYPVYFASEGEDINAVLADVK+NDATGQ
Sbjct: 121 PKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQ 180

Query: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA 240
           LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDA+QLPTIAIVASYDTFGAA
Sbjct: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDANQLPTIAIVASYDTFGAA 240

Query: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300
           PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Sbjct: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300

Query: 301 FDHRLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360
           FDHR+RESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH
Sbjct: 301 FDHRIRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360

Query: 361 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIK 420
           KKINISNPRVAWEHEQFSRLRVTAATLS +SAAPELLERTGGLADNRLFLNE+ IAKSIK
Sbjct: 361 KKINISNPRVAWEHEQFSRLRVTAATLSGISAAPELLERTGGLADNRLFLNESAIAKSIK 420

Query: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480
           LVAESLARHIYRYEGKNIQVFADDSSLA+NPTYIRSWLDLLSRTPRVAPFLSKDDPFISA
Sbjct: 421 LVAESLARHIYRYEGKNIQVFADDSSLAINPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480

Query: 481 LKK----------------------------------VASVTFDLLLLLVLGSYLVLLFC 531
           LKK                                  VASVTFDL+LLLVLGSYLVLLFC
Sbjct: 481 LKKELEVHTHDVSLQHEPFDGMFTFYDSTAAKLHIYQVASVTFDLVLLLVLGSYLVLLFC 540

BLAST of CaUC01G012040 vs. ExPASy TrEMBL
Match: A0A6J1CME5 (Nicalin OS=Momordica charantia OX=3673 GN=LOC111012612 PE=3 SV=1)

HSP 1 Score: 951.8 bits (2459), Expect = 1.2e-273
Identity = 492/564 (87.23%), Postives = 503/564 (89.18%), Query Frame = 0

Query: 1   MAPRKPREPQVLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATL 60
           MAPRK RE +VLESFYPVVALVFILVACVELCDAATVVDVYRLI YDISGVPFGSRAATL
Sbjct: 1   MAPRKAREREVLESFYPVVALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKN 120
           NHHA SLHFP  ADLSR+V+IIPL ELNITF++ECISQKK LGGLL LLP+  GSD  KN
Sbjct: 61  NHHAGSLHFPPGADLSRTVVIIPLCELNITFVKECISQKKXLGGLLFLLPKIFGSDDTKN 120

Query: 121 DDIKCPHNGEGMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLAT 180
           D  KCP+NGEG +K+LL ELERLL+H  IPYPVYFASEGEDI AVLADVK NDATGQLAT
Sbjct: 121 DGTKCPNNGEGTVKNLLGELERLLVHENIPYPVYFASEGEDIGAVLADVKRNDATGQLAT 180

Query: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240
           ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL
Sbjct: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240

Query: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300
           SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360
           RLRESIDYAICLNSIGSWDDKLWLHVSKPPEN YIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENVYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVA 420
           NISN RVAWEHEQFSRLRVTAATLSELSAAPELLERTGGL DNRLFLNE+ IAKSIKLVA
Sbjct: 361 NISNLRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLVDNRLFLNESAIAKSIKLVA 420

Query: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK 480
           ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSR PRVAPFLSKDDPFI ALKK
Sbjct: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRMPRVAPFLSKDDPFILALKK 480

Query: 481 ----------------------------------VASVTFDLLLLLVLGSYLVLLFCFLV 531
                                             VASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVSMQHEAFDGMFTFYDSTAAKLHIYQVASVTFDLLLLLVLGSYLVLLFCFLV 540

BLAST of CaUC01G012040 vs. TAIR 10
Match: AT3G44330.1 (INVOLVED IN: protein processing; LOCATED IN: mitochondrion, endoplasmic reticulum, plasma membrane, vacuole; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nicalin (InterPro:IPR016574), EF-Hand 1, calcium-binding site (InterPro:IPR018247), Nicastrin (InterPro:IPR008710); Has 245 Blast hits to 243 proteins in 99 species: Archae - 6; Bacteria - 10; Metazoa - 139; Fungi - 0; Plants - 46; Viruses - 0; Other Eukaryotes - 44 (source: NCBI BLink). )

HSP 1 Score: 755.7 bits (1950), Expect = 2.4e-218
Identity = 388/552 (70.29%), Postives = 446/552 (80.80%), Query Frame = 0

Query: 11  VLESFYPVVALVFILVACVELCDAATVVDVYRLIHYDISGVPFGSRAATLNHHASSLHFP 70
           V ES YP++AL+ ILVACVELCDAATVVDVYRLI YDISGVPFGSR ++LNHHA+SL F 
Sbjct: 15  VFESMYPILALMLILVACVELCDAATVVDVYRLIQYDISGVPFGSRFSSLNHHAASLSFQ 74

Query: 71  SAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLILLPRFLGSDGPKNDDIKCPHNGE 130
             ADLSRSVLI+PL EL+I F+Q+ ISQK+ LGGLLILLP+           +   ++G 
Sbjct: 75  RGADLSRSVLILPLRELDIAFVQDYISQKQSLGGLLILLPQTFRPGNVGGGSLSSENDG- 134

Query: 131 GMIKDLLVELERLLIHSTIPYPVYFASEGEDINAVLADVKNNDATGQLATATTGGYKLVV 190
              + LL +LE+LL+H  IP+PVYFA E E+ +A+LADVK NDA GQ ATATTGGYKLV+
Sbjct: 135 --FRSLLGQLEKLLVHGNIPFPVYFAFENEETDAMLADVKKNDALGQQATATTGGYKLVI 194

Query: 191 SAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSG 250
           S +EPRK+ S TITNIQGWLPGL+++GD+SQLPTIA+VASYDTFGAAP LSVGSDSNGSG
Sbjct: 195 SVSEPRKIASPTITNIQGWLPGLRAEGDSSQLPTIAVVASYDTFGAAPALSVGSDSNGSG 254

Query: 251 IVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRLRESIDYAI 310
           +VALLE+ARLFS+LYSNPKTRG+YNLLF LTSGGPYNY GT KWL+S D R+RESIDYAI
Sbjct: 255 VVALLEVARLFSVLYSNPKTRGKYNLLFALTSGGPYNYEGTQKWLKSLDQRMRESIDYAI 314

Query: 311 CLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWE 370
           CLNS+GSWD +L +HVSKPP+NAYIKQIFE FSNVAEDLGF+V LKHKKINISN RVAWE
Sbjct: 315 CLNSVGSWDSELLIHVSKPPDNAYIKQIFEGFSNVAEDLGFQVALKHKKINISNSRVAWE 374

Query: 371 HEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNENKIAKSIKLVAESLARHIYRY 430
           HEQFSRLRVTAATLSELS  PELLE  G L+D R  +NE+ I K +KLVAESLA+HIY +
Sbjct: 375 HEQFSRLRVTAATLSELSTPPELLENAGSLSDTRQLVNEDAIIKGVKLVAESLAKHIYGH 434

Query: 431 EGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISALKK---------- 490
           +GK+I++FADDSSLAVNP Y+RSWLDLLS+TPRVAPFLSK++P I ALKK          
Sbjct: 435 QGKDIKIFADDSSLAVNPFYVRSWLDLLSQTPRVAPFLSKNEPLIMALKKELEDYTAEVS 494

Query: 491 ------------------------VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLI 529
                                   VASVTFDLLLLLVLGSYL++LF FLVITTRGLDDLI
Sbjct: 495 VQHESLDGSFTFYDSTKASLNIYQVASVTFDLLLLLVLGSYLIVLFSFLVITTRGLDDLI 554

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894420.18.7e-28790.60nicalin-1 isoform X2 [Benincasa hispida][more]
XP_023520704.12.2e-28288.48nicalin-1-like [Cucurbita pepo subsp. pepo][more]
XP_008463652.11.9e-28189.54PREDICTED: nicalin-1 [Cucumis melo][more]
KAG6583712.11.6e-28087.83Nicalin-1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022973118.17.9e-28087.65nicalin-1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q6NZ075.7e-6332.10Nicalin-1 OS=Danio rerio OX=7955 GN=ncl1 PE=2 SV=1[more]
Q8VCM81.7e-5931.74Nicalin OS=Mus musculus OX=10090 GN=Ncln PE=1 SV=2[more]
Q5XIA11.7e-5932.11Nicalin OS=Rattus norvegicus OX=10116 GN=Ncln PE=2 SV=1[more]
Q969V31.4e-5831.88Nicalin OS=Homo sapiens OX=9606 GN=NCLN PE=1 SV=2[more]
Q5ZJH21.4e-5034.00Nicalin OS=Gallus gallus OX=9031 GN=NCLN PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3CJR59.1e-28289.54nicalin-1 OS=Cucumis melo OX=3656 GN=LOC103501746 PE=4 SV=1[more]
A0A6J1IC523.8e-28087.65nicalin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111471642 PE=4 SV=1[more]
A0A0A0LYB73.8e-28088.83Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G167840 PE=4 SV=1[more]
A0A6J1EL153.2e-27987.65nicalin-1-like OS=Cucurbita moschata OX=3662 GN=LOC111434272 PE=4 SV=1[more]
A0A6J1CME51.2e-27387.23Nicalin OS=Momordica charantia OX=3673 GN=LOC111012612 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G44330.12.4e-21870.29INVOLVED IN: protein processing; LOCATED IN: mitochondrion, endoplasmic reticulu... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF05450Nicastrincoord: 225..361
e-value: 1.6E-8
score: 34.3
NoneNo IPR availableGENE3D3.40.630.10Zn peptidasescoord: 145..413
e-value: 6.3E-22
score: 80.5
NoneNo IPR availablePANTHERPTHR31826:SF7NICALINcoord: 10..480
coord: 480..529
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..28
score: 7.0
NoneNo IPR availableCDDcd03882M28_nicalin_likecoord: 139..428
e-value: 1.37983E-115
score: 342.039
NoneNo IPR availableSUPERFAMILY53187Zn-dependent exopeptidasescoord: 201..427
IPR016574NicalinPANTHERPTHR31826NICALINcoord: 10..480
coord: 480..529
IPR018247EF-Hand 1, calcium-binding sitePROSITEPS00018EF_HAND_1coord: 245..257

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G012040.1CaUC01G012040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009966 regulation of signal transduction
cellular_component GO:0005789 endoplasmic reticulum membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane