CSPI01G14970 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G14970
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionNicalin
LocationChr1: 10611123 .. 10621224 (+)
RNA-Seq ExpressionCSPI01G14970
SyntenyCSPI01G14970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAAAGAAATAAAAAACTTGACTTGCGAAAGAGGCCCTTACTCAACAAAGGTTGGAAGGTCAAGTGAGGTGTATGCGGGCGGTGGGCATCTCTCCTCAACGGCGAGAAGAGCAGCAAAAGCCGCGAGCCGATTCAATTTCCATGTGCTTATCATCCATTTTATTCTAATCAAAATTCCCTTCCAGTCTCATCCTTTTTCTTCTCATATCTTCAAACGCCATTTCCTCCCTCAGTCTGAAGCTCCAGATCTTGCATCCATGGCTCCTCGTAAACCCCGCGAGCCACAAGTTTTTGATTCCTTCTACCCTGTCCTCGCCTTAGTCTTCATTCTAGTTGCCTGTGTCGAGCTCTGTGACGCTGCCACCGTCGTCGATGTCTACCGTCTCATTCAGTATGATATCTCTGGTGTTCCCTTTGGATCCCGCGCCGCCACACTCAATCACCATGCTTCCTCTCTTCATTTTCCAACTGGTGCTGATCTCTCTCGTACTGTTCTCATCATTCCGCTTTGTGAGCTCAATATGACTTTTCTCCAAGGTACCACCATTCTTCTCTTTTTTTAACTACACTTCCTACTTTCTTAGCTGCTGGATCGTCGTCGTTAGTATTTGGAGATTTTAGTAAATCAGTTGTGAACTGGATTCAGCTGTTTTGGGGTTGAATTACTCAGGTCATAGTTACAAGTGTAGATAGTTTCTTTTGTGAAGTTTTATCGGGGCGAAATTTCCATGATGAGCTGGATTTGATTATACTGAACTAGTTAAAGAAGTTGTTCATACTTTGAGTATTTGTATTCAAAGGTTTGATCTTCTTTCTATTACCAAACAGTAGTGTGTTTGCAGACTTGAAGAACTTTGTTAATCTAATTTCTTCTCTATTTGAAGGAAGAAGAATATGATTGTAGGCTTTGCTCTTACTTTTTAACTTTTAACGTATCAGACATCCTGTCTATTTTTCTTAGTGAGGAATTATATGACCTATATTTTATAAGCATTGTGGGTTTACTACTTCCCTTCTTTCTTCTGTTGATAACCGGGCTATCTTTTTGGTCTTGTCAGAATGTATATCTCAGAAAAAGCGTCTGGGAGGTCTGTTGGTTTTACTTCCCAGGATTCTTGGTTCGGAAAGCCTGAAAAATGATGACATTAAATGTCCAAATGGAGAGGGGGTGATCAAGGGTTTATCGGTTGAACTTGAACGGTTGCTCGTACATTCTACTATACCTGTGAGTGAAATGAAAATAACTTCTGGTGATTCTTCATCACTTCACTAGACATTTTCCATGTTTATTCATTGTGTGCTTATTCTGAATTCTGTTTTATATCATGTTCCTTCTATATATGGTTTGAAACATTTAAGCATTTAACTTTCATTTATACTCATCTGAACTTTGTGGTTCATTCTACATTAAATGAATGCACGTTATCTTTTCTTAGTTTGTGAATTATGACCACCTTCTCGTATTCCTTTACTTCAATTTCTTTTTCAGTTTCTGTAATGTGTTTGCATTTCTGAGAGTTTCTATGTTGTTCCAGTATCCTGTATATTTTGCTTCAGAAGGTGAAGATATTGATGCTGTTTTGGCTGATGTCAAGAACAATGATGCCACTGGTCAGCTTGCAACTGCAACTACTGGCGGGTTAGTTGAATATATTATGAAATTTTGGGTGCTTTAGTGGATTTCAATATATTTTTATCAGTTTATGTAGTTTCTACTATGTTTTATATTTTAATTCTGATTTTGATATGATTATTAAGGGTTATTATTATTATTTTGCTGCAAGCATAAACCAAACCTTCTTTTGCAACAGACAACAACATTAAGTTAGAATGTTAAATTATATTATGTTTCTCTTTATCTTTTATTTTCGCTACTGTCTCCAATTTTCCCATTAATTACCTTCATCAGTTCCTACTTTTGCATAAAAGATTGGCAATGTTTATTCTGTTAACTTCTTTTATTGTTTAATTCATACCCAAGGAATTTTTGGTAATTCTGATCTTGGGGTGATGTTAATTAATATTTTGTTTTGAAAAGTCATAGAAGTTGATAAAGTTTCAACACTTCTTCAATAACTTATAGAATTGTTTTGGATTAGGGCTGTCTTTTGTAACTGGGTTTGAGGTCCTTTTTGTTCTTGGGCTTATTTCTTTATATGCCTTTGAATATACTTTCACCTCTCATTGAAAACTTGGTTTTTTTGATTCAATAAACCAAAGTATGATTGTTATTCATTTTCTTTTTTGGTGTAATTTCTAATGTCTTGACACTTTTTGATGTATTGTAATTTACTTGATACTACAAAAACATCCATCAACCTATATTGATTTGATGTCAGATCATTCAGTTTCTCATTAAAAAGGTACTTTCAATGTCAACTGCCATTGTGTCATTTGCAAGTGCTTTTATCAAATAATTTCATATTATCTAGAATCTTTTTTTTCTCATAAAGAAAACTTGAGAGCTTGAGGATAAGTAGACAATTTCTAATATTCTATGTATTCATACAGGTATAAGCTTGTTGTTTCGGCAGCAGAACCAAGGAAACTTGTATCTTCCACTATTACAAATATTCAGGTATCTTTTTCAGTTTCCTTAACATGTGAACGCCTATAGTTGTTCTGTTATTGTATATAATTATTTGAAAGCTTTTACACAGGAAAAGTGGTTGGATTGTTGATCTTATAGTCGTATATATACATGTATATATATTTTGGTAAGAAACCGAACTTTCCTCGAGCTAAATGAAGGAATATACAGGAGTGCACACACGTACACGCACAAAAGAAAAAGCCGAACATGCATGAACTATAAGAATGGGCTCCAATCCAAAAAATTCAAACCAATGTCACAAGTACAAAAAGGCCGTAGTCTTGCTATGCAAATTTTGGCTCTCAATCCCTAGGCCACCAACCACCACCGACCACTACTCCCAGTTAACCATGTGAGATCACCATCCCATAAGAAATTCACATCACTCACTTGTTACATAGTTTCTTGCTGGTTGGATGGTTCTTGTGGTTCGCCTCCGAGTCTAAAACAGAGTACATACGGTTTCCATTCCTTTGAGTGGATGTTGTTGGGTGGGGTCAAAAACTTTTCTAGAAATCTTGGAAAGACATTTGATATGAGCTTCCATCCTTTTTTTATTTGGTCTAGTGTTTTGTGGATGATAGGAGGGAAACATACTTTTGGGAGGGTTAAGTGTGGGGGAGGGGGGATAGAGTATAGACCCCTATTCTCCACATGCCCTTCCTTATTTCACTTGTCCTTGATGAAAGATTGTTTCGGGGCTAATGTTTTGCACTCTTCTAGGAACTTCTCCTCTTTTCCCTTGTCATTCTTTATCAGCATAGGAACGTGAAAGTTGTGGCTTTCTTATCTTTGGTTGGGGAGTTTGAGCTTATGCAAGGGAGGGAGGATGTTGGTTTTAGGAGCTTAACCTCATTGTGGGTTTTTCTTGTAGCTCATTCTTCATTTCTTCCAGTGCTTGTTTGAACCCTTCTTTGACCAATGAGTCTACTTTTTCAATTATGTGGAAGATGAAAATTCCAAGGAATTTTCGATTCTTCGTTTGGCAGGTCATCCTTGGAGGAGTCAACACTTTAGATCAACTTACTAGAAAGATGTCTACTTTGATAAATAAAACCATGTTGTTACATTTTCTGCAGAGGGTGGGGAAAGATCTTTTGGGTGTTTGACTTTTAGTATGCTAGACACAAAGGGTATAGGAAGATGATTGAGTAGTTCTTTTTCCATTTTCCTTCTCAAGAGAGGAGTTTTTTGTGGCAAGGAGTTTGCGTTGTGTTGTGGGTTTTGTGGGGGGAGTGGAATAGAACCTTCAGAAGGGATGAGAGGGACTCAGACCTTAGAAATGTTTGGTCCTTCATTGGATTTTGTCTCTCTTTGGGCTTGAATGACAAAGATTTTTTGCAATTATTTCTTAGGTCTCATTTCACTTGATTGGATCCCATTTTTTAGTTGGCTTTTTTGCTTGGTTTTTTTTTCGGATGACTTTATGTTCTTTCTCAATGGCAGTTCGGTTCTTTATAAAAAAAAATTAAATCTCTATTTGGATAAGTAGGATAATATGTAATTTTATTCAAATATACATTTATTATTCTACTTATACTAACACAGACATTTATAAATCGTTTGCTGACAGATTTTTTGGTCTCATTAGGGAGTTCTGGTTCCTTTTCTTTTGGTCCTCCTTGTTCATTGTTTGATAGAGAAATAGTGGAGGTTTTCTCTCTCCTGTTTTTGCTAAGGGAGTGCGATTTCAGGATTGGGAGAAAAGACTTCTGAGTTTGTAGTTTTTCTTGTAAAGTTTTCTTTTGTGGCTTATTGGACCTTCTCTTGTTAGCGAGCTTGCCTTTCTGCTCTTTGGAGGATAAAAACTCCAAAGAAGGTTAAGTTCTTTACGTGGAAGGTCTTACATGGACGAGTTAACACTCTAAATTGACTCTCGAGGAAGTTTTCCTTGTTAGTTGACTCGTTATGTTGCATTCTTTGTCAGAAGGTGGAGGAAAACTTGGATCATATTCTTTGGACATGTGAGTTTGTGAGATATTTTGAGTTGCTTCTTTCAAACTTTTCGGTTTCGTGCTCTTTATTATTTTTAAGTTCTTGGTTACTTGAGAATGTGATGAATATGAAAATTTGTGGCTTAGCTAGGAGTTCATTACGTTTGGATACAGTACATTTTAAGTTATGTTTTTTTAGTACCCGAGTGTTCTGTAGTTACCATCTTATTGTGTTATTGAGTTGTTGGAGATCCTTTTATAATCCTTTTGAGTCCCTAGCTTTTTTTGGGTCCCCTATTCCCTTATTAGTTATCTGACCCCTCCGTCAAATATATTGGTTAAATTAAATTTAAAATTTCACGTGGCATTCATTTCAAGGTTCTCTCCATTGACTCTTGAGGAGGATTGGAGTAGAGAAGTCAACAAGTCAACCAAGAGGTCGTGTGTCAAGAATAGTGTTGTAAAACTGCAAATTGTGTATTTCCCATATAATTTATCATTTCCCTTGGTGTATTTATGCAGTCTGTGTTCTGTTATTGAGGTGCTGTGATTAGCTTTGTTCTCCCTTCATTTTTGTGTCTAATTCTTTTTAATAGTGTTGCATTGTTTTACCTTTATTTATTTTTTATTTTTTTAATTGGTGTCAGGGTTGGCTTCCTGGACTAAAATCCGATGGGGATGCTAGTCAACTCCCAACAATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGTATGTGACTTGTTTAATGCACTAACAATACCTACCATATTTTATCTATATTTTATATAGACGATGTGGATCATCTCTCTGGTTGTAGGACTTATCTGTGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTCGAAATTGCAAGATTATTTTCTCTTCTTTATTCCAACCCTAAGACAAGAGGAAGGTACAATCTACTTTTTGGACTCACTTCTGGCGGGCCTTACAACTACAATGGGACTCACAAGGTTTTCATCCTTCTTTAGTACATACAATCATTGTAGTTATTACTTATCATTACAATATATTCTATTTAGTTGAATATCTTCATTTTGATTATTATTTAATTTTTTTCTATAATTGTTTATATTTGTTGTCTATTAATGTAAAGTATTGCTTTCCCAGTGGCTTCAAAGCTTTGATCACCGTCTCCGTGAGAGGATTGACTATGCTATTTGCTTAAATAGTATTGGTTCATGGGATGACAAATTATGGCTGCATGTCTCCAAGCCTCCAGAAAATGCCTACATTAAGCAAATCTTTGAAGTAGGCATATGTTCTTTTAAAATGTTCATTGTCATGTAATATTAAAAAAATGTTGATGTCTTAGTACTGTGTTGTTTATTACAACATGCTGATTGTGGGTGGCCTGGACTCTCTGTAGGATTTCTCAAATGTTGCGGAGGATTTGGGCTTTAAAGTTGATTTGAAGCACAAGAAGATTAATATTTCAAACCCTCGAGTGAGTTAAATGCTAATACACCTGAATTTTTTCTATTTTTATTTTTCTTACAAAATTAATTTTCACATGAAGAAATATTATGAAATTCAGTTATTGACATTTGTAAAATGTTATAGAATCTAAAACATTATATTGAAACATTACCGAAATAGAAAAAGAGGAAAAAGGGGAGAGAGTTTGAAAGGTTATTCTCTCCCCAGGACTAGTCTGAATCCCACCTACCTGTTACAGTTACACGCATACATCAGTACCTAGCCACCTACACGATCTTCCTTCAATAACACCTGAGGCTGTCCCATACCCTCATTGTCCTCCTAACAAAATTCTCCCATCCATAAGCTTCGTGCTACCACACCATGCAAATCTAATGGGTTGGCCTAGCGGTAAAAAGGGGGACATAGTCTTAGTATTAACTAAGAGATCATGGGTTGTTTTAATCCATGGTGACCATCTACCTATAAATTAATTTCCTGAGTTTCTTTAACACAAAAACGTTGTAGGGTCAGGTGGGTTGTCCGTGAGATGAGTCGAGGTGTGTATAAGCTGGTACGGACACTCACGGATATAAAAAAAAAACATGACTTTAGCATTTATATTCCTAGTCCCTAGTCCCTTGACAAGTGCATGTTTCAAATGTCATGTTACTTACCCACTTAATAGAAAGCAAAAAAATGTAACAAGAAATTGGCAATGGCCAATGTGGACAGCATCATTTGTCTTGTGGCTTGGATAAGCTGGCTCAAACTAAAGTGGGCATGCTTTTTATCTTATAATTAGTCAATAGTCAAAGCCAACCCTGGATGCTTTTAATAATCGTATCCTGTTGTCATTGCTCCAAGAAAAGGATATTTCTAGGATAATAGCTACATGTACCAGCACAGAGGAGACACTCTTTTTTTCTTTTTCTTTTTTCCTTTTATACAGGGGTGGGAAAGGACAAAGCAGGAGTCAAAGACTTTTGACAACTGACCTAATTTACATTGCCATCATCACATTTTTCATAATGATGTGTGCTATTTATTTAATCTTTGTTTCAAAAGAGTGTTCTAGGTTTCCCTTAGCATAGTATACAACTCTTTTGTTTCTGATTATTATTTATTGTAGCCCCCTTGATAAATCCTTTGTATGCTGTAGGTAGCCTGGGAGCATGAACAGTTTTCAAGATTGAGAGTAACCGCTGCTACCCTTTCTGAACTCTCTGCTGCTCCTGAGCTCTTGGAAAGGACTGGAGGTTTGGGTGACAACAGGTTCTATTTTTAAAAAATTGATCAGTTTATCAGAGTTTCCTACTTCTGTTCCACTTCCATGTTTGCATGTGTGCAAGCCATATTTATTTCTGAGTGATGTGTACTCTGACATGACTAACTTCCAACAGATTGTTTTTGGACGAGAGTAAAATTGCCAAGAGTATCAAGTTAGTTGCCGAGAGTCTTGCAGTAAGTTTTATCAGTCTTCTTAGACATAATCTTTAAGCTAGTTCTTATCATATTTATGCGTTTCCTGTTTCCAACTGTGAGTGTTCGTTTTTGGCAAACAAGAATTCATTTCAAGTTCAGTATGTTTAACATGAGATATAGCTAGCTTCTATTGTTATTTATGAGTGGACACGAATACATGACTTACATTATTAATGCACATTTTTCAGATTTTTATATCATTTTTCAGAATTCTCTAGTGCATCTTAACAAAAGAAAAAAAAAAAAAAAAAGGAAAAGAAAAGAAATCTCTAATGCACATTTCTTTGCATTTTATGCAGAGGCATATTTACAGATACGAAGGAAAGAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAATCAATCCAACTTTTATTCGGTCATGGTTGGATCTTTTATCACGAACGCCTCGAGTTGCTCCATTCCTGTCGAAAGACGACCCTTTCATAACGGCATTAAAAAAGGTTTTATATTATAAACAAACTCGTGCCAGTTGATACCATCTACTTTCATTTAGCTTCGTGTACTGTGTTGACCCAAAGATTGATTGGCCATCTGATAGAAAAAAATAAGATATTGTTCTGAAATGTATTTATGTTCTACTGGCTCCTTTTCGTTAAAAGGTGACTCCTTCCTTTGTGTATTTTCATTAGAAAAGGTTTTATATCATTGGAATAAAATTGTCTGTTGTTACCATTCTTGCCGCTTCATTTAGCTCAACCTAAACATATTGCTTGGTAACATGCGTAACGTGCTTAAATATTCTGACTTTGGAGAAATTGCAACCTTCTATTTGAAATTAGTTTTTTAAAATCTTGACTATTTTTGATATGGCAGAACTACTACGGTTAGTGCTGATCCATATTTTCCTCTCATCTTTGTCTGATACAAAAGCTTGAAATCTATTTATGAAATGCAATATTATAGTGGTGAATTAGTCTGGCTGGAGTGACTCATGCGAATGCTTGATTCACGTTATCTCACTCTCAGCCATCATGGAATAATATGTTTCTACTATTTTCAGGAACTGGAGGTCCATACCCATGATGTGAGCTTGCAACATGAAGTATTTGAGGGGATATTCACCTTTTATGGTTCAACTGCAGCTAAACTTCACGTATACCAGGTACTTTGCAGATAAATTGTGAACTTGGCTCTCTAGTCGGTTGTTGGTAACGTGTTTTGGTCCTTTGGTGTTTTTTCTTGACTTCAATCTAATAAGCAAAGCTTAATGTTTGTTGTTTCAGGTTGCTAGTGTGACATTCGACTTGCTTTTGCTTTTGGTCTTGGGATCTTATTTAGTTTTACTCTTCTGTTTCCTTGTGATCACAACCAGGGTATGGTTATTATTTACTTTTTTAATTTTAGAGAATGATCTCATGACAGAATTCCTTAAACATAAATTAAAAAGATAAATGTCTATGTGTGCATAAAAGGGCTCTTTGGACCTTTTCTATATCAAGCACTCACCACTCAAAATTATAGAGCTTGTTTAGAATATAACTTTCCAAGCGTAGGAATAACATAAATATGGTATTGGAAAAGTTCAAAATACACCAGACTGTTTTGTGGAAGAACACTTGGTTGGTGTTTTTGGCTCGAGTGCTTAATTGTGAATCACTCTAAAACACTAATCTACATTGTTAAAAGAAAGCCGAGGGAAAGGTTTAGGCTCTTCGCCTAGCTTTGTCTCTTTCTTTTCTCGTTGTTGAAACTCTTTTTTTTCTTTCTTGTGAAAGAGTTGATGTAGTGGATAATATCAAAGTGATTGTCAAAAATATTTATTTTAAAAGCACTCTTAAACACATCTTGAGTTTGGAATGACTTGTGAAGATATTGTTTTTAAGTAAAATATTTGTTCATCTCCTTTGGAACACGCTTGAAGTGTTTCGCTTTTTCTTTTTTCTTTTTTAATTATACTCCATACTCACAAAATTTATCTGTTCCATAGGGTCTTGATGATCTGATCAGTTTATTTAGACGTCCTCCTTCCCGAAAAGTAAAAACAGCTTGATGAGTTGAGTAGATTTTGATGCTTGGATTTGCATCATTGCCACCCGAATTTTTGCAATCTAGATTAGACAGGCCTAACGATGGGGTCCAAGGTGTCAAACCAATCATCATCAGAGCTGGAGGAACTGAAGGGGATTTTGATGCTTGAATTCCTCCCAGGATTCTCTGTTTCTTTCTTTTGACATCCTTTAGTTGTTATATCATCTGTGACATAAAAACAGTTACGAGAACTCTCGAGGTGCGATATTTTTTTCTTTGCTCTCCATATTCCTACTGTTTAACCCTCCATTTGTGGCTTCCACTTTTGACAATGATTTTTTTCCCCTTGGAATTTGTTGGGTCTATTATTTCTTTGCTCGGTTGATGTTAGTGAGTCGCAGGCAATAGATTCAGTTCAGCAGTTAAGAAAATCTTTCTCAGAGAATATGTTGAGGTTATTTATTTATTAGCATTGTTATTATTTTACAGTTGTCTCGTGGTCTTTTCCCCCTTGTTTTGTTGATTATTCTGTTTTATCAAATGTTTCAGTGGCATAGAGTTAAGTTACTGCTAAATAATACAAAGATTTCATCCATTGCTACACTCATCTAGAATTTTTCACTTTTATATTA

mRNA sequence

ATGAAAAAAGAAATAAAAAACTTGACTTGCGAAAGAGGCCCTTACTCAACAAAGGTTGGAAGGTCAAGTGAGGTGTATGCGGGCGGTGGGCATCTCTCCTCAACGGCGAGAAGAGCAGCAAAAGCCGCGAGCCGATTCAATTTCCATGTGCTTATCATCCATTTTATTCTAATCAAAATTCCCTTCCAGTCTCATCCTTTTTCTTCTCATATCTTCAAACGCCATTTCCTCCCTCAGTCTGAAGCTCCAGATCTTGCATCCATGGCTCCTCGTAAACCCCGCGAGCCACAAGTTTTTGATTCCTTCTACCCTGTCCTCGCCTTAGTCTTCATTCTAGTTGCCTGTGTCGAGCTCTGTGACGCTGCCACCGTCGTCGATGTCTACCGTCTCATTCAGTATGATATCTCTGGTGTTCCCTTTGGATCCCGCGCCGCCACACTCAATCACCATGCTTCCTCTCTTCATTTTCCAACTGGTGCTGATCTCTCTCGTACTGTTCTCATCATTCCGCTTTGTGAGCTCAATATGACTTTTCTCCAAGAATGTATATCTCAGAAAAAGCGTCTGGGAGGTCTGTTGGTTTTACTTCCCAGGATTCTTGGTTCGGAAAGCCTGAAAAATGATGACATTAAATGTCCAAATGGAGAGGGGGTGATCAAGGGTTTATCGGTTGAACTTGAACGGTTGCTCGTACATTCTACTATACCTTATCCTGTATATTTTGCTTCAGAAGGTGAAGATATTGATGCTGTTTTGGCTGATGTCAAGAACAATGATGCCACTGGTCAGCTTGCAACTGCAACTACTGGCGGGTATAAGCTTGTTGTTTCGGCAGCAGAACCAAGGAAACTTGTATCTTCCACTATTACAAATATTCAGGGTTGGCTTCCTGGACTAAAATCCGATGGGGATGCTAGTCAACTCCCAACAATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGACTTATCTGTGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTCGAAATTGCAAGATTATTTTCTCTTCTTTATTCCAACCCTAAGACAAGAGGAAGGTACAATCTACTTTTTGGACTCACTTCTGGCGGGCCTTACAACTACAATGGGACTCACAAGTGGCTTCAAAGCTTTGATCACCGTCTCCGTGAGAGGATTGACTATGCTATTTGCTTAAATAGTATTGGTTCATGGGATGACAAATTATGGCTGCATGTCTCCAAGCCTCCAGAAAATGCCTACATTAAGCAAATCTTTGAAGATTTCTCAAATGTTGCGGAGGATTTGGGCTTTAAAGTTGATTTGAAGCACAAGAAGATTAATATTTCAAACCCTCGAGTAGCCTGGGAGCATGAACAGTTTTCAAGATTGAGAGTAACCGCTGCTACCCTTTCTGAACTCTCTGCTGCTCCTGAGCTCTTGGAAAGGACTGGAGGTTTGGGTGACAACAGATTGTTTTTGGACGAGAGTAAAATTGCCAAGAGTATCAAGTTAGTTGCCGAGAGTCTTGCAAGGCATATTTACAGATACGAAGGAAAGAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAATCAATCCAACTTTTATTCGGTCATGGTTGGATCTTTTATCACGAACGCCTCGAGTTGCTCCATTCCTGTCGAAAGACGACCCTTTCATAACGGCATTAAAAAAGGAACTGGAGGTCCATACCCATGATGTGAGCTTGCAACATGAAGTATTTGAGGGGATATTCACCTTTTATGGTTCAACTGCAGCTAAACTTCACGTATACCAGGTTGCTAGTGTGACATTCGACTTGCTTTTGCTTTTGGTCTTGGGATCTTATTTAGTTTTACTCTTCTGTTTCCTTGTGATCACAACCAGGGGTCTTGATGATCTGATCAGTTTATTTAGACGTCCTCCTTCCCGAAAAGTAAAAACAGCTTGATGAGTTGAGTAGATTTTGATGCTTGGATTTGCATCATTGCCACCCGAATTTTTGCAATCTAGATTAGACAGGCCTAACGATGGGGTCCAAGGTGTCAAACCAATCATCATCAGAGCTGGAGGAACTGAAGGGGATTTTGATGCTTGAATTCCTCCCAGGATTCTCTGTTTCTTTCTTTTGACATCCTTTAGTTGTTATATCATCTGTGACATAAAAACAGTTACGAGAACTCTCGAGGTGCGATATTTTTTTCTTTGCTCTCCATATTCCTACTGTTTAACCCTCCATTTGTGGCTTCCACTTTTGACAATGATTTTTTTCCCCTTGGAATTTGTTGGGTCTATTATTTCTTTGCTCGGTTGATGTTAGTGAGTCGCAGGCAATAGATTCAGTTCAGCAGTTAAGAAAATCTTTCTCAGAGAATATGTTGAGGTTATTTATTTATTAGCATTGTTATTATTTTACAGTTGTCTCGTGGTCTTTTCCCCCTTGTTTTGTTGATTATTCTGTTTTATCAAATGTTTCAGTGGCATAGAGTTAAGTTACTGCTAAATAATACAAAGATTTCATCCATTGCTACACTCATCTAGAATTTTTCACTTTTATATTA

Coding sequence (CDS)

ATGAAAAAAGAAATAAAAAACTTGACTTGCGAAAGAGGCCCTTACTCAACAAAGGTTGGAAGGTCAAGTGAGGTGTATGCGGGCGGTGGGCATCTCTCCTCAACGGCGAGAAGAGCAGCAAAAGCCGCGAGCCGATTCAATTTCCATGTGCTTATCATCCATTTTATTCTAATCAAAATTCCCTTCCAGTCTCATCCTTTTTCTTCTCATATCTTCAAACGCCATTTCCTCCCTCAGTCTGAAGCTCCAGATCTTGCATCCATGGCTCCTCGTAAACCCCGCGAGCCACAAGTTTTTGATTCCTTCTACCCTGTCCTCGCCTTAGTCTTCATTCTAGTTGCCTGTGTCGAGCTCTGTGACGCTGCCACCGTCGTCGATGTCTACCGTCTCATTCAGTATGATATCTCTGGTGTTCCCTTTGGATCCCGCGCCGCCACACTCAATCACCATGCTTCCTCTCTTCATTTTCCAACTGGTGCTGATCTCTCTCGTACTGTTCTCATCATTCCGCTTTGTGAGCTCAATATGACTTTTCTCCAAGAATGTATATCTCAGAAAAAGCGTCTGGGAGGTCTGTTGGTTTTACTTCCCAGGATTCTTGGTTCGGAAAGCCTGAAAAATGATGACATTAAATGTCCAAATGGAGAGGGGGTGATCAAGGGTTTATCGGTTGAACTTGAACGGTTGCTCGTACATTCTACTATACCTTATCCTGTATATTTTGCTTCAGAAGGTGAAGATATTGATGCTGTTTTGGCTGATGTCAAGAACAATGATGCCACTGGTCAGCTTGCAACTGCAACTACTGGCGGGTATAAGCTTGTTGTTTCGGCAGCAGAACCAAGGAAACTTGTATCTTCCACTATTACAAATATTCAGGGTTGGCTTCCTGGACTAAAATCCGATGGGGATGCTAGTCAACTCCCAACAATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGACTTATCTGTGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTCGAAATTGCAAGATTATTTTCTCTTCTTTATTCCAACCCTAAGACAAGAGGAAGGTACAATCTACTTTTTGGACTCACTTCTGGCGGGCCTTACAACTACAATGGGACTCACAAGTGGCTTCAAAGCTTTGATCACCGTCTCCGTGAGAGGATTGACTATGCTATTTGCTTAAATAGTATTGGTTCATGGGATGACAAATTATGGCTGCATGTCTCCAAGCCTCCAGAAAATGCCTACATTAAGCAAATCTTTGAAGATTTCTCAAATGTTGCGGAGGATTTGGGCTTTAAAGTTGATTTGAAGCACAAGAAGATTAATATTTCAAACCCTCGAGTAGCCTGGGAGCATGAACAGTTTTCAAGATTGAGAGTAACCGCTGCTACCCTTTCTGAACTCTCTGCTGCTCCTGAGCTCTTGGAAAGGACTGGAGGTTTGGGTGACAACAGATTGTTTTTGGACGAGAGTAAAATTGCCAAGAGTATCAAGTTAGTTGCCGAGAGTCTTGCAAGGCATATTTACAGATACGAAGGAAAGAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAATCAATCCAACTTTTATTCGGTCATGGTTGGATCTTTTATCACGAACGCCTCGAGTTGCTCCATTCCTGTCGAAAGACGACCCTTTCATAACGGCATTAAAAAAGGAACTGGAGGTCCATACCCATGATGTGAGCTTGCAACATGAAGTATTTGAGGGGATATTCACCTTTTATGGTTCAACTGCAGCTAAACTTCACGTATACCAGGTTGCTAGTGTGACATTCGACTTGCTTTTGCTTTTGGTCTTGGGATCTTATTTAGTTTTACTCTTCTGTTTCCTTGTGATCACAACCAGGGGTCTTGATGATCTGATCAGTTTATTTAGACGTCCTCCTTCCCGAAAAGTAAAAACAGCTTGA

Protein sequence

MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKIPFQSHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA*
Homology
BLAST of CSPI01G14970 vs. ExPASy Swiss-Prot
Match: Q6NZ07 (Nicalin-1 OS=Danio rerio OX=7955 GN=ncl1 PE=2 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 1.4e-68
Identity = 193/586 (32.94%), Postives = 308/586 (52.56%), Query Frame = 0

Query: 103 YPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADL 162
           +P+  ++F+++ C    +AA    VYR+ QYD+ G  +GSR A LN  A ++       L
Sbjct: 17  FPLSLVLFLVLVCPLRAEAAHEFSVYRMQQYDLQGQTYGSRNAILNTEARTVEAEV---L 76

Query: 163 SRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGL 222
           SR  +++ L + +    Q+ + Q    G ++++LP      +L  D ++           
Sbjct: 77  SRRCVMMRLADFSYEKYQKALRQS--AGAVVIILPH--NMSTLPQDIVQ----------Q 136

Query: 223 SVELERLLVHSTIPYPVYFASEGEDIDAVLADVK--------NNDATGQLATATTGGYKL 282
            +ELE  L+ +    PVYFA E E++ ++    +        ++ A   L TAT  G+++
Sbjct: 137 FMELEPELLATETIVPVYFALEDEELLSIYTQTQISSSSQGSSSAAEVLLHTATANGFQM 196

Query: 283 VVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNG 342
           V S A+ + +    IT+++G L G  S G+   LPTI +VA YD+FG AP LS G+DSNG
Sbjct: 197 VTSGAQSKAVSDWAITSLEGRLTG--SGGE--DLPTIVLVAHYDSFGVAPWLSYGADSNG 256

Query: 343 SGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR----LR 402
           SG+  LLE+ARLFS LYS  +T   YNLLF L+ GG +NY GT +WL+ + DH     L+
Sbjct: 257 SGVAILLELARLFSRLYSYKRTHAGYNLLFFLSGGGKFNYQGTKRWLEDNLDHTDASLLQ 316

Query: 403 ERIDYAICLNSIGSWDDKLWLHVSKPPEN-----AYIKQIFEDFSNVAEDLGFKVDLKHK 462
           + + + +CL+++G+  D L LHVSKPP+        +K++    ++   DL F   + HK
Sbjct: 317 DNVAFVLCLDTLGN-SDNLHLHVSKPPKEGSPQYTLLKELETVVAHQHPDLKFA--MVHK 376

Query: 463 KINISNPRVAWEHEQFSRLRVTAATLSEL---------------SAAPELLERTGGLGDN 522
           KIN+++  +AWEHE+F   R+ A TLS L               S +P L     G    
Sbjct: 377 KINLADDTLAWEHERFGIRRLPAFTLSHLESHRSPARHSIMDMRSVSPSL--EGAGEATT 436

Query: 523 RLFLDESKIAKSIKLVAESLARHIYRYEGK----NIQVFADDSSLAINPTFIRSWLDLLS 582
              +D  K++++ K++AE+LAR IY    K    ++++F +   + +    + S +D L+
Sbjct: 437 GPHVDLGKLSRNTKVIAETLARVIYNLTEKGVTGDLEIFTE--QMQVQEDQLASLVDWLT 496

Query: 583 RTPRVAPFLSKDDPFITALKKELEVHTHDVS---LQHEVFEGIFTFYGSTAAKLHVYQVA 642
             PR A  L KD   I  L+ +L  +  DV    ++ +  +  F FY      ++ Y+V 
Sbjct: 497 AQPRAAQLLDKDSSIINTLEHQLSRYLKDVKRHLVRADKRDPEFVFYDQLKQTMNAYRVK 556

Query: 643 SVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVK 649
              FDLLL + + SYL +L  +L I   GL  L    RR  + +VK
Sbjct: 557 PAIFDLLLAVCIASYLGVL--YLAIQNFGL--LYGFLRRVTAPRVK 570

BLAST of CSPI01G14970 vs. ExPASy Swiss-Prot
Match: Q8VCM8 (Nicalin OS=Mus musculus OX=10090 GN=Ncln PE=1 SV=2)

HSP 1 Score: 259.6 bits (662), Expect = 9.4e-68
Identity = 186/556 (33.45%), Postives = 299/556 (53.78%), Query Frame = 0

Query: 96  PQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLH 155
           P  F  F P  A++ ++   +   DAA    VYR+ QYD+ G P+G+R A LN  A ++ 
Sbjct: 19  PLGFIVFLP--AVLLLVAPPLPAADAAHEFTVYRMQQYDLQGQPYGTRNAVLNTEARTV- 78

Query: 156 FPTGAD-LSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPN 215
               AD LSR  +++ L + +    Q+ + Q    G ++++LPR +   ++  D ++   
Sbjct: 79  ---DADVLSRRCVLMRLLDFSYEHYQKALRQS--AGAVVIILPRAMA--AVPQDVVR--- 138

Query: 216 GEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATG--------QLAT 275
                + + +E E L + + +  PVYFA E E + ++    +   A+          L T
Sbjct: 139 -----QFMEIEPEMLAMETVV--PVYFAVEDEALLSIYEQTQAASASQGSASAAEVLLHT 198

Query: 276 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDL 335
           AT  G+++V S A+ + +    IT+++G L GL  +     LPTI IVA YD FG AP L
Sbjct: 199 ATANGFQMVTSGAQSQAVSDWLITSVEGRLTGLGGE----DLPTIVIVAHYDAFGVAPWL 258

Query: 336 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFD 395
           S+G+DSNGSGI  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ S D
Sbjct: 259 SLGADSNGSGISVLLELARLFSRLYTYKRTHAAYNLLFFASGGGKFNYQGTKRWLEDSLD 318

Query: 396 HR----LRERIDYAICLNSIGSWDDKLWLHVSKPP-----ENAYIKQIFEDFSNVAEDLG 455
           H     L++ + + +CL+++G     L LHVSKPP     ++A+++++    ++   D+ 
Sbjct: 319 HTDSSLLQDNVAFVLCLDTVGR-GSHLRLHVSKPPREGTLQHAFLRELETVAAHQFPDVS 378

Query: 456 FKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTG---GLGDNRLFL 515
           F   + HKKIN+++  +AWEHE+F+  R+ A TLS L +      R G    + D R  +
Sbjct: 379 F--SMVHKKINLADDVLAWEHERFAIRRLPAFTLSHLES-----HRAGPRSSIMDVRSRV 438

Query: 516 DESKIAKSIKLVAESLARHIYRYEGK----NIQVFADDSSLAINPTFIRSWLDLLSRTPR 575
           D   + ++ +++AE+L R IY    K    ++ VF +   + +    I S +D L+  PR
Sbjct: 439 DSKTLTRNTRIIAEALTRVIYNLTEKGTPPDMPVFTE--QMQVQEEQIDSVMDWLTNQPR 498

Query: 576 VAPFLSKDDPFITALKKELEVHTHDVSLQH---EVFEGIFTFYGSTAAKLHVYQVASVTF 623
            A  L KD  F++ L+  L  +  DV   H   +  +  F FY      ++ Y+V    F
Sbjct: 499 AAQLLDKDGTFLSTLEHFLSRYLKDVRQHHVKADKRDPEFVFYDQLKQVMNAYRVKPAIF 540

BLAST of CSPI01G14970 vs. ExPASy Swiss-Prot
Match: Q5XIA1 (Nicalin OS=Rattus norvegicus OX=10116 GN=Ncln PE=2 SV=1)

HSP 1 Score: 259.6 bits (662), Expect = 9.4e-68
Identity = 188/556 (33.81%), Postives = 295/556 (53.06%), Query Frame = 0

Query: 96  PQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLH 155
           P  F  F P  A++ ++   +   DAA    VYR+ QYD+ G P+G+R A LN  A ++ 
Sbjct: 19  PLGFIVFLP--AVLLLVAPPLPAADAAHEFTVYRMQQYDLQGQPYGTRNAVLNTEARTV- 78

Query: 156 FPTGAD-LSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPN 215
               AD LSR  +++ L + +    Q+ + Q    G ++++LPR +   ++  D ++   
Sbjct: 79  ---DADVLSRRCVLMRLLDFSYEHYQKALRQS--AGAVVIILPRAMA--AVPQDVVR--- 138

Query: 216 GEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATG--------QLAT 275
                + + +E E L + + +  PVYFA E E + ++    +   A+          L T
Sbjct: 139 -----QFMEIEPEMLAMETVV--PVYFAVEDEALLSIYEQTQAASASQGSASAAEVLLHT 198

Query: 276 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDL 335
           AT  G+++V S A+ + +    IT+++G L GL  +     LPTI IVA YD FG AP L
Sbjct: 199 ATANGFQMVTSGAQSQAVSDWLITSVEGRLTGLGGE----DLPTIVIVAHYDAFGVAPWL 258

Query: 336 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFD 395
           S+G+DSNGSGI  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ S D
Sbjct: 259 SLGADSNGSGISVLLELARLFSRLYTYKRTHAAYNLLFFASGGGKFNYQGTKRWLEDSLD 318

Query: 396 HR----LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIF-EDFSNVA----EDLG 455
           H     L++ + + +CL+++G     L LHVSKPP    ++ +F  +   VA     D+ 
Sbjct: 319 HTDSSLLQDNVAFVLCLDTVGR-GSHLRLHVSKPPREGTLQHVFLRELEMVAAHQFPDVS 378

Query: 456 FKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELS---AAPELLERTGGLGDNRLFL 515
           F   + HKKIN+++  +AWEHE+F+  R+ A TLS L    A P        + D R  +
Sbjct: 379 F--SMVHKKINLADDVLAWEHERFAIRRLPAFTLSHLENHRAGPR-----SSIMDVRSRV 438

Query: 516 DESKIAKSIKLVAESLARHIYRYEGK----NIQVFADDSSLAINPTFIRSWLDLLSRTPR 575
           D   + ++ +++AE+L R IY    K    ++ VF +   + +    I S +D L+  PR
Sbjct: 439 DSKTLTRNTRIIAEALTRVIYNLTEKGTPPDMPVFTE--QMQVQQEQIDSVMDWLTNQPR 498

Query: 576 VAPFLSKDDPFITALKKELEVHTHDVSLQH---EVFEGIFTFYGSTAAKLHVYQVASVTF 623
            A  L KD  F++ L+  L  +  DV   H   +  +  F FY      ++ Y+V    F
Sbjct: 499 AAQLLDKDGTFLSTLEHFLSRYLKDVRQHHVKADKRDPEFVFYDQLKQVMNAYRVKPAIF 540

BLAST of CSPI01G14970 vs. ExPASy Swiss-Prot
Match: Q969V3 (Nicalin OS=Homo sapiens OX=9606 GN=NCLN PE=1 SV=2)

HSP 1 Score: 250.4 bits (638), Expect = 5.7e-65
Identity = 177/553 (32.01%), Postives = 288/553 (52.08%), Query Frame = 0

Query: 96  PQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLH 155
           P  F  F P  A++ ++   +   DAA    VYR+ QYD+ G P+G+R A LN  A ++ 
Sbjct: 19  PLGFIVFLP--AVLLLVAPPLPAADAAHEFTVYRMQQYDLQGQPYGTRNAVLNTEARTM- 78

Query: 156 FPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNG 215
                 LSR  +++ L + +    Q+ + Q    G ++++LPR +   ++  D ++    
Sbjct: 79  --AAEVLSRRCVLMRLLDFSYEQYQKALRQS--AGAVVIILPRAMA--AVPQDVVR---- 138

Query: 216 EGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQ-----------L 275
               + + +E E L + + +  PVYFA E E   A+L+  K   A              L
Sbjct: 139 ----QFMEIEPEMLAMETAV--PVYFAVEDE---ALLSIYKQTQAASASQGSASAAEVLL 198

Query: 276 ATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP 335
            TAT  G+++V S  + + +    I +++G L GL  +     LPTI IVA YD FG AP
Sbjct: 199 RTATANGFQMVTSGVQSKAVSDWLIASVEGRLTGLGGE----DLPTIVIVAHYDAFGVAP 258

Query: 336 DLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-S 395
            LS+G+DSNGSG+  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ +
Sbjct: 259 WLSLGADSNGSGVSVLLELARLFSRLYTYKRTHAAYNLLFFASGGGKFNYQGTKRWLEDN 318

Query: 396 FDHR----LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIF-EDFSNVA--EDLG 455
            DH     L++ + + +CL+++G     L LHVSKPP    ++  F  +   VA  +   
Sbjct: 319 LDHTDSSLLQDNVAFVLCLDTVGR-GSSLHLHVSKPPREGTLQHAFLRELETVAAHQFPE 378

Query: 456 FKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDES 515
            +  + HK+IN++   +AWEHE+F+  R+ A TLS L +  +   +   + D R  +D  
Sbjct: 379 VRFSMVHKRINLAEDVLAWEHERFAIRRLPAFTLSHLESHRD--GQRSSIMDVRSRVDSK 438

Query: 516 KIAKSIKLVAESLARHIYRYEGK----NIQVFADDSSLAINPTFIRSWLDLLSRTPRVAP 575
            + ++ +++AE+L R IY    K    ++ VF +   + I    + S +D L+  PR A 
Sbjct: 439 TLTRNTRIIAEALTRVIYNLTEKGTPPDMPVFTE--QMQIQQEQLDSVMDWLTNQPRAAQ 498

Query: 576 FLSKDDPFITALKKELEVHTHDVSLQH---EVFEGIFTFYGSTAAKLHVYQVASVTFDLL 623
            + KD  F++ L+  L  +  DV   H   +  +  F FY      ++ Y+V    FDLL
Sbjct: 499 LVDKDSTFLSTLEHHLSRYLKDVKQHHVKADKRDPEFVFYDQLKQVMNAYRVKPAVFDLL 540

BLAST of CSPI01G14970 vs. ExPASy Swiss-Prot
Match: Q5ZJH2 (Nicalin OS=Gallus gallus OX=9031 GN=NCLN PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 3.7e-56
Identity = 166/547 (30.35%), Postives = 279/547 (51.01%), Query Frame = 0

Query: 101 SFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGA 160
           SF   +  V +L+      +AA    VYR+ QY++ G P+G+R+A LN  A ++     A
Sbjct: 21  SFLLFVPAVLLLLGPPPAAEAAHESTVYRMQQYELGGQPYGTRSAVLNTEARTVE----A 80

Query: 161 D-LSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVI 220
           D LSR  +++ L + +    Q+ + Q    G ++++LPR +   S+  D +K        
Sbjct: 81  DVLSRRCVMMRLVDFSYEQYQKALRQS--AGAVVIILPRSI--SSVPQDVVK-------- 140

Query: 221 KGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATG--------QLATATTGG 280
           + + +E E L + + +  PVYFA E +++ ++    +   A+          L TAT  G
Sbjct: 141 QFMEIEPEMLAMETIV--PVYFAVEDDELLSIYEQTRAASASQGSASAAEVLLHTATANG 200

Query: 281 YKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLSVGSD 340
           +++V S A+ + +    I +++G L GL  +     LPT+ IVA YD+FG AP LS G+D
Sbjct: 201 FQMVTSGAQSKAIHDWLIPSVEGRLTGLGGE----DLPTVVIVAHYDSFGVAPWLSHGAD 260

Query: 341 SNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR--- 400
           SNGSG+  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ + DH    
Sbjct: 261 SNGSGVSVLLELARLFSRLYTYRRTHAGYNLLFFASGGGKFNYQGTKRWLEDNLDHTDSS 320

Query: 401 -LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGF---KVDLKH 460
            L++ + + +CL+++G   + L LHVSKPP+   ++  F     +     F   K  + H
Sbjct: 321 LLQDNVAFVLCLDTLGR-GNSLHLHVSKPPKEGTLQHAFLRELEMVVASQFPEVKFSMVH 380

Query: 461 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIK 520
           KKIN++   +AWEHE+F+  R+ A T+S L +  + L  +  + D R  +D        K
Sbjct: 381 KKINLAEDILAWEHERFAIRRLPAFTISHLESHRDSLRNS--IMDRRSQVD-------TK 440

Query: 521 LVAESLARHIYRYEGKNIQVFADDSSLA--------INPTFIRSWLD-LLSRTPRVAPFL 580
            + +    H   ++  ++Q   + S+           +P       D L  ++ + A  +
Sbjct: 441 ALTQEYQDHCGGFDEGHLQPNREGSTCRSADLHGSDADPGGAARISDGLAEQSAQAAQLI 500

Query: 581 SKDDPFITALKKELEVHTHDVSLQH---EVFEGIFTFYGSTAAKLHVYQVASVTFDLLLL 619
            KD  F++ L+  +  +  DV   H   +  +  F FY      ++ Y+V    FDLLL 
Sbjct: 501 DKDSTFLSTLEYYMGRYLKDVKQHHVKADKRDPEFVFYDQLKQVMNAYRVKPAIFDLLLA 535

BLAST of CSPI01G14970 vs. ExPASy TrEMBL
Match: A0A0A0LYB7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G167840 PE=4 SV=1)

HSP 1 Score: 1281.9 bits (3316), Expect = 0.0e+00
Identity = 649/650 (99.85%), Postives = 649/650 (99.85%), Query Frame = 0

Query: 1   MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKI 60
           MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKI
Sbjct: 1   MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKI 60

Query: 61  PFQSHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCD 120
           PF SHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCD
Sbjct: 61  PFHSHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCD 120

Query: 121 AATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQ 180
           AATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQ
Sbjct: 121 AATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQ 180

Query: 181 ECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVY 240
           ECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVY
Sbjct: 181 ECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVY 240

Query: 241 FASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK 300
           FASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK
Sbjct: 241 FASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK 300

Query: 301 SDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRY 360
           SDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRY
Sbjct: 301 SDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRY 360

Query: 361 NLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAY 420
           NLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAY
Sbjct: 361 NLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAY 420

Query: 421 IKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELL 480
           IKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELL
Sbjct: 421 IKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELL 480

Query: 481 ERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSW 540
           ERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSW
Sbjct: 481 ERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSW 540

Query: 541 LDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQ 600
           LDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQ
Sbjct: 541 LDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQ 600

Query: 601 VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA 651
           VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA
Sbjct: 601 VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA 650

BLAST of CSPI01G14970 vs. ExPASy TrEMBL
Match: A0A1S3CJR5 (nicalin-1 OS=Cucumis melo OX=3656 GN=LOC103501746 PE=4 SV=1)

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 547/563 (97.16%), Postives = 557/563 (98.93%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQV DSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHASSLHFP+GADLSRTVLIIPLCEL MTFLQECISQKKRLGGLLVLLPRILGSESLKN
Sbjct: 61  NHHASSLHFPSGADLSRTVLIIPLCELKMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 208 DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA 267
           DDIKC NGEGVIK L VELERLL+HSTIPYPVYFAS+GEDIDAVLADVKNNDATGQLATA
Sbjct: 121 DDIKCTNGEGVIKDLLVELERLLIHSTIPYPVYFASDGEDIDAVLADVKNNDATGQLATA 180

Query: 268 TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS 327
           TTGGYKLVVSAAEP+KL+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+LS
Sbjct: 181 TTGGYKLVVSAAEPKKLISSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELS 240

Query: 328 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 387
           VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR
Sbjct: 241 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 300

Query: 388 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 447
           LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN
Sbjct: 301 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 360

Query: 448 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 507
           ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE
Sbjct: 361 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 420

Query: 508 SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE 567
           SLARHIYRYEGKNIQVFADDSSLA+NPT+IRSW+DLLSRTPRVAPFLSKDDPFI+ALKKE
Sbjct: 421 SLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWVDLLSRTPRVAPFLSKDDPFISALKKE 480

Query: 568 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 627
           LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI
Sbjct: 481 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 540

Query: 628 TTRGLDDLISLFRRPPSRKVKTA 651
           TTRGLDDLI LFRRPPSRKVKTA
Sbjct: 541 TTRGLDDLIGLFRRPPSRKVKTA 563

BLAST of CSPI01G14970 vs. ExPASy TrEMBL
Match: A0A5A7T1R9 (Nicalin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G002170 PE=4 SV=1)

HSP 1 Score: 1055.0 bits (2727), Expect = 1.2e-304
Identity = 530/545 (97.25%), Postives = 540/545 (99.08%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQV DSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHASSLHFP+GADLSRTVLIIPLCEL MTFLQECISQKKRLGGLLVLLPRILGSESLKN
Sbjct: 61  NHHASSLHFPSGADLSRTVLIIPLCELKMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 208 DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA 267
           DDIKC NGEGVIK L VELERLL+HSTIPYPVYFAS+GEDIDAVLADVKNNDATGQLATA
Sbjct: 121 DDIKCTNGEGVIKDLLVELERLLIHSTIPYPVYFASDGEDIDAVLADVKNNDATGQLATA 180

Query: 268 TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS 327
           TTGGYKLVVSAAEP+KL+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+LS
Sbjct: 181 TTGGYKLVVSAAEPKKLISSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELS 240

Query: 328 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 387
           VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR
Sbjct: 241 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 300

Query: 388 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 447
           LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN
Sbjct: 301 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 360

Query: 448 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 507
           ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE
Sbjct: 361 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 420

Query: 508 SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE 567
           SLARHIYRYEGKNIQVFADDSSLA+NPT+IRSW+DLLSRTPRVAPFLSKDDPFI+ALKKE
Sbjct: 421 SLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWVDLLSRTPRVAPFLSKDDPFISALKKE 480

Query: 568 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 627
           LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI
Sbjct: 481 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 540

Query: 628 TTRGL 633
           TTRGL
Sbjct: 541 TTRGL 545

BLAST of CSPI01G14970 vs. ExPASy TrEMBL
Match: A0A6J1IC52 (nicalin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111471642 PE=4 SV=1)

HSP 1 Score: 1023.8 bits (2646), Expect = 3.0e-295
Identity = 511/567 (90.12%), Postives = 542/567 (95.59%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQV +SFYP+LALVF+LVAC ELCDAA VVDVYRLI YDISGVPFGSRAA+L
Sbjct: 1   MAPRKPREPQVLESFYPLLALVFLLVACTELCDAAAVVDVYRLIHYDISGVPFGSRAASL 60

Query: 148 NHHASSLHFP---TGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSES 207
           NHHA+SLHFP     ADLSRTV IIPLCELN TF++EC+SQ+KRLGGLL+LLP+ILGS+ 
Sbjct: 61  NHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECLSQRKRLGGLLILLPKILGSDG 120

Query: 208 LKNDDIKCP-NGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQ 267
            KNDD KCP NG+G+IK L VELERLL+H+T+PYPVYFASEGEDI+AVLADVK+NDATGQ
Sbjct: 121 PKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQ 180

Query: 268 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAA 327
           LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK DGDASQLPTIAIVASYDTFGA+
Sbjct: 181 LATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKFDGDASQLPTIAIVASYDTFGAS 240

Query: 328 PDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 387
           P+LSVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Sbjct: 241 PELSVGSDSNGSGIVALLEIARLFSLLYSSPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS 300

Query: 388 FDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 447
           FDHR+RE IDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH
Sbjct: 301 FDHRIRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKH 360

Query: 448 KKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIK 507
           KKINISNPRVAWEHEQFSRLRVTAATLS +SAAPELLERTGGL DNRLFL+ES IAKSIK
Sbjct: 361 KKINISNPRVAWEHEQFSRLRVTAATLSGISAAPELLERTGGLADNRLFLNESAIAKSIK 420

Query: 508 LVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITA 567
           LVAESLARHIYRYEGKNIQVFADDSSLA+NPT+IRSWLDLLSRTPRVAPFLSKDDPFI+A
Sbjct: 421 LVAESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRTPRVAPFLSKDDPFISA 480

Query: 568 LKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFC 627
           LKKELEVHTHDVSLQHE F+G+FTFY STAAKLHVYQVASVTFDL+LLLVLGSYLVLLFC
Sbjct: 481 LKKELEVHTHDVSLQHEPFDGVFTFYDSTAAKLHVYQVASVTFDLVLLLVLGSYLVLLFC 540

Query: 628 FLVITTRGLDDLISLFRRPPSRKVKTA 651
           FLVITTRGLDDLI LFRRPPSRKVKTA
Sbjct: 541 FLVITTRGLDDLIGLFRRPPSRKVKTA 567

BLAST of CSPI01G14970 vs. ExPASy TrEMBL
Match: A0A6J1CME5 (Nicalin OS=Momordica charantia OX=3673 GN=LOC111012612 PE=3 SV=1)

HSP 1 Score: 1021.5 bits (2640), Expect = 1.5e-294
Identity = 515/564 (91.31%), Postives = 533/564 (94.50%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRK RE +V +SFYPV+ALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKAREREVLESFYPVVALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHA SLHFP GADLSRTV+IIPLCELN+TF++ECISQKK LGGLL LLP+I GS+  KN
Sbjct: 61  NHHAGSLHFPPGADLSRTVVIIPLCELNITFVKECISQKKXLGGLLFLLPKIFGSDDTKN 120

Query: 208 DDIKCP-NGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLAT 267
           D  KCP NGEG +K L  ELERLLVH  IPYPVYFASEGEDI AVLADVK NDATGQLAT
Sbjct: 121 DGTKCPNNGEGTVKNLLGELERLLVHENIPYPVYFASEGEDIGAVLADVKRNDATGQLAT 180

Query: 268 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDL 327
           ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+L
Sbjct: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240

Query: 328 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 387
           SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 388 RLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 447
           RLRE IDYAICLNSIGSWDDKLWLHVSKPPEN YIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENVYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 448 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVA 507
           NISN RVAWEHEQFSRLRVTAATLSELSAAPELLERTGGL DNRLFL+ES IAKSIKLVA
Sbjct: 361 NISNLRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLVDNRLFLNESAIAKSIKLVA 420

Query: 508 ESLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKK 567
           ESLARHIYRYEGKNIQVFADDSSLA+NPT+IRSWLDLLSR PRVAPFLSKDDPFI ALKK
Sbjct: 421 ESLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWLDLLSRMPRVAPFLSKDDPFILALKK 480

Query: 568 ELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 627
           ELEVHTHDVS+QHE F+G+FTFY STAAKLH+YQVASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVSMQHEAFDGMFTFYDSTAAKLHIYQVASVTFDLLLLLVLGSYLVLLFCFLV 540

Query: 628 ITTRGLDDLISLFRRPPSRKVKTA 651
           ITTRGLDDLI LFRRPPSRKVKTA
Sbjct: 541 ITTRGLDDLIGLFRRPPSRKVKTA 564

BLAST of CSPI01G14970 vs. NCBI nr
Match: KGN64956.1 (hypothetical protein Csa_022770 [Cucumis sativus])

HSP 1 Score: 1281.9 bits (3316), Expect = 0.0e+00
Identity = 649/650 (99.85%), Postives = 649/650 (99.85%), Query Frame = 0

Query: 1   MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKI 60
           MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKI
Sbjct: 1   MKKEIKNLTCERGPYSTKVGRSSEVYAGGGHLSSTARRAAKAASRFNFHVLIIHFILIKI 60

Query: 61  PFQSHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCD 120
           PF SHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCD
Sbjct: 61  PFHSHPFSSHIFKRHFLPQSEAPDLASMAPRKPREPQVFDSFYPVLALVFILVACVELCD 120

Query: 121 AATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQ 180
           AATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQ
Sbjct: 121 AATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFPTGADLSRTVLIIPLCELNMTFLQ 180

Query: 181 ECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVY 240
           ECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVY
Sbjct: 181 ECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEGVIKGLSVELERLLVHSTIPYPVY 240

Query: 241 FASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK 300
           FASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK
Sbjct: 241 FASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLK 300

Query: 301 SDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRY 360
           SDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRY
Sbjct: 301 SDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRY 360

Query: 361 NLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAY 420
           NLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAY
Sbjct: 361 NLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAICLNSIGSWDDKLWLHVSKPPENAY 420

Query: 421 IKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELL 480
           IKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELL
Sbjct: 421 IKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEHEQFSRLRVTAATLSELSAAPELL 480

Query: 481 ERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSW 540
           ERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSW
Sbjct: 481 ERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYEGKNIQVFADDSSLAINPTFIRSW 540

Query: 541 LDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQ 600
           LDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQ
Sbjct: 541 LDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQ 600

Query: 601 VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA 651
           VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA
Sbjct: 601 VASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLISLFRRPPSRKVKTA 650

BLAST of CSPI01G14970 vs. NCBI nr
Match: XP_004139498.1 (nicalin-1 [Cucumis sativus])

HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 563/563 (100.00%), Postives = 563/563 (100.00%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN
Sbjct: 61  NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 208 DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA 267
           DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA
Sbjct: 121 DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA 180

Query: 268 TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS 327
           TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS
Sbjct: 181 TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS 240

Query: 328 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 387
           VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR
Sbjct: 241 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 300

Query: 388 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 447
           LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN
Sbjct: 301 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 360

Query: 448 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 507
           ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE
Sbjct: 361 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 420

Query: 508 SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE 567
           SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE
Sbjct: 421 SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE 480

Query: 568 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 627
           LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI
Sbjct: 481 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 540

Query: 628 TTRGLDDLISLFRRPPSRKVKTA 651
           TTRGLDDLISLFRRPPSRKVKTA
Sbjct: 541 TTRGLDDLISLFRRPPSRKVKTA 563

BLAST of CSPI01G14970 vs. NCBI nr
Match: XP_008463652.1 (PREDICTED: nicalin-1 [Cucumis melo])

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 547/563 (97.16%), Postives = 557/563 (98.93%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQV DSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHASSLHFP+GADLSRTVLIIPLCEL MTFLQECISQKKRLGGLLVLLPRILGSESLKN
Sbjct: 61  NHHASSLHFPSGADLSRTVLIIPLCELKMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 208 DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA 267
           DDIKC NGEGVIK L VELERLL+HSTIPYPVYFAS+GEDIDAVLADVKNNDATGQLATA
Sbjct: 121 DDIKCTNGEGVIKDLLVELERLLIHSTIPYPVYFASDGEDIDAVLADVKNNDATGQLATA 180

Query: 268 TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS 327
           TTGGYKLVVSAAEP+KL+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+LS
Sbjct: 181 TTGGYKLVVSAAEPKKLISSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELS 240

Query: 328 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 387
           VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR
Sbjct: 241 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 300

Query: 388 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 447
           LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN
Sbjct: 301 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 360

Query: 448 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 507
           ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE
Sbjct: 361 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 420

Query: 508 SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE 567
           SLARHIYRYEGKNIQVFADDSSLA+NPT+IRSW+DLLSRTPRVAPFLSKDDPFI+ALKKE
Sbjct: 421 SLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWVDLLSRTPRVAPFLSKDDPFISALKKE 480

Query: 568 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 627
           LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI
Sbjct: 481 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 540

Query: 628 TTRGLDDLISLFRRPPSRKVKTA 651
           TTRGLDDLI LFRRPPSRKVKTA
Sbjct: 541 TTRGLDDLIGLFRRPPSRKVKTA 563

BLAST of CSPI01G14970 vs. NCBI nr
Match: KAA0035547.1 (nicalin-1 [Cucumis melo var. makuwa] >TYK30987.1 nicalin-1 [Cucumis melo var. makuwa])

HSP 1 Score: 1055.0 bits (2727), Expect = 2.5e-304
Identity = 530/545 (97.25%), Postives = 540/545 (99.08%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQV DSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHASSLHFP+GADLSRTVLIIPLCEL MTFLQECISQKKRLGGLLVLLPRILGSESLKN
Sbjct: 61  NHHASSLHFPSGADLSRTVLIIPLCELKMTFLQECISQKKRLGGLLVLLPRILGSESLKN 120

Query: 208 DDIKCPNGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATA 267
           DDIKC NGEGVIK L VELERLL+HSTIPYPVYFAS+GEDIDAVLADVKNNDATGQLATA
Sbjct: 121 DDIKCTNGEGVIKDLLVELERLLIHSTIPYPVYFASDGEDIDAVLADVKNNDATGQLATA 180

Query: 268 TTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLS 327
           TTGGYKLVVSAAEP+KL+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+LS
Sbjct: 181 TTGGYKLVVSAAEPKKLISSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELS 240

Query: 328 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 387
           VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR
Sbjct: 241 VGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR 300

Query: 388 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 447
           LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN
Sbjct: 301 LRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKIN 360

Query: 448 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 507
           ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE
Sbjct: 361 ISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAE 420

Query: 508 SLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKE 567
           SLARHIYRYEGKNIQVFADDSSLA+NPT+IRSW+DLLSRTPRVAPFLSKDDPFI+ALKKE
Sbjct: 421 SLARHIYRYEGKNIQVFADDSSLAVNPTYIRSWVDLLSRTPRVAPFLSKDDPFISALKKE 480

Query: 568 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 627
           LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI
Sbjct: 481 LEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVI 540

Query: 628 TTRGL 633
           TTRGL
Sbjct: 541 TTRGL 545

BLAST of CSPI01G14970 vs. NCBI nr
Match: XP_038894420.1 (nicalin-1 isoform X2 [Benincasa hispida])

HSP 1 Score: 1048.1 bits (2709), Expect = 3.1e-302
Identity = 527/564 (93.44%), Postives = 546/564 (96.81%), Query Frame = 0

Query: 88  MAPRKPREPQVFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 147
           MAPRKPREPQV DSFYP+LALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL
Sbjct: 1   MAPRKPREPQVLDSFYPILALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATL 60

Query: 148 NHHASSLHFPTGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKN 207
           NHHASSLHFP+ ADLSR+VLIIPL ELN+TFLQECISQKKRLGGLLVLLP+I  S+  +N
Sbjct: 61  NHHASSLHFPSAADLSRSVLIIPLSELNITFLQECISQKKRLGGLLVLLPKIFDSDGPEN 120

Query: 208 DDIKCP-NGEGVIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLAT 267
           DDIK P NGEG+IK L VELERLL+HSTIPYPVYFASEGEDIDAVLADVKNNDATGQLAT
Sbjct: 121 DDIKSPHNGEGMIKNLLVELERLLIHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLAT 180

Query: 268 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDL 327
           ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+L
Sbjct: 181 ATTGGYKLVVSAAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPEL 240

Query: 328 SVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 387
           SVGSDSNGSG+VALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
Sbjct: 241 SVGSDSNGSGVVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH 300

Query: 388 RLRERIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 447
           RLRE IDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI
Sbjct: 301 RLRESIDYAICLNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKI 360

Query: 448 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVA 507
           NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGL DNRLFL+ESKIA SIKLVA
Sbjct: 361 NISNPRVAWEHEQFSRLRVTAATLSELSAAPELLERTGGLADNRLFLNESKIANSIKLVA 420

Query: 508 ESLARHIYRYEGKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKK 567
           ES+A+HIYRYEGKNIQVFADDSSLA+NPT+IR WLDLLSRTPRVAPFLSKDDPFI+ALKK
Sbjct: 421 ESIAKHIYRYEGKNIQVFADDSSLAVNPTYIRLWLDLLSRTPRVAPFLSKDDPFISALKK 480

Query: 568 ELEVHTHDVSLQHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 627
           ELEVHTHDV LQHEVF+G+FTFY STAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV
Sbjct: 481 ELEVHTHDVGLQHEVFDGMFTFYDSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLV 540

Query: 628 ITTRGLDDLISLFRRPPSRKVKTA 651
           ITTRGLDDLI LFRRPPSRKVKTA
Sbjct: 541 ITTRGLDDLIGLFRRPPSRKVKTA 564

BLAST of CSPI01G14970 vs. TAIR 10
Match: AT3G44330.1 (INVOLVED IN: protein processing; LOCATED IN: mitochondrion, endoplasmic reticulum, plasma membrane, vacuole; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nicalin (InterPro:IPR016574), EF-Hand 1, calcium-binding site (InterPro:IPR018247), Nicastrin (InterPro:IPR008710); Has 245 Blast hits to 243 proteins in 99 species: Archae - 6; Bacteria - 10; Metazoa - 139; Fungi - 0; Plants - 46; Viruses - 0; Other Eukaryotes - 44 (source: NCBI BLink). )

HSP 1 Score: 813.5 bits (2100), Expect = 1.2e-235
Identity = 407/551 (73.87%), Postives = 474/551 (86.03%), Query Frame = 0

Query: 98  VFDSFYPVLALVFILVACVELCDAATVVDVYRLIQYDISGVPFGSRAATLNHHASSLHFP 157
           VF+S YP+LAL+ ILVACVELCDAATVVDVYRLIQYDISGVPFGSR ++LNHHA+SL F 
Sbjct: 15  VFESMYPILALMLILVACVELCDAATVVDVYRLIQYDISGVPFGSRFSSLNHHAASLSFQ 74

Query: 158 TGADLSRTVLIIPLCELNMTFLQECISQKKRLGGLLVLLPRILGSESLKNDDIKCPNGEG 217
            GADLSR+VLI+PL EL++ F+Q+ ISQK+ LGGLL+LLP+     ++    +   N +G
Sbjct: 75  RGADLSRSVLILPLRELDIAFVQDYISQKQSLGGLLILLPQTFRPGNVGGGSLSSEN-DG 134

Query: 218 VIKGLSVELERLLVHSTIPYPVYFASEGEDIDAVLADVKNNDATGQLATATTGGYKLVVS 277
             + L  +LE+LLVH  IP+PVYFA E E+ DA+LADVK NDA GQ ATATTGGYKLV+S
Sbjct: 135 -FRSLLGQLEKLLVHGNIPFPVYFAFENEETDAMLADVKKNDALGQQATATTGGYKLVIS 194

Query: 278 AAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPDLSVGSDSNGSGI 337
            +EPRK+ S TITNIQGWLPGL+++GD+SQLPTIA+VASYDTFGAAP LSVGSDSNGSG+
Sbjct: 195 VSEPRKIASPTITNIQGWLPGLRAEGDSSQLPTIAVVASYDTFGAAPALSVGSDSNGSGV 254

Query: 338 VALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRLRERIDYAIC 397
           VALLE+ARLFS+LYSNPKTRG+YNLLF LTSGGPYNY GT KWL+S D R+RE IDYAIC
Sbjct: 255 VALLEVARLFSVLYSNPKTRGKYNLLFALTSGGPYNYEGTQKWLKSLDQRMRESIDYAIC 314

Query: 398 LNSIGSWDDKLWLHVSKPPENAYIKQIFEDFSNVAEDLGFKVDLKHKKINISNPRVAWEH 457
           LNS+GSWD +L +HVSKPP+NAYIKQIFE FSNVAEDLGF+V LKHKKINISN RVAWEH
Sbjct: 315 LNSVGSWDSELLIHVSKPPDNAYIKQIFEGFSNVAEDLGFQVALKHKKINISNSRVAWEH 374

Query: 458 EQFSRLRVTAATLSELSAAPELLERTGGLGDNRLFLDESKIAKSIKLVAESLARHIYRYE 517
           EQFSRLRVTAATLSELS  PELLE  G L D R  ++E  I K +KLVAESLA+HIY ++
Sbjct: 375 EQFSRLRVTAATLSELSTPPELLENAGSLSDTRQLVNEDAIIKGVKLVAESLAKHIYGHQ 434

Query: 518 GKNIQVFADDSSLAINPTFIRSWLDLLSRTPRVAPFLSKDDPFITALKKELEVHTHDVSL 577
           GK+I++FADDSSLA+NP ++RSWLDLLS+TPRVAPFLSK++P I ALKKELE +T +VS+
Sbjct: 435 GKDIKIFADDSSLAVNPFYVRSWLDLLSQTPRVAPFLSKNEPLIMALKKELEDYTAEVSV 494

Query: 578 QHEVFEGIFTFYGSTAAKLHVYQVASVTFDLLLLLVLGSYLVLLFCFLVITTRGLDDLIS 637
           QHE  +G FTFY ST A L++YQVASVTFDLLLLLVLGSYL++LF FLVITTRGLDDLIS
Sbjct: 495 QHESLDGSFTFYDSTKASLNIYQVASVTFDLLLLLVLGSYLIVLFSFLVITTRGLDDLIS 554

Query: 638 LFRRPPSRKVK 649
           LFRRPPSRKVK
Sbjct: 555 LFRRPPSRKVK 563

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6NZ071.4e-6832.94Nicalin-1 OS=Danio rerio OX=7955 GN=ncl1 PE=2 SV=1[more]
Q8VCM89.4e-6833.45Nicalin OS=Mus musculus OX=10090 GN=Ncln PE=1 SV=2[more]
Q5XIA19.4e-6833.81Nicalin OS=Rattus norvegicus OX=10116 GN=Ncln PE=2 SV=1[more]
Q969V35.7e-6532.01Nicalin OS=Homo sapiens OX=9606 GN=NCLN PE=1 SV=2[more]
Q5ZJH23.7e-5630.35Nicalin OS=Gallus gallus OX=9031 GN=NCLN PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LYB70.0e+0099.85Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G167840 PE=4 SV=1[more]
A0A1S3CJR50.0e+0097.16nicalin-1 OS=Cucumis melo OX=3656 GN=LOC103501746 PE=4 SV=1[more]
A0A5A7T1R91.2e-30497.25Nicalin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G002170 PE=... [more]
A0A6J1IC523.0e-29590.12nicalin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111471642 PE=4 SV=1[more]
A0A6J1CME51.5e-29491.31Nicalin OS=Momordica charantia OX=3673 GN=LOC111012612 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
KGN64956.10.0e+0099.85hypothetical protein Csa_022770 [Cucumis sativus][more]
XP_004139498.10.0e+00100.00nicalin-1 [Cucumis sativus][more]
XP_008463652.10.0e+0097.16PREDICTED: nicalin-1 [Cucumis melo][more]
KAA0035547.12.5e-30497.25nicalin-1 [Cucumis melo var. makuwa] >TYK30987.1 nicalin-1 [Cucumis melo var. ma... [more]
XP_038894420.13.1e-30293.44nicalin-1 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT3G44330.11.2e-23573.87INVOLVED IN: protein processing; LOCATED IN: mitochondrion, endoplasmic reticulu... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF05450Nicastrincoord: 311..447
e-value: 2.6E-8
score: 33.6
NoneNo IPR availableGENE3D3.40.630.10Zn peptidasescoord: 233..499
e-value: 4.7E-22
score: 80.9
NoneNo IPR availablePANTHERPTHR31826:SF7NICALINcoord: 97..649
NoneNo IPR availableCDDcd03882M28_nicalin_likecoord: 225..514
e-value: 1.0957E-113
score: 340.884
NoneNo IPR availableSUPERFAMILY53187Zn-dependent exopeptidasescoord: 286..474
IPR016574NicalinPANTHERPTHR31826NICALINcoord: 97..649
IPR018247EF-Hand 1, calcium-binding sitePROSITEPS00018EF_HAND_1coord: 331..343

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G14970.1CSPI01G14970.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009966 regulation of signal transduction
cellular_component GO:0005789 endoplasmic reticulum membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane