Cp4.1LG00g01670 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG00g01670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein
LocationCp4.1LG00 : 3950742 .. 3961077 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGGTAAAGGACTGCGACCACCACCACCACCACGACTGCGAGCGCCGCCGTCTCTACCGCCGAATAGCCTGCGCCATCTTCACTGTCCTCCTCTTGATTAGCCTTGTCATCTTCCTCATCTGGGCTATCCTCCGGCCGTCCAAGCCCCGCCTCATTCTCCAAGACGTCACCGTTTTCGGCCTGAACGTCTCGTCGGTGCCACCGGCTGCCATCTCGATCACCATGCAGGTCACCATCTCCTCCCACAACCCCAACTCCCGCATCGGTGTTTACTACCAAACCATGGATGTTTACGCCGCCTACCGCGGCCAACAGGTCACTCTCCCGACGCTCTTGCCGCCGACTTACCAAGGCCACAATGATGTCACCGTTTGGTCGCCGTTTTTGTACGGCGAAGCTGTGCCGGTGGCGCCCGAGTTTGCTGAAGCTTTGAATGAGGATAATAACGTTGGAGCCATGCTGTTCAATATCAAGATCAATGGACAGGTTTTGTTCGTATAATATATTTTATATAGCTGTAATCAGTATTTTTTTTTTTTTTTTTCTTAACATTAATATATTTTGTCTCGACGAAACAGGTTAGGTGGAAGGTTGGGAGCTGGATCTCAGGCAGGTATCGGTTGAACGCGAATTGTCCGGCGTATATAAAGTTCGGCGATCCAAAGAATGGGATTGCATTCGGACCGGCGATGAAGTTTCGGTTTGTCCAAGGTTGTTATGTCGATATTTAAGGTCGCAAGAGTCCACTTCTTGTATTTTCTTTTCCACGACCCTTATAAGTCTTTGTGATCAACGAGATTTATTTGAATTATAGTTTTATTCTTTAAATCTGACTCCTTTTTATATGTATACACACATAATAGATTGAGGGAACTTTTTGGACCAACCTAAAAGTTCGGGTTGGTTGGATTAGTAGTCTAACCTAACTCGAAATTGTTTTGCAACTCAACCTAATTCTCTATTTTCAGATGGTTGAAAAAAATATATATTTTTTTTAAAAAGTTATCCACCATACATTATCATTCTTGGTTAAAAGTTCATAAACAAACACAAATTCATCCATAATTATTCAAATAAACAATTTTTTTATTTTTTATAATATAGTCCGAGAATTGTAATATTATAACATATTCTAAGAAAATATAAGAATAGATTCTAAAATTAAAAAAAAAATAAAAATAAAAACAAATATAAATATATTTTTATTTCTCGTGAAAAGATTTATTTGTGTACGACAGAGCTGGAGGGTGAGAATATTTACTAAAACCACCTCACTAAATAGTGCGTGAAAGTTCTCTATTTATAATTCATATGGGCTTATGAAAGGAAGCCTTAGAAGCCCAATTCGTCGTATTCCATTGGGCCCGGCCCAAATCGTTTTCATTTAAAATAAAAAGAAAAAGAAAAAAACTCCTCCCTCGACCCTCAGTTTCACGGCGGCTCCCCCTCCTTCGGATCAGGCACCGCCGTTCTGCTCCCTTTTTCATCTCTGGATCTCAGGTATTCTTCTTAATTCCCATCTCTGTCTTTCAATCTTTCTTGTTTTGCTATATTTTGAATCCAATCTTCTGGTTTCATTGCAACCGATTTTGCTCCTCGATGGGTTCATCCGATTTCTCACTCGAATATCAGCTTCAATTTACTTCTGGCCTGTTTGGAATTGCCAAAATCACCTTTGCAATTGCGCAGAGGATGAAATTGATTTTTGAAGTGTTAGTTGGTAGATGATTTTTGATAGGGTGACTGACCTGAAATCAGCTCCAAAATTTACCTGGATTTTATTCAGCTTGCCCATAAATATAAGAACACGATTCTCAGCCCTGAACAATTTCTTTGCTTTCGATTTTTTCCGCTTTCTCTCACCATTGTAGCATCTTCAGAGGACTCTTTCATCTCCGGCCGCTTTCTCTCTCGTTCTGCTGATCATTTCGTTGGTTTTCGCTGGTAGTTGTTAGCTTCATCCTTGTTTGAGTTTTGGACGTGTGGTTTCCACATTGATTGTGAACTGTTTGCATTATTCTTAATTTTGAACTAGGTTTTCAATTTGTTTATGTAATTTTGACTGGTGGTTTTGATTTGTGTTTCGTTTTGAACTAGGTTTTTGATTTGTTCCGTAATTGCTTCATTGATTAGGTTGCCAATGGTGTGTTCTTTGCTTTTCTTACTTATGATTGAGTGGTGTATTTCTGGAATTCGACGGCAGTAAGTTTTGTTTCTTTGGTGTGATGATGTGTTTTTGTTTGTGCTTTTCGTGTGGTATTGCATTGTTGGAGTTTGCCTTCATTTGTGAAGCTTTTCTGTTTTCGATTTTGATCGTACTGAAGGGGCTTTTTTTTGTTAATTGTGTATATGAAAATCCTTGCTGGAGCATTGTTATTTGGAAGCTTTTGGCTACCATTATATTTCTAGGGAGCTTTTTTGTTAGTTGGAAGCTTTTGGCTACCATTATATTTCTAGGGGGCTTTTCTTTGGTTGTCATTGTTTTGATTGTGATAAGATCAGTAATCTTCTACCATTTTGTTTCCTTCTCATCCTCCTCTGATCAATAATCTTACTTCTCTTTCTACTTTTGCATCTGTTTTTTTGTGAATTGTTCATTTAGCTATATATATATATATATTATCATAATTCTGGTGTGAATAGTTAATTTAAGTTTAGCAAATATTATATGTAAATCACTTCAAACCAAACATTCAAATGGCTATATGAAAATCACTTTACAAAACCAATTGTAATTAGATCTCTTTTTATAAAATCACTTATACTACAATGAGTCTCAAACATGCCCTTCATGATTAAACCCATTTGAGTTTGAGCATGCTTGTTCTATCCCGTTTGGTTTTGTGTTCAACTTGATTCGGATTTTACACTTCCTTCAGTTCTCATTTCTTTCCATCTTATTGGTTGTTTTCCGTTTCACATGTTCTTCATTTTTGCGTGCAATCCTTTTCTGTTCTTAATCTCTTCGATTTGATTATTCTTTTTCATATCTTTTTGCACTTGTCCCATCGCCCTGGATTTCAATTTCATTTTCTCTGTGCTCTTCAATTTCGATTTTGCAAGTCCAGTCATCATCTCCACCAAGTGTTTTTCTAATTCCACCTGTGTTCTTCTAGTCCCTGAACCCATATCAAACTTCTCTGGCTAACTTGATTGGAACAAAGGATTGAAATAGAAATTGTACTGAAAACTTGATTAACAAAAGAAAAAAGAAAAAAGGGAAAATATTCCAAAGCAACTTAGAGGGACCTCAACCTGAGCAAGTTGAAGCTAAGAGAGAAAAATACACTTTTGATATCTCAGGTTTGAGTTTGGTGCTAAGTTCGGTGCATTTAGGTACCTGAGCTTTCAAACTCAACAATTTAGGCCATGAGTTTTGGATTTAGTTGCATTTTAGTTCTTCCGTCCAAATTTTAATTTTCTTTTTTCTTCTCTCCCTTCTCTTCTCTTTTCTTTCTCCCTCCCCCTCACATCCTCCTCCTTCTTCCATGGCTACCGTAGCCGTGCACCATCATCACAACTAGTTGCCTCTTTCGACAGACCATAATGACAATCCGATAGCCTCTGATTGGTGGTGACAAACATAGCTTGCACACCCATGGCTTCACGACCCACTGATGATAACAATTCCTTAAATTTGTGACTCTAACAACTCCTCCCAATTGACGGTGACTTGACAATACTCACGAATTGATCTGGTAAAGCCGCGGAAACCTGTGAGCTAATCCAGATGTCATATCTGGAACAAACCCAAATCTGGTGCAAGGAGGTGATAGTTGATGGCTTTGTTGTGAGGGGGCAATCACTGACAATGCTAGTGCGAAGAGGCATTGTTTGTCAGGCTTTGCTGCTGCAAAGATTATTCCTTGACAGAGCTGTTGCGAGGACGTACCCTCAACAACGAATGTGCAACTATCACAAGAGAAGAGAGACGAAGATAAAGAGGGGGAGTGTAACAAGGCTAATACACCAGATCCCATCAGAACTCACTCCCAGAGTGTAACAAGGCTAATACACCAGATCCCATCAGAACTCCCATCAGAACTCTGTTGTTAAGCGTGGGAAGTCCTTGTGTTGGACCCCTTTAAAATGTAATGGACTGATTTGGAAAATATGATAATAATTTCCGCTAGAAATCTTGCGAGATTTTCGACAAGGGTTTAACGCAAGTGGAAATGTTTTCTTAATCCCTTAAAAGCCCTTTTTCATCCATCTATCCCATTACCTTTGTTTTGCTGTTTACGCATGCACCGAGAGCCAAAAAGAGTGAAAGAAACGAGAAGTCGAGAGAGAGATTGAAGGATTTGGAGAAGGGAAATCGAGAAGCTTCGTTTGTGAGATTTTTGGGGTTTAGGGAAGAGGTTTACCGTGGAGTTGTTGTACGAGTGAGAGGCGGACGAACGGTGGCAGAAACGTGCGGTTATGGCGTCAGAATTGTTGGCATAAACGTGCAGTTACGACGTTGGAGTCGTTAGCATCTCCGTCGAAGAAAGGAAAGAGGATATCTAGGGGTTTTCAGCATATTCGGGTGAGGAATAAAGAACATCGGGTCGGCATTCAGATCCGGATATAGCCCACGATCCAAATATAATAACTAACCCAATTGTCACAACCCAACTGTAGCCCACGGTAGCCCATCAGCACATACAACCCTGTCTAACCCCAACACAAACTGCACATACAACCCTGTCTAACCCCAACACAAACTGCCGTGCTATTCTGTAGTCAATGTCCACCAACTCCAAGGCGAAATGATTCAACTTGGAAAACTCTCCGGCGTATGTGGTAACAAAACGCCCCCTCTGACACAACTGGGTGAACTCTTGCTGCTTGCGAAGTTGAACGTCCTCTCGGTATAGTACTCTTCAATAAAAGCACCCTTGAATTTCGCCCACGAGATTGCACCTCCCTCAGGACTAATAATTTCGCGGGTGGACTGCCACCATAGTTCTGCATCATCTTGCAGCATGAATGCTGCACACTCCACCTTCTGATCTTTAGGATAGTTGGTCAACACAAACATAGTCTCTATAGATCTAAGCCACATTTGTGCTACAGTCGGATCATTAGAAGTTCCCTTAAAGGTTCATGGGTCACCCCTCTTGAAATCACGCAAGTACCTTACTTCCTTGGTCGATGATGAGTCATTTGCTTGTTGATTAGCAGTCAGGTTTTGTATAACAGTTTGCAATGCGTCCACCAGAGCAGCTGTACTATCATAATGCATTACTATCATAACGCAATGCATTCACCACAGTTTGCACAGCACATTACATGTCACTGGACACCACAAAAAAGCAAAGGCCCACATGATTTCATTGCAAACATAAATATCAAACGTGCGTCCGTACTATCATAATATAAAGGGAAAGTATCTCGATGCACATGATATACAAACATGCATGAGGTCCCATGATTTACATTTCATGACATAATACTTAATACCTTTCATTACTCTTAGCATTTCATATTATAACTCATTTATTTCATATTATAACTCATTTATGGAACATTTATGCATGAAAACATTTTTCAATTCATTTCATGAAAATAGGGTAATGGATTACAGGCATACGTAGCAAATCTCTGATCAATACAAGAATACATAAATATTGCATAACTTAGACATTTCTATGGTACATAGGGCATACACCATCATACAATATACTAAGCTAACACCTACAGTGAGGTTTGGTTGACCTTGATCCAAGTTACTCACAGTAAGTTAAGTGCTAGAATTTATTTATTTTTGGAAAACCGAATCGTTACAATTAGGCTGATTATTCACCTTTTTTTTTCTTTCCATTTATCTTGTAAACCAAGTCATGAAGGCTACTTTACCGTTTGTTCATTGTTGGCCAATATTCTGACAATTTATTTTCCTCATAGCTGCTTTACATTTTCTGCCCATATTTAGCAGCTGACGAGGATTCTAGCCAATGTTTAAGATTTAGGCCATTGTGTTCATACAACATTCAGGACCTTGAAAAGTATTTTTATTTTGTTTAATCTTTCCTAGTAGAGAATTGGTGTTTTATGCATATGGAAGAATTTTGTATGTTGTATGTTTTATGCATCAATGAGGTACATTCAAGTCTTCACAGGAACAGTAGAAGACATTAAAAATGAGCAAGTACGGCCTACAACTTAGAGTTAAGCCATCACAGCAGAAACAGCCTACAAGACCGCCCCTTCCTGCTCCTCTTGGATTTCAAGACGACGACGACGACGATGTCGAGAGAGAGATATCCCGACAAGCTTCCAAGAATAAGGCACTTAAGGATGTAAGCTAGCTTATATATGTATGACTGTGTCATTAACACGAATATCATGGTCGTATGCTACCCTTAATTTTATTTTTCTCCCTTGAAATATCAATTTAGATCGAGGAGCAACACAAGAAAGCACTAGCGGAAGATCCGTCGGTTTTCGATTACGATGGAGTCTACGATGAAATGAAAGAAAAGGCTGTTCTACCTCGAGCATATGATCGTGAAGAGCGAAAGGTATGTTTATAATTTGATCCCAAACTTGTGTAAAAATACAGTATCACCCTAATGTAATGCTCTGCCAGTGTGGTGAAATTACTTATTTGCTCACTGCCTTTTCATATTGCAGCCAAAGTATATTCAAAATTTGATGAAGAAGGCAAAAGAGCGAGAGAGAGAACAGGAGGTTATATATGAAAGAAAACTAGCCAAGGAGAGAAGCAAAGATGATCATCTTTATGCTGGTAAGGACAAATTCGTCACTAGTGCCTACAAAAAGAAACTTGCAGAACAAGCTAAGTGGATGGAGGAAGAACGTCTCCGACAACTTCGAGAGGAGAAAGACGATGTATGTTTTAACTTTGCACTCTGTTTCTTTTTCTTTTCGTTTTATAGTTTTATAAGTTATTATACTTTGCTAAATGCCCTCCGTGACAAAATGAGCGAGTATAAGTTATTTGTGACTGTACGAGGAACATTGAACCTAACAAGGGTCCTAATATGCGACCACTAATAAGGGTCCTAATATAATCACAAAATGATTTAGGACAAAATGATTTAGGACCATGTTAGGAACATTGAAACATTCTTTGAAACATTCTTTATAATAGTGCGGAAATTTCTCTCTAGCAGACGAGTTTTAAAAATCTTGAGGGGAAGTCCGGAAGGGAAAGTCTAAAGAGGGCAATATCTGTTAGCGGTGGGCGAGGATCGTTACAGCAATCCATTATCTTGTTCTTGGGTTATCTTAAGAATTGCTTTATGCTTTTGTTAATGTGTGTTTTTCATTGACAGCCAATTATCTGATTGTTCTTTTTGTTCTAATTTGAAGGTTACCAAGAAGAGCGACATTAGCGACTTTTACTTCAACCTTCAAAAGAATATTGCCTACGGTGCCGGTAATGCAGTCGAGCCGAGGGAACCTCCACAGAAGCTTCAGTCAGAGAAGCAGGCTGAAGTCCAGAAATTGAAGAAGCATGAAGAGAGTGTTAATGCCGATATTGTTAATGATCATCAGTCACCGAAATACCCCAGCTCGCCTTCCCAGTCAGCAAAGACAATAGAACAGGATCGTCACCTCGAAGAGCACCTTTCGAATTCAAACAAAACAACACCTTCTGATGTACGAGGTACTAGCCCCCCTCCATTACACAATAAAACTCCAGTTGAGAAACAGCCCGAACATGACCAAGGTGAACAACAACCAAAGCGCGATCATCATAAAAGAAGTGAAGATGCTGTCGCTGCTGCAAAAGAGCGATTTTTGGCTCGAAAAAGAACAAAGGAGGTGTGAAATACTTGACCTGTATGGATTAACCTGAGGACAAATATGTAAATGTTATTGTTCTATGGATGATAGTTGATAGTTATGTTCTTAATGGTCAAATTCATCTTGGATCTCTGGCTTTGACATCACATTTTTGTTTCGTGGTTCATGTGAGAATCCTTGTGTTGAGATAACAAGGAGAGCAACGGATGAAGCCTGATGACGCAGCTGCTCGAAGTTGCAGCATCTTCGTCGAAAGTAAGTAAACCAAGTTCTCATTTGTTAAGTCATATAGGCATATTTCTTTCTCCACTGTTGTTTTCGGAAGAAACTTTTGGATAGGTATGCCACGTCTGAACGGAATTCTTTTTGGTTAAAGAATCTCTTTCAAGTTGCTCTGGAACCATTAAGAGATCTAGAAGGTCGTTTAGGCCTTGAATTTTGGACTTTCCATAAGTAGTCTGACCTTGCCGAAAGTAGGTCATAGGAGCAAATGAGTGACCTATTTTTGAGTCTCGAAGCAATATTTTCCTTAAATGCATATCCAGTTTATTCCAAACATCTCATTTTTCTCCTGAACCAAATCCTCATTAACCTCGATTTCTTATCTTTTCTTTTCCGAACCTATTCTATTGGCTGACATCCCTTATCTCTTAAAAGCAACAAGAAAGGAGTAGGATGCATGAACCTCCTAACATGTTTTATTGTTGCATTAAATGATGGGCGGTTGGTTGCGTGCAGAGGGAATGTCATTTTGAAACATGTGATTTAGATGTCAAGACATTGGACATCAATTCTTGTCTAGCACGCCAAATGACATAATAATTTATAATAAAGTAGGGAGCAAATCATTTAAAGCAAATCTTTTAATCTTATCATCAGCATATGGAATGATTCTGTTCTTTGTTAGAACCCGAAAACTGAACGAAGCTCATCAACGAACTCATACCATAAACACTCACTCAATTGGTTATTAACATACCTCACCAACTTTGCTTTGATGAGGAATGCTTACACGCATCTTTGTGGCCATAATATGCTCCTCTTGTTCAATAATACCACGTCTCTTATCATGGTCCAAGCTAGAGGTTGATGTTCATATCTATACAAGCCTAGTACCAAACAAGAAAAATTAAATGTATAAATTGATAAGACAATGAAAATTAATCGAGTTTGTTGATGAAAGAAACCATTTTCTGGTTCTGGAGAGTGAGAAAGAGTTGTGTACAACTTTTAAAAATTCAAACTCAATTAATTTCTTATAATTAATGATATAATTTTTTCTAAAAAATGAATTTGATACGTTTCTAGTAGTTTGTTTTTTTCTTTCCGAACATTTTGAATATTTTTTTGTTGTTGCTGTGTTTTATAGTCCTCGAACTTTTAATTATGTAATGGAATTAGTCTTGCACTGGATTCATCGGGCTCGGAGTCCAATGAATCAATAGTTAACCATTTTTTTAGGGATTGCTATCGCAAAGCTAGTCGATGTCTGCTGTAATGGTTGAAAATATCCAAACCTGCTCACCCACCCCACCTCATACGACTAGTTTGGGGGGAGGGGTTGGCTGTTCGCCTTTTGGAGACACCTAAAATCCTCACATTTGAAGGACGTTCTTTTATGAAAATTTCAATTTTTTTAAAATTAATTTGAATTTTACACTATGTCCAATAAATACGTCATTTTTCAGATATTTTAACATTTTGAATATTTTAAATTTTTTTAAAAAGATGCGAAGAACACCATATTTAAATAAGATGAACCCGGATATTGATTTGTGGTATGATTGGTGGTAGGTCACTGAACTTGATATGGTTGACAAGTCATATCATAAACTTGTGAATCAGAAGTCATATTGTATCCATATCAAAGGAGTTAATGAGTTGATTTTCAAAACCAAATTGGATATTCCTACCGGCAAAACTCAAAGGTCAATGTTCAAACTTTGTATTAAATGATAATTTAGACATTGTGAATTGTATACTCCATGTTTGGCCGTCTTTCCTCTCTTATAAATCATATACAGCTTCCAATTGTTGATACAAAGATTTGATCTTTCAATTCTTCCTTCTTCTTCGTTTCTCCGTTTTTCCATTGAAGAACTCTAAATTATGGGTCGAAATTTCAACCATCCAAAGTCAGAACCGATACTGCAAGATTTGTTTTTGGCAGGATCTAGCTCGAAAACTGAGGACGCTACGACATCGGTCATGGTGCTACTGGGATGTACTCAATGCCTTATGTACGTCATGTCGTCGAAGTTCGATCTCAAGTGTCCCAAATGTAATAACACTACGTTGCTCGATGTTCACAATTTGGCCAATCAAGCATCCAAGAAAAATCTTGTTGACATGTCAATCGACGTTGCTAAATGAGGAGTTTATCGACATGCCCATTGTAGTTACCTTCTTTACCTTTATGTATACTCTTTTACTTTCGTTTCTTAATTTAGCTTCCTCGTTCGAGCACTTTTTTTTCAATTCATTCTATGTCATGGTTTAAGGAACTATTTGATATTACTAGATATCTTCAAATCATGTCATATCATGTCAATTGATAATAATGAAAGATATATAT

mRNA sequence

ATGACGGACTGCGACCACCACCACCACCACGACTGCGAGCGCCGCCGTCTCTACCGCCGAATAGCCTGCGCCATCTTCACTGTCCTCCTCTTGATTAGCCTTGTCATCTTCCTCATCTGGGCTATCCTCCGGCCGTCCAAGCCCCGCCTCATTCTCCAAGACGTCACCGTTTTCGGCCTGAACGTCTCGTCGGTGCCACCGGCTGCCATCTCGATCACCATGCAGGTCACCATCTCCTCCCACAACCCCAACTCCCGCATCGGTGTTTACTACCAAACCATGGATGTTTACGCCGCCTACCGCGGCCAACAGGTCACTCTCCCGACGCTCTTGCCGCCGACTTACCAAGGCCACAATGATGTCACCGTTTGGTCGCCGTTTTTGTACGGCGAAGCTGTGCCGGTGGCGCCCGAGTTTGCTGAAGCTTTGAATGAGGATAATAACGTTGGAGCCATGCTGTTCAATATCAAGATCAATGGACAGGTTAGGTGGAAGGTTGGGAGCTGGATCTCAGGCAGGTATCGGTTGAACGCGAATTGTCCGGCGTATATAAAGTTCGGCGATCCAAAGAATGGGATTGCATTCGGACCGGCGATGAAGTTTCGAAAAAGAAAAAAACTCCTCCCTCGACCCTCAGTTTCACGGCGGCTCCCCCTCCTTCGGATCAGGCACCGCCGTTCTGCTCCCTTTTTCATCTCTGGATCTCAGAAGACATTAAAAATGAGCAAGTACGGCCTACAACTTAGAGTTAAGCCATCACAGCAGAAACAGCCTACAAGACCGCCCCTTCCTGCTCCTCTTGGATTTCAAGACGACGACGACGACGATGTCGAGAGAGAGATATCCCGACAAGCTTCCAAGAATAAGGCACTTAAGGATATCGAGGAGCAACACAAGAAAGCACTAGCGGAAGATCCGTCGGTTTTCGATTACGATGGAGTCTACGATGAAATGAAAGAAAAGGCTGTTCTACCTCGAGCATATGATCGTGAAGAGCGAAAGCCAAAGTATATTCAAAATTTGATGAAGAAGGCAAAAGAGCGAGAGAGAGAACAGGAGGTTATATATGAAAGAAAACTAGCCAAGGAGAGAAGCAAAGATGATCATCTTTATGCTGGTAAGGACAAATTCGTCACTAGTGCCTACAAAAAGAAACTTGCAGAACAAGCTAAGTGGATGGAGGAAGAACGTCTCCGACAACTTCGAGAGGAGAAAGACGATGTTACCAAGAAGAGCGACATTAGCGACTTTTACTTCAACCTTCAAAAGAATATTGCCTACGGTGCCGGTAATGCAGTCGAGCCGAGGGAACCTCCACAGAAGCTTCAGTCAGAGAAGCAGGCTGAAGTCCAGAAATTGAAGAAGCATGAAGAGAGTGTTAATGCCGATATTGTTAATGATCATCAGTCACCGAAATACCCCAGCTCGCCTTCCCAGTCAGCAAAGACAATAGAACAGGATCGTCACCTCGAAGAGCACCTTTCGAATTCAAACAAAACAACACCTTCTGATGTACGAGGTACTAGCCCCCCTCCATTACACAATAAAACTCCAGTTGAGAAACAGCCCGAACATGACCAAGGTGAACAACAACCAAAGCGCGATCATCATAAAAGAAGTGAAGATGCTGTCGCTGCTGCAAAAGAGCGATTTTTGGCTCGAAAAAGAACAAAGGAGGTCACTGAACTTGATATGGTTGACAAGTCATATCATAAACTTGTGAATCAGAAGTCATATTGTATCCATATCAAAGGAGTTAATGAGTTGATTTTCAAAACCAAATTGGATATTCCTACCGGCAAAACTCAAAGGTCAATGTTCAAACTTTGTATTAAATGATAATTTAGACATTGTGAATTGTATACTCCATGTTTGGCCGTCTTTCCTCTCTTATAAATCATATACAGCTTCCAATTGTTGATACAAAGATTTGATCTTTCAATTCTTCCTTCTTCTTCGTTTCTCCGTTTTTCCATTGAAGAACTCTAAATTATGGGTCGAAATTTCAACCATCCAAAGTCAGAACCGATACTGCAAGATTTGTTTTTGGCAGGATCTAGCTCGAAAACTGAGGACGCTACGACATCGGTCATGGTGCTACTGGGATGTACTCAATGCCTTATGTACGTCATGTCGTCGAAGTTCGATCTCAAGTGTCCCAAATGTAATAACACTACGTTGCTCGATGTTCACAATTTGGCCAATCAAGCATCCAAGAAAAATCTTGTTGACATGTCAATCGACGTTGCTAAATGAGGAGTTTATCGACATGCCCATTGTAGTTACCTTCTTTACCTTTATGTATACTCTTTTACTTTCGTTTCTTAATTTAGCTTCCTCGTTCGAGCACTTTTTTTTCAATTCATTCTATGTCATGGTTTAAGGAACTATTTGATATTACTAGATATCTTCAAATCATGTCATATCATGTCAATTGATAATAATGAAAGATATATAT

Coding sequence (CDS)

ATGACGGACTGCGACCACCACCACCACCACGACTGCGAGCGCCGCCGTCTCTACCGCCGAATAGCCTGCGCCATCTTCACTGTCCTCCTCTTGATTAGCCTTGTCATCTTCCTCATCTGGGCTATCCTCCGGCCGTCCAAGCCCCGCCTCATTCTCCAAGACGTCACCGTTTTCGGCCTGAACGTCTCGTCGGTGCCACCGGCTGCCATCTCGATCACCATGCAGGTCACCATCTCCTCCCACAACCCCAACTCCCGCATCGGTGTTTACTACCAAACCATGGATGTTTACGCCGCCTACCGCGGCCAACAGGTCACTCTCCCGACGCTCTTGCCGCCGACTTACCAAGGCCACAATGATGTCACCGTTTGGTCGCCGTTTTTGTACGGCGAAGCTGTGCCGGTGGCGCCCGAGTTTGCTGAAGCTTTGAATGAGGATAATAACGTTGGAGCCATGCTGTTCAATATCAAGATCAATGGACAGGTTAGGTGGAAGGTTGGGAGCTGGATCTCAGGCAGGTATCGGTTGAACGCGAATTGTCCGGCGTATATAAAGTTCGGCGATCCAAAGAATGGGATTGCATTCGGACCGGCGATGAAGTTTCGAAAAAGAAAAAAACTCCTCCCTCGACCCTCAGTTTCACGGCGGCTCCCCCTCCTTCGGATCAGGCACCGCCGTTCTGCTCCCTTTTTCATCTCTGGATCTCAGAAGACATTAAAAATGAGCAAGTACGGCCTACAACTTAGAGTTAAGCCATCACAGCAGAAACAGCCTACAAGACCGCCCCTTCCTGCTCCTCTTGGATTTCAAGACGACGACGACGACGATGTCGAGAGAGAGATATCCCGACAAGCTTCCAAGAATAAGGCACTTAAGGATATCGAGGAGCAACACAAGAAAGCACTAGCGGAAGATCCGTCGGTTTTCGATTACGATGGAGTCTACGATGAAATGAAAGAAAAGGCTGTTCTACCTCGAGCATATGATCGTGAAGAGCGAAAGCCAAAGTATATTCAAAATTTGATGAAGAAGGCAAAAGAGCGAGAGAGAGAACAGGAGGTTATATATGAAAGAAAACTAGCCAAGGAGAGAAGCAAAGATGATCATCTTTATGCTGGTAAGGACAAATTCGTCACTAGTGCCTACAAAAAGAAACTTGCAGAACAAGCTAAGTGGATGGAGGAAGAACGTCTCCGACAACTTCGAGAGGAGAAAGACGATGTTACCAAGAAGAGCGACATTAGCGACTTTTACTTCAACCTTCAAAAGAATATTGCCTACGGTGCCGGTAATGCAGTCGAGCCGAGGGAACCTCCACAGAAGCTTCAGTCAGAGAAGCAGGCTGAAGTCCAGAAATTGAAGAAGCATGAAGAGAGTGTTAATGCCGATATTGTTAATGATCATCAGTCACCGAAATACCCCAGCTCGCCTTCCCAGTCAGCAAAGACAATAGAACAGGATCGTCACCTCGAAGAGCACCTTTCGAATTCAAACAAAACAACACCTTCTGATGTACGAGGTACTAGCCCCCCTCCATTACACAATAAAACTCCAGTTGAGAAACAGCCCGAACATGACCAAGGTGAACAACAACCAAAGCGCGATCATCATAAAAGAAGTGAAGATGCTGTCGCTGCTGCAAAAGAGCGATTTTTGGCTCGAAAAAGAACAAAGGAGGTCACTGAACTTGATATGGTTGACAAGTCATATCATAAACTTGTGAATCAGAAGTCATATTGTATCCATATCAAAGGAGTTAATGAGTTGATTTTCAAAACCAAATTGGATATTCCTACCGGCAAAACTCAAAGGTCAATGTTCAAACTTTGTATTAAATGA

Protein sequence

MTDCDHHHHHDCERRRLYRRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGLNVSSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHNDVTVWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANCPAYIKFGDPKNGIAFGPAMKFRKRKKLLPRPSVSRRLPLLRIRHRRSAPFFISGSQKTLKMSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKKALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYFNLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSPSQSAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHHKRSEDAVAAAKERFLARKRTKEVTELDMVDKSYHKLVNQKSYCIHIKGVNELIFKTKLDIPTGKTQRSMFKLCIK
BLAST of Cp4.1LG00g01670 vs. Swiss-Prot
Match: NHL12_ARATH (NDR1/HIN1-like protein 12 OS=Arabidopsis thaliana GN=NHL12 PE=2 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 3.9e-44
Identity = 85/177 (48.02%), Postives = 119/177 (67.23%), Query Frame = 1

Query: 20  RIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGLNVSSVPPAAISITMQVTIS 79
           RI   I   ++++ + IFL+W IL+P+KPR ILQD TV+  N+S   P  ++   Q+TI+
Sbjct: 20  RICGVIIGFIIIVLITIFLVWIILQPTKPRFILQDATVYAFNLSQ--PNLLTSNFQITIA 79

Query: 80  SHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHNDVTVWSPFLYGEAVPVAPEF 139
           S N NSRIG+YY  + VYA YR QQ+TL T +PPTYQGH +  VWSPF+YG +VP+AP  
Sbjct: 80  SRNRNSRIGIYYDRLHVYATYRNQQITLRTAIPPTYQGHKEDNVWSPFVYGNSVPIAPFN 139

Query: 140 AEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANCPAYIKFGDPKNGIAFG 197
           A AL ++ N G +   I+ +G+VRWKVG+ I+G+Y L+  C A+I   D   G+  G
Sbjct: 140 AVALGDEQNRGFVTLIIRADGRVRWKVGTLITGKYHLHVRCQAFINLADKAAGVHVG 194

BLAST of Cp4.1LG00g01670 vs. Swiss-Prot
Match: NSRP1_BOVIN (Nuclear speckle splicing regulatory protein 1 OS=Bos taurus GN=NSRP1 PE=2 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 1.1e-22
Identity = 88/228 (38.60%), Postives = 133/228 (58.33%), Query Frame = 1

Query: 243 KYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDD---VEREISRQASKNKALKDIEEQHK 302
           +YGL L  K +QQ  P     P+  G   DDDDD   V   + R+A+K +A+K  + + +
Sbjct: 7   QYGLILP-KKAQQLHPVLQK-PSVFGNDSDDDDDETSVSESLQREAAKKQAMKQTKLEIQ 66

Query: 303 KALAEDPSVFDYDGVYDEMKEKAVL--PRAYDREERKPKYIQNLMKKAKEREREQEVIYE 362
           KALAED +V++YD +YDEM++K     P+    ++RKPKYI NL+K  + R++EQE   E
Sbjct: 67  KALAEDSTVYEYDSIYDEMQKKKEESNPKLLLGKDRKPKYIHNLLKAVEIRKKEQEKRME 126

Query: 363 RKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDF 422
           +K+ +ER  +   +  K+ FVTSAYKKKL E+A+  E ER     E + DVTK+ D+S F
Sbjct: 127 KKIQREREMEKGEFDDKEAFVTSAYKKKLQERAEEEERERRAAALEARLDVTKQRDLSGF 186

Query: 423 YFNLQKNIAYGAGNAVEPREPPQKLQSEKQAEV--QKLKKHEESVNAD 464
           Y +L          AV   E P     E ++E+  +K K + + V+++
Sbjct: 187 YRHL-------LNQAVGEEEVPTCSFREARSEIKEEKSKGYSDEVSSE 225

BLAST of Cp4.1LG00g01670 vs. Swiss-Prot
Match: NSRP1_MOUSE (Nuclear speckle splicing regulatory protein 1 OS=Mus musculus GN=Nsrp1 PE=1 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 1.6e-21
Identity = 103/331 (31.12%), Postives = 163/331 (49.24%), Query Frame = 1

Query: 243 KYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDD---VEREISRQASKNKALKDIEEQHK 302
           +YGL L  K     QP    L  P  F  D DDD   V   + R+A+K +A+K  + + +
Sbjct: 7   QYGLILPKKT----QPLHRVLQKPSVFGSDSDDDETSVSESLQREAAKKQAMKQTKLEIQ 66

Query: 303 KALAEDPSVFDYDGVYDEMKEKAVL--PRAYDREERKPKYIQNLMKKAKEREREQEVIYE 362
           KALAED +V++YD VYDEM++K     P+    ++RKPKYI NL+K  + R++EQE   E
Sbjct: 67  KALAEDSTVYEYDSVYDEMQKKKEENNPKLLPGKDRKPKYIHNLLKAVEIRKKEQEKRME 126

Query: 363 RKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDF 422
           +K+ +ER  ++  +  K+ FVTSAYKKKL E+A+  E E+     E   DVTK+ D+S F
Sbjct: 127 KKIQREREMENGEFDDKEAFVTSAYKKKLEERAEEEEREKRAAALEAHLDVTKQKDLSGF 186

Query: 423 YFNLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSP 482
           Y +L   +    G    P+   ++ ++  + E  KL+ + +  N++     QS       
Sbjct: 187 YRHL---LNQAVGEEAAPKSSFREARTVIKEE--KLRGYPDETNSESRPPQQSCVLQRGA 246

Query: 483 SQSAKTIEQDRHLEEHLSNSN-------KTTPSDVRGTSPPPLHNKTPVEKQ-------- 542
            ++ +  + DR  ++  S          K+   D   +   P H+K     +        
Sbjct: 247 QEAEENPDADREFDDESSEDGEKRDHKVKSRGEDTGASMKHPKHHKNRAHSRSSSEERGL 306

Query: 543 --PEHDQGEQQPKRDHHKR-SEDAVAAAKER 551
               H +G Q    +H  R S D  +  K+R
Sbjct: 307 GTKHHSRGSQSRGHEHQDRQSRDQESCHKDR 328

BLAST of Cp4.1LG00g01670 vs. Swiss-Prot
Match: NSRP1_RAT (Nuclear speckle splicing regulatory protein 1 OS=Rattus norvegicus GN=Nsrp1 PE=1 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 1.1e-19
Identity = 99/318 (31.13%), Postives = 159/318 (50.00%), Query Frame = 1

Query: 243 KYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDD---VEREISRQASKNKALKDIEEQHK 302
           +YGL L  K     QP    L  P  F +D DDD   V   + R+A+K +A++  + + +
Sbjct: 7   QYGLILPKKT----QPLNRVLQKPSVFGNDSDDDEASVSESLQREAAKKQAMRQTKLEIQ 66

Query: 303 KALAEDPSVFDYDGVYDEMKEKAVL--PRAYDREERKPKYIQNLMKKAKEREREQEVIYE 362
           KALAED +V++YD +YDEM++K     P+    ++RKPKYI NL+K  + R++EQE   E
Sbjct: 67  KALAEDSTVYEYDSIYDEMQKKKEENNPKLLMGKDRKPKYIHNLLKAVEIRKKEQEKRME 126

Query: 363 RKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDF 422
           +K+ +ER  +   +  K+ FVTSAYKKKL E+A+  E E+     E + DVTK+ D+S F
Sbjct: 127 KKIQREREMEKGEFDDKEAFVTSAYKKKLEERAEEEEREKRAAALEARLDVTKQKDLSGF 186

Query: 423 YFNLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSP 482
           Y +L   +    G    P+   ++ ++  + E  KL+ + +  N++           + P
Sbjct: 187 YRHL---LNQAVGEEAVPKSSFREARTVIKEE--KLRGYPDETNSE-----------NRP 246

Query: 483 SQSAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHH 542
            Q+          EE     N    SD   +          V+ + E D G       HH
Sbjct: 247 QQNCALQSGVEEAEE-----NPDADSDSEESCDDGERGDHKVKSRGEEDTGASTKYLKHH 299

Query: 543 KRSEDAVAAAKERFLARK 556
           K    + ++++E  L+ K
Sbjct: 307 KNHTHSRSSSEEGGLSTK 299

BLAST of Cp4.1LG00g01670 vs. Swiss-Prot
Match: NSRP1_HUMAN (Nuclear speckle splicing regulatory protein 1 OS=Homo sapiens GN=NSRP1 PE=1 SV=1)

HSP 1 Score: 95.5 bits (236), Expect = 2.1e-18
Identity = 103/323 (31.89%), Postives = 157/323 (48.61%), Query Frame = 1

Query: 243 KYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREIS----RQASKNKALKDIEEQH 302
           +YGL L  K  Q      P L  P  F +D DDD E  +S    R+A+K +A+K  + + 
Sbjct: 7   QYGLILPKKTQQ----LHPVLQKPSVFGNDSDDDDETSVSESLQREAAKKQAMKQTKLEI 66

Query: 303 KKALAEDPSVFDYDGVYDEMKEKAVL--PRAYDREERKPKYIQNLMKKAKEREREQEVIY 362
           +KALAED +V++YD +YDEM++K     P+    ++RKPKYI NL+K  + R++EQE   
Sbjct: 67  QKALAEDATVYEYDSIYDEMQKKKEENNPKLLLGKDRKPKYIHNLLKAVEIRKKEQEKRM 126

Query: 363 ERKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISD 422
           E+K+ +ER  +   +  K+ FVTSAYKKKL E+A+  E E+     E   DVTK+ D+S 
Sbjct: 127 EKKIQREREMEKGEFDDKEAFVTSAYKKKLQERAEEEEREKRAAALEACLDVTKQKDLSG 186

Query: 423 FYFNLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVN---------- 482
           FY +L          AV   E P+    E ++ +++ K    S      N          
Sbjct: 187 FYRHL-------LNQAVGEEEVPKCSFREARSGIKEEKSRGFSNEVSSKNRIPQEKCILQ 246

Query: 483 -DHQSPKYPSSPSQSAKTIEQDRHLEEHLSNSNK----TTPSDVRGTSPPPLHNKTPVEK 542
            D +  + P + S        D  +EE   N  +     TP +         H+++P E+
Sbjct: 247 TDVKVEENPDADSDFDAKSSADDEIEETRVNCRREKVIETPENDFKHHRSQNHSRSPSEE 306

BLAST of Cp4.1LG00g01670 vs. TrEMBL
Match: A0A0A0LJE9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G070270 PE=4 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 5.6e-135
Identity = 266/323 (82.35%), Postives = 286/323 (88.54%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKK 300
           MSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKK
Sbjct: 1   MSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKK 60

Query: 301 ALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERKL 360
           AL EDPSVFDYDGVYDEMKEK V PRAYDREERKPKYIQNLMKKA+EREREQE+IYERKL
Sbjct: 61  ALEEDPSVFDYDGVYDEMKEKVVQPRAYDREERKPKYIQNLMKKAQEREREQEIIYERKL 120

Query: 361 AKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYFN 420
           AKERSKDDHLYAGKDKFVT AYKKKLAEQAKWMEEERLRQLREEK+DVTKKSD+SDFYF+
Sbjct: 121 AKERSKDDHLYAGKDKFVTGAYKKKLAEQAKWMEEERLRQLREEKEDVTKKSDMSDFYFS 180

Query: 421 LQKNIAYGAGNAVEPREPPQKLQ---SEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSP 480
           LQKN+AYGA NA+EP  PP+KLQ    EKQ EV  L+KHEES N D  NDH   +  + P
Sbjct: 181 LQKNVAYGARNAIEPTAPPKKLQLEKKEKQTEVHILEKHEESCNGDTFNDHLL-QNSNLP 240

Query: 481 SQSAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHH 540
           S S KTIE+  HLEE L  SNKTTPS++ GTSPP LH+KTPV++QP+H+Q EQ PK DHH
Sbjct: 241 SCSEKTIEEHPHLEERLPFSNKTTPSNIEGTSPPTLHDKTPVQEQPKHEQSEQPPKSDHH 300

Query: 541 KRSEDAVAAAKERFLARKRTKEV 561
           KR+EDAVAAAKERFLARKR KEV
Sbjct: 301 KRNEDAVAAAKERFLARKRAKEV 322

BLAST of Cp4.1LG00g01670 vs. TrEMBL
Match: A0A0A0LVW5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G169420 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 6.5e-107
Identity = 189/200 (94.50%), Postives = 195/200 (97.50%), Query Frame = 1

Query: 3   DCDHHHHHDCERRRLYRRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGLNV 62
           DCDHHHHHDCERRRLYRRIAC IFTV+LLI LVIFLIWAILRPSKPRLILQDVT+ GLNV
Sbjct: 5   DCDHHHHHDCERRRLYRRIACVIFTVVLLIGLVIFLIWAILRPSKPRLILQDVTLLGLNV 64

Query: 63  SSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 122
           SSVPPAAIS TMQ+TISSHNPN+RIGVYYQ MDVYAAYRGQQVTLPTLLPPTYQGHNDVT
Sbjct: 65  SSVPPAAISTTMQITISSHNPNNRIGVYYQVMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 124

Query: 123 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANCPA 182
           VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIK+NGQVRWKVGSWISGRYRLNANCPA
Sbjct: 125 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKVNGQVRWKVGSWISGRYRLNANCPA 184

Query: 183 YIKFGDPKNGIAFGPAMKFR 203
           YIKFGDPKNGIAFGPAMKF+
Sbjct: 185 YIKFGDPKNGIAFGPAMKFQ 204

BLAST of Cp4.1LG00g01670 vs. TrEMBL
Match: A0A061GV10_THECC (Coiled-coil domain-containing protein 55, putative isoform 1 OS=Theobroma cacao GN=TCM_041289 PE=4 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 4.5e-92
Identity = 212/321 (66.04%), Postives = 238/321 (74.14%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQP-TRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHK 300
           M KYGLQLRV PSQQK+P TRPPLP PLGF+DDDDDDVEREISRQASKNK+LK+IEEQH+
Sbjct: 1   MKKYGLQLRVPPSQQKKPVTRPPLPPPLGFRDDDDDDVEREISRQASKNKSLKEIEEQHR 60

Query: 301 KALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERK 360
           KAL EDPSVFDYDGVYDEMKEK V PR  DREER+ KYI NL+KKA++R+ EQE++YERK
Sbjct: 61  KALEEDPSVFDYDGVYDEMKEKVVRPRVQDREERQSKYIHNLIKKAEQRKWEQEIVYERK 120

Query: 361 LAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYF 420
           L KERSK+DHLYA KDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSD+SDFYF
Sbjct: 121 LVKERSKEDHLYADKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDLSDFYF 180

Query: 421 NLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSPSQ 480
           NL KN+A+GA N V PR P      EK  E +K +K +E    D++   Q     ++P  
Sbjct: 181 NLGKNVAFGA-NEVGPRMP------EKHTESRKPEKKDEK-EIDVIKRVQPLPNSNAPES 240

Query: 481 SAKTIE-QDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHHK 540
           S  T   QD         S  + P  V    P     +T   KQP  DQ    PKRDHHK
Sbjct: 241 SGVTDRTQDETSSRENFESQDSRPITV-DVLPETALQETSSVKQPSVDQ----PKRDHHK 300

Query: 541 RSEDAVAAAKERFLARKRTKE 560
           R EDAVAAA+ERFLARKR KE
Sbjct: 301 RGEDAVAAARERFLARKRAKE 308

BLAST of Cp4.1LG00g01670 vs. TrEMBL
Match: A0A0D2PQG7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G124100 PE=4 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 2.5e-90
Identity = 205/320 (64.06%), Postives = 237/320 (74.06%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQP-TRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHK 300
           M KYGLQLRV PSQQK+P TRPPLP P GFQ+DDDDDVE+EISRQASKNKALK++EEQHK
Sbjct: 1   MKKYGLQLRVPPSQQKKPVTRPPLPPPRGFQEDDDDDVEKEISRQASKNKALKEVEEQHK 60

Query: 301 KALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERK 360
           KAL EDPSVFDYDGVYD MKE+ V PRA D EERKPKYI NLMKKA++R+ EQE++YERK
Sbjct: 61  KALEEDPSVFDYDGVYDAMKEEVVRPRAQDHEERKPKYILNLMKKAEQRKWEQEIVYERK 120

Query: 361 LAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYF 420
           L KERSK+DHLYA KDKFVTSAYK+KLAEQAKWMEEERLRQLREEK+DVTKKSD+SDFYF
Sbjct: 121 LVKERSKEDHLYADKDKFVTSAYKRKLAEQAKWMEEERLRQLREEKEDVTKKSDLSDFYF 180

Query: 421 NLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSPSQ 480
           NL KN+A+GA N  +PR+P          E+++ +K +E    + VN       P S + 
Sbjct: 181 NLGKNVAFGA-NEAKPRKP----------ELREPEKEKEKEVLNRVNPLPDSVSPESSAV 240

Query: 481 SAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHHKR 540
             +T ++    E   S  +K    D   T P     +T  EKQP  D    QPK DHHKR
Sbjct: 241 IDRTRDESSFREIFDSRDSKPITED---TVPDTAVQETTSEKQPSVD----QPKSDHHKR 300

Query: 541 SEDAVAAAKERFLARKRTKE 560
             DAVAAA+ERFLARKR KE
Sbjct: 301 GADAVAAARERFLARKRAKE 302

BLAST of Cp4.1LG00g01670 vs. TrEMBL
Match: A0A0B0NEG8_GOSAR (Nuclear speckle splicing regulatory 1 OS=Gossypium arboreum GN=F383_10067 PE=4 SV=1)

HSP 1 Score: 338.2 bits (866), Expect = 2.1e-89
Identity = 204/320 (63.75%), Postives = 235/320 (73.44%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQP-TRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHK 300
           M KYGLQLRV PSQQK+P TRPPLP P GFQ+DDDDDVE+EISRQASKNKALK++EEQHK
Sbjct: 1   MKKYGLQLRVPPSQQKKPATRPPLPPPRGFQEDDDDDVEKEISRQASKNKALKEVEEQHK 60

Query: 301 KALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERK 360
           KAL EDP VFDYDGVYD MKE+ V PRA DREERKPKYI NLMKKA++R+ EQE++YERK
Sbjct: 61  KALEEDPCVFDYDGVYDAMKEEVVRPRAQDREERKPKYIHNLMKKAEQRKWEQEIVYERK 120

Query: 361 LAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYF 420
           L KERSK+DHLYA KDKFVTSAYK+KLAEQAKWMEEERLRQLREEK+DVTKKSD+SDFYF
Sbjct: 121 LVKERSKEDHLYADKDKFVTSAYKRKLAEQAKWMEEERLRQLREEKEDVTKKSDLSDFYF 180

Query: 421 NLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSPSQ 480
           NL KN+A+GA N  +PREP          E+++ +K +E    + VN       P S + 
Sbjct: 181 NLGKNVAFGA-NEAKPREP----------ELREPEKEKEKEVLNRVNLLPDSIAPESSAV 240

Query: 481 SAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHHKR 540
             +T ++    E   S  +K    D     P     +T  EKQP       QPK DHHKR
Sbjct: 241 IDRTRDESSSQEIFDSRDSKPITED---AVPDTAVQETTSEKQP-----PDQPKSDHHKR 300

Query: 541 SEDAVAAAKERFLARKRTKE 560
             DAVAAA+ERFLARKR KE
Sbjct: 301 GADAVAAARERFLARKRAKE 301

BLAST of Cp4.1LG00g01670 vs. TAIR10
Match: AT2G27285.1 (AT2G27285.1 Coiled-coil domain-containing protein 55 (DUF2040))

HSP 1 Score: 267.3 bits (682), Expect = 2.3e-71
Identity = 169/335 (50.45%), Postives = 226/335 (67.46%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQPT-RPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHK 300
           M KYGLQ+R  PSQ+KQ + RPPL     F +++D DVE+EISRQA+K KA K+IEEQHK
Sbjct: 1   MKKYGLQIRA-PSQKKQSSSRPPLRPASIFDEEEDHDVEKEISRQATKTKAHKEIEEQHK 60

Query: 301 KALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERK 360
           KAL EDPS F YD VYD+MK+KAVLPR  DREERKP+YIQNLMK+A+ RE+E E++YERK
Sbjct: 61  KALEEDPSAFSYDEVYDDMKQKAVLPRMQDREERKPRYIQNLMKQAERREKEHEIVYERK 120

Query: 361 LAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYF 420
           LAKER KD+HL++ K+KFVT AYK+KL EQ KW+ EERLR+LREE+DDVTKK D+SDFYF
Sbjct: 121 LAKEREKDEHLFSDKEKFVTGAYKRKLEEQKKWLAEERLRELREERDDVTKKKDLSDFYF 180

Query: 421 NLQKNIAYGA-------GNAVEPREPPQKLQSEKQA--------EVQKLKKHEESVNADI 480
           N+ KN+A+GA          +E +   +KL+ +++A        EV +++K  +S   ++
Sbjct: 181 NIGKNVAFGAREVEAKEAEKLEEQRKAEKLEEQRKAEKLEELRKEVTRVEKKRKSPEKEV 240

Query: 481 VNDHQSPKYPSSPSQSAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPE 540
             D  S ++ SS S+S + +E ++ + E    S+ T                    K   
Sbjct: 241 SPD--SGEFGSSRSKSLEPLEAEQAVSEKEMGSDGTEE-----------------RKSSI 300

Query: 541 HDQGEQQPKR-DHHKRSEDAVAAAKERFLARKRTK 559
            +  ++ PK  +  KR EDA+AAAKERFLARK+ K
Sbjct: 301 KEAAKEVPKAINDQKRREDAIAAAKERFLARKKAK 315

BLAST of Cp4.1LG00g01670 vs. TAIR10
Match: AT2G27280.1 (AT2G27280.1 Coiled-coil domain-containing protein 55 (DUF2040))

HSP 1 Score: 238.4 bits (607), Expect = 1.1e-62
Identity = 143/286 (50.00%), Postives = 200/286 (69.93%), Query Frame = 1

Query: 233 SGSQKTLKMSKYGLQLRVKPSQQKQPTRPPL--PAPLGFQDDDDDDVEREISRQASKNKA 292
           S  Q T+K+ KYGLQ+R  PSQ+KQ +  PL   A +  +DD+++DVE+EISRQASK K+
Sbjct: 134 SVKQATVKIKKYGLQIRA-PSQKKQSSSRPLLRTASIFGEDDEENDVEKEISRQASKTKS 193

Query: 293 LKDIEEQHKKALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKERER 352
           LK IE+QHKKA+ EDPS F YD VYD++K +A LPR  DREE K +YIQ++MK+A+ RE+
Sbjct: 194 LKKIEKQHKKAIEEDPSAFAYDEVYDDIKHEAALPRMQDREEHKSRYIQHIMKQAERREK 253

Query: 353 EQEVIYERKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTK 412
           E E++YERKLAKER+KD+HLY+ K+KFVT  +K+KL EQ KW+EEERLR+LREE+DDVTK
Sbjct: 254 EHEIVYERKLAKERAKDEHLYSDKEKFVTGPFKRKLEEQKKWLEEERLRELREERDDVTK 313

Query: 413 KSDISDFYFNLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQS 472
           K+D+S+FY N+ KN+A+GA + +E RE  +  +  K   +++L+K E           +S
Sbjct: 314 KNDLSEFYINIGKNVAFGARD-IEAREAGRLKELRKVDRLEELRKEETRKE----KKRKS 373

Query: 473 PKYPSSPS------QSAKTIE-QDRHLEEHLSNSNKTTPSDVRGTS 510
           P+   SP        S K+++ QD  ++E    + K T  D   T+
Sbjct: 374 PEKEVSPDSGDFGLSSKKSVKPQDASIKEEAKETQKATREDAIATA 413

BLAST of Cp4.1LG00g01670 vs. TAIR10
Match: AT3G11660.1 (AT3G11660.1 NDR1/HIN1-like 1)

HSP 1 Score: 226.9 bits (577), Expect = 3.5e-59
Identity = 106/196 (54.08%), Postives = 141/196 (71.94%), Query Frame = 1

Query: 1   MTDCDHHHHHDCERRRLYRRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGL 60
           M DC++H H    RR+L RRI  +I  VL +I L I LIWAIL+PSKPR ILQD TV+  
Sbjct: 1   MKDCENHGH---SRRKLIRRIFWSIIFVLFIIFLTILLIWAILQPSKPRFILQDATVYAF 60

Query: 61  NVSSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHND 120
           NVS  PP  ++   Q+T+SS NPN++IG+YY  +DVYA YR QQ+T PT +PPTYQGH D
Sbjct: 61  NVSGNPPNLLTSNFQITLSSRNPNNKIGIYYDRLDVYATYRSQQITFPTSIPPTYQGHKD 120

Query: 121 VTVWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANC 180
           V +WSPF+YG +VP+AP    +L+ D + G +L  I+ +G+VRWKVG++I+G+Y L+  C
Sbjct: 121 VDIWSPFVYGTSVPIAPFNGVSLDTDKDNGVVLLIIRADGRVRWKVGTFITGKYHLHVKC 180

Query: 181 PAYIKFGDPKNGIAFG 197
           PAYI FG+  NG+  G
Sbjct: 181 PAYINFGNKANGVIVG 193

BLAST of Cp4.1LG00g01670 vs. TAIR10
Match: AT3G44220.1 (AT3G44220.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 223.0 bits (567), Expect = 5.0e-58
Identity = 108/202 (53.47%), Postives = 143/202 (70.79%), Query Frame = 1

Query: 1   MTDCDHHHHHDCERRRLYRRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGL 60
           MT+ +  HHHD E  ++ +RI   +   L  +  V+FL+WAIL P  PR +LQD T++  
Sbjct: 1   MTEKECEHHHD-EDEKMRKRIGALVLGFLAAVLFVVFLVWAILHPHGPRFVLQDATIYAF 60

Query: 61  NVSSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHND 120
           NVS   P  ++  +QVT+SS NPN +IG++Y  +D+YA+YR QQVTL TLLP TYQGH D
Sbjct: 61  NVSQ--PNYLTSNLQVTLSSRNPNDKIGIFYDRLDIYASYRNQQVTLATLLPATYQGHLD 120

Query: 121 VTVWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANC 180
           VT+WSPFLYG  VPVAP F+ AL++D   G +L NIKI+G VRWKVG+W+SGRYRL+ NC
Sbjct: 121 VTIWSPFLYGTTVPVAPYFSPALSQDLTAGMVLLNIKIDGWVRWKVGTWVSGRYRLHVNC 180

Query: 181 PAYIKFGDPKNGIAFGPAMKFR 203
           PAYI      +G   GPA+K++
Sbjct: 181 PAYITLAGHFSG--DGPAVKYQ 197

BLAST of Cp4.1LG00g01670 vs. TAIR10
Match: AT5G22200.1 (AT5G22200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 212.2 bits (539), Expect = 8.8e-55
Identity = 107/200 (53.50%), Postives = 139/200 (69.50%), Query Frame = 1

Query: 4   CDHHHHHDCERRRLY-RRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGLNV 63
           CD H+ ++  R R+  RRIA A   +++ ++ V+FL+WAIL P  PR +LQDVT+   NV
Sbjct: 6   CDQHNGYEERRMRMMMRRIAWACLGLIVAVAFVVFLVWAILHPHGPRFVLQDVTINDFNV 65

Query: 64  SSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 123
           S   P  +S  +QVT+SS NPN +IG++Y  +D+Y  YR Q+VTL  LLP TYQGH +VT
Sbjct: 66  SQ--PNFLSSNLQVTVSSRNPNDKIGIFYDRLDIYVTYRNQEVTLARLLPSTYQGHLEVT 125

Query: 124 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANCPA 183
           VWSPFL G AVPVAP  + ALNED   G +L NIKI+G VRWKVGSW+SG YRL+ NCPA
Sbjct: 126 VWSPFLIGSAVPVAPYLSSALNEDLFAGLVLLNIKIDGWVRWKVGSWVSGSYRLHVNCPA 185

Query: 184 YIKFGDPKNGIAFGPAMKFR 203
           +I       G   GPA+K++
Sbjct: 186 FITVTGKLTGT--GPAIKYQ 201

BLAST of Cp4.1LG00g01670 vs. NCBI nr
Match: gi|449470301|ref|XP_004152856.1| (PREDICTED: nuclear speckle splicing regulatory protein 1-like [Cucumis sativus])

HSP 1 Score: 489.6 bits (1259), Expect = 8.1e-135
Identity = 266/323 (82.35%), Postives = 286/323 (88.54%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKK 300
           MSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKK
Sbjct: 1   MSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHKK 60

Query: 301 ALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERKL 360
           AL EDPSVFDYDGVYDEMKEK V PRAYDREERKPKYIQNLMKKA+EREREQE+IYERKL
Sbjct: 61  ALEEDPSVFDYDGVYDEMKEKVVQPRAYDREERKPKYIQNLMKKAQEREREQEIIYERKL 120

Query: 361 AKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYFN 420
           AKERSKDDHLYAGKDKFVT AYKKKLAEQAKWMEEERLRQLREEK+DVTKKSD+SDFYF+
Sbjct: 121 AKERSKDDHLYAGKDKFVTGAYKKKLAEQAKWMEEERLRQLREEKEDVTKKSDMSDFYFS 180

Query: 421 LQKNIAYGAGNAVEPREPPQKLQ---SEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSP 480
           LQKN+AYGA NA+EP  PP+KLQ    EKQ EV  L+KHEES N D  NDH   +  + P
Sbjct: 181 LQKNVAYGARNAIEPTAPPKKLQLEKKEKQTEVHILEKHEESCNGDTFNDHLL-QNSNLP 240

Query: 481 SQSAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHH 540
           S S KTIE+  HLEE L  SNKTTPS++ GTSPP LH+KTPV++QP+H+Q EQ PK DHH
Sbjct: 241 SCSEKTIEEHPHLEERLPFSNKTTPSNIEGTSPPTLHDKTPVQEQPKHEQSEQPPKSDHH 300

Query: 541 KRSEDAVAAAKERFLARKRTKEV 561
           KR+EDAVAAAKERFLARKR KEV
Sbjct: 301 KRNEDAVAAAKERFLARKRAKEV 322

BLAST of Cp4.1LG00g01670 vs. NCBI nr
Match: gi|659069555|ref|XP_008450616.1| (PREDICTED: nuclear speckle splicing regulatory protein 1-like [Cucumis melo])

HSP 1 Score: 488.0 bits (1255), Expect = 2.4e-134
Identity = 267/333 (80.18%), Postives = 289/333 (86.79%), Query Frame = 1

Query: 231 FISGSQKTLKMSKYGLQLRVKPSQQKQPTRPPLPAPLGFQDDDDDDVEREISRQASKNKA 290
           F S   +  KMSKYGLQLRVKPSQQKQP RPPLPAPLGFQDDDDD+VEREISRQASKNKA
Sbjct: 3   FSSEGLEDSKMSKYGLQLRVKPSQQKQPKRPPLPAPLGFQDDDDDNVEREISRQASKNKA 62

Query: 291 LKDIEEQHKKALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKERER 350
           LKDIEEQHKKAL EDPSVFDYDGVYDEMKEKAV PRAYDREERKPKYIQNLMKKA+ERER
Sbjct: 63  LKDIEEQHKKALEEDPSVFDYDGVYDEMKEKAVQPRAYDREERKPKYIQNLMKKAQERER 122

Query: 351 EQEVIYERKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTK 410
           EQE+IYERKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEK+DVTK
Sbjct: 123 EQEIIYERKLAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKEDVTK 182

Query: 411 KSDISDFYFNLQKNIAYGAGNAVEPREPPQKLQ---SEKQAEVQKLKKHEESVNADIVND 470
           KSD+SDFYF+LQKN+AYGA NA+EPR  P+KLQ    EKQ EV  L+KHEES N D  ND
Sbjct: 183 KSDMSDFYFSLQKNVAYGARNAIEPRATPKKLQLEKKEKQTEVHILEKHEESCNGDTFND 242

Query: 471 HQSPKYPSSPSQSAKTIEQDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQ 530
           HQ         Q++KTIE+  HLEE LS SNKTTPSD+ GTSPP LH+K  V+++P+H+ 
Sbjct: 243 HQ--------LQNSKTIEEHSHLEERLSASNKTTPSDIEGTSPPTLHDKALVQERPKHEL 302

Query: 531 GEQQPKRDHHKRSEDAVAAAKERFLARKRTKEV 561
            EQ PKRDHHKR+EDAVAAAKERFLARKR KEV
Sbjct: 303 SEQPPKRDHHKRNEDAVAAAKERFLARKRAKEV 327

BLAST of Cp4.1LG00g01670 vs. NCBI nr
Match: gi|449443654|ref|XP_004139592.1| (PREDICTED: protein YLS9-like [Cucumis sativus])

HSP 1 Score: 396.4 bits (1017), Expect = 9.3e-107
Identity = 189/200 (94.50%), Postives = 195/200 (97.50%), Query Frame = 1

Query: 3   DCDHHHHHDCERRRLYRRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGLNV 62
           DCDHHHHHDCERRRLYRRIAC IFTV+LLI LVIFLIWAILRPSKPRLILQDVT+ GLNV
Sbjct: 5   DCDHHHHHDCERRRLYRRIACVIFTVVLLIGLVIFLIWAILRPSKPRLILQDVTLLGLNV 64

Query: 63  SSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 122
           SSVPPAAIS TMQ+TISSHNPN+RIGVYYQ MDVYAAYRGQQVTLPTLLPPTYQGHNDVT
Sbjct: 65  SSVPPAAISTTMQITISSHNPNNRIGVYYQVMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 124

Query: 123 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANCPA 182
           VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIK+NGQVRWKVGSWISGRYRLNANCPA
Sbjct: 125 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKVNGQVRWKVGSWISGRYRLNANCPA 184

Query: 183 YIKFGDPKNGIAFGPAMKFR 203
           YIKFGDPKNGIAFGPAMKF+
Sbjct: 185 YIKFGDPKNGIAFGPAMKFQ 204

BLAST of Cp4.1LG00g01670 vs. NCBI nr
Match: gi|659127309|ref|XP_008463636.1| (PREDICTED: protein YLS9-like [Cucumis melo])

HSP 1 Score: 393.3 bits (1009), Expect = 7.9e-106
Identity = 187/200 (93.50%), Postives = 194/200 (97.00%), Query Frame = 1

Query: 3   DCDHHHHHDCERRRLYRRIACAIFTVLLLISLVIFLIWAILRPSKPRLILQDVTVFGLNV 62
           DCDHHHHHDCERRRLYRRIAC IF++LLLI LVIFLIWAILRPSKPRLILQDVT+ GLNV
Sbjct: 5   DCDHHHHHDCERRRLYRRIACVIFSLLLLIGLVIFLIWAILRPSKPRLILQDVTLLGLNV 64

Query: 63  SSVPPAAISITMQVTISSHNPNSRIGVYYQTMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 122
           SSVPPAAIS TMQ+TISSHNPN+RIGVYYQ MDVYAAYRGQQVTLPTLLPPTYQGHNDVT
Sbjct: 65  SSVPPAAISTTMQITISSHNPNTRIGVYYQVMDVYAAYRGQQVTLPTLLPPTYQGHNDVT 124

Query: 123 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKINGQVRWKVGSWISGRYRLNANCPA 182
           VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIK+NGQVRWKVGSWISGRYRLNANCPA
Sbjct: 125 VWSPFLYGEAVPVAPEFAEALNEDNNVGAMLFNIKVNGQVRWKVGSWISGRYRLNANCPA 184

Query: 183 YIKFGDPKNGIAFGPAMKFR 203
           YIKFGDPKNGIAFGP MKF+
Sbjct: 185 YIKFGDPKNGIAFGPTMKFQ 204

BLAST of Cp4.1LG00g01670 vs. NCBI nr
Match: gi|590586457|ref|XP_007015711.1| (Coiled-coil domain-containing protein 55, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 347.1 bits (889), Expect = 6.5e-92
Identity = 212/321 (66.04%), Postives = 238/321 (74.14%), Query Frame = 1

Query: 241 MSKYGLQLRVKPSQQKQP-TRPPLPAPLGFQDDDDDDVEREISRQASKNKALKDIEEQHK 300
           M KYGLQLRV PSQQK+P TRPPLP PLGF+DDDDDDVEREISRQASKNK+LK+IEEQH+
Sbjct: 1   MKKYGLQLRVPPSQQKKPVTRPPLPPPLGFRDDDDDDVEREISRQASKNKSLKEIEEQHR 60

Query: 301 KALAEDPSVFDYDGVYDEMKEKAVLPRAYDREERKPKYIQNLMKKAKEREREQEVIYERK 360
           KAL EDPSVFDYDGVYDEMKEK V PR  DREER+ KYI NL+KKA++R+ EQE++YERK
Sbjct: 61  KALEEDPSVFDYDGVYDEMKEKVVRPRVQDREERQSKYIHNLIKKAEQRKWEQEIVYERK 120

Query: 361 LAKERSKDDHLYAGKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDISDFYF 420
           L KERSK+DHLYA KDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSD+SDFYF
Sbjct: 121 LVKERSKEDHLYADKDKFVTSAYKKKLAEQAKWMEEERLRQLREEKDDVTKKSDLSDFYF 180

Query: 421 NLQKNIAYGAGNAVEPREPPQKLQSEKQAEVQKLKKHEESVNADIVNDHQSPKYPSSPSQ 480
           NL KN+A+GA N V PR P      EK  E +K +K +E    D++   Q     ++P  
Sbjct: 181 NLGKNVAFGA-NEVGPRMP------EKHTESRKPEKKDEK-EIDVIKRVQPLPNSNAPES 240

Query: 481 SAKTIE-QDRHLEEHLSNSNKTTPSDVRGTSPPPLHNKTPVEKQPEHDQGEQQPKRDHHK 540
           S  T   QD         S  + P  V    P     +T   KQP  DQ    PKRDHHK
Sbjct: 241 SGVTDRTQDETSSRENFESQDSRPITV-DVLPETALQETSSVKQPSVDQ----PKRDHHK 300

Query: 541 RSEDAVAAAKERFLARKRTKE 560
           R EDAVAAA+ERFLARKR KE
Sbjct: 301 RGEDAVAAARERFLARKRAKE 308

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NHL12_ARATH3.9e-4448.02NDR1/HIN1-like protein 12 OS=Arabidopsis thaliana GN=NHL12 PE=2 SV=1[more]
NSRP1_BOVIN1.1e-2238.60Nuclear speckle splicing regulatory protein 1 OS=Bos taurus GN=NSRP1 PE=2 SV=1[more]
NSRP1_MOUSE1.6e-2131.12Nuclear speckle splicing regulatory protein 1 OS=Mus musculus GN=Nsrp1 PE=1 SV=1[more]
NSRP1_RAT1.1e-1931.13Nuclear speckle splicing regulatory protein 1 OS=Rattus norvegicus GN=Nsrp1 PE=1... [more]
NSRP1_HUMAN2.1e-1831.89Nuclear speckle splicing regulatory protein 1 OS=Homo sapiens GN=NSRP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LJE9_CUCSA5.6e-13582.35Uncharacterized protein OS=Cucumis sativus GN=Csa_2G070270 PE=4 SV=1[more]
A0A0A0LVW5_CUCSA6.5e-10794.50Uncharacterized protein OS=Cucumis sativus GN=Csa_1G169420 PE=4 SV=1[more]
A0A061GV10_THECC4.5e-9266.04Coiled-coil domain-containing protein 55, putative isoform 1 OS=Theobroma cacao ... [more]
A0A0D2PQG7_GOSRA2.5e-9064.06Uncharacterized protein OS=Gossypium raimondii GN=B456_001G124100 PE=4 SV=1[more]
A0A0B0NEG8_GOSAR2.1e-8963.75Nuclear speckle splicing regulatory 1 OS=Gossypium arboreum GN=F383_10067 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT2G27285.12.3e-7150.45 Coiled-coil domain-containing protein 55 (DUF2040)[more]
AT2G27280.11.1e-6250.00 Coiled-coil domain-containing protein 55 (DUF2040)[more]
AT3G11660.13.5e-5954.08 NDR1/HIN1-like 1[more]
AT3G44220.15.0e-5853.47 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G22200.18.8e-5553.50 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449470301|ref|XP_004152856.1|8.1e-13582.35PREDICTED: nuclear speckle splicing regulatory protein 1-like [Cucumis sativus][more]
gi|659069555|ref|XP_008450616.1|2.4e-13480.18PREDICTED: nuclear speckle splicing regulatory protein 1-like [Cucumis melo][more]
gi|449443654|ref|XP_004139592.1|9.3e-10794.50PREDICTED: protein YLS9-like [Cucumis sativus][more]
gi|659127309|ref|XP_008463636.1|7.9e-10693.50PREDICTED: protein YLS9-like [Cucumis melo][more]
gi|590586457|ref|XP_007015711.1|6.5e-9266.04Coiled-coil domain-containing protein 55, putative isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR018612DUF2040
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG00g01670.1Cp4.1LG00g01670.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 77..180
score: 1.0
IPR018612Domain of unknown function DUF2040PFAMPF09745DUF2040coord: 295..411
score: 7.8
NoneNo IPR availableunknownCoilCoilcoord: 384..411
scor
NoneNo IPR availablePANTHERPTHR30060INNER MEMBRANE PROTEINcoord: 222..566
score: 5.6E
NoneNo IPR availablePANTHERPTHR30060:SF0COILED-COIL DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 222..566
score: 5.6E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG00g01670Bottle gourd (USVL1VR-Ls)cpelsiB010