Cp4.1LG01g02180 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g02180
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionU4/U6.U5 tri-snRNP-associated protein 1
LocationCp4.1LG01 : 2531729 .. 2541500 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACCGGGTCGGTTATTGAATCGGATCAAAGGAAGTATTTGTTTCTTCTCCGCCCTTCTGTGCTGCTTCGAGTTCCTCCGGTCAGAATTGCGCACGCCACAATCAGCCACTGCTGCATCGTCGCTGCAGCCTCTTCTTTTTGCTCTCGACTTCCGATTTCTTCATCATCATCGATTTGTGAGCTTATTTCTCCGAACTTTGTATCTAGCTTCTTCATCAACATTGCCTTCCAGCTTCCAGTTTTGCTGTGGTAACCATTTCCCTCGTTCATCCCTACTTCATTTGTTATTTTGTATGCCTACCGTACTTGTTTCCTACGGAGATTTCCACCGCCTCATAGTATCACCGATTCACTGTTTTTAACGAAAGATTTCGTGGCTTACATTATTCTAGTCTGGAGCTTTTTCCCCCTTTCCTTTTGGAATTCACTTTTTTTTTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTGGGGGGGATGCGGGTGGTGGGTTATGGTTATGGGAATATTATCGTTCATATATCCAGGGTTTAGATAGGCAAAATTAATCATCTAATTTTTTTTATTTGTTCTTAGCACGAGCTTCGTTTTAATTTTGAATTCAGTCTGTATCTCTTCTCGTATTTAACAATTGGAAGTTGGAACTCTTGCTCATTCTTTACTTGCTTGAACTGATGCCTAATGCTTTCCTAAATTTAGCTTTAGTGGTTGACATAGTTTGCTCCAATTATCTATTCTGTATATCATCTTTCCGTATAACTTATTCTTTTTAGGTGTTAACTGAGATTTAGAGGCAAGCTATTAAATAATTGCAAATGGACGCGGATGGGTCATCTGTACCTGAACATGATGAGAGAAATGGTCATGAGGCAAGAGATCGTGGGGAAGGACAGGATGACTTTGGTTATAGTGGAGCAGAAAAGTCAAGCAAGCATCGGAGTGAGGATCATCGGAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGATCGATCTAAGAGACGTAGTGATGATGCATCGAAGGAAAAGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCATATTCGTGAAAGGAGGAAGGAAGACAGGGATGAGCATGATAAAGAAAGGACTAGGGAGAAGAAAGTTAAAGACAAAGATTATGACAGAGAGGTTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAGCGGGGAAGGGAGAGAGAATTGGAGAAGGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGAGATAGGGAAAGGGAAAGGGAAAGAGATAGGGATAGGAAGAAGAAGGAGAAGGACAAGGACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACGCAGAGATCAAGAGGAGAAAGAAAGCTATCGGAACATTGATAAGGAAAGAGGAAAAGAGAAAAATTTGGTGGATGATAAGAAAGGAGATCAAAACAAGGAGAAATTACGAGATAAAGAAGGAATTGGCGGCAAAAATGATGAAGAAAGAATTGATTGGATTGCACATGGGGCTAAGGATTATATGCTAGAAAGTGATGGCGAGGATAACAGGGACAGAGGTGTTGATCAAGGGAATGCAGTCCAGCATTTGGGAGGTGAAGAAAATTCTGATGGGTTGAAAGTTGGAGCTCAGTCTTCTTCAGCTATGCTTGAGGAGCGCATTCGGACGTGAGTAGATACTCACCCTTTTGCTTTCTATATATATGTTTTTTTTTTTTGTGTATATGTATCTGAATATGAAGCATGTGCATAGATTAGGTAATAATGAATCCTGAAGTGTGTTCCTGCAAAATTTGATTTTGATACCTTCCGGATTTCACTCAGTTTTTCGTTATTTCTTTATAATCTGATAATAATATTTTCCTTAGGTTGCTTAAACTTATACTCACCTTTTTTTTTTCTTTTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTGTTAATGTATTATCAATTGTGTGCACGAGTAGGTAATAACAAAAAATGAAGTGGGCTCCTGCAAAATTTGACTGTGATGCCTTCTAGATTTCACTCAGTTATTGTAATCTGATAATAATATTTCCTTGGGTAGATTAAACTTGAATGTATGCAATTGTATTTGATATTTGTTTTATTCTATTTAGAGAAATCTTTACCTTAGCTTGAAAAGTGTTTATGATATAGTTACATTGCTTGTTGATTTGTTTATAAGTAATGTCGATGGCCCAAGGCTTTTACTTCCTTCTAAACTTTTGAGCCGGACCAAAATCATTTATTATTGAAAAGATGACTATAGGTAGTTATTTTAAAAGAAAATAAATGGCAAAGCAAGTGATGTAAATTCAGGCGGGGAAGACAAAACCCTCACCAAGGATGTCTATGAAATACTCAGTCGTGGTTAATCAATAATAGATTGTGAATACCAAAAAGGTCTTGTGATGGGCACACCAAAATACATATTGAGATCAGGCATCTTTCATGATTAAATTCTAGCACATGTCAGCTTGAATCAAATTCTGGAGACAAGGGATCCATAAGTTATGGGTTTTTATGGTTTTGATCGGTTATTACATAACCTCCTATATAAAATAAAAGACCATGACACAACCAAACTTTTGTAATATTGCTGAATACAATTGAATGTATGTTGTACTTTATTTCATATATGATATATCAACTAATAAATCAAACACCACATATGGAAGGTGCATATAACAATATGTATCCATTTATCAAGATTATCATATCACATTAGGCATATATATTGTTTCTGTAAAAGGCATCCTCTTTAAGAGAATGGTTTTTTTTTTTTTTTTTTTTTTTTTGGAGAGAAAAGGATAAACCGTTATATTATCAACACATGGCCATTATTTCCAGTGTCATGTGTTTAAAACGATGGGCGAGTTGGCCATCTATCAAGAAATTAGTTTTCTTAGAGTTTTTCAGTGCCAAATTATGTAGGAGCAGGCAATGGTCATATGAGATTGATTAGTCGATTTCATTCCATTCTTATAGGTACCTACTTAAAAGGAAAGGATTCCATTCAATGATTATGATATTCACTTTATGACTGGTAAAAAAAAAATCAGTCTTGTTGATTATAGTGACTGGAAGACTCTTTTGGTGTAGCTTTCTTTTTCTTTTTTCTTTTGATGGGGGGAGGGTACTCTCTACCCAGGCTCAGGCCCTTAGTTTATGCTTGTTTGTTTTGTAGAATAAATTTCCTTGATCTTTCTTATAAAAAAGAATCCGGCCAAAAAATGTAACTGATTCCATTCATATCCTGTTTCTGTCCCATTTTGATTCCACACCCTATTGTCTCTTGAAACCAATGGGACACTACCTTTGCGTACTTTTTTTTATAATTCCCGCTTGCTCAATTTCACATCACTCTTGTGTTTGCGATGTATGTTGTTTTCCTTGCAAATTCATAGTTTTCTGTCCCCAATCAGTTGGTGAAGTTCAAGAATTCAAGAATTGGATCTCTCATCCTTATCCTTCAAGGACATGCAGTTCTGTTGTTTCATTCTGTCCTGGATACCTCGTCCTTGCATTGATCACTGTGATTGCATGTAGCAAACTGGCCAACATACTATGCTCACTTGGACATAGGGTTGTATGATTATCTTAGATGCTATGATGCTCTGTGGATATTTCTTTCCTTATAAATTTATAATATATGGTTTAGAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTCTCTCTCTTTTCTGCTACAAGTGAAAACTTCTGACCTTATGTTGTGACTGGCAATTTGCATGTAAATGTACTTTCTGTTTGACTGATATGTCAATAGATTTAAGAGTTCGTTTTCAATCAAATTCTTGATGCAGCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCAGAGGTTTTAGACTGGGTTAAAAGGAGTCGTAAACTTGAGGAGAAGAAACTTACTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAGGTTAGTTAACTGCATAATCCAGTGTCATTTGCAATTGCAGTCCTTTCTTTCATAACAACAACTTTCATGTGTGATTGCGATCGTGGCATTTTGGAATTTTTCAGGACAATATTGATCAAGGTGCAAGCGATGATGATATTGCAGCTGAAGATATAACTAGTATGGGTTCATACTTTTTGTTTCTTCGTATGAGTTATGATTATATATTTTTTAATTTTTTTTGCACGACTATATTTTGTGGTTACATAGATGTTGGAATAAGAATATAAGGGAATTTTTATCCAGTATTTCACTTAATATGTTTGTACAATAAGGATGATTAGGAAGCAATTAATGTATACAATGGACTGAGACATATGAGACGTCCTTTACTTCTAGCATGGAAGCACAGTGATATGCGACTTTAAAGCAGCAGTTTTTTTATTCCTCCTTTTCTTGAGCTTTTTCTTTTCTTTTTATACTTCTTCCATGTCTGATTAAAACCCAACATTGTTTTAACATTTCGAGGACACATGTATGCTTCTAGACAGCAAGGGAGGATGTCTAGAAGTGAGAGAGATATCACTTGATGTTTCTGTATGCGCAGCTTATTTATTTCTGTATGCAATTTTTAGGTAATTTAGCTGGAGTTAAAGTACTTCATGGCATAGACAAAGTACTAGGAGGTGGTGCAGTTGTCCTAACCCTTAAAGATCAGAATATCTTAGCTGATGGTGACGTTAATGAAGGTAAAATACGTTGGTTCTTTTCCTTTAATTTCAGGGCCGAAGTTCTTAAGTACTTATATATTTTAATGCATCAGATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGATAAGTAAGTCACTTATTTAAGTGCTCTATGAACTGTTTTACAGGCTGCCTCAGTTCTCATCCATTCTTCTATTATAGGTTTAATGATGAAAATGCTGGTGAGAAGAAGATGCTGCCACAGTATGATGACCCAGCAGCTGCAGACGAGGTGGACTTCTTTCCTTTTGAGCTTCTGTTTCTGTAATCTTATATAAGGAGCTCTATATCTTATTCATGTTCTTTACTTGAATCATTTCTAAACTTGGAGCATAAGCATTCTCTGAGGACAATTTGTGTTGTAGGGCCTAACTCTAGATGGAACTGGACGCTTTAGTAATGATGCAGAAAAGAAGCTTGAGGAGGACTATATTTTGTGGTTACATAGATGTTGGAATAAGAATATAAGGGAATTTTTATCCAGTATTTCACTTAATATGTTTGTACAATAAGGATGATTAGGAAGCAATTAATGTATACAATGGACTGAGACATATGAGACGTCCTTTACTTCTAGCATGGAAGCACAGTGATATGCGACTTTAAAGCAGCAGTTTTTTTATTCCTCCTTTTCTTGAGCTTTTTCTTTTCTTTTTATACTTCTTCCATGTCTGATTAAAACCCAACATTGTTTTAACATTTCGAGGACACATGTATGCTTCTAGACAGCAAGGGAGGATGTCTAGAAGTGAGAGAGATATCACTTGATGTTTCTGTATGCGCAGCTTATTTATTTCTGTATGCAATTTTTAGGTAATTTAGCTGGAGTTAAAGTACTTCATGGCATAGACAAAGTACTAGGAGGTGGTGCAGTTGTCCTAACCCTTAAAGATCAGAATATCTTAGCTGATGGTGACGTTAATGAAGGTAAAATACGTTGGTTCTTTTCCTTTAATTTCAGGGCCGAAGTTCTTAAGTACTTATATATTTTAATGCATCAGATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGATAAGTAAGTCACTTATTTAAGTGCTCTATGAACTGTTTTACAGGCTGCCTCAGTTCTCATCCATTCTTCTATTATAGGTTTAATGATGAAAATGCTGGTGAGAAGAAGATGCTGCCACAGTATGATGACCCAGCAGCTGCAGACGAGGTGGACTTCTTTCCTTTTGAGCTTCTGTTTCTGTAATCTTATATAAGGAGCTCTATATCTTATTCATGTTCTTTACTTGAATCATTTCTAAACTTGGAGCATAAGCATTCTCTGAGGACAATTTGTGTTGTAGGGCCTAACTCTAGATGGAACTGGACGCTTTAGTAATGATGCAGAAAAGAAGCTTGAGGAGGTAATTGTATTCTACGTTCTACCCTACCCATGAAGATCGGTAATGGTTCAGTATATTATAGTGCTTCTTTTTTTGCTTCCTACTGTGAACTTTTTGCTCTTTGCCAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAAACACTTTGAAGATCTTAATGCATCAGTGAAAGTCTCGCATGATTATTACACTCAAGATGAAATGCTTCGATTTAAGAAGCCCAAGAAAAAGAAATCTCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCAGAAGCAATCTCCTCTGGATTGGGTGTTGGAGATCTTGGTCCTCGAAATGATTCTAGCAGGCAAGCACGAAAAACAGAACAAGAGAGATCTGAAGCAGAAATGCGACAAAATGCATACCAGTCAGCCTATGCTAAAGCAGACGAGGCATCAAGATCTCTACAATTAGTTCAAAGCTCAGTCAGATTAGATGACAATGAAGATACTTTCATTGAAGATGATGATGAAGACCTCTATAAGTCGCTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGCAGCATCGGGACCCGAAGCAGTTGCTCTTCTTGCTACAACAACAATCAGCGGGCAGACAACTGATGATCAAAACACAAAAGCAGGAGAGTTGCAGGAAAATAAGGTTGTTTTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGGTATCATCTTCCTTATTCTCTTTTTGGTGTACCATTTATTGCTATTCACATTGACTGTTCCTATAAGTCCCCAGTCAATCTTGAGCACGGTGTACTTGTTGGTCTTTATTTTCTCTCCCCCCCCTCTCCCGCCTCATTGGGCTTTGTTAGTGGTTCTCGCAGGCTTCCTGTCGACCTTCAAACTGTGTCGATCTTTGGGATTTTCAATGAACACTAACCATCTAGAAGACCTGTTTTTTCGTGTGCTTGCTAAATCTGAACTACAACGTCTATTTGCTTGACTTAAGCCAAACATAAATTCTGGCATGCTATTTTCTGTTCTAAATGGATATGTCATAATTAAAAGCTTTATAATCGCGAATAGAAGGTTTAAAAATGAGATAAATGAGGAAAACCAGGGAGAGTGAACTGGGTTTTTTCTTTTTCTTTTTCTTTCAGAATCTCATAAACCTGAAGAAGAAGATGTCTTTATGGATGACGATGAAGCACCAAAAGAAGAATATCATGAAGATGAGAAGGATAAAGATGGTGGGTGGACTGAGGTCAAAGATACTGCCAAAGAAGAACCCACTCCTGAGGATAATGAGACAATAGCTCCCGATGAAACAATCCATGAAGTTCCTGTTGGAAAGGGATTATCCAGTGTACTGAAACTGCTTAAGGATCGTGGGACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAATAGATGAAGATGAACCAAAGGAAGCTAAGTCAAAGGATTCCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATTCACATCGAGAGGACTGATGAATTTGGGCGAATTGTAAGTTAGCTTCGACTCTATAATTTACTTAGCCAAGTAGTTCATATCTAGTTTACTTGCACCACATATTCACTTACTGGTTTTTAAATGCCTACTGGTAGTGTAATGAGGAGCTAGGTGCCAAGTTAAAGTAGGATTTCTTTTTTAAATGCGTAAAATTACATTTAGGAAATTGTGTCCGTTTACTTCTCAAATTGCCATACTTAAGTCATGCTTGCTCTTCTTCTTTGTATTTTATTCCAACAATGAAATGCACTCACTACCAGAAGACAAATAATACACAAAGACATTCAAGTTGCGCTGAATATGAGTCTCTCGTCTCTGAATTGTTGGCAGTCTTACATGTTTATGCCAACTGCACAAGGCTTTTGCCTAGAGGTGTCAATTTTGTTCTCTTGTTCCAATGGAGTTTGATTATAATTTCTGGTCTATCTTTTCTATGGGCTGCTTTAGGATGTGATATTACTCCTATGCTTACCATTAAGGAGTTTTCTATTCTCACTTGTGCCTTTCTACTGTACCAGATGACTCCAAAGGAGTCATTTCGCCAACTTTCTCACAAGTTCCATGGCAAGGGACCTGGGAAAATGAAACAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCTGATACACCTTCGTTATCAGTGGAGAGAATGAGGGAAGCTCAAGCACAATTAAAAACACCTTACCTTGTTCTCAGCGGTCACGTTAAACCTGGGTATGCTCCGTTTTCATTGGAACAAGTATTTTGCCTCTTGTTTTTGCATTTAGCCATTCAGTTTCAAACTGTTTTCTGGTTGTCTTTAATGTTTGTAGCCAAACAAGTGATCCAAGAAGTGGTTTTGCTACCGTCGAAAAGGATCTCCCCGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTGAGTCTTCTTTCCTTATTCTTCCAAGTATTTATTACTCAACGAGATTATAATATCGAGGAAGTTCTTCAAGAATTGGTGGCAGCTATTGATAAGTTAGGAACATATGCTATACGGGAGATTTCTAGTTATATGCGTACCCGAATTTCATTTGTGTGCACTCTAAGCATATACTCAATTCATATTTTCATTTTTGTCTCAATCTAAGTTCAATTTTTAAATCAACTGCCTTATTACCAAACTTCGACTCTCAGCATATAGAATTCGTCATTTCGAGACGCGATCGCTGTTAGTTTATATAAGTTGATCATCTTATTTTCAGGTCGAGCATTTCTTGGGGATAAAGCGAAAAGGTGATCCTTCGAATACAGGCACAAAAAAGCCAAAAATTTGAGATATGTAACTTTTTACCAGAATTAGTACCAAGAAATAGAAACCCATGATTAGTTTAGTTTAGTTTATTTTTCTTCACCAACAAGATGAACTTCGCATATTCAATTTCACATCCTCATCAAATAACAGTGTGCTTGAATTGTTTGTTGTACGATAATCGATAAATGATGAGGGATTACTCTTATAAATTATGGAAATTCATATTACAAAAACACATTATAATATTGGATACCTTAGGTCGTCGCCTTGTTAGA

mRNA sequence

TACCGGGTCGGTTATTGAATCGGATCAAAGGAAGTATTTGTTTCTTCTCCGCCCTTCTGTGCTGCTTCGAGTTCCTCCGGTCAGAATTGCGCACGCCACAATCAGCCACTGCTGCATCGTCGCTGCAGCCTCTTCTTTTTGCTCTCGACTTCCGATTTCTTCATCATCATCGATTTGTGAGCTTATTTCTCCGAACTTTGTATCTAGCTTCTTCATCAACATTGCCTTCCAGCTTCCAGTTTTGCTGTGGTGTTAACTGAGATTTAGAGGCAAGCTATTAAATAATTGCAAATGGACGCGGATGGGTCATCTGTACCTGAACATGATGAGAGAAATGGTCATGAGGCAAGAGATCGTGGGGAAGGACAGGATGACTTTGGTTATAGTGGAGCAGAAAAGTCAAGCAAGCATCGGAGTGAGGATCATCGGAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGATCGATCTAAGAGACGTAGTGATGATGCATCGAAGGAAAAGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCATATTCGTGAAAGGAGGAAGGAAGACAGGGATGAGCATGATAAAGAAAGGACTAGGGAGAAGAAAGTTAAAGACAAAGATTATGACAGAGAGGTTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAGCGGGGAAGGGAGAGAGAATTGGAGAAGGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGAGATAGGGAAAGGGAAAGGGAAAGAGATAGGGATAGGAAGAAGAAGGAGAAGGACAAGGACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACGCAGAGATCAAGAGGAGAAAGAAAGCTATCGGAACATTGATAAGGAAAGAGGAAAAGAGAAAAATTTGGTGGATGATAAGAAAGGAGATCAAAACAAGGAGAAATTACGAGATAAAGAAGGAATTGGCGGCAAAAATGATGAAGAAAGAATTGATTGGATTGCACATGGGGCTAAGGATTATATGCTAGAAAGTGATGGCGAGGATAACAGGGACAGAGGTGTTGATCAAGGGAATGCAGTCCAGCATTTGGGAGGTGAAGAAAATTCTGATGGGTTGAAAGTTGGAGCTCAGTCTTCTTCAGCTATGCTTGAGGAGCGCATTCGGACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCAGAGGTTTTAGACTGGGTTAAAAGGAGTCGTAAACTTGAGGAGAAGAAACTTACTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAGGACAATATTGATCAAGGTGCAAGCGATGATGATATTGCAGCTGAAGATATAACTAATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGATAAGTTTAATGATGAAAATGCTGGTGAGAAGAAGATGCTGCCACAGTATGATGACCCAGCAGCTGCAGACGAGGGCCTAACTCTAGATGGAACTGGACGCTTTAGTAATGATGCAGAAAAGAAGCTTGAGGAGGACTATATTTTGTGGTTACATAGATATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGATAAGTTTAATGATGAAAATGCTGGTGAGAAGAAGATGCTGCCACAGTATGATGACCCAGCAGCTGCAGACGAGGGCCTAACTCTAGATGGAACTGGACGCTTTAGTAATGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAAACACTTTGAAGATCTTAATGCATCAGTGAAAGTCTCGCATGATTATTACACTCAAGATGAAATGCTTCGATTTAAGAAGCCCAAGAAAAAGAAATCTCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCAGAAGCAATCTCCTCTGGATTGGGTGTTGGAGATCTTGGTCCTCGAAATGATTCTAGCAGGCAAGCACGAAAAACAGAACAAGAGAGATCTGAAGCAGAAATGCGACAAAATGCATACCAGTCAGCCTATGCTAAAGCAGACGAGGCATCAAGATCTCTACAATTAGTTCAAAGCTCAGTCAGATTAGATGACAATGAAGATACTTTCATTGAAGATGATGATGAAGACCTCTATAAGTCGCTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGCAGCATCGGGACCCGAAGCAGTTGCTCTTCTTGCTACAACAACAATCAGCGGGCAGACAACTGATGATCAAAACACAAAAGCAGGAGAGTTGCAGGAAAATAAGGTTGTTTTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAATCTCATAAACCTGAAGAAGAAGATGTCTTTATGGATGACGATGAAGCACCAAAAGAAGAATATCATGAAGATGAGAAGGATAAAGATGGTGGGTGGACTGAGGTCAAAGATACTGCCAAAGAAGAACCCACTCCTGAGGATAATGAGACAATAGCTCCCGATGAAACAATCCATGAAGTTCCTGTTGGAAAGGGATTATCCAGTGTACTGAAACTGCTTAAGGATCGTGGGACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAATAGATGAAGATGAACCAAAGGAAGCTAAGTCAAAGGATTCCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATTCACATCGAGAGGACTGATGAATTTGGGCGAATTTGTAATGAGGAGCTAGTCTTACATGTTTATGCCAACTGCACAAGGCTTTTGCCTAGAGGTGTCAATTTTGTTCTCTTGTTCCAATGGAGTTTGATTATAATTTCTGGTCTATCTTTTCTATGGGCTGCTTTAGGATGTGATATTACTCCTATGCTTACCATTAAGGAGTTTTCTATTCTCACTTGTGCCTTTCTACTGTACCAGATGACTCCAAAGGAGTCATTTCGCCAACTTTCTCACAAGTTCCATGGCAAGGGACCTGGGAAAATGAAACAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCTGATACACCTTCGTTATCAGTGGAGAGAATGAGGGAAGCTCAAGCACAATTAAAAACACCTTACCTTGTTCTCAGCGGTCACGTTAAACCTGGCCAAACAAGTGATCCAAGAAGTGGTTTTGCTACCGTCGAAAAGGATCTCCCCGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTCGAGCATTTCTTGGGGATAAAGCGAAAAGGTGATCCTTCGAATACAGGCACAAAAAAGCCAAAAATTTGAGATATGTAACTTTTTACCAGAATTAGTACCAAGAAATAGAAACCCATGATTAGTTTAGTTTAGTTTATTTTTCTTCACCAACAAGATGAACTTCGCATATTCAATTTCACATCCTCATCAAATAACAGTGTGCTTGAATTGTTTGTTGTACGATAATCGATAAATGATGAGGGATTACTCTTATAAATTATGGAAATTCATATTACAAAAACACATTATAATATTGGATACCTTAGGTCGTCGCCTTGTTAGA

Coding sequence (CDS)

ATGGACGCGGATGGGTCATCTGTACCTGAACATGATGAGAGAAATGGTCATGAGGCAAGAGATCGTGGGGAAGGACAGGATGACTTTGGTTATAGTGGAGCAGAAAAGTCAAGCAAGCATCGGAGTGAGGATCATCGGAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGATCGATCTAAGAGACGTAGTGATGATGCATCGAAGGAAAAGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCATATTCGTGAAAGGAGGAAGGAAGACAGGGATGAGCATGATAAAGAAAGGACTAGGGAGAAGAAAGTTAAAGACAAAGATTATGACAGAGAGGTTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAGCGGGGAAGGGAGAGAGAATTGGAGAAGGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGAGATAGGGAAAGGGAAAGGGAAAGAGATAGGGATAGGAAGAAGAAGGAGAAGGACAAGGACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACGCAGAGATCAAGAGGAGAAAGAAAGCTATCGGAACATTGATAAGGAAAGAGGAAAAGAGAAAAATTTGGTGGATGATAAGAAAGGAGATCAAAACAAGGAGAAATTACGAGATAAAGAAGGAATTGGCGGCAAAAATGATGAAGAAAGAATTGATTGGATTGCACATGGGGCTAAGGATTATATGCTAGAAAGTGATGGCGAGGATAACAGGGACAGAGGTGTTGATCAAGGGAATGCAGTCCAGCATTTGGGAGGTGAAGAAAATTCTGATGGGTTGAAAGTTGGAGCTCAGTCTTCTTCAGCTATGCTTGAGGAGCGCATTCGGACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCAGAGGTTTTAGACTGGGTTAAAAGGAGTCGTAAACTTGAGGAGAAGAAACTTACTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAGGACAATATTGATCAAGGTGCAAGCGATGATGATATTGCAGCTGAAGATATAACTAATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGATAAGTTTAATGATGAAAATGCTGGTGAGAAGAAGATGCTGCCACAGTATGATGACCCAGCAGCTGCAGACGAGGGCCTAACTCTAGATGGAACTGGACGCTTTAGTAATGATGCAGAAAAGAAGCTTGAGGAGGACTATATTTTGTGGTTACATAGATATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGATAAGTTTAATGATGAAAATGCTGGTGAGAAGAAGATGCTGCCACAGTATGATGACCCAGCAGCTGCAGACGAGGGCCTAACTCTAGATGGAACTGGACGCTTTAGTAATGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAAACACTTTGAAGATCTTAATGCATCAGTGAAAGTCTCGCATGATTATTACACTCAAGATGAAATGCTTCGATTTAAGAAGCCCAAGAAAAAGAAATCTCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCAGAAGCAATCTCCTCTGGATTGGGTGTTGGAGATCTTGGTCCTCGAAATGATTCTAGCAGGCAAGCACGAAAAACAGAACAAGAGAGATCTGAAGCAGAAATGCGACAAAATGCATACCAGTCAGCCTATGCTAAAGCAGACGAGGCATCAAGATCTCTACAATTAGTTCAAAGCTCAGTCAGATTAGATGACAATGAAGATACTTTCATTGAAGATGATGATGAAGACCTCTATAAGTCGCTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGCAGCATCGGGACCCGAAGCAGTTGCTCTTCTTGCTACAACAACAATCAGCGGGCAGACAACTGATGATCAAAACACAAAAGCAGGAGAGTTGCAGGAAAATAAGGTTGTTTTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAATCTCATAAACCTGAAGAAGAAGATGTCTTTATGGATGACGATGAAGCACCAAAAGAAGAATATCATGAAGATGAGAAGGATAAAGATGGTGGGTGGACTGAGGTCAAAGATACTGCCAAAGAAGAACCCACTCCTGAGGATAATGAGACAATAGCTCCCGATGAAACAATCCATGAAGTTCCTGTTGGAAAGGGATTATCCAGTGTACTGAAACTGCTTAAGGATCGTGGGACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAATAGATGAAGATGAACCAAAGGAAGCTAAGTCAAAGGATTCCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATTCACATCGAGAGGACTGATGAATTTGGGCGAATTTGTAATGAGGAGCTAGTCTTACATGTTTATGCCAACTGCACAAGGCTTTTGCCTAGAGGTGTCAATTTTGTTCTCTTGTTCCAATGGAGTTTGATTATAATTTCTGGTCTATCTTTTCTATGGGCTGCTTTAGGATGTGATATTACTCCTATGCTTACCATTAAGGAGTTTTCTATTCTCACTTGTGCCTTTCTACTGTACCAGATGACTCCAAAGGAGTCATTTCGCCAACTTTCTCACAAGTTCCATGGCAAGGGACCTGGGAAAATGAAACAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCTGATACACCTTCGTTATCAGTGGAGAGAATGAGGGAAGCTCAAGCACAATTAAAAACACCTTACCTTGTTCTCAGCGGTCACGTTAAACCTGGCCAAACAAGTGATCCAAGAAGTGGTTTTGCTACCGTCGAAAAGGATCTCCCCGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTCGAGCATTTCTTGGGGATAAAGCGAAAAGGTGATCCTTCGAATACAGGCACAAAAAAGCCAAAAATTTGA

Protein sequence

MDADGSSVPEHDERNGHEARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYDREVYKEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKEKDKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIGGKNDEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGASDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQSSVRLDDNEDTFIEDDDEDLYKSLERARKLALKKQEAASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVKDTAKEEPTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRICNEELVLHVYANCTRLLPRGVNFVLLFQWSLIIISGLSFLWAALGCDITPMLTIKEFSILTCAFLLYQMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGDPSNTGTKKPKI
BLAST of Cp4.1LG01g02180 vs. Swiss-Prot
Match: DOT2_ARATH (SART-1 family protein DOT2 OS=Arabidopsis thaliana GN=DOT2 PE=1 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 5.9e-117
Identity = 328/736 (44.57%), Postives = 446/736 (60.60%), Query Frame = 1

Query: 167 ERERERDRDRKKKEK-DKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKK 226
           E E+ + R   ++E+ D + S   E   GR K +D   K+  ++ D+E+ ++K+   DK+
Sbjct: 2   EVEKSKSRHEIREERADYEGSPVREHRDGRRKEKDHRSKDKEKDYDREKIRDKDHRRDKE 61

Query: 227 GDQNKEKLRD----KEGIGGKNDEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHL 286
            ++++++ RD    KE   G++ E   D     ++D + E D E  R+R  D+ N     
Sbjct: 62  KERDRKRSRDEDTEKEISRGRDKEREKD----KSRDRVKEKDKEKERNRHKDREN---ER 121

Query: 287 GGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKE 346
             E+  D             ++R R +KE   KK  E+ +           E  K  E+ 
Sbjct: 122 DNEKEKD-------------KDRAR-VKERASKKSHEDDD-----------ETHKAAERY 181

Query: 347 KALQLSKIFEEQDNIDQGASDDDIAAEDITNMDVLENVEIGEQKQRD----MAYKAAKKK 406
           +      + E  DN+D  +S  + +A D+ N  +L+  E  ++K  D    +++ A  +K
Sbjct: 182 EHSDNRGLNEGGDNVDAASSGKEASALDLQNR-ILKMREERKKKAEDASDALSWVARSRK 241

Query: 407 TGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTG-------RFSNDAEKKLEEDYI 466
               ++K N E    +++   +++    ++G   DG         +  +  EK +E   +
Sbjct: 242 ---IEEKRNAEKQRAQQLSRIFEEQDNLNQGENEDGEDGEHLSGVKVLHGLEKVVEGGAV 301

Query: 467 LW------------LHRYMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKK 526
           +             ++  +D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK
Sbjct: 302 ILTLKDQSVLTDGDVNNEIDMLENVEIGEQKRRNEAYEAAKKKKGIYDDKFNDDPGAEKK 361

Query: 527 MLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDY 586
           MLPQYD+ AA DEG+ LD  GRF+ +AEKKLEELRKR+QG  +   FEDLN+S KVS DY
Sbjct: 362 MLPQYDE-AATDEGIFLDAKGRFTGEAEKKLEELRKRIQG-QTTHTFEDLNSSAKVSSDY 421

Query: 587 YTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLGPRNDSSRQARKTEQER 646
           ++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA++SGLG  DLG R D  RQA K E+ER
Sbjct: 422 FSQEEMLKFKKPKKKKQLRKKDKLDLSMLEAEAVASGLGAEDLGSRKDGRRQAMKEEKER 481

Query: 647 SEAEMRQNAYQSAYAKADEASRSLQLVQ-SSVRLDDNEDTFIEDDDEDLYKSLERARKLA 706
            E E R NAYQ A AKADEASR L+  Q    + D++E   + DD EDLYKSLE+AR+LA
Sbjct: 482 IEYEKRSNAYQEAIAKADEASRLLRREQVQPFKRDEDESMVLADDAEDLYKSLEKARRLA 541

Query: 707 L-KKQEAASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEES 766
           L KK+EA SGP+AVA L  ++ + QTTDD  T   E QEN VVFTEM +FVWGLQ + + 
Sbjct: 542 LIKKEEAGSGPQAVAHLVASS-TNQTTDDNTTTGDETQENTVVFTEMGDFVWGLQRENDV 601

Query: 767 HKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVKDTAKE-EPTPEDNETIAPDETIHEV 826
            KPE EDVFM++D APK      E+  D G TEV DT  +      D + I PDE IHEV
Sbjct: 602 RKPESEDVFMEEDVAPKAPVEVKEEHPD-GLTEVNDTDMDAAEDSSDTKEITPDENIHEV 661

Query: 827 PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLV 872
            VGKGLS  LKLLKDRGTLKE +EWGGRNMDK+KSKLVGI+D+D  KE+K K+S+     
Sbjct: 662 AVGKGLSGALKLLKDRGTLKEKVEWGGRNMDKKKSKLVGIVDDDGGKESKDKESK----- 692

BLAST of Cp4.1LG01g02180 vs. Swiss-Prot
Match: SNUT1_HUMAN (U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens GN=SART1 PE=1 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 1.1e-14
Identity = 43/73 (58.90%), Postives = 56/73 (76.71%), Query Frame = 1

Query: 940  QMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQ 999
            ++TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q  
Sbjct: 719  KLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPLGTVALLQEKQKA 778

Query: 1000 LKTPYLVLSGHVK 1013
             KTPY+VLSG  K
Sbjct: 779  QKTPYIVLSGSGK 791

BLAST of Cp4.1LG01g02180 vs. Swiss-Prot
Match: SNUT1_RAT (U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus GN=Sart1 PE=1 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 1.1e-14
Identity = 43/73 (58.90%), Postives = 56/73 (76.71%), Query Frame = 1

Query: 940  QMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQ 999
            ++TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q  
Sbjct: 725  KLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPLGTVALLQEKQKA 784

Query: 1000 LKTPYLVLSGHVK 1013
             KTPY+VLSG  K
Sbjct: 785  QKTPYIVLSGSGK 797

BLAST of Cp4.1LG01g02180 vs. Swiss-Prot
Match: SNUT1_MOUSE (U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus GN=Sart1 PE=1 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 1.1e-14
Identity = 43/73 (58.90%), Postives = 56/73 (76.71%), Query Frame = 1

Query: 940  QMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQ 999
            ++TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q  
Sbjct: 725  KLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPLGTVALLQEKQKA 784

Query: 1000 LKTPYLVLSGHVK 1013
             KTPY+VLSG  K
Sbjct: 785  QKTPYIVLSGSGK 797

BLAST of Cp4.1LG01g02180 vs. TrEMBL
Match: A0A0A0KXY6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G650610 PE=4 SV=1)

HSP 1 Score: 1016.1 bits (2626), Expect = 3.0e-293
Identity = 624/881 (70.83%), Postives = 696/881 (79.00%), Query Frame = 1

Query: 1   MDADGSSVPEHDERNGHEARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD 60
           MD + SS P  DER+G          DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD
Sbjct: 1   MDWERSSAP--DERSG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD 60

Query: 61  RDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYDREVY 120
           R+RSKR SDDASKEKEKE KDSERDR+  RE+RKEDRDEH+KER+R K VKDKDYDR++Y
Sbjct: 61  RERSKRSSDDASKEKEKEAKDSERDRIRSREKRKEDRDEHEKERSRGK-VKDKDYDRDIY 120

Query: 121 KEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKE 180
           K+KEYERERDRKDRGKD+ER RERELEKD VRG DKERGKEKDRDR+++R  DRDRKKK+
Sbjct: 121 KDKEYERERDRKDRGKDRERERERELEKDTVRGHDKERGKEKDRDRDKDR--DRDRKKKD 180

Query: 181 KDKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIG 240
           KDKDRSNE EREKGR+K RDQE+KESYRNIDK+RGKE+ L DD+K DQNK+KL+DKEGIG
Sbjct: 181 KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKDRGKERILEDDRKTDQNKQKLQDKEGIG 240

Query: 241 GKNDEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAM 300
            KNDEERI  I    KDYMLESDGE+NRDR V+QGN VQHLG EEN DGLKVG+ +SS M
Sbjct: 241 SKNDEERIGRIGDEGKDYMLESDGENNRDRDVNQGNMVQHLGVEENFDGLKVGSHASSTM 300

Query: 301 LEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGA 360
           LEER                     ++  ++   KK TE+ + L   K   + +  ++  
Sbjct: 301 LEER---------------------IRNMKEDRLKKQTEESEVLSWVKRSRKLE--EKKL 360

Query: 361 SDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYD 420
           S+ + A +     +  +N++ G     D+A           +D  N+ +    K+L   D
Sbjct: 361 SEKEKALQLSKIFEEQDNIDQGVSDD-DIAP----------EDTTNNHDLAGVKVLHGVD 420

Query: 421 DPAAA--------DEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRD 480
                        D+ +  DG                   ++  +DVLENVEIGEQKQRD
Sbjct: 421 KVLEGGAVVLTLKDQSILADGN------------------VNEELDVLENVEIGEQKQRD 480

Query: 481 MAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEEL 540
           +AYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEEL
Sbjct: 481 IAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEEL 540

Query: 541 RKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAI 600
           R+RLQGASSVKHFEDLN S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAI
Sbjct: 541 RRRLQGASSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAI 600

Query: 601 SSGLGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQ-SSVRL 660
           S+GLGVGDLG RNDS RQA+K EQE+SEAEMR NAYQSAYAKADEASRSLQLVQ SS RL
Sbjct: 601 SAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASRSLQLVQNSSARL 660

Query: 661 DDNEDTFIEDDDEDLYKSLERARKLALKKQEAASGPEAVALLATTTISGQTTDDQNTKAG 720
           +DN+D  I DDDED YKSLERARKLALKKQ+AASGP AVALLAT T S Q TDDQ+TKAG
Sbjct: 661 EDNDDALIADDDEDFYKSLERARKLALKKQDAASGPGAVALLATATTSSQATDDQSTKAG 720

Query: 721 ELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVK 780
           ELQENKVVFTEMEEFVWGLQLDE++HKPEE+DVFMDDDE PKEEYHED KDKDGGWTEVK
Sbjct: 721 ELQENKVVFTEMEEFVWGLQLDEDAHKPEEDDVFMDDDEIPKEEYHEDVKDKDGGWTEVK 780

Query: 781 DTAKEEPTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSK 840
           DTA EE TPE+NE +APDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSK
Sbjct: 781 DTAMEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSK 814

Query: 841 LVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRI 873
           LVGI+DEDEPKE+KSKDSRLSSLVDYKKEIHIERTDEFGRI
Sbjct: 841 LVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRI 814

BLAST of Cp4.1LG01g02180 vs. TrEMBL
Match: D7UD56_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0078g00440 PE=4 SV=1)

HSP 1 Score: 699.1 bits (1803), Expect = 8.2e-198
Identity = 490/895 (54.75%), Postives = 615/895 (68.72%), Query Frame = 1

Query: 1   MDADGSSV-PEHDE----RNGHEARDRGEGQ-DDFGYSGAEKSSKHRSEDHRKSSRGEEK 60
           MD D S   PE  +    R+    RD  +G  DD   +G EKSSKHRS+D RK SR EEK
Sbjct: 1   MDMDWSEPKPERSDELRDRDDSPTRDYHDGAYDDLEENGIEKSSKHRSKD-RKKSRREEK 60

Query: 61  DHRSKDRDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKD 120
           DHR KDR+RSK  + D  KE+EKE KDSE+DRV  RERRKEDRDE +K+R R+K V++KD
Sbjct: 61  DHRGKDRERSK--AGDGLKEREKETKDSEKDRVTSRERRKEDRDEREKDRNRDK-VREKD 120

Query: 121 YDREVYKEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDR 180
           YDRE Y++KE ER++DRKDRGK+KER RERE++K++ RG+DKERGKEK+RDR++ERE++R
Sbjct: 121 YDREKYRDKERERDKDRKDRGKEKEREREREVDKESDRGRDKERGKEKNRDRDKEREKER 180

Query: 181 DRKKKEKDKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLR 240
           DR K   D+DR  E E+ K REK R+ ++      IDKE+GKE+    +++ DQ++++ +
Sbjct: 181 DRTK---DRDREKEKEKSKDREKERENDKDRDRDAIDKEKGKERIRDKEREADQDRDRYK 240

Query: 241 DKEGIGGKNDEERIDWIAHGAKDYMLESDGEDNRDRGV-----------DQGNAVQHLGG 300
           D++    KN +E  D    G KD  L+ DG DNRDR V           D   A++H   
Sbjct: 241 DRDKGSRKNRDEGHDRSKDGGKDDKLKLDGGDNRDRDVTKQGRGSHHDEDDSRAIEH--- 300

Query: 301 EENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTE-ESEVLDWVKRSRKLEEKKLTEKEK 360
           E+N++G   G QSS+A L+ERI  MKE+R+K+++E  SEVL WV RSRK+EE++  EKEK
Sbjct: 301 EKNAEGAS-GPQSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSRKVEEQRNAEKEK 360

Query: 361 ALQLSKIFEEQDNIDQGASDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYD 420
           ALQLSKIFEEQ                       +N++ GE         +++   G+  
Sbjct: 361 ALQLSKIFEEQ-----------------------DNIDQGESDDEKPTRHSSQDLAGV-- 420

Query: 421 DKFNDENAGEKKMLPQYDDPAAADEG-LTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLE 480
                      K+L   D         LTL      +N     + ED        +D+LE
Sbjct: 421 -----------KVLHGLDKVIEGGAVVLTLKDQDILANG---DINED--------VDMLE 480

Query: 481 NVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRF 540
           NVEIGEQK+RD AYKAAKKKTGIY+DKFNDE   EKK+LPQYDDP   DEGL LD +GRF
Sbjct: 481 NVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQYDDPVT-DEGLALDASGRF 540

Query: 541 SNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEK 600
           + +AEKKLEELR+RLQG S+   FEDLN   K S DYYT +EML+FKKPKKKKSLRKKEK
Sbjct: 541 TGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPKKKKSLRKKEK 600

Query: 601 LDIDALEAEAISSGLGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRS 660
           L+IDALEAEA+S+GLGVGDLG RND  RQ+ + EQERSEAEMR +AYQ AYAKADEAS++
Sbjct: 601 LNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMRNSAYQLAYAKADEASKA 660

Query: 661 LQLVQS-SVRLDDNEDTFIEDDDEDLYKSLERARKLALKKQE--AASGPEAVALLATTTI 720
           L+L Q+  V+L++NE+    +DDE+L KSL+RARKL L+KQ+  A SGP+A+ALLA+TT 
Sbjct: 661 LRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDEAATSGPQAIALLASTTT 720

Query: 721 SGQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHE 780
           S Q  D+QN  +GE QEN+VVFTEMEEFVWGLQL++E+HKP+ EDVFMD+DEAPK    +
Sbjct: 721 SSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLEDEAHKPDGEDVFMDEDEAPKAS-DQ 780

Query: 781 DEKDKDGGWTEVKDTAKEE-PTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKES 840
           + KD+ GGWTEVKDT K+E P  E+ E + PD+TIHEV VGKGLS  L+LLK+RGTLKE 
Sbjct: 781 ERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHEVAVGKGLSGALQLLKERGTLKEG 818

Query: 841 IEWGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRI 873
           IEWGGRNMDK+KSKLVGI D                     KEI IERTDEFGRI
Sbjct: 841 IEWGGRNMDKKKSKLVGIYDNTG-----------------TKEIRIERTDEFGRI 818

BLAST of Cp4.1LG01g02180 vs. TrEMBL
Match: A0A061F934_THECC (U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 OS=Theobroma cacao GN=TCM_032152 PE=4 SV=1)

HSP 1 Score: 647.9 bits (1670), Expect = 2.2e-182
Identity = 468/885 (52.88%), Postives = 588/885 (66.44%), Query Frame = 1

Query: 13  ERNGHEARDRGEGQDDFGYSGA-EKSSKHRSEDHRKSSRGEEKDHRSKDR--DRSKRRSD 72
           +R    +R+R +G     YS   E++ KHRS+D +KSS  EEKDHRS+DR  DRSKR +D
Sbjct: 7   DREDDVSRERWDGG---AYSDELEQNDKHRSKDKKKSSLEEEKDHRSRDRERDRSKRSND 66

Query: 73  DASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKV--KDKDYDREVYKEKEYER 132
           +  KE+EK+ KD E+DRV  RERRK+DRDEH K+R+R+ KV  K+KDYDR+ Y+EKE+ER
Sbjct: 67  EILKEREKDFKDLEKDRVSSRERRKDDRDEHGKDRSRDSKVREKEKDYDRDKYREKEHER 126

Query: 133 ER--DRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKEKDKDR 192
           ER  DRKDRGK+K+R R R+ EK        ERGK+K RDR+RE+E++RD+ K+ + KDR
Sbjct: 127 EREKDRKDRGKEKDRERGRDSEK--------ERGKDKGRDRDREKEKERDKAKEREKKDR 186

Query: 193 SNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIGGKNDE 252
             E E EK R++             D+E+GKE++    ++ D  KE+ RD++    KN E
Sbjct: 187 EKEREGEKDRDR-------------DREKGKERSKQKSREADLEKERSRDRDNAIKKNHE 246

Query: 253 ERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAMLEERI 312
           E  +    G+KD  L  D  D+RD+   + NA  + G           AQ+SS+ LEERI
Sbjct: 247 EDYE----GSKDGELALDYGDSRDKDEAELNAGSNAGV----------AQASSSELEERI 306

Query: 313 RTMKEDRLKKQTEE-SEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGASDDD 372
             MKE+RLKK++E  SEVL+WV   RKLEEK+  EKEKALQ SKIFEEQD+  QG ++  
Sbjct: 307 ARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQRSKIFEEQDDFVQGENE-- 366

Query: 373 IAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAA 432
                             E+  R  A+  A  K     DK  D  A    +         
Sbjct: 367 -----------------DEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTL--------- 426

Query: 433 ADEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRDMAYKAAKKKTGI 492
            D+ +  +G           + ED        +D+LENVEIGEQ++RD AYKAAKKKTG+
Sbjct: 427 KDQSILANGD----------INED--------VDMLENVEIGEQRRRDEAYKAAKKKTGV 486

Query: 493 YDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSVKH 552
           YDDKFNDE   EKK+LPQYD+P A DEG+TLD  GRF+ +AEKKL+ELRKRLQG  +   
Sbjct: 487 YDDKFNDEPGSEKKILPQYDNPVA-DEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNR 546

Query: 553 FEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLGPR 612
            EDLN + K++ DYYTQ+EML+FKKPKKKK+LRKKEKLDIDALEAEAISSGLG GDLG R
Sbjct: 547 VEDLNNAGKIASDYYTQEEMLKFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSR 606

Query: 613 NDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQS-SVRLDDNEDTFIEDDD 672
           ND+ RQA + E+ RSEAE R +AYQSAYAKADEAS+SL L Q+  V+ +++E+    DDD
Sbjct: 607 NDARRQAIREEEARSEAEKRNSAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDD 666

Query: 673 EDLYKSLERARKLALKKQE-AASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVFTE 732
           +DLYKS+ER+RKLA KKQE   SGP+A+AL ATT    QT DDQ T  GE QENK+V TE
Sbjct: 667 DDLYKSIERSRKLAFKKQEDEKSGPQAIALRATTAAISQTADDQTTTTGEAQENKLVITE 726

Query: 733 MEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKD--GGWTEVKDTAKEE-PT 792
           MEEFVWGLQ DEE+HKP+ EDVFMD+DE P    H+ +  ++  GGWTEV D + +E P+
Sbjct: 727 MEEFVWGLQHDEEAHKPDSEDVFMDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPS 786

Query: 793 PEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDED 852
            ED + I PDETIHEV VGKGLS  LKLLKDRGTLKESIEWGGRNMDK+KSKLVGI+D+D
Sbjct: 787 NEDKDDIVPDETIHEVAVGKGLSGALKLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDD 793

Query: 853 EPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRICNEELVLHVYAN 885
                           D  K+I IERTDEFGRI   +    V ++
Sbjct: 847 REN-------------DRFKDIRIERTDEFGRIITPKEAFRVLSH 793

BLAST of Cp4.1LG01g02180 vs. TrEMBL
Match: A0A061FA02_THECC (U4/U6.U5 tri-snRNP-associated protein 1 isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_032152 PE=4 SV=1)

HSP 1 Score: 647.9 bits (1670), Expect = 2.2e-182
Identity = 468/885 (52.88%), Postives = 588/885 (66.44%), Query Frame = 1

Query: 13  ERNGHEARDRGEGQDDFGYSGA-EKSSKHRSEDHRKSSRGEEKDHRSKDR--DRSKRRSD 72
           +R    +R+R +G     YS   E++ KHRS+D +KSS  EEKDHRS+DR  DRSKR +D
Sbjct: 7   DREDDVSRERWDGG---AYSDELEQNDKHRSKDKKKSSLEEEKDHRSRDRERDRSKRSND 66

Query: 73  DASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKV--KDKDYDREVYKEKEYER 132
           +  KE+EK+ KD E+DRV  RERRK+DRDEH K+R+R+ KV  K+KDYDR+ Y+EKE+ER
Sbjct: 67  EILKEREKDFKDLEKDRVSSRERRKDDRDEHGKDRSRDSKVREKEKDYDRDKYREKEHER 126

Query: 133 ER--DRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKEKDKDR 192
           ER  DRKDRGK+K+R R R+ EK        ERGK+K RDR+RE+E++RD+ K+ + KDR
Sbjct: 127 EREKDRKDRGKEKDRERGRDSEK--------ERGKDKGRDRDREKEKERDKAKEREKKDR 186

Query: 193 SNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIGGKNDE 252
             E E EK R++             D+E+GKE++    ++ D  KE+ RD++    KN E
Sbjct: 187 EKEREGEKDRDR-------------DREKGKERSKQKSREADLEKERSRDRDNAIKKNHE 246

Query: 253 ERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAMLEERI 312
           E  +    G+KD  L  D  D+RD+   + NA  + G           AQ+SS+ LEERI
Sbjct: 247 EDYE----GSKDGELALDYGDSRDKDEAELNAGSNAGV----------AQASSSELEERI 306

Query: 313 RTMKEDRLKKQTEE-SEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGASDDD 372
             MKE+RLKK++E  SEVL+WV   RKLEEK+  EKEKALQ SKIFEEQD+  QG ++  
Sbjct: 307 ARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQRSKIFEEQDDFVQGENE-- 366

Query: 373 IAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAA 432
                             E+  R  A+  A  K     DK  D  A    +         
Sbjct: 367 -----------------DEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTL--------- 426

Query: 433 ADEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRDMAYKAAKKKTGI 492
            D+ +  +G           + ED        +D+LENVEIGEQ++RD AYKAAKKKTG+
Sbjct: 427 KDQSILANGD----------INED--------VDMLENVEIGEQRRRDEAYKAAKKKTGV 486

Query: 493 YDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSVKH 552
           YDDKFNDE   EKK+LPQYD+P A DEG+TLD  GRF+ +AEKKL+ELRKRLQG  +   
Sbjct: 487 YDDKFNDEPGSEKKILPQYDNPVA-DEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNR 546

Query: 553 FEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLGPR 612
            EDLN + K++ DYYTQ+EML+FKKPKKKK+LRKKEKLDIDALEAEAISSGLG GDLG R
Sbjct: 547 VEDLNNAGKIASDYYTQEEMLKFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSR 606

Query: 613 NDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQS-SVRLDDNEDTFIEDDD 672
           ND+ RQA + E+ RSEAE R +AYQSAYAKADEAS+SL L Q+  V+ +++E+    DDD
Sbjct: 607 NDARRQAIREEEARSEAEKRNSAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDD 666

Query: 673 EDLYKSLERARKLALKKQE-AASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVFTE 732
           +DLYKS+ER+RKLA KKQE   SGP+A+AL ATT    QT DDQ T  GE QENK+V TE
Sbjct: 667 DDLYKSIERSRKLAFKKQEDEKSGPQAIALRATTAAISQTADDQTTTTGEAQENKLVITE 726

Query: 733 MEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKD--GGWTEVKDTAKEE-PT 792
           MEEFVWGLQ DEE+HKP+ EDVFMD+DE P    H+ +  ++  GGWTEV D + +E P+
Sbjct: 727 MEEFVWGLQHDEEAHKPDSEDVFMDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPS 786

Query: 793 PEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDED 852
            ED + I PDETIHEV VGKGLS  LKLLKDRGTLKESIEWGGRNMDK+KSKLVGI+D+D
Sbjct: 787 NEDKDDIVPDETIHEVAVGKGLSGALKLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDD 793

Query: 853 EPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRICNEELVLHVYAN 885
                           D  K+I IERTDEFGRI   +    V ++
Sbjct: 847 REN-------------DRFKDIRIERTDEFGRIITPKEAFRVLSH 793

BLAST of Cp4.1LG01g02180 vs. TrEMBL
Match: M5Y4E7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000914mg PE=4 SV=1)

HSP 1 Score: 612.8 bits (1579), Expect = 7.7e-172
Identity = 451/909 (49.61%), Postives = 589/909 (64.80%), Query Frame = 1

Query: 10  EHDERNGHEARDRGEG-QDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRSKRRS 69
           E+D  +      R EG  DD   +G +KSS+HRS+D +KSSRGEEKD RSKDR+RS+R S
Sbjct: 4   EYDRDDSPMREHREEGIYDDLDENGTDKSSRHRSKDRKKSSRGEEKDTRSKDRERSRRSS 63

Query: 70  DDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYDREVYKEKEYERE 129
           DD  KE+EKE KDSE+DRV  +ERRK+DRD+  K++ R+ K ++KDYDRE ++E E+ER 
Sbjct: 64  DDFVKEREKESKDSEKDRVSSKERRKDDRDDRYKDKNRDNKAREKDYDRESHRETEHERG 123

Query: 130 RDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKEKDKDRSNE 189
           +DRKDRGK+KER +ERE+EKD+ RG+DKERGKEK +DR++++ER       EK++DR+ E
Sbjct: 124 KDRKDRGKEKEREKEREVEKDSDRGRDKERGKEKIKDRDKDKER-------EKERDRAKE 183

Query: 190 NEREKGREKRRDQEE-KESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIGGKNDEER 249
            EREK REK +D+E+ +E+Y                   D ++E+++DK           
Sbjct: 184 KEREKEREKHKDREKGRENY------------------KDTDRERVKDK----------- 243

Query: 250 IDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAMLEERI-- 309
                +  K+  ++ D + +RDR       V     +EN +  K G +   A L E    
Sbjct: 244 -----YREKEREVDHDKDKSRDR-------VSRRSLDENYEWSKDGGRDDKAKLNEEYTG 303

Query: 310 -RTMKEDRLKKQTEESEVLDWVK-----RSRKLEEKKLTEKEKALQLSKIFEEQDNIDQG 369
            + +K+ ++    E+    + +       + +LEE+ +  KE+ L+  K           
Sbjct: 304 DKDIKQGKVSHNAEDERKAEGLSGGAHLSALELEERIMKTKEERLKKKK----------- 363

Query: 370 ASDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQY 429
                   ED+  +    +     + +R+   + A + + I++++   +N G+ +     
Sbjct: 364 --------EDVPEVLAWVSRSRKLEDKRNAEKQKALQLSKIFEEQ---DNIGQGE---SE 423

Query: 430 DDPAAADEGLTLDGTGRFSNDAEKKLEEDYILW------------LHRYMDVLENVEIGE 489
           D+  A D    L G  +  +  +K +E   ++             ++  +D+LENVEIGE
Sbjct: 424 DEETAQDTTHDLAGV-KVLHGLDKVMEGGAVVLTLKDQNILADGGVNEDIDMLENVEIGE 483

Query: 490 QKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEK 549
           QKQRD AYKAAKKKTGIY DKFND+   EKK+LPQYDDP   DEGLTLD  GRF+ +AEK
Sbjct: 484 QKQRDDAYKAAKKKTGIYVDKFNDDLNTEKKILPQYDDPVP-DEGLTLDERGRFTGEAEK 543

Query: 550 KLEELRKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKK--KKSLRKKEKLDID 609
           KLEELRKR+QG  +   FEDLN S  ++ D+YTQ+EML+FKKPKK  KKSLRKKEKLD+D
Sbjct: 544 KLEELRKRIQGVPTNNRFEDLNMSGNITSDFYTQEEMLQFKKPKKGKKKSLRKKEKLDLD 603

Query: 610 ALEAEAISSGLGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLV 669
           ALEAEA+S+GLGV DLG RND+ RQA K EQER EAE R +AYQ AYAKADEAS+SL+L 
Sbjct: 604 ALEAEAVSAGLGVADLGSRNDAKRQANKEEQERLEAERRNSAYQLAYAKADEASKSLRLE 663

Query: 670 QSSVRLDDNEDT-FIEDDDEDLYKSLERARKLALKK--QEAASGPEAVALLATTTISGQT 729
           Q    + + ++T    DDD+DLYKSLERARKLALKK  +E ASGP+A+ALLATTT S QT
Sbjct: 664 QILTVIPEEDETPAFADDDDDLYKSLERARKLALKKKEEETASGPQAIALLATTTASSQT 723

Query: 730 TDDQNTKAGELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKD 789
            D+Q    GE Q+NKVVFTEMEEFVWGLQLDEESHKPE EDVFM +DE PK   HE+  +
Sbjct: 724 ADNQIPSTGESQDNKVVFTEMEEFVWGLQLDEESHKPESEDVFMQEDEEPKPS-HEERMN 783

Query: 790 KDGGWTEVKDTAKEE-PTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWG 849
           + GGWTEVKD  ++E P  ED E I PDETIHEV VGKGLS VLKLLKDRGTLKE IEWG
Sbjct: 784 EPGGWTEVKDMDEDEKPATEDKEEIVPDETIHEVAVGKGLSGVLKLLKDRGTLKEGIEWG 836

Query: 850 GRNMDKRKSKLVGII-DEDEPKEA--------KSKDSRLS----------SLVDYKKEIH 872
           GRNMDK+KSKL+GI+ D+DEPKE         + KD+R S          S V  +K+IH
Sbjct: 844 GRNMDKKKSKLLGIVDDDDEPKEPHTSRQKKDEHKDTRPSSSSHQKETRPSKVYQEKDIH 836

BLAST of Cp4.1LG01g02180 vs. TAIR10
Match: AT5G16780.1 (AT5G16780.1 SART-1 family)

HSP 1 Score: 423.7 bits (1088), Expect = 3.3e-118
Identity = 328/736 (44.57%), Postives = 446/736 (60.60%), Query Frame = 1

Query: 167 ERERERDRDRKKKEK-DKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKK 226
           E E+ + R   ++E+ D + S   E   GR K +D   K+  ++ D+E+ ++K+   DK+
Sbjct: 2   EVEKSKSRHEIREERADYEGSPVREHRDGRRKEKDHRSKDKEKDYDREKIRDKDHRRDKE 61

Query: 227 GDQNKEKLRD----KEGIGGKNDEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHL 286
            ++++++ RD    KE   G++ E   D     ++D + E D E  R+R  D+ N     
Sbjct: 62  KERDRKRSRDEDTEKEISRGRDKEREKD----KSRDRVKEKDKEKERNRHKDREN---ER 121

Query: 287 GGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKE 346
             E+  D             ++R R +KE   KK  E+ +           E  K  E+ 
Sbjct: 122 DNEKEKD-------------KDRAR-VKERASKKSHEDDD-----------ETHKAAERY 181

Query: 347 KALQLSKIFEEQDNIDQGASDDDIAAEDITNMDVLENVEIGEQKQRD----MAYKAAKKK 406
           +      + E  DN+D  +S  + +A D+ N  +L+  E  ++K  D    +++ A  +K
Sbjct: 182 EHSDNRGLNEGGDNVDAASSGKEASALDLQNR-ILKMREERKKKAEDASDALSWVARSRK 241

Query: 407 TGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTG-------RFSNDAEKKLEEDYI 466
               ++K N E    +++   +++    ++G   DG         +  +  EK +E   +
Sbjct: 242 ---IEEKRNAEKQRAQQLSRIFEEQDNLNQGENEDGEDGEHLSGVKVLHGLEKVVEGGAV 301

Query: 467 LW------------LHRYMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKK 526
           +             ++  +D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK
Sbjct: 302 ILTLKDQSVLTDGDVNNEIDMLENVEIGEQKRRNEAYEAAKKKKGIYDDKFNDDPGAEKK 361

Query: 527 MLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDY 586
           MLPQYD+ AA DEG+ LD  GRF+ +AEKKLEELRKR+QG  +   FEDLN+S KVS DY
Sbjct: 362 MLPQYDE-AATDEGIFLDAKGRFTGEAEKKLEELRKRIQG-QTTHTFEDLNSSAKVSSDY 421

Query: 587 YTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLGPRNDSSRQARKTEQER 646
           ++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA++SGLG  DLG R D  RQA K E+ER
Sbjct: 422 FSQEEMLKFKKPKKKKQLRKKDKLDLSMLEAEAVASGLGAEDLGSRKDGRRQAMKEEKER 481

Query: 647 SEAEMRQNAYQSAYAKADEASRSLQLVQ-SSVRLDDNEDTFIEDDDEDLYKSLERARKLA 706
            E E R NAYQ A AKADEASR L+  Q    + D++E   + DD EDLYKSLE+AR+LA
Sbjct: 482 IEYEKRSNAYQEAIAKADEASRLLRREQVQPFKRDEDESMVLADDAEDLYKSLEKARRLA 541

Query: 707 L-KKQEAASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEES 766
           L KK+EA SGP+AVA L  ++ + QTTDD  T   E QEN VVFTEM +FVWGLQ + + 
Sbjct: 542 LIKKEEAGSGPQAVAHLVASS-TNQTTDDNTTTGDETQENTVVFTEMGDFVWGLQRENDV 601

Query: 767 HKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVKDTAKE-EPTPEDNETIAPDETIHEV 826
            KPE EDVFM++D APK      E+  D G TEV DT  +      D + I PDE IHEV
Sbjct: 602 RKPESEDVFMEEDVAPKAPVEVKEEHPD-GLTEVNDTDMDAAEDSSDTKEITPDENIHEV 661

Query: 827 PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLV 872
            VGKGLS  LKLLKDRGTLKE +EWGGRNMDK+KSKLVGI+D+D  KE+K K+S+     
Sbjct: 662 AVGKGLSGALKLLKDRGTLKEKVEWGGRNMDKKKSKLVGIVDDDGGKESKDKESK----- 692

BLAST of Cp4.1LG01g02180 vs. TAIR10
Match: AT3G14700.1 (AT3G14700.1 SART-1 family)

HSP 1 Score: 64.7 bits (156), Expect = 3.9e-10
Identity = 37/67 (55.22%), Postives = 50/67 (74.63%), Query Frame = 1

Query: 941  MTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQL 1000
            MT KE++R L H FHGKGPGK KQEK+ K++++  K KQM++++    SVER+RE  A  
Sbjct: 143  MTEKEAYRSLCHGFHGKGPGKKKQEKQRKKHED--KSKQMESSER---SVERIREIHAIS 202

Query: 1001 KTPYLVL 1008
            KTPY+VL
Sbjct: 203  KTPYIVL 204

BLAST of Cp4.1LG01g02180 vs. NCBI nr
Match: gi|778708017|ref|XP_011656108.1| (PREDICTED: SART-1 family protein DOT2 [Cucumis sativus])

HSP 1 Score: 1016.1 bits (2626), Expect = 4.3e-293
Identity = 624/881 (70.83%), Postives = 696/881 (79.00%), Query Frame = 1

Query: 1   MDADGSSVPEHDERNGHEARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD 60
           MD + SS P  DER+G          DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD
Sbjct: 1   MDWERSSAP--DERSG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD 60

Query: 61  RDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYDREVY 120
           R+RSKR SDDASKEKEKE KDSERDR+  RE+RKEDRDEH+KER+R K VKDKDYDR++Y
Sbjct: 61  RERSKRSSDDASKEKEKEAKDSERDRIRSREKRKEDRDEHEKERSRGK-VKDKDYDRDIY 120

Query: 121 KEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKE 180
           K+KEYERERDRKDRGKD+ER RERELEKD VRG DKERGKEKDRDR+++R  DRDRKKK+
Sbjct: 121 KDKEYERERDRKDRGKDRERERERELEKDTVRGHDKERGKEKDRDRDKDR--DRDRKKKD 180

Query: 181 KDKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIG 240
           KDKDRSNE EREKGR+K RDQE+KESYRNIDK+RGKE+ L DD+K DQNK+KL+DKEGIG
Sbjct: 181 KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKDRGKERILEDDRKTDQNKQKLQDKEGIG 240

Query: 241 GKNDEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAM 300
            KNDEERI  I    KDYMLESDGE+NRDR V+QGN VQHLG EEN DGLKVG+ +SS M
Sbjct: 241 SKNDEERIGRIGDEGKDYMLESDGENNRDRDVNQGNMVQHLGVEENFDGLKVGSHASSTM 300

Query: 301 LEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGA 360
           LEER                     ++  ++   KK TE+ + L   K   + +  ++  
Sbjct: 301 LEER---------------------IRNMKEDRLKKQTEESEVLSWVKRSRKLE--EKKL 360

Query: 361 SDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYD 420
           S+ + A +     +  +N++ G     D+A           +D  N+ +    K+L   D
Sbjct: 361 SEKEKALQLSKIFEEQDNIDQGVSDD-DIAP----------EDTTNNHDLAGVKVLHGVD 420

Query: 421 DPAAA--------DEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRD 480
                        D+ +  DG                   ++  +DVLENVEIGEQKQRD
Sbjct: 421 KVLEGGAVVLTLKDQSILADGN------------------VNEELDVLENVEIGEQKQRD 480

Query: 481 MAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEEL 540
           +AYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEEL
Sbjct: 481 IAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEEL 540

Query: 541 RKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAI 600
           R+RLQGASSVKHFEDLN S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAI
Sbjct: 541 RRRLQGASSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAI 600

Query: 601 SSGLGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQ-SSVRL 660
           S+GLGVGDLG RNDS RQA+K EQE+SEAEMR NAYQSAYAKADEASRSLQLVQ SS RL
Sbjct: 601 SAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASRSLQLVQNSSARL 660

Query: 661 DDNEDTFIEDDDEDLYKSLERARKLALKKQEAASGPEAVALLATTTISGQTTDDQNTKAG 720
           +DN+D  I DDDED YKSLERARKLALKKQ+AASGP AVALLAT T S Q TDDQ+TKAG
Sbjct: 661 EDNDDALIADDDEDFYKSLERARKLALKKQDAASGPGAVALLATATTSSQATDDQSTKAG 720

Query: 721 ELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVK 780
           ELQENKVVFTEMEEFVWGLQLDE++HKPEE+DVFMDDDE PKEEYHED KDKDGGWTEVK
Sbjct: 721 ELQENKVVFTEMEEFVWGLQLDEDAHKPEEDDVFMDDDEIPKEEYHEDVKDKDGGWTEVK 780

Query: 781 DTAKEEPTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSK 840
           DTA EE TPE+NE +APDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSK
Sbjct: 781 DTAMEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSK 814

Query: 841 LVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRI 873
           LVGI+DEDEPKE+KSKDSRLSSLVDYKKEIHIERTDEFGRI
Sbjct: 841 LVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRI 814

BLAST of Cp4.1LG01g02180 vs. NCBI nr
Match: gi|659119412|ref|XP_008459643.1| (PREDICTED: trichohyalin [Cucumis melo])

HSP 1 Score: 892.9 bits (2306), Expect = 5.5e-256
Identity = 572/878 (65.15%), Postives = 648/878 (73.80%), Query Frame = 1

Query: 1   MDADGSSVPEHDERNGHEARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD 60
           MD + SS P  DERNG          DD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD
Sbjct: 1   MDWERSSAP--DERNG----------DDLGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKD 60

Query: 61  RDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYDREVY 120
           R+RSKR SDDASKEKEKEVKDSERDRV  RE+RKEDRDEH+KER R  KVKDKDYDRE+Y
Sbjct: 61  RERSKRSSDDASKEKEKEVKDSERDRVRSREKRKEDRDEHEKERGRGSKVKDKDYDREIY 120

Query: 121 KEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKE 180
           K+KEYERERDRKDRGKD+ER RERELEKDNVRG DKERGKEKDRDR+++R+RDRDRKKK+
Sbjct: 121 KDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDRDKDRDRDRDRKKKD 180

Query: 181 KDKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIG 240
           KDKDRSNE EREKGREK RDQE+KES                     +N +K R KE I 
Sbjct: 181 KDKDRSNEIEREKGREKHRDQEDKES--------------------YRNVDKERGKERI- 240

Query: 241 GKNDEERIDWIAHGAKDYMLESDGEDNRDRG--VDQG-NAVQHLGGEENSDGLKVGAQSS 300
              D+ + D      +D        D    G   D+G + +    GE N D         
Sbjct: 241 -LEDDRKTDQTKQKLQDKEGIGSKNDEERTGWIADEGKDYMLESDGENNRD--------- 300

Query: 301 SAMLEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNID 360
                   R + +  + +     E  D +K         L E+ + ++       +D + 
Sbjct: 301 --------RDVNQGNMVQHLGGEENFDGLKVGSHPSSTMLEERIRNMK-------EDRLK 360

Query: 361 QGASDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLP 420
           +   + ++ A  +     LE  ++ E+++     K  +++  I  D  +D+ A E     
Sbjct: 361 KQTEESEVLAW-VKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQDVSDDDIAPENTTNN 420

Query: 421 QYDDPAAADEGL--TLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRDMAY 480
                     G+   L+G        ++ +  D  +  +  +D+LENVEIGEQKQRDMAY
Sbjct: 421 HDLTGVKVLHGVDKVLEGGAVVLTLKDQSILADGDV--NEELDMLENVEIGEQKQRDMAY 480

Query: 481 KAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKR 540
           KAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEELR+R
Sbjct: 481 KAAKKKTGIYDDKFNDENDGEKKMLPQYDDPAEADEGLTLDGRGGFNNDAEKKLEELRRR 540

Query: 541 LQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSG 600
           LQG SSVKHFEDLN S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAIS+G
Sbjct: 541 LQGTSSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAG 600

Query: 601 LGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQ-SSVRLDDN 660
           LGVGDLG RNDS RQA+K EQE+SEAEMR NAYQSAYAKADEASRSLQLVQ SS RL+DN
Sbjct: 601 LGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASRSLQLVQTSSTRLEDN 660

Query: 661 EDTFIEDDDEDLYKSLERARKLALKKQEAASGPEAVALLATTTISGQTTDDQNTKAGELQ 720
           +D  I DDDED YKSLERARKLALKKQ+AASGP A+ALLAT T S Q TDDQNTKAGELQ
Sbjct: 661 DDALIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQ 720

Query: 721 ENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVKDTA 780
           ENKV+FTEMEEFVWGLQLDE++HKPEEEDVFMDDDE PKEEYHED KDKDGGWTEVKDTA
Sbjct: 721 ENKVIFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTA 780

Query: 781 KEEPTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVG 840
           KEE  P++N+ +APDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVG
Sbjct: 781 KEESIPDENKAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVG 817

Query: 841 IIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRI 873
           I+DEDEPKE+KSKDSRLSSLVDYKKEIHIERTDEFGRI
Sbjct: 841 IVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRI 817

BLAST of Cp4.1LG01g02180 vs. NCBI nr
Match: gi|1009141433|ref|XP_015888191.1| (PREDICTED: SART-1 family protein DOT2 [Ziziphus jujuba])

HSP 1 Score: 721.1 bits (1860), Expect = 2.9e-204
Identity = 496/887 (55.92%), Postives = 617/887 (69.56%), Query Frame = 1

Query: 10  EHDERNGHEARDRGEGQ--DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRSKRR 69
           EH+ER+    R+  +G   DD   +G EK  KHR++D +K SRGEEK+HRSKDR+RSKR 
Sbjct: 11  EHEERDDSPMREPQDGGAFDDLEENGIEKLGKHRNKDRKKGSRGEEKEHRSKDRERSKRS 70

Query: 70  SDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYDREVYKEKEYER 129
            DD  KE+EKE KDSER+RV  RERRK+DRDE D++R+++ KV++KDYDRE Y+EKE ER
Sbjct: 71  GDDLLKEREKEAKDSERERVSSRERRKDDRDERDRDRSKDIKVREKDYDREKYREKERER 130

Query: 130 ERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKK---KEKDKD 189
           E+DRKDRGK+K+R +ERE+EKD+ RG+DK+RGKEK RDR++ERE++RDR K   KEKD+D
Sbjct: 131 EKDRKDRGKEKDREKEREVEKDSDRGRDKDRGKEKSRDRDKEREKERDRDKDREKEKDRD 190

Query: 190 RSNENEREKGREKRRDQEE-KESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIGGKN 249
           +  E ERE+ REK +D+E+ +ESY+  DKE+GKEK    +++ DQ+K+KLRD++    ++
Sbjct: 191 KVKEKERERDREKHKDREKGRESYKEGDKEKGKEKTKEKEREADQDKDKLRDRDS--KRS 250

Query: 250 DEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAMLEE 309
            E+  DW   G K+   + D +D                GE+ ++ L  GA  SS  LEE
Sbjct: 251 SEDDYDWNKDGGKENKSKLDDDD----------------GEQIAEDLAGGAHPSSTHLEE 310

Query: 310 RIRTMKEDRLKKQTEE-SEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGASD 369
           RI  M+E RLKK+TE+ S++L WV RSRKLEEKK+TEKEKALQLSKIFEEQDNI Q  S+
Sbjct: 311 RILRMREGRLKKKTEDVSDILAWVNRSRKLEEKKITEKEKALQLSKIFEEQDNIGQEESE 370

Query: 370 DDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDP 429
           DD               E  +   RD+A    K   GI  DK  D  A    +       
Sbjct: 371 DD--------------EEAAQHNARDLA--GVKVLHGI--DKVLDGGAVVLTL------- 430

Query: 430 AAADEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRDMAYKAAKKKT 489
              D+ +  DG           L ED        +D+LENVEIGEQK+RD AYKAAKKKT
Sbjct: 431 --KDQNILADGD----------LNED--------IDMLENVEIGEQKRRDDAYKAAKKKT 490

Query: 490 GIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSV 549
           GIY DKFND+   EK MLPQYDDPA  DEG+ LD  GRF+ +AEKKLEELRKRLQG    
Sbjct: 491 GIYADKFNDDPNSEKTMLPQYDDPAT-DEGVILDERGRFTGEAEKKLEELRKRLQGVPKN 550

Query: 550 KHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLG 609
             +EDLN   KVS DY+T +EML+FKKPKKKKSLRKK+KLDIDALEAEA+++GLGVGDLG
Sbjct: 551 NRYEDLNLPGKVSSDYFTPEEMLQFKKPKKKKSLRKKDKLDIDALEAEAVNAGLGVGDLG 610

Query: 610 PRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQS-SVRLDDNEDTFIED 669
            RN+S R+A   EQER+EA+ +  AYQ AYAKADEAS++L+L Q+  V+ +++E     D
Sbjct: 611 SRNNSKRKAILEEQERAEADRKNQAYQLAYAKADEASKTLRLEQTLPVKSEEDETPVAGD 670

Query: 670 DDEDLYKSLERARKLALKK-QEAASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVF 729
           +DEDLYKSLERARKLALKK +EA SGPEA+ALLA     GQ  DD   K GE QEN++VF
Sbjct: 671 EDEDLYKSLERARKLALKKKEEAPSGPEAIALLAANNAGGQNADDGAAKTGESQENRLVF 730

Query: 730 TEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKDGGWTEVKDTAKEE-PT 789
           +EMEEFVWGLQL+EE+ KP+ EDVFM++DE PK    E+   + GGWTEVKD  K+E P+
Sbjct: 731 SEMEEFVWGLQLEEEAQKPDGEDVFMEEDEEPKAS-DEEIVVEAGGWTEVKDVEKDENPS 790

Query: 790 PEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDED 849
            +D E I PDETIHE   GKGLS+ LKLLKDRGTLKE  +WGGRNMDK+KSKLVGI+D+D
Sbjct: 791 NDDKEEIVPDETIHEAAFGKGLSNALKLLKDRGTLKEGPDWGGRNMDKKKSKLVGIVDDD 832

Query: 850 EPKE---AKSKDSRLSSLVDYKKE-----------IHIERTDEFGRI 873
           EPKE   A+ K         Y+KE           IHIERTDEFGRI
Sbjct: 851 EPKETNPARYKRDEQRETRGYQKETHPAKVYQEKDIHIERTDEFGRI 832

BLAST of Cp4.1LG01g02180 vs. NCBI nr
Match: gi|731407973|ref|XP_010656678.1| (PREDICTED: SART-1 family protein DOT2 [Vitis vinifera])

HSP 1 Score: 699.1 bits (1803), Expect = 1.2e-197
Identity = 490/895 (54.75%), Postives = 615/895 (68.72%), Query Frame = 1

Query: 1   MDADGSSV-PEHDE----RNGHEARDRGEGQ-DDFGYSGAEKSSKHRSEDHRKSSRGEEK 60
           MD D S   PE  +    R+    RD  +G  DD   +G EKSSKHRS+D RK SR EEK
Sbjct: 1   MDMDWSEPKPERSDELRDRDDSPTRDYHDGAYDDLEENGIEKSSKHRSKD-RKKSRREEK 60

Query: 61  DHRSKDRDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKD 120
           DHR KDR+RSK  + D  KE+EKE KDSE+DRV  RERRKEDRDE +K+R R+K V++KD
Sbjct: 61  DHRGKDRERSK--AGDGLKEREKETKDSEKDRVTSRERRKEDRDEREKDRNRDK-VREKD 120

Query: 121 YDREVYKEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDR 180
           YDRE Y++KE ER++DRKDRGK+KER RERE++K++ RG+DKERGKEK+RDR++ERE++R
Sbjct: 121 YDREKYRDKERERDKDRKDRGKEKEREREREVDKESDRGRDKERGKEKNRDRDKEREKER 180

Query: 181 DRKKKEKDKDRSNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLR 240
           DR K   D+DR  E E+ K REK R+ ++      IDKE+GKE+    +++ DQ++++ +
Sbjct: 181 DRTK---DRDREKEKEKSKDREKERENDKDRDRDAIDKEKGKERIRDKEREADQDRDRYK 240

Query: 241 DKEGIGGKNDEERIDWIAHGAKDYMLESDGEDNRDRGV-----------DQGNAVQHLGG 300
           D++    KN +E  D    G KD  L+ DG DNRDR V           D   A++H   
Sbjct: 241 DRDKGSRKNRDEGHDRSKDGGKDDKLKLDGGDNRDRDVTKQGRGSHHDEDDSRAIEH--- 300

Query: 301 EENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTE-ESEVLDWVKRSRKLEEKKLTEKEK 360
           E+N++G   G QSS+A L+ERI  MKE+R+K+++E  SEVL WV RSRK+EE++  EKEK
Sbjct: 301 EKNAEGAS-GPQSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSRKVEEQRNAEKEK 360

Query: 361 ALQLSKIFEEQDNIDQGASDDDIAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYD 420
           ALQLSKIFEEQ                       +N++ GE         +++   G+  
Sbjct: 361 ALQLSKIFEEQ-----------------------DNIDQGESDDEKPTRHSSQDLAGV-- 420

Query: 421 DKFNDENAGEKKMLPQYDDPAAADEG-LTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLE 480
                      K+L   D         LTL      +N     + ED        +D+LE
Sbjct: 421 -----------KVLHGLDKVIEGGAVVLTLKDQDILANG---DINED--------VDMLE 480

Query: 481 NVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRF 540
           NVEIGEQK+RD AYKAAKKKTGIY+DKFNDE   EKK+LPQYDDP   DEGL LD +GRF
Sbjct: 481 NVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQYDDPVT-DEGLALDASGRF 540

Query: 541 SNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEK 600
           + +AEKKLEELR+RLQG S+   FEDLN   K S DYYT +EML+FKKPKKKKSLRKKEK
Sbjct: 541 TGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPKKKKSLRKKEK 600

Query: 601 LDIDALEAEAISSGLGVGDLGPRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRS 660
           L+IDALEAEA+S+GLGVGDLG RND  RQ+ + EQERSEAEMR +AYQ AYAKADEAS++
Sbjct: 601 LNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMRNSAYQLAYAKADEASKA 660

Query: 661 LQLVQS-SVRLDDNEDTFIEDDDEDLYKSLERARKLALKKQE--AASGPEAVALLATTTI 720
           L+L Q+  V+L++NE+    +DDE+L KSL+RARKL L+KQ+  A SGP+A+ALLA+TT 
Sbjct: 661 LRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDEAATSGPQAIALLASTTT 720

Query: 721 SGQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHE 780
           S Q  D+QN  +GE QEN+VVFTEMEEFVWGLQL++E+HKP+ EDVFMD+DEAPK    +
Sbjct: 721 SSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLEDEAHKPDGEDVFMDEDEAPKAS-DQ 780

Query: 781 DEKDKDGGWTEVKDTAKEE-PTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKES 840
           + KD+ GGWTEVKDT K+E P  E+ E + PD+TIHEV VGKGLS  L+LLK+RGTLKE 
Sbjct: 781 ERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHEVAVGKGLSGALQLLKERGTLKEG 818

Query: 841 IEWGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRI 873
           IEWGGRNMDK+KSKLVGI D                     KEI IERTDEFGRI
Sbjct: 841 IEWGGRNMDKKKSKLVGIYDNTG-----------------TKEIRIERTDEFGRI 818

BLAST of Cp4.1LG01g02180 vs. NCBI nr
Match: gi|590611180|ref|XP_007022027.1| (U4/U6.U5 tri-snRNP-associated protein 1 isoform 3, partial [Theobroma cacao])

HSP 1 Score: 647.9 bits (1670), Expect = 3.1e-182
Identity = 468/885 (52.88%), Postives = 588/885 (66.44%), Query Frame = 1

Query: 13  ERNGHEARDRGEGQDDFGYSGA-EKSSKHRSEDHRKSSRGEEKDHRSKDR--DRSKRRSD 72
           +R    +R+R +G     YS   E++ KHRS+D +KSS  EEKDHRS+DR  DRSKR +D
Sbjct: 7   DREDDVSRERWDGG---AYSDELEQNDKHRSKDKKKSSLEEEKDHRSRDRERDRSKRSND 66

Query: 73  DASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKV--KDKDYDREVYKEKEYER 132
           +  KE+EK+ KD E+DRV  RERRK+DRDEH K+R+R+ KV  K+KDYDR+ Y+EKE+ER
Sbjct: 67  EILKEREKDFKDLEKDRVSSRERRKDDRDEHGKDRSRDSKVREKEKDYDRDKYREKEHER 126

Query: 133 ER--DRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDRKKKEKDKDR 192
           ER  DRKDRGK+K+R R R+ EK        ERGK+K RDR+RE+E++RD+ K+ + KDR
Sbjct: 127 EREKDRKDRGKEKDRERGRDSEK--------ERGKDKGRDRDREKEKERDKAKEREKKDR 186

Query: 193 SNENEREKGREKRRDQEEKESYRNIDKERGKEKNLVDDKKGDQNKEKLRDKEGIGGKNDE 252
             E E EK R++             D+E+GKE++    ++ D  KE+ RD++    KN E
Sbjct: 187 EKEREGEKDRDR-------------DREKGKERSKQKSREADLEKERSRDRDNAIKKNHE 246

Query: 253 ERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQHLGGEENSDGLKVGAQSSSAMLEERI 312
           E  +    G+KD  L  D  D+RD+   + NA  + G           AQ+SS+ LEERI
Sbjct: 247 EDYE----GSKDGELALDYGDSRDKDEAELNAGSNAGV----------AQASSSELEERI 306

Query: 313 RTMKEDRLKKQTEE-SEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNIDQGASDDD 372
             MKE+RLKK++E  SEVL+WV   RKLEEK+  EKEKALQ SKIFEEQD+  QG ++  
Sbjct: 307 ARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQRSKIFEEQDDFVQGENE-- 366

Query: 373 IAAEDITNMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENAGEKKMLPQYDDPAA 432
                             E+  R  A+  A  K     DK  D  A    +         
Sbjct: 367 -----------------DEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTL--------- 426

Query: 433 ADEGLTLDGTGRFSNDAEKKLEEDYILWLHRYMDVLENVEIGEQKQRDMAYKAAKKKTGI 492
            D+ +  +G           + ED        +D+LENVEIGEQ++RD AYKAAKKKTG+
Sbjct: 427 KDQSILANGD----------INED--------VDMLENVEIGEQRRRDEAYKAAKKKTGV 486

Query: 493 YDDKFNDENAGEKKMLPQYDDPAAADEGLTLDGTGRFSNDAEKKLEELRKRLQGASSVKH 552
           YDDKFNDE   EKK+LPQYD+P A DEG+TLD  GRF+ +AEKKL+ELRKRLQG  +   
Sbjct: 487 YDDKFNDEPGSEKKILPQYDNPVA-DEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNR 546

Query: 553 FEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKEKLDIDALEAEAISSGLGVGDLGPR 612
            EDLN + K++ DYYTQ+EML+FKKPKKKK+LRKKEKLDIDALEAEAISSGLG GDLG R
Sbjct: 547 VEDLNNAGKIASDYYTQEEMLKFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSR 606

Query: 613 NDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASRSLQLVQS-SVRLDDNEDTFIEDDD 672
           ND+ RQA + E+ RSEAE R +AYQSAYAKADEAS+SL L Q+  V+ +++E+    DDD
Sbjct: 607 NDARRQAIREEEARSEAEKRNSAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDD 666

Query: 673 EDLYKSLERARKLALKKQE-AASGPEAVALLATTTISGQTTDDQNTKAGELQENKVVFTE 732
           +DLYKS+ER+RKLA KKQE   SGP+A+AL ATT    QT DDQ T  GE QENK+V TE
Sbjct: 667 DDLYKSIERSRKLAFKKQEDEKSGPQAIALRATTAAISQTADDQTTTTGEAQENKLVITE 726

Query: 733 MEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHEDEKDKD--GGWTEVKDTAKEE-PT 792
           MEEFVWGLQ DEE+HKP+ EDVFMD+DE P    H+ +  ++  GGWTEV D + +E P+
Sbjct: 727 MEEFVWGLQHDEEAHKPDSEDVFMDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPS 786

Query: 793 PEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDED 852
            ED + I PDETIHEV VGKGLS  LKLLKDRGTLKESIEWGGRNMDK+KSKLVGI+D+D
Sbjct: 787 NEDKDDIVPDETIHEVAVGKGLSGALKLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDD 793

Query: 853 EPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRICNEELVLHVYAN 885
                           D  K+I IERTDEFGRI   +    V ++
Sbjct: 847 REN-------------DRFKDIRIERTDEFGRIITPKEAFRVLSH 793

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DOT2_ARATH5.9e-11744.57SART-1 family protein DOT2 OS=Arabidopsis thaliana GN=DOT2 PE=1 SV=1[more]
SNUT1_HUMAN1.1e-1458.90U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens GN=SART1 PE=1 SV=1[more]
SNUT1_RAT1.1e-1458.90U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus GN=Sart1 PE=1 SV=1[more]
SNUT1_MOUSE1.1e-1458.90U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus GN=Sart1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KXY6_CUCSA3.0e-29370.83Uncharacterized protein OS=Cucumis sativus GN=Csa_5G650610 PE=4 SV=1[more]
D7UD56_VITVI8.2e-19854.75Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0078g00440 PE=4 SV=... [more]
A0A061F934_THECC2.2e-18252.88U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 OS=Theobroma cacao GN=TCM_0321... [more]
A0A061FA02_THECC2.2e-18252.88U4/U6.U5 tri-snRNP-associated protein 1 isoform 3 (Fragment) OS=Theobroma cacao ... [more]
M5Y4E7_PRUPE7.7e-17249.61Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000914mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G16780.13.3e-11844.57 SART-1 family[more]
AT3G14700.13.9e-1055.22 SART-1 family[more]
Match NameE-valueIdentityDescription
gi|778708017|ref|XP_011656108.1|4.3e-29370.83PREDICTED: SART-1 family protein DOT2 [Cucumis sativus][more]
gi|659119412|ref|XP_008459643.1|5.5e-25665.15PREDICTED: trichohyalin [Cucumis melo][more]
gi|1009141433|ref|XP_015888191.1|2.9e-20455.92PREDICTED: SART-1 family protein DOT2 [Ziziphus jujuba][more]
gi|731407973|ref|XP_010656678.1|1.2e-19754.75PREDICTED: SART-1 family protein DOT2 [Vitis vinifera][more]
gi|590611180|ref|XP_007022027.1|3.1e-18252.88U4/U6.U5 tri-snRNP-associated protein 1 isoform 3, partial [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0000398mRNA splicing, via spliceosome
Vocabulary: INTERPRO
TermDefinition
IPR005011SNU66/SART1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000398 mRNA splicing, via spliceosome
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0009908 flower development
biological_process GO:0016458 gene silencing
biological_process GO:0048366 leaf development
biological_process GO:0009933 meristem structural organization
biological_process GO:0050794 regulation of cellular process
biological_process GO:0040029 regulation of gene expression, epigenetic
biological_process GO:0009628 response to abiotic stimulus
biological_process GO:0048364 root development
biological_process GO:0010051 xylem and phloem pattern formation
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g02180.1Cp4.1LG01g02180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005011SNU66/SART1 familyPFAMPF03343SART-1coord: 457..871
score: 1.2E-33coord: 940..974
score: 1.6
NoneNo IPR availableunknownCoilCoilcoord: 961..981
score: -coord: 72..92
score: -coord: 604..631
score: -coord: 522..542
scor
NoneNo IPR availablePANTHERPTHR14152SQUAMOUS CELL CARCINOMA ANTIGEN RECOGNISED BY CYTOTOXIC T LYMPHOCYTEScoord: 317..379
score: 6.8E-142coord: 50..202
score: 6.8E-142coord: 423..846
score: 6.8E-142coord: 940..1050
score: 6.8E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g02180CmaCh04G004880Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g02180CmoCh04G005240Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g02180Carg08158Silver-seed gourdcarcpeB0527
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g02180Cucurbita pepo (Zucchini)cpecpeB234
Cp4.1LG01g02180Cucurbita pepo (Zucchini)cpecpeB261
Cp4.1LG01g02180Cucurbita pepo (Zucchini)cpecpeB400
Cp4.1LG01g02180Cucumber (Gy14) v1cgycpeB0599
Cp4.1LG01g02180Cucumber (Gy14) v1cgycpeB0654
Cp4.1LG01g02180Cucurbita maxima (Rimu)cmacpeB136
Cp4.1LG01g02180Cucurbita maxima (Rimu)cmacpeB351
Cp4.1LG01g02180Cucurbita maxima (Rimu)cmacpeB531
Cp4.1LG01g02180Cucurbita moschata (Rifu)cmocpeB114
Cp4.1LG01g02180Cucurbita moschata (Rifu)cmocpeB311
Cp4.1LG01g02180Cucurbita moschata (Rifu)cmocpeB489
Cp4.1LG01g02180Wild cucumber (PI 183967)cpecpiB388
Cp4.1LG01g02180Wild cucumber (PI 183967)cpecpiB470
Cp4.1LG01g02180Cucumber (Chinese Long) v2cpecuB389
Cp4.1LG01g02180Cucumber (Chinese Long) v2cpecuB470
Cp4.1LG01g02180Bottle gourd (USVL1VR-Ls)cpelsiB348
Cp4.1LG01g02180Bottle gourd (USVL1VR-Ls)cpelsiB369
Cp4.1LG01g02180Watermelon (Charleston Gray)cpewcgB378
Cp4.1LG01g02180Watermelon (Charleston Gray)cpewcgB392
Cp4.1LG01g02180Watermelon (Charleston Gray)cpewcgB394
Cp4.1LG01g02180Watermelon (97103) v1cpewmB431
Cp4.1LG01g02180Watermelon (97103) v1cpewmB433
Cp4.1LG01g02180Watermelon (97103) v1cpewmB453
Cp4.1LG01g02180Melon (DHL92) v3.5.1cpemeB394
Cp4.1LG01g02180Melon (DHL92) v3.5.1cpemeB411
Cp4.1LG01g02180Melon (DHL92) v3.5.1cpemeB412
Cp4.1LG01g02180Cucumber (Gy14) v2cgybcpeB362
Cp4.1LG01g02180Cucumber (Gy14) v2cgybcpeB942
Cp4.1LG01g02180Melon (DHL92) v3.6.1cpemedB457
Cp4.1LG01g02180Melon (DHL92) v3.6.1cpemedB481
Cp4.1LG01g02180Silver-seed gourdcarcpeB0927
Cp4.1LG01g02180Cucumber (Chinese Long) v3cpecucB0484
Cp4.1LG01g02180Cucumber (Chinese Long) v3cpecucB0582
Cp4.1LG01g02180Wax gourdcpewgoB0516
Cp4.1LG01g02180Wax gourdcpewgoB0517