Cp4.1LG02g14090 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g14090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionKanadaptin
LocationCp4.1LG02 : 12162299 .. 12171970 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAAAAATTAATCAATTTTAATGTTTTGGATCCAAAATATTTTTTATTTTTTATTTTTTATTTTTTAAGATGTCAGATATTTTTGTATCTTAAAAAAAAAAAAAAAAAATACCGAATTTCCCATCCGAGTGTAGAATCTCTTGTTCAGCATAGTAACGACGTCGATTTTCACCACTGGACACAGGGCAACTCCGGCGACGGCGAGTGGTGACTGGTGAGTTAAGCGCCATCTGGCGGTGATCACTAAAGGTTTATATATCATAGAACCACTCGTCCATAGTGTATCTCCCGCTGTCACTTTTAGAACGCCGAGATGACGACTGCAATGGGACCTCCACCGCCTAGAAACCCTTCCTCCGCTTCTCCTATGGATTCTGATGCCGGAACCCTTGAGGGAGATTCAACCTCTTCTTCAACGGAAACGAAGGTCACCATGGGCCCTCCTCTTCCAAAAAATCCCACTCCTCCCGATTCTGACCCCCCTGCCCCGACCGCAACTCAAGAAGATGAATCATCGGTGATTTCGGTCAATTCTGATGCTTCAGAACCCGTTGATAAGGTTCCAGACACTCCTCCATCTGATAAAGCTGTGGAACTGGCTCCGAAGCAACCCCAGAGCGTAGCGGTGCCATACACCATTCCTTCTTGGAGTGGAGCCCCCTCCCATCGTTTCTATTTGGAGGTTCTGAAGGATGGATGCATTATTGATCAATTTGATGTGTAAGTCCGTTGAATGTTTGAAAGAAATTTTTACAACTTCGTATTAATTTCTCAAGATAGATGAAATTCTGGGAACTTCGAGAAACGGCCTGATATATTGTTTTTCTCATACTGGGTCTTGATGAAATCATATCCTACTGTTCTTTTATTTGTCAAATATTGGTCACTGAGTTGTTTATATGGTTTTAGATATGAGAAGGGGGCTTATATGTTTGGACGTGTGGATCTCTGCGATTTTGTTCTGGAGCATCCAACCATTTCTCGGTTTCACGCTGGTAGCCAACTTTTCACTTATACTTACATTAGTTGACATGAACATTGTAGGGTTCTTATCACAGGATTTTCAAGCTTATTAACTTCAATGCAGAACTTATACAGTAGTAAATTTAGGTTCAACAGTTTTTAGCGATTATCATTGACTATTACATGTACGAGATTTAATCATTGTTTCTATGGAACCGTTAAGTTTCCACACACTGTAATTGCATTTCCCTGGGATTACTTGTATTATCATGTAACACCATAGGAACCTTTATCAATTATCATATCTATTATAATATCATTGCATTCTATCGTGAAGGTTCCATACTACTTTACATGTACAGGATTTAATCCTTGTTTCTCTAGCATCATCAGGGTTCTATACATTTAACAATGCATTTCTCAGTGTCATGTCAGAGGGCTCTCCACTCTCCATTAGTTCTAAGCTGCCTTAATTAACAAACAGTAATAATATATAGGTGCTTTGAGCTAAGATCTAAATATCTATTATGTTCAATTGAAAATTGATCTTTCTGGATGTTGCACCTTCCTTTGACTTCTTTTTAAAACGGTTCGCTCTCTCTTTAAAGATAGAAGTTTAGTGCTATTATTTCAGTGGTAGGATAGCAACATCGTGGCTTTCTGTAAGACAAACTTGCAAGTTGTGAATTATGTTATACTTTCCATCTATAGTACATGATTTATTTGTCATCGCTAAATGAGACTTTTTTTTATCATTTCTAGCAACCGTAAGGCATATGATATCATTAAGGCCATTAAGTTGGTTGCTCAAAGGGCGAGCATCCAAATTCCTGTTTCTTAAAATTTAGAATACTAAGCATTGTAGAATGCCTATACAAGAGTCTGAATATCGAAAGAGCTTTTGATGCTCTCCTCAACCCTTGGGCTCTAAGGTTTAAATGGTGAGGCTTCTTTTTATGAAATAGTTTGGGGAAAAATGGATCGATAAGAAAATAATGTTGTTCACATTACCACAAGATATGCTTAAGCGGCAACTAATGTCGCACTAAAAAATTTATGCTACAATGGATTCAAGCTACTTCAAGTGGGGCTTCATTTATTCTTGGATGGTTAAAAATTGGGGGAATGATTAAAATCTAGATTTATGATATTTCATTAGTGTATTTTCAATTTTAAGACTTTTTTATATACTTTTAGGTTACATTTATATAATGGCTAATTATGGTCCTAGAGTATGAAAGTTATAACTCTTAGAATGCCTCATGGAAGGTCAATTGTTTCATTTTGAGGCTAGACAAAGCCATGTATGTTGTCTTGTAAAATAGACTTAGAAAATGTAGTATAAAGAGTATTTGTGGTTTCTTTTAGCTTAGAGCTTAGTTTCTTTGTGATTCTACATAGTTTAGGTTACATGCTTAGAATCATTGAAGCTTGACTTGATCAATCTTATTTGCGGAGTGATTCGAATCTCAAACAAGTTTTCTTGCCTTGAGATATTTGATCAACAAGGTAATCCGGATTTTACTTTTCCTTGGGTAATTCCTTATTGTTTTGGAGTTATTGGAAATTTTAGATCTAATGGTCTTGTTCTTCAATTAGGTTGTTAGGTCGAATTCCCAAGAGCTTCAATCATTTGGGATGAAATCATCTTGATTTGAGGGCTTTGACATTTGTGATCTAGGAGTTTGCATATCCATAAGGGTTGTTCTTATATAATTGGTTAGGGTTTTCATAGGACTATGGAATTAAAGAAACAAGAACACTATAAGGGCTTGACCCTTAGTAAGGATATTGACATCATTGTTGATAGGAAGTACTCTCGAGGCTCTCATCTCATATCATTGGTTGTAAAAACTAGATGTACGAGGTGGGTGAAAATTGGGGTGGGGTAAATTGCTTAAATCAAAATAGAAAAAGAAAGATATATCAGGCAATGTACCAACAATATCTTACAACATTAAGACTTCGGAGGTAGATTGAGAAACAGTATTCCCCTTATCATCCTTGAATCTTACAAGGAACATTGAATAGGAAACTAGACATTTTTGCCCTTTTTTCCTGCCACTATATATTATTTTCATCAATTTTTCTTACTTCTCATTTTAAAATTTATATTTAAAATGATTCAAGACAAACTACTATATTATTTAACTCATGAAAACAGTACAATTAAGTTATTTTCCAAGGTGTACCCATTAATTTGTCTACAAAAGAATTATAGCTATTCAAGGATGCTAGGTCTTATATATAGTTGAATGATATATTGACTGTATGTTTTTATTAAAGAATAATTTAAGTGGTCAGAATATGGTATTATTTTACTATATTGTCTTCCTTTTGTAGTTCTCCAGTTTAGAAGTAGTGGAGATGCATACCTTTATGATCTTGGAAGTACTCATGGAACTTTTATTAACAAGAATCAGGTATGGATCTTTCTTAGTTACTTTTTATTCTTGGTAATTGGTATTTGTATCAATCTTTAATCTTCTTCACAGTTTCAACTTCCCTATTAGAAAAACCTGAAAGGTTTATTATTATTGTCTGGCGATGGCTTTTCCTGCTTGTTTTTCCATATTTGATCGATAAAGTGGGATGGATTTTCTTTTATCTTCTGATGAACTTCTCCGTTCTTTTATTTAGGTGAAGAAAAGGATTTTTGTGGACTTGCATGTTGGTGATGTCATTCGATTTGGCCAGTAAGCAACTCTTCATTAGCCATTTTTTTATAGTAAGAAATGTCTGATTATTGGAGTTTTTCATTTTGAGATTCCATGTCGTTTAGTAGTTGTAGTGGGAAAACAAGCATTTCTTCAGTAGAACAGTAGTTGTTTCAAGGATGGTATTAATGTCGAAACCTTTTGTTTGGCAAGAACTTAAATAAAATCTTTCTAGATGGTTCGTGGAAATTTCTACCATCTCCATTGGAGTTATGAGATGAATTGCAGTTCTGATGGAGGTTAACAAGGGTGGTTGGAGTGAGTAGTGGGTGATGTTGGGTGATTTCGAGAATATTGCTGGTAAAGAGGAGGTGAAGGAAAGTTTTTGGTTAGAAATGGTTTGTTTAAAGGGAGAGGAGAAATATCTGGTACAAATAGTAAAAACAAATTTGGAAGAAACCAACAGCGAAAGGGGTTAATGGGGAAGAAGGAAAGGATAATATAAAGAGAACAATTTTAGAGTGGGAAAATTCAAGCAAAAATGATTGAATTTTAAAGGGAAGAAGAATACAGGGGAAGGTAAAATTAGATATTTAGGGGAAAAGTGAGGGGGACTCGAGGAAAAAGATTGGAGTACAAAGAATAAATAGATTTCTCACGAAGAGATTGGGAGCATACCATAATTGTTACTAGATTTTAATTCCATGATGATTCATTCACCATTGTAGAAGCATTGAGATAGGTAATTTATGAAGTCTGTATGCTCAAACCTTTTATAGCGAATATAGCTTTGCTCAAGGGTCTGCGAAGGAAGACTTGGAGAACCCTTGTAGGTTAGAAGGGTAGTACGTTGTGGGTGCTTTTCCCTTGAGATTCACCTATAAGGTGGATAATTACAAAATTCCTTAGAGCTTGCTCTTCAAGAGCAAACTTAAACGGAAGAATCTCAAACCTTCCTCCAATCTTTTTGATATTCATGGAAAAAAGACAATGTTGTCTTGATTTGTCTGGTTGTATTGTTCGCATTCTTAATTTTTTTTTAAATCTCCTCTTGCTTCAGTTCAGCTTGTGCTTAAACTCTGACTTTTTATGTTCATTTGCAGTTCATCTCGATTGTACGTTTTTCAAGGGCCAAATCATTTGATGCTACCTGTAAGCTTCCTTTAGCATTTTTTTTTTCATTTACCACTAAAAAGGTTCTTGGAACTCCGTTCTTTTGTTTCTAATCTACCTGTATTCATTCAGTCTACTGCTTGTGTCATTGATTATAATGGACAGTTCCTTTCCTTCTAGTCAACTACTATGCCCTTGTTTTCAGTAAATTTCTCATTTCCAGATTTGTTCTCTCTTATGGTAAGGCGTGTTTAAGAATTTGGAATACTTCAAAATTCATGGATGTTGATTTGGAGAGAGTGATTGCTGAATGGTGAATATTAGCTTAGAAGGACCTTAAAAGGACCATCATACTACGCCTTTGCTTTGGGAATCTTCCAGATTTGTTTTCTCTCAAGTAACATGTTAAAAAACTCATGTGGATGGGTCTACTTTAAAATTCATTGATGTAGATATGGAGAGGGTGATTAGCGAATGGTGAATATTAGCTTAGAAGGACCAACACAGTATAAAACCCAAATATGACATGATCACAATGACATGTTGTAGAAAATTAGGGCACGGATACATTGGAAACAAATGTGTTTATTAACGTGTTTATTTATTTTAACACATTTTAGGGTTTCATACTTAATTTCTTTTATTTGGACTTATGTATTTCTCCTCTTTTATTTATATTGCAGAGATTTTTTTTTGGTTTTTTTTAATATATTTTGAAGGATTAAATTTCATTTCTTTGGTGTATTCTTTCTTCATTAAAACTATAGGTCCTCGCAGAAAATAAGTTTCAAATATGCTACCCTTACATACGTAAATTTTTTAAAAAAAAATAATATGCACATGTCCAAAACTAGGATTATAAATAATTTTAACACTCTAGTTTAAGTATTCTAGAGGCGTTGAAGCATTGTTGTATGGCATCTGTTGAATACATATACAAACATGTTAACCAAATTAATGTCCCTACTTCTACAGGATCGGTTCTGTACAACTATATCCCTGAGTATTCAGGAGACAATATGTCCAAAGTTCTAGGATCTTAGCTAATCAAATGCTACAAATTCTTACTCTAGTGTGTTGCCGTGCATCCTTGTTTTAGCTTTTGCATGCAGTTGCTTTCAGTTAAAATCTAGGTTTATTTACAACTTGATCTTGATTTGAATAAATATGTTGTTAGTTGAGTTATTTTGGACTGAACACCAGGAATCAGATTTGACCATGATAAAAAAGGCTAAGATTCGAGAACAGACACTAGATCGAGAAGCTTCACTTCGACGAGCCCGACAGGAAGCATCTCTCGCTGATGGAATATCTTGGGGCATGGGAGAAGATGCTGTCGAAGAGGCTGAGGTTTGTGCAAAATCAACAAAAAAACTCATTTAGGGTCTTTGTCAGATTTTTCTTAGATTACTAGTTATACTTAAACTTGACCATGTTTCCTTGATTTAGAGTTTTGGTAGCTGAGTACTTTGCTGTGTTCTGATTATTTAGGATGAAGTTGATGAAGTCACATGGCAAACATACAAAGGACAGCTTACAGAAAAGCAGCAAAAAACTCGTGAAAAGGTTTTAAAAAGAACTGAAAAGGTTAGTCAATATCATCCTTTTTGGTTTTTAAATCATTTGAGAAGATCGTTATTTTCTCTACTATACGTTGCAATACTGACTACAAAATTCCAAAATGGAAGCCTTACATGTAAGTTACTATCTTTGAGAATCTTTCTTTGTTTCCCAAAGTTTTTTTTGTTCTTTTTCCATTTCGTTGTAATTTATTTATTTTATTTACTCAATGATTGGCAGACATTTCATCATCACTTTGTCAGTAATTGGTATGGTGATTTCTTAAATAAACGTACCTTTCCCATTAAAAAAAGATTATTCCATCAGTGGCACATATTATAGATGCTCTTAACTTGCTCTATTTCTCATATTTTACTGAGATGTTTACACAAGAATGTCAAGGACTCTTATTTCTTAGCTGTCAAGCTATGTCTAGCAGATGGAAGCTTTGATGTTGAATAAGCACCATAAATCTCTAGAAACTCGTTTTTATGATAAAAAATATTGTTATAATGTTCTTCCTTATCTATTATTCTATATGTTTACTGGTGCACTTATACAAGGGCTACATGCATGTTGGATACAGATTTCTCACATGAAGAAAGAAATTGATGCAATTCGTGCTAAAGACATTTCTCAAGGTGGATTGACGCAAGGGCAGCAAACTCAGATTGCTAGGAATGAACAAAGAATTACTCAGGTGAAAATTTATAGCCCAGAATTAGTACTCTTATTTATTTTATTTTTGCTTTATGCAATGCTTACCATTTTCAGTACTAGGATTTGATTCTTTATTTTTACTTTATCACAGCACCATTTTAATTGTTTCTTCTTCTAAAAATAATTTTGGCAGATCATGGAAGAACTTGAAAACTTGGAAGAGACACTGAATGATAGTATTAGGGAAAGCCTTGGAGCTCGTTCTGGGATTCGATCACTTGGTAAGAAGCAAGGAGGAATGGAAAATGATGAAGAACTTTTAAGGTACGTCCCAAATAAGTGTTGTGGAAGGGAGGAAGGGGGAATCATTTAGTAATTTGTATGCTTCTGTACTATAAATCTAGATATGTTATAATTATATTATTGAGATAGTGAGAATACGATTGACCCAATATGCTAGTATTATGGTGGCCTTTTATTGTATAAGAGAATGCATAGCGACTTCCATGAAATTTATGTTCTGTTTATTGTTATTTTTTCAGTGATGATGATGACTTCTATGACCGCACGAAGAAGCCTTCACATAAAAAAACTGGTGAAAATCAATCAATTGAAACAGCTGATTCTCTTCTTGATAAGAGAGATGCCCTCAATAAAGAAATGGATGAAAAAAAAAGATTGCTTTTGATTGAGGAGAACAAAATGGAATCACATACAGATTTGGACTCTGGCAATGATGCTCTCGATGCTTACATGTCAGGGCTTTCATCTCAGCTAGGTTTGGTTCTCAGACATAAGTATAGATACCTTGGTGTGCTGTTTTACAAGTATTGGTTGTTCCTTTTTAGCTCGAGGGCGGGTGTTACTTTCTTATTTTGTTATTTCCATGCTCATATAGATTATGCATAGTATATAACTACCAACACAGTGTTTGCAGCTAAAATTAATTCTCAGTAGTAAACATTAAAATCATGTTAATATATTACCTTTTAAATTTGAAGGATGTGACACTGAGACCTTAACTATTGTGAGTAACACTTATTGAAAGCTATTATCTGCAATTTTTATGCTTATGTTCTTGCATAATAGTAGAAAACATTTCTTCTGACTGCTGTTGCTGGTTTGATTATTGTTGGCAGTGCTTGACAAAACCACCAAACTACAGAACGAACTATCGTCTCTTCAGTCAGAACTAGATAGAATTTTGTACCTGTTGAAAATTGCTGATCCATCAGGAGAAGCAGCCAAGAAAAGGGAAACTTCAGCCAAGAAAATTGATTCAAATCTAGAAGCAAAGCCTGAAAATTTTAAAGTCCCTGCATCTGTTAATGGGAAACCACAGAAGGAACTAGTAAAAGACGGTGAATCTAAAGAACAAGTGGTAGATGCCAAACAAAAAATTAAAACCACACAGGAAAGTGTTGAACCTAATGAGTCAGTTACTGAAAAAGTTGTGGATGATACAAAAGATAAAAAGACCATCAGTTACACTGTTGTTAAGCCCCAGTGGCTTGGGGCCATCGAAGAACTGAAATCTGAGGAAACTCAAAAGGATGCTGCACCATTGGATATACAAGAATCTGATGATTTTGTTGACTACAAAGACAGGAAAGACGTTCTTCAGAGTTCTGATAATAAGCCTGCAAATGTGGATTCTGTGATTGAGAGTGCTGCCCCTGGTTTGATTTTGAGAAAACGGAAGCAAGAAGATCAATCTGACGGTAACTTGGATGCCTCTCAACAGTCGACATCATCTTTGGAGGCAGAGAGAGCAGAATTTAAGGCAGAGGATGCTGTGGCTTTGCTGTTAAAGCACCAAAGAGGGTATCATGGATCAGATGAGGAGGAAAATCGACATGAAAGCAAGCGCCCGACAGGTCGAACCAGATCAAAAAAGAATGAGAAGAAGTCCAAGAGGGTACTTGGTCCCGAAAAACCGTCATTTCTAGATACAAAAGCTGATTATGACTCATGGGTACCTCCTGAAGGTAACTTCTTTAGTGAATACAGTGGCCTACTATTAACTTTGAAACTTCGTTCTCTCTAATGTTAAGTTGTCTGTTCACTTCGCAGGACAATCAGGCGATGGAAGGACAACATTAAACGAGCGTTATGGCTACTAATTTCCCCATGTTTCTAACAACATTTTGATCTGAGATCGGCCTCGCCCTTTTTTACTTTTTCAAGAAAGATGCATTGGTCACTGGCCCGTAAATTATTTCATCAATCTTAGCCGCTTTTTCCAGTGTATAAAAACAAAGGTAGTGAGGCTGCTGCTTTATAATTGAAAGCCTTGTGTTGTAATAAAAGAAGATGGAACTCAGGCAAAGATGTTCATGAACTATGGAACTAGCAAGCTTGGAGAAGTTTCTTGTCTATATACCTGGGTAAAGGGAGGGAATTGGAGAGCCTGAAAAATTGCTCTCTTGTTCTAGAAGATTTATTGTAACATTACATACAAGGGTTGTTCTTTTGTCCACACACCTTGAAAGACTTGCAACAACCTTCCAGTAATAGCGCAGCTGATAGTATGGAATAGTAACATTGTTGGTGGTTATTATAGATTTTTCACTATATTCACTCCTTTGTAAGATCTCATGTACCATACTCTTACCCGTGAACATGTTTGAATTATACTTTGAGGCATTCCCCGAGTTACATTGATACTTTGAAATGGTTGTTGGGAGATTGAAGTCGCCTAATATGTTCTGAAATTATA

mRNA sequence

TTAAAAATTAATCAATTTTAATGTTTTGGATCCAAAATATTTTTTATTTTTTATTTTTTATTTTTTAAGATGTCAGATATTTTTGTATCTTAAAAAAAAAAAAAAAAAATACCGAATTTCCCATCCGAGTGTAGAATCTCTTGTTCAGCATAGTAACGACGTCGATTTTCACCACTGGACACAGGGCAACTCCGGCGACGGCGAGTGGTGACTGGTGAGTTAAGCGCCATCTGGCGGTGATCACTAAAGGTTTATATATCATAGAACCACTCGTCCATAGTGTATCTCCCGCTGTCACTTTTAGAACGCCGAGATGACGACTGCAATGGGACCTCCACCGCCTAGAAACCCTTCCTCCGCTTCTCCTATGGATTCTGATGCCGGAACCCTTGAGGGAGATTCAACCTCTTCTTCAACGGAAACGAAGGTCACCATGGGCCCTCCTCTTCCAAAAAATCCCACTCCTCCCGATTCTGACCCCCCTGCCCCGACCGCAACTCAAGAAGATGAATCATCGGTGATTTCGGTCAATTCTGATGCTTCAGAACCCGTTGATAAGGTTCCAGACACTCCTCCATCTGATAAAGCTGTGGAACTGGCTCCGAAGCAACCCCAGAGCGTAGCGGTGCCATACACCATTCCTTCTTGGAGTGGAGCCCCCTCCCATCGTTTCTATTTGGAGGTTCTGAAGGATGGATGCATTATTGATCAATTTGATGTATATGAGAAGGGGGCTTATATGTTTGGACGTGTGGATCTCTGCGATTTTGTTCTGGAGCATCCAACCATTTCTCGGTTTCACGCTGTTCTCCAGTTTAGAAGTAGTGGAGATGCATACCTTTATGATCTTGGAAGTACTCATGGAACTTTTATTAACAAGAATCAGTTCATCTCGATTGTACGTTTTTCAAGGGCCAAATCATTTGATGCTACCTTTGAGTTATTTTGGACTGAACACCAGGAATCAGATTTGACCATGATAAAAAAGGCTAAGATTCGAGAACAGACACTAGATCGAGAAGCTTCACTTCGACGAGCCCGACAGGAAGCATCTCTCGCTGATGGAATATCTTGGGGCATGGGAGAAGATGCTGTCGAAGAGGCTGAGGATGAAGTTGATGAAGTCACATGGCAAACATACAAAGGACAGCTTACAGAAAAGCAGCAAAAAACTCGTGAAAAGGTTTTAAAAAGAACTGAAAAGATTTCTCACATGAAGAAAGAAATTGATGCAATTCGTGCTAAAGACATTTCTCAAGGTGGATTGACGCAAGGGCAGCAAACTCAGATTGCTAGGAATGAACAAAGAATTACTCAGATCATGGAAGAACTTGAAAACTTGGAAGAGACACTGAATGATAGTATTAGGGAAAGCCTTGGAGCTCGTTCTGGGATTCGATCACTTGGTAAGAAGCAAGGAGGAATGGAAAATGATGAAGAACTTTTAAGTGATGATGATGACTTCTATGACCGCACGAAGAAGCCTTCACATAAAAAAACTGGTGAAAATCAATCAATTGAAACAGCTGATTCTCTTCTTGATAAGAGAGATGCCCTCAATAAAGAAATGGATGAAAAAAAAAGATTGCTTTTGATTGAGGAGAACAAAATGGAATCACATACAGATTTGGACTCTGGCAATGATGCTCTCGATGCTTACATGTCAGGGCTTTCATCTCAGCTAGTGCTTGACAAAACCACCAAACTACAGAACGAACTATCGTCTCTTCAGTCAGAACTAGATAGAATTTTGTACCTGTTGAAAATTGCTGATCCATCAGGAGAAGCAGCCAAGAAAAGGGAAACTTCAGCCAAGAAAATTGATTCAAATCTAGAAGCAAAGCCTGAAAATTTTAAAGTCCCTGCATCTGTTAATGGGAAACCACAGAAGGAACTAGTAAAAGACGGTGAATCTAAAGAACAAGTGGTAGATGCCAAACAAAAAATTAAAACCACACAGGAAAGTGTTGAACCTAATGAGTCAGTTACTGAAAAAGTTGTGGATGATACAAAAGATAAAAAGACCATCAGTTACACTGTTGTTAAGCCCCAGTGGCTTGGGGCCATCGAAGAACTGAAATCTGAGGAAACTCAAAAGGATGCTGCACCATTGGATATACAAGAATCTGATGATTTTGTTGACTACAAAGACAGGAAAGACGTTCTTCAGAGTTCTGATAATAAGCCTGCAAATGTGGATTCTGTGATTGAGAGTGCTGCCCCTGGTTTGATTTTGAGAAAACGGAAGCAAGAAGATCAATCTGACGGTAACTTGGATGCCTCTCAACAGTCGACATCATCTTTGGAGGCAGAGAGAGCAGAATTTAAGGCAGAGGATGCTGTGGCTTTGCTGTTAAAGCACCAAAGAGGGTATCATGGATCAGATGAGGAGGAAAATCGACATGAAAGCAAGCGCCCGACAGGTCGAACCAGATCAAAAAAGAATGAGAAGAAGTCCAAGAGGGTACTTGGTCCCGAAAAACCGTCATTTCTAGATACAAAAGCTGATTATGACTCATGGGTACCTCCTGAAGGACAATCAGGCGATGGAAGGACAACATTAAACGAGCGTTATGGCTACTAATTTCCCCATGTTTCTAACAACATTTTGATCTGAGATCGGCCTCGCCCTTTTTTACTTTTTCAAGAAAGATGCATTGGTCACTGGCCCGTAAATTATTTCATCAATCTTAGCCGCTTTTTCCAGTGTATAAAAACAAAGGTAGTGAGGCTGCTGCTTTATAATTGAAAGCCTTGTGTTGTAATAAAAGAAGATGGAACTCAGGCAAAGATGTTCATGAACTATGGAACTAGCAAGCTTGGAGAAGTTTCTTGTCTATATACCTGGGTAAAGGGAGGGAATTGGAGAGCCTGAAAAATTGCTCTCTTGTTCTAGAAGATTTATTGTAACATTACATACAAGGGTTGTTCTTTTGTCCACACACCTTGAAAGACTTGCAACAACCTTCCAGTAATAGCGCAGCTGATAGTATGGAATAGTAACATTGTTGGTGGTTATTATAGATTTTTCACTATATTCACTCCTTTGTAAGATCTCATGTACCATACTCTTACCCGTGAACATGTTTGAATTATACTTTGAGGCATTCCCCGAGTTACATTGATACTTTGAAATGGTTGTTGGGAGATTGAAGTCGCCTAATATGTTCTGAAATTATA

Coding sequence (CDS)

ATGACGACTGCAATGGGACCTCCACCGCCTAGAAACCCTTCCTCCGCTTCTCCTATGGATTCTGATGCCGGAACCCTTGAGGGAGATTCAACCTCTTCTTCAACGGAAACGAAGGTCACCATGGGCCCTCCTCTTCCAAAAAATCCCACTCCTCCCGATTCTGACCCCCCTGCCCCGACCGCAACTCAAGAAGATGAATCATCGGTGATTTCGGTCAATTCTGATGCTTCAGAACCCGTTGATAAGGTTCCAGACACTCCTCCATCTGATAAAGCTGTGGAACTGGCTCCGAAGCAACCCCAGAGCGTAGCGGTGCCATACACCATTCCTTCTTGGAGTGGAGCCCCCTCCCATCGTTTCTATTTGGAGGTTCTGAAGGATGGATGCATTATTGATCAATTTGATGTATATGAGAAGGGGGCTTATATGTTTGGACGTGTGGATCTCTGCGATTTTGTTCTGGAGCATCCAACCATTTCTCGGTTTCACGCTGTTCTCCAGTTTAGAAGTAGTGGAGATGCATACCTTTATGATCTTGGAAGTACTCATGGAACTTTTATTAACAAGAATCAGTTCATCTCGATTGTACGTTTTTCAAGGGCCAAATCATTTGATGCTACCTTTGAGTTATTTTGGACTGAACACCAGGAATCAGATTTGACCATGATAAAAAAGGCTAAGATTCGAGAACAGACACTAGATCGAGAAGCTTCACTTCGACGAGCCCGACAGGAAGCATCTCTCGCTGATGGAATATCTTGGGGCATGGGAGAAGATGCTGTCGAAGAGGCTGAGGATGAAGTTGATGAAGTCACATGGCAAACATACAAAGGACAGCTTACAGAAAAGCAGCAAAAAACTCGTGAAAAGGTTTTAAAAAGAACTGAAAAGATTTCTCACATGAAGAAAGAAATTGATGCAATTCGTGCTAAAGACATTTCTCAAGGTGGATTGACGCAAGGGCAGCAAACTCAGATTGCTAGGAATGAACAAAGAATTACTCAGATCATGGAAGAACTTGAAAACTTGGAAGAGACACTGAATGATAGTATTAGGGAAAGCCTTGGAGCTCGTTCTGGGATTCGATCACTTGGTAAGAAGCAAGGAGGAATGGAAAATGATGAAGAACTTTTAAGTGATGATGATGACTTCTATGACCGCACGAAGAAGCCTTCACATAAAAAAACTGGTGAAAATCAATCAATTGAAACAGCTGATTCTCTTCTTGATAAGAGAGATGCCCTCAATAAAGAAATGGATGAAAAAAAAAGATTGCTTTTGATTGAGGAGAACAAAATGGAATCACATACAGATTTGGACTCTGGCAATGATGCTCTCGATGCTTACATGTCAGGGCTTTCATCTCAGCTAGTGCTTGACAAAACCACCAAACTACAGAACGAACTATCGTCTCTTCAGTCAGAACTAGATAGAATTTTGTACCTGTTGAAAATTGCTGATCCATCAGGAGAAGCAGCCAAGAAAAGGGAAACTTCAGCCAAGAAAATTGATTCAAATCTAGAAGCAAAGCCTGAAAATTTTAAAGTCCCTGCATCTGTTAATGGGAAACCACAGAAGGAACTAGTAAAAGACGGTGAATCTAAAGAACAAGTGGTAGATGCCAAACAAAAAATTAAAACCACACAGGAAAGTGTTGAACCTAATGAGTCAGTTACTGAAAAAGTTGTGGATGATACAAAAGATAAAAAGACCATCAGTTACACTGTTGTTAAGCCCCAGTGGCTTGGGGCCATCGAAGAACTGAAATCTGAGGAAACTCAAAAGGATGCTGCACCATTGGATATACAAGAATCTGATGATTTTGTTGACTACAAAGACAGGAAAGACGTTCTTCAGAGTTCTGATAATAAGCCTGCAAATGTGGATTCTGTGATTGAGAGTGCTGCCCCTGGTTTGATTTTGAGAAAACGGAAGCAAGAAGATCAATCTGACGGTAACTTGGATGCCTCTCAACAGTCGACATCATCTTTGGAGGCAGAGAGAGCAGAATTTAAGGCAGAGGATGCTGTGGCTTTGCTGTTAAAGCACCAAAGAGGGTATCATGGATCAGATGAGGAGGAAAATCGACATGAAAGCAAGCGCCCGACAGGTCGAACCAGATCAAAAAAGAATGAGAAGAAGTCCAAGAGGGTACTTGGTCCCGAAAAACCGTCATTTCTAGATACAAAAGCTGATTATGACTCATGGGTACCTCCTGAAGGACAATCAGGCGATGGAAGGACAACATTAAACGAGCGTTATGGCTACTAA

Protein sequence

MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPTATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHRFYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDLGSTHGTFINKNQFISIVRFSRAKSFDATFELFWTEHQESDLTMIKKAKIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQKTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEETLNDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQSIETADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDSGNDALDAYMSGLSSQLVLDKTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPASVNGKPQKELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKKTISYTVVKPQWLGAIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRKRKQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEEENRHESKRPTGRTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY
BLAST of Cp4.1LG02g14090 vs. Swiss-Prot
Match: NADAP_HUMAN (Kanadaptin OS=Homo sapiens GN=SLC4A1AP PE=1 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 7.2e-16
Identity = 201/807 (24.91%), Postives = 322/807 (39.90%), Query Frame = 1

Query: 16  ASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPTATQEDESSVISVNSD 75
           A P+   A +    S+SS+ E     GP   ++    + D P P   Q D     S+  +
Sbjct: 78  ALPVSPAARSKAPASSSSNPEEVQKEGPTALQDSNSGEPDIPPP---QPDCGDFRSLQEE 137

Query: 76  ASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHRFYLEVLKDGCIIDQFD 135
            S P   V            +P  P   A PY  P W G  +  + LE LK G I+    
Sbjct: 138 QSRPPTAVS-----------SPGGPAR-APPYQEPPWGGPATAPYSLETLKGGTILGTRS 197

Query: 136 VYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDA----------YLYDLGSTHGT 195
           +      +FGR+  CD  LEHP++SR+HAVLQ R+SG            YLYDLGSTHGT
Sbjct: 198 LKGTSYCLFGRLSGCDVCLEHPSVSRYHAVLQHRASGPDGECDSNGPGFYLYDLGSTHGT 257

Query: 196 FINKNQF----ISIVRFSRAKSFDATFELFWTEHQESD--------LTMIKKAKIREQTL 255
           F+NK +        V       F  +  LF  +  E D        +T +K+ + ++Q L
Sbjct: 258 FLNKTRIPPRTYCRVHVGHVVRFGGSTRLFILQGPEEDREAESELTVTQLKELRKQQQIL 317

Query: 256 DREASLRRARQEASLAD---------------GISWGMGEDAVEEAEDEVDEVTWQTYKG 315
             +  L     E    D               G +WGMGEDAVE+  +E   V       
Sbjct: 318 LEKKMLGEDSDEEEEMDTSERKINAGSQDDEMGCTWGMGEDAVEDDAEENPIVL------ 377

Query: 316 QLTEKQQKTREKVLKRTEKI--SHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQI 375
              E QQ+     +K  +K       +E + +  +   QG  T   + ++  ++    Q+
Sbjct: 378 ---EFQQEREAFYIKDPKKALQGFFDREGEELEYEFDEQGHSTWLCRVRLPVDDSTGKQL 437

Query: 376 MEELENLEETLNDSIRESLGARSGIRSLGK-KQGGMENDEELLS-DDDDFYDRTKKPSHK 435
           + E  +  +     I+ SL A   + +LG  +Q  +    +  + +D+DFYD        
Sbjct: 438 VAEAIHSGKKKEAMIQCSLEACRILDTLGLLRQEAVSRKRKAKNWEDEDFYDSDD----- 497

Query: 436 KTGENQSIETADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDSGNDALDAYMSGLS 495
                      D+ LD+   +     EKKRL     N+M+    +D   +  ++ ++ L+
Sbjct: 498 -----------DTFLDRTGLI-----EKKRL-----NRMKKAGKIDEKPETFESLVAKLN 557

Query: 496 SQLVLDKTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENF 555
                      + ELS +   L                A  +  S      +L+A     
Sbjct: 558 DA---------ERELSEISERLK---------------ASSQVLSESPSQDSLDAFMSEM 617

Query: 556 KVPASVNGKPQKEL-VKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDK-KTI 615
           K  ++++G  +K+L ++  E +++    K  IK  + +  P    TE      ++K K +
Sbjct: 618 KSGSTLDGVSRKKLHLRTFELRKEQQRLKGLIKIVKPAEIPELKKTETQTTGAENKAKKL 677

Query: 616 SYTVVKPQWLGAIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVI 675
           +  +      G+  +LK+    K   P   +     +  KD  +V +  + +        
Sbjct: 678 TLPLFGAMKGGSKFKLKTGTVGK-LPPKRPELPPTLMRMKDEPEVEEEEEEEEEEEKEKE 737

Query: 676 ESAAPGLILRKRKQEDQSDGNLDASQQSTSSLEAER-----AEFKAEDAVALLLKHQRGY 735
           E         K+K ED S        +  ++++  R       FK          H+   
Sbjct: 738 EH-------EKKKLEDGSLSRPQPEIEPEAAVQEMRPPTDLTHFKETQT------HENMS 796

Query: 736 HGSDEEENR--HESKRPT----GRTRSK------KNEKKSKRVLGPEK-PSFLDTK---- 757
             S+EE+N+   +  + T    G + SK      + E K K+  GP K P  L +K    
Sbjct: 798 QLSEEEQNKDYQDCSKTTSLCAGPSASKNEYEKSRGELKKKKTPGPGKLPPTLSSKYPED 796

BLAST of Cp4.1LG02g14090 vs. Swiss-Prot
Match: YOT2_CAEEL (Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans GN=ZK632.2 PE=3 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 3.9e-06
Identity = 168/725 (23.17%), Postives = 300/725 (41.38%), Query Frame = 1

Query: 47  KNPT-PPDSDPPAPTATQEDESSVISVNSDASEPVDKVPDTP-PSDKAVELAPKQPQSVA 106
           K+P+ PP    PAP + ++  +    ++      +D++      ++K  +++ + P   A
Sbjct: 9   KSPSLPPSHHAPAPMSPEKIRAPAEQMDGPVEGVIDEIETAEVQAEKESKISVQAP---A 68

Query: 107 VPYTIPSWSGA--PSHRFYLEVLKDGCIIDQFDVYEKGAYMF---GRVDL-CDFVLEHPT 166
           + Y +P W+    P+H+F  E+LK+G +I  +D+  +    F   GR+   CD ++EHP+
Sbjct: 69  LHYEVPPWACEPDPAHKFQFEILKEGKLIASYDLSNRKNSTFVVIGRIKPGCDLLMEHPS 128

Query: 167 ISRFHAVLQF------RSSGDAYLYDLGSTHGTFINK-----NQFIS-----IVRF---S 226
           ISR+H +LQ+      ++    ++++LGSTHG+ +NK      Q+I      I +F   +
Sbjct: 129 ISRYHCILQYGNDKMSKTGKGWHIFELGSTHGSRMNKKRLPPKQYIRTRVGFIFQFGEST 188

Query: 227 RAKSFDATFELFWTEHQESDLTMIKKAKIREQTLDREASLRRARQEASLAD--------G 286
           R  +F    E    E   S   M    K+R+   + EA LR A  +  + D        G
Sbjct: 189 RILNFVGPEEDSEPEWDCSPTEM----KLRKHKKELEAKLRAAAAQEMIDDEKREKEEEG 248

Query: 287 ISWGM--GEDAVEEAEDEVDEVTWQTYKGQLTEKQQKTREKVLKR----------TEKIS 346
             WGM  GED       E D    +  +    +  +K  +K  +R           +   
Sbjct: 249 CGWGMDYGEDEKPLTTVETDAHLMEDREAYYNQDPKKALQKFFEREGFDMNFEFSEQGQG 308

Query: 347 HMKKEIDAIRAK---DISQGGLTQGQQTQIARNEQRITQIMEELENLEETLNDSIRESLG 406
           H  K + +I      D      T       ++ + +I   ++    L +T N        
Sbjct: 309 HTHKWVCSIELPVEIDGVDRAFTASATVSTSKKDAQIQCALDACRIL-DTYN-------V 368

Query: 407 ARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQ-----------SIETA 466
            R     L  ++  +E ++    DDD + DRT +   ++    Q             +T 
Sbjct: 369 LRKSNTKLRMQRKTLEANDYYDEDDDLYLDRTGQLEKQREKRKQWAEEGFGHKRTETDTY 428

Query: 467 DSLLDKRDALNKEMDE-KKRLLLIEENKMESHTDLDSGNDALDAYM--------SGLSSQ 526
           +SL  K +   KE+ E +K L  +     +S T +D G D LD Y+        +G  ++
Sbjct: 429 ESLCRKLEESKKEIIECQKHLDELSAGTKKSRT-IDQGGDVLDDYIRQLEKSGGAGDDAK 488

Query: 527 LVLDKTTKLQNELSSLQSELDRILYLLKIADPS-GEAAKKRETSA--------------- 586
             ++K +K + +L +   E  ++  L+KIA P+  +  ++ ET+A               
Sbjct: 489 TKMEK-SKWRQKLMAATHESQKLEKLVKIAKPAVVKGLEQLETTAANDRQAFLKKLMGVR 548

Query: 587 --KKIDSNLEAKP---ENFKVPASVNGKPQKELVKDGESK------EQVVDA---KQKIK 646
             K+ID      P    +  +PA+V     K +  + E K      E+ + A     +IK
Sbjct: 549 ARKEIDQTPSQGPGPSTSATLPATVAPTSTKAVEVEHEKKMTPLKVEKEIAASLDSSEIK 608

Query: 647 TTQESVEPNESVTEKVVDDTKDKKTISYTVVK--PQWLGAIEELKSEETQKDAAPLDIQE 670
            +  +V+   SV ++V ++T  K+     V K   QW   +E  K E  +K     + +E
Sbjct: 609 NSLPAVDEPSSVKDEVSEETPQKEAFGSKVQKRVAQWEEELEAEKEELAKKQKLEAE-EE 668

BLAST of Cp4.1LG02g14090 vs. TrEMBL
Match: A0A061ENF0_THECC (SMAD/FHA domain-containing protein OS=Theobroma cacao GN=TCM_019051 PE=4 SV=1)

HSP 1 Score: 747.3 bits (1928), Expect = 1.9e-212
Identity = 461/773 (59.64%), Postives = 550/773 (71.15%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDP-PAP 60
           MTT MGPPPPRNP+ ++  + +   +  +  S  T  K + GPP P  P PP   P P  
Sbjct: 1   MTTTMGPPPPRNPNPSAEPEPEPEPVTQEE-SEPTTAKASTGPPPP--PPPPAKKPNPQN 60

Query: 61  TATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHR 120
              QE ES     NSD SEP            ++E      QS  VPYTIP WSG PSH 
Sbjct: 61  PQDQEKES-----NSD-SEP-----------NSIEKPSNSKQS-PVPYTIPQWSGPPSHH 120

Query: 121 FYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDL 180
           F+LE+LKDGCIIDQF V EKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSG AYLYDL
Sbjct: 121 FFLEILKDGCIIDQFKVNEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGQAYLYDL 180

Query: 181 GSTHGTFINKNQFIS----------IVRF---SRAKSFDATFELFWTEHQESDLTMIKKA 240
           GSTHGTFINK+Q             ++RF   SR   F    EL      E DL ++K A
Sbjct: 181 GSTHGTFINKSQVTKRTYVDLNVGDVIRFGHSSRLYIFQGPSELM---PPEKDLKIMKDA 240

Query: 241 KIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQK 300
           KI+E+ LDREASLRRAR EASLADGISWG+GEDA+EEAED+ DE+TWQTYKGQLTEKQ+K
Sbjct: 241 KIQEEMLDREASLRRARAEASLADGISWGIGEDAIEEAEDDADEMTWQTYKGQLTEKQEK 300

Query: 301 TREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEET 360
           T +K++KRTEKI+HMKKEIDAIRAKDI+QGGLTQGQQTQIARNEQRITQIMEELENLEET
Sbjct: 301 THDKIIKRTEKIAHMKKEIDAIRAKDIAQGGLTQGQQTQIARNEQRITQIMEELENLEET 360

Query: 361 LNDSIRESLGARSGIRSLGKKQGGME-NDEELLSDDDDFYDRT-KKPSHKKTGENQSIET 420
           LN+SIRES+GAR+G  S GK++GG E +DE+  SDDD+FYDRT KKP+  K GE QSIET
Sbjct: 361 LNESIRESIGARAGRISHGKRKGGPEDDDEDFSSDDDEFYDRTKKKPTVLKVGETQSIET 420

Query: 421 ADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDS-GNDALDAYMSGLSSQLVLDKTT 480
           ADSLLDKRDA+ KE+++KK LLL EENKM S T L++   DALDAYMSGLSSQLVLD+T 
Sbjct: 421 ADSLLDKRDAIMKEIEDKKELLLSEENKMASETALETEAGDALDAYMSGLSSQLVLDRTV 480

Query: 481 KLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPASVNGK 540
           +L+ EL +LQSELDRI YLLKIADP+ EAAKKR+T A+         P+  + PA+V  +
Sbjct: 481 QLEKELFALQSELDRIFYLLKIADPTREAAKKRDTKAQ------APAPDKSRTPAAVKKQ 540

Query: 541 PQKELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKKTISYTVVKPQWLG 600
           P  E  K   S E      QK      S+E ++   E ++ DT + +   YTV KPQWLG
Sbjct: 541 PPLE-PKISTSTEPANSPMQKEGVADVSMESSKKPEENILSDTAEVRKAIYTVAKPQWLG 600

Query: 601 AIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRK 660
           A+E  + +E+Q++   +   + D FVDYKDRK VL S D+      S IE+ A GLI+RK
Sbjct: 601 AVESKEIKESQQE-VEVKTHKVDQFVDYKDRKKVLGSVDDPLVKGHSGIETTASGLIIRK 660

Query: 661 RKQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEEENRHESKRPT 720
           +KQ ++S+G+  AS QSTSS  +  AE  A++AVALLLKH RGYH  DEE   HE+    
Sbjct: 661 QKQVEKSEGDDKASDQSTSS--STGAEEIAQNAVALLLKHTRGYHAEDEE--LHETPEML 720

Query: 721 GRTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY 757
            R + KK EKK KRV+GPEKPSFL++  +Y+SWVPPEGQSGDGRTTLN+RYGY
Sbjct: 721 ARNQLKKKEKKPKRVMGPEKPSFLNSNPEYESWVPPEGQSGDGRTTLNDRYGY 737

BLAST of Cp4.1LG02g14090 vs. TrEMBL
Match: A0A067JNE9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25614 PE=4 SV=1)

HSP 1 Score: 744.6 bits (1921), Expect = 1.2e-211
Identity = 449/784 (57.27%), Postives = 561/784 (71.56%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGT------LEGDSTSSSTETKVTMGPPLPKNPTPPDS 60
           MTTAMGPPPPRNP+  S     A T      L+    SS+T  K+ MGPP P  P P +S
Sbjct: 1   MTTAMGPPPPRNPNPQSSSTGAATTEPEPKILDTPQNSSTTTMKIAMGPPPP--PAPKNS 60

Query: 61  DPPAPTATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSG 120
           D P P   ++ ES+  S+NSD ++  +++                 +  +VPYTIP WSG
Sbjct: 61  DIPEPETVEKTESN--SLNSDTTQLKEQIA----------------KQSSVPYTIPEWSG 120

Query: 121 APSHRFYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDA 180
            P H+FYLEVLKDG I+DQ D+ EKGAYMFGRVDLCDFVLEHPT+SRFHAVLQF+ SGDA
Sbjct: 121 PPCHKFYLEVLKDGSIVDQLDICEKGAYMFGRVDLCDFVLEHPTVSRFHAVLQFKRSGDA 180

Query: 181 YLYDLGSTHGTFINKNQFIS----------IVRF---SRAKSFDATFELFWTEHQESDLT 240
           YLYD+ STHGTF+NK Q             ++RF   SR   F    EL      E DL 
Sbjct: 181 YLYDINSTHGTFVNKCQVEKRVYVELHVGDVIRFGHSSRLYIFQGPPELM---PPEKDLN 240

Query: 241 MIKKAKIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLT 300
           ++++AKIR++ LDREASLRRAR EASLADGI WGMGEDA+EE ED+ DEVTWQTYKGQLT
Sbjct: 241 IVREAKIRQEMLDREASLRRARAEASLADGILWGMGEDAIEEDEDDGDEVTWQTYKGQLT 300

Query: 301 EKQQKTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELE 360
           EKQ+KTR+K++KR EKI+HMKKEIDAIRAKDI+QGGLTQGQQTQIARNEQR+TQI+EELE
Sbjct: 301 EKQEKTRDKIIKRNEKIAHMKKEIDAIRAKDIAQGGLTQGQQTQIARNEQRMTQILEELE 360

Query: 361 NLEETLNDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQS 420
           NLEETLN+SIRES+GAR+G RS G ++G  E+DEEL SDDD+FYDRTKKPS +K   NQS
Sbjct: 361 NLEETLNESIRESIGARAGRRSGGMRKGTAEDDEELSSDDDEFYDRTKKPSMQKASANQS 420

Query: 421 IETADSLLDKRDALNKEMDEKKRLLLIEENKMESHT-DLDSGNDALDAYMSGLSSQLVLD 480
           +ETAD+LLDKRD++ KEM++KK+LLLIE+NK+ S T +     DALDAYMSG+SSQLVLD
Sbjct: 421 VETADTLLDKRDSILKEMEKKKQLLLIEKNKISSETLEETEAGDALDAYMSGVSSQLVLD 480

Query: 481 KTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPASV 540
               ++ +LS+LQSELDR+ +LLKIADPSG AAKKR++  ++++S+ + K E   VP++ 
Sbjct: 481 ----MEKKLSALQSELDRVFFLLKIADPSGAAAKKRDSRVEEVNSD-KCKAE---VPSAT 540

Query: 541 NGK-PQKELVKDGESKEQVVDAKQKIKTTQESVEPNES----VTEKVVDDTKDKKTISYT 600
             K P  E  K     E +  +  K KT    V   ES      +K+  +  D K   YT
Sbjct: 541 TKKQPAAEPKKSSGMGEPIAASLMKEKTPDSRVGAKESEKKPEPDKIAINAPDVKPAVYT 600

Query: 601 VVKPQWLGAIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESA 660
           VVKPQWLGA+ + + +E +++   L+I +SD+FVDYKDR+ +L +SD      DS +ESA
Sbjct: 601 VVKPQWLGAVNDTEMKEIKQEV--LNIDDSDEFVDYKDRQKILINSDGAQGKDDSDLESA 660

Query: 661 APGLILRKRKQEDQ--SDGNLDASQQS-TSSLEAERAEFKAEDAVALLLKHQRGYHGSDE 720
           APGLI+RKRK+ ++   DG    ++QS TSS+E   AE  AEDAVALLLKH+RGYH  DE
Sbjct: 661 APGLIIRKRKETEEPGDDGKKATAEQSITSSME---AELTAEDAVALLLKHKRGYHAEDE 720

Query: 721 EENRHESKRPTGRTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNE 757
                  +R  GR++  K+ KK KRVLGPEKPSFL++ +DYDSWVPPEGQSGDGRT+LN+
Sbjct: 721 GGGHQSQER--GRSQHNKDRKKQKRVLGPEKPSFLNSNSDYDSWVPPEGQSGDGRTSLND 746

BLAST of Cp4.1LG02g14090 vs. TrEMBL
Match: A0A0D2QTM4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G206200 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 6.0e-211
Identity = 456/788 (57.87%), Postives = 554/788 (70.30%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMG--PPLPKNPTPPDSDPPA 60
           MT  MGPPPPRNP+ ++  +S A        S     K TMG  PPLP NP P  S  P 
Sbjct: 1   MTATMGPPPPRNPNPSTEPESIA-----QEESEPRTAKTTMGPPPPLPINPNP--STEPE 60

Query: 61  PTATQEDE----------SSVISVNSDASEPVD-KVPDTPPSDKAVELAPKQPQSVAVPY 120
             A +E E             + +N +   P+D + P    S+      P  P+  +VPY
Sbjct: 61  SIAPEESELITAKTTMGPPPPLPINPNLQNPLDEEEPSNSKSEPNSTEKPLNPKQSSVPY 120

Query: 121 TIPSWSGAPSHRFYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQ 180
           TIP WSG P H F+LEVLKDGCI+D+F V+EKGAYMFGR+DLCDFVLEHPTISRFHAVLQ
Sbjct: 121 TIPPWSGPPCHHFFLEVLKDGCILDRFKVFEKGAYMFGRIDLCDFVLEHPTISRFHAVLQ 180

Query: 181 FRSSGDAYLYDLGSTHGTFINKNQFI----------SIVRF---SRAKSFDATFELFWTE 240
           FRSSG+AYLYDLGSTHGTFINK+Q             ++RF   +R   F    EL    
Sbjct: 181 FRSSGEAYLYDLGSTHGTFINKSQVTKKTYVDLRVGDVIRFGHSTRLYIFQGPSELM--- 240

Query: 241 HQESDLTMIKKAKIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQ 300
             E DL +I++AKIRE+ LDREASLRRAR EASL+DGISWGMGEDA+EEAED+ DEVTWQ
Sbjct: 241 PPEKDLKVIREAKIREEMLDREASLRRARAEASLSDGISWGMGEDAIEEAEDDADEVTWQ 300

Query: 301 TYKGQLTEKQQKTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRIT 360
           TYKGQLTEKQ+KTR+K++KRTEKI+HMKKEIDAIRAKDI+QGGLTQGQQTQIARNEQRIT
Sbjct: 301 TYKGQLTEKQEKTRDKIIKRTEKIAHMKKEIDAIRAKDIAQGGLTQGQQTQIARNEQRIT 360

Query: 361 QIMEELENLEETLNDSIRESLGARSGIRSLGKKQGGMENDEE-LLSDDDDFYDRT-KKPS 420
           Q++EELE+LEETLN+SIRES+GAR G  + GK++GG E+DEE + SDDD+FYDRT KKP+
Sbjct: 361 QVLEELESLEETLNESIRESIGARGG-TTRGKRKGGPEDDEEDISSDDDEFYDRTKKKPT 420

Query: 421 HKKTGENQSIETADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDS-GNDALDAYMS 480
            +K GE QSIETADSLLDKRDA+ KE++EKK LLL E+NKM S T L++   DALDAYMS
Sbjct: 421 VQKVGETQSIETADSLLDKRDAITKEIEEKKELLLTEKNKMTSDTGLETEAGDALDAYMS 480

Query: 481 GLSSQLVLDKTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKP 540
           GLSSQLVLD+T +++ ELS+LQSELDRI YLLKIADP+GEAAKKR+  A+         P
Sbjct: 481 GLSSQLVLDRTVQIEKELSALQSELDRIFYLLKIADPTGEAAKKRDMKAQ------VPAP 540

Query: 541 ENFKVP-ASVNGKPQKELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKK 600
           +  + P A+V  +  KE  K   + E      QK      S+E  +   E VV DT + +
Sbjct: 541 DRPRPPAAAVRKQIAKEPKKISSATEPANSPVQKEGVADVSMESRKKPEENVVSDTSEGE 600

Query: 601 TISYTVVKPQWLGAIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDS 660
              YTV KPQWLGA+E  + +E+ +    +D  + DDFVDYKDRK VL S+DN      S
Sbjct: 601 KAIYTVAKPQWLGAVENKEIKESNQ-VIVVDTHKVDDFVDYKDRKKVLGSADNPQVKEPS 660

Query: 661 VIESAAPGLILRKRKQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGS 720
            IE+ A GLI+R +KQ ++ +     S QST+   +  AE  A++AVALLLKH RGYH  
Sbjct: 661 GIEATASGLIIRTQKQVEKPEAGDKPSDQSTT--PSTGAEEIAQNAVALLLKHTRGYHAD 720

Query: 721 DEEENRHESKRPTGRTRSKKNEKKSKRVLGPEKPSFLDTKAD--YDSWVPPEGQSGDGRT 757
           +EE N  E+   + R +SKK EKK KRVLGPEKPSFLD+  D  Y++WVPPEGQSGDGRT
Sbjct: 721 EEELN--ETPDMSARNQSKKKEKKPKRVLGPEKPSFLDSNPDPEYETWVPPEGQSGDGRT 766

BLAST of Cp4.1LG02g14090 vs. TrEMBL
Match: A0A0D2M173_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G206200 PE=4 SV=1)

HSP 1 Score: 739.6 bits (1908), Expect = 3.9e-210
Identity = 455/787 (57.81%), Postives = 553/787 (70.27%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMG--PPLPKNPTPPDSDPPA 60
           MT  MGPPPPRNP+ ++  +S A        S     K TMG  PPLP NP P  S  P 
Sbjct: 1   MTATMGPPPPRNPNPSTEPESIA-----QEESEPRTAKTTMGPPPPLPINPNP--STEPE 60

Query: 61  PTATQEDE----------SSVISVNSDASEPVD-KVPDTPPSDKAVELAPKQPQSVAVPY 120
             A +E E             + +N +   P+D + P    S+      P  P+  +VPY
Sbjct: 61  SIAPEESELITAKTTMGPPPPLPINPNLQNPLDEEEPSNSKSEPNSTEKPLNPKQSSVPY 120

Query: 121 TIPSWSGAPSHRFYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQ 180
           TIP WSG P H F+LEVLKDGCI+D+F V+EKGAYMFGR+DLCDFVLEHPTISRFHAVLQ
Sbjct: 121 TIPPWSGPPCHHFFLEVLKDGCILDRFKVFEKGAYMFGRIDLCDFVLEHPTISRFHAVLQ 180

Query: 181 FRSSGDAYLYDLGSTHGTFINKNQFI----------SIVRF---SRAKSFDATFELFWTE 240
           FRSSG+AYLYDLGSTHGTFINK+Q             ++RF   +R   F    EL    
Sbjct: 181 FRSSGEAYLYDLGSTHGTFINKSQVTKKTYVDLRVGDVIRFGHSTRLYIFQGPSELM--- 240

Query: 241 HQESDLTMIKKAKIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQ 300
             E DL +I++AKIRE+ LDREASLRRAR EASL+DGISWGMGEDA+EEAED+ DEVTWQ
Sbjct: 241 PPEKDLKVIREAKIREEMLDREASLRRARAEASLSDGISWGMGEDAIEEAEDDADEVTWQ 300

Query: 301 TYKGQLTEKQQKTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRIT 360
           TYKGQLTEKQ+KTR+K++KRTEKI+HMKKEIDAIRAKDI+QGGLTQGQQTQIARNEQRIT
Sbjct: 301 TYKGQLTEKQEKTRDKIIKRTEKIAHMKKEIDAIRAKDIAQGGLTQGQQTQIARNEQRIT 360

Query: 361 QIMEELENLEETLNDSIRESLGARSGIRSLGKKQGGMENDEE-LLSDDDDFYDRT-KKPS 420
           Q++EELE+LEETLN+SIRES+GAR G  + GK++GG E+DEE + SDDD+FYDRT KKP+
Sbjct: 361 QVLEELESLEETLNESIRESIGARGG-TTRGKRKGGPEDDEEDISSDDDEFYDRTKKKPT 420

Query: 421 HKKTGENQSIETADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDS-GNDALDAYMS 480
            +K GE QSIETADSLLDKRDA+ KE++EKK LLL E+NKM S T L++   DALDAYMS
Sbjct: 421 VQKVGETQSIETADSLLDKRDAITKEIEEKKELLLTEKNKMTSDTGLETEAGDALDAYMS 480

Query: 481 GLSSQLVLDKTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKP 540
           GLSSQLVLD+T +++ ELS+LQSELDRI YLLKIADP+GEAAKKR+  A+         P
Sbjct: 481 GLSSQLVLDRTVQIEKELSALQSELDRIFYLLKIADPTGEAAKKRDMKAQ------VPAP 540

Query: 541 ENFKVP-ASVNGKPQKELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKK 600
           +  + P A+V  +  KE  K   + E      QK      S+E  +   E VV DT + +
Sbjct: 541 DRPRPPAAAVRKQIAKEPKKISSATEPANSPVQKEGVADVSMESRKKPEENVVSDTSEGE 600

Query: 601 TISYTVVKPQWLGAIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDS 660
              YTV KPQWLGA+E  + +E+ +    +D  + DDFVDYKDRK VL S+DN      S
Sbjct: 601 KAIYTVAKPQWLGAVENKEIKESNQ-VIVVDTHKVDDFVDYKDRKKVLGSADNPQVKEPS 660

Query: 661 VIESAAPGLILRKRKQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGS 720
            IE+ A GLI+R +KQ ++ +     S QST+   +  AE  A++AVALLLKH RGYH  
Sbjct: 661 GIEATASGLIIRTQKQVEKPEAGDKPSDQSTT--PSTGAEEIAQNAVALLLKHTRGYHAD 720

Query: 721 DEEENRHESKRPTGRTRSKKNEKKSKRVLGPEKPSFLDTKAD--YDSWVPPEGQSGDGRT 756
           +EE N  E+   + R +SKK EKK KRVLGPEKPSFLD+  D  Y++WVPPEGQSGDGRT
Sbjct: 721 EEELN--ETPDMSARNQSKKKEKKPKRVLGPEKPSFLDSNPDPEYETWVPPEGQSGDGRT 765

BLAST of Cp4.1LG02g14090 vs. TrEMBL
Match: M5WX33_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002013mg PE=4 SV=1)

HSP 1 Score: 734.9 bits (1896), Expect = 9.5e-209
Identity = 447/773 (57.83%), Postives = 551/773 (71.28%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPT 60
           MTTAM PPP   P + S       +   +++SS+   K  MGPP  KNP+PP       +
Sbjct: 1   MTTAMAPPPDLVPETLS-------SELAETSSSAITMKPPMGPPPAKNPSPPPQSEAPIS 60

Query: 61  ATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHRF 120
             Q   +S I+ +++A+E           D A +    Q Q  AVPYTIP WS AP H+F
Sbjct: 61  EDQPQSNSSINDSTEAAE-----------DNAKQTLKPQSQGFAVPYTIPPWSAAPCHQF 120

Query: 121 YLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDLG 180
            LEVLKDG II+QFDVYEKGAYMFGR+DLCDFVLEHPT+SRFHAVLQF+ SG+AYLYDLG
Sbjct: 121 QLEVLKDGAIINQFDVYEKGAYMFGRIDLCDFVLEHPTVSRFHAVLQFKRSGEAYLYDLG 180

Query: 181 STHGTFINKNQFIS----------IVRF---SRAKSFDATFELFWTEHQESDLTMIKKAK 240
           STHGTFINKNQ             ++RF   SR   F    EL      E DL +++ AK
Sbjct: 181 STHGTFINKNQVNKKVYVDLCVGDVIRFGHSSRLYIFQGPSELM---PPEKDLKLLRVAK 240

Query: 241 IREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQKT 300
           +RE  LD+EASL+RAR EASLADGISWGM EDA+EEAE     +TWQTYKGQLTEKQ+KT
Sbjct: 241 MREDILDQEASLQRARLEASLADGISWGMEEDAIEEAE----ALTWQTYKGQLTEKQEKT 300

Query: 301 REKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEETL 360
           R       EKI+HMKKEIDAIRAKDISQGGL+QGQQTQIARNEQRI QIMEELENLEETL
Sbjct: 301 R-------EKIAHMKKEIDAIRAKDISQGGLSQGQQTQIARNEQRIAQIMEELENLEETL 360

Query: 361 NDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQSIETADS 420
           N+SIRESLGAR G  S GKK+G  + +EELLSDDD+FYDRTKKPS KK GEN S+ET+D+
Sbjct: 361 NESIRESLGARVGKLSYGKKKGATDEEEELLSDDDEFYDRTKKPSSKKAGENPSVETSDT 420

Query: 421 LLDKRDALNKEMDEKKRLLLIEENKMESH-TDLDSGNDALDAYMSGLSSQLVLDKTTKLQ 480
           LLDKRDA+ KEM+EKK LL IE+NKM S  TD     DALDAYMSGLSSQLVL+KT +LQ
Sbjct: 421 LLDKRDAIMKEMEEKKELLSIEKNKMASKTTDETDAADALDAYMSGLSSQLVLNKTEELQ 480

Query: 481 NELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPA-SVNGKPQ 540
            ELS+LQSELDRI++LLKIADPSGEAAKKR++   K++   E+KP   + PA ++  +P 
Sbjct: 481 KELSALQSELDRIIFLLKIADPSGEAAKKRDS---KVEEVQESKPNKSETPAPAIKKQPP 540

Query: 541 KELVKDGESKEQVVDAKQKIKTTQESVEPN-ESVTEKVVDDTKDKKTISYTVVKPQWLGA 600
            E  +  +  +   D+  K  TT+ S++ + E    ++V D  + K + YTVVKPQWLGA
Sbjct: 541 MEPEESSQPGKPANDSILKEGTTEVSIKSSTELAASEIVTDATEGKNVVYTVVKPQWLGA 600

Query: 601 IEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRKR 660
           +E++K E+  ++AAP +  E+ +FVDYKDRK +L++  +   N++S IE+AAPGLI+RKR
Sbjct: 601 VEDIKMEKGHQEAAPSNQDEAGEFVDYKDRKKILENVSDAKVNMESGIENAAPGLIIRKR 660

Query: 661 KQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEE-ENRHESKRPT 720
           KQ  +S GN   S+Q  +S  +  AEF AEDAVALLLKH+RGY+  D+E ++  E K   
Sbjct: 661 KQVHESKGNDSDSRQQPAS--STGAEFLAEDAVALLLKHKRGYYAPDDETQDVKEGK--- 720

Query: 721 GRTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY 757
              +  K++KK KRVLGPEKPSFLDT +D ++WVPPEGQSGDGRT+LN  YGY
Sbjct: 721 ---QLSKDKKKPKRVLGPEKPSFLDTNSD-ETWVPPEGQSGDGRTSLNSHYGY 729

BLAST of Cp4.1LG02g14090 vs. TAIR10
Match: AT5G38840.1 (AT5G38840.1 SMAD/FHA domain-containing protein )

HSP 1 Score: 630.9 bits (1626), Expect = 9.8e-181
Identity = 401/791 (50.70%), Postives = 530/791 (67.00%), Query Frame = 1

Query: 2   TTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPTA 61
           T+AM PPPPRNPS     D +       S S S ET  TM PP P+NP PPD        
Sbjct: 3   TSAMDPPPPRNPSH----DIEPPEPNSTSISQSDETS-TMNPPPPRNPNPPDLKTTEVVV 62

Query: 62  TQE--DESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHR 121
             E  +ES   SV  DA +PV                P+  +   VPYTIP WSG P H+
Sbjct: 63  EPEPIEESKDDSVTVDADKPV---------------RPRTVKQNPVPYTIPEWSGPPCHQ 122

Query: 122 FYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDL 181
           F LEVLK+G I+++ DVY+KGAY+FGR  +CDF LEHP+ISRFHAV+Q++ SG AY++DL
Sbjct: 123 FQLEVLKEGAIVEKLDVYKKGAYLFGRDGICDFALEHPSISRFHAVIQYKRSGAAYIFDL 182

Query: 182 GSTHGTFINKNQ-----FIS-----IVRF---SRAKSFDATFELFWTEHQESDLTMIKKA 241
           GSTHGT +NKN+     F+      ++RF   +R   F    +L      E DL +I++A
Sbjct: 183 GSTHGTTVNKNKVDKKVFVDLNVGDVIRFGGSTRLYIFQGPSDLM---PPEKDLQLIREA 242

Query: 242 KIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQK 301
           K+R +  +REASLRRARQ+AS+ADG+SWGMGEDA+EE ED+V+E+TWQTY G+LT KQ+K
Sbjct: 243 KMRMEMSEREASLRRARQQASMADGVSWGMGEDAIEEEEDDVEEITWQTYSGELTPKQEK 302

Query: 302 TREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEET 361
           T+EKVLKR EKI HMKKE+ AIRAKDISQGGLTQGQQTQIARNEQR  +++EELENLEET
Sbjct: 303 TKEKVLKRLEKIGHMKKEVAAIRAKDISQGGLTQGQQTQIARNEQRTAELLEELENLEET 362

Query: 362 LNDSIRESLGARSGIRSL-GKKQGGMENDEELLSDDDDFYDRT-KKPSHKKTGENQSIET 421
           LNDSIRESLGA++G +   GKK+G +E++E+L SD+DDFYDRT KKPS KK  ENQ++ET
Sbjct: 363 LNDSIRESLGAKTGRKPTHGKKKGIVEDEEDLSSDEDDFYDRTQKKPSTKKGSENQTVET 422

Query: 422 ADSLLDKRDALNKEMDEKKRLLLIEENKMESH--TDLDSGN--DALDAYMSGLSSQLVLD 481
            DSL+DKRD + KE++ K   LL E++KME+   T++ SG+  DALDAYM+GLS+ LV D
Sbjct: 423 VDSLVDKRDNVLKEIEAKNEQLLTEKSKMETENVTEVTSGDSLDALDAYMTGLSTTLVQD 482

Query: 482 KTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKP---ENFKVP 541
           KT ++Q ELS+LQSEL RILYLLKIADP+GE  KKRE  ++++       P   +   +P
Sbjct: 483 KTAQIQQELSTLQSELSRILYLLKIADPTGEEVKKRELKSQELKIKKSETPSVEKKINIP 542

Query: 542 ---ASVNGKPQKELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKKTISY 601
              A  N   +KE+ KD      +VD++ K             V  K  +  ++KKT  Y
Sbjct: 543 LKQADPNEHKEKEVAKD------LVDSENK-----------PEVENKASETAEEKKTTVY 602

Query: 602 TVVKPQWLG-----AIEELKSEETQKDAAPLD-IQESDDFVDYKDRKDVLQSSDNKPANV 661
              KPQWLG     AI E K+ E    AA  D  +++D FVDYK+RK++  ++    A V
Sbjct: 603 VPSKPQWLGSAANKAIIEEKNPEIV--AATTDSTEDADGFVDYKNRKNIALTAT---AGV 662

Query: 662 DSVIESAAPGLILRKRKQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYH 721
           + V      GLI+RKRKQED+S+ + D+ ++        +AE  A+DAVALLLKH  G+H
Sbjct: 663 EVVT-----GLIIRKRKQEDKSEEDDDSKEK--------QAEVMAQDAVALLLKHSVGHH 722

Query: 722 GSDEEEN---RHESKRPTGRTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGD 757
            ++E++    + E+ + +G++++KK +K +K+V+GP+KP +LD   DYDSWVPP GQSGD
Sbjct: 723 VNEEDKELSKQEENNQGSGQSKTKKKKKTAKKVVGPDKPEYLDETIDYDSWVPPAGQSGD 735

BLAST of Cp4.1LG02g14090 vs. TAIR10
Match: AT5G47790.1 (AT5G47790.1 SMAD/FHA domain-containing protein )

HSP 1 Score: 65.5 bits (158), Expect = 1.6e-10
Identity = 43/122 (35.25%), Postives = 65/122 (53.28%), Query Frame = 1

Query: 74  SDASEPVDKVPDTPP------SDKAVELAPKQPQSVAVPYTIPSWSGAPSHRFY-LEVLK 133
           S  SEP     + PP      S +A+     Q  +    +  P W+  P    Y LEV+K
Sbjct: 13  SQTSEPFSVSANPPPVVQQHLSPEALSGQKTQIGAGQSNWHPPDWAIEPRAGVYSLEVVK 72

Query: 134 DGCIIDQFDVYEKGAYMFGRV-DLCDFVLEHPTISRFHAVLQFRSSGDAYLYDLGSTHGT 188
           DG I+D+  + ++  ++FGR    CDFVL+H ++SR HA +    +G  ++ DLGS HGT
Sbjct: 73  DGQILDRIHL-DRRRHIFGRQHQTCDFVLDHQSVSRQHAAVVPHKNGSIFVIDLGSAHGT 132

BLAST of Cp4.1LG02g14090 vs. TAIR10
Match: AT3G20550.1 (AT3G20550.1 SMAD/FHA domain-containing protein )

HSP 1 Score: 49.7 bits (117), Expect = 9.3e-06
Identity = 38/142 (26.76%), Postives = 71/142 (50.00%), Query Frame = 1

Query: 65  DESSVISVNSDASEPVDKVPDTPPSDKAVELAPK--QPQSVAVPYTIPSWSGAPSHRFYL 124
           +E SV  + +       K  + P  + + +LA +  + + + + +  P  +  PS R+ L
Sbjct: 139 EEDSVARMRAVEEALAAKKKEEPSFELSGKLAEETNRYRGITLLFNEPPEARKPSERWRL 198

Query: 125 EVLKDGCIIDQ-FDVYEKGAYMFGRVD-LCDFVLEHPTISRFHAVLQFR----------- 184
            V KDG  +++   ++ +  Y+FGR   + D   +HP+ S+ HAV+Q+R           
Sbjct: 199 YVFKDGEPLNEPLCLHRQSCYLFGRERRIADIPTDHPSCSKQHAVIQYREMEKEKPDGMM 258

Query: 185 -SSGDAYLYDLGSTHGTFINKN 191
                 Y+ DLGST+ T+IN++
Sbjct: 259 GKQVKPYIMDLGSTNKTYINES 280

BLAST of Cp4.1LG02g14090 vs. NCBI nr
Match: gi|449438741|ref|XP_004137146.1| (PREDICTED: kanadaptin [Cucumis sativus])

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 625/769 (81.27%), Postives = 676/769 (87.91%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPT 60
           MTT MGPPPPRN S +SPMDSDAG LE DST SST TK  MGPP PK+PT  DSDPPA T
Sbjct: 1   MTTDMGPPPPRNTSPSSPMDSDAGALEEDSTISSTATKAPMGPPPPKSPTSSDSDPPALT 60

Query: 61  ATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHRF 120
           +TQE+ES V S+NSDASE  + V D   SDKAVELA KQPQSV+VPYTIPSWSGAPSHRF
Sbjct: 61  STQENESPVNSMNSDASEHSENVSDGSASDKAVELASKQPQSVSVPYTIPSWSGAPSHRF 120

Query: 121 YLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDLG 180
           YLEVLKDGCIIDQ +VYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRS+GDAYL DLG
Sbjct: 121 YLEVLKDGCIIDQLNVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSNGDAYLCDLG 180

Query: 181 STHGTFINKNQ-----FIS-----IVRF---SRAKSFDATFELFWTEHQESDLTMIKKAK 240
           STHG+FINKNQ     F+      ++RF   SR   F     L   E   SDLT++KKAK
Sbjct: 181 STHGSFINKNQVKKKIFVDLHVGDVIRFGHSSRLYIFQGPNHLMLPE---SDLTVMKKAK 240

Query: 241 IREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQKT 300
           +RE+TLDREASL+RAR+EAS+ADGISWGMGEDAVEEAEDEVDE+TWQTY GQLTEKQQKT
Sbjct: 241 MREETLDREASLQRARREASVADGISWGMGEDAVEEAEDEVDEITWQTYNGQLTEKQQKT 300

Query: 301 REKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEETL 360
           REKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEETL
Sbjct: 301 REKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEETL 360

Query: 361 NDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQSIETADS 420
           NDSIRESLGARSGIRS GKK GGME+DEE+LSDDDDFYDRTKKPS+KK  +NQSIETADS
Sbjct: 361 NDSIRESLGARSGIRSRGKKGGGMEDDEEVLSDDDDFYDRTKKPSNKKADQNQSIETADS 420

Query: 421 LLDKRDALNKEMDEKKRLLLIEENKMESHTDLDSGNDALDAYMSGLSSQLVLDKTTKLQN 480
           LLDKRDA+ KEM+EK+ LLL EENKMES TDLD+G DALDAYMSGLSSQLVLDKTTKLQN
Sbjct: 421 LLDKRDAIKKEMEEKRELLLREENKMESQTDLDTGTDALDAYMSGLSSQLVLDKTTKLQN 480

Query: 481 ELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPASVNGKPQKE 540
           ELSSLQ ELDRILYLLKIADPSGEAAKKRE+SAKK DSN+ AKPE F VP SVNGKP K 
Sbjct: 481 ELSSLQPELDRILYLLKIADPSGEAAKKRESSAKKSDSNVGAKPEKFNVPTSVNGKPCKG 540

Query: 541 LVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKKTISYTVVKPQWLGAIEE 600
            +KDG+SKEQV+DAKQ++KT Q+SVEPN+ VTEK+VDD KDKK ISYT  KPQWLGA+EE
Sbjct: 541 PLKDGDSKEQVLDAKQEVKTAQDSVEPNDLVTEKIVDDAKDKKVISYTAAKPQWLGAVEE 600

Query: 601 LKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRKRKQE 660
           +KSEE QK+A PLDIQESDDFVDYKDRK+VLQ+SDNKP  +DSVIESAAPGLILRKRKQE
Sbjct: 601 MKSEEIQKEAVPLDIQESDDFVDYKDRKEVLQNSDNKPTKIDSVIESAAPGLILRKRKQE 660

Query: 661 DQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEEENRHESKRPTGRTR 720
           D SD  LDASQQST+S E +RA+FKAEDAVALLLKHQRGYHGSDEEE RHESKR TGR +
Sbjct: 661 DLSDSPLDASQQSTASSEVDRAKFKAEDAVALLLKHQRGYHGSDEEEVRHESKRSTGRNK 720

Query: 721 SKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY 757
           SKK+EKK KRVLGPEKPSFLD KADY+SWVPPEGQSGDGRT LNERYGY
Sbjct: 721 SKKDEKKPKRVLGPEKPSFLDAKADYESWVPPEGQSGDGRTALNERYGY 766

BLAST of Cp4.1LG02g14090 vs. NCBI nr
Match: gi|659111073|ref|XP_008455566.1| (PREDICTED: kanadaptin [Cucumis melo])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 628/771 (81.45%), Postives = 675/771 (87.55%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPT 60
           MTT MGPPPPRN  S+SPMDSDA  LE DST SST TK  MG P PK PTPPDSDPPA T
Sbjct: 1   MTTDMGPPPPRNTFSSSPMDSDAVALEEDSTVSSTATKAPMGLPPPKIPTPPDSDPPALT 60

Query: 61  ATQEDESSVISVNSDASEPVDKVPD--TPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSH 120
           +TQE+ES V S+NSDASE  +KV D     SDKAVELA KQPQSV+VPYTIPSWSG PSH
Sbjct: 61  STQENESPVNSINSDASEHTEKVSDGSASASDKAVELASKQPQSVSVPYTIPSWSGVPSH 120

Query: 121 RFYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYD 180
           RFYLEVLKDGCI+DQ +VYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRS+GDAYLYD
Sbjct: 121 RFYLEVLKDGCIVDQLNVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSNGDAYLYD 180

Query: 181 LGSTHGTFINKNQ-----FIS-----IVRF---SRAKSFDATFELFWTEHQESDLTMIKK 240
           LGSTHG+FINKNQ     F+      ++RF   SR   F     L   E   +DLT++KK
Sbjct: 181 LGSTHGSFINKNQVKKRVFVDLHVGDVIRFGHSSRLYIFQGPNHLMLPE---ADLTLMKK 240

Query: 241 AKIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQ 300
           AK+RE+TL+REASLRRARQEASLADGISWGMGEDAVEE EDEVDEVTWQTY GQLTEKQQ
Sbjct: 241 AKMREETLEREASLRRARQEASLADGISWGMGEDAVEETEDEVDEVTWQTYSGQLTEKQQ 300

Query: 301 KTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEE 360
           KTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEE
Sbjct: 301 KTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEE 360

Query: 361 TLNDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQSIETA 420
           TLNDSIRESLGARSGIRS GKK GGME+DEE+LSDDDDFYDRTKKPS+KK GENQSIETA
Sbjct: 361 TLNDSIRESLGARSGIRSRGKKGGGMEDDEEVLSDDDDFYDRTKKPSNKKAGENQSIETA 420

Query: 421 DSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDSGNDALDAYMSGLSSQLVLDKTTKL 480
           DSLLDKRDA+ KEM+EK+ LLL EENKMES T LD+G DALDAYMSGLSSQLVLDKTTKL
Sbjct: 421 DSLLDKRDAIKKEMEEKRGLLLSEENKMESQTYLDTGTDALDAYMSGLSSQLVLDKTTKL 480

Query: 481 QNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPASVNGKPQ 540
           QNELSSLQSELDRILYLLKIADPSGEAAKKRETSA+K DSN+ AKPE F VP+SVNGKP 
Sbjct: 481 QNELSSLQSELDRILYLLKIADPSGEAAKKRETSAQKSDSNVGAKPEKFNVPSSVNGKPC 540

Query: 541 KELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKKTISYTVVKPQWLGAI 600
           K  +KDG+SKEQVVDAKQ++KT Q+SVEPN+SVTEK+VDD KDKKTISYT VKPQWLGA+
Sbjct: 541 KGPLKDGDSKEQVVDAKQEVKTAQDSVEPNDSVTEKIVDDAKDKKTISYTAVKPQWLGAV 600

Query: 601 EELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRKRK 660
           EE+KSEE Q +A PLDIQESDDFVDYKDRK+VLQ+SD KP  +DSVIESAAPGLILRKRK
Sbjct: 601 EEMKSEEIQ-EAVPLDIQESDDFVDYKDRKEVLQNSDIKPTKMDSVIESAAPGLILRKRK 660

Query: 661 QEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEEENRHESKRPTGR 720
           QED SD   DASQQSTSS E ++AEF AEDAVALLLKHQRGYHGSDEEE RHESK  TGR
Sbjct: 661 QEDLSDSPFDASQQSTSSSEVDKAEFMAEDAVALLLKHQRGYHGSDEEEVRHESKCSTGR 720

Query: 721 TRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY 757
            + KK+EKK KRVLGPEKPSFLDTKADY+SWVPPEGQSGDGRT LNERYGY
Sbjct: 721 NKLKKDEKKPKRVLGPEKPSFLDTKADYESWVPPEGQSGDGRTALNERYGY 767

BLAST of Cp4.1LG02g14090 vs. NCBI nr
Match: gi|645248488|ref|XP_008230320.1| (PREDICTED: kanadaptin [Prunus mume])

HSP 1 Score: 761.5 bits (1965), Expect = 1.4e-216
Identity = 453/772 (58.68%), Postives = 556/772 (72.02%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDPPAPT 60
           MTTAM PPP   P + S   ++       ++SS+   K  MGPP  KNPTPP        
Sbjct: 1   MTTAMAPPPDPVPETLSSEPAE-------TSSSAITMKPPMGPPPAKNPTPPPQSEAPIA 60

Query: 61  ATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHRF 120
             Q   +S I+ +++A+E           D A ++   Q Q  AVPYTIP WS AP H+F
Sbjct: 61  EEQPQSNSSINDSTEAAE-----------DNAKQILKPQSQGFAVPYTIPPWSAAPCHQF 120

Query: 121 YLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDLG 180
            LEVLKDG II+QFDVYEKGAYMFGR+DLCDFVLEHPT+SRFHAVLQF  SG+AYLYDLG
Sbjct: 121 QLEVLKDGAIINQFDVYEKGAYMFGRIDLCDFVLEHPTVSRFHAVLQFTRSGEAYLYDLG 180

Query: 181 STHGTFINKNQFIS----------IVRF---SRAKSFDATFELFWTEHQESDLTMIKKAK 240
           STHGTFINKNQ             ++RF   SR   F    EL      E+DL +++ AK
Sbjct: 181 STHGTFINKNQVNKKVYVDLCVGDVIRFGHSSRLYIFQGPSELM---PPENDLKLLRVAK 240

Query: 241 IREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQKT 300
           +RE  LD+EASL+RAR EASLADGISWGM EDA+EEAED+ +EVTWQTYKGQLTEKQ+KT
Sbjct: 241 MREDILDQEASLQRARLEASLADGISWGMEEDAIEEAEDDGEEVTWQTYKGQLTEKQEKT 300

Query: 301 REKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEETL 360
           REKVLKR EKI+HMKKEIDAIRAKDISQGGL+QGQQTQIARNEQRI QIMEELENLEETL
Sbjct: 301 REKVLKRLEKIAHMKKEIDAIRAKDISQGGLSQGQQTQIARNEQRIAQIMEELENLEETL 360

Query: 361 NDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSHKKTGENQSIETADS 420
           N+SIRESLGAR G  S GKK+G  + +EELLSDDD+FYDRTKKPS KK GEN S+ET+D+
Sbjct: 361 NESIRESLGARVGKLSYGKKKGATDEEEELLSDDDEFYDRTKKPSSKKAGENPSVETSDT 420

Query: 421 LLDKRDALNKEMDEKKRLLLIEENKMESH-TDLDSGNDALDAYMSGLSSQLVLDKTTKLQ 480
           LLDKRDA+ KEM+EKK LL IE++KM S  TD     DALDAYMSGLSSQLVL+KT +LQ
Sbjct: 421 LLDKRDAIMKEMEEKKELLSIEKDKMASKTTDETDAADALDAYMSGLSSQLVLNKTEELQ 480

Query: 481 NELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPA-SVNGKPQ 540
            ELS+LQSELDRI++LLKIADPSGEAAKKR++  +++    E+KP   + PA ++  +P 
Sbjct: 481 KELSALQSELDRIIFLLKIADPSGEAAKKRDSKVQEVQ---ESKPNKSETPAPAIKKQPP 540

Query: 541 KELVKDGESKEQVVDAKQKIKTTQESVEPN-ESVTEKVVDDTKDKKTISYTVVKPQWLGA 600
            E  +  +  +   D+  K  TT+ S++ + E    K+V D  + K + Y+VVKPQWLGA
Sbjct: 541 MEPKESSQPGKPANDSILKEGTTEVSIKSSTELAASKIVTDATEGKNVVYSVVKPQWLGA 600

Query: 601 IEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRKR 660
           +E++K E+  ++AAP +  E+ +FVDYKDRK +L++  +   N++S IE+AAPGLI+RK 
Sbjct: 601 VEDIKMEKGHQEAAPSNQDEAGEFVDYKDRKKILENVSDAEVNMESGIENAAPGLIIRKW 660

Query: 661 KQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEEENRHESKRPTG 720
           KQ  +S GN   S+Q  +S  +  AEF AEDAVALLLKH+RGY+  D+E           
Sbjct: 661 KQVHESKGNDSDSRQQPAS--STGAEFMAEDAVALLLKHKRGYYAPDDE----------- 720

Query: 721 RTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY 757
            T+    +KK KRVLGPEKPSFLDT +D ++WVPPEGQSGDGRT+LN RYGY
Sbjct: 721 -TQELSKDKKPKRVLGPEKPSFLDTNSD-ETWVPPEGQSGDGRTSLNSRYGY 733

BLAST of Cp4.1LG02g14090 vs. NCBI nr
Match: gi|1009127052|ref|XP_015880488.1| (PREDICTED: kanadaptin [Ziziphus jujuba])

HSP 1 Score: 761.1 bits (1964), Expect = 1.8e-216
Identity = 460/792 (58.08%), Postives = 562/792 (70.96%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASP--------MDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPP 60
           MTTAMGPPPP  P+S  P        +D  + +    S+SS    K  MGPP P  P PP
Sbjct: 1   MTTAMGPPPPPPPTSPKPPTDPPPQTLDQPSSSSSSSSSSSDMTEKTLMGPPSPPPPLPP 60

Query: 61  -DSDPPAPTATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPS 120
            ++ P  P +T  +E    S  +D+    +      P+++       +P ++AVPYT P 
Sbjct: 61  PEAGPSQPESTAPEEQLQSSSTTDSDVVAE------PAERTSAEQVSRPHNIAVPYTKPP 120

Query: 121 WSGAPSHRFYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSS 180
           WSG P H+F LEVLKDG IIDQFDVYEKGAYMFGRVDLCDFVL+HPTISRFHAVLQF+ S
Sbjct: 121 WSGPPIHKFSLEVLKDGSIIDQFDVYEKGAYMFGRVDLCDFVLDHPTISRFHAVLQFKRS 180

Query: 181 GDAYLYDLGSTHGTFINKNQFIS----------IVRF---SRAKSFDATFELFWTEHQES 240
           GDAY+YDL STHGTFINKNQ             ++RF   SR   F    EL      E+
Sbjct: 181 GDAYIYDLSSTHGTFINKNQVDKKVYVDLHVGDVIRFGHSSRLYIFQGPTELM---PSET 240

Query: 241 DLTMIKKAKIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKG 300
           DL  I+KAK+ E+ LDREASLRRAR EASLADGISWGMGEDA+EEAED+VDE+TWQTYKG
Sbjct: 241 DLKAIRKAKMYEENLDREASLRRARMEASLADGISWGMGEDAIEEAEDDVDEITWQTYKG 300

Query: 301 QLTEKQQKTREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIME 360
           QLTEKQ+KTREKV+KR EKI+HMKKEIDAIRAKDISQGGLTQGQQTQIARNEQR+TQIME
Sbjct: 301 QLTEKQEKTREKVIKRMEKIAHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRMTQIME 360

Query: 361 ELENLEETLNDSIRESLGARSGIRSLGKKQGGMENDEELLSDDDDFYDRTKKPSH-KKTG 420
           ELENLEETLN+SIRESLGAR G  S GKK+G  E+D+E LSDDDDFYDRTKK S  KK G
Sbjct: 361 ELENLEETLNESIRESLGARIGKISHGKKKGATEDDDEFLSDDDDFYDRTKKKSSGKKAG 420

Query: 421 ENQSIETADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDS-GNDALDAYMSGLSSQ 480
           ENQSIETAD+L+DKRDA+ +E+ +KK LLL E+NK+ S T  ++ G DALDAYMSGLSSQ
Sbjct: 421 ENQSIETADTLIDKRDAIKREIGDKKELLLKEKNKITSETTEEAVGGDALDAYMSGLSSQ 480

Query: 481 LVLDKTTKLQNELSSLQSELDRILYLLKIADPSGEAAKKR---------ETSAKKIDSNL 540
           LVLDKT +L+ ++S+LQSELDRILYLLKIADP+GEAAKKR         E + K+     
Sbjct: 481 LVLDKTQQLEKDISALQSELDRILYLLKIADPTGEAAKKRNLKTTDQVGEATQKRDLKEK 540

Query: 541 EAKPENFKVPASVNGKPQKELVKDGESKEQVVDAKQKIKTTQESVEPNES-VTEKVVDDT 600
           E K     +P+ +  +P  E   +  + +      QK  +T E+ + +++    +V+ DT
Sbjct: 541 EPKSNRSVIPSVIKKQPSVEAKDNNGTGKPENGFMQKEGSTDETAKLSKNPEAGEVILDT 600

Query: 601 KDKKTISYTVVKPQWLGAIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPA 660
            + KT  YTV KPQWLGA+ +  +EE+    AP  + ++D+FVDYKDRK VL   ++   
Sbjct: 601 TEGKTAVYTVAKPQWLGAVHDRVAEESNPQPAPSHVHDADEFVDYKDRKKVLDDGNDADT 660

Query: 661 NVDSVIESAAPGLILRKRKQEDQSDG-NLDASQQSTSSLEAERAEFKAEDAVALLLKHQR 720
            ++S +E+AAPGLI+RKRKQ  + +G + DA  Q TSS  A  AE  AEDAV+LLLKH++
Sbjct: 661 KMESGLENAAPGLIVRKRKQVHEFEGKSNDAKPQMTSSPSA--AELMAEDAVSLLLKHKK 720

Query: 721 GYHGSDEEENRHESKRPTGRTRSKKNEKKSKRVLGPEKPSFL-DTKADYDSWVPPEGQSG 757
           GYHG D EEN  E+     +TR    +KK KRVLGPEKPSFL D+ +DY++WVPPEGQSG
Sbjct: 721 GYHGMD-EENITETLDEGHQTR---KDKKPKRVLGPEKPSFLVDSNSDYETWVPPEGQSG 777

BLAST of Cp4.1LG02g14090 vs. NCBI nr
Match: gi|590651557|ref|XP_007032923.1| (SMAD/FHA domain-containing protein [Theobroma cacao])

HSP 1 Score: 747.3 bits (1928), Expect = 2.7e-212
Identity = 461/773 (59.64%), Postives = 550/773 (71.15%), Query Frame = 1

Query: 1   MTTAMGPPPPRNPSSASPMDSDAGTLEGDSTSSSTETKVTMGPPLPKNPTPPDSDP-PAP 60
           MTT MGPPPPRNP+ ++  + +   +  +  S  T  K + GPP P  P PP   P P  
Sbjct: 1   MTTTMGPPPPRNPNPSAEPEPEPEPVTQEE-SEPTTAKASTGPPPP--PPPPAKKPNPQN 60

Query: 61  TATQEDESSVISVNSDASEPVDKVPDTPPSDKAVELAPKQPQSVAVPYTIPSWSGAPSHR 120
              QE ES     NSD SEP            ++E      QS  VPYTIP WSG PSH 
Sbjct: 61  PQDQEKES-----NSD-SEP-----------NSIEKPSNSKQS-PVPYTIPQWSGPPSHH 120

Query: 121 FYLEVLKDGCIIDQFDVYEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGDAYLYDL 180
           F+LE+LKDGCIIDQF V EKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSG AYLYDL
Sbjct: 121 FFLEILKDGCIIDQFKVNEKGAYMFGRVDLCDFVLEHPTISRFHAVLQFRSSGQAYLYDL 180

Query: 181 GSTHGTFINKNQFIS----------IVRF---SRAKSFDATFELFWTEHQESDLTMIKKA 240
           GSTHGTFINK+Q             ++RF   SR   F    EL      E DL ++K A
Sbjct: 181 GSTHGTFINKSQVTKRTYVDLNVGDVIRFGHSSRLYIFQGPSELM---PPEKDLKIMKDA 240

Query: 241 KIREQTLDREASLRRARQEASLADGISWGMGEDAVEEAEDEVDEVTWQTYKGQLTEKQQK 300
           KI+E+ LDREASLRRAR EASLADGISWG+GEDA+EEAED+ DE+TWQTYKGQLTEKQ+K
Sbjct: 241 KIQEEMLDREASLRRARAEASLADGISWGIGEDAIEEAEDDADEMTWQTYKGQLTEKQEK 300

Query: 301 TREKVLKRTEKISHMKKEIDAIRAKDISQGGLTQGQQTQIARNEQRITQIMEELENLEET 360
           T +K++KRTEKI+HMKKEIDAIRAKDI+QGGLTQGQQTQIARNEQRITQIMEELENLEET
Sbjct: 301 THDKIIKRTEKIAHMKKEIDAIRAKDIAQGGLTQGQQTQIARNEQRITQIMEELENLEET 360

Query: 361 LNDSIRESLGARSGIRSLGKKQGGME-NDEELLSDDDDFYDRT-KKPSHKKTGENQSIET 420
           LN+SIRES+GAR+G  S GK++GG E +DE+  SDDD+FYDRT KKP+  K GE QSIET
Sbjct: 361 LNESIRESIGARAGRISHGKRKGGPEDDDEDFSSDDDEFYDRTKKKPTVLKVGETQSIET 420

Query: 421 ADSLLDKRDALNKEMDEKKRLLLIEENKMESHTDLDS-GNDALDAYMSGLSSQLVLDKTT 480
           ADSLLDKRDA+ KE+++KK LLL EENKM S T L++   DALDAYMSGLSSQLVLD+T 
Sbjct: 421 ADSLLDKRDAIMKEIEDKKELLLSEENKMASETALETEAGDALDAYMSGLSSQLVLDRTV 480

Query: 481 KLQNELSSLQSELDRILYLLKIADPSGEAAKKRETSAKKIDSNLEAKPENFKVPASVNGK 540
           +L+ EL +LQSELDRI YLLKIADP+ EAAKKR+T A+         P+  + PA+V  +
Sbjct: 481 QLEKELFALQSELDRIFYLLKIADPTREAAKKRDTKAQ------APAPDKSRTPAAVKKQ 540

Query: 541 PQKELVKDGESKEQVVDAKQKIKTTQESVEPNESVTEKVVDDTKDKKTISYTVVKPQWLG 600
           P  E  K   S E      QK      S+E ++   E ++ DT + +   YTV KPQWLG
Sbjct: 541 PPLE-PKISTSTEPANSPMQKEGVADVSMESSKKPEENILSDTAEVRKAIYTVAKPQWLG 600

Query: 601 AIEELKSEETQKDAAPLDIQESDDFVDYKDRKDVLQSSDNKPANVDSVIESAAPGLILRK 660
           A+E  + +E+Q++   +   + D FVDYKDRK VL S D+      S IE+ A GLI+RK
Sbjct: 601 AVESKEIKESQQE-VEVKTHKVDQFVDYKDRKKVLGSVDDPLVKGHSGIETTASGLIIRK 660

Query: 661 RKQEDQSDGNLDASQQSTSSLEAERAEFKAEDAVALLLKHQRGYHGSDEEENRHESKRPT 720
           +KQ ++S+G+  AS QSTSS  +  AE  A++AVALLLKH RGYH  DEE   HE+    
Sbjct: 661 QKQVEKSEGDDKASDQSTSS--STGAEEIAQNAVALLLKHTRGYHAEDEE--LHETPEML 720

Query: 721 GRTRSKKNEKKSKRVLGPEKPSFLDTKADYDSWVPPEGQSGDGRTTLNERYGY 757
            R + KK EKK KRV+GPEKPSFL++  +Y+SWVPPEGQSGDGRTTLN+RYGY
Sbjct: 721 ARNQLKKKEKKPKRVMGPEKPSFLNSNPEYESWVPPEGQSGDGRTTLNDRYGY 737

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NADAP_HUMAN7.2e-1624.91Kanadaptin OS=Homo sapiens GN=SLC4A1AP PE=1 SV=1[more]
YOT2_CAEEL3.9e-0623.17Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans GN=ZK632.2 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A061ENF0_THECC1.9e-21259.64SMAD/FHA domain-containing protein OS=Theobroma cacao GN=TCM_019051 PE=4 SV=1[more]
A0A067JNE9_JATCU1.2e-21157.27Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25614 PE=4 SV=1[more]
A0A0D2QTM4_GOSRA6.0e-21157.87Uncharacterized protein OS=Gossypium raimondii GN=B456_001G206200 PE=4 SV=1[more]
A0A0D2M173_GOSRA3.9e-21057.81Uncharacterized protein OS=Gossypium raimondii GN=B456_001G206200 PE=4 SV=1[more]
M5WX33_PRUPE9.5e-20957.83Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002013mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G38840.19.8e-18150.70 SMAD/FHA domain-containing protein [more]
AT5G47790.11.6e-1035.25 SMAD/FHA domain-containing protein [more]
AT3G20550.19.3e-0626.76 SMAD/FHA domain-containing protein [more]
Match NameE-valueIdentityDescription
gi|449438741|ref|XP_004137146.1|0.0e+0081.27PREDICTED: kanadaptin [Cucumis sativus][more]
gi|659111073|ref|XP_008455566.1|0.0e+0081.45PREDICTED: kanadaptin [Cucumis melo][more]
gi|645248488|ref|XP_008230320.1|1.4e-21658.68PREDICTED: kanadaptin [Prunus mume][more]
gi|1009127052|ref|XP_015880488.1|1.8e-21658.08PREDICTED: kanadaptin [Ziziphus jujuba][more]
gi|590651557|ref|XP_007032923.1|2.7e-21259.64SMAD/FHA domain-containing protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR008984SMAD_FHA_dom_sf
IPR000253FHA_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g14090.1Cp4.1LG02g14090.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000253Forkhead-associated (FHA) domainGENE3DG3DSA:2.60.200.20coord: 110..191
score: 1.8
IPR000253Forkhead-associated (FHA) domainPFAMPF00498FHAcoord: 142..191
score: 9.0
IPR000253Forkhead-associated (FHA) domainSMARTSM00240FHA_2coord: 141..192
score: 1.1
IPR000253Forkhead-associated (FHA) domainPROFILEPS50006FHA_DOMAINcoord: 142..192
score: 14
IPR008984SMAD/FHA domainunknownSSF49879SMAD/FHA domaincoord: 92..191
score: 3.44
NoneNo IPR availableunknownCoilCoilcoord: 326..353
score: -coord: 405..432
score: -coord: 654..674
scor
NoneNo IPR availablePANTHERPTHR23308NUCLEAR INHIBITOR OF PROTEIN PHOSPHATASE-1coord: 41..742
score: 6.1E
NoneNo IPR availablePANTHERPTHR23308:SF2KANADAPTINcoord: 41..742
score: 6.1E

The following gene(s) are paralogous to this gene:

None