Cp4.1LG11g01240 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG11g01240
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCW7
LocationCp4.1LG11 : 613013 .. 623469 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACGATGCATTTTTTGCTATAGCATCCGACTCCAACGCACCGCATTTCCCAGAAAAGAAACCGCGCCACCTGGTCAGTAAATTCCTCCTGGGCTTGATCGTCTCCCTTCCCTTCTACTGCAAAAACTCTCTCAGAACTAAGATTCCAGAACTTGAGCTTATTTATAAAACCGATTTTCCTTTAGTTTATACATTATTTTTCATCTTGAGAAAGCAGAAGCTTCCGGGAGAAATGCTCGGAGATGGAGAAGAAACTCCTTGCAGGTGCGACTGATATCCTACCTTCTCTTTAACTAGGGACTTTTGAACCTCGTTTCTTAGTTGCTCCACTGGTATATTCGGTTTCTTGACTTCAAATTCTTGTTTCCGATCATTAAATCGGGCTGCATCAATCGGTTCTTAGCAAACTCGCTGATCATCTTGATTTTCATACATTTGCATTCTTCGAGACTGCAGCGATTAATTATATTTTGCTTGATGTTGTCATAAATCGCGATCGATGTCTAGATGCACAAGTTATTGTTATTGGATAAATTGATAATTGTTATGTTTCGGCTACTTCAACTATCCCGCTTCATCCATTAGTGCTTAAGCTTTGCTTGAAGTTGGATAAAAGCACAAGCATACGAAATTAATGAGTTTTTCGTGTTTATTTTTGTAATTTGGTTGTTAAATTTGTAGATATGAATTGCTGAACATGGTTAAGAAGCACTCCAACTTAATAGGACAAACTGAGGTTGATGAGCAAGATGCTTCGGATGTAGAAATGGATCCTCGTTTTTGGCACGATGTAATGGATTTGTATTTTATTCGTGGTAAGGAATCAAGGGGGCGACAAGACGACGATCTCGTATTTTTTATTAGAAAAGTGGTACAATTTTGCTATTGTCTTCTGCTTTCCTTGGTCGCAGAGCCTTTAATTTATTTAGTGCGGTATGAAAGTTCTAATACACTTTCTGACTTATGTTTTGAAAGAAATCTCAAGGATATGGGAACGATGATGATAATGGAGGCACCTCACCTTATTTTGTTCGCAGGTGGGCATCTAAGGTACACAACCTCCATTCTAGGCAGTGATGTTTTATGCATTTAATGAATTTATTTTTCTTTGCCAATGTTATCACATTTTCTAGAAGTGATTGGTGTTTTGAGCACCTTTATCGTGTAAATATCTTATGTGTGCTGTTATTTGTACAAATTAAACTCTTACCTAAATATTGTATTATGACAATAGAATGAGAGTGCCTTGTTTTCCTTGGTTGTTTCTGTTCATGCTATATGTTTTTCTATTATTAATATTGTTGCCAATACTTGCTATAAATCAGACTTAGCAGGTCTTCATTTACAGTGTTTATACTATACCAAGTAGAGCACGGTAATTTGTGGTATGATATCTCTAGGCTATCAAGATAACAAGCTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTGGTTGGAGATGCTTCAGTAGATGTTGATTGGAGGCGCTCATTTTATTTGAACTTGATTGCTCACAGTTCATTTACTGTGACTGTGGCAATTTGCAGGTATGGACTTTTTATGTGCTCTCTTTATTTTCTTAATAGACTTGTCTGATTTTCTTGAACACATTGCCTGCATATGTCTTGAGAGGTTTGAATAGTATGATTGTGGGCATATTATTTTTTATTACCATGCTAATCACATGTTCTGGGAGGATTGCTTGTTTTGCATAATTCTTATATGTTCTAATTATGAACTGAGAAGAAAAATTCTCTGCAGTATTCAACCTTTTTTCTGTTACCTGGTCAAGTCTTTTTGTGACTTTCACTAGTGTCATCTTTATACACTATATCATCTTATAAGAATGGGATCAAATTATTTTGGTTTAGAATAAATTTGTTAGGGATGCAATGCATATTCTTGTAATGACTAATGTTTCATGTTTTTGCCTATTTTTCTTCCACAAACTTTATAAAATCCACCTTTTTGGAGTGTAACATCATCTTTTAATATCCTTTGTGAAAATGACCAACACAAACAGAATCCGTTTAGTTAAATACTATTTGGTGTATGATGTAGTTATTGAGGTTTATTTCCTTATTCCTCTTCCATGAAGCTTCATCAATATTGATTTGTTGTTCTGTTTATTGATCTTTTCTCTTATTTCATTCCTTCTTCAGTCATCTGGTTCTTCGAAATCATCAAGCTGGACATACTACATCATTGTCTCCTATATATAAGGTATCTTCATGTGTACTCTTATTTAGTTGTTGGATTAATGTTTTGTCTAGAACTCATGCCCGGAAGGTCACATACTTTTGATGTTGGTGTAATAACAAATATACCTTGGCTATGTATTATGACGACATTTAGTACTTTGGTTTTCACCTTCTTTTATGACTATAATGATCCTTTGATAAAAACCATGGATTTTTGCTGCAAGAAGTAATTTGATTTTCTGGTCATTCAACTGTGGTTTCATGCAGTCTTGCCCTTTAATAAATTTTATTTTCATCATTTATATTTTTCTGTTCGGTTTGGGGCTCTCTGAAGGTTGTAAAGACTGTATATGCATCTCCTAGCCGTGTCAATTTTCATTTGGATTCTAAAAAGGTAAAGTATTTGTTGTCCAGAAAGATTCTCTATCATGCTATACTATCAATGTCAATTTGACTTTATCATAAATTATATTCAGGAAGTGGAGACAACACCTGCATATCCAGATATTTGTTTTGCAGTTGATGACTTTGACTCCACCTTTGATGCAGTGGTAATGTTCGTAATGAATCTCATCACTAGTTTGTTTGTTTGTCTGTTTTTCTTTTTACTCAATAAATATGCAAGTAATTGGTTGTTGGAATTAAGTTTTCATGATCTATGCATAAGCATTTGCTGACTCTGTTTTAACTGTGCTAAAATTTATATGTATATTTTTTTTATCCATGAAGTTCAATGTCAGAGTGTACACGCCTCGACTAACCTCACACGATATTCCCCCAACCCTACTACATTTGGTTGCCTAGGAAATTTGTAGGATAATCTTAAATAGGTGGCCACTATGACTTGAACCTATTCTCACCACGCCATTTACTATTGACCTTGATCCTAATTGACTAGTAGGCCATATATGATAGTTTTTTATGTTTTTTCTTTCTTTCTGAATAAGGCATTTGTTTCCTTTAATGTAAACACACGCTGAATCATTGAACCAGAATTGAATGTTAACTTTTGGAAAAAATTTAAACATATGTAATCATGCACAATCTTTTGGTCAGTCATACTACTTATACACTTAAGGTACCATTGCATTTGTTGAAATTTCAGTGTTCTTATTATTCAGGTCTTAACAGAGACTGACCATTGCTATTGTGTACTTCTCAATGCACATGGTGGTGCAGCTTTTCCCGGAAAGGAGAATGCAGAAAATTGCAATTCTAGTAATCTTATGGCCTCATGTACGGATTCTGATCCTCAAAGTATGAAGGATTCCAAGGTTTGCGTTGCTTCACTTTCCAGCTGTCTTGCTACTATTATTTACTTGTTGTCAAGAGCTACTGAATTGTCCAAGCATTGCGGATCATGTGAATCTCACTTGTAAAATTATAAATAAAACTTTCATGTTAATGTAAAAAACATCATGTTGCCCTACTTCATGCTTGTTTATGTTGAGTTGCTTGAGCTAAGCCATTAGTGAAATTGTAATTCAAACCAAGAGGCATATGGTAGAGCCATAAAGGACTATATTTATAGAGTTTATTGATAGGACTAGTACTTGGAGGGTCTTCCTTCATCTTCTCTCTTTAAGAAATTTCATCTGCTTTCTTCTTCTCTCTTATAATTTGGATAGCTTTATAGATGTTTTTACTTGAATGATTGTAGAGCTTTTCTTATTGTTATCCCTGACATATTTTCTCAAGGGGGTGCTAGCTACACTGTATGGACTATATGGATGATGGGGAATACTTATTATTAGTAACCCATCTTGCAGAAAAGGGAGTTCTCGTTAAATATTCTAAATTTAGTTTCCCTTGGAAGTCAATTTTTTCCTTTTCAATGATCTGTAAATGCATTTTTTAAGCAAGCGGAAGATTTAAACTTTGCGAAATATGAGTATTGACATGATCCATGAGATCCCCTCATGAGTATTGGATAATGGATACTCACATCGTTCTAAATGCTGCAGATTACTCTTTTCTCTGGATTTGTCAGCTACCAAATGGTTCGAGATGCATATGACGGTATATATATCTAGAATATCTTTTTTTCTTCAATGTGGCATAACATTTATCATACTGAGTTTTCCATTCTTATTCAATGCGGAAGAACTTTACTTCGTTCAAATTATCCAGCCTATTTGTTGATATGGGTTGTTGCATCAAATCATTGTTTCCCAGTATGATATAGGATGTATGACGTAAGTAACTAATGCCTTATTGATTCTCTGAGCTAATAGTGGATGTTTTGTCCGTTAGTTTGAATAGCACAGACCTTTTCTACTAATTCTAGAGTTTAAGCTAGCCTTCCACTTGATACCTTTTTCTTTAATATTTATTAATCAAATATCGAAGTTTCTCTATACAATCTATCTGCAACTCTTTTCTTCAACCACTTTGATGGACATATCCCATTAAGAGAAACTAGAGGACTCTTCAATGTGCTCTTTTGTCCTTAGAAATTGAAAGAAGGCTGTGATGCAATTTTTCCTTTTCATTTTCCATCTTGTTCTCTTATCATCATAAGGTAGACTTCATTTATGTGACGTTTCTGCTGAATTGTTTATGATTAGTATTGACTAAGAGAGAAATTCCTGATAAACAGTATTAGATTTAATGTGGTTTTCACACTTGGTGGATAAATGCGTGGCTTGTTATCCCCTTAGAGATAACTTTGTTGAATCGATTGATATCGATCATATATAAAAAATATGTATATATATTTTATTGTAGTGATATTTTACTATTGCTATGTGGTGGCTGGATCATAGTTCAATAGTAGTATACTCCCGTCTATGCATATGGTTTGTATCTTACGGAAGCCTTTTGCAGAAGCGATTTTTATAGTTTGCTTTTCCAAAGTTTATTTTTATCAGAGAGACTAGGTGGTAAGAGATTAAGTTTCTCACTGTCTTTCTTCTCTCTTGCACTTATTGACAGCTGGCAAGTCTAGATTTGGCAGCCTTCTCTCTCTTGGTCATGCTTCTGGCAAAACAGACAAAATTTACATGAAGGGACCTGGAGGACGTGGGGAAGTTGAAGTTGCTGTTTCTGGTGTTGCAGGTAGGTTAGGAAATAATCTTTTCAATGTTATCAAGCCTTTTATTCTTCTAATTGAGTAATAGTACACAATATTTTCCTTAACTATAGCAATTAAAGATAAATTATCTTTGCACCTTTTGTTAGTCCCAACACCCCCCTGGCTCTTCCTCCATTAGAAGGGACGGATACCATTATGATGTTATGAGGCATTTTTTATTCTTCTCCTTGTTTTTAGTTTGTCAAAAAAATCTTGTTTTGAACCAAATGATCCCCAAAAGTTATCCTCGTAGGAAGCTTATCTTACTGGAAAGTCTTAATTTATTACGATTGTAGTAACATAACATTTGTAGTATCAGTTTTGGTAAGCGGGGTGAGAGAATGAGCATTGGATTATTTGAAGATTTATTGCAGTTTAAGTTATTTTATTGGGTGAATAATATTATGCCATTCCTCTTTTCATATGGGTGATAGTTAATTTCATCTGTCTAATTATTGTCTTATATGTGTTTGTATTATAATATCAGATCAAAGCCTGCAGGATTCTGGGCCCTTTTCACCCGTAGTATCAAAGAGAGGATTTGGGATTGGCACCATAGTTAGAAAAGCAGCATCTGTAGCAACTGTTGCTGCAAAACAAGCTTATGCAGCTGCTAGTAGCACTAGTTCTGATGACGAGATGATACCTCTCAAGTGTTGCTTGATGTCCATTTCGTTGCCCTGGGAATACATCGCACACGATCTTTTGTTCAAGGTAAGTTCATTATTAACCATCGTAACTTCGCATTATCCATATTTCACCACATTTGATCTCAGGAGGTGGGTTGTGGATGTCAGACTTGGGAATTAGATTGGTATTGATTCTTTAGGATTGGTCTGGTAATATTTATGTTTCTTCTTATCTCTTATTTCTCATATTCTTTTTTCTGTTAAAATGTCTTTGATAATAAGAGAGAGGTTTTGATCCTGGGATGTGGTTTATAGATGTCAAAATTGGGATTGAGATTGAGATTGGTATTGATTTTGTAGGATTGTTCTGATAATATTTATTTTTCTTCTTTCTTGTATATTTCTCCTCTTTTTAAAAAAAAAAATTGTTTTTAAAGAAACAAAAATACATTTATCAGAAAAAGTAATGAAACAATAATTTCACCAATATTTTGGAAAATTAGAAATAAGAAACTAATTTTTTCCCCATAATTTATATAATATGTAGAAGCGTTGATACCATGAAAAAAAAAGGTGGAAAATTGTTTTTTCTTCTTGTTGAGAATGAAAACTAGTTATTTTATCTCTTTGAGACTAGAATACTTCTCTGTATACAAAATAAGCTCATTACTAAATTATTAGCATATTCGATACTGATTAGGCACAAAAGAAAACTATGGAAGGGAGGAAGTTGAGGAACAGCATAATTGGGAATGAATCCTGAATAAATGATTTCATCACCAAAATAAATCTTAGGTTTTTTGGCCGAAAGTATGCATCTTAAGTTTTAGTAGTACGTGAAAAATACCCACATTGGCATATCAGATGTAGGGTGCTGCAATTTATTTATTTATTATTATTATTATTATTATTATTATTGGTTTACTCCTTGCACTTTTGGTTATGGAATTACCTATTCTGTAATAATGATCACTAACTTCTAGAGTCATCAACCAGGGAAGTCCGCCAGTGAACTTGTGAGCATCATAGGTTTCCGTCACTGAGATATAAACTGAAGTGGAACTCACCTTTGGAAGGCTCCAAAGTTTCAGGGAGGTACTCCTTGTACGCACACAACATGAAAGAACTATTTGTAAAATTTGATTGTATTTGGATTATTGTTGTTATTTGCATAGTTTTGTGTCGGGTAATAGAGAGGAATCTTGCTGAAGAGATCTTGTGGAATTTAACTGTGTAGCAAGCGCTTGATTCAATTCATAATCACCTACTTAGCAGCAGGCGCAATTTCTTTTCTTTCTTTTTTTTTTTTAGTCAAAGACAATAGGAGGTTAGAGATTTCAATTCCTGATGGGTAATTTAAACGAAGTTCAAGCTATCTTTCATTTGAGTTGTTCCCAAATATTATGTTCAACAGATGGATGGAAAGCACTTTGCATTGCTTCTCCGTTGGAGTTATTCTGTAATTTGTATTTATATCTTTTACCAATGGGGGGCTGGACAACTATCATCTTGTGTATAAAATGATCTTTCATTTCCTGGAATTTTTTTTGTTTTAATTGTTAGATTTTTACAGACTTAACACGCTTCCCTTCCCGGCTTTGAAGTCTCAACTATTTTCAAAATTTTATTACTCTTTCCCTTCTTGGAACTTGTGTTCATTCTCATTCGTTTCTAATCTTATAGCTTTAGTTTGATCAAGATTAATATACTTAATTAGCAAGTTTTTAGAGCTTGTTATGGACACGGAGGCTTGAGTTGCCATATTCTTGAAGGTCCTATCTTTATTTTCTGGCTTGAGCCTCTAACGCATGAGCAATCTATGCTTAACAAAGCTATTCATAGATTCCAAGTATGAACAGCGCTTTAGTGAGTATCTTACCCACTTTTCTGTCGCATCTAAGATTGGAAATACTTGTGCAATAAAAAGTTGTATTTGCATTGAGATACGCGTTAAAAGTAAGCAGATTTGGCAGTATTACATTGGGGTTACAAAAGTAACTTGCAAAGGGGTACCTGATGACGACATCTCAAGTTCCTGGCGGCATGGGCGTCTAGTTAAGGTAAAAATGCAGTGTAAAACATTGGCTTCTAAGTTGTAATACCATGTTAATTTTCTTCACTTCGTACAGGTAAAAAGTTCAATTTATAGGGGAGAGGTGCAGAACTTGCATGTTAGCCTCCATTACCGGTAAACCTGACCCCTTCGCTTATGCTAAAGGCTAAAACTAATTGTTAATTAGGAAAGACAACCGTAGGAACATTTGCTGTAAATTTATTATGGAATCTTCTCTACTTGAGATTTGGTGTTGTGCATGCCATGTCTTGAACTATGGGCGAAGCTCCGGATCAAGTGAAACCCCAACTGTGCTTTTCATGGTTGGGTTAAAAAAGCTGCCCTCTTCGATCTAATAAATAGGATTAAAGAACATACTTTTTAGTATATAATTAGCTCTCGTAGGCTGGATATATGTGTTCTAGGGTTATAACCATAGCCAATGTCTAGTATGTGGTGCTCGTGTATCTCGAGTAAATATATCACTAAAATGTTAGACAGACATTGTTGAACTCAGGTTGAAATCAAAATGTTAGACAGACTTTTTAACATGTTATGAACTCAGTTTGAAATCCTTTGACTTCCCTCCCATCCTTTGTAGGCAAAATACACCTTGTTGGTCTCCACACTGGCGGGATCAAGTAAATAGGTTAGCCTTCTTTAGATTTGAAGCTATAATATAATTCTTCAAGTCTCTCATTAGACAAACTTGTGGGTACTGGTTAATGAGAGCGGGGTGAAAATCAGAACAAGAAAAAGAACAAACCCTTAAACGCAAAGGTAAAAAGAAAAAGAAAAAGAAAAAGGACTATGGTCTGCACTTTTGTACAATGGGTGTTGCCCATAGTATTCCAAAACCATGCTGATGAAGCTTGGGGTATTGTCTTTCCTTTTTTAGGCTAATTTTTTTATTCATTCATCCAACTGTTCATCCGTTGGCTTACACTGTGTTGCAGCAAAGGCATGAGCTCAACCATACACCATGCCATTCTCCCTTCAACTTTTTAACCTTCATGACTTCTCCATCCCGCCCTGTCGTTTTTTGCTTGACTCTTCATGACTAAGCTAAACCACAATCTCATATTCCCCAACTACTTTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCCTGCCCGTCTAAAAGTTTAAAGCAACATTTCTGGTTCAGCACCTTGCCACCGCTGCACAACTCCCCATAAAAGATAAATTTTCTTTTCTTTTCTTTTTTTTTTTTTGTCTGAGAAAACTACTGCTATTATGTAGCAGGAAGCATAGTTTGCCACATTACCATGTCTTTTATTATGTGGGTCTTTGCATAACTTTTAAGTGCTAATGGCATAGCTAAAAATGAATAGGAGTAGTAGTAGAAGAGCATTGGTAATGCTTCTGTTGTCTGCCCGATGTTCCTTAAAGCACCTTTCTAATTTTGTTTCCGAAACTACTCTTAGCTCTGTTCATATAAATGGTTTGAGTTATAGGGTTTGTGTTGCCAAATTAATACTGTTCTTCACGTCTGTAATAATTTTTTTTAAAAAATTAGTTAGTTTCTTGAATATATATGTTCATAAAGAAATAATAATCCAAAAGAACTGAAATTAGTGATATGTCCATTCGTAGGAAACAAATCTTCATGATACCTGGTATCATCATAGGGGAAACAAAACTGTCTAAAGAAAACAACGTAAGTTTGTTGGGTTTGTATTAATTGATTTTATTGTTTCCTCATTCTTCTTTAATTTTCTCTTATATATATATATATAAGGGGGGATGTATTATTAAAAAGAAAAAAGAAAACTTGAAAGAAATCACAAAATTAACATACATTTTTCTCATGAAACAAATCACGTGTTATTTTACAATAAAAGTTGTTTTTTTTTTCTTTCGTGGGCAGGTTTAGCTCACTCCTCCAAGGTTCATGCCTTATTGGGTCCCATGTGTCTCTCCCATTTCTTCCATAAATACCCAACACTTCCTCTTCCTTCATCCCCTTTGTTCTCTCTCTCACAGTTTAGTTTCGTTTACATGTCAATCCTGAATATGGAGGAAGAAGTTGGGTTTCAAGGCGAGGTACCTGAGAATCATGAGGAGTTCACTGCCACTGAAATCCAAGAGCTTTTGTCACTCTTTCTAGTCAACGACGGGCCTCCAAGTTCCGGTTCCGATTCACAGGGTTCGATGCCGACCTCGGCAGCTTGTTCAACCAATGACGACGAGAGGAAGCTGCGTCGGATGATATCAAATCGGGAGTCAGCCCGGCGGTCACGGTGTAGGAAAAAGAGGCATTTGGAAGATCTGACTAATGAGGTGAACCGGTTGATGATCCAGAACCGGGAATTGAAGGATCAGCTCGGCCGGATTCTCAGCCGCCGCCATATGGTTCTGAGAGAAAATGATTGGTTGTGGATGGAGTCCGTGGGTCTTCGGGCCAGACTTTCGGATCTTTGCCGGATCTTTGCCGTCATGCAATAACCGTCAAAGAAGCTTACGTAAAAG

mRNA sequence

AACGATGCATTTTTTGCTATAGCATCCGACTCCAACGCACCGCATTTCCCAGAAAAGAAACCGCGCCACCTGGTCAGTAAATTCCTCCTGGGCTTGATCGTCTCCCTTCCCTTCTACTGCAAAAACTCTCTCAGAACTAAGATTCCAGAACTTGAGCTTATTTATAAAACCGATTTTCCTTTAGTTTATACATTATTTTTCATCTTGAGAAAGCAGAAGCTTCCGGGAGAAATGCTCGGAGATGGAGAAGAAACTCCTTGCAGATATGAATTGCTGAACATGGTTAAGAAGCACTCCAACTTAATAGGACAAACTGAGGTTGATGAGCAAGATGCTTCGGATGTAGAAATGGATCCTCGTTTTTGGCACGATGTAATGGATTTGTATTTTATTCGTGGTAAGGAATCAAGGGGGCGACAAGACGACGATCTCAAATCTCAAGGATATGGGAACGATGATGATAATGGAGGCACCTCACCTTATTTTGTTCGCAGGTCTTCATTTACAGTGTTTATACTATACCAAGTAGAGCACGATGCTTCAGTAGATGTTGATTGGAGGCGCTCATTTTATTTGAACTTGATTGCTCACAGTTCATTTACTGTGACTGTGGCAATTTGCAGTCATCTGGTTCTTCGAAATCATCAAGCTGGACATACTACATCATTGTCTCCTATATATAAGGTTGTAAAGACTGTATATGCATCTCCTAGCCGTGTCAATTTTCATTTGGATTCTAAAAAGGAAGTGGAGACAACACCTGCATATCCAGATATTTGTTTTGCAGTTGATGACTTTGACTCCACCTTTGATGCAGTGGTCTTAACAGAGACTGACCATTGCTATTGTGTACTTCTCAATGCACATGGTGGTGCAGCTTTTCCCGGAAAGGAGAATGCAGAAAATTGCAATTCTAGTAATCTTATGGCCTCATGTACGGATTCTGATCCTCAAAGTATGAAGGATTCCAAGATTACTCTTTTCTCTGGATTTGTCAGCTACCAAATGGTTCGAGATGCATATGACGCTGGCAAGTCTAGATTTGGCAGCCTTCTCTCTCTTGGTCATGCTTCTGGCAAAACAGACAAAATTTACATGAAGGGACCTGGAGGACGTGGGGAAGTTGAAGTTGCTGTTTCTGGTGTTGCAGATCAAAGCCTGCAGGATTCTGGGCCCTTTTCACCCGTAGTATCAAAGAGAGGATTTGGGATTGGCACCATAGTTAGAAAAGCAGCATCTGTAGCAACTGTTGCTGCAAAACAAGCTTATGCAGCTGCTAGTAGCACTAGTTCTGATGACGAGATGATACCTCTCAAGTGTTGCTTGATGTCCATTTCGTTGCCCTGGGAATACATCGCACACGATCTTTTGTTCAAGATTTGGCAGTATTACATTGGGGTTACAAAAGTAACTTGCAAAGGGGTACCTGATGACGACATCTCAAGTTCCTGGCGGCATGGGCGTCTAGTTAAGGTTTAGCTCACTCCTCCAAGGTTCATGCCTTATTGGGTCCCATGTGTCTCTCCCATTTCTTCCATAAATACCCAACACTTCCTCTTCCTTCATCCCCTTTGTTCTCTCTCTCACAGTTTAGTTTCGTTTACATGTCAATCCTGAATATGGAGGAAGAAGTTGGGTTTCAAGGCGAGGTACCTGAGAATCATGAGGAGTTCACTGCCACTGAAATCCAAGAGCTTTTGTCACTCTTTCTAGTCAACGACGGGCCTCCAAGTTCCGGTTCCGATTCACAGGGTTCGATGCCGACCTCGGCAGCTTGTTCAACCAATGACGACGAGAGGAAGCTGCGTCGGATGATATCAAATCGGGAGTCAGCCCGGCGGTCACGGTGTAGGAAAAAGAGGCATTTGGAAGATCTGACTAATGAGGTGAACCGGTTGATGATCCAGAACCGGGAATTGAAGGATCAGCTCGGCCGGATTCTCAGCCGCCGCCATATGGTTCTGAGAGAAAATGATTGGTTGTGGATGGAGTCCGTGGGTCTTCGGGCCAGACTTTCGGATCTTTGCCGGATCTTTGCCGTCATGCAATAACCGTCAAAGAAGCTTACGTAAAAG

Coding sequence (CDS)

AACGATGCATTTTTTGCTATAGCATCCGACTCCAACGCACCGCATTTCCCAGAAAAGAAACCGCGCCACCTGGTCAGTAAATTCCTCCTGGGCTTGATCGTCTCCCTTCCCTTCTACTGCAAAAACTCTCTCAGAACTAAGATTCCAGAACTTGAGCTTATTTATAAAACCGATTTTCCTTTAGTTTATACATTATTTTTCATCTTGAGAAAGCAGAAGCTTCCGGGAGAAATGCTCGGAGATGGAGAAGAAACTCCTTGCAGATATGAATTGCTGAACATGGTTAAGAAGCACTCCAACTTAATAGGACAAACTGAGGTTGATGAGCAAGATGCTTCGGATGTAGAAATGGATCCTCGTTTTTGGCACGATGTAATGGATTTGTATTTTATTCGTGGTAAGGAATCAAGGGGGCGACAAGACGACGATCTCAAATCTCAAGGATATGGGAACGATGATGATAATGGAGGCACCTCACCTTATTTTGTTCGCAGGTCTTCATTTACAGTGTTTATACTATACCAAGTAGAGCACGATGCTTCAGTAGATGTTGATTGGAGGCGCTCATTTTATTTGAACTTGATTGCTCACAGTTCATTTACTGTGACTGTGGCAATTTGCAGTCATCTGGTTCTTCGAAATCATCAAGCTGGACATACTACATCATTGTCTCCTATATATAAGGTTGTAAAGACTGTATATGCATCTCCTAGCCGTGTCAATTTTCATTTGGATTCTAAAAAGGAAGTGGAGACAACACCTGCATATCCAGATATTTGTTTTGCAGTTGATGACTTTGACTCCACCTTTGATGCAGTGGTCTTAACAGAGACTGACCATTGCTATTGTGTACTTCTCAATGCACATGGTGGTGCAGCTTTTCCCGGAAAGGAGAATGCAGAAAATTGCAATTCTAGTAATCTTATGGCCTCATGTACGGATTCTGATCCTCAAAGTATGAAGGATTCCAAGATTACTCTTTTCTCTGGATTTGTCAGCTACCAAATGGTTCGAGATGCATATGACGCTGGCAAGTCTAGATTTGGCAGCCTTCTCTCTCTTGGTCATGCTTCTGGCAAAACAGACAAAATTTACATGAAGGGACCTGGAGGACGTGGGGAAGTTGAAGTTGCTGTTTCTGGTGTTGCAGATCAAAGCCTGCAGGATTCTGGGCCCTTTTCACCCGTAGTATCAAAGAGAGGATTTGGGATTGGCACCATAGTTAGAAAAGCAGCATCTGTAGCAACTGTTGCTGCAAAACAAGCTTATGCAGCTGCTAGTAGCACTAGTTCTGATGACGAGATGATACCTCTCAAGTGTTGCTTGATGTCCATTTCGTTGCCCTGGGAATACATCGCACACGATCTTTTGTTCAAGATTTGGCAGTATTACATTGGGGTTACAAAAGTAACTTGCAAAGGGGTACCTGATGACGACATCTCAAGTTCCTGGCGGCATGGGCGTCTAGTTAAGGTTTAG

Protein sequence

NDAFFAIASDSNAPHFPEKKPRHLVSKFLLGLIVSLPFYCKNSLRTKIPELELIYKTDFPLVYTLFFILRKQKLPGEMLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESRGRQDDDLKSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSFYLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYAAASSTSSDDEMIPLKCCLMSISLPWEYIAHDLLFKIWQYYIGVTKVTCKGVPDDDISSSWRHGRLVKV
BLAST of Cp4.1LG11g01240 vs. Swiss-Prot
Match: K0930_XENLA (Uncharacterized protein KIAA0930 homolog OS=Xenopus laevis PE=2 SV=2)

HSP 1 Score: 53.9 bits (128), Expect = 5.7e-06
Identity = 79/296 (26.69%), Postives = 121/296 (40.88%), Query Frame = 1

Query: 121 FWHDVMDLYFIRGKESRGRQDDDL----KSQGYGNDDDNGGTSPYFV---RRSSFTVFIL 180
           FW  +   YF+  ++   RQDD L    +   Y   D N G     V   R+ S  +  L
Sbjct: 31  FWTWMYSTYFM--EKWAPRQDDMLFYVRRKPAYMGPDGNEGRKQVEVEVYRKDSKKLPGL 90

Query: 181 YQVEHDASVDVDWRRSFYLNLIAHS-SFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKT 240
                    D+DW  S YLNLI     + VT A+C+        AG    +    K  + 
Sbjct: 91  ------GDPDIDWEESVYLNLILQKLDYMVTCAVCT-----RSDAG---DIHIHKKKSQQ 150

Query: 241 VYASPSRVNFHLDSKKEVETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLL------ 300
           V+ASPS+    +DSK E E+  +YP+I F +D+F+  F  + + E +   CV L      
Sbjct: 151 VFASPSK--HPMDSKGE-ESKISYPNIFFMIDNFEEVFSDMTVGEGE-MVCVELVARDKT 210

Query: 301 NAHGGAAFPGKENAENCNSSNLMASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKS 360
           N   G  F G    E                + + D+++++ +  ++ +M    Y     
Sbjct: 211 NTFQGVIFQGSIRYEAL--------------KKVYDNRVSV-AAKMAQKMSFGFYKYNNM 270

Query: 361 RFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQSL------QDSGPFSPV 397
            F               + MKGP G+G  E+AVS V+          +DS P SP+
Sbjct: 271 EF---------------VRMKGPQGKGHAEMAVSRVSTGDTSPYGTEEDSNPGSPM 276

BLAST of Cp4.1LG11g01240 vs. TrEMBL
Match: A0A0A0LN69_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G354060 PE=4 SV=1)

HSP 1 Score: 652.9 bits (1683), Expect = 3.1e-184
Identity = 334/373 (89.54%), Postives = 345/373 (92.49%), Query Frame = 1

Query: 94  MVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESRGRQDDDL-------KS 153
           MVKKHSNLIGQT VDEQD SDVE DPRFWHDVMDLYFIRGKESRGRQDDDL       KS
Sbjct: 1   MVKKHSNLIGQTVVDEQDVSDVERDPRFWHDVMDLYFIRGKESRGRQDDDLVFFVRKVKS 60

Query: 154 QGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSFYLNLIAHSSFTVTVAI 213
           QGYG+DDDNGGTSPYFVRR +     L  +  DASVDVDWRRSFYLNLIAH+SFTVTVAI
Sbjct: 61  QGYGSDDDNGGTSPYFVRRWASK---LDNLVGDASVDVDWRRSFYLNLIAHTSFTVTVAI 120

Query: 214 CSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDF 273
           CSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDF
Sbjct: 121 CSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDF 180

Query: 274 DSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMASCTDSDPQSMKDSKIT 333
           DSTFDAVVLTETDHCYCVLLNAH GAAFP K+NAENCNSSN +AS  DSD Q+ K+SKIT
Sbjct: 181 DSTFDAVVLTETDHCYCVLLNAHDGAAFPAKDNAENCNSSNFVASSLDSDSQNTKNSKIT 240

Query: 334 LFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQS 393
           LFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQS
Sbjct: 241 LFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQS 300

Query: 394 LQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYAAASSTSSDDEMIPLKCCLMSIS 453
           LQDSGPFSPVVSK GFGIGTIVR+AASVATVAA+QAYAAASS+SSDDEMIPLKCCLMSIS
Sbjct: 301 LQDSGPFSPVVSKTGFGIGTIVRRAASVATVAARQAYAAASSSSSDDEMIPLKCCLMSIS 360

Query: 454 LPWEYIAHDLLFK 460
           LPWEYIAHDLLFK
Sbjct: 361 LPWEYIAHDLLFK 370

BLAST of Cp4.1LG11g01240 vs. TrEMBL
Match: A0A061FMH8_THECC (Cw7 protein isoform 1 OS=Theobroma cacao GN=TCM_034665 PE=4 SV=1)

HSP 1 Score: 563.9 bits (1452), Expect = 1.9e-157
Identity = 289/397 (72.80%), Postives = 336/397 (84.63%), Query Frame = 1

Query: 69  LRKQKLPGEMLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDL 128
           L+K+K P  ML  GEETP R+ELL+MVKKHSNL+G+T VDE DASDV+MD +FWHDV +L
Sbjct: 38  LQKKKKPQAMLHIGEETPSRFELLSMVKKHSNLLGKTTVDEHDASDVQMDGQFWHDVFNL 97

Query: 129 YFIRGKESRGRQDDDL-----KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVD 188
           YF+RG+ESRGRQDDDL     K +G+  +D + G +PYFVRR +  +  L         D
Sbjct: 98  YFVRGRESRGRQDDDLIFFVRKWRGHDFNDKDEGFAPYFVRRWAPELDNLVGASLS---D 157

Query: 189 VDWRRSFYLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFH 248
           VDWRRSFYLN+IAH+S++VTVAICSH VLRNHQAG  T LSPIYKVVKTVYASPSRVNFH
Sbjct: 158 VDWRRSFYLNMIAHTSYSVTVAICSHHVLRNHQAGQDTPLSPIYKVVKTVYASPSRVNFH 217

Query: 249 LDSKKEVETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENC 308
           LDSKKEVETTPAYPDICFA+DDFDSTFDA+VLT+TDHCYCVLLNA  GAAFP ++   + 
Sbjct: 218 LDSKKEVETTPAYPDICFAIDDFDSTFDAMVLTDTDHCYCVLLNALDGAAFPSEKETNDS 277

Query: 309 NSSNLMASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDK 368
           +SS+ +    D++    K SK+TLFSGFVSYQMVRDAYDAG SRFGSLLSLGH+SGKTD+
Sbjct: 278 SSSDKLPLRVDTNSMKTKTSKLTLFSGFVSYQMVRDAYDAGSSRFGSLLSLGHSSGKTDR 337

Query: 369 IYMKGPGGRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAY 428
           +YMKGPGG GEVEVAVSGVADQS QDSGPFSPV+SKRGFG+G+IVRKAASVA+VAAK AY
Sbjct: 338 LYMKGPGGCGEVEVAVSGVADQSKQDSGPFSPVISKRGFGLGSIVRKAASVASVAAKHAY 397

Query: 429 AAASSTS-SDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           AAA++TS SD+EMIPLKCCLMSI+LPWE+IA+DLLFK
Sbjct: 398 AAAAATSTSDEEMIPLKCCLMSITLPWEHIAYDLLFK 431

BLAST of Cp4.1LG11g01240 vs. TrEMBL
Match: B9R6W6_RICCO (Cw7 protein, putative OS=Ricinus communis GN=RCOM_1586510 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 3.0e-155
Identity = 291/399 (72.93%), Postives = 335/399 (83.96%), Query Frame = 1

Query: 78  MLGDGEETPCRYELLNMVKKHSNLIGQ-TEVDEQDASDVEMDPRFWHDVMDLYFIRGKES 137
           MLGDGE+TP RYELL+MVKKHS L+G+ TEVD  DA+DVEMD  FW+D++DLYFIRG+ES
Sbjct: 1   MLGDGEDTPSRYELLSMVKKHSTLLGKSTEVD--DATDVEMDGHFWYDMLDLYFIRGRES 60

Query: 138 RGRQDDDLK-------SQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRS 197
           RGRQDDDL        ++GYG +D+  G SPYFVRR +  +  L     D S +VDWRRS
Sbjct: 61  RGRQDDDLVFFVKKMGTEGYGFNDNMQGVSPYFVRRWAPKLESLIS---DNS-EVDWRRS 120

Query: 198 FYLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKE 257
           FYLNLIAH+SF+VTVAICSH  LRNHQAG  T LSPIYKVVKTVYAS SRVNFHLDSKK 
Sbjct: 121 FYLNLIAHTSFSVTVAICSHQALRNHQAGPNTQLSPIYKVVKTVYASASRVNFHLDSKKA 180

Query: 258 VETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLM 317
           +ETTPAYPDICFAVDDFDSTFDAVVLTE DHCYCVLLNAHGGAAFP +  + + +S+++ 
Sbjct: 181 IETTPAYPDICFAVDDFDSTFDAVVLTEKDHCYCVLLNAHGGAAFPCERESPDSSSNSVS 240

Query: 318 ASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGP 377
               D+     K+SK+TLFSGFVSYQMVR+AYDAGKSRFGSLLSLGH+ GKTDK+YMKGP
Sbjct: 241 PVEVDTSSGKTKNSKLTLFSGFVSYQMVREAYDAGKSRFGSLLSLGHSPGKTDKLYMKGP 300

Query: 378 GGRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAY-AAASS 437
           GGRGEVEVAVSGVADQS QD GPFSPV SKRGFGIG+IVRKAASVA+VAAK AY AAA+S
Sbjct: 301 GGRGEVEVAVSGVADQSQQDFGPFSPVTSKRGFGIGSIVRKAASVASVAAKHAYAAAAAS 360

Query: 438 TSSDDEMIPLKCCLMSISLPWEYIAHDLLFKIWQYYIGV 468
           TSSD+EM+PLKCCLM++SLPWE+IA+DLLFK+  Y + +
Sbjct: 361 TSSDEEMVPLKCCLMAVSLPWEHIAYDLLFKVAGYCLHI 393

BLAST of Cp4.1LG11g01240 vs. TrEMBL
Match: A0A067LAQ9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15997 PE=4 SV=1)

HSP 1 Score: 556.2 bits (1432), Expect = 3.9e-155
Identity = 291/390 (74.62%), Postives = 328/390 (84.10%), Query Frame = 1

Query: 78  MLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESR 137
           M+ DGE+TP RYELL+MVKKHS L+ +    EQDASDVEMD RFW+D++DLYFIRGKESR
Sbjct: 1   MVRDGEDTPSRYELLSMVKKHSKLLSKATESEQDASDVEMDGRFWYDMLDLYFIRGKESR 60

Query: 138 GRQDDDLK-------SQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSF 197
           GRQDDDL        ++GYG +D   G  PYFVRR +  +  L   EH  S ++DWRRSF
Sbjct: 61  GRQDDDLVFFVRKMGTEGYGFNDIVQGVPPYFVRRWAPKLDNLVS-EH--SKEIDWRRSF 120

Query: 198 YLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV 257
           YLNLIAH+SFTVTVAICSH VLRNHQAG    LSPIYKVVKTVYASPSRV+F LDSKKEV
Sbjct: 121 YLNLIAHTSFTVTVAICSHQVLRNHQAGQDAPLSPIYKVVKTVYASPSRVDFQLDSKKEV 180

Query: 258 ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMA 317
           ETTPAYPDICFA+DDFDSTFDAVVLTE DHCYCVLLNAHGGAAFP ++ + + +SS+   
Sbjct: 181 ETTPAYPDICFAIDDFDSTFDAVVLTEPDHCYCVLLNAHGGAAFPTEKGSPDTSSSSNSP 240

Query: 318 SCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG 377
             TD+     K+SK+TLFSGFVSYQMVR+AYDAGKSRFGSLLSLG + GKTDK+YMKGPG
Sbjct: 241 LKTDTTFAKTKNSKLTLFSGFVSYQMVREAYDAGKSRFGSLLSLGQSPGKTDKLYMKGPG 300

Query: 378 GRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAY-AAASST 437
           GRGEVEVAVSGVADQS QD GPFSPV SK+GFGIG+IVRKAASVA+VAAK AY AAA+ST
Sbjct: 301 GRGEVEVAVSGVADQSQQDFGPFSPVTSKKGFGIGSIVRKAASVASVAAKHAYAAAAAST 360

Query: 438 SSDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           SSDDEM+PLKCCLMSISLPWE+IA+DLLFK
Sbjct: 361 SSDDEMLPLKCCLMSISLPWEHIAYDLLFK 387

BLAST of Cp4.1LG11g01240 vs. TrEMBL
Match: E0CNZ2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g08720 PE=4 SV=1)

HSP 1 Score: 555.1 bits (1429), Expect = 8.8e-155
Identity = 289/392 (73.72%), Postives = 328/392 (83.67%), Query Frame = 1

Query: 78  MLGDGEETPCRYELLNMVKKHSNLIGQTEVD-EQDASDVEMDPRFWHDVMDLYFIRGKES 137
           MLGDG ETP RYELL+MVKKHS+ +  T VD EQDASDVEMDP FWHD++DLYFI G+ES
Sbjct: 1   MLGDGGETPTRYELLSMVKKHSSSLKNTTVDDEQDASDVEMDPSFWHDILDLYFICGRES 60

Query: 138 RGRQDDDL-------KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRS 197
           RGRQDDDL          GYG +D   G  PYFVRR +     L ++  + S +VDWRRS
Sbjct: 61  RGRQDDDLIFFVRKLSLHGYGFNDHMEGIPPYFVRRWAPK---LDKLVSENSAEVDWRRS 120

Query: 198 FYLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKE 257
           FYLNLIAH+SFTVTVAICSH VLRNHQ G  + LSPIYKVVKTVYASPSRV FHLDSKKE
Sbjct: 121 FYLNLIAHTSFTVTVAICSHQVLRNHQTGQDSPLSPIYKVVKTVYASPSRVEFHLDSKKE 180

Query: 258 VETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLM 317
           VET PAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAH GAAFP ++ + +C+SSN  
Sbjct: 181 VETIPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHDGAAFPSEKTSPDCSSSNTS 240

Query: 318 ASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGP 377
           +   D+D +  K++K+TLFSGFVSYQMVR+AYDAGKSRFGSLL LG ++GKTD++YMKGP
Sbjct: 241 SLRIDTDSEKSKNTKLTLFSGFVSYQMVREAYDAGKSRFGSLLLLGQSAGKTDRLYMKGP 300

Query: 378 GGRGEVEVAVSGVADQSLQDSGPFSPV-VSKRGFGIGTIVRKAASVATVAAKQAYAAASS 437
           GGRGEVEVAVSGVADQS Q SGP SPV  SKRGFGIGTIVRKAA+VA+VAAK AYAAA++
Sbjct: 301 GGRGEVEVAVSGVADQSQQPSGPSSPVHASKRGFGIGTIVRKAATVASVAAKHAYAAAAA 360

Query: 438 T-SSDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           T SSD+EM+PLKCCLMSISLPWE+IA+DL+FK
Sbjct: 361 TRSSDEEMLPLKCCLMSISLPWEHIAYDLMFK 389

BLAST of Cp4.1LG11g01240 vs. TAIR10
Match: AT1G59520.3 (AT1G59520.3 CW7)

HSP 1 Score: 490.0 bits (1260), Expect = 1.8e-138
Identity = 249/389 (64.01%), Postives = 305/389 (78.41%), Query Frame = 1

Query: 78  MLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESR 137
           ML DGE TP R+E+L+MVKKHS  +G+T +DEQDASDVEMD  FWH V D+YF+R  ESR
Sbjct: 1   MLNDGEVTPSRHEILSMVKKHSKSLGKTSLDEQDASDVEMDSNFWHGVFDVYFVRCMESR 60

Query: 138 GRQDDDL-------KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSF 197
            RQDDDL         + YG  ++    +PYFVRR +     L ++  ++  +VDWR+SF
Sbjct: 61  RRQDDDLLFFVRKLSCKSYGLTENEDAPAPYFVRRWAPK---LDELLGESLAEVDWRKSF 120

Query: 198 YLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV 257
           YLN+IAH+SFTVTVAICS+  L+ +Q    T LSPIYKVVKTVYASPSRVNFHLDSKK +
Sbjct: 121 YLNMIAHTSFTVTVAICSNEALKTYQGSKDTKLSPIYKVVKTVYASPSRVNFHLDSKKAM 180

Query: 258 ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMA 317
           ETTPAYP+ICFAVDDFDSTFDAVVLT+ DHCYCVLLN+H GAAFP     ++ +S+    
Sbjct: 181 ETTPAYPEICFAVDDFDSTFDAVVLTDKDHCYCVLLNSHDGAAFPSATVKDSSDSN---- 240

Query: 318 SCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG 377
             T++DP+++KD K+TLFSGFVSYQMVR+AY+ G++RFGSLLSLGH +GK D++YM+GPG
Sbjct: 241 --TNADPRTVKDPKVTLFSGFVSYQMVREAYEGGRNRFGSLLSLGHITGKADRLYMRGPG 300

Query: 378 GRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYAAASSTS 437
           GRGEVEVAVSGV DQS    GP SP+ SK+   +G+I RKAASVA+VAAK A AAA+++ 
Sbjct: 301 GRGEVEVAVSGVVDQSQVVLGPVSPMSSKKSIDLGSIFRKAASVASVAAKHAIAAATASY 360

Query: 438 SDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
            +DEM PLKCCLMSISLPW+ IAHDLLFK
Sbjct: 361 DEDEMFPLKCCLMSISLPWDTIAHDLLFK 380

BLAST of Cp4.1LG11g01240 vs. NCBI nr
Match: gi|449450304|ref|XP_004142903.1| (PREDICTED: uncharacterized protein KIAA0930 homolog [Cucumis sativus])

HSP 1 Score: 686.0 bits (1769), Expect = 4.7e-194
Identity = 349/389 (89.72%), Postives = 360/389 (92.54%), Query Frame = 1

Query: 78  MLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESR 137
           MLGDG ETPCRYELLNMVKKHSNLIGQT VDEQD SDVE DPRFWHDVMDLYFIRGKESR
Sbjct: 1   MLGDGGETPCRYELLNMVKKHSNLIGQTVVDEQDVSDVERDPRFWHDVMDLYFIRGKESR 60

Query: 138 GRQDDDL-------KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSF 197
           GRQDDDL       KSQGYG+DDDNGGTSPYFVRR +     L  +  DASVDVDWRRSF
Sbjct: 61  GRQDDDLVFFVRKVKSQGYGSDDDNGGTSPYFVRRWASK---LDNLVGDASVDVDWRRSF 120

Query: 198 YLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV 257
           YLNLIAH+SFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV
Sbjct: 121 YLNLIAHTSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV 180

Query: 258 ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMA 317
           ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAH GAAFP K+NAENCNSSN +A
Sbjct: 181 ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHDGAAFPAKDNAENCNSSNFVA 240

Query: 318 SCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG 377
           S  DSD Q+ K+SKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG
Sbjct: 241 SSLDSDSQNTKNSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG 300

Query: 378 GRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYAAASSTS 437
           GRGEVEVAVSGVADQSLQDSGPFSPVVSK GFGIGTIVR+AASVATVAA+QAYAAASS+S
Sbjct: 301 GRGEVEVAVSGVADQSLQDSGPFSPVVSKTGFGIGTIVRRAASVATVAARQAYAAASSSS 360

Query: 438 SDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           SDDEMIPLKCCLMSISLPWEYIAHDLLFK
Sbjct: 361 SDDEMIPLKCCLMSISLPWEYIAHDLLFK 386

BLAST of Cp4.1LG11g01240 vs. NCBI nr
Match: gi|659087578|ref|XP_008444527.1| (PREDICTED: uncharacterized protein KIAA0930 homolog [Cucumis melo])

HSP 1 Score: 679.9 bits (1753), Expect = 3.4e-192
Identity = 345/389 (88.69%), Postives = 357/389 (91.77%), Query Frame = 1

Query: 78  MLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESR 137
           MLGDG ETPCRYELLNMVKKHSNLIGQT VDEQD SDVE DPRFWHDVMDLYFIRGKESR
Sbjct: 1   MLGDGGETPCRYELLNMVKKHSNLIGQTVVDEQDVSDVETDPRFWHDVMDLYFIRGKESR 60

Query: 138 GRQDDDL-------KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSF 197
           GRQDDDL       K QGYGNDDDNGGTSPYFVRR +     L  +  DASVDVDWRRSF
Sbjct: 61  GRQDDDLVFFVRKVKPQGYGNDDDNGGTSPYFVRRWASK---LDNLVGDASVDVDWRRSF 120

Query: 198 YLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV 257
           YLNLIAH++FTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV
Sbjct: 121 YLNLIAHTTFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEV 180

Query: 258 ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMA 317
           ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAH GAAFP K+NAENC+SSN + 
Sbjct: 181 ETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHDGAAFPAKDNAENCDSSNFVV 240

Query: 318 SCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG 377
           S  DSD Q+ K+SKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG
Sbjct: 241 SSMDSDSQNTKNSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPG 300

Query: 378 GRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYAAASSTS 437
           GRGEVEVAVSGVADQSLQDSGPFSPVVSK GFGIGTIVR+AASVATVAA+QAYAAASS+ 
Sbjct: 301 GRGEVEVAVSGVADQSLQDSGPFSPVVSKTGFGIGTIVRRAASVATVAARQAYAAASSSG 360

Query: 438 SDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           SDDEMIPLKCCLMSISLPWEYIAHDLLFK
Sbjct: 361 SDDEMIPLKCCLMSISLPWEYIAHDLLFK 386

BLAST of Cp4.1LG11g01240 vs. NCBI nr
Match: gi|700207310|gb|KGN62429.1| (hypothetical protein Csa_2G354060 [Cucumis sativus])

HSP 1 Score: 652.9 bits (1683), Expect = 4.4e-184
Identity = 334/373 (89.54%), Postives = 345/373 (92.49%), Query Frame = 1

Query: 94  MVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDLYFIRGKESRGRQDDDL-------KS 153
           MVKKHSNLIGQT VDEQD SDVE DPRFWHDVMDLYFIRGKESRGRQDDDL       KS
Sbjct: 1   MVKKHSNLIGQTVVDEQDVSDVERDPRFWHDVMDLYFIRGKESRGRQDDDLVFFVRKVKS 60

Query: 154 QGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDVDWRRSFYLNLIAHSSFTVTVAI 213
           QGYG+DDDNGGTSPYFVRR +     L  +  DASVDVDWRRSFYLNLIAH+SFTVTVAI
Sbjct: 61  QGYGSDDDNGGTSPYFVRRWASK---LDNLVGDASVDVDWRRSFYLNLIAHTSFTVTVAI 120

Query: 214 CSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDF 273
           CSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDF
Sbjct: 121 CSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHLDSKKEVETTPAYPDICFAVDDF 180

Query: 274 DSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCNSSNLMASCTDSDPQSMKDSKIT 333
           DSTFDAVVLTETDHCYCVLLNAH GAAFP K+NAENCNSSN +AS  DSD Q+ K+SKIT
Sbjct: 181 DSTFDAVVLTETDHCYCVLLNAHDGAAFPAKDNAENCNSSNFVASSLDSDSQNTKNSKIT 240

Query: 334 LFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQS 393
           LFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQS
Sbjct: 241 LFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKIYMKGPGGRGEVEVAVSGVADQS 300

Query: 394 LQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYAAASSTSSDDEMIPLKCCLMSIS 453
           LQDSGPFSPVVSK GFGIGTIVR+AASVATVAA+QAYAAASS+SSDDEMIPLKCCLMSIS
Sbjct: 301 LQDSGPFSPVVSKTGFGIGTIVRRAASVATVAARQAYAAASSSSSDDEMIPLKCCLMSIS 360

Query: 454 LPWEYIAHDLLFK 460
           LPWEYIAHDLLFK
Sbjct: 361 LPWEYIAHDLLFK 370

BLAST of Cp4.1LG11g01240 vs. NCBI nr
Match: gi|590596846|ref|XP_007018449.1| (Cw7 protein isoform 1 [Theobroma cacao])

HSP 1 Score: 563.9 bits (1452), Expect = 2.7e-157
Identity = 289/397 (72.80%), Postives = 336/397 (84.63%), Query Frame = 1

Query: 69  LRKQKLPGEMLGDGEETPCRYELLNMVKKHSNLIGQTEVDEQDASDVEMDPRFWHDVMDL 128
           L+K+K P  ML  GEETP R+ELL+MVKKHSNL+G+T VDE DASDV+MD +FWHDV +L
Sbjct: 38  LQKKKKPQAMLHIGEETPSRFELLSMVKKHSNLLGKTTVDEHDASDVQMDGQFWHDVFNL 97

Query: 129 YFIRGKESRGRQDDDL-----KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVD 188
           YF+RG+ESRGRQDDDL     K +G+  +D + G +PYFVRR +  +  L         D
Sbjct: 98  YFVRGRESRGRQDDDLIFFVRKWRGHDFNDKDEGFAPYFVRRWAPELDNLVGASLS---D 157

Query: 189 VDWRRSFYLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFH 248
           VDWRRSFYLN+IAH+S++VTVAICSH VLRNHQAG  T LSPIYKVVKTVYASPSRVNFH
Sbjct: 158 VDWRRSFYLNMIAHTSYSVTVAICSHHVLRNHQAGQDTPLSPIYKVVKTVYASPSRVNFH 217

Query: 249 LDSKKEVETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENC 308
           LDSKKEVETTPAYPDICFA+DDFDSTFDA+VLT+TDHCYCVLLNA  GAAFP ++   + 
Sbjct: 218 LDSKKEVETTPAYPDICFAIDDFDSTFDAMVLTDTDHCYCVLLNALDGAAFPSEKETNDS 277

Query: 309 NSSNLMASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDK 368
           +SS+ +    D++    K SK+TLFSGFVSYQMVRDAYDAG SRFGSLLSLGH+SGKTD+
Sbjct: 278 SSSDKLPLRVDTNSMKTKTSKLTLFSGFVSYQMVRDAYDAGSSRFGSLLSLGHSSGKTDR 337

Query: 369 IYMKGPGGRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAY 428
           +YMKGPGG GEVEVAVSGVADQS QDSGPFSPV+SKRGFG+G+IVRKAASVA+VAAK AY
Sbjct: 338 LYMKGPGGCGEVEVAVSGVADQSKQDSGPFSPVISKRGFGLGSIVRKAASVASVAAKHAY 397

Query: 429 AAASSTS-SDDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           AAA++TS SD+EMIPLKCCLMSI+LPWE+IA+DLLFK
Sbjct: 398 AAAAATSTSDEEMIPLKCCLMSITLPWEHIAYDLLFK 431

BLAST of Cp4.1LG11g01240 vs. NCBI nr
Match: gi|694400055|ref|XP_009375131.1| (PREDICTED: uncharacterized protein KIAA0930 homolog isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 562.4 bits (1448), Expect = 7.9e-157
Identity = 292/396 (73.74%), Postives = 333/396 (84.09%), Query Frame = 1

Query: 74  LPGEMLGDGEETPCRYELLNMVKKHSNLIGQTEV--DEQDASDVEMDPRFWHDVMDLYFI 133
           LP EMLG GE+TP R+ELL+MVKKHS+L+G+T +  DE D SDVE+D RFWHDV+DLYF+
Sbjct: 2   LPIEMLGHGEDTPSRFELLSMVKKHSSLLGKTVIVADEDDTSDVELDHRFWHDVLDLYFV 61

Query: 134 RGKESRGRQDDDL-------KSQGYGNDDDNGGTSPYFVRRSSFTVFILYQVEHDASVDV 193
           RGK+SRGRQDDDL        S GYG +D   G SPYFVRR +     L  +  +++VDV
Sbjct: 62  RGKDSRGRQDDDLVFFVRKLSSYGYGFNDGKEGVSPYFVRRWAPK---LDNLIGESNVDV 121

Query: 194 DWRRSFYLNLIAHSSFTVTVAICSHLVLRNHQAGHTTSLSPIYKVVKTVYASPSRVNFHL 253
           DWRRSFYLNLIAH+SFTVTVAICSH  LRNHQA     LSPIYKVVKTVYASPSRVNF L
Sbjct: 122 DWRRSFYLNLIAHTSFTVTVAICSHQDLRNHQAEQRIPLSPIYKVVKTVYASPSRVNFQL 181

Query: 254 DSKKEVETTPAYPDICFAVDDFDSTFDAVVLTETDHCYCVLLNAHGGAAFPGKENAENCN 313
           DSKKEVETTPAYPDICFA+DDFDSTFDAVVLTETDHCYCV+LNAH GAAFP ++ + +C+
Sbjct: 182 DSKKEVETTPAYPDICFAIDDFDSTFDAVVLTETDHCYCVILNAHDGAAFPREKESNDCS 241

Query: 314 SSNLMASCTDSDPQSMKDSKITLFSGFVSYQMVRDAYDAGKSRFGSLLSLGHASGKTDKI 373
           SS+  +   +++   MK++K+TLFSGFVSYQMVRDAYDAGKSRFG+L  LGH+ GKTDK+
Sbjct: 242 SSDSSSLRVETNSAKMKNTKLTLFSGFVSYQMVRDAYDAGKSRFGNL--LGHSPGKTDKL 301

Query: 374 YMKGPGGRGEVEVAVSGVADQSLQDSGPFSPVVSKRGFGIGTIVRKAASVATVAAKQAYA 433
           YMKGPGGRGEVEVAVS VADQS QD GPFSPV+SKRGFGIG+IVRKAASVA+VAAK AYA
Sbjct: 302 YMKGPGGRGEVEVAVSRVADQSQQDLGPFSPVISKRGFGIGSIVRKAASVASVAAKNAYA 361

Query: 434 AASSTSS-DDEMIPLKCCLMSISLPWEYIAHDLLFK 460
           AASST S DDEM+PLKCCLMSISLPWE IA+DLLFK
Sbjct: 362 AASSTHSFDDEMVPLKCCLMSISLPWEQIAYDLLFK 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
K0930_XENLA5.7e-0626.69Uncharacterized protein KIAA0930 homolog OS=Xenopus laevis PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LN69_CUCSA3.1e-18489.54Uncharacterized protein OS=Cucumis sativus GN=Csa_2G354060 PE=4 SV=1[more]
A0A061FMH8_THECC1.9e-15772.80Cw7 protein isoform 1 OS=Theobroma cacao GN=TCM_034665 PE=4 SV=1[more]
B9R6W6_RICCO3.0e-15572.93Cw7 protein, putative OS=Ricinus communis GN=RCOM_1586510 PE=4 SV=1[more]
A0A067LAQ9_JATCU3.9e-15574.62Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15997 PE=4 SV=1[more]
E0CNZ2_VITVI8.8e-15573.72Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g08720 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G59520.31.8e-13864.01 CW7[more]
Match NameE-valueIdentityDescription
gi|449450304|ref|XP_004142903.1|4.7e-19489.72PREDICTED: uncharacterized protein KIAA0930 homolog [Cucumis sativus][more]
gi|659087578|ref|XP_008444527.1|3.4e-19288.69PREDICTED: uncharacterized protein KIAA0930 homolog [Cucumis melo][more]
gi|700207310|gb|KGN62429.1|4.4e-18489.54hypothetical protein Csa_2G354060 [Cucumis sativus][more]
gi|590596846|ref|XP_007018449.1|2.7e-15772.80Cw7 protein isoform 1 [Theobroma cacao][more]
gi|694400055|ref|XP_009375131.1|7.9e-15773.74PREDICTED: uncharacterized protein KIAA0930 homolog isoform X1 [Pyrus x bretschn... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019141DUF2045
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG11g01240.1Cp4.1LG11g01240.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019141Protein of unknown function DUF2045PFAMPF09741DUF2045coord: 119..382
score: 1.2
NoneNo IPR availablePANTHERPTHR21477FAMILY NOT NAMEDcoord: 56..290
score: 5.3E-128coord: 319..468
score: 5.3E
NoneNo IPR availablePANTHERPTHR21477:SF13PROTEIN C16E9.2, ISOFORM Acoord: 56..290
score: 5.3E-128coord: 319..468
score: 5.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG11g01240Cp4.1LG07g04780Cucurbita pepo (Zucchini)cpecpeB146