Cp4.1LG12g03940 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g03940
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptionglycosyl transferase family 1 protein
LocationCp4.1LG12 : 2865468 .. 2874146 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATGCCTATATCCACGTCAACGTCTTTTCTCAAACCTCGCTCTCTAACTTAACCAATTTTTACTCAGCTCACAACCATCCTTCTAGTTTTTTTTGCTCTGAATTTACTGCCTCCTCTACTCTGCTAAATCTATCGCACCCCAGTTCAGCTCTTGTCGGCGATTAGTGGGTCATTTTTTGTTTTTGATTCTGAGTCTAAAATTCCCAAGTTATTGCTGTGAAAATTGGTGTTGTTTGATGCGATTGATCTTGGGATGTTGAGTTGAAGCAAATGGGTTCTCTGGAGAATGGATATCCATTGAAGAGAGACCCGCTTCTTCGTTCTTCATCGATTAGCAGAGGCGAAAGGTACCCATTTCTACAGAGACCCAGATCGAGATTTTCTCGGTTTTTGCTTTTTCAAAAGATTGATTACTTGCAATGGATTTGTACTGTGGGTGTGTTCTTGTTCTTCGTGGTTCTTTTTCAAATGTTCTTGCCTGGATCAGTCATGGAGAAGTCTGACATTGCCTTTAAAGATGTGGAGAAAAGTTTAGGGGATTTGGAGTTCTTGAAGGAGTTGGGTATGTTGGAATTTGGGGAGGATATTCGATTTGAGCCGTCGAAGCTTTTGGAGAAGCTTAAGAAAGAGGCAAGAGAAGGGGGTTTTTCATCTTTCAATAGAACTATTAATCGTTTTGGGTATAGGAAACCTCAGCTTGCTATGGTAAGTTTGCTACCACAATCTTAGAAATATTCAGGAATTGGTTCAAATCTTTGAGCTCTGTGATGTCTGATGTTGTTGAACATTGAAATTTCTTTTATTGTCATTTTGTCTTTGAAGGTGTTTTCAGATCTGTTGGTTGATTCTTACCAAGTTCTAATGGTAACCATTGCATCTGCTCTGCAAGAGATAGGATATGCAATTCAGGTAATTCACTGAATGCAAGAGTTTTTACATATAGCCTTTTTTACTCTCAACTTCTGTTATGCAGTTTATTTTTTGGGGGATTTGTGGGGATGTTTGAGGGAAGGTGTATGTGTGAGATCCTACATCGGTTGAGGAGGAGAATGAAAGAACGAAACATTCTTTATAAGGGCGTGGAAACCTCTTACTAGCAGATGTGTTTTAGAAACCTTGAGGGGAAGTTCGAAAGGGAAAGTCCTAGGAGGACAATATCTCCTAGTGGTGGGCTTGGGTCGTTACAAATGGCATTAGTGCTAGACACCGAGCGATGTGCTAGTGAGGAGGCTGAGCCTTGAAGGAGGGTGGACATGAGGCGGTGTGCCAATAAGGACGCTGGGCCCTGAAGGGGGTGGATTGTGAGATTCCACATTGGTTGGGGAAGAGAACAAAATATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAATTTTTGAGGGAAAGCCTGAAAAGGAAAGCCCAAAGAGAACAATATTTGCTAGCGGTGGGTTTGAGCCGTTACAAATGGTATCAGAGCTAGACATTGAGCGATGTGCCAACGAGGAAGCTGAGTCCCAAAGAGAACAATATTTGCTAGCGGTGGGTTTGAGCCGTTACAAATGGTATCAGAGCTAGACATTGGGCGATGTGCCAACAAGGAAGTTGAGTTCCAAAAGAGGGTGGACGCGAGGCGGTGTGCCAGCAAGGACGTTGGGCCCTGAAGGGGGTGGATTGTGAGATCCCACATCAGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCTTAGTAGACGTGTTTTAAAAATCTTGAGGGGAAGCCTGAAAGGGAAAGCCTAAAGAGGTCAATATCTGCTAACGGTGGGCTTGAGCCGTTACAGTAAGCATACAACATGCTGTATTCTGGAATTTCCTATTTCAAATATGTTCAAGGAAAGGATATAGAACGACGACAGCTGTATAATTGTTGGCTACTCCTTGTTGCAGGATGTGGACTGAAATATTTTTCTTCTTTTCAGAACTAAAAACTTAGTTTTGGGAAGGGGCTTTCCATTTACTTCATCAATCTTTACTTTGATACTCCCTTCATTGCTATATCTTGGAATGCTCGTTACGCTCTTCATAGTTATATATTTGAAACTACATCTTTTCCTCATAATAATCATGTTCATTTTCTTCAATGTTATATATTCGATACACCTTTCGGTGTAAGTAGAAGTCGAGAACTAAGTTCAGCAGCTTAATCTTGCTGTGTTTATTTTTACATGATATCTGTCGAGGAGATTATCTAGACAAGCATATAATTTGATATCTTAAGTTTTTGCTTTACAGGTTTATTCTCTTCAGGGTGGACCAGTGAATGATGCTTGGAGGCACATGGGAGTCCCAGTTACTCTAATTCAAACCTGTGATGAGACCGAGGTCATGGTTGATTGGCTAAAGTATGTTAACGCTACTCGTCACAGTGTTTTGACCCTATGACCTTTATTTCTACAGCATTCGATTAACTAAGTTACGTATTGTTTCTAATTATGTTTATGATGTTATTTCCAGCTATGATGGCATACTTATGCACTCTCTTGGAGTGAAAGACGTCTTTTCCTGGTAAGACGTTATGCAAATAACTTTTTTGCAGTTAGTTTTGTGTTCTTATTCCACGAACCGATCCATCTTATCTCGTACTAATAGCGAACTTATGGAAATTTGCCAAATTCAATAAGTTTTGTTGCTTGTTACAGCTTCCTGCAGGAACCTTTCAAATCCTTACCACTCATCTGGACCATCCATGAAGAAACCCTCGGCTTACGCTCTCGAAACTACGCTTCAAATGGGTTATTTGATCTTCTAAATGATTGGGAGAGAGTATTCAACCATTCAACTGTTGTTGTCTTTCCCAATTATGTCATGCCGGTAATGGCTAGTTGATATTTCTGTTAAGGTTTTCTGCAACCTTTGTAAGTTTATAACTTTATTGTAACCTGTGAAGTAAGGAAGTTGTCCCTTCTTATTAATTGGTTTTGTTATATGTATAATTACCACAGATGATCTATTCTGCATTTGATAGTGGGAATTTCTTCGTGATTCCGAGCTTTCCTGCCGAAGCATTGGAAGCAGAAATTGATATCACCTCCGATGCTGATAATCCGCGTGCAAAAATGGGCTATGCAAATGACGACTTGGTTATTGCCATTGTTGGAAGCCAATTTTTGTATAGGGGCATGTGGCTAGAACATACAATGATGTTGCAGGCCATGTTGCCACTACTTCACAAATTTTCTTTGGATGAGCATTCCAATTCTCATCTCAAGATATTTGTTCTAAGTGGGAATTCAAATAGCAACTACACGATGGCTGTCGAGGTGCGTGTCTGCTTTGGTTTACTTTTTTCGTTAATGAATTGCTAGCTTTTATTGAGTCATCCGATTTTCCGCAGGCGATTGCTCAGAAACTGGAATATCCAAGGAGTGTTGTGAAGCATGTTCCTGTCAATGCAGATTCAGACAATGCTCTAAGTATGGCTGACCTTGTTATATATGGTTCTTTTTTGGAAGAGCAATCTTTTCCACAGGTTTTGGTAAAAGCCATGAGCATGGGAAAACCAATCATCGCCCCGGATCTTGCCAATATTAGGAAACACGTACGTTTCTCATTTCTTATCTTCCTACTTTTCGGTAGCGTGTCGTAGACAGGATGATGCTTATTCATATTCTCAACAGTTTGAACACGTCATACATTTGTTTTCAGACATTGTTTAATTCCTTTCACAAATAAGAAAGAATTCGTAAGTAAGAACGAATATGCAACGGAAAGGGACGATCAGATATTCCCCAAAAAAAACGAAGCCAAATGAGACTGGAAACGAGCTCTCAAATTACAATTGCTTAGGGGGTATTTCCAGAGTTCCTTGTTTTCCCTTTTGGCAGTCTTTGAGTTGATTGAAGTCTTTTCTGTGTGGTCTCTTCTTTGTACTTCATACACACGTACGTACGTACGTACATACATTCATACATATATACTTACGTACGTACAAATCCTGGGCAAACTACTGACTGCATTGGCAATAACAACTAGGTTGATGACAGGGTAAACGGCTATTTGTTTCCCACGGGAAATTTCAATGTACTTTCCCAAATTATTTTGGAAGTGATCTCGAAAGGGAGAGTATCGCCACTTGCTCGTAGCATTGCTTCAACTGGACGAGGCACCGTGAAAAACTTGATGGTTTCAGAAACTGTTGTCGGATATGCCTCACTACTTGATGTTGTTCTTAAGCTTCCATCAGAAGCTGCACCAGCTAAGGAAGTCGCCGAAATCCCTTCCAAACCGAAAGAAAAATGGCAGTGGCAGTTATTTGAAGGGGTATCAAATTTGGCCATCCTGCACAGAAACAAAAAAAGTTACACAATTTTAGATGAATTTGAAAAGCATTGGAACCAAACTAAAAAGGGGAAGCCTGGTAATCCTATTGCTTTTAATGAGTCATTTGTATATGATATATGGGAGGAGGAAAAACAAACAGTGATGTCTAATATCAAAAGAAGAAGAGAGGAAGAAGAGGTGAGATTATAAGCTGTTCATCTTTGGTTTTCTATGATTTATATTGATTTTGTATTTCTTCAATATTCATTGATACTTGAATTTGACAGATAAAAGATAGAACCGAACAACCTCATAGCACGTGGGAGGAGGTGTATCGAAGTGCTAAGAGGGCTGATAGGTCTAAGAATGATTTGCATGAGAGGGATGAAGGGGAGCTTGAAAGGACTGGACAACCATTATGCATCTATGAACCTTACTTTGGGGAAGGAGTTTGGCCTTTCCTGCACCGATATTCTCTTTATCGTGGAATCGGGCTGGTAGGTTGCGCAAAATTTTTGTTATTTTTTTGTTTTCCGTGTGCAAACTTGAGGTTGAAAGACTTTTTCTTTAAATATCATCCTATGACAACCATGTGCTTTTCTGAATTATGCGATCATTATAATGGTTTTTTTATTTTTATTTTTTAACTTCAATGATATTCATATATGTGAATTAATAATGTCAATTCTGCTTGTGAGATCTCACGTCGATTGGAGAGGGAAACAAAACATTCCTTATAAGAGTGTGAAAGACTCTCCCTAGCAGACGTGTTTAAAACCTTGAGGGGAAACCCAGAAGGGAAAGTCCAAAGAGAACAATATCGGCTAGCGGTAGGCTTGAGCTGTTATAAATGGTATCAGAGCTAGAGACCATAACAGTGTGCCAGCAAGGACGCTGGGCCCCTAAAGGGTGGATTGTGAGATCCCACATCAGCTGAAGAGGGGAACGAAGTATTCCTTATAAAGGTGTGGAAACCTCTCCCTAGCATACGCATTTTAAAATCTTAAGGGAAAGCCTAGAAGGGAAAGCCCAAAGAGGACAATATCTGCTAGTAGTAGGTTTAGGTTGTTACAAATGGTATCAGAGCTAGACATCGGGTGGTGTGCCAGCAAGAACACTGACCCCCAAGTGGGTGAATTGTGAGATCCCACCTCAGTTGAAGAGGGGAACGAAGTATTCCTTATAAGGCTCTGGAAACCTCTCCCTAGCAGACGTGTTTTAAAACCTTGAGGTGAAGCCCAAAAGAGAAAGCCCAAAAGACATTATCTACTAACGGTGGGCTTGGACTTGGGCTGTTACACTGCTTAACTTCAATTGTTTGCAAGGTTCGTTGTTGATACTTAACTAATTTAGATAATTGTATTGGAGGTTAGTATTGTTTTCAGGCTTCTTTAAATTTGATATTCTAAAATTTAGATGCAATACAGGAGTGTCTGTTAATCCTTTGTTTATAATCTCTGAATGTTGTTTTGAAGTATTCTGGACGATTCCTCAAGATGGAGTGTCTTAGCTTGAGAGTTTTGTAATTGTTTCCCGCTTTCAACTAATGTTAGTTCGGAAGTTGCTGACCTGGAGTTCGTGTTCTTCGAAATTTTTATATCTTTTCATCAACCAATGAAAAGTCGTTTCTTATTCAAAAAATGAAAAAAAAAGAAAAATGAGTAGTTTTCATGTTTCTTCTTTAGTGAAATATTTCTTTACCAAAGAAAATATGCTTGATGAAAGAACATTTTCATTTCCCATCAAAAGTTGATGTCAGTTACCATCTTAATGTAGATGTTAAATGTTATTCCTGCAGTCGAGTAAAGGCAGGCGACCTGGAACTGATGATGTTGATGCACCGTCACGGCTTCCACTTCTCAGTAACCCTTACTACCGAAATGTACTCGGTGAATATGGAGCCTTCTTTGCAATTGCTAACCGGGTCGACCGCATACACAAGAATGCTTGGATCGGTTTTCAATCTTGGAGAGCCACTGCCAGGAATGTATGTGACATCGCTCTGTCCATGTTTTTTCTGAAGTTTTATTTCTTGAACTCACTTTTACATTTATAGTTTTCTTCTACTGAAACTAAGGAACATAGCTGGTGTTGCATGAGCTTTAGATAGATTATTTACACAAAAGAAGTGAATGTTCAGACTGTTCTATAAGATATTACAAAACTAATCAGCAGATACCTTCCATTGTTTGTTTCTGTGTGAGCGTGGGGCAAGGCAAATCCTCCTTCCTCGTTCCTTGCAATCTTTTGTTAGGAACCACGACCCTGTTAAGAACCATGGATCTCCACAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCACGGCTTTGCTTTGGGTTTGCCCAAAAGGTCTCATACCAATGGAGATGTATTCCTTACTTATAAACCCATGATCATTCCCTAAATTAGCCAACGTGGGACTCCCTCCCAACAATCCTCGACAATCCTCCCCTCGAACAAAGTACATTATAAAGCCTCCCCCGAGGCCTATGGAGCCCTCGAACAGCCTCTCCTTAATCTAGGCTCGACTCCTTCTTTGGAGCCTTCGAACAAAGTACACCTTTTGTTCGACACTTGAGTCACTTTTGACTACATCTTCGAGGCTCACAACTTCTTTGTTCGATATTTAAGGATTTTCTTGACATGACTTAGTTCAGCCATGGCGGCTCTGATACCATGTTAGGAATTACGGATCTCTACAATGGTATGATATTGTCCACTTTGAGCTTTTCGTTTTCCCAAAAGGCCTTTTATCTATTCCTTACTTATAAACCCACAATCATCCTTTTAATTATCATATGTGGTACTCTCTCCCAACAATCCTCGACATCTTTCTGGATGGTAGAACAATATTTTGACAAAGAAAAGCAAGGCGCTTTTGTCTAACCAATGGTGTGGTTGAATAGGCATCGCTGTCAAAAATCGCTGAAACTGCATTATTAGATGCAATTGAGACGCGACGATATGGAGATGCACTCTACTTTTGGGTGCGGATGGACTCGGATCCAAGAAACCCGCTACAGCTTGATTTTTGGTCATTTTGTGATTCGATAAATGCTGGAAATTGCAAGTAAGACCTGAAAATTCATGCTCAAGTTTTTGTACTGAAACATCTGAACTCTCTTTATACAATCTTCTGATATCAATTCGTTGCCTATGCCCTTTTTGTTTATAGGTTCGCGTTCTCCGAGTCATTGAAGACGATGTATGGTATAAAAAGTGATCTCGAGTTTTTACCGCCCATGCCTGCAGATGGCTATACATGGTCCTCTATGCAGAGCTGGGTTCTACCAACCGGATCTTTTCTCGAATTTGTCATGTTTTCAAGGTTTGTAAGAGATCATTCTCCTATCTTCACTGCACCATGGCTTTAGTTCATTCATTTCTTTTGTCTGCAGAATGTTCGTTGATGCCTTAGATGCGCAAATGTACGACGAACACCGGACAAGTGGACGGTGTTATTTGAGTTTGTCCAAAGTAACGCCCCTCGTCCTTTTAGATGTCAACATGGAACTATGAGCATTTCATTGTAATTGCTAGAGTTGAACTGTACTTTTATTTACAGGACAAGCACTGCTACTCCCGGCTACTCGAGCTCCTCGTAAACGTTTGGGCATATCACAGTGCAAGGCGTATGGTGTACATGAACCCCGAAACCGGCGCAATGCAAGAGCAACACAAGTTCGACACACGGAGAGGCAAAATGTGGATCAAATGGTTTTCATACGGCATGATAAAGAACATGGACGAGGAGTTAGGAGAGGAAGCAGATACCGACCACCCCACGAGACGGTGGTTGTGGCCATCGACAGGCGAAGTGTTTTGGGAAGGCATGTACGAGAGAGAGAAGAATTTACGATATCGACAGAAAGAGAACAGAAAGCAAAAGAGTAAAGCTAAGCTAGACAGAATGAGACATAGGAGACACCAAAAGGTTATAGGAAAGTATGTAAAGCCTCCACCAGAGATGGAAAATTCAACCACAACAATGGGTACAGAAGCTATTTTGTGATGGAAATTGGAAGGTTAGTGAAAAAAGTAGAGTCTAATCTTAAATTCATTTCATTTTTAACATCTTTTTTGTTGTGTTAAAAAATTATTGTTCTACATTGTTCTTGACCGACTGCTGCCATTGTATGTTGTTCACTCACATGTATTATATTATGTTGTTTTTGTATGCCATTTCACTTCTAAATGTAAATCTAATACTCTTTGGAAGTGGGTAAATTTGGATGGTTTATGTTCGGAAGAGTTTTATCGACTTAGACTCGGTTATGAATTTTAGGATATGTTTCTATGACCATTTTGTTCATTGTTTCTCTCCCTCTCTCTCGACTTGGACTCAGTTATGAATTTTAGAGTATGTTTCG

mRNA sequence

CGATGCCTATATCCACGTCAACGTCTTTTCTCAAACCTCGCTCTCTAACTTAACCAATTTTTACTCAGCTCACAACCATCCTTCTAGTTTTTTTTGCTCTGAATTTACTGCCTCCTCTACTCTGCTAAATCTATCGCACCCCAGTTCAGCTCTTGTCGGCGATTAGTGGGTCATTTTTTGTTTTTGATTCTGAGTCTAAAATTCCCAAGTTATTGCTGTGAAAATTGGTGTTGTTTGATGCGATTGATCTTGGGATGTTGAGTTGAAGCAAATGGGTTCTCTGGAGAATGGATATCCATTGAAGAGAGACCCGCTTCTTCGTTCTTCATCGATTAGCAGAGGCGAAAGGTACCCATTTCTACAGAGACCCAGATCGAGATTTTCTCGGTTTTTGCTTTTTCAAAAGATTGATTACTTGCAATGGATTTGTACTGTGGGTGTGTTCTTGTTCTTCGTGGTTCTTTTTCAAATGTTCTTGCCTGGATCAGTCATGGAGAAGTCTGACATTGCCTTTAAAGATGTGGAGAAAAGTTTAGGGGATTTGGAGTTCTTGAAGGAGTTGGGTATGTTGGAATTTGGGGAGGATATTCGATTTGAGCCGTCGAAGCTTTTGGAGAAGCTTAAGAAAGAGGCAAGAGAAGGGGGTTTTTCATCTTTCAATAGAACTATTAATCGTTTTGGGTATAGGAAACCTCAGCTTGCTATGGTGTTTTCAGATCTGTTGGTTGATTCTTACCAAGTTCTAATGGTAACCATTGCATCTGCTCTGCAAGAGATAGGATATGCAATTCAGGTTTATTCTCTTCAGGGTGGACCAGTGAATGATGCTTGGAGGCACATGGGAGTCCCAGTTACTCTAATTCAAACCTGTGATGAGACCGAGGTCATGGTTGATTGGCTAAACTATGATGGCATACTTATGCACTCTCTTGGAGTGAAAGACGTCTTTTCCTGCTTCCTGCAGGAACCTTTCAAATCCTTACCACTCATCTGGACCATCCATGAAGAAACCCTCGGCTTACGCTCTCGAAACTACGCTTCAAATGGGTTATTTGATCTTCTAAATGATTGGGAGAGAGTATTCAACCATTCAACTGTTGTTGTCTTTCCCAATTATGTCATGCCGATGATCTATTCTGCATTTGATAGTGGGAATTTCTTCGTGATTCCGAGCTTTCCTGCCGAAGCATTGGAAGCAGAAATTGATATCACCTCCGATGCTGATAATCCGCGTGCAAAAATGGGCTATGCAAATGACGACTTGGTTATTGCCATTGTTGGAAGCCAATTTTTGTATAGGGGCATGTGGCTAGAACATACAATGATGTTGCAGGCCATGTTGCCACTACTTCACAAATTTTCTTTGGATGAGCATTCCAATTCTCATCTCAAGATATTTGTTCTAAGTGGGAATTCAAATAGCAACTACACGATGGCTGTCGAGGCATCGCTGTCAAAAATCGCTGAAACTGCATTATTAGATGCAATTGAGACGCGACGATATGGAGATGCACTCTACTTTTGGGTGCGGATGGACTCGGATCCAAGAAACCCGCTACAGCTTGATTTTTGGTCATTTTGTGATTCGATAAATGCTGGAAATTGCAAGTTCGCGTTCTCCGAGTCATTGAAGACGATGTATGGTATAAAAAGTGATCTCGAGTTTTTACCGCCCATGCCTGCAGATGGCTATACATGGTCCTCTATGCAGAGCTGGGTTCTACCAACCGGATCTTTTCTCGAATTTGTCATGTTTTCAAGAATGTTCGTTGATGCCTTAGATGCGCAAATGTACGACGAACACCGGACAAGTGGACGGTGTTATTTGAGTTTGTCCAAAGACAAGCACTGCTACTCCCGGCTACTCGAGCTCCTCGTAAACGTTTGGGCATATCACAGTGCAAGGCGTATGGTGTACATGAACCCCGAAACCGGCGCAATGCAAGAGCAACACAAGTTCGACACACGGAGAGGCAAAATGTGGATCAAATGGTTTTCATACGGCATGATAAAGAACATGGACGAGGAGTTAGGAGAGGAAGCAGATACCGACCACCCCACGAGACGGTGGTTGTGGCCATCGACAGGCGAAGTGTTTTGGGAAGGCATGTACGAGAGAGAGAAGAATTTACGATATCGACAGAAAGAGAACAGAAAGCAAAAGAGTAAAGCTAAGCTAGACAGAATGAGACATAGGAGACACCAAAAGGTTATAGGAAAGTATGTAAAGCCTCCACCAGAGATGGAAAATTCAACCACAACAATGGGTACAGAAGCTATTTTGTGATGGAAATTGGAAGGTTAGTGAAAAAAGTAGAGTCTAATCTTAAATTCATTTCATTTTTAACATCTTTTTTGTTGTGTTAAAAAATTATTGTTCTACATTGTTCTTGACCGACTGCTGCCATTGTATGTTGTTCACTCACATGTATTATATTATGTTGTTTTTGTATGCCATTTCACTTCTAAATGTAAATCTAATACTCTTTGGAAGTGGGTAAATTTGGATGGTTTATGTTCGGAAGAGTTTTATCGACTTAGACTCGGTTATGAATTTTAGGATATGTTTCTATGACCATTTTGTTCATTGTTTCTCTCCCTCTCTCTCGACTTGGACTCAGTTATGAATTTTAGAGTATGTTTCG

Coding sequence (CDS)

ATGGGTTCTCTGGAGAATGGATATCCATTGAAGAGAGACCCGCTTCTTCGTTCTTCATCGATTAGCAGAGGCGAAAGGTACCCATTTCTACAGAGACCCAGATCGAGATTTTCTCGGTTTTTGCTTTTTCAAAAGATTGATTACTTGCAATGGATTTGTACTGTGGGTGTGTTCTTGTTCTTCGTGGTTCTTTTTCAAATGTTCTTGCCTGGATCAGTCATGGAGAAGTCTGACATTGCCTTTAAAGATGTGGAGAAAAGTTTAGGGGATTTGGAGTTCTTGAAGGAGTTGGGTATGTTGGAATTTGGGGAGGATATTCGATTTGAGCCGTCGAAGCTTTTGGAGAAGCTTAAGAAAGAGGCAAGAGAAGGGGGTTTTTCATCTTTCAATAGAACTATTAATCGTTTTGGGTATAGGAAACCTCAGCTTGCTATGGTGTTTTCAGATCTGTTGGTTGATTCTTACCAAGTTCTAATGGTAACCATTGCATCTGCTCTGCAAGAGATAGGATATGCAATTCAGGTTTATTCTCTTCAGGGTGGACCAGTGAATGATGCTTGGAGGCACATGGGAGTCCCAGTTACTCTAATTCAAACCTGTGATGAGACCGAGGTCATGGTTGATTGGCTAAACTATGATGGCATACTTATGCACTCTCTTGGAGTGAAAGACGTCTTTTCCTGCTTCCTGCAGGAACCTTTCAAATCCTTACCACTCATCTGGACCATCCATGAAGAAACCCTCGGCTTACGCTCTCGAAACTACGCTTCAAATGGGTTATTTGATCTTCTAAATGATTGGGAGAGAGTATTCAACCATTCAACTGTTGTTGTCTTTCCCAATTATGTCATGCCGATGATCTATTCTGCATTTGATAGTGGGAATTTCTTCGTGATTCCGAGCTTTCCTGCCGAAGCATTGGAAGCAGAAATTGATATCACCTCCGATGCTGATAATCCGCGTGCAAAAATGGGCTATGCAAATGACGACTTGGTTATTGCCATTGTTGGAAGCCAATTTTTGTATAGGGGCATGTGGCTAGAACATACAATGATGTTGCAGGCCATGTTGCCACTACTTCACAAATTTTCTTTGGATGAGCATTCCAATTCTCATCTCAAGATATTTGTTCTAAGTGGGAATTCAAATAGCAACTACACGATGGCTGTCGAGGCATCGCTGTCAAAAATCGCTGAAACTGCATTATTAGATGCAATTGAGACGCGACGATATGGAGATGCACTCTACTTTTGGGTGCGGATGGACTCGGATCCAAGAAACCCGCTACAGCTTGATTTTTGGTCATTTTGTGATTCGATAAATGCTGGAAATTGCAAGTTCGCGTTCTCCGAGTCATTGAAGACGATGTATGGTATAAAAAGTGATCTCGAGTTTTTACCGCCCATGCCTGCAGATGGCTATACATGGTCCTCTATGCAGAGCTGGGTTCTACCAACCGGATCTTTTCTCGAATTTGTCATGTTTTCAAGAATGTTCGTTGATGCCTTAGATGCGCAAATGTACGACGAACACCGGACAAGTGGACGGTGTTATTTGAGTTTGTCCAAAGACAAGCACTGCTACTCCCGGCTACTCGAGCTCCTCGTAAACGTTTGGGCATATCACAGTGCAAGGCGTATGGTGTACATGAACCCCGAAACCGGCGCAATGCAAGAGCAACACAAGTTCGACACACGGAGAGGCAAAATGTGGATCAAATGGTTTTCATACGGCATGATAAAGAACATGGACGAGGAGTTAGGAGAGGAAGCAGATACCGACCACCCCACGAGACGGTGGTTGTGGCCATCGACAGGCGAAGTGTTTTGGGAAGGCATGTACGAGAGAGAGAAGAATTTACGATATCGACAGAAAGAGAACAGAAAGCAAAAGAGTAAAGCTAAGCTAGACAGAATGAGACATAGGAGACACCAAAAGGTTATAGGAAAGTATGTAAAGCCTCCACCAGAGATGGAAAATTCAACCACAACAATGGGTACAGAAGCTATTTTGTGA

Protein sequence

MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLFFVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKEAREGGFSSFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQGGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPLIWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVIPSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPLLHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEASLSKIAETALLDAIETRRYGDALYFWVRMDSDPRNPLQLDFWSFCDSINAGNCKFAFSESLKTMYGIKSDLEFLPPMPADGYTWSSMQSWVLPTGSFLEFVMFSRMFVDALDAQMYDEHRTSGRCYLSLSKDKHCYSRLLELLVNVWAYHSARRMVYMNPETGAMQEQHKFDTRRGKMWIKWFSYGMIKNMDEELGEEADTDHPTRRWLWPSTGEVFWEGMYEREKNLRYRQKENRKQKSKAKLDRMRHRRHQKVIGKYVKPPPEMENSTTTMGTEAIL
BLAST of Cp4.1LG12g03940 vs. TrEMBL
Match: A0A0A0K892_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006880 PE=4 SV=1)

HSP 1 Score: 700.7 bits (1807), Expect = 1.8e-198
Identity = 346/397 (87.15%), Postives = 370/397 (93.20%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLENG+PLKRDPLLRSSS  RGERYPFLQRPRSRFSRFL F+KIDYLQWICTV VF F
Sbjct: 1   MGSLENGFPLKRDPLLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFFF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPGSV+EKS++A KDVEKSLGDL+FLKELGML+FGEDIRFEPSKLL K KKE
Sbjct: 61  FVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKKE 120

Query: 121 AREGGFSSFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQG 180
           ARE  FSSFNRT +RFGYRKPQLA+VFSDLLVDSYQVLMVTIASALQEIGY  QVYSLQG
Sbjct: 121 AREADFSSFNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQG 180

Query: 181 GPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPLI 240
           GP ND WR MGVPVTLIQ+CDETEVMVDWLNYDGIL+HSLGVKDVFSC+LQEPFKSLPLI
Sbjct: 181 GPANDVWRQMGVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLPLI 240

Query: 241 WTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVIP 300
           WTIHEE L +RS+NYAS+GL D+LNDW+RVFNHSTVVVFPNYVMPMIYSA+DSGNFFVIP
Sbjct: 241 WTIHEEALAIRSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFVIP 300

Query: 301 SFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPLL 360
           SFPAEALEAEID+TSDADN RAKMGYANDDLVIAIVGSQFLYRGMWLEH M+LQAMLPLL
Sbjct: 301 SFPAEALEAEIDVTSDADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLPLL 360

Query: 361 HKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEASLSKI 398
           H+FS  EHSNS LKIFVLSG+SNSNYTMAVEA   ++
Sbjct: 361 HEFSFYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRL 397

BLAST of Cp4.1LG12g03940 vs. TrEMBL
Match: M5X6C9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000692mg PE=4 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 2.1e-151
Identity = 268/393 (68.19%), Postives = 325/393 (82.70%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLE+G PLKRDPLLRSSS  R ER+PFLQRPRS+FSRFLL +K+DYLQWICTV VFLF
Sbjct: 1   MGSLESGVPLKRDPLLRSSSTGRTERHPFLQRPRSKFSRFLLIKKLDYLQWICTVAVFLF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPGSV+EKS +  K+VE +  DL FLKELG+L+FGEDIRFEPSKLLEK +KE
Sbjct: 61  FVVLFQMFLPGSVVEKSRVLMKNVELNSEDLRFLKELGLLDFGEDIRFEPSKLLEKFQKE 120

Query: 121 AREGGFSS-FNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQ 180
           ARE   +S  NRT   FGYRKPQLA+VF+DL V S Q+LMVT+A+ALQEIGYA  VYSL+
Sbjct: 121 AREASLTSAMNRTRQHFGYRKPQLALVFADLSVASQQLLMVTVAAALQEIGYAFSVYSLE 180

Query: 181 GGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPL 240
            GPV+D WR +GVPVT+IQT D++E+ +DWLNYDGIL++SL  K +FSCF+QEPFKSLP+
Sbjct: 181 DGPVHDVWRSLGVPVTIIQTYDQSELNIDWLNYDGILVNSLEAKGIFSCFVQEPFKSLPI 240

Query: 241 IWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVI 300
           +WTIHE+ L  RSR Y+SN   +L NDW+R+F+ STVVVFPNY +PM YS FD+GNFFVI
Sbjct: 241 LWTIHEQALATRSRKYSSNRQIELFNDWKRLFSRSTVVVFPNYFLPMAYSVFDAGNFFVI 300

Query: 301 PSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPL 360
           P  PAEA +A+  +  D ++  AKMGY ++D+VI IVGSQFLYRG+WLEH+++L+A+LPL
Sbjct: 301 PGSPAEACKADSIMVLDKNHLLAKMGYGSEDVVITIVGSQFLYRGLWLEHSIVLRAVLPL 360

Query: 361 LHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEA 393
           L  F LD +S SHLKI VLSG+S SNY+  VEA
Sbjct: 361 LEDFPLDNNSYSHLKIIVLSGDSTSNYSSVVEA 393

BLAST of Cp4.1LG12g03940 vs. TrEMBL
Match: W9R020_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022487 PE=4 SV=1)

HSP 1 Score: 525.4 bits (1352), Expect = 1.0e-145
Identity = 266/396 (67.17%), Postives = 320/396 (80.81%), Query Frame = 1

Query: 1   MGSLENGY--PLKRDPLLRSSSIS-RGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGV 60
           MGSLE G   P KRDP LRS+S + R +R PFLQR RSRFSRF LF+K+DYLQWICTV V
Sbjct: 1   MGSLEGGSATPFKRDPFLRSASFTGRSDRNPFLQRQRSRFSRFFLFKKLDYLQWICTVAV 60

Query: 61  FLFFVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKL 120
           FLFFVVLFQMFLPGSV+EKS    +D E S GDL FLKE G+L+FGEDIRFEPSK+LEK 
Sbjct: 61  FLFFVVLFQMFLPGSVVEKSIKTHRDEEFSSGDLFFLKEYGILDFGEDIRFEPSKVLEKF 120

Query: 121 KKEAREGGFS-SFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVY 180
           ++E +E   S +FNR+  R+ ++KPQLA+VF+DLLVDS Q+LMVT+A+ALQEIGY IQVY
Sbjct: 121 RRENKEVNLSHAFNRSRLRYPHKKPQLALVFADLLVDSQQLLMVTVAAALQEIGYEIQVY 180

Query: 181 SLQGGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKS 240
           SL+GGPV+  WR++GVPV++IQ CD  +V VDWL YDGIL++S   KD+FSCF+QEPFKS
Sbjct: 181 SLEGGPVHGIWRNLGVPVSIIQACDPADVTVDWLIYDGILVNSFEAKDMFSCFVQEPFKS 240

Query: 241 LPLIWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNF 300
           LPL+WTIH+  L  RSRNY SN   +LLNDW+R FN STVVVFPNYV+PMIYS FDSGNF
Sbjct: 241 LPLVWTIHDRALATRSRNYTSNKQIELLNDWKRAFNRSTVVVFPNYVLPMIYSTFDSGNF 300

Query: 301 FVIPSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAM 360
           FVIP  PAEA + E  + S+ D  RAKMGY ++D+VI IVGS+ LYRG+WLEH+++LQA+
Sbjct: 301 FVIPGSPAEAWKIETLMESEKDYLRAKMGYGHEDIVITIVGSELLYRGLWLEHSIVLQAL 360

Query: 361 LPLLHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEA 393
            PLL  FS DE+S SHLKI VLSG+  SNY+ AVEA
Sbjct: 361 FPLLEDFSSDENSFSHLKIIVLSGDPTSNYSSAVEA 396

BLAST of Cp4.1LG12g03940 vs. TrEMBL
Match: A0A067JXC8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14313 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 2.1e-143
Identity = 260/393 (66.16%), Postives = 320/393 (81.42%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQR-PRSRFSRFLLFQKIDYLQWICTVGVFL 60
           MGSLE   PLKR+ LLRSSS  R   + F+QR PRSRFSRFLLF+K+DYLQWICTV VFL
Sbjct: 1   MGSLETVLPLKRESLLRSSSAGR---HSFMQRQPRSRFSRFLLFKKLDYLQWICTVAVFL 60

Query: 61  FFVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKK 120
           FFVVLFQMFLPGSV+EKS+ ++K+VE   GDL +LKE+G  +FGEDI+FEPSK+L+K +K
Sbjct: 61  FFVVLFQMFLPGSVIEKSEDSWKEVENVSGDLMYLKEIGTWDFGEDIKFEPSKILQKFQK 120

Query: 121 EAREGGFSS-FNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSL 180
           E RE  FSS FNRT  RFGY+KPQLA+VF+DL  D  Q+LMVT+A+ALQEIGY+IQV+S+
Sbjct: 121 EVREVNFSSSFNRTQLRFGYKKPQLALVFADLSADPQQLLMVTVATALQEIGYSIQVFSI 180

Query: 181 QGGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLP 240
           Q GPVN  W+ +GVPVT+ Q   + E+ VDWL YDGIL++SL  K +FSCF+QEPFKS+P
Sbjct: 181 QDGPVNGIWKSIGVPVTIFQRNHKMEIAVDWLIYDGILVNSLETKAIFSCFMQEPFKSIP 240

Query: 241 LIWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFV 300
           LIWTIHE TL +RSR YAS+G  +L++DW+RVFN +TVVVFPNY +PM+YSAFD+GN++V
Sbjct: 241 LIWTIHERTLAIRSRQYASDGQTELVSDWKRVFNRATVVVFPNYALPMMYSAFDAGNYYV 300

Query: 301 IPSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLP 360
           IP  PAEA EA++ +    DN R KMGY  DD+VIAIVG QFLYRG+WLEH ++LQA+LP
Sbjct: 301 IPGSPAEAWEADV-MALYKDNVRLKMGYGPDDVVIAIVGGQFLYRGLWLEHALILQALLP 360

Query: 361 LLHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVE 392
               F  D++SNSHLKI VLSGNS SNY++AVE
Sbjct: 361 AFQDFPFDDNSNSHLKIIVLSGNSTSNYSVAVE 389

BLAST of Cp4.1LG12g03940 vs. TrEMBL
Match: F6I683_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g01620 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 1.7e-137
Identity = 249/397 (62.72%), Postives = 310/397 (78.09%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLENG P+KRDPLLRSSS ++G  +   QRP  RFSRFL F K+DYLQW+CTV VF F
Sbjct: 1   MGSLENGVPVKRDPLLRSSS-NKGSAF---QRPIVRFSRFLFFGKLDYLQWVCTVAVFCF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPG +MEKS  + K++E   GDL F+K +G L+FGE IRFEPSKLL+K +KE
Sbjct: 61  FVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQKE 120

Query: 121 AREGGFSSFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQG 180
           A E   SS +R  +RFGYRKPQLA+VF DLLVD  Q+LMVT+ASAL E+GY IQVYSL+ 
Sbjct: 121 ADEVNLSSASRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSLED 180

Query: 181 GPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPLI 240
           GPVN  WR++G PVT+I++  ++  +VDWLNYDGI+++SL  + V SCF+QEPFKSLPLI
Sbjct: 181 GPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSLPLI 240

Query: 241 WTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVIP 300
           WTI E TL  R R Y   G  +L+NDW++VFN +T VVFPNYV+PMIYS FDSGN+FVIP
Sbjct: 241 WTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYFVIP 300

Query: 301 SFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPLL 360
             PA+A E +  + S  D+PR KMGY  DD VIA+V SQFLY+G+WLEH ++LQA+LPL+
Sbjct: 301 GSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQALLPLV 360

Query: 361 HKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEASLSKI 398
            +F +D +SNSHLKI + SGNS +NY++AVEA   K+
Sbjct: 361 AEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKL 393

BLAST of Cp4.1LG12g03940 vs. TAIR10
Match: AT4G01210.1 (AT4G01210.1 glycosyl transferase family 1 protein)

HSP 1 Score: 399.8 bits (1026), Expect = 3.3e-111
Identity = 192/275 (69.82%), Postives = 224/275 (81.45%), Query Frame = 1

Query: 389  AVEASLSKIAETALLDAIETRRYGDALYFWVRMDSDPRNPLQLDFWSFCDSINAGNCKFA 448
            A + SLSKIAE ALL+AI+TR++GDALYFWVRMD DPRNPLQ  FWSFCD+INAGNC+FA
Sbjct: 747  ARKESLSKIAEDALLNAIQTRKHGDALYFWVRMDKDPRNPLQKPFWSFCDAINAGNCRFA 806

Query: 449  FSESLKTMYGIKSDLEFLPPMPADGYTWSSMQSWVLPTGSFLEFVMFSRMFVDALDAQMY 508
            ++E+LK MY IK+ L+ LPPMP DG TWS MQSW LPT SFLEFVMFSRMFVD+LDAQ+Y
Sbjct: 807  YNETLKKMYSIKN-LDSLPPMPEDGDTWSVMQSWALPTRSFLEFVMFSRMFVDSLDAQIY 866

Query: 509  DEHRTSGRCYLSLSKDKHCYSRLLELLVNVWAYHSARRMVYMNPETGAMQEQHKFDTRRG 568
            +EH  + RCYLSL+KDKHCYSR+LELLVNVWAYHSARR+VY++PETG MQEQHK   RRG
Sbjct: 867  EEHHRTNRCYLSLTKDKHCYSRVLELLVNVWAYHSARRIVYIDPETGLMQEQHKQKNRRG 926

Query: 569  KMWIKWFSYGMIKNMDEELGEEADTDHPTRRWLWPSTGEVFWEGMYEREKNLRYRQKENR 628
            KMW+KWF Y  +K MDE+L EEAD+D     WLWP TGE+ W G  E+EK  +  +KE +
Sbjct: 927  KMWVKWFDYTTLKTMDEDLAEEADSDRRVGHWLWPWTGEIVWRGTLEKEKQKKNLEKEEK 986

Query: 629  KQKSKAKLDRMRHRR-HQKVIGKYVKPPPEMENST 663
            K+KS+ KL RMR R   QKVIGKYVKPPPE E  T
Sbjct: 987  KKKSRDKLSRMRSRSGRQKVIGKYVKPPPENETVT 1020

BLAST of Cp4.1LG12g03940 vs. TAIR10
Match: AT5G04480.1 (AT5G04480.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 275.0 bits (702), Expect = 1.2e-73
Identity = 136/266 (51.13%), Postives = 186/266 (69.92%), Query Frame = 1

Query: 391  EASLSKIAETALLDAIETRRYGDALYFWVRMDSDPR---NPLQLDFWSFCDSINAGNCKF 450
            + SLS  AE +L + I+    G+ +YFW R+D D     +   L FWS CD +N GNC+ 
Sbjct: 785  KVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRT 844

Query: 451  AFSESLKTMYGIKSDLEFLPPMPADGYTWSSMQSWVLPTGSFLEFVMFSRMFVDALDAQM 510
             F ++ + MYG+   +E LPPMP DG+ WSS+ +WV+PT SFLEFVMFSRMF ++LDA +
Sbjct: 845  TFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDA-L 904

Query: 511  YDEHRTSGRCYL--SLSKDKHCYSRLLELLVNVWAYHSARRMVYMNPETGAMQEQHKFDT 570
            ++    S  C L  SL + KHCY R+LELLVNVWAYHS R+MVY+NP  G+++EQH    
Sbjct: 905  HNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQ 964

Query: 571  RRGKMWIKWFSYGMIKNMDEELGEEA-DTDHPTRRWLWPSTGEVFWEGMYEREKNLRYRQ 630
            R+G MW K+F++ ++K+MDE+L E A D DHP  RWLWP TGEV W+G+YERE+  RYR 
Sbjct: 965  RKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRL 1024

Query: 631  KENRKQKSKAKL-DRMRHRRHQKVIG 650
            K ++K+K+K KL DR+++   QK +G
Sbjct: 1025 KMDKKRKTKEKLYDRIKNGYKQKSLG 1049

BLAST of Cp4.1LG12g03940 vs. NCBI nr
Match: gi|659116602|ref|XP_008458158.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103497681 [Cucumis melo])

HSP 1 Score: 702.6 bits (1812), Expect = 6.7e-199
Identity = 350/397 (88.16%), Postives = 370/397 (93.20%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLENG+PLKRDPLLRSSS  RGER+PFLQRPRSRFSRFL F+KIDYLQWICTV VF F
Sbjct: 1   MGSLENGFPLKRDPLLRSSSSVRGERFPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFXF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPGSVMEKS+IA KDVEKSLGDL+FLKELGML+FGEDIRFEPSKLL K KKE
Sbjct: 61  FVVLFQMFLPGSVMEKSEIALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKKE 120

Query: 121 AREGGFSSFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQG 180
           ARE  FSSFNRT +RFGYRKPQLA+VFSDLLVDSYQVLMVTIASALQEIGY  QVYSLQG
Sbjct: 121 AREADFSSFNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQG 180

Query: 181 GPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPLI 240
           GP ND WR MGVPVT+IQTCDETEVMVDWLNYDGILMHSLGVKDVFSC+LQEPFKSLPLI
Sbjct: 181 GPANDVWRQMGVPVTIIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCYLQEPFKSLPLI 240

Query: 241 WTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVIP 300
           WTIHEE L LRS+NYAS+GL DLLNDW+RVFNHSTVVVFPNYVMPMIYSA+DSGNFFVIP
Sbjct: 241 WTIHEEALALRSQNYASDGLLDLLNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFVIP 300

Query: 301 SFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPLL 360
           SFPAEALEAEID+TSDAD  RAKMGYANDDLVIAIVGSQFLYRGMWLEH M+LQAMLPLL
Sbjct: 301 SFPAEALEAEIDVTSDADILRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLPLL 360

Query: 361 HKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEASLSKI 398
           H+FSL EHSNS LKIFVLSG+SNSNYTMAVEA   ++
Sbjct: 361 HEFSLYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRL 397

BLAST of Cp4.1LG12g03940 vs. NCBI nr
Match: gi|449441374|ref|XP_004138457.1| (PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus])

HSP 1 Score: 700.7 bits (1807), Expect = 2.5e-198
Identity = 346/397 (87.15%), Postives = 370/397 (93.20%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLENG+PLKRDPLLRSSS  RGERYPFLQRPRSRFSRFL F+KIDYLQWICTV VF F
Sbjct: 1   MGSLENGFPLKRDPLLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFFF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPGSV+EKS++A KDVEKSLGDL+FLKELGML+FGEDIRFEPSKLL K KKE
Sbjct: 61  FVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKKE 120

Query: 121 AREGGFSSFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQG 180
           ARE  FSSFNRT +RFGYRKPQLA+VFSDLLVDSYQVLMVTIASALQEIGY  QVYSLQG
Sbjct: 121 AREADFSSFNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQG 180

Query: 181 GPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPLI 240
           GP ND WR MGVPVTLIQ+CDETEVMVDWLNYDGIL+HSLGVKDVFSC+LQEPFKSLPLI
Sbjct: 181 GPANDVWRQMGVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLPLI 240

Query: 241 WTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVIP 300
           WTIHEE L +RS+NYAS+GL D+LNDW+RVFNHSTVVVFPNYVMPMIYSA+DSGNFFVIP
Sbjct: 241 WTIHEEALAIRSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFVIP 300

Query: 301 SFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPLL 360
           SFPAEALEAEID+TSDADN RAKMGYANDDLVIAIVGSQFLYRGMWLEH M+LQAMLPLL
Sbjct: 301 SFPAEALEAEIDVTSDADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLPLL 360

Query: 361 HKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEASLSKI 398
           H+FS  EHSNS LKIFVLSG+SNSNYTMAVEA   ++
Sbjct: 361 HEFSFYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRL 397

BLAST of Cp4.1LG12g03940 vs. NCBI nr
Match: gi|596047484|ref|XP_007220285.1| (hypothetical protein PRUPE_ppa000692mg [Prunus persica])

HSP 1 Score: 544.3 bits (1401), Expect = 3.0e-151
Identity = 268/393 (68.19%), Postives = 325/393 (82.70%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLE+G PLKRDPLLRSSS  R ER+PFLQRPRS+FSRFLL +K+DYLQWICTV VFLF
Sbjct: 1   MGSLESGVPLKRDPLLRSSSTGRTERHPFLQRPRSKFSRFLLIKKLDYLQWICTVAVFLF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPGSV+EKS +  K+VE +  DL FLKELG+L+FGEDIRFEPSKLLEK +KE
Sbjct: 61  FVVLFQMFLPGSVVEKSRVLMKNVELNSEDLRFLKELGLLDFGEDIRFEPSKLLEKFQKE 120

Query: 121 AREGGFSS-FNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSLQ 180
           ARE   +S  NRT   FGYRKPQLA+VF+DL V S Q+LMVT+A+ALQEIGYA  VYSL+
Sbjct: 121 AREASLTSAMNRTRQHFGYRKPQLALVFADLSVASQQLLMVTVAAALQEIGYAFSVYSLE 180

Query: 181 GGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLPL 240
            GPV+D WR +GVPVT+IQT D++E+ +DWLNYDGIL++SL  K +FSCF+QEPFKSLP+
Sbjct: 181 DGPVHDVWRSLGVPVTIIQTYDQSELNIDWLNYDGILVNSLEAKGIFSCFVQEPFKSLPI 240

Query: 241 IWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFVI 300
           +WTIHE+ L  RSR Y+SN   +L NDW+R+F+ STVVVFPNY +PM YS FD+GNFFVI
Sbjct: 241 LWTIHEQALATRSRKYSSNRQIELFNDWKRLFSRSTVVVFPNYFLPMAYSVFDAGNFFVI 300

Query: 301 PSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLPL 360
           P  PAEA +A+  +  D ++  AKMGY ++D+VI IVGSQFLYRG+WLEH+++L+A+LPL
Sbjct: 301 PGSPAEACKADSIMVLDKNHLLAKMGYGSEDVVITIVGSQFLYRGLWLEHSIVLRAVLPL 360

Query: 361 LHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEA 393
           L  F LD +S SHLKI VLSG+S SNY+  VEA
Sbjct: 361 LEDFPLDNNSYSHLKIIVLSGDSTSNYSSVVEA 393

BLAST of Cp4.1LG12g03940 vs. NCBI nr
Match: gi|1000982607|ref|XP_015584512.1| (PREDICTED: uncharacterized protein LOC8286706 [Ricinus communis])

HSP 1 Score: 526.9 bits (1356), Expect = 5.0e-146
Identity = 258/394 (65.48%), Postives = 322/394 (81.73%), Query Frame = 1

Query: 1   MGSLENGYPLKRDPLLRSSSISRGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGVFLF 60
           MGSLENG  LKR+ LLRSSS  R ER+PFLQRPRSRFSRFLLF+K+DYLQWICTV VFLF
Sbjct: 1   MGSLENGGSLKRESLLRSSSAGRNERHPFLQRPRSRFSRFLLFKKLDYLQWICTVAVFLF 60

Query: 61  FVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKLKKE 120
           FVVLFQMFLPGS+++KS+++ K +E   GDL +LK +G L+FGED++F+P KLLEK +KE
Sbjct: 61  FVVLFQMFLPGSMIDKSEVSLKKLEIVPGDLLYLKAMGTLDFGEDVQFQPLKLLEKFQKE 120

Query: 121 AREGGFSS--FNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVYSL 180
            RE   +S  FNRT+ RFGYRKPQLA+VF+DLL D  Q+LMVT+A+ALQEIGYAIQV+S+
Sbjct: 121 NREVNLTSSAFNRTLLRFGYRKPQLALVFADLLADPQQLLMVTVATALQEIGYAIQVFSV 180

Query: 181 QGGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKSLP 240
             GPV+D W+ +GVPVT+ QT  + E+ VDWL +D I+++SL  K VF CF+QEPFKS+P
Sbjct: 181 NDGPVHDIWKRIGVPVTIFQTNHKMEIAVDWLIFDSIIVNSLEAKVVFPCFMQEPFKSIP 240

Query: 241 LIWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNFFV 300
           LIWTIHE+TLG+RSR Y SNG  +L++DW+RVFN +TVVVFPN+V+PM+YSAFD+ N++V
Sbjct: 241 LIWTIHEKTLGIRSRQYISNGQIELVSDWKRVFNRATVVVFPNHVLPMMYSAFDAENYYV 300

Query: 301 IPSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAMLP 360
           IP  PAE  EAE       D+ R KMGY  DD++IAIVGSQFLYRG+WLEH ++LQA+ P
Sbjct: 301 IPGSPAEVWEAEAMAAVYKDSIRMKMGYRPDDIIIAIVGSQFLYRGLWLEHALILQALSP 360

Query: 361 LLHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEA 393
           L   FS D++SN HLKI VLSGNS SNY++A+EA
Sbjct: 361 LFSDFSFDDNSNPHLKIIVLSGNSTSNYSVAIEA 394

BLAST of Cp4.1LG12g03940 vs. NCBI nr
Match: gi|703086012|ref|XP_010092892.1| (hypothetical protein L484_022487 [Morus notabilis])

HSP 1 Score: 525.4 bits (1352), Expect = 1.5e-145
Identity = 266/396 (67.17%), Postives = 320/396 (80.81%), Query Frame = 1

Query: 1   MGSLENGY--PLKRDPLLRSSSIS-RGERYPFLQRPRSRFSRFLLFQKIDYLQWICTVGV 60
           MGSLE G   P KRDP LRS+S + R +R PFLQR RSRFSRF LF+K+DYLQWICTV V
Sbjct: 1   MGSLEGGSATPFKRDPFLRSASFTGRSDRNPFLQRQRSRFSRFFLFKKLDYLQWICTVAV 60

Query: 61  FLFFVVLFQMFLPGSVMEKSDIAFKDVEKSLGDLEFLKELGMLEFGEDIRFEPSKLLEKL 120
           FLFFVVLFQMFLPGSV+EKS    +D E S GDL FLKE G+L+FGEDIRFEPSK+LEK 
Sbjct: 61  FLFFVVLFQMFLPGSVVEKSIKTHRDEEFSSGDLFFLKEYGILDFGEDIRFEPSKVLEKF 120

Query: 121 KKEAREGGFS-SFNRTINRFGYRKPQLAMVFSDLLVDSYQVLMVTIASALQEIGYAIQVY 180
           ++E +E   S +FNR+  R+ ++KPQLA+VF+DLLVDS Q+LMVT+A+ALQEIGY IQVY
Sbjct: 121 RRENKEVNLSHAFNRSRLRYPHKKPQLALVFADLLVDSQQLLMVTVAAALQEIGYEIQVY 180

Query: 181 SLQGGPVNDAWRHMGVPVTLIQTCDETEVMVDWLNYDGILMHSLGVKDVFSCFLQEPFKS 240
           SL+GGPV+  WR++GVPV++IQ CD  +V VDWL YDGIL++S   KD+FSCF+QEPFKS
Sbjct: 181 SLEGGPVHGIWRNLGVPVSIIQACDPADVTVDWLIYDGILVNSFEAKDMFSCFVQEPFKS 240

Query: 241 LPLIWTIHEETLGLRSRNYASNGLFDLLNDWERVFNHSTVVVFPNYVMPMIYSAFDSGNF 300
           LPL+WTIH+  L  RSRNY SN   +LLNDW+R FN STVVVFPNYV+PMIYS FDSGNF
Sbjct: 241 LPLVWTIHDRALATRSRNYTSNKQIELLNDWKRAFNRSTVVVFPNYVLPMIYSTFDSGNF 300

Query: 301 FVIPSFPAEALEAEIDITSDADNPRAKMGYANDDLVIAIVGSQFLYRGMWLEHTMMLQAM 360
           FVIP  PAEA + E  + S+ D  RAKMGY ++D+VI IVGS+ LYRG+WLEH+++LQA+
Sbjct: 301 FVIPGSPAEAWKIETLMESEKDYLRAKMGYGHEDIVITIVGSELLYRGLWLEHSIVLQAL 360

Query: 361 LPLLHKFSLDEHSNSHLKIFVLSGNSNSNYTMAVEA 393
            PLL  FS DE+S SHLKI VLSG+  SNY+ AVEA
Sbjct: 361 FPLLEDFSSDENSFSHLKIIVLSGDPTSNYSSAVEA 396

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K892_CUCSA1.8e-19887.15Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006880 PE=4 SV=1[more]
M5X6C9_PRUPE2.1e-15168.19Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000692mg PE=4 SV=1[more]
W9R020_9ROSA1.0e-14567.17Uncharacterized protein OS=Morus notabilis GN=L484_022487 PE=4 SV=1[more]
A0A067JXC8_JATCU2.1e-14366.16Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14313 PE=4 SV=1[more]
F6I683_VITVI1.7e-13762.72Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g01620 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G01210.13.3e-11169.82 glycosyl transferase family 1 protein[more]
AT5G04480.11.2e-7351.13 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659116602|ref|XP_008458158.1|6.7e-19988.16PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103497681 [Cucumis me... [more]
gi|449441374|ref|XP_004138457.1|2.5e-19887.15PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus][more]
gi|596047484|ref|XP_007220285.1|3.0e-15168.19hypothetical protein PRUPE_ppa000692mg [Prunus persica][more]
gi|1000982607|ref|XP_015584512.1|5.0e-14665.48PREDICTED: uncharacterized protein LOC8286706 [Ricinus communis][more]
gi|703086012|ref|XP_010092892.1|1.5e-14567.17hypothetical protein L484_022487 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009058 biosynthetic process
biological_process GO:0019375 galactolipid biosynthetic process
biological_process GO:0001666 response to hypoxia
cellular_component GO:0005575 cellular_component
cellular_component GO:0005768 endosome
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g03940.1Cp4.1LG12g03940.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 622..642
scor
NoneNo IPR availablePANTHERPTHR12526GLYCOSYLTRANSFERASEcoord: 16..388
score: 2.7E
NoneNo IPR availablePANTHERPTHR12526:SF412GLYCOSYL TRANSFERASE FAMILY 1 PROTEINcoord: 16..388
score: 2.7E