Cp4.1LG12g03990 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g03990
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGolgin candidate 4
LocationCp4.1LG12 : 2825200 .. 2848445 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GATGTATGATTTATTATGCAAGTATCGACATGTCGTCTGGTATAAACCGACAGAATATTGCACTGGACGACGCACAATGCATAAAGTCTTAAACGAAAAATCGTCTCTTTACTCTCAAATTAATTCTCTTCTCTCCATTTCTTATACCAAAAATACATTTCGTCTTCGCCTCACCAACTGACAGACTCAGATCTGCAAGATTCAGTTCCTTCCGTTTCTCCATCTGATTTCTTCTCCAGCTCTTGCAGAAATTTGGTGCAAATTCCTTCTATTTCCATCGTTCTTAGCACCTTACGGTGCTCGTGTACTGGAATTTAGGAGCCATTTTGGGTCTCTGGGGTTCTGGTGTATGAATGATGTGGAGCTCGATAGCTAATTTGAAAGAGAATCTGAATAAGATAGCTCTCGATGTGCATCACGACGATGATGAAGAGGAATTTGCGATCTATGGCTCCAATGGAGGGGATGTTGATGTTTCGGTATCTGATCGGAGGAACTCGCATAGCTTTGCTCATTCGAATCCGGTGACGCGGTCCCCGATTGCCAATGGGATTGAGGATGCTCATCATCCTGAGGTGCATTATTGTTTATGGACTTGGTTTCTTGTTATAGCTGAATTGTGGGGAGGTGATGATGCATTTTGATTTTTAGGGGTTTATGAGCACGTTATTGTGAATTGGTTCGGTTTAGTTTATTTTGTAATTTCAATGAGATTATAACTCAATTAGGTTATATGACAGGGAAGAATAGAGGGTTTGGTTTCAGGAAGGGAGTGTTTGGTTACAAATCGAGAACACCTTTGTTAGCTGATTGCTATACTGATGTTTGATTCCGTTCGAACATTCATTTTAATTAAGGTCTGAGGTTTAAAGGATTAAGGCGTGGTGCACGGCATTAGATGTAATAATTTATAATAGCCACTGGTTCTACAGTTTGTTTTTGTCGAAGATATTTTTACAGTTTGTTTCTTCACTGTTCCTTTGTGTCTAAGGCTTGAAATTTTATTTTTGAGAAGTTGGGGAGTGTGACTTGCTTTCCTTGGAGGATCGAAGATTGACTCTACGAAAATCTTAACGATTAGAATTTAAAGGCCACAGCTAGGATTTTGGGAAGCTGCGTTACTAGAGCTCTATTGTGGCATATTTGGCTGGAAAGAAATCTGTATCTTTTCAATATAAATCTCTTAATTTTGATTTTTTTGTCACCTTGTACAAAATAAGGTTTCTTGGTAGGCCTCTATAACAACCCAAGTCCACCGCTAGCAGATATTGTCCTCTTTAGGCTTTCCCTTCCGCACTTTCACTCAAAGCTTTCAAAACGCGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCTTCGTTCTCCTCTCCACCCAATGTGGGCTCTCACAATCTACCCCCTTCGGGGCCCAGCGTCCTCACTGACACTCGTTCCCTTCTCCAATCGATATGGGACTCCCATGTTGGCACACCACCTTGTATCCACCCCCTTTGGGGCTCAGTCTCCTTGCTAGCACATCGTCCGGTGTCTGGATTTGATACCATTTGTAACAACCCGAGTCCACTTTTAGCAGATATTGTCCACTTTGGGCTTTCCTTTCAGGCTTCCNATCGATATGGGACTCCCATGTTGGCACACCACCTTGTATCCACCTCCTTTGGGGCTCAGTCTCCTTGCTAGCACATCGTCCGGTGTCTGGATTTGATACCATTTGTAACAACCCGAGTCCACTGTTAGCAGATATTGTCCACTTTGGGCTTTCCTTTCAGGCTTCCCATCAAGGCCGATATGGGATCTCACAACCTCTTTACATTCTAAATTCTTTTGTAACTTACAGCCTACTGATGATNTGGGGGATCTCACAACCTCCTTACATTCTAAATTCTTTTGTAACTTACAGCCTACTGATGATTATCAGCAATTGGAAATTGCTTATTCGGAGTTTTCTAGGGGAGGGGGTTTGCTTTACCCAGCCCTTAGGCTGAAAAGAGGCTGGAGGCTTTTCCCTCGTGGAAAAGGAAGATATGCTTTGATAAAAGCATCTTTAATAAGGTTTTGTATATTATGATTATTGAAGACCTCGAACAATAATTGCCAGTATGTGAAGCATTTTTGTTATTGCTATTAATTGTGTTTTCATACACTGCTTGGAATTTCCAGATTGTGGTCTGGGAGTGGTGTGCCCGTTCAGAGTTCTAATGCTTATTTTCTTTCATTTTAAGAGCTCTTTCTCCCTCCAGAATAAAAGAGGATGAAATTTTAGAAGCTCTTGAAATTGTAAAGGGCTGCTAGTTTTGTTTGTTGTTTTTCTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTCTAGTTTGGTTGGTTTGAAACTCATAGCATATATCCAGGTTCATATATAGCGCTTCTGTTTGTCATTATTTGACTGGCTTATATCTTTGTGTTTCTTTTCTTTTGAGTTTCCTTTGAACTGGTTAACTTAAATCTTGTACGGTGTTTTTCATTATTGTGCTTGTAGATTGAACAATACAAAACAGAAATTAAGAGGCTTCAGGAATCTGAGAGGGATATTAAGTCATTATCGATGAATTATGCAGCTCTGCTAAAGGAAAAAGAGGTATTCTAGTTGCTCCTTGGAGTTTTTTACCTTTGCACTCACTGTTCTATTTCCCTGTTGTATATAGCCATATCCAGTGATTTCTGAGTAATTTGTTTGTAGTAGTATCTATTAGTTGCTGTCGATGGTTTTTTACTTTAATAGTTTGCATTCTTCTATAATGATGGTTGCTATTGATTAGTGAAAATGAATAAGATTGTTTTGCCCTTCCAGTGTTCACAAAATTAAAATTGCAATTTGGTTATTCGCCGAATCTATATACTTCAATGAATCGGAAAACATACAAGAAAATTCTAATTGCATACCAAACAAACCTTTATTGTTATTTCCTGTGTCATATTCCTTTGGCAGGAGCTAATCTTACGATTGAACAAGGAAAATGGCTCGCTAAAACAGAGTTTGGAAGGCACAAATACATCAACAAATTCACCTAGAGCTGAAAGTTCCAAATCACCATCAAATGGAACTAATGAAATGAAGGTAATTATTGGAACGCGGATGAAAACTTATGAGCTTGATAGATTACTATATGTGGGGCTGGTTTTTATTTTATTTTGTGGTTGTGTGTCAGAATTATCACATTTTTGTCAATNTTTTTATTTTATATTTTGTGGTTGTGTGTCAGAATTATCACATTTTTGTCAATGACATTACAATGTTATTCCAGCCCCGGCTTTTGCTTTAATACCTCATGCTGCATGGAAAAATGTTAAAGATGTTATTCCAGTCCCTGCTTGTAAGTGGTTTGTTTGTAGGCTTATATTTCTGNGCCCCGGCTTTTGCTTTAATACCTCATGCTGCATGGAAAAATGTTAAAGATGTTATTCCTGTCCCTGCTTGTAAGTGGTTTGTTTGTAGGCTTATATTTCTGTAGTTGTAATGTAGAGTTACAGCTTGAAATGCAGGTTTCAAAAGTTTTGGTTTGGGAAATCTGTGGGAAGGATTGGGATTGGGGTAGTAAGTTTAGGTAGTTTTCTTTGGATAAGCTGGCTCGACAAAAATGATAAATTTTTCACTAGGAAGGGAAGAAAACTACACTTATCTTTTTGATATTGTTTTGTATACAACAGCCACTTGGTGCAAGCTTGACAAATCTTTTTTGTAATTGCTTCATTAGTATTTTCATTCGTGCTAATTAGAAGGGTTTCTAGCCTCCATTAGCATTTTGTATATTTCATTACCGAATGAAATCTGTTTCTCTTTAAAAAAGAAAGTCTTGGTAACATTGAGACCTATGGGTATGCCCAACAAACTTGAGGCTTGGCAAGCCCGGGAGCAAGCATGTTTTCATCTTAAGCCAAACCCACCTAAAAAACCTTCTAGGGACTTGGACAGGAGGACACATCGGCACTTATTATTTATTTATTTTACGTGTTAGTGGTTGATGTATTCAAAAGCTTGTTGGAAAAAGGTATGAAAGAGGATTGTTTACGAGACCTTATGATTGGAATTTATTACATACACTTAAACCACCTACAATTTGCATATGAGACTCCTTTTCTTGGCTAATGATCATGACAACTAATCAACTTAAAGGAGATGTCAAGCTGTTTTAAGTCCTCTTTGTCTCAAGATCAATCTACCTAACTTGTCTGGTCCTAATGTGAATGGCGGGGAGGTGAAGGCCCTTGCTGCTTTGTGAGGTTGTGATGGATCTTGGCCTCTTTCTTGCCTTGCAATGCTGTTGGGCGGGGAGAATAAGTGATTTGGAGTTTTAGGAGCTCATCAAAGATAAGGTCACTAGAAATTTAGCAAATTGGAAGAACTTATACCTTCAAGAGATGCAAGGTTGTCTCTAGCAAATTGATTTCTTTAAAAAATCTCCCTTCACTGTACATTCATCTTCAAATGTTCAAAACAGACAATCAAGAAGCTGGAGGAAGTGTACAGTACAGAGACTTTATTTTGGCGAGGAGGTAAGGGAGCCACTAACCTCATCAAGAGGATCCTTTAAATTTATCAGGCCATGGATCAATATCGCAAGGCCTAAGTGTTCCTCCAAGGTTGACAAACGTTATAAACCAGTAGTTCGTAGGGAGAAATTCTCTTCTAGGAATATCCTTGGCTCATGAGTGCTCCTCTTTGGTTTGCCTTGCCTAGAATATAGTGGCTAATTCCAGGATTTCAGTTATATCTGGAGTTGTATCTAGTGCTAGTCTGATGCTTAAAGGGATTGAGGCCTAAAATTTAGAAGAAACCTCTTTGATGCATAAGTGGAGGAGGGGGCCTTGTTATCTTCTTTCATAGCAACCTTTTCTCCGTACAATATAGAAGTCTTAAGAGCTTGGTCCCTTGAAAGGTGTGTCATTTTCACCGTTAGATCTTTGATCCTCAATATTGTCAACCAGAATGAATGCCTCAACCCCAAAGAAGGTCTTTTTTTATAACGGAAAAATAAAAAGAAATGATGCCCCAATCCGTAGGAGTTGCATACGGCTTCTCCAATTGGACAAAAGAGAATCAAAGTTATAAGAATGAAAAAGAGATGAGAGTTTACCCCAAGATATTGCTAGGAAGATAATATGTCAAAAAAACCTATCNATAGTTTACACCAAGATATTGCTAGGAAGATAATATGTCAAAAAAACCTATCAAAAGATGGATCTTTGTTAGTGAAGACATGATGATTCCTCCCAATCCAAATCAACCATAAGAAATCTTTATTGATATGCATCCAAGAAGCATTTTCTCATCTACGAATGGATGACCACCAAAATTAAGAGAGAGAAGGCTGGGTTTTGGTTTAGGAATATTTTACATTTCCCCGTGATGTTATTTTTTGACTACTATATATATATTTTTTTTTTCCTTTTTAAAATGATTATCATGATATCATGGCGACCTATCACGTTATTGTAGTTTTCTGGATACTTCTTTGTTTTTTGTTTTATCTATATTTCGATCATTTTGCAGGGGAGCGATCAATCACCTACCCGACTGCTTAGGGGGAAGACCCGGCGTAATGGTATTGTGTCTAAGCAGGATGGAATTGCTAATGGAGCTTCACACTCTGGAAAACTTGATTACCAGAGTAAGATGGTACCAGAACATTCAACTTCACAGGTAAATGATACGGTTATGGAACTCTTTACTCCCTGTTACAGAGAAACATTTCTACCAATTTATGCAGGCAAAATTTAATAAATTAACAATCTTAAATAATTTTTGGATTTTTAATCGTCATATAATAATCTTAAATTAACGGTGATGAGCTGGCCTTTGGTCAAGAAAGGGGGCAAGCTAGATGCTTTCAATGTCATGTAATTAAATTTACAGTCATAAAATATAAATAAATTTTCATAAGCCTAATGTTGTTAGGTTAAATAGCAATCTTCCAAATATAATCAAGGTAACTGAGGTTGAAATAGAGTAGATGCTGCAATTAACTCAATGTACTCAAATATGCATATTACCGAAAATAGGGCATGATTCTATTTGCTTTGAAAATAACCTTGTAAGAAAAATCTTGGAAAATTGATTAGAATTTGAATTATGATTGAAATTCTTTTGTAATTATACTTCTTCTTTTTTTTTTTATTTTTTTTTATTTATTTTTTTTTTTTTTTAATTAAAGGTAATTAGAGAAGCTTTTCGTGATCCCTTGGCTTTGGTTGTTTTTTATCCTTGTGTTGTATGTTTTTGTCATCTAAAGAAGTAATACGGCTCAGTTCTTAGTAAAAATCATTTTTCTATATTGTTTTATGTACAAGGCTCTTTGCTTCAAGTATAAGCTTAGGTACTAGACTTAAGTAGTTGTGGTTCGAGAAAGGATCGTTGGCTGTCTCAATATTTCTTTGTTTCTCTCTTATTCTTATTAGTCTATAACATCATGCAAGGATCTTTGTTAGGCTTGAACCACTTTATAACGAAGAAGAGAAAGGTCAAGGGTTTGAAATGCTACATTAGCTCAGGACAGTCTAGGGATAAAGGAGCTNGTTTGAAATGCTACATTAGCTCAGGACAGTCTAGGGATAAAGGAGCTATAGATGGGTACATCCGAGATCTATTTGGAGAGTGTGGATCACAACAAGGGTATTGTGTTAAAAAAAAAAAGCATTATATATCCGAACCTAATTGTAGTAATTTAATTATTGGGATAGAAGAGGTTCAGTTGTACGTAGAACCAAAAATCCACAGCAAAGAGGCTGGTTTCCACTTTCATAGATGGAGAATTTTTTTCTTTTGCAAAATGGATGGCTTCTATAGTTAGTGTAGAGAAGGAGAGGAAAAAAATGGGTNTTTTTTGAAAAAGGGATGCTTCTATAGTTAGTGTAGAGAAGGAGAGGAAAAAAATGGGTTGGAATACTGTGCTTCGGTACCTTCCAAAATCCAGAAGATGATCAAGGGGCTATTTAGTACTACTGTTAAGCTGGGAGCACTTGGACGTAAATGGAGAAAGTTCCATGGGTCTAAATTGTCAAATTGGTGGAGGTGAAGTAGGCAGAAATGTTACGAAGAAACTGTCAAAAGGGAGGTACAAAGCCAATTTGGAATCCGGCAACAAAGAAATGGGGTTGCACTAGAGAAGGGCTTGGTGTACGAGTGTTGGAGGTATGANATGAAGAAACTGTCAAAAGGGAGGAACAAAGCCAATTTGGAAGCAGGCAACAAAGAAATGGGGTGCACTAGAGAAGGGCTTGGTGTACGAGTGTTGGAGGTATGAAGAGTTGGGGGGAGAGATTCGTAGCTCGTGCAAATAATTAGGGTTAGGAAAATTTTGAGGTTTGGTGCTGGGCATGATAAGTTTTGCTCGAGCCTTTTTTCTGGAGAAAAAACAAGAAAAAGGAAAATCAAGCAGTTGTCCATATTTATGGGCCTGGGGGGCACGATCTAACTGAAATATCTAATCTAATCTGCTGGTGTGGGACTGTTCAGGTACTTTTTAGCAACACCTCAATTAATAATTACGAGAGGGGCTCACAGCCAAGTAGAGGGAATTGGGTAGAGAAAAAAAGAACGGAGAACTATGATTTTCTATTGATTTATTCCATTTGAACTTTTGAACCTGAACCTGCAAAACCAATGGGCATCAATCAAAACTTCATCAATTGACCAAAACTTGGAGACAAGAAATGTTTCAAGGGAAAAGCAGTCTTAGTAGTAATTTTAGAATGAATGTTAGAAATATGATAGACAGAAGCCTCCAATACCTTTTACATATGCTGGGAGAGGCGTATAAAGAAGACAGAAGGGTGTGTGATGGAGAGCGAGTTAGTTAGTGATATGAAATAAGGTGTCTTAATCATGCCACAAAGATTTCATAGTGGATGGCAAGATTGTATCATACACGTTTGGATTTAGGGAGACTTCATTATATTTTTCAATTTTAGAGTATTATTTTTAAGATTTTAAGTTACAATTAATTAGTGGTCATTATGGTATTGATGGGCATGAGAGGCATAACCTTTCGGTTGACCATAGGGTACTCTAAGGTTTGTACTCAAGGCTATAAATAGTCATTCATGTTGTTGCTTTAGCCGGAGGTTTTGAATATTAAAATGAAAGTTGCATATTCTTAAGCCATGAGTTTGTTCTTTTGCATTTTTTGTGGTTAGATTTGCATGCTAGAGTCATTGAAGTAGTCTTGATCAAGCTTTCTTATGGAGTGATTTGAATCATGAGTTTAGAAACGAGATTTTTTACTCTTGATCTTTGTCATCAAGATAATCCGTTATAAAACATTTCTTGGGTTGATTTCTTATCGTTTTTGGGGTTCTTGGATAATTTAGACCTAAAGATCTTGTTCTTCAATAAGGTTGGTACATTAAATCCCAGAGTTTCGTAACAGTTAGGGTCAGGGAGCAGAGGAATATTTTGTTATTGTATGGGGGCCCCCAGCATAGTATATGTTAGGAGTGTGGAGATTATCTCAGGTATTTTGTATTATGGGAGACTATAGACCTCTCAGTTATCTTTGTTTTCTATGTTTTCCTCTCTTATCTTATAGTATACCCTCAAAAGACTTGCTCTCGATTGTTATATCAGAATACCATTAGGATATTAGAAGCATACTAGCGGAGGTTATTTGTTGGGGGGTTCGTTAGTTGCAAATAAAGTGAGTGGGTTTGAGGAATTTGTGGTAGGCGTATATTGGGATTATTATAGGAAGAAATTTGGTCTTCTTGAACGTTTAACACCTTTTTGATTTTTCTCTTCCAGTAAAATCAATAGCATGTTTCTATTGATACCCTAAAATTTCTGGTTTAATAGTTTTTTTTGCTAGTGTCTGTTTACCATCACAATAGCATATTCCTAATGTTTTTTAATTTGTAAAAAACCAATCACTTATTTGAATTTACTTAGTTTTATTTGTTAATGATAAATGAACTTGACAAACAGATATATGAATTCCTTCTAGGAGCTTACAGATTTTCAAGAAGGGAATATTGGATCACTGCAAGATGTGCAAACTACTCTTGAGATGAAACAGTTAAGGAAGGAACTTCAACAAGAAAGGGATCAGTTGGCAGATGTGCAATTAAGATTACGAGGTCACATTTTTCTCCTTAAAGATNAATATACTCCTATTTATTTCAATTTCTTCTTCCTTCATTGACTCCGATAATGGATATTGATCTCAGTCTATCTCTCTCCCTTACAGAGGAGCAAAAATTGAACAAAAAGTTCCAGGAGGAGTTGAACTCTCTACATATGAACAAGGACAAAGTAAGTTTGAAATTTAAATATTAATTATGAGTTTACTTGGAGCTGAAGTTTCAAACTGAATATACCTTATGCGCAAGTTATTATTTTTTTGATTTCATGTTTTCTGTTAATTGTCTTTTAAGTTTGTTTACTTTGTCTTTCCTCTGTAAATTGTTATTTTCTCTTTTGTAAGGATCCGAAAGTGCCATGCATTCCTGAATATCTAAGAGAATTATTGACCACGTTACTCACTCTTTGGTTGGCAAATATCAATCTATTCCAACTTTTGATTGTGTCAATTAAAACTTCAAACTAACAGTTGTATTAGTTTAGATCCTAAACTTCCATTAGTTAATCAATTTTGACCATTTGTCAGTGATTTGTAGCTCTTTTGAATTAATTCCATTTTAAATCAACTTTAGTCATCGAGATAAAATTATTCTTTTGCAAAAGTCTCATCGAAGTAGGAGAAAAAAAAAAAACCAAGTAATTCTTTCTAAATAAGTACAGAAACATAGAAGAAGCTGAGCAAATAAAAAGTTGATCCTCACATTTCCCTCTGTTATATATATCAATTCTGCACTTAAGGAAGTCTTATACTTGTATGGAAGTAATGCATTAACGGTTTTAAAATCTAATTTTATAGAGGGTTTAAGTTGATTCACTCACGAAAGTTTAGGGTCTAAATTGTATAATTGTTAGCTTGAGGTTTTAATTCATACAACCCACAAAGTTAAGGGTACAATCTCATTGATTTATGAAATTTACAAAAGGATGGAAAATTCATGATTAAATTAAAAGGCTATCCAGATTTATTAAAAGGGAAGCATAGCTATAAGAAGAGAAGAGATTAGACAATTTACACTGAGATACAATATATTTATTAACAAGATCAAAGACCTTATTGAAAGATTATTCTGTCATTGAAGATCCTATTCCTTTCTTTCCAGAGTAGCCAAAAAAAGTCCATAGAGATGCTAATCCAAATCAAAGTTTTTTCTGTTTATGAAAGTGAGCCCCACGAGAAGATGATAGAAGTCTTCAACATGAAAAGGAAAAACCTCTGACCAGCCAAACTTTACCAACATTTTGTCTCAAAAAACCCGAGTGAAGGGGCAATGAAGCAAGATGTGGCTTTGAGATTCATTTGCAGCCTTACACAAGGAGCACCAACTGGGGAAGATGGCCATATGGGGCAAGCATTTCTGTAAGAGGTCCAAAGTATTAACGTCTGATGTGGGACTTCCCACAAAAGGAATTTTATCTTCTCTAGATAGCGTCCCTTCCAAATAGTTTCAGCCAACAGAGAATCATATAAGGTTGATGAAGAGATCATATCCTGCAAAAGAGATTTCGTAGTGGGAAATCCATTAGTGGTTTGAGACCACATCCACCAGTCAACCCTTCTAAATAGGGTTGCTGAAGCAAATAAATCAATTCGGCCCATTCAATAGCTTCAGCATCCTTGAGGTTCCTACCCAAATGAAATTCCAGAAGCTATCCTTGGGGTGACATGACTCCTTGACGGTGGCAGTTTTCTGTTGAGACGTCTATATTGGAGAGGAGAACAACTAGCCAAGGCCCCTCTTGAAAGCCACTGATAATTCCAAAATGAATATGAGCCCCATCACTCACTAGAATTCAAATACACCTAGTAATGAGGGCGCTGCTGCCTTTTAATGAGGTGCCGTTGCCTTTTAATGACTTGGCATGGCCCTTCTGCTTAGAGTTGAGATGAAAGAGAGGGATGGTCATATTTATAAAAATAAGCCTTCTCCAGAGAGTCTCCCTCTCATGATTATACCGCCATCCACTTAGAACGCAATGCTCTATTCTTGTCTTTTGGGGAATAACCCTTCCCACTTTGTTCAATGGGGAAGAGAAATTTTCTCCATTAGACTAGGTGGATGCCCGACCCATCTGACCCTACTCTCCAAATAAAATTCCTGCAATGCCTTTCCACTTCTTGAGCCACCTTACTAGGAAAATTGATAAGGGGAAGGTAGTGTGTTGGAAGATTGGAGAGGGTGGCTTTCTTCTTCTTGAGCCACCTTATAACGAAAAGGTAGTATGTTGGAAGATTGGAGAGGGTGGCTTGAAGGAGCATAAGCCTCCCTCCTTTGGATATAAATCAGGAGACCTAGGAAGATCATCTTCTCCCGATCTTTTATATCATTGGGGCTCAAAAAGCCAAAAAATTGGGGGAGATGTTGAGGGGGAGTCCCAGGTACGGACTTAGCCAAGTCCCAACTGTACATTCAAACCGACAAGCAAGAGCCTCAATTAGAGAGAACATTGATGCCCAAAATTTAAGAATTGTGGAGATTAATGTTTAGACCCAATACCCTTTCAAAATAATCAACAACTTCGAACAGGTTGTTCAAGGAGTTGTCAATCAAAGGGGAGAACGAAATGGTGTGATAGGTCTTGTCCAACAATTTTATCAGATGGCAAGGTACCAATAAATATTAGAGAGTTTCCTATCTCTCCTAATTCTCAAAGAGAATGCTTAAAAAGGATGNTTCATCATCAGATGGCAAGGTACCAATAAATATTAGAGAGTTTCCTATCTCTCCTAATTCTCAAAGAGAATGCTTAAAAAGGATGCAAAAAGATACTCTACTTAGCAAGACAACCAGCTAAATATCCAACTTCTCTAGCCCATTCTAACTAACAAAATACCGTGAGCCACACTTCACGCGTACATGTTTTCCATCTCTCACTACTTTCCTTCCCTTCCTACGTGTATACACTTCTACAATCAGGGGCCTATCATGGTGCCATTGGAAAAATAGATATGGTTGATGCTCACCTTCCCATTTCTCACGTGCTTCAATTTGACCTAAAGATTCAGCCTGTGTGAGTAGTGACTTAGGCTATCCATTACAAGAATGAAGAAAGCAAGGGTACAAAGGGTCACCCTGGGAGACCTCTAGTACAGATCCTCTCCCTAAATGGACCATTAGTGATGATAGCGAATTTACCAATGAGATGCACTTCAATCCACTATCCCGATTTTTGACCAAAACCCTTTGGAGCGAGGATGTTGTTGAGGAACCCCCGATCATCCTTGTCGAATGTCTTTTTCACATCAAGTTTAATGTTCGACCCCTTCCTCTTTTTTCCTTTTCCTTTCATCAATGAATTTGTTGGCCTTGTGTGAGGCATTGAGGATTTGTCTTCCTTAGGCTGTTTGATAGTCAATGATACTTTGGGGGAAAATGGTTTTGAGTAGCTTGGAGAGAACACAAGCCACAATCTTGTAAAGATAAGCGGTGAGGCTAATAGGCTGAAATTCGGCTATCATGCAAGTATCCACTTTCTTGGAATAAGAGAAATGTAAGTCTCGTTAAGTCTGGCTTTGATAATTCCATTCCAAAGAAATCTTGGAACACTCTCATAATGTTACTCCCTTGTTCTTCCTACGGCTAACTCATAGAACATCCTTGTTACGCTCTGCTGTCTATTTGCGAGAGCAAAAAGAACTATTGTGTTGTGAATCACAGCTTCACAATGCATTTTTGTTCTTTTGTAGGCACTAATGAAAGTACTAGGAATCTAGGGCTATCTTCTTAGATTTTGTTATGATGTAATGTGCCTTTAGAGCATAGGTGCACCTAGCATTCACTTGGCTGCGTGAATGCCTAGTATCACCAATACATGGTTTGCACAACACACCTGAAGTCAACTGTTTAAACCAGGATGTGTCTTCCGGCTATTCTAGGGTTAGAGGATGAATTGGAGAGCTGTGTCATTGTTGCAGAGTTGATTATGTTACTTTCTTGTCACTCTTTACTGGAGGTCAGTTCATTCCTTTGAATATGTCTACATATGCATGTTTGTTAGTAACTGTACCCAATATGCCCTTATTTTTGTTTGCTTATTAACATTTTACCTTTTCTTAAGCAGGCATCGTTGGAGATGAGCAACATTATAAGAGAATTGAATGAGAAGAAATTAGAAGTAAAGCAATTGCAAGTTGAGTTGAACAGAAGAGAGAATATGAAGTCCGATGATAATGTGGAGGGATTGAAGAGATTAATTACAAAATTGGAGAAAGAAAAAAGTACTCTTGAGGTCGCTAAATTATTCTTAGTTTTTGGTTGCAATAAGTGAGAAATGTGTTTCATATTTATAAAAATTTCTGTGTTCTCATTGTCTAATATTGGCTGCTTCTCATGTACAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGGTAATTCTGATCACAATCTGGAATACCGATTTTGTTTACCAGCTGTTCAGATTGTTTGATTTCTCATGAAATCACTGCAGATGTGGTTTGCCTTTATGCCCTATATGTGTTAAAAAATTCTTGCGTATACTGGTTAAATGCATTGATAATGGCGTTAATGAATTGTAGGTTGACTTCCCTTTTCTTTTTTTTTCTTTACTTCTTTTACATAGAAATTAGGTCTATCTGGAATTTCCCCTGGAAAAGAAGATATGGATCTATCATTGCAAAAATTAAAGAAAGATCTGAAGGAAATGCAGCAGGAGAGAGACAAAGCTGTGCATGAACTATCACGTCTCAAGCAGCATTTATTGGAAAAGGTTTCCTGTCTTAACCTCGTGGGCTTGATACTTCACACTAGAGTTTGGCCATTTATTAACCTCTGAAGTTGCATGATGTAATTTATCATATGAAGTGCATATTTATCAAGCTTATGACCTATATTCTTTACTAGTTAATACCATACTGTCAATGAAGGCGGGTGAAATTTTTTTAGTTTACGTAAATTTTCCTAATTAAATCTATGATATGGTTATGAATTTGGACTTCTAGATGCAGAAATACACTATTAATCCTTTTCTGTTCGTTCAAGAAGTGGGATGATATAGGATCTAATTGAAGTCAGTGCATAGAAGCACAAACTATCTCCTCTATTTTATTTTATTTTNTGCTTCAATTTGACCTAAAGATTCAGCCTGTGTGAGTAGTGACTTAGGCTATCCATTACAAGAATGAAGAAAGCAAGGGTACAAAGGGTCACCCTGGGAGACCTCTAGTACAGATCCTCTCCCTAAATGGACCATTATTGATGATAGCGAATTTACCAATGAGACGCACTTCAATCCACTATCCCGATTTTTGACCAAAACCCTTTGNAGTGATGATAGCGAATTTACCAATGAGATGCACTTCAATCCACTATCCCGATTTTTGACCAAAACCCTTTGGAGCGAGGATGTTGTTGAGGAACCCCCGATCATCCTTGTCGAATGTCTTTTTCACATCAAGTTTAATGTTCGACCCCTTCCTCTTTTTTCCTTTTCCTTTCATCAATGAATTTGTTGGCCTTGTGTGAGGCATTGAGGATTTGTCTTCCTTAGGCTGTTTGATAGTCAATGATACTTTGGGGGAAAATGGTTTTGAGTAGCTTGGAGAGAACACAAGCCACAATCTTGTAAAGATAAGCGGTGAGGCTAATAGGCTGAAATTCGGCTATCATGCAAGTATCCACTTTCTTGGAATAAGAGAAATGTAAGTCTCGTTAAGTCTGGCTTTGATAATTCCATTCCAAAGAAATCTTGGAACACTCTCATAATGTTACTCCCTTGTTCTTCCTACGGCTAACTCATAGAACATCCTTGTTACGCTCTGCTGTCTATTTGCGAGAGCAAAAAGAACTATTGTGTTGTGAATCACAGCTTCACAATGCATTTTTGTTCTTTTGTAGGCACTAATGAAAGTACTAGGAATCTAGGGCTATCTTCTTAGATTTTGTTATGATGTAATGTGCCTTTAGAGCATAGGTGCACCTAGCATTCACTTGGCTGCGTGAATGCCTAGTATCACCAATACATGGTTTGCACAACACACCTGAAGTCAACTGTTTAAACCAGGATGTGTCTTCCGGCTATTCTAGGGTTAGAGGATGAATTGGAGAGCTGTGTCATTGTTGCAGAGTTGATTATGTTACTTTCTTGTCACTCTTTACTGGAGGTCAGTTCATTCCTTTGAATATGTCTACATATGCATGTTTGTTAGTAACTGTACCCANTTCCGGCTATTCTAGGGTTAGAGGATGAATTGGAGAGCTGTGTCATTGTTGCAGAGTTGATTATGTTACTTTCTTGTCACTCTTTACTGGATGTCAGTTCATTCCTTTGAATATGTCTACATATGCATGTTTGTTAGTAACTGTACCCAATATGCCCTTATTTTTGTTTGCTTATTAACATTTTACCTTTTCTTAAGCAGGCATCGTTGGAGATGAGCAACATTATAAGAGAATTGAATGAGAAGAAATTAGAAGTAAAGCAATTGCAAGTTGAGTTGAACAGAAGAGAGAATATGAAGTCCGATGATAATGTGGAGGGATTGAAGAGATTAATTACAAAATTGGAGAAAGAAAAAAGTACTCTTGAGGTCGCTAAATTATTTTTAGTTTTTGGTTGCAATAAGTGAGAAATGTGTTTTATATTTATAAAAATTTCTGTGTTCTCATTGTCTAATATTGGCTGCTTCTCATGTACAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGGTAATTCTGATCACAATCTGGAATACCGATTTTGTTTACCAGCTGTTCAGATTGTTTGATTTCTCATGAAATCACTGCAGATGTGGTTTGCCTTTATGCCCTATATGTGTTAAAAAATTCTTGCGTATACTGGTTAAATGCATTGATAATGGCGTTAATGAATTGTAGGTTGACTTCCCTTTTCTTTTTTTTTCTTTACTTCTTTTACATAGAAATTAGGTCTATCTGGAATTTCCNTCTTAGTTTTTGGTTGCAATAAGTGAGAAATGTGTTTCATATTTATAAAAATTTCTGTGTTCTCATTGTCTAATATTGGCTGCTTCTCATGTACAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGGTAATTCTGATCACAATCTGGAATACCGATTTTGTTTACCAGCTGTTCAGATTGTTTGATTTCTCATGAAATCACTGCAGATGTGGTTTGCCTTTATGCCCTATATGTGTTAAAAAATTCTTGCGTATACTGGTTAAATGCATTGATAATGGCGTTAATGAATTGTAGGTTGACTTCCCTTTTCTTTTTTTTTCTTTACTTCTTTTACATAGAAATTAGGTCTATCTGGAATTTCCCCTGGAAAAGAAGATATGGATCTATCATTGCAAAAATTAAAGAAAGATCTGAAGGAAATGCAGCAGGAGAGAGACAAAGCTGTGCATGAACTATCACGTCTCAAGCAGCATTTATTGGAAAAGGTTTCCTGTCTTAACCTCGTGGGCTTGATACTTCACACTAGAGTTTGGCCATTTATTAACCTCTGAAGTTGCATGATGTAATTTATCATATGAAGTGCATATTTATCAAGCTTATGACCTATATTCTTTACTAGTTAATACCATACTGTCAATGAAGGCGGGTGAAATTTTTTTAGTTTACGTAAATTTTCCTAATTAAATCTATGATATGGTTATGAATTTGGACTTCTAGATGCAGAAATACACTATTAATCCTTTTCTGTTCGTTCAAGAAGTGGGATGATATAGGATCTGATTGAAGTCAGTGCATAGAAGCACAAACTATCTCCTCTATTTTATTTTATTTTTTTTAGTGGAAATCTGAAATCTTGTCCTGATTCAAAACTTCTTGGAAGAACTAGTTGTTTGCGTTCAAATGTGACTGCTTATAGTCCCTTAATGGAGATCTTGATTGTATTTAGTGATCATTATAATTCAGACTGATGTTTGTCAACTGTTTAAATTAAATATGATTGTTGGTGCTTCAATGTTTCTGCTTATATTCTTTTCGTTGCAAGGGTTCTTGCAAACCTTTGGATCTTCTGTGTGCTAGTGTTTTTCTTTAGAACCTACTGTAGCTGAGTAGTCTTTCATTTCTGGAATTCTGTCAAGGAATCTGAGGAATCAGAAAAGATGGATGAAGATAGCAGAATAATTGAAGAACTTCGGCATGATAATGAATATCAAAGGGGCCAGATATTGCATTTAGAGAAGGCATTGAATCAGGCAATTGCAACTCAGAAGGAGCTTGAGATGTACGGTAAAAATGAACTCCAGAAATCCAAGGAAATTATTGAAGAGCTTAACAGAAAACTTGCAAACTATACGAGTATTATAGATTCCAAGAACGTTGAACTGTTGAATCTTCAAACTGCACTCGGCCAGTACTACGCAGAAATTGAAGCCAAGGTAAAGAAAAATGAAATTTTATTTAAGTTTGTAATAATATATAGAATATGCAATGAGTATCTAAGCATGCCATTTTTTGATTGTGTTATTATCTTATACATACTTTGAACTATAGGAACACTTGGAGAGTGACTTGGCTCGGGAAAGAGAAGGAGAAGCTAAATTGTCTCAAATGCTAAAAGTACGCTGTTCTGACTGTCAATCTATACATACTTCTATGAATAACATGCCTTAGCTCTTGGTTTCTTAAGTGGAGTGTGTCTTCCTTTTTTTTCTGTTTATAGGAACATTATAGGGGTCGAACAAGTTGGGTTGGGATGCAATCTTGGTCCCTGTAGAGATCTTTAATCTCTATTCTAAACCCATGTCCTGGTTTTGGAGAATCTCCAAATAGGGATTGGATTCCCGCCAAAAAAAAAAAAAACAAAAAATGGTCTTTATTTTTAGTTCTAATATGAACTAAAGATAATGTCTCAGATTTAGTTAGTTCTCACTTATCAACTTAACATTTTCATACTCTTTGTGAGTATCTAGACTTTCAGAATTATCATCTATCACATCTGATTTTTATGGGCTTCTTTGAATTAAGGATTCACATTGCTTTCGTAGTTTGCTGATGAGATTTGACCACTTAAATTGTACAATTAATGCTTATCACTTCTTCTCCTTTGGGTTTTCTCGTTATTGCTTAATACTTTGTTTCATTACCCAGGATGCCAACCAAAGAGAAGATGCATTAAAGAAGGAGAAGGAAGAAATTTTGTCAAAGCTTTCACTTTCTGAACGAGCATTGGGAGAATGGAAAAGCAGAGTAAATAAACTTGAGGAAGATAATTCTAAGCTACGCCGTGCTCTTGATCAGAGTATGACAAGGCTGAATAGGATGTCGGTGGATTCAGATTTCCTTGTTGATAGGTAAATCTTCATAAATATCTTTGATTGGGAATTTACACCCCTGCTCGTTTCTCCCAGCACTTACATTGCATGCTAATAGTCTTCTACTACATGCCTTTTGTTTTTTTGTTTTGTTTTAGAAAAAAGAGGAACATAATTTTTCATTGACAAGTGATATGTGAAAAGTTATTCCAGCCTTAACCATGAAGAACAATGACTATGTTCCTACTGAAAATTTTGTGCTTCAAACTTTAGCCTTGTTGAAAATTTNCCCCCAAAAAAAAAAAAAAACAAAAAATGGTCTTTATTTTTAGTTCTAATATGAACTAAAGATAATGTCTCAGATTTAGTTAGTTCTCACTTATCAACTTAACATTTTCATACTCTTTGTGAGTATCTAGACTTTCAGAATTATCATCTATCACATCTGATTTTTATGGGCTTCTTTGAATTAAGGATTCACATTGCTTTCGTAGTTTGCTGATGAGATTTGACCACTTAAATTGTACAATNGACTTCTTTGAATTAAGGATTCACATTGCTCTCGTAGTTTGCTGATGAGATTTGACCACTTAAATTGTACAATTAATGCTTATCACTTCTTCTCCTTTGGGTTTTCTCGTTATTGCTTAATACTTTGTTTCATTACCCAGGATGCCAACCAAAGAGAAGATGCATTAAAGAAGGAGAAGGAAGAAATTTTGTCAAAGCTTTCACTTTCTGAACGAGCATTGGGAGAATGGAAAAGCAGAGTAAATAAACTTGAGGAAGATAATTCTAAGCTACGCCGTGCTCTTGATCAGAGTATGACAAGGCTGAATAGGATGTCGGTGGATTCAGATTTCCTTGTTGATAGGTAAATCTTCATAAATATCTTTGATTGGGAATTTACACCCCTGCTCGTTTCTCCCAGCACTTACATTGCATGCTAATAGTCTTCTACTACATGCCTTTTGTTTTTTTGTTTTGTTTTAGAAAAAAGAGGAACATAATTTTTCATTGACAAGTGATATGTGAAAAGTTATTCCAGCCTTAACCATGAAGAACAATGACTATGTTCCTACTGAAAATTTTGTGCTTCAAACTTTAGCCTTGTTGAAAATTTTGTTGTTTCTTTTTAACTAAATGCTCCAAAGAATTGACATGATAGCTATAGCATTGATCCGCGAAATCTTTGCTTTCCACTTAAAATGTGCACACAATAATTGTTACAAATTACCGTCTGCGGAGCAATGTGGGTCCCCGAAGCTGAAAATTTGTATCAAAGAGACCCATATCGTCTTGATAAAGGAACACCAGGAGTCGATGATACATTCCTTTGAAAGGAGAACACCTTGATAATTTTAAGGCATATTGTCTAGGTACTAATACATCAGCTGCATTAAGACCATGAAGACGAATGTGTCCACAGGGTACTTGCCTTTTCAAATTGCCAAGAAGAGATGTTGTTACGTTTCACAAAGACAAGCTACCAACCATTGTTAGGTTGACTAGCCACTATTGTCCCTTCAAATTGAAACTTTGGCTATCTATATTTTCTTTACTCATTTGAAGAATTACGTACTTCATGAACACCTTCTCCTCACATCTGATTCTGGCCTTCAAGCAATACTTTTCAATTTTGTTTTTCCTGTGCNATTAATGCTTATCACTTCTTCTCCTTTGGGTTTTCTCGTTATTGCTTAATACTTTGTTTCATTACCCAGGATGCCAACCAAAGAGAAGATGCATTAAAGAAGGAGAAGGAAGAAATTTTGTCAAAGCTTTCACTTTCTGAACGAGCATTGGGAGAATGGAAAAGCAGAGTAAATAAACTTGAGGAAGATAATTCTAAGCTACGCCGTGCTCTTGATCAGAGTATGACAAGGCTGAATAGGATGTCGGTGGATTCAGATTTCCTTGTTGATAGGTAAATCTTCATAAATATCTTTGATTGGGAATTTACACCCCTGCTCGTTTCTCCCAGCACTTACATTGCATGCTAATAGTCTTCTACTACATGCCTTTTGTTTTTTTGTTTTGTTTTAGAAAAAAGAGGAACATAATTTTTCATTGACAAGTGATATGTGAAAAGTTATTCCAGCCTTAACCATGAAGAACAATGACTATGTTCCTACTGAAAATTTTGTGCTTCAAACTTTAGCCTTGTTGAAAATTTTGGAGTTGTAATACTGATAATTGTTGGGGGGTTGTGGCAGGCGTATCGTGATCAAATTACTGGTGACGTACTTCCAGAAAAACCACAGCAAAGAGGTACTTGTTTCTTTTAATCAACGTATTCATTATATAAGATTCGTGTTTTGTCCGACGCATTTACTACGCCTGCAGACATGTGAACATTATATGAATATGTCCTTTTAGAGTAGGCAACTTTACTTGGTTCTTAGTTGGAATGAAATTAGAATGGATCTTATTTTGACTATAATCAAGATCGTAATTGAAAGTTATTGCACTATTTGGTACAATAATGAAAAGAATGTTGGAGGCTTTAAAGTAATTAATTCTTTTGTGTAGAAGACATCTCTATAATCACTTTCAGGAAGGGTTTTCCATGAAAGGAATTTGCACTTTTTGGGGTATTTATCCTTCCATACACCATAAGGAAGTGTTCAATGAGGCTAGAAAATGGCCCACTTAGCCTTTCTCTTTTTCCCTTTCTAAAAATTTTCATCTAATCAGTTCTTGTGCATTGCTTATGCAGGTTTTGGATCTTATGGTCCGTATGCTTGGATTTTCTGAAGATGACAAGATGAGGATAGGAGCTGCTAAACAAGGTCCAAGCAAGGGTGTTGTACGTGGAGTTTTGGGCCTTCCTGGACGCCTGGTCGGTGGGATTTTGGGAGGAAGCTCAGCGGAGACGCCGGCTAATATGGCCTCTGATAACCAGGTACACAACTATTTTGGTCAAATATTTGCAATATCTATTTTGGTCAAATATAGGGATTTTTAGTGGGTTTTTGTTTGTTTTTAGTCTAGAAGGATGATTTTCTTCTGGTAAATCCTATTTTTGTCCTAGTTATGGGTTTCTGAAAGTTTCAAAATACAATTTAGTCTCCTGATCTGATTCAAAACCCACTGTAGTCGGCGTATATGGTTTCGAACATGCTCACATCATTTCGTTAATATACATATTCCCCGTCGTGCTATTATCTATCCAACAAAATATGGTAGAACAATGCAAGGGACTGAACTGATTACTTATTTTATTTTATTTCTTGAGAGTTCAAAGACTAATCATTTTCATACAAGAATTTGAACCTAACTTTTATCTATATGTACTGGGATTTCTACCAAATGTTGTACCATTTTATCTTTTTGTGAGTAAAAGTAGAAGTCTCTGTTCTAAGCATGGATTTCCGTTTACAGTCCTTTGCAGATCTATGGGTTGACTTTCTTCTCAAGGAGAATGAAGAAAGAGAGAAGAGAGAAGCCAAGGAAAGCCTCAAGCTTCAGGAAGAATCGCAACTTAACGGTCCAAATGTTGGCAGTATCGATCCCGGAACGAGGGCAACTGGTTCGACATCTGAATCTTCAAGAACAGGTTTTCCTTCACATCATCATCATCATCATCATCATCAATCGACTCACCTTCCTTTTGGTGGCGATTTTCGCCTTTCAAGACACCACTCTGAATCTGAATTCTCAACAGTTCCTCTCACATCAACAACTGAGAACACTCATTATAGTTCAAGACCGCTCCCAAAATACTAAGATCTTCTTCATCGGATTTTGTAATTAATAGAAAACTACAGATTATTGTTTGTTAATGAACTACTACTATAAAAGCCATGTAATGTTCAATTAATTCATTTACTTGGAGTGGTTGGTTAGTGTTGTTCAGCTCACCTGGTGTAATTGTTTGAAAGAGCTATAAATTTCTCAGCTGATTTCATCTTCTGGGTAAAGATTTGGAATAAATACGACGTTTCGATTCGATTTGTTTCGATATTCTTGCTGCATCATCTACAAAATCAGGACTCTAGACCACTGCTCATTTCATTTTTCAAAATTGATGAACCCTTTTTTGAGTCAAAACTTAGGCCCCATTCAACTACTTAGAAACCATATTTCATGCATATAATTAAACCATTTCGTTCCCACTTATCTAACCTAAGAATCAACATAAATACAATATGAACAGCTTGACTTCTTTTTCTACTCTCTCTTTATTAGTGCATTTACAACCACAATATCCCAACCAAAAATTCATNATGTTGTACCATTTTATCTTTTTGTGAGTAAAAGTAGAAGTCTCTGTTCTAAGCATGGATTTTCGTTTACAGTCCTTTGCAGATCTATGGGTTGACTTTCTTCTCAAGGAGAATGAAGAAAGAGAGAAGAGAGAAGCCAAGGAAAGCCTCAAGCTTCAGGAAGAATCGCAACTTAACGGTCCAAATGTTGGCAGTATCGATCCCGGAACGAGGGCAACTGGTTCGACATCTGAATCTTCAAGAACAGGTTTTCCTTCACATCATCATCATCATCATCATCATCAATCGACTCACCTTCCTTTTGGTGGCGATTTTCGCCTTTCAAGACACCACTCTGAATCTGAATTCTCAACAGTTCCTCTCACATCAACAACTGAGAACACTCATTAT

mRNA sequence

GATGTATGATTTATTATGCAAGTATCGACATGTCGTCTGGTATAAACCGACAGAATATTGCACTGGACGACGCACAATGCATAAACTCGATAGCTAATTTGAAAGAGAATCTGAATAAGATAGCTCTCGATGTGCATCACGACGATGATGAAGAGGAATTTGCGATCTATGGCTCCAATGGAGGGGATGTTGATGTTTCGGTATCTGATCGGAGGAACTCGCATAGCTTTGCTCATTCGAATCCGGTGACGCGGTCCCCGATTGCCAATGGGATTGAGGATGCTCATCATCCTGAGATTGAACAATACAAAACAGAAATTAAGAGGCTTCAGGAATCTGAGAGGGATATTAAGTCATTATCGATGAATTATGCAGCTCTGCTAAAGGAAAAAGAGAGTTTGGAAGGCACAAATACATCAACAAATTCACCTAGAGCTGAAAGTTCCAAATCACCATCAAATGGAACTAATGAAATGAAGGGGAGCGATCAATCACCTACCCGACTGCTTAGGGGGAAGACCCGGCGTAATGGTATTGTGTCTAAGCAGGATGGAATTGCTAATGGAGCTTCACACTCTGGAAAACTTGATTACCAGAGTAAGATGGTACCAGAACATTCAACTTCACAGGAGCTTACAGATTTTCAAGAAGGGAATATTGGATCACTGCAAGATGTGCAAACTACTCTTGAGATGAAACAGTTAAGGAAGGAACTTCAACAAGAAAGGGATCAGTTGGCAGATGTGCAATTAAGATTACGAGAGGAGCAAAAATTGAACAAAAAGTTCCAGGAGGAGTTGAACTCTCTACATATGAACAAGGACAAAGCATCGTTGGAGATGAGCAACATTATAAGAGAATTGAATGAGAAGAAATTAGAAGTAAAGCAATTGCAAGTTGAGTTGAACAGAAGAGAGAATATGAAGTCCGATGATAATGTGGAGGGATTGAAGAGATTAATTACAAAATTGGAGAAAGAAAAAAGTACTCTTGAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGGCATCGTTGGAGATGAGCAACATTATAAGAGAATTGAATGAGAAGAAATTAGAAGTAAAGCAATTGCAAGTTGAGTTGAACAGAAGAGAGAATATGAAGTCCGATGATAATGTGGAGGGATTGAAGAGATTAATTACAAAATTGGAGAAAGAAAAAAGTACTCTTGAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGAAATTAGGTCTATCTGGAATTTCCCCTGGAAAAGAAGATATGGATCTATCATTGCAAAAATTAAAGAAAGATCTGAAGGAAATGCAGCAGGAGAGAGACAAAGCTGTGCATGAACTATCACGTCTCAAGCAGCATTTATTGGAAAAGGAATCTGAGGAATCAGAAAAGATGGATGAAGATAGCAGAATAATTGAAGAACTTCGGCATGATAATGAATATCAAAGGGGCCAGATATTGCATTTAGAGAAGGCATTGAATCAGGCAATTGCAACTCAGAAGGAGCTTGAGATGTACGGTAAAAATGAACTCCAGAAATCCAAGGAAATTATTGAAGAGCTTAACAGAAAACTTGCAAACTATACGAGTATTATAGATTCCAAGAACGTTGAACTGTTGAATCTTCAAACTGCACTCGGCCAGTACTACGCAGAAATTGAAGCCAAGGAACACTTGGAGAGTGACTTGGCTCGGGAAAGAGAAGGAGAAGCTAAATTGTCTCAAATGCTAAAAGATGCCAACCAAAGAGAAGATGCATTAAAGAAGGAGAAGGAAGAAATTTTGTCAAAGCTTTCACTTTCTGAACGAGCATTGGGAGAATGGAAAAGCAGAGTAAATAAACTTGAGGAAGATAATTCTAAGCTACGCCGTGCTCTTGATCAGAGTATGACAAGGCTGAATAGGATGTCGGTGGATTCAGATTTCCTTGTTGATAGGCGTATCGTGATCAAATTACTGGTGACGTACTTCCAGAAAAACCACAGCAAAGAGGTTTTGGATCTTATGGTCCGTATGCTTGGATTTTCTGAAGATGACAAGATGAGGATAGGAGCTGCTAAACAAGGTCCAAGCAAGGGTGTTGTACGTGGAGTTTTGGGCCTTCCTGGACGCCTGGTCGGTGGGATTTTGGGAGGAAGCTCAGCGGAGACGCCGGCTAATATGGCCTCTGATAACCAGTCCTTTGCAGATCTATGGGTTGACTTTCTTCTCAAGGAGAATGAAGAAAGAGAGAAGAGAGAAGCCAAGGAAAGCCTCAAGCTTCAGGAAGAATCGCAACTTAACGGTCCAAATGTTGGCAGTATCGATCCCGGAACGAGGGCAACTGGTTCGACATCTGAATCTTCAAGAACAGGTTTTCCTTCACATCATCATCATCATCATCATCATCAATCGACTCACCTTCCTTTTGGTGGCGATTTTCGCCTTTCAAGACACCACTCTGAATCTGAATTCTCAACAGTTCCTCTCACATCAACAACTGAGAACACTCATTATAAAGTCTCTGTTCTAAGCATGGATTTTCGTTTACAGTCCTTTGCAGATCTATGGGTTGACTTTCTTCTCAAGGAGAATGAAGAAAGAGAGAAGAGAGAAGCCAAGGAAAGCCTCAAGCTTCAGGAAGAATCGCAACTTAACGGTCCAAATGTTGGCAGTATCGATCCCGGAACGAGGGCAACTGGTTCGACATCTGAATCTTCAAGAACAGGTTTTCCTTCACATCATCATCATCATCATCATCATCAATCGACTCACCTTCCTTTTGGTGGCGATTTTCGCCTTTCAAGACACCACTCTGAATCTGAATTCTCAACAGTTCCTCTCACATCAACAACTGAGAACACTCATTAT

Coding sequence (CDS)

ATGATTTATTATGCAAGTATCGACATGTCGTCTGGTATAAACCGACAGAATATTGCACTGGACGACGCACAATGCATAAACTCGATAGCTAATTTGAAAGAGAATCTGAATAAGATAGCTCTCGATGTGCATCACGACGATGATGAAGAGGAATTTGCGATCTATGGCTCCAATGGAGGGGATGTTGATGTTTCGGTATCTGATCGGAGGAACTCGCATAGCTTTGCTCATTCGAATCCGGTGACGCGGTCCCCGATTGCCAATGGGATTGAGGATGCTCATCATCCTGAGATTGAACAATACAAAACAGAAATTAAGAGGCTTCAGGAATCTGAGAGGGATATTAAGTCATTATCGATGAATTATGCAGCTCTGCTAAAGGAAAAAGAGAGTTTGGAAGGCACAAATACATCAACAAATTCACCTAGAGCTGAAAGTTCCAAATCACCATCAAATGGAACTAATGAAATGAAGGGGAGCGATCAATCACCTACCCGACTGCTTAGGGGGAAGACCCGGCGTAATGGTATTGTGTCTAAGCAGGATGGAATTGCTAATGGAGCTTCACACTCTGGAAAACTTGATTACCAGAGTAAGATGGTACCAGAACATTCAACTTCACAGGAGCTTACAGATTTTCAAGAAGGGAATATTGGATCACTGCAAGATGTGCAAACTACTCTTGAGATGAAACAGTTAAGGAAGGAACTTCAACAAGAAAGGGATCAGTTGGCAGATGTGCAATTAAGATTACGAGAGGAGCAAAAATTGAACAAAAAGTTCCAGGAGGAGTTGAACTCTCTACATATGAACAAGGACAAAGCATCGTTGGAGATGAGCAACATTATAAGAGAATTGAATGAGAAGAAATTAGAAGTAAAGCAATTGCAAGTTGAGTTGAACAGAAGAGAGAATATGAAGTCCGATGATAATGTGGAGGGATTGAAGAGATTAATTACAAAATTGGAGAAAGAAAAAAGTACTCTTGAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGGCATCGTTGGAGATGAGCAACATTATAAGAGAATTGAATGAGAAGAAATTAGAAGTAAAGCAATTGCAAGTTGAGTTGAACAGAAGAGAGAATATGAAGTCCGATGATAATGTGGAGGGATTGAAGAGATTAATTACAAAATTGGAGAAAGAAAAAAGTACTCTTGAGATGGGAAAAAAGGAACTTGAAGATACATTGGAAAAATGCCGAACATCTTCAAGTGTTGAAGCCCAATCAAGTTCTTTGGAAATGGTGAATAGGCATCTAAGTGGTTCAAACGAGAAATTAGGTCTATCTGGAATTTCCCCTGGAAAAGAAGATATGGATCTATCATTGCAAAAATTAAAGAAAGATCTGAAGGAAATGCAGCAGGAGAGAGACAAAGCTGTGCATGAACTATCACGTCTCAAGCAGCATTTATTGGAAAAGGAATCTGAGGAATCAGAAAAGATGGATGAAGATAGCAGAATAATTGAAGAACTTCGGCATGATAATGAATATCAAAGGGGCCAGATATTGCATTTAGAGAAGGCATTGAATCAGGCAATTGCAACTCAGAAGGAGCTTGAGATGTACGGTAAAAATGAACTCCAGAAATCCAAGGAAATTATTGAAGAGCTTAACAGAAAACTTGCAAACTATACGAGTATTATAGATTCCAAGAACGTTGAACTGTTGAATCTTCAAACTGCACTCGGCCAGTACTACGCAGAAATTGAAGCCAAGGAACACTTGGAGAGTGACTTGGCTCGGGAAAGAGAAGGAGAAGCTAAATTGTCTCAAATGCTAAAAGATGCCAACCAAAGAGAAGATGCATTAAAGAAGGAGAAGGAAGAAATTTTGTCAAAGCTTTCACTTTCTGAACGAGCATTGGGAGAATGGAAAAGCAGAGTAAATAAACTTGAGGAAGATAATTCTAAGCTACGCCGTGCTCTTGATCAGAGTATGACAAGGCTGAATAGGATGTCGGTGGATTCAGATTTCCTTGTTGATAGGCGTATCGTGATCAAATTACTGGTGACGTACTTCCAGAAAAACCACAGCAAAGAGGTTTTGGATCTTATGGTCCGTATGCTTGGATTTTCTGAAGATGACAAGATGAGGATAGGAGCTGCTAAACAAGGTCCAAGCAAGGGTGTTGTACGTGGAGTTTTGGGCCTTCCTGGACGCCTGGTCGGTGGGATTTTGGGAGGAAGCTCAGCGGAGACGCCGGCTAATATGGCCTCTGATAACCAGTCCTTTGCAGATCTATGGGTTGACTTTCTTCTCAAGGAGAATGAAGAAAGAGAGAAGAGAGAAGCCAAGGAAAGCCTCAAGCTTCAGGAAGAATCGCAACTTAACGGTCCAAATGTTGGCAGTATCGATCCCGGAACGAGGGCAACTGGTTCGACATCTGAATCTTCAAGAACAGGTTTTCCTTCACATCATCATCATCATCATCATCATCAATCGACTCACCTTCCTTTTGGTGGCGATTTTCGCCTTTCAAGACACCACTCTGAATCTGAATTCTCAACAGTTCCTCTCACATCAACAACTGAGAACACTCATTATAAAGTCTCTGTTCTAAGCATGGATTTTCGTTTACAGTCCTTTGCAGATCTATGGGTTGACTTTCTTCTCAAGGAGAATGAAGAAAGAGAGAAGAGAGAAGCCAAGGAAAGCCTCAAGCTTCAGGAAGAATCGCAACTTAACGGTCCAAATGTTGGCAGTATCGATCCCGGAACGAGGGCAACTGGTTCGACATCTGAATCTTCAAGAACAGGTTTTCCTTCACATCATCATCATCATCATCATCATCAATCGACTCACCTTCCTTTTGGTGGCGATTTTCGCCTTTCAAGACACCACTCTGAATCTGAATTCTCAACAGTTCCTCTCACATCAACAACTGAGAACACTCATTAT

Protein sequence

MIYYASIDMSSGINRQNIALDDAQCINSIANLKENLNKIALDVHHDDDEEEFAIYGSNGGDVDVSVSDRRNSHSFAHSNPVTRSPIANGIEDAHHPEIEQYKTEIKRLQESERDIKSLSMNYAALLKEKESLEGTNTSTNSPRAESSKSPSNGTNEMKGSDQSPTRLLRGKTRRNGIVSKQDGIANGASHSGKLDYQSKMVPEHSTSQELTDFQEGNIGSLQDVQTTLEMKQLRKELQQERDQLADVQLRLREEQKLNKKFQEELNSLHMNKDKASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQSFADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGSIDPGTRATGSTSESSRTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTSTTENTHYKVSVLSMDFRLQSFADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGSIDPGTRATGSTSESSRTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTSTTENTHY
BLAST of Cp4.1LG12g03990 vs. Swiss-Prot
Match: GOGC4_ARATH (Golgin candidate 4 OS=Arabidopsis thaliana GN=GC4 PE=2 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 7.3e-125
Identity = 281/497 (56.54%), Postives = 363/497 (73.04%), Query Frame = 1

Query: 323 EKEKSTLEMGKKELEDTLEKCRTSSSV----EAQSSSLEMVNRHLSGSNEASLEMSNIIR 382
           E+ +S      +ELE   EK      +      Q+ + +   + L    E +L  SN +R
Sbjct: 193 ERTRSMASAQARELEKEREKSANLQILLQEERKQNETFKEELQSLRLDKEKTLMESNKVR 252

Query: 383 -ELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKC 442
            EL+ K  E++QLQ++LN  E      + E LK +   LEKE + L++ + ELE  LE  
Sbjct: 253 RELDAKLAEIRQLQMKLNGGEQHAFGISRENLKEVNKALEKENNELKLKRSELEAALEAS 312

Query: 443 RTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKA 502
           + S+S +    S E ++RHLS  +E+   +G  PGKEDM+ SLQ+L+K+L+E ++E+DKA
Sbjct: 313 QKSTSRKLFPKSTEDLSRHLSSLDEEK--AGTFPGKEDMEKSLQRLEKELEEARREKDKA 372

Query: 503 VHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKEL 562
             EL RLKQHLLEKE+EESEKMDEDSR+I+ELR  NEYQR QIL LEKAL Q +A Q+E+
Sbjct: 373 RQELKRLKQHLLEKETEESEKMDEDSRLIDELRQTNEYQRSQILGLEKALRQTMANQEEI 432

Query: 563 EMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDL 622
           +     E++KSK IIE+LN+KLAN    IDSKNVELLNLQTALGQYYAEIEAKEH E +L
Sbjct: 433 KSSSDLEIRKSKGIIEDLNQKLANCLRTIDSKNVELLNLQTALGQYYAEIEAKEHFEREL 492

Query: 623 AREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLR 682
           A  +E   KLS  LKD +++ ++ KKEKEEI SK+  +E    EWK+RV+K+E+DN+K+R
Sbjct: 493 AVAKEDAMKLSARLKDVDEQLESSKKEKEEITSKVLHAENIAAEWKNRVSKVEDDNAKVR 552

Query: 683 RALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRI 742
           R L+QSMTRLNRMS+DSDFLVDRRIVIKLLVTYFQ+NHS+EVLDLMVRMLGFSE++K RI
Sbjct: 553 RVLEQSMTRLNRMSMDSDFLVDRRIVIKLLVTYFQRNHSREVLDLMVRMLGFSEEEKQRI 612

Query: 743 GAAKQGPS-KGVVRGVLGLPGRLVGGIL--GGSSAETPANMASDNQSFADLWVDFLLKEN 802
           G A+QG + KGVVRGVLG PGRLVGGIL  GG S ++  NMASDNQSFAD+WV+FLLK+ 
Sbjct: 613 GLAQQGAAGKGVVRGVLGFPGRLVGGILGGGGGSPDSHPNMASDNQSFADMWVEFLLKDA 672

Query: 803 EEREKREAKESLKLQEE 812
           EERE+REA+++   ++E
Sbjct: 673 EERERREAEDAANKEQE 687

BLAST of Cp4.1LG12g03990 vs. Swiss-Prot
Match: GOGC3_ARATH (Golgin candidate 3 OS=Arabidopsis thaliana GN=GC3 PE=1 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 1.4e-123
Identity = 300/599 (50.08%), Postives = 403/599 (67.28%), Query Frame = 1

Query: 219 GSLQD--VQTTLEMKQLRKELQQERDQLADVQLRLREEQKLNKKFQEELNSLHMNKDKAS 278
           GSL+     T+  +K+ R ++ +  +  A      +   +L+K      +  HM+  K  
Sbjct: 112 GSLKQNLTSTSAALKEARTDISRGSNNYAIKGNNDQSPNRLHKSVSHLKSPNHMSNGKGK 171

Query: 279 LEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKEL 338
            +  + I+E        K L   L  R   KS   V+      T+L KE+  L   +  L
Sbjct: 172 -DTDSFIKE--------KDLADMLEDRT--KSMAAVQA-----TELAKEREKLRDFQLSL 231

Query: 339 EDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNR 398
           +   E+ + S S + +  S+ +        N+ S+E+S +  EL+ K LE+K LQ++L  
Sbjct: 232 Q---EERKRSESFKEELESMRL------DKNKTSMEISKMRSELDAKLLEIKHLQMKLTG 291

Query: 399 RENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRH 458
           +E+      +E LK +   LEKE + L++ + ELE  LE+ R  ++ +    + E + RH
Sbjct: 292 QESHAIGPGMEHLKEVNKALEKENNELKLKRSELEAALEESRKLTNSKVFPDATESLTRH 351

Query: 459 LSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKESEES 518
            S  +++   S   PGKE+M+ SLQ+L+ DLKE Q+ERDKA  EL RLKQHLLEKE+EES
Sbjct: 352 PSTLDKEKPES--FPGKEEMEQSLQRLEMDLKETQRERDKARQELKRLKQHLLEKETEES 411

Query: 519 EKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIEELN 578
           EKMDEDSR+IEELR  NEYQR QI HLEK+L QAI+ Q++  +   N+++K K+ +++LN
Sbjct: 412 EKMDEDSRLIEELRQTNEYQRSQISHLEKSLKQAISNQEDNRLSNDNQIRKLKDTVDDLN 471

Query: 579 RKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQ 638
           +KL N    I+SKNVELLNLQTALGQYYAEIEAKEH E +LA  ++   KLS  LKD+++
Sbjct: 472 QKLTNCLRTIESKNVELLNLQTALGQYYAEIEAKEHFERELAMAKDELMKLSARLKDSDE 531

Query: 639 REDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDF 698
           R ++  KEKE++ SKL  +E+   EWK+RV K+EEDN+K+RR L+QSMTRLNRMS++SD+
Sbjct: 532 RLESSNKEKEDVTSKLLHAEKVAAEWKNRVTKVEEDNAKVRRVLEQSMTRLNRMSMESDY 591

Query: 699 LVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLP 758
           LVDRRIVIKLLVTYFQKNH+KEVLDLMVRMLGFSE+DK RIGAAKQG  KGVVRGVLG P
Sbjct: 592 LVDRRIVIKLLVTYFQKNHNKEVLDLMVRMLGFSEEDKERIGAAKQGGGKGVVRGVLGFP 651

Query: 759 GRLVGGILGGSSAETPANMASDNQSFADLWVDFLLKENEEREKREAKESL--KLQEESQ 814
           GR VGGILGG SAE  AN ASDNQSFADLWVDFLLK+ EERE+REA+E+   K +++S+
Sbjct: 652 GRFVGGILGGKSAELHANAASDNQSFADLWVDFLLKDAEERERREAEEAAASKAKQDSE 683

BLAST of Cp4.1LG12g03990 vs. TrEMBL
Match: A0A0A0K888_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006840 PE=4 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 2.8e-224
Identity = 463/590 (78.47%), Postives = 504/590 (85.42%), Query Frame = 1

Query: 305 NMKSDDNVEGL---KRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNR 364
           NM S  +V+     K+L  +L++E+  L   +  L    E+ + +   + + +SL M   
Sbjct: 206 NMGSLQDVQATLEYKQLRKELQQEREQLADVQLRLR---EEQKLNKKFQEELNSLRM--- 265

Query: 365 HLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKS 424
                ++ASLEMS+I+RELNEKKLEVKQLQVELNRRE MKSDDNVE LKRLIT LEKEKS
Sbjct: 266 ---NKDKASLEMSDILRELNEKKLEVKQLQVELNRREKMKSDDNVEELKRLITTLEKEKS 325

Query: 425 TLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQ 484
           TLEM KKEL+DTLEK +  S VE  S SLEMVNRHLS S+EKLG SGIS GKED DLSLQ
Sbjct: 326 TLEMEKKELKDTLEKSQELSGVETPSKSLEMVNRHLSDSSEKLGPSGISLGKEDRDLSLQ 385

Query: 485 KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQIL 544
           KLKKDLKEMQQERDKA HELSRLKQHLLEKESEESEKMDEDSRIIEELRH+NEYQRGQI+
Sbjct: 386 KLKKDLKEMQQERDKAAHELSRLKQHLLEKESEESEKMDEDSRIIEELRHNNEYQRGQIM 445

Query: 545 HLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALG 604
           HLEKALNQAIA QKE EMYG NELQKSKEIIE+L+RKLAN  SIIDSKN+ELLNLQTALG
Sbjct: 446 HLEKALNQAIAMQKEAEMYGNNELQKSKEIIEDLHRKLANCMSIIDSKNIELLNLQTALG 505

Query: 605 QYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGE 664
           QYYAEIEAKEHLES LARERE EAKLSQMLKDANQREDALKKEKEEILSKLS+SERALGE
Sbjct: 506 QYYAEIEAKEHLESVLAREREEEAKLSQMLKDANQREDALKKEKEEILSKLSISERALGE 565

Query: 665 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLD 724
           WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQ+NHSKEVLD
Sbjct: 566 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQRNHSKEVLD 625

Query: 725 LMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQS 784
           LMVRMLGFSED+K+RIGAAKQGPSKGVVRGVLGLPGRLVGGILGGS+ ETPANMASDNQS
Sbjct: 626 LMVRMLGFSEDEKLRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSTTETPANMASDNQS 685

Query: 785 FADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGS-----IDPGTRATGSTSESS 844
           FADLWVDFLLKENEEREKREA+ESLKL+E SQ +  +V S     +DP T+  GST   S
Sbjct: 686 FADLWVDFLLKENEEREKREAEESLKLREASQSSSSDVASAGSPLLDPRTKTIGSTPNPS 745

Query: 845 RTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLT-STTENTH 886
           RTGFPSH       QSTHLPFG DFRLSRHHS+SEFSTVPLT S++ENT+
Sbjct: 746 RTGFPSHL------QSTHLPFGSDFRLSRHHSDSEFSTVPLTSSSSENTY 780

BLAST of Cp4.1LG12g03990 vs. TrEMBL
Match: A0A061DSS6_THECC (GRIP-related ARF-binding domain-containing protein 1 isoform 1 OS=Theobroma cacao GN=TCM_005246 PE=4 SV=1)

HSP 1 Score: 530.0 bits (1364), Expect = 6.2e-147
Identity = 348/617 (56.40%), Postives = 434/617 (70.34%), Query Frame = 1

Query: 275 ASLEMSN-IIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGK 334
           A  +MSN +  + +EK+ E+  L  E NR        +   +K+   +LEKE+  L   +
Sbjct: 169 AGNQMSNGLSSKHDEKEKELADLLEEKNRSLEAVQASHESQIKQFNMELEKERDKLANVQ 228

Query: 335 KELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVE 394
             L    E+ + + S + +   L+      S  +++  E+S I  ELNEK +E+++LQ+E
Sbjct: 229 IRLH---EERKLNESFQEELKLLK------SDKDKSVTELSKIRNELNEKIIEIRRLQME 288

Query: 395 LNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMV 454
           LNRREN  +DD +E L+R+I  LEKE + L+  K ELE  LE  + S + +    + E +
Sbjct: 289 LNRRENDSADDTLENLRRVIATLEKENTHLKKEKNELEAALEISKKSLTGKIHPDAAETL 348

Query: 455 NRHLSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKES 514
           +         +  SG  PGK++M+LSLQKL+ DLKE  +ERDKA+ EL+RLKQHLLEKES
Sbjct: 349 D---------IDSSGCFPGKKEMELSLQKLEDDLKETCRERDKALQELTRLKQHLLEKES 408

Query: 515 EESEKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIE 574
           EESEKMDEDS+IIEEL   NEYQR QI HLEKAL  A+A Q+E++M   NE+QKSKEII+
Sbjct: 409 EESEKMDEDSKIIEELHESNEYQRAQIAHLEKALKLAMANQEEVKMMNNNEIQKSKEIID 468

Query: 575 ELNRKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKD 634
           +LN+KLAN    ID KNVELLNLQTALGQYYAEIEAKEHLE DLA  RE  AKLS +LKD
Sbjct: 469 DLNQKLANCMRTIDLKNVELLNLQTALGQYYAEIEAKEHLERDLALAREESAKLSGLLKD 528

Query: 635 ANQREDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVD 694
           A++R + LK+EKEEIL KLS +ER L E K+RVNKLEEDN KLRRAL+QSMTRLNRMS+D
Sbjct: 529 ADERAELLKREKEEILVKLSQTERMLAEGKARVNKLEEDNGKLRRALEQSMTRLNRMSMD 588

Query: 695 SDFLVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVL 754
           SD+LVDRRIVIKLLVTYFQ+NHSKEVLDLMVRMLGFS++DK RIG A+QG  KGVVRGVL
Sbjct: 589 SDYLVDRRIVIKLLVTYFQRNHSKEVLDLMVRMLGFSDEDKQRIGVAQQGTGKGVVRGVL 648

Query: 755 GLPGRLVGGILGGSSAETPANMASDNQSFADLWVDFLLKENEEREKREAKESLKLQEESQ 814
           GLPGRLVGGILGGSS +  ANMASDNQS ADLWVDFLLKE EEREKRE+ E     +E+ 
Sbjct: 649 GLPGRLVGGILGGSSTDVHANMASDNQSIADLWVDFLLKETEEREKRESAEDASRSKEN- 708

Query: 815 LNGPNVGSID-----PGTRATGSTSESSRTGF-PSHHHHHHHHQSTHLPFGGDFRLSRHH 874
           L+G +  +       P  R T + S  SR+ F PS +       S  +P  G+FR    H
Sbjct: 709 LHGRSPDATGTSPSVPNQRTTTAGSGFSRSSFSPSQN-------SGPVPPQGNFR-QFEH 758

Query: 875 SESEFSTVPLTSTTENT 885
           S+SEFSTVPLTS+  ++
Sbjct: 769 SDSEFSTVPLTSSESSS 758

BLAST of Cp4.1LG12g03990 vs. TrEMBL
Match: W9R007_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022472 PE=4 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 3.1e-146
Identity = 352/639 (55.09%), Postives = 448/639 (70.11%), Query Frame = 1

Query: 255  QKLNKKFQEEL-NSLHMNKDKASLEMSNIIR-ELNEKKLEVKQLQVELNRRE-------N 314
            Q  N+ F +E+ N +   +D  S  +++ ++ +    K+E K    +   RE       N
Sbjct: 480  QAKNRYFGKEIHNGVVSKQDGMSNGITHAVQHDAIHSKVESKYSNFQGKEREYADSLETN 539

Query: 315  MKSDDNVEG---LKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRH 374
             +S   V+G   +++L  +LEKE+  L   + +LE   +K  +S   E +S   E     
Sbjct: 540  NRSSAAVQGTGEIRQLRMELEKERDLLRNIQLKLEGE-QKLNSSLREELKSLKTE----- 599

Query: 375  LSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKST 434
                ++ S +MS I  ELNEK   V++LQ+EL+RRE+ + DD VE LK+ I  LE+E ++
Sbjct: 600  ---KDKTSTDMSKIHAELNEKISAVRRLQMELSRRED-EGDDIVENLKKSIASLERENAS 659

Query: 435  LEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQK 494
            L+M K EL+  +++  T    +  S   E V +H +  NEK+  S   PG+E+M+LSLQK
Sbjct: 660  LKMEKNELKAAMDRIGTD---KKSSVVAETVTKHPNNLNEKVEPSASFPGREEMELSLQK 719

Query: 495  LKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQILH 554
            L K++KE Q ERDKA+ EL+RLKQHLLEKESEESEKMDEDS+IIEELR  NE QR QIL+
Sbjct: 720  LDKEIKETQHERDKALQELTRLKQHLLEKESEESEKMDEDSKIIEELRETNERQRTQILY 779

Query: 555  LEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALGQ 614
            LEKAL QA+A Q+E++M G NE+QK KE+I +LN++LAN T+ ID+KNVELLNLQTALGQ
Sbjct: 780  LEKALKQAVANQEEVKMIGNNEVQKLKEVIGDLNKRLANSTNTIDAKNVELLNLQTALGQ 839

Query: 615  YYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGEW 674
            YYAEIEAKEHLE DLAR RE  +KLS++LK+A+ + D LKKEKEEIL KL  +ER   +W
Sbjct: 840  YYAEIEAKEHLEGDLARAREESSKLSELLKNADYQADVLKKEKEEILFKLLQAERTATDW 899

Query: 675  KSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLDL 734
            KSRVNKLEEDN+KLRRAL+QSMTRLNRMS+DSD+LVDRRIVIKLLVTYFQ+NH+KEVLDL
Sbjct: 900  KSRVNKLEEDNAKLRRALEQSMTRLNRMSMDSDYLVDRRIVIKLLVTYFQRNHNKEVLDL 959

Query: 735  MVRMLGFSEDDKMRIGAA-KQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQS 794
            MVRMLGFSE+DK RIG A +QG  KGVVRGVLGLPGRLVGGILGGSS + PAN A DNQS
Sbjct: 960  MVRMLGFSEEDKQRIGVAQQQGAGKGVVRGVLGLPGRLVGGILGGSSGQLPANAAMDNQS 1019

Query: 795  FADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGSIDPGTRATGSTSESSRTGF- 854
            FADLWVDFLLKE EERE+REA ++     +     PN+ +  P      ++S  SRT   
Sbjct: 1020 FADLWVDFLLKEGEERERREAMDASGKDMDELHKTPNIANAAPPLADPKTSSGLSRTTLS 1079

Query: 855  PSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTS 880
            PS +       S+  PF G+   S  HS+SEFSTVPLTS
Sbjct: 1080 PSQN-------SSPFPFRGNVGQS-DHSDSEFSTVPLTS 1097

BLAST of Cp4.1LG12g03990 vs. TrEMBL
Match: B9RDZ2_RICCO (Structural maintenance of chromosome 1 protein, putative OS=Ricinus communis GN=RCOM_1616980 PE=4 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 9.9e-145
Identity = 345/592 (58.28%), Postives = 426/592 (71.96%), Query Frame = 1

Query: 292 EVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEA 351
           E+  L  E NR        +   +K+L  +LEKE+  +   + +L+   E+ + + S + 
Sbjct: 183 ELADLLEEKNRLVAAMQATHELQIKQLRLELEKERDKVTNVQIKLQ---EEHKLNESFQE 242

Query: 352 QSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKR 411
           Q  +L+M      G ++ S+EMS I  ELNEK  E+++LQ+ L+RRE+  +DD V+GLKR
Sbjct: 243 QVRTLKM------GESKTSMEMSKIRNELNEKISEIRRLQIILSRREDENADDTVKGLKR 302

Query: 412 LITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISP 471
           ++  LEKE + L++ K ELE  LE  R +S  E            L G   K+  SG   
Sbjct: 303 VLATLEKENANLKIAKNELEAALETSRNASPGETS----------LDG---KVDPSGSFN 362

Query: 472 GKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRH 531
            KE M+ SLQKL+K+LKE + ERDKA+ ELSRLKQHLL+KE+EESEKMDEDS+IIEELR 
Sbjct: 363 AKE-MESSLQKLEKELKETRHERDKALQELSRLKQHLLDKENEESEKMDEDSKIIEELRE 422

Query: 532 DNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNV 591
           +NEYQ+ Q+LHLEKAL QAIA Q+E+ M   NE+QKSKEIIE+LN+KLAN  SIIDSKNV
Sbjct: 423 NNEYQKAQVLHLEKALKQAIANQEEVRMINNNEIQKSKEIIEDLNKKLANCMSIIDSKNV 482

Query: 592 ELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSK 651
           ELLNLQTALGQY+AEIEAKE LE +LA  RE  AKLS++LKDA Q  +ALKKEKE+IL+K
Sbjct: 483 ELLNLQTALGQYFAEIEAKEQLERNLALAREETAKLSELLKDAEQGTEALKKEKEKILAK 542

Query: 652 LSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYF 711
           LS +ER L E K+RVNKLEEDN+KLRR L+QSM+RLNRMSVDSDFLVDRRIVIKLLVTYF
Sbjct: 543 LSHNERTLAEGKNRVNKLEEDNAKLRRVLEQSMSRLNRMSVDSDFLVDRRIVIKLLVTYF 602

Query: 712 QKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSSAET 771
           Q+NHSKEVLDLMVRMLGFS +DK RIG A+QG  +GVVRGVLGLPGRLVGGILGGSS++ 
Sbjct: 603 QRNHSKEVLDLMVRMLGFSNEDKQRIGIAQQG-GRGVVRGVLGLPGRLVGGILGGSSSDA 662

Query: 772 PANMASDNQSFADLWVDFLLKENEEREKREAKESL-KLQEESQLNGPNVGSIDPGT--RA 831
            AN AS+NQSFADLWVDFLLK+ EERE+RE+ E+   L E+SQ   P  GS  P +    
Sbjct: 663 HANAASENQSFADLWVDFLLKQTEERERRESAENRGGLMEDSQGQSPISGSPTPPSIPNT 722

Query: 832 TGSTSESSRTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTST 881
            G+ S  SR  F     +      + LP  G+ R    HS+SEFSTVPLTS+
Sbjct: 723 AGTISGISRPKFSPTPDY------SPLPVQGNLR-PFEHSDSEFSTVPLTSS 743

BLAST of Cp4.1LG12g03990 vs. TrEMBL
Match: A0A0D2P6S2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G104100 PE=4 SV=1)

HSP 1 Score: 515.4 bits (1326), Expect = 1.6e-142
Identity = 330/599 (55.09%), Postives = 417/599 (69.62%), Query Frame = 1

Query: 295 QLQVELNRRENMKSDDNVEGLKRLITKLE-KEKSTLEMGKKELEDTLEKC-----RTSSS 354
           Q+   L+ + + +  D +E   R +  ++   +S ++  K ELE   +K      R    
Sbjct: 139 QMSNGLSSKHDEELADLLEEKTRSLEAIQASHESQIKQFKMELEKEHDKLVNVQIRLQEE 198

Query: 355 VEAQSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEG 414
            +   S  E +    S  ++ S E+S I  E NEK +E+ +LQ ELNR+E+  SDD +E 
Sbjct: 199 HKLNESFQEELKLLKSDKDKRSAELSKIRNESNEKTIEISRLQKELNRQEDESSDDTMEN 258

Query: 415 LKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSG 474
           +KRL+  LEKE + L+M K ELE  LE     SS +A +  ++ ++  +   N K+  SG
Sbjct: 259 MKRLVATLEKENTHLKMEKNELEAALE-----SSKKASTDKIDKIDP-IPSENLKVDSSG 318

Query: 475 ISPGKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEE 534
            SPGK++ +LSLQKL+KDLKE   ERDKA+ EL+RLKQHLLEK SEESE MDEDS++IEE
Sbjct: 319 SSPGKKETELSLQKLEKDLKETCCERDKALQELTRLKQHLLEKASEESETMDEDSKVIEE 378

Query: 535 LRHDNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDS 594
           L   NEYQR QI HLEKALN A+A Q+E+++   NE+QKSKEII+ LN+KL N    ID+
Sbjct: 379 LLERNEYQRAQIAHLEKALNMAMANQEEVKLMNNNEIQKSKEIIDNLNKKLTNRMRTIDA 438

Query: 595 KNVELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEI 654
           K+VELLNLQTALGQY+AE+EAKEHLE DLA  RE  A+L+ +LKDA++  +  K+EKEEI
Sbjct: 439 KDVELLNLQTALGQYHAELEAKEHLERDLALAREESARLTGLLKDADEHAEFSKREKEEI 498

Query: 655 LSKLSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLV 714
           L+KLS +ER L E K+RVNKLEEDN KLRRAL+QSMTRLNRMS+DSD+LVD RIVIKLLV
Sbjct: 499 LTKLSQTERMLAEGKTRVNKLEEDNGKLRRALEQSMTRLNRMSMDSDYLVDSRIVIKLLV 558

Query: 715 TYFQKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSS 774
           TYFQ+NHSKEVLDLMVRMLGFS++DK RIG A+ GP KGVVRGVLGLPG LVGGILGGS 
Sbjct: 559 TYFQRNHSKEVLDLMVRMLGFSDEDKQRIGVAQHGPGKGVVRGVLGLPGHLVGGILGGSP 618

Query: 775 AETPANMASDNQSFADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGSIDPGT-- 834
           A T ANMASDNQS ADLWVDFLLKE EEREK+E  E +    E  L+G ++ +  P T  
Sbjct: 619 ANTQANMASDNQSIADLWVDFLLKETEEREKKEPIEDVGRSRE-DLHGRSLNTAGPSTFV 678

Query: 835 -RATGSTSESSRTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTSTTENT 885
              T + S+ SR+ F            + LP  G F+    HS+SEFSTVPLTS+  +T
Sbjct: 679 SEQTTAVSDVSRSSF----------SPSPLPSQGSFQ-QLEHSDSEFSTVPLTSSESST 719

BLAST of Cp4.1LG12g03990 vs. TAIR10
Match: AT2G46180.1 (AT2G46180.1 golgin candidate 4)

HSP 1 Score: 449.9 bits (1156), Expect = 4.1e-126
Identity = 281/497 (56.54%), Postives = 363/497 (73.04%), Query Frame = 1

Query: 323 EKEKSTLEMGKKELEDTLEKCRTSSSV----EAQSSSLEMVNRHLSGSNEASLEMSNIIR 382
           E+ +S      +ELE   EK      +      Q+ + +   + L    E +L  SN +R
Sbjct: 193 ERTRSMASAQARELEKEREKSANLQILLQEERKQNETFKEELQSLRLDKEKTLMESNKVR 252

Query: 383 -ELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKC 442
            EL+ K  E++QLQ++LN  E      + E LK +   LEKE + L++ + ELE  LE  
Sbjct: 253 RELDAKLAEIRQLQMKLNGGEQHAFGISRENLKEVNKALEKENNELKLKRSELEAALEAS 312

Query: 443 RTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKA 502
           + S+S +    S E ++RHLS  +E+   +G  PGKEDM+ SLQ+L+K+L+E ++E+DKA
Sbjct: 313 QKSTSRKLFPKSTEDLSRHLSSLDEEK--AGTFPGKEDMEKSLQRLEKELEEARREKDKA 372

Query: 503 VHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKEL 562
             EL RLKQHLLEKE+EESEKMDEDSR+I+ELR  NEYQR QIL LEKAL Q +A Q+E+
Sbjct: 373 RQELKRLKQHLLEKETEESEKMDEDSRLIDELRQTNEYQRSQILGLEKALRQTMANQEEI 432

Query: 563 EMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDL 622
           +     E++KSK IIE+LN+KLAN    IDSKNVELLNLQTALGQYYAEIEAKEH E +L
Sbjct: 433 KSSSDLEIRKSKGIIEDLNQKLANCLRTIDSKNVELLNLQTALGQYYAEIEAKEHFEREL 492

Query: 623 AREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLR 682
           A  +E   KLS  LKD +++ ++ KKEKEEI SK+  +E    EWK+RV+K+E+DN+K+R
Sbjct: 493 AVAKEDAMKLSARLKDVDEQLESSKKEKEEITSKVLHAENIAAEWKNRVSKVEDDNAKVR 552

Query: 683 RALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRI 742
           R L+QSMTRLNRMS+DSDFLVDRRIVIKLLVTYFQ+NHS+EVLDLMVRMLGFSE++K RI
Sbjct: 553 RVLEQSMTRLNRMSMDSDFLVDRRIVIKLLVTYFQRNHSREVLDLMVRMLGFSEEEKQRI 612

Query: 743 GAAKQGPS-KGVVRGVLGLPGRLVGGIL--GGSSAETPANMASDNQSFADLWVDFLLKEN 802
           G A+QG + KGVVRGVLG PGRLVGGIL  GG S ++  NMASDNQSFAD+WV+FLLK+ 
Sbjct: 613 GLAQQGAAGKGVVRGVLGFPGRLVGGILGGGGGSPDSHPNMASDNQSFADMWVEFLLKDA 672

Query: 803 EEREKREAKESLKLQEE 812
           EERE+REA+++   ++E
Sbjct: 673 EERERREAEDAANKEQE 687

BLAST of Cp4.1LG12g03990 vs. TAIR10
Match: AT3G61570.1 (AT3G61570.1 GRIP-related ARF-binding domain-containing protein 1)

HSP 1 Score: 445.7 bits (1145), Expect = 7.8e-125
Identity = 300/599 (50.08%), Postives = 403/599 (67.28%), Query Frame = 1

Query: 219 GSLQD--VQTTLEMKQLRKELQQERDQLADVQLRLREEQKLNKKFQEELNSLHMNKDKAS 278
           GSL+     T+  +K+ R ++ +  +  A      +   +L+K      +  HM+  K  
Sbjct: 112 GSLKQNLTSTSAALKEARTDISRGSNNYAIKGNNDQSPNRLHKSVSHLKSPNHMSNGKGK 171

Query: 279 LEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKEL 338
            +  + I+E        K L   L  R   KS   V+      T+L KE+  L   +  L
Sbjct: 172 -DTDSFIKE--------KDLADMLEDRT--KSMAAVQA-----TELAKEREKLRDFQLSL 231

Query: 339 EDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNR 398
           +   E+ + S S + +  S+ +        N+ S+E+S +  EL+ K LE+K LQ++L  
Sbjct: 232 Q---EERKRSESFKEELESMRL------DKNKTSMEISKMRSELDAKLLEIKHLQMKLTG 291

Query: 399 RENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRH 458
           +E+      +E LK +   LEKE + L++ + ELE  LE+ R  ++ +    + E + RH
Sbjct: 292 QESHAIGPGMEHLKEVNKALEKENNELKLKRSELEAALEESRKLTNSKVFPDATESLTRH 351

Query: 459 LSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKESEES 518
            S  +++   S   PGKE+M+ SLQ+L+ DLKE Q+ERDKA  EL RLKQHLLEKE+EES
Sbjct: 352 PSTLDKEKPES--FPGKEEMEQSLQRLEMDLKETQRERDKARQELKRLKQHLLEKETEES 411

Query: 519 EKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIEELN 578
           EKMDEDSR+IEELR  NEYQR QI HLEK+L QAI+ Q++  +   N+++K K+ +++LN
Sbjct: 412 EKMDEDSRLIEELRQTNEYQRSQISHLEKSLKQAISNQEDNRLSNDNQIRKLKDTVDDLN 471

Query: 579 RKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQ 638
           +KL N    I+SKNVELLNLQTALGQYYAEIEAKEH E +LA  ++   KLS  LKD+++
Sbjct: 472 QKLTNCLRTIESKNVELLNLQTALGQYYAEIEAKEHFERELAMAKDELMKLSARLKDSDE 531

Query: 639 REDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDF 698
           R ++  KEKE++ SKL  +E+   EWK+RV K+EEDN+K+RR L+QSMTRLNRMS++SD+
Sbjct: 532 RLESSNKEKEDVTSKLLHAEKVAAEWKNRVTKVEEDNAKVRRVLEQSMTRLNRMSMESDY 591

Query: 699 LVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLP 758
           LVDRRIVIKLLVTYFQKNH+KEVLDLMVRMLGFSE+DK RIGAAKQG  KGVVRGVLG P
Sbjct: 592 LVDRRIVIKLLVTYFQKNHNKEVLDLMVRMLGFSEEDKERIGAAKQGGGKGVVRGVLGFP 651

Query: 759 GRLVGGILGGSSAETPANMASDNQSFADLWVDFLLKENEEREKREAKESL--KLQEESQ 814
           GR VGGILGG SAE  AN ASDNQSFADLWVDFLLK+ EERE+REA+E+   K +++S+
Sbjct: 652 GRFVGGILGGKSAELHANAASDNQSFADLWVDFLLKDAEERERREAEEAAASKAKQDSE 683

BLAST of Cp4.1LG12g03990 vs. NCBI nr
Match: gi|659116612|ref|XP_008458163.1| (PREDICTED: golgin candidate 3 isoform X2 [Cucumis melo])

HSP 1 Score: 802.4 bits (2071), Expect = 9.3e-229
Identity = 470/589 (79.80%), Postives = 510/589 (86.59%), Query Frame = 1

Query: 305 NMKSDDNVEG---LKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNR 364
           NM S  +V+    LK+L  +L++E+  L   +  L    E+ + +   + + +SL+M   
Sbjct: 166 NMGSLQDVQATLELKQLRKELQQEREQLADVQLRLR---EEQKLNKKFQEELNSLQM--- 225

Query: 365 HLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKS 424
                ++ASLEMS+I+RELNEKKLEVKQLQVELNRRE MKSDDNVE LKRLIT LEKEKS
Sbjct: 226 ---NKDKASLEMSDILRELNEKKLEVKQLQVELNRREKMKSDDNVEELKRLITTLEKEKS 285

Query: 425 TLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQ 484
           TLEM KKEL+DTLEK R SS V   S SLEMVNRHLSGS+EKLG SG    KED DLSLQ
Sbjct: 286 TLEMEKKELKDTLEKSRESSGVGTPSKSLEMVNRHLSGSSEKLGPSG----KEDRDLSLQ 345

Query: 485 KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQIL 544
           KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRH+NEYQRGQIL
Sbjct: 346 KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHNNEYQRGQIL 405

Query: 545 HLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALG 604
           HLEKALNQAIATQKE EMYG NELQKSKEIIE+LNRKLAN  S IDSKN+ELLNLQTALG
Sbjct: 406 HLEKALNQAIATQKEAEMYGNNELQKSKEIIEDLNRKLANCMSTIDSKNIELLNLQTALG 465

Query: 605 QYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGE 664
           QYYAEIEAKEHLES LARERE EAKLSQMLKDANQREDALKKEKEEILSKLS+SERALGE
Sbjct: 466 QYYAEIEAKEHLESVLAREREEEAKLSQMLKDANQREDALKKEKEEILSKLSISERALGE 525

Query: 665 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLD 724
           WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQ+NHSKEVLD
Sbjct: 526 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQRNHSKEVLD 585

Query: 725 LMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQS 784
           LMVRMLGFSED+K+RIGAAKQGPSKGVVRGVLGLPGRLVGGILGGS+AETPANMASDNQS
Sbjct: 586 LMVRMLGFSEDEKLRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSAAETPANMASDNQS 645

Query: 785 FADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVG-----SIDPGTRATGSTSESS 844
           FADLWVDFLLKENEEREKR+A+ESLKL+EESQ +GP+V      S+DP T+ TGST  SS
Sbjct: 646 FADLWVDFLLKENEEREKRQAEESLKLREESQSSGPDVALTGSPSLDPRTKTTGSTPNSS 705

Query: 845 RTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTSTTENTH 886
           RT FPSH       QSTHLPFG DFRLSRHHS+SEFSTVPLTS++ENT+
Sbjct: 706 RTAFPSHL------QSTHLPFGNDFRLSRHHSDSEFSTVPLTSSSENTY 735

BLAST of Cp4.1LG12g03990 vs. NCBI nr
Match: gi|659116610|ref|XP_008458162.1| (PREDICTED: golgin candidate 4 isoform X1 [Cucumis melo])

HSP 1 Score: 802.4 bits (2071), Expect = 9.3e-229
Identity = 470/589 (79.80%), Postives = 510/589 (86.59%), Query Frame = 1

Query: 305 NMKSDDNVEG---LKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNR 364
           NM S  +V+    LK+L  +L++E+  L   +  L    E+ + +   + + +SL+M   
Sbjct: 205 NMGSLQDVQATLELKQLRKELQQEREQLADVQLRLR---EEQKLNKKFQEELNSLQM--- 264

Query: 365 HLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKS 424
                ++ASLEMS+I+RELNEKKLEVKQLQVELNRRE MKSDDNVE LKRLIT LEKEKS
Sbjct: 265 ---NKDKASLEMSDILRELNEKKLEVKQLQVELNRREKMKSDDNVEELKRLITTLEKEKS 324

Query: 425 TLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQ 484
           TLEM KKEL+DTLEK R SS V   S SLEMVNRHLSGS+EKLG SG    KED DLSLQ
Sbjct: 325 TLEMEKKELKDTLEKSRESSGVGTPSKSLEMVNRHLSGSSEKLGPSG----KEDRDLSLQ 384

Query: 485 KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQIL 544
           KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRH+NEYQRGQIL
Sbjct: 385 KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHNNEYQRGQIL 444

Query: 545 HLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALG 604
           HLEKALNQAIATQKE EMYG NELQKSKEIIE+LNRKLAN  S IDSKN+ELLNLQTALG
Sbjct: 445 HLEKALNQAIATQKEAEMYGNNELQKSKEIIEDLNRKLANCMSTIDSKNIELLNLQTALG 504

Query: 605 QYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGE 664
           QYYAEIEAKEHLES LARERE EAKLSQMLKDANQREDALKKEKEEILSKLS+SERALGE
Sbjct: 505 QYYAEIEAKEHLESVLAREREEEAKLSQMLKDANQREDALKKEKEEILSKLSISERALGE 564

Query: 665 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLD 724
           WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQ+NHSKEVLD
Sbjct: 565 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQRNHSKEVLD 624

Query: 725 LMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQS 784
           LMVRMLGFSED+K+RIGAAKQGPSKGVVRGVLGLPGRLVGGILGGS+AETPANMASDNQS
Sbjct: 625 LMVRMLGFSEDEKLRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSAAETPANMASDNQS 684

Query: 785 FADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVG-----SIDPGTRATGSTSESS 844
           FADLWVDFLLKENEEREKR+A+ESLKL+EESQ +GP+V      S+DP T+ TGST  SS
Sbjct: 685 FADLWVDFLLKENEEREKRQAEESLKLREESQSSGPDVALTGSPSLDPRTKTTGSTPNSS 744

Query: 845 RTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTSTTENTH 886
           RT FPSH       QSTHLPFG DFRLSRHHS+SEFSTVPLTS++ENT+
Sbjct: 745 RTAFPSHL------QSTHLPFGNDFRLSRHHSDSEFSTVPLTSSSENTY 774

BLAST of Cp4.1LG12g03990 vs. NCBI nr
Match: gi|449441372|ref|XP_004138456.1| (PREDICTED: golgin candidate 4 [Cucumis sativus])

HSP 1 Score: 786.9 bits (2031), Expect = 4.0e-224
Identity = 463/590 (78.47%), Postives = 504/590 (85.42%), Query Frame = 1

Query: 305 NMKSDDNVEGL---KRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNR 364
           NM S  +V+     K+L  +L++E+  L   +  L    E+ + +   + + +SL M   
Sbjct: 206 NMGSLQDVQATLEYKQLRKELQQEREQLADVQLRLR---EEQKLNKKFQEELNSLRM--- 265

Query: 365 HLSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKS 424
                ++ASLEMS+I+RELNEKKLEVKQLQVELNRRE MKSDDNVE LKRLIT LEKEKS
Sbjct: 266 ---NKDKASLEMSDILRELNEKKLEVKQLQVELNRREKMKSDDNVEELKRLITTLEKEKS 325

Query: 425 TLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQ 484
           TLEM KKEL+DTLEK +  S VE  S SLEMVNRHLS S+EKLG SGIS GKED DLSLQ
Sbjct: 326 TLEMEKKELKDTLEKSQELSGVETPSKSLEMVNRHLSDSSEKLGPSGISLGKEDRDLSLQ 385

Query: 485 KLKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQIL 544
           KLKKDLKEMQQERDKA HELSRLKQHLLEKESEESEKMDEDSRIIEELRH+NEYQRGQI+
Sbjct: 386 KLKKDLKEMQQERDKAAHELSRLKQHLLEKESEESEKMDEDSRIIEELRHNNEYQRGQIM 445

Query: 545 HLEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALG 604
           HLEKALNQAIA QKE EMYG NELQKSKEIIE+L+RKLAN  SIIDSKN+ELLNLQTALG
Sbjct: 446 HLEKALNQAIAMQKEAEMYGNNELQKSKEIIEDLHRKLANCMSIIDSKNIELLNLQTALG 505

Query: 605 QYYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGE 664
           QYYAEIEAKEHLES LARERE EAKLSQMLKDANQREDALKKEKEEILSKLS+SERALGE
Sbjct: 506 QYYAEIEAKEHLESVLAREREEEAKLSQMLKDANQREDALKKEKEEILSKLSISERALGE 565

Query: 665 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLD 724
           WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQ+NHSKEVLD
Sbjct: 566 WKSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQRNHSKEVLD 625

Query: 725 LMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQS 784
           LMVRMLGFSED+K+RIGAAKQGPSKGVVRGVLGLPGRLVGGILGGS+ ETPANMASDNQS
Sbjct: 626 LMVRMLGFSEDEKLRIGAAKQGPSKGVVRGVLGLPGRLVGGILGGSTTETPANMASDNQS 685

Query: 785 FADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGS-----IDPGTRATGSTSESS 844
           FADLWVDFLLKENEEREKREA+ESLKL+E SQ +  +V S     +DP T+  GST   S
Sbjct: 686 FADLWVDFLLKENEEREKREAEESLKLREASQSSSSDVASAGSPLLDPRTKTIGSTPNPS 745

Query: 845 RTGFPSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLT-STTENTH 886
           RTGFPSH       QSTHLPFG DFRLSRHHS+SEFSTVPLT S++ENT+
Sbjct: 746 RTGFPSHL------QSTHLPFGSDFRLSRHHSDSEFSTVPLTSSSSENTY 780

BLAST of Cp4.1LG12g03990 vs. NCBI nr
Match: gi|590721690|ref|XP_007051687.1| (GRIP-related ARF-binding domain-containing protein 1 isoform 1 [Theobroma cacao])

HSP 1 Score: 530.0 bits (1364), Expect = 8.9e-147
Identity = 348/617 (56.40%), Postives = 434/617 (70.34%), Query Frame = 1

Query: 275 ASLEMSN-IIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGK 334
           A  +MSN +  + +EK+ E+  L  E NR        +   +K+   +LEKE+  L   +
Sbjct: 169 AGNQMSNGLSSKHDEKEKELADLLEEKNRSLEAVQASHESQIKQFNMELEKERDKLANVQ 228

Query: 335 KELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEASLEMSNIIRELNEKKLEVKQLQVE 394
             L    E+ + + S + +   L+      S  +++  E+S I  ELNEK +E+++LQ+E
Sbjct: 229 IRLH---EERKLNESFQEELKLLK------SDKDKSVTELSKIRNELNEKIIEIRRLQME 288

Query: 395 LNRRENMKSDDNVEGLKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMV 454
           LNRREN  +DD +E L+R+I  LEKE + L+  K ELE  LE  + S + +    + E +
Sbjct: 289 LNRRENDSADDTLENLRRVIATLEKENTHLKKEKNELEAALEISKKSLTGKIHPDAAETL 348

Query: 455 NRHLSGSNEKLGLSGISPGKEDMDLSLQKLKKDLKEMQQERDKAVHELSRLKQHLLEKES 514
           +         +  SG  PGK++M+LSLQKL+ DLKE  +ERDKA+ EL+RLKQHLLEKES
Sbjct: 349 D---------IDSSGCFPGKKEMELSLQKLEDDLKETCRERDKALQELTRLKQHLLEKES 408

Query: 515 EESEKMDEDSRIIEELRHDNEYQRGQILHLEKALNQAIATQKELEMYGKNELQKSKEIIE 574
           EESEKMDEDS+IIEEL   NEYQR QI HLEKAL  A+A Q+E++M   NE+QKSKEII+
Sbjct: 409 EESEKMDEDSKIIEELHESNEYQRAQIAHLEKALKLAMANQEEVKMMNNNEIQKSKEIID 468

Query: 575 ELNRKLANYTSIIDSKNVELLNLQTALGQYYAEIEAKEHLESDLAREREGEAKLSQMLKD 634
           +LN+KLAN    ID KNVELLNLQTALGQYYAEIEAKEHLE DLA  RE  AKLS +LKD
Sbjct: 469 DLNQKLANCMRTIDLKNVELLNLQTALGQYYAEIEAKEHLERDLALAREESAKLSGLLKD 528

Query: 635 ANQREDALKKEKEEILSKLSLSERALGEWKSRVNKLEEDNSKLRRALDQSMTRLNRMSVD 694
           A++R + LK+EKEEIL KLS +ER L E K+RVNKLEEDN KLRRAL+QSMTRLNRMS+D
Sbjct: 529 ADERAELLKREKEEILVKLSQTERMLAEGKARVNKLEEDNGKLRRALEQSMTRLNRMSMD 588

Query: 695 SDFLVDRRIVIKLLVTYFQKNHSKEVLDLMVRMLGFSEDDKMRIGAAKQGPSKGVVRGVL 754
           SD+LVDRRIVIKLLVTYFQ+NHSKEVLDLMVRMLGFS++DK RIG A+QG  KGVVRGVL
Sbjct: 589 SDYLVDRRIVIKLLVTYFQRNHSKEVLDLMVRMLGFSDEDKQRIGVAQQGTGKGVVRGVL 648

Query: 755 GLPGRLVGGILGGSSAETPANMASDNQSFADLWVDFLLKENEEREKREAKESLKLQEESQ 814
           GLPGRLVGGILGGSS +  ANMASDNQS ADLWVDFLLKE EEREKRE+ E     +E+ 
Sbjct: 649 GLPGRLVGGILGGSSTDVHANMASDNQSIADLWVDFLLKETEEREKRESAEDASRSKEN- 708

Query: 815 LNGPNVGSID-----PGTRATGSTSESSRTGF-PSHHHHHHHHQSTHLPFGGDFRLSRHH 874
           L+G +  +       P  R T + S  SR+ F PS +       S  +P  G+FR    H
Sbjct: 709 LHGRSPDATGTSPSVPNQRTTTAGSGFSRSSFSPSQN-------SGPVPPQGNFR-QFEH 758

Query: 875 SESEFSTVPLTSTTENT 885
           S+SEFSTVPLTS+  ++
Sbjct: 769 SDSEFSTVPLTSSESSS 758

BLAST of Cp4.1LG12g03990 vs. NCBI nr
Match: gi|703085954|ref|XP_010092877.1| (hypothetical protein L484_022472 [Morus notabilis])

HSP 1 Score: 527.7 bits (1358), Expect = 4.4e-146
Identity = 352/639 (55.09%), Postives = 448/639 (70.11%), Query Frame = 1

Query: 255  QKLNKKFQEEL-NSLHMNKDKASLEMSNIIR-ELNEKKLEVKQLQVELNRRE-------N 314
            Q  N+ F +E+ N +   +D  S  +++ ++ +    K+E K    +   RE       N
Sbjct: 480  QAKNRYFGKEIHNGVVSKQDGMSNGITHAVQHDAIHSKVESKYSNFQGKEREYADSLETN 539

Query: 315  MKSDDNVEG---LKRLITKLEKEKSTLEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRH 374
             +S   V+G   +++L  +LEKE+  L   + +LE   +K  +S   E +S   E     
Sbjct: 540  NRSSAAVQGTGEIRQLRMELEKERDLLRNIQLKLEGE-QKLNSSLREELKSLKTE----- 599

Query: 375  LSGSNEASLEMSNIIRELNEKKLEVKQLQVELNRRENMKSDDNVEGLKRLITKLEKEKST 434
                ++ S +MS I  ELNEK   V++LQ+EL+RRE+ + DD VE LK+ I  LE+E ++
Sbjct: 600  ---KDKTSTDMSKIHAELNEKISAVRRLQMELSRRED-EGDDIVENLKKSIASLERENAS 659

Query: 435  LEMGKKELEDTLEKCRTSSSVEAQSSSLEMVNRHLSGSNEKLGLSGISPGKEDMDLSLQK 494
            L+M K EL+  +++  T    +  S   E V +H +  NEK+  S   PG+E+M+LSLQK
Sbjct: 660  LKMEKNELKAAMDRIGTD---KKSSVVAETVTKHPNNLNEKVEPSASFPGREEMELSLQK 719

Query: 495  LKKDLKEMQQERDKAVHELSRLKQHLLEKESEESEKMDEDSRIIEELRHDNEYQRGQILH 554
            L K++KE Q ERDKA+ EL+RLKQHLLEKESEESEKMDEDS+IIEELR  NE QR QIL+
Sbjct: 720  LDKEIKETQHERDKALQELTRLKQHLLEKESEESEKMDEDSKIIEELRETNERQRTQILY 779

Query: 555  LEKALNQAIATQKELEMYGKNELQKSKEIIEELNRKLANYTSIIDSKNVELLNLQTALGQ 614
            LEKAL QA+A Q+E++M G NE+QK KE+I +LN++LAN T+ ID+KNVELLNLQTALGQ
Sbjct: 780  LEKALKQAVANQEEVKMIGNNEVQKLKEVIGDLNKRLANSTNTIDAKNVELLNLQTALGQ 839

Query: 615  YYAEIEAKEHLESDLAREREGEAKLSQMLKDANQREDALKKEKEEILSKLSLSERALGEW 674
            YYAEIEAKEHLE DLAR RE  +KLS++LK+A+ + D LKKEKEEIL KL  +ER   +W
Sbjct: 840  YYAEIEAKEHLEGDLARAREESSKLSELLKNADYQADVLKKEKEEILFKLLQAERTATDW 899

Query: 675  KSRVNKLEEDNSKLRRALDQSMTRLNRMSVDSDFLVDRRIVIKLLVTYFQKNHSKEVLDL 734
            KSRVNKLEEDN+KLRRAL+QSMTRLNRMS+DSD+LVDRRIVIKLLVTYFQ+NH+KEVLDL
Sbjct: 900  KSRVNKLEEDNAKLRRALEQSMTRLNRMSMDSDYLVDRRIVIKLLVTYFQRNHNKEVLDL 959

Query: 735  MVRMLGFSEDDKMRIGAA-KQGPSKGVVRGVLGLPGRLVGGILGGSSAETPANMASDNQS 794
            MVRMLGFSE+DK RIG A +QG  KGVVRGVLGLPGRLVGGILGGSS + PAN A DNQS
Sbjct: 960  MVRMLGFSEEDKQRIGVAQQQGAGKGVVRGVLGLPGRLVGGILGGSSGQLPANAAMDNQS 1019

Query: 795  FADLWVDFLLKENEEREKREAKESLKLQEESQLNGPNVGSIDPGTRATGSTSESSRTGF- 854
            FADLWVDFLLKE EERE+REA ++     +     PN+ +  P      ++S  SRT   
Sbjct: 1020 FADLWVDFLLKEGEERERREAMDASGKDMDELHKTPNIANAAPPLADPKTSSGLSRTTLS 1079

Query: 855  PSHHHHHHHHQSTHLPFGGDFRLSRHHSESEFSTVPLTS 880
            PS +       S+  PF G+   S  HS+SEFSTVPLTS
Sbjct: 1080 PSQN-------SSPFPFRGNVGQS-DHSDSEFSTVPLTS 1097

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GOGC4_ARATH7.3e-12556.54Golgin candidate 4 OS=Arabidopsis thaliana GN=GC4 PE=2 SV=1[more]
GOGC3_ARATH1.4e-12350.08Golgin candidate 3 OS=Arabidopsis thaliana GN=GC3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K888_CUCSA2.8e-22478.47Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006840 PE=4 SV=1[more]
A0A061DSS6_THECC6.2e-14756.40GRIP-related ARF-binding domain-containing protein 1 isoform 1 OS=Theobroma caca... [more]
W9R007_9ROSA3.1e-14655.09Uncharacterized protein OS=Morus notabilis GN=L484_022472 PE=4 SV=1[more]
B9RDZ2_RICCO9.9e-14558.28Structural maintenance of chromosome 1 protein, putative OS=Ricinus communis GN=... [more]
A0A0D2P6S2_GOSRA1.6e-14255.09Uncharacterized protein OS=Gossypium raimondii GN=B456_007G104100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46180.14.1e-12656.54 golgin candidate 4[more]
AT3G61570.17.8e-12550.08 GRIP-related ARF-binding domain-containing protein 1[more]
Match NameE-valueIdentityDescription
gi|659116612|ref|XP_008458163.1|9.3e-22979.80PREDICTED: golgin candidate 3 isoform X2 [Cucumis melo][more]
gi|659116610|ref|XP_008458162.1|9.3e-22979.80PREDICTED: golgin candidate 4 isoform X1 [Cucumis melo][more]
gi|449441372|ref|XP_004138456.1|4.0e-22478.47PREDICTED: golgin candidate 4 [Cucumis sativus][more]
gi|590721690|ref|XP_007051687.1|8.9e-14756.40GRIP-related ARF-binding domain-containing protein 1 isoform 1 [Theobroma cacao][more]
gi|703085954|ref|XP_010092877.1|4.4e-14655.09hypothetical protein L484_022472 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g03990.1Cp4.1LG12g03990.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 98..135
score: -coord: 565..592
score: -coord: 909..930
score: -coord: 791..812
score: -coord: 373..436
score: -coord: 624..686
score: -coord: 473..514
score: -coord: 223..342
scor
NoneNo IPR availablePANTHERPTHR18921MYOSIN HEAVY CHAIN - RELATEDcoord: 490..880
score: 1.0E-187coord: 369..437
score: 1.0E-187coord: 27..338
score: 1.0E