CsaV3_5G036430.1 (mRNA) Cucumber (Chinese Long) v3

NameCsaV3_5G036430.1
TypemRNA
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionNuclear receptor corepressor 1
Locationchr5 : 28723723 .. 28732080 (+)
Sequence length4950
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TATAGAAGGAAAATAATTTAATTAAAAAGAAATTGATTTGTTTAAAAAAATAATTTTCCCGGACTTGAAGGGTGATTGAAATGGAGTTTAACAATCGAGTTGAGATCCGGATGCCAATCTCTCTCTCTTTCTCTCTTTTTGCTCAGGGATCTATTTTGTTCCTTATCGTCTTGCTTGCGCGTTCTCTTTTGAAGCAGATTTTCCACCAGCCAAGAAAGAGAGATAAAAAGAGTCACTAAGAAGATTGTAAACCCTAGAAAGTTGAAGAAGGAAGAAGGAAGAAGGAAGAAGGAAGAAGGAAGAAGGAAGAAGGAAGAAGAAGGGCTCTCTCTTATTGTTTTATTAATCTGATTTCAAGCCCTAGAATTTATTTGGCGCGCCGAGGGGTTGTGTGTCTAAAGGATTTAAAGAAGGGTATATTGGTTGTGCAAGGGTTATTTAGCTGGGTGCTCAATTTTTGTTTTGTTTGAGCTAGCTTTTGTTTGTTGCACATCATAGTTCTGGTCTTTGAGGTGGATTTTTGGTTCGTGAAACTGATTGCCGAGTGTTTGGAATTGGAATTGGAATTGGCCATGGTCGTGGTGGGGGCGAGGTGCTAGGGTTTTTATACTGGTTTTTGTGTGTGTGTTTGTGGGGCACCCGTTGTTCGTTGATGCTCATGCAATTTTCTTGCTTTTCATGCCGCCAGAACCTTTGCCGTGGGACAGGAAAGACCTCTTCAAGGAGAGGAAACACGAGAAGTCGGAGGCCATAGGGTCTGCGGCCAGATGGCGGGACTCTTATCATGGATCTCGCGAGTTTAATCGGTGGGGTTCTGCTGACTTACGAAGGCCTACTGGTGAGTTTTGATGTTACTTTACTGTTTCCTCCGTTTCAACTTCTTAGATCCTTGGGCGTCTTAGCTGGTTTATTACAATCGCATTTATTTTTAATAATCTTGAGTTTTTGGCCCGGCTTTCTTTGAAGCTTTGGATGATAAAAGGATTTCGGGTTTCATTCCTGCTGAGATTTTGGGCCGTCCTCAAAACCTGCCTGCATACTTTTTGTCTTCTGATATTGATACTCTTTGCTGTCCTTTTTTCCTGAAGGTCATGGTAAGCAGGGTGGTTGGCATCAGTTTTCTGAAGACTCTAGTCACGGTTATGGACCTTCTCGGTCATTCAGTGACAGGGTGATAGAAGATGAGAGCTTCCGGCCGTCAGTTCCTCGTGGTGATGGAAAGTATATTAGAATCGGTAGAGAAAGTAGAGGTTCTTTTAGTCATAGAGACTGGAGAAGTCACTCAAGAGATGCTAACAATGGATTTGGGAACCCTTCACGAAGAACGTCATCGCAGGATGTGAGTTCTGATCAGAGGTCAGTAGATGATACGGTGACATATTCCTCTCCTCAATCGTTTCATGGGTTAGAAAATGGCCCGAGGTCTGATGTGGAAGTTTCCCTTGGCTCCACTGATTGGAAGCCGCTTAAGTGGTCCAGATCTGGGAGTTTGTCTTCTCGGGGATCTGCTTACAGCAGTTCAACAAACTCGAAGAATGAAAAGGCCGATTTACCTCTTAGAGTTGCATCTCCTATAGAAAGCCCTTCTGCTGAAGCTACTGCCTGTGTGACATCTTCTCTGCCTTCTGAAGATGCAATTTCTAGGAAGAAACCAAGGCTTGGATGGGGTGATGGACTGGCCAAGTACGAGAAAGAAAAAGTTGAGGTTCCTGATGGAAGTCTGAGAAAAGAAGTGGCTCTTCTTTCAAGCGGTAGTGGTGAATTAACTCATTCGCTTGGTTCAAACTTTGCTGAGAAAAGTCCCAAAACTTTACCCTTCTCAGATTGTGCATCTCCTGCAACTCCATCCTCTTTTGCTTGTAGTTCATCATCAGGTAAACTTTTTCCAAAAAAAGAAATAGACGGGATATTTTAGTGTCTTGAAGTTCTTTTTGGTTGTAATTAGTTTAATCTTTGTGATAGTCATGACACAATCGTGTGCTTAACCTATTCCTACAGCATTTCATTAATTTCTTAACATTGTTCTAGGCTCAAGTTGGTTTAGTCAATTTTAAGATTTAAGAATGAGAAAGGGATGGTAGTCTACTTAGAATTTTTTGCATGGCTCGTGTGTGATGTAGGCTGTACGCACTATGGCATGTGCTGTAAATTGCAACCAAACTGCTTTAGTTGAATCTAGACCAAGCAAGCTTTATTGAGTATAAACTTTGAATTGAAGGATAAGCAGTTTGAATTAGCTGATGGAGATTGGGTTCTATGCACTTGTTTTCTGATAAAATTTCATGCATTAAGCTTTCATTGTTTTCTCTAATCTTGGACTTCTTATTTCTATTCCTTCAACTCATGACTCTCACTATTAAACTATGAAGAATTGGTTTCTAGTAAGATTAAGGGGTAGCATCTTTCTGATTGACATGTAGCCATTTATTTTTTTTTTCTGTGTTTTAATTTCTTTTTCTTTTTTGAATTGAATGCAATAGGATTGGAAGATAAACCATTTAGTAAGGGAGCAGGTGCTGATGGCATGATATGTAGTTCACCTGGGTCTGGTTCACAAAATCTTCAGAAGTTATTGTGCAGTATAGAGAAGATGGAGATTAGTTCGGTTGCTAATTTAGGGTCATCACTTGTTGAATTATTTCATTCTGATGATCCAAATACAATAGAATCATGTTTTGGGAAATCTACATTGAATAAGCTGCTAGCGTATAAAGGTGAAATTTCAAAGACATTGGAGATGACAGAGTCTGAAATTGATTCTCTTGAAAATGAACTTAAGTCTTTGAAATCTGTAAATGGAGGCAATGTTTCTCATAAAAAATCTTGCAGTGCCACACGTGTGATGGAGAGTTCAACATATTTCAAAGAACAAGATGGTATCTCTTGCATTGCCACTCGTCCTGCTCCGTTGGTGGTAGTTTCTTCTTCTGATGCAACAGTTGAGAAGGTGCCACTCTGCAAGGGTGACGTGGGAGTAGAAGATGTTGATACAAAGGCTGACGAAATTGATAGTCCTGGAACTGTGACATCAAAATTTAATGAGCCGTCCAGAGTAGTAAAGGCGATTGCTTCTGATATTGTTGATAATGGTCATTGCTCTGTAGTTACAGATGCGATTGTTCCTGGAAAGATGGAAGGGAGTTTTCCAATATCAGGGCCTTTTGTGGATGAACACGAAACGATTGGCTCTGGCAATGAATGCACTCTTGCCAAGAGTTGTACCAGTGAATCTGTTTATGGTGATTTGATGGCCCAAGCTGGTAGTAGATCATCTCTTTGTGATTCAATTTTTGCATGTAATAAAGAATATGCAAGTAGAGCTGCAGAAGTAATTTTTAAGAGATCACCAGTGGGAATGTGCAAGATCAGCAGCAAAAGCACCAAAAATGTGTCCTGTTCAGAGACTGAGAAACTTATTAAAGAGAAATTTGTAATGAGGAAGAAGTTTTTAAAATTTAAGGAGAGTGCATTAACCCTTAGATTTAAATCCTTGCAACAATCATGGAAAGAAGGTTTGCTGCATTCTGTAAAGAAATGTCGCTCAAGGCCACAAAAAAAGGAGTTGAGTCTAAGGGTGACACATTCTGGCCATCAGAAGTACAGGTCGTCTTCAATTCGCTCCCGTTTGGTTCAGCAAGGTATGATTACTTTCATCCCTTCAGCATTTTTTTCCGTTTCCACTCCAGACTTGCAAATAATGTTTTGAAGTTGAACATTCTTCGAATGTTATATTGTCTATTTTCTACTTTAAAAACCATTTGGTTTATTTGTGTTGTTGACAAACCTATGTGTGTGTGTGTGTTTGTTAAAAAATGAGTTAGAATATTAGCAATGAAATTCTAGGGATTTTTCTCATCTGCAATGAAGTATTGTGTTTTATTGTTCTATTGGAACTTCAGTTTCCAATTAATGAAAGCATACCTGTCTGTACACTTCTTATTGTGGTTGACAAATAATGGGAGTTATCATGTTTTTATACTTCCTTATTTTTGATGTTGTTGATTGGTTGATTTTAATTGGTGGATTGTGTAACTGGACTTGTTATGATATATTGAGTTGAGTTCGTTGAGAATTGTGGTAGGCCGTTTTTTCGTCTTCTTTTTTTAACAGGACATTTTAGTGTCTCTGATCAAGGTTATGTATGAACTAAGAAAGCTACAAAAGATTTAGTGGATGTGTTCTATATGCCCCTCTTGTAGTCAGAGATCTTTGATAATTCATCTAGAAAGAGAGACATGCATCTTTTATCGTTTATTTCGTACATTTTTTTTTGACATCTACAAAGTTGTTTGTTGGCCTTAAGATTCAAACCTTTGTTATAGTCTGTGATGTGAATATTTTTCTCTTCTCCCTTTGAGATCTGCTTAGCAGCTGACTATCTCATTCCAGAAGTGCCTTGATTGTAATTTGTTGGATAGTTGAACAAAAGGTTTCTATAAAATTTCCTCTGACAATCATTACGTGTCATTATGATTTAATGAATGCTTAAATTTTGATTTTTCAGGAGCGTGTCAGAGCTCTACATTTAACACAGAAATTGCTGTTCGTCACTCCAGCAAGCTGCTGTTGAACCCACAAATTAAGCTTTACAGGAATACCTTAAAGATGCCAGCTATGATTTTGGACAAGAAGGAAAAGATTGCATTGAGGTTCATCTCTCATAATGGGTTGGTTGAAGATCCCTGTGCTGTTGAGAAGGAAAGGAACTTGATAAACCCCTGGACTTCAGCTGAGAAAGAAATATTCTGGGAGAAACTATCTCTGTTTGGAAAGGATTTTAAGAAAATTTCTTCATTTCTCGACCTCAAAACTACAGCAGACTGTATCCAGTTCTATTACAAAAACCATAAATCTGATAGTTTTAAAAAGAACAAAAATTTGGAGTTGGGCAAGCAAATGAAATCTTCTGCCATCACGTATTTGGTAACATCAGGGAAAAAATGGAATCCGGACGCGAATGCCACTTCCCTTGATATTTTAGGTGTTGCTTCAGTAATGGCAGCACAAGCAGACTATGACATTGAAAACCAGCAGAAATGTACTCGCCATTTGGGTGTAGGAAGGGATGTTGAGTCAAAAGTATCATGGAGTGCTAGCTCTCCAAATAAAAGCAATTTGGATGATCTTCAAACTGAAAAAGAAACAGTTGCTGCTGATGTACTCGCTGGCATAAGTGGTTCAATATCTTCAGAGGCCCTGAGTTCTTGCATTACAAGTGCCATTGATCCCCGTGAGGAACTAAGGGAGCGGAAGTGCTATAGAGTGGATTTTGCAGCGAAATTGCCTTCATTGTCTGACGTCATGCAGAAAACTGATAATGAACCTTGTTCGGATGATAGTTCTGAGGATGTGGATTCTTCAAATTGGACAGATGAGGAGAAGTTGGTCTTCATGCAGGCTGTGTCATCTTATGGTAAGGATTTTGATATGATCTCTAGATGTATCAGATCAAAGTCTAGGGACCAGTGCAAGATTTTCTTCAGCAAAGCTCGGAAATGTCTTGGACTGGATTTGATGCATACTTCTGGAGACGTAGGTGAAACACCTGGGAATGGTAACGATGCTAGTGGGAGTGGGAGTGGGACTGACACTGAAGAACACTGTGTGGTGGAAATCTGTGAAGGCCGTGGCAGTGATGAATTTATCTCCAAGTCAATCAATGGTGGATCAACATCGGTTAACATTAATCATGAAGAAACTGTTTCTGCTGTGACTGACAACATGCGGACTAGTATGGAATTTGAGGAAAGTACAGCATTGCAACAGTCGGATGAGAAAGGTGCTGAGGCTGTTGGGAACTTGATTTTTGAGACATTGAAGGAAGAGGATGTGCCTAATCCGAGTCAGCCCACGCATGACCACAAAATTGAAGGCTCTTCTGAAAATACTGAAAGTGGAAAGAGCTGTAATGAACCTGACATTCTGAGATCTGAATCCGTGTCCACAGTAGATGAAAATTCAGCAGCTGTGAGTGAGGGCAGAGCTACTGTGAAGCTTGCGATTGGAGAAGAAGTAGGAAGCGACACTAATTTACACGGTCAGAGTACAATTCTGTGCTCAGGTCAGGATTCAACTGGGAATGATTCCAATATTGCTTTAGAGGGCAGTTCTGTAGGACTTGATCCACATATCTTGCATCCAAATATTCTTAAAGTGGAACCTGTAGAGAAGAAGTCTTGTATCAAGTCTGAGGAGAATTTTCTTTCTGTCAGAAATTCTGATACTGGTGTCATTGGAAGGGAACAGATGCTTAACCAAGACATATTGTCACCAACACTTGTTTTGCAGGAGATTAGTGATGCGAATCAAAAACCTATGAATAGAGATGACGATGCTGAGCATCCAAATAATTTGTTGTGCAATAGTGAATCATCCACGTTTCCAAGAAGCTATCCTTTCAACAAACAAATCTTTGAGGACATAAATAGAAATATCAATCACGCGTATTTTCGTGTTCAAGGGCTGTCAAAGCCAGACATTAATTGTAATAGTAAATATGTTTCTGAGGGGCAATTTCTTCAGAATTGTAACAGTTCCAAGCCGCACAATCTGGCTGAGCCTCCTTTTCTGTCTCAGAATATAGAATTGGGCCATGATCATCAGAAGAATGCTTCTGGCAGTGGCAGTGCTTCAGATTCCGATGTTCCACGCAGGAAAGGTGATGTGAAACTGTTTGGTCAGATATTAAGTCATGCCCCTTCCCAGCAAAATTCGAGCTCTGGTTCCAACGAGTGTGGAGAGAAAAAGGGACCACTTCACAACTCGAGCAGCAAATCATGCGACATGGGAGAAAATATTCCGTTAAGGAGTTATGGTTTTTGGGATGGAAGCAGAATACAGACGGGTTTGTCGGCTTTGCCTGATTCTGCCATTTTACAAGCCAAGTATCCTGCTGCGTTCAGTGGCTACTCTGCTACATCTGTTAAAACTGAACAGCAGCCCTTGCAGGCACTTTCAAATAATGGTGATCAAAGTCTTAATGAACTAGTGTCCGCTTTTCCCACCAAGGATGGAGTGGTTGATTATCATTCATATAGAAGCCGGGATGGTGTTAAAATGCGACCATTCCCAGTTGATATATTTTCTGAGATGCACCGAAGAAATGGCTTCGATGCTGTATCCTTGTCGAGTTTACAGCAGCAGGGAAGAGTGCTAGTTGGAATGAATGTTGTTGGAAGGGGAGGGATACTCATGGGTGGTTCTTGCACTGGTGTTTCAGATCCTGTGGCAGCCATTAAAATGCACTACGCCAAGGCCGACCAGTACGCTGGGCAACCTGCTAGCATGTTCACCAGGGAAGATGGTAGTTGGGGAGGTGGTGGTAATGGTGGAGATTTAGGCAGCAGATAGTAGATGTGCATCGGGGGCCTCTTCCTGGTTAGGGATGGTCCTCGCCTGTATCATATTTAGTCTGTTTCAAGTTTTCTTTTTTCCTTTTTTGAAAAAGAAGTGATGTAGGAGAGTAGAAGTAATTTGAACCAATGGGTTCTGAAAAATCCATCTTTTTTGGTTGAAGAAAAAGGAAAGAGATTTTAGGGGATTGTAATATTGTAGCAGTCTTTTGTATTTGTTGTATTTGATTGCAACACAGAAATAATCTTCTCAATATGGGTGCGGCCATAGTCACGGCTCCGCTCAGCTCAGCTCCATCCTTCAACTTTGATAACCTAACCTACCTGACTCGTTACATTATCCAACTGGAGGTTAGTAGTTCCTTTTTTTTTGTTTTCTTTTCTTTTTGACATTTATTCCCCCTTATACTCTGTTGTATGATTAGGAGTTAGGAGTCTTTTGTAGAATTTTGGTTATATAATATAGTCATCAAAATTTGTATTGTATCATATTTCCCTTTGTTTTTTGTATTTGGGTTTCTGACTAATTTGAAGAATAAAGTCTTCGGAAATGGGACAAATTATTGACTGTTGCCTTTCACTTTCACTGTTTTAACCCATGAATGTATGAGAAAAGATTAACTCTTTTAATCATATATTTAACAGTTGATTGTGATGGTTTACTTTCAATAATTTCGTCAAATAATTGATTACTTGAATGTGACTCGTGGGGTCCTAACATCTTCACAAATATTCTCTATGCATATTTACACATTTCAGCAATCTTTGTTATCATTCCATAATTTTAGTTGAATCGTCTCTTGAAGAACTACTTCAACCTTGATGATGTTCGCAAACTTCTTTATTCTTTATTATAAACAAAAAAAGGTGAGATCTTCCACTGTTTTTGTAAATGTTTCATAATAATTAAAGTAAATGCAAATCAAGTCAACCAATATTTAATTTAATCATCAATTGGATTTTAGGGTAGGCCAGGAAATTGTCGAGCAAACGAACATGGCATTAT

mRNA sequence

ATGCCGCCAGAACCTTTGCCGTGGGACAGGAAAGACCTCTTCAAGGAGAGGAAACACGAGAAGTCGGAGGCCATAGGGTCTGCGGCCAGATGGCGGGACTCTTATCATGGATCTCGCGAGTTTAATCGGTGGGGTTCTGCTGACTTACGAAGGCCTACTGGTCATGGTAAGCAGGGTGGTTGGCATCAGTTTTCTGAAGACTCTAGTCACGGTTATGGACCTTCTCGGTCATTCAGTGACAGGGTGATAGAAGATGAGAGCTTCCGGCCGTCAGTTCCTCGTGGTGATGGAAAGTATATTAGAATCGGTAGAGAAAGTAGAGGTTCTTTTAGTCATAGAGACTGGAGAAGTCACTCAAGAGATGCTAACAATGGATTTGGGAACCCTTCACGAAGAACGTCATCGCAGGATGTGAGTTCTGATCAGAGGTCAGTAGATGATACGGTGACATATTCCTCTCCTCAATCGTTTCATGGGTTAGAAAATGGCCCGAGGTCTGATGTGGAAGTTTCCCTTGGCTCCACTGATTGGAAGCCGCTTAAGTGGTCCAGATCTGGGAGTTTGTCTTCTCGGGGATCTGCTTACAGCAGTTCAACAAACTCGAAGAATGAAAAGGCCGATTTACCTCTTAGAGTTGCATCTCCTATAGAAAGCCCTTCTGCTGAAGCTACTGCCTGTGTGACATCTTCTCTGCCTTCTGAAGATGCAATTTCTAGGAAGAAACCAAGGCTTGGATGGGGTGATGGACTGGCCAAGTACGAGAAAGAAAAAGTTGAGGTTCCTGATGGAAGTCTGAGAAAAGAAGTGGCTCTTCTTTCAAGCGGTAGTGGTGAATTAACTCATTCGCTTGGTTCAAACTTTGCTGAGAAAAGTCCCAAAACTTTACCCTTCTCAGATTGTGCATCTCCTGCAACTCCATCCTCTTTTGCTTGTAGTTCATCATCAGGATTGGAAGATAAACCATTTAGTAAGGGAGCAGGTGCTGATGGCATGATATGTAGTTCACCTGGGTCTGGTTCACAAAATCTTCAGAAGTTATTGTGCAGTATAGAGAAGATGGAGATTAGTTCGGTTGCTAATTTAGGGTCATCACTTGTTGAATTATTTCATTCTGATGATCCAAATACAATAGAATCATGTTTTGGGAAATCTACATTGAATAAGCTGCTAGCGTATAAAGGTGAAATTTCAAAGACATTGGAGATGACAGAGTCTGAAATTGATTCTCTTGAAAATGAACTTAAGTCTTTGAAATCTGTAAATGGAGGCAATGTTTCTCATAAAAAATCTTGCAGTGCCACACGTGTGATGGAGAGTTCAACATATTTCAAAGAACAAGATGGTATCTCTTGCATTGCCACTCGTCCTGCTCCGTTGGTGGTAGTTTCTTCTTCTGATGCAACAGTTGAGAAGGTGCCACTCTGCAAGGGTGACGTGGGAGTAGAAGATGTTGATACAAAGGCTGACGAAATTGATAGTCCTGGAACTGTGACATCAAAATTTAATGAGCCGTCCAGAGTAGTAAAGGCGATTGCTTCTGATATTGTTGATAATGGTCATTGCTCTGTAGTTACAGATGCGATTGTTCCTGGAAAGATGGAAGGGAGTTTTCCAATATCAGGGCCTTTTGTGGATGAACACGAAACGATTGGCTCTGGCAATGAATGCACTCTTGCCAAGAGTTGTACCAGTGAATCTGTTTATGGTGATTTGATGGCCCAAGCTGGTAGTAGATCATCTCTTTGTGATTCAATTTTTGCATGTAATAAAGAATATGCAAGTAGAGCTGCAGAAGTAATTTTTAAGAGATCACCAGTGGGAATGTGCAAGATCAGCAGCAAAAGCACCAAAAATGTGTCCTGTTCAGAGACTGAGAAACTTATTAAAGAGAAATTTGTAATGAGGAAGAAGTTTTTAAAATTTAAGGAGAGTGCATTAACCCTTAGATTTAAATCCTTGCAACAATCATGGAAAGAAGGTTTGCTGCATTCTGTAAAGAAATGTCGCTCAAGGCCACAAAAAAAGGAGTTGAGTCTAAGGGTGACACATTCTGGCCATCAGAAGTACAGGTCGTCTTCAATTCGCTCCCGTTTGGTTCAGCAAGGAGCGTGTCAGAGCTCTACATTTAACACAGAAATTGCTGTTCGTCACTCCAGCAAGCTGCTGTTGAACCCACAAATTAAGCTTTACAGGAATACCTTAAAGATGCCAGCTATGATTTTGGACAAGAAGGAAAAGATTGCATTGAGGTTCATCTCTCATAATGGGTTGGTTGAAGATCCCTGTGCTGTTGAGAAGGAAAGGAACTTGATAAACCCCTGGACTTCAGCTGAGAAAGAAATATTCTGGGAGAAACTATCTCTGTTTGGAAAGGATTTTAAGAAAATTTCTTCATTTCTCGACCTCAAAACTACAGCAGACTGTATCCAGTTCTATTACAAAAACCATAAATCTGATAGTTTTAAAAAGAACAAAAATTTGGAGTTGGGCAAGCAAATGAAATCTTCTGCCATCACGTATTTGGTAACATCAGGGAAAAAATGGAATCCGGACGCGAATGCCACTTCCCTTGATATTTTAGGTGTTGCTTCAGTAATGGCAGCACAAGCAGACTATGACATTGAAAACCAGCAGAAATGTACTCGCCATTTGGGTGTAGGAAGGGATGTTGAGTCAAAAGTATCATGGAGTGCTAGCTCTCCAAATAAAAGCAATTTGGATGATCTTCAAACTGAAAAAGAAACAGTTGCTGCTGATGTACTCGCTGGCATAAGTGGTTCAATATCTTCAGAGGCCCTGAGTTCTTGCATTACAAGTGCCATTGATCCCCGTGAGGAACTAAGGGAGCGGAAGTGCTATAGAGTGGATTTTGCAGCGAAATTGCCTTCATTGTCTGACGTCATGCAGAAAACTGATAATGAACCTTGTTCGGATGATAGTTCTGAGGATGTGGATTCTTCAAATTGGACAGATGAGGAGAAGTTGGTCTTCATGCAGGCTGTGTCATCTTATGGTAAGGATTTTGATATGATCTCTAGATGTATCAGATCAAAGTCTAGGGACCAGTGCAAGATTTTCTTCAGCAAAGCTCGGAAATGTCTTGGACTGGATTTGATGCATACTTCTGGAGACGTAGGTGAAACACCTGGGAATGGTAACGATGCTAGTGGGAGTGGGAGTGGGACTGACACTGAAGAACACTGTGTGGTGGAAATCTGTGAAGGCCGTGGCAGTGATGAATTTATCTCCAAGTCAATCAATGGTGGATCAACATCGGTTAACATTAATCATGAAGAAACTGTTTCTGCTGTGACTGACAACATGCGGACTAGTATGGAATTTGAGGAAAGTACAGCATTGCAACAGTCGGATGAGAAAGGTGCTGAGGCTGTTGGGAACTTGATTTTTGAGACATTGAAGGAAGAGGATGTGCCTAATCCGAGTCAGCCCACGCATGACCACAAAATTGAAGGCTCTTCTGAAAATACTGAAAGTGGAAAGAGCTGTAATGAACCTGACATTCTGAGATCTGAATCCGTGTCCACAGTAGATGAAAATTCAGCAGCTGTGAGTGAGGGCAGAGCTACTGTGAAGCTTGCGATTGGAGAAGAAGTAGGAAGCGACACTAATTTACACGGTCAGAGTACAATTCTGTGCTCAGGTCAGGATTCAACTGGGAATGATTCCAATATTGCTTTAGAGGGCAGTTCTGTAGGACTTGATCCACATATCTTGCATCCAAATATTCTTAAAGTGGAACCTGTAGAGAAGAAGTCTTGTATCAAGTCTGAGGAGAATTTTCTTTCTGTCAGAAATTCTGATACTGGTGTCATTGGAAGGGAACAGATGCTTAACCAAGACATATTGTCACCAACACTTGTTTTGCAGGAGATTAGTGATGCGAATCAAAAACCTATGAATAGAGATGACGATGCTGAGCATCCAAATAATTTGTTGTGCAATAGTGAATCATCCACGTTTCCAAGAAGCTATCCTTTCAACAAACAAATCTTTGAGGACATAAATAGAAATATCAATCACGCGTATTTTCGTGTTCAAGGGCTGTCAAAGCCAGACATTAATTGTAATAGTAAATATGTTTCTGAGGGGCAATTTCTTCAGAATTGTAACAGTTCCAAGCCGCACAATCTGGCTGAGCCTCCTTTTCTGTCTCAGAATATAGAATTGGGCCATGATCATCAGAAGAATGCTTCTGGCAGTGGCAGTGCTTCAGATTCCGATGTTCCACGCAGGAAAGGTGATGTGAAACTGTTTGGTCAGATATTAAGTCATGCCCCTTCCCAGCAAAATTCGAGCTCTGGTTCCAACGAGTGTGGAGAGAAAAAGGGACCACTTCACAACTCGAGCAGCAAATCATGCGACATGGGAGAAAATATTCCGTTAAGGAGTTATGGTTTTTGGGATGGAAGCAGAATACAGACGGGTTTGTCGGCTTTGCCTGATTCTGCCATTTTACAAGCCAAGTATCCTGCTGCGTTCAGTGGCTACTCTGCTACATCTGTTAAAACTGAACAGCAGCCCTTGCAGGCACTTTCAAATAATGGTGATCAAAGTCTTAATGAACTAGTGTCCGCTTTTCCCACCAAGGATGGAGTGGTTGATTATCATTCATATAGAAGCCGGGATGGTGTTAAAATGCGACCATTCCCAGTTGATATATTTTCTGAGATGCACCGAAGAAATGGCTTCGATGCTGTATCCTTGTCGAGTTTACAGCAGCAGGGAAGAGTGCTAGTTGGAATGAATGTTGTTGGAAGGGGAGGGATACTCATGGGTGGTTCTTGCACTGGTGTTTCAGATCCTGTGGCAGCCATTAAAATGCACTACGCCAAGGCCGACCAGTACGCTGGGCAACCTGCTAGCATGTTCACCAGGGAAGATGGTAGTTGGGGAGGTGGTGGTAATGGTGGAGATTTAGGCAGCAGATAG

Coding sequence (CDS)

ATGCCGCCAGAACCTTTGCCGTGGGACAGGAAAGACCTCTTCAAGGAGAGGAAACACGAGAAGTCGGAGGCCATAGGGTCTGCGGCCAGATGGCGGGACTCTTATCATGGATCTCGCGAGTTTAATCGGTGGGGTTCTGCTGACTTACGAAGGCCTACTGGTCATGGTAAGCAGGGTGGTTGGCATCAGTTTTCTGAAGACTCTAGTCACGGTTATGGACCTTCTCGGTCATTCAGTGACAGGGTGATAGAAGATGAGAGCTTCCGGCCGTCAGTTCCTCGTGGTGATGGAAAGTATATTAGAATCGGTAGAGAAAGTAGAGGTTCTTTTAGTCATAGAGACTGGAGAAGTCACTCAAGAGATGCTAACAATGGATTTGGGAACCCTTCACGAAGAACGTCATCGCAGGATGTGAGTTCTGATCAGAGGTCAGTAGATGATACGGTGACATATTCCTCTCCTCAATCGTTTCATGGGTTAGAAAATGGCCCGAGGTCTGATGTGGAAGTTTCCCTTGGCTCCACTGATTGGAAGCCGCTTAAGTGGTCCAGATCTGGGAGTTTGTCTTCTCGGGGATCTGCTTACAGCAGTTCAACAAACTCGAAGAATGAAAAGGCCGATTTACCTCTTAGAGTTGCATCTCCTATAGAAAGCCCTTCTGCTGAAGCTACTGCCTGTGTGACATCTTCTCTGCCTTCTGAAGATGCAATTTCTAGGAAGAAACCAAGGCTTGGATGGGGTGATGGACTGGCCAAGTACGAGAAAGAAAAAGTTGAGGTTCCTGATGGAAGTCTGAGAAAAGAAGTGGCTCTTCTTTCAAGCGGTAGTGGTGAATTAACTCATTCGCTTGGTTCAAACTTTGCTGAGAAAAGTCCCAAAACTTTACCCTTCTCAGATTGTGCATCTCCTGCAACTCCATCCTCTTTTGCTTGTAGTTCATCATCAGGATTGGAAGATAAACCATTTAGTAAGGGAGCAGGTGCTGATGGCATGATATGTAGTTCACCTGGGTCTGGTTCACAAAATCTTCAGAAGTTATTGTGCAGTATAGAGAAGATGGAGATTAGTTCGGTTGCTAATTTAGGGTCATCACTTGTTGAATTATTTCATTCTGATGATCCAAATACAATAGAATCATGTTTTGGGAAATCTACATTGAATAAGCTGCTAGCGTATAAAGGTGAAATTTCAAAGACATTGGAGATGACAGAGTCTGAAATTGATTCTCTTGAAAATGAACTTAAGTCTTTGAAATCTGTAAATGGAGGCAATGTTTCTCATAAAAAATCTTGCAGTGCCACACGTGTGATGGAGAGTTCAACATATTTCAAAGAACAAGATGGTATCTCTTGCATTGCCACTCGTCCTGCTCCGTTGGTGGTAGTTTCTTCTTCTGATGCAACAGTTGAGAAGGTGCCACTCTGCAAGGGTGACGTGGGAGTAGAAGATGTTGATACAAAGGCTGACGAAATTGATAGTCCTGGAACTGTGACATCAAAATTTAATGAGCCGTCCAGAGTAGTAAAGGCGATTGCTTCTGATATTGTTGATAATGGTCATTGCTCTGTAGTTACAGATGCGATTGTTCCTGGAAAGATGGAAGGGAGTTTTCCAATATCAGGGCCTTTTGTGGATGAACACGAAACGATTGGCTCTGGCAATGAATGCACTCTTGCCAAGAGTTGTACCAGTGAATCTGTTTATGGTGATTTGATGGCCCAAGCTGGTAGTAGATCATCTCTTTGTGATTCAATTTTTGCATGTAATAAAGAATATGCAAGTAGAGCTGCAGAAGTAATTTTTAAGAGATCACCAGTGGGAATGTGCAAGATCAGCAGCAAAAGCACCAAAAATGTGTCCTGTTCAGAGACTGAGAAACTTATTAAAGAGAAATTTGTAATGAGGAAGAAGTTTTTAAAATTTAAGGAGAGTGCATTAACCCTTAGATTTAAATCCTTGCAACAATCATGGAAAGAAGGTTTGCTGCATTCTGTAAAGAAATGTCGCTCAAGGCCACAAAAAAAGGAGTTGAGTCTAAGGGTGACACATTCTGGCCATCAGAAGTACAGGTCGTCTTCAATTCGCTCCCGTTTGGTTCAGCAAGGAGCGTGTCAGAGCTCTACATTTAACACAGAAATTGCTGTTCGTCACTCCAGCAAGCTGCTGTTGAACCCACAAATTAAGCTTTACAGGAATACCTTAAAGATGCCAGCTATGATTTTGGACAAGAAGGAAAAGATTGCATTGAGGTTCATCTCTCATAATGGGTTGGTTGAAGATCCCTGTGCTGTTGAGAAGGAAAGGAACTTGATAAACCCCTGGACTTCAGCTGAGAAAGAAATATTCTGGGAGAAACTATCTCTGTTTGGAAAGGATTTTAAGAAAATTTCTTCATTTCTCGACCTCAAAACTACAGCAGACTGTATCCAGTTCTATTACAAAAACCATAAATCTGATAGTTTTAAAAAGAACAAAAATTTGGAGTTGGGCAAGCAAATGAAATCTTCTGCCATCACGTATTTGGTAACATCAGGGAAAAAATGGAATCCGGACGCGAATGCCACTTCCCTTGATATTTTAGGTGTTGCTTCAGTAATGGCAGCACAAGCAGACTATGACATTGAAAACCAGCAGAAATGTACTCGCCATTTGGGTGTAGGAAGGGATGTTGAGTCAAAAGTATCATGGAGTGCTAGCTCTCCAAATAAAAGCAATTTGGATGATCTTCAAACTGAAAAAGAAACAGTTGCTGCTGATGTACTCGCTGGCATAAGTGGTTCAATATCTTCAGAGGCCCTGAGTTCTTGCATTACAAGTGCCATTGATCCCCGTGAGGAACTAAGGGAGCGGAAGTGCTATAGAGTGGATTTTGCAGCGAAATTGCCTTCATTGTCTGACGTCATGCAGAAAACTGATAATGAACCTTGTTCGGATGATAGTTCTGAGGATGTGGATTCTTCAAATTGGACAGATGAGGAGAAGTTGGTCTTCATGCAGGCTGTGTCATCTTATGGTAAGGATTTTGATATGATCTCTAGATGTATCAGATCAAAGTCTAGGGACCAGTGCAAGATTTTCTTCAGCAAAGCTCGGAAATGTCTTGGACTGGATTTGATGCATACTTCTGGAGACGTAGGTGAAACACCTGGGAATGGTAACGATGCTAGTGGGAGTGGGAGTGGGACTGACACTGAAGAACACTGTGTGGTGGAAATCTGTGAAGGCCGTGGCAGTGATGAATTTATCTCCAAGTCAATCAATGGTGGATCAACATCGGTTAACATTAATCATGAAGAAACTGTTTCTGCTGTGACTGACAACATGCGGACTAGTATGGAATTTGAGGAAAGTACAGCATTGCAACAGTCGGATGAGAAAGGTGCTGAGGCTGTTGGGAACTTGATTTTTGAGACATTGAAGGAAGAGGATGTGCCTAATCCGAGTCAGCCCACGCATGACCACAAAATTGAAGGCTCTTCTGAAAATACTGAAAGTGGAAAGAGCTGTAATGAACCTGACATTCTGAGATCTGAATCCGTGTCCACAGTAGATGAAAATTCAGCAGCTGTGAGTGAGGGCAGAGCTACTGTGAAGCTTGCGATTGGAGAAGAAGTAGGAAGCGACACTAATTTACACGGTCAGAGTACAATTCTGTGCTCAGGTCAGGATTCAACTGGGAATGATTCCAATATTGCTTTAGAGGGCAGTTCTGTAGGACTTGATCCACATATCTTGCATCCAAATATTCTTAAAGTGGAACCTGTAGAGAAGAAGTCTTGTATCAAGTCTGAGGAGAATTTTCTTTCTGTCAGAAATTCTGATACTGGTGTCATTGGAAGGGAACAGATGCTTAACCAAGACATATTGTCACCAACACTTGTTTTGCAGGAGATTAGTGATGCGAATCAAAAACCTATGAATAGAGATGACGATGCTGAGCATCCAAATAATTTGTTGTGCAATAGTGAATCATCCACGTTTCCAAGAAGCTATCCTTTCAACAAACAAATCTTTGAGGACATAAATAGAAATATCAATCACGCGTATTTTCGTGTTCAAGGGCTGTCAAAGCCAGACATTAATTGTAATAGTAAATATGTTTCTGAGGGGCAATTTCTTCAGAATTGTAACAGTTCCAAGCCGCACAATCTGGCTGAGCCTCCTTTTCTGTCTCAGAATATAGAATTGGGCCATGATCATCAGAAGAATGCTTCTGGCAGTGGCAGTGCTTCAGATTCCGATGTTCCACGCAGGAAAGGTGATGTGAAACTGTTTGGTCAGATATTAAGTCATGCCCCTTCCCAGCAAAATTCGAGCTCTGGTTCCAACGAGTGTGGAGAGAAAAAGGGACCACTTCACAACTCGAGCAGCAAATCATGCGACATGGGAGAAAATATTCCGTTAAGGAGTTATGGTTTTTGGGATGGAAGCAGAATACAGACGGGTTTGTCGGCTTTGCCTGATTCTGCCATTTTACAAGCCAAGTATCCTGCTGCGTTCAGTGGCTACTCTGCTACATCTGTTAAAACTGAACAGCAGCCCTTGCAGGCACTTTCAAATAATGGTGATCAAAGTCTTAATGAACTAGTGTCCGCTTTTCCCACCAAGGATGGAGTGGTTGATTATCATTCATATAGAAGCCGGGATGGTGTTAAAATGCGACCATTCCCAGTTGATATATTTTCTGAGATGCACCGAAGAAATGGCTTCGATGCTGTATCCTTGTCGAGTTTACAGCAGCAGGGAAGAGTGCTAGTTGGAATGAATGTTGTTGGAAGGGGAGGGATACTCATGGGTGGTTCTTGCACTGGTGTTTCAGATCCTGTGGCAGCCATTAAAATGCACTACGCCAAGGCCGACCAGTACGCTGGGCAACCTGCTAGCATGTTCACCAGGGAAGATGGTAGTTGGGGAGGTGGTGGTAATGGTGGAGATTTAGGCAGCAGATAG

Protein sequence

MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGGWHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSRDANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEGLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASSPNKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDASGSGSGTDTEEHCVVEICEGRGSDEFISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLKEEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLAIGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSCIKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLCNSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSKPHNLAEPPFLSQNIELGHDHQKNASGSGSASDSDVPRRKGDVKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKADQYAGQPASMFTREDGSWGGGGNGGDLGSR
BLAST of CsaV3_5G036430.1 vs. NCBI nr
Match: XP_004142488.1 (PREDICTED: uncharacterized protein LOC101222167 [Cucumis sativus] >KGN52286.1 hypothetical protein Csa_5G623500 [Cucumis sativus])

HSP 1 Score: 3134.4 bits (8125), Expect = 0.0e+00
Identity = 1649/1649 (100.00%), Postives = 1649/1649 (100.00%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180
            DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL
Sbjct: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180

Query: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK 240
            KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK
Sbjct: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK 240

Query: 241  KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300
            KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC
Sbjct: 241  KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300

Query: 301  ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN 360
            ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN
Sbjct: 301  ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN 360

Query: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV 420
            LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV
Sbjct: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV 420

Query: 421  NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG 480
            NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG
Sbjct: 421  NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG 480

Query: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS 540
            VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS
Sbjct: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS 540

Query: 541  GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600
            GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI
Sbjct: 541  GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600

Query: 601  FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG 660
            FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG
Sbjct: 601  FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG 660

Query: 661  LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK 720
            LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK
Sbjct: 661  LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK 720

Query: 721  LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780
            LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE
Sbjct: 721  LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780

Query: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840
            IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY
Sbjct: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840

Query: 841  LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS 900
            LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS
Sbjct: 841  LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS 900

Query: 901  PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK 960
            PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK
Sbjct: 901  PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK 960

Query: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR 1020
            LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR
Sbjct: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR 1020

Query: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE 1080
            DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE
Sbjct: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE 1080

Query: 1081 FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK 1140
            FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK
Sbjct: 1081 FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK 1140

Query: 1141 EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA 1200
            EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA
Sbjct: 1141 EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA 1200

Query: 1201 IGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSC 1260
            IGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSC
Sbjct: 1201 IGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSC 1260

Query: 1261 IKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLC 1320
            IKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLC
Sbjct: 1261 IKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLC 1320

Query: 1321 NSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSK 1380
            NSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSK
Sbjct: 1321 NSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSK 1380

Query: 1381 PHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSS 1440
            PHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSS
Sbjct: 1381 PHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSS 1440

Query: 1441 GSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAF 1500
            GSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAF
Sbjct: 1441 GSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAF 1500

Query: 1501 SGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDI 1560
            SGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDI
Sbjct: 1501 SGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDI 1560

Query: 1561 FSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKAD 1620
            FSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKAD
Sbjct: 1561 FSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKAD 1620

Query: 1621 QYAGQPASMFTREDGSWGGGGNGGDLGSR 1650
            QYAGQPASMFTREDGSWGGGGNGGDLGSR
Sbjct: 1621 QYAGQPASMFTREDGSWGGGGNGGDLGSR 1649

BLAST of CsaV3_5G036430.1 vs. NCBI nr
Match: XP_008446909.2 (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 [Cucumis melo])

HSP 1 Score: 2860.9 bits (7415), Expect = 0.0e+00
Identity = 1515/1653 (91.65%), Postives = 1558/1653 (94.25%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180
            D NNGFGNPSRR SSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL
Sbjct: 121  DTNNGFGNPSRRPSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180

Query: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK 240
            KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSED ISRK
Sbjct: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDTISRK 240

Query: 241  KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300
            KPRLGWGDGLAKYEKEKV+VPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC
Sbjct: 241  KPRLGWGDGLAKYEKEKVDVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300

Query: 301  ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN 360
            ASPATPSSFACSSSSGLEDKPFSKGA ADGMICSSPGSGSQNLQKLLCSIEKMEISS+AN
Sbjct: 301  ASPATPSSFACSSSSGLEDKPFSKGASADGMICSSPGSGSQNLQKLLCSIEKMEISSIAN 360

Query: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV 420
            LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKS 
Sbjct: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSG 420

Query: 421  NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG 480
            NGGNVS+KKSCSATR++ESSTYFKEQDGISCIA RPAPLVVVSSSDATVEKVPLCKGD+G
Sbjct: 421  NGGNVSNKKSCSATRLVESSTYFKEQDGISCIAPRPAPLVVVSSSDATVEKVPLCKGDMG 480

Query: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS 540
            VEDVDTKADEIDSPGTVTSKFNEPSRVVK   SDIVDNGHCSVVTD IVPGKMEG+FPIS
Sbjct: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKENTSDIVDNGHCSVVTDMIVPGKMEGNFPIS 540

Query: 541  GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600
             PFVDE +T GSGNEC LAKSC+SES  GDLMAQAGSRSSLCDSIFACNKEYASRAAEVI
Sbjct: 541  EPFVDERKTTGSGNECILAKSCSSESFNGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600

Query: 601  FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG 660
            FKRSPVG+CKISSKSTK VSCSETEKLIKEKFV RKKFLKFKESALTLRFK+LQQSWK  
Sbjct: 601  FKRSPVGVCKISSKSTKYVSCSETEKLIKEKFVSRKKFLKFKESALTLRFKALQQSWKM- 660

Query: 661  LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK 720
                 K+   +  KKELSLRVTHSGHQKYR SS RSRL+QQGACQS+TFNTEIAVRHSSK
Sbjct: 661  FAAFCKEMSLKATKKELSLRVTHSGHQKYR-SSFRSRLIQQGACQSTTFNTEIAVRHSSK 720

Query: 721  LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780
            LLLNPQIKLYRNTLKMPAMILDKKEK+ALRFISHNGLVEDPCAVEKERNLINPWTSAEKE
Sbjct: 721  LLLNPQIKLYRNTLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780

Query: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840
            IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY
Sbjct: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840

Query: 841  LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS 900
            LVTSGKKWNPDANATSLDILGVASVMAAQA+YDI NQQKC+RHLG G+DVESKVSWSAS+
Sbjct: 841  LVTSGKKWNPDANATSLDILGVASVMAAQAEYDIGNQQKCSRHLGTGKDVESKVSWSAST 900

Query: 901  PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK 960
            PNKSNLDDLQTEKETVAADVL  XXXXXXXXXXXXXXXXX DPREELRE+KCY+VD AAK
Sbjct: 901  PNKSNLDDLQTEKETVAADVLAGXXXXXXXXXXXXXXXXXIDPREELREQKCYKVDSAAK 960

Query: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR 1020
            LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEK++F+QAVSSYGKDFDMISRCIRSKSR
Sbjct: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKVIFLQAVSSYGKDFDMISRCIRSKSR 1020

Query: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE 1080
            DQCKIFFSKARKCLGLDLMHTSGDVGETPGN             E+HCVVEIC GRGSDE
Sbjct: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGN--GNDISGSGTDTEDHCVVEICGGRGSDE 1080

Query: 1081 FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK 1140
             ISKSING STSVNINHEE+VSA T NMRTSMEFE STALQQ DEKGAEAVGN+IFETLK
Sbjct: 1081 SISKSINGVSTSVNINHEESVSAATVNMRTSMEFEGSTALQQLDEKGAEAVGNMIFETLK 1140

Query: 1141 EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA 1200
            EEDVPNPSQP HD KIEGSSENTE GKSCNEPDILRSESVSTVDENSAAVSE RATVKLA
Sbjct: 1141 EEDVPNPSQPMHDQKIEGSSENTEGGKSCNEPDILRSESVSTVDENSAAVSECRATVKLA 1200

Query: 1201 IG-EEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKS 1260
            IG EEVGSD NLH QST+ CSGQDSTG DSNIALEGSS+GLDP ILHPNILKVEPVEKKS
Sbjct: 1201 IGEEEVGSDANLHSQSTMQCSGQDSTGYDSNIALEGSSIGLDPQILHPNILKVEPVEKKS 1260

Query: 1261 CIKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNR--DDDAEHPNN 1320
            CIKSEENFL+VRNSDTGVIGREQMLNQD+ S TLVLQ++SDA+QKPMNR  DDD EH NN
Sbjct: 1261 CIKSEENFLAVRNSDTGVIGREQMLNQDVSSSTLVLQDVSDADQKPMNRDKDDDDEHRNN 1320

Query: 1321 LLCNSESSTFPRSYPFNKQIFEDINRNINHAYFR-VQGLSKPDINCNSKYVSEGQFLQNC 1380
            LL NSES  FPRSYPFNKQIFEDINRNINH YF  VQGLSKPDINCN+KYV EGQ+LQNC
Sbjct: 1321 LLRNSESPKFPRSYPFNKQIFEDINRNINHTYFPVVQGLSKPDINCNNKYVPEGQYLQNC 1380

Query: 1381 NSSKPHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQ 1440
            NSSKPHN AE PFLSQNIELGH+HQKNA  XXXXX  DVPRRKGDVKLFGQILSHAPSQQ
Sbjct: 1381 NSSKPHNPAELPFLSQNIELGHNHQKNAXXXXXXXXXDVPRRKGDVKLFGQILSHAPSQQ 1440

Query: 1441 NSSSGSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKY 1500
            NSSSGSNECGEKKG LHNSSSKSCDMGE++PLRSYGFWDGSRIQTGLSALPDSAILQ+KY
Sbjct: 1441 NSSSGSNECGEKKG-LHNSSSKSCDMGEHVPLRSYGFWDGSRIQTGLSALPDSAILQSKY 1500

Query: 1501 PAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPF 1560
            PAAFSGYS TSVKTEQQ LQAL+NN DQSLNE+VSAFPTKDGVVDYHSYRSRDGVKMRPF
Sbjct: 1501 PAAFSGYSGTSVKTEQQTLQALANNSDQSLNEVVSAFPTKDGVVDYHSYRSRDGVKMRPF 1560

Query: 1561 PVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHY 1620
            PVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHY
Sbjct: 1561 PVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHY 1620

Query: 1621 AKADQYAGQPASMFTREDGSWGGGGNGGDLGSR 1650
            AKADQYAGQP SMFTREDGSW  GG GGDLGSR
Sbjct: 1621 AKADQYAGQPGSMFTREDGSW-RGGKGGDLGSR 1647

BLAST of CsaV3_5G036430.1 vs. NCBI nr
Match: XP_023516042.1 (uncharacterized protein LOC111780021 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2459.9 bits (6374), Expect = 0.0e+00
Identity = 1309/1689 (77.50%), Postives = 1423/1689 (84.25%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSA RWRDSYHGSREFNRWGSAD RRPTGHGK GG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSATRWRDSYHGSREFNRWGSADFRRPTGHGKLGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSE++SHGYGPSRSFSDRV+EDESFRPSVPRGDGKY RIGRESRGSFS RDWR HS+
Sbjct: 61   WHQFSEETSHGYGPSRSFSDRVLEDESFRPSVPRGDGKYNRIGRESRGSFSQRDWRGHSK 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQS--------------------FHGL 180
            + +  FGNPSRR SSQD SSDQRS+DDTVTYSSPQS                     +GL
Sbjct: 121  ENSKEFGNPSRRPSSQDASSDQRSLDDTVTYSSPQSDFVSVSDKIHSKDRNDKVGGVYGL 180

Query: 181  ENGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPS 240
             NGP+SDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEK DLP RVASP++SPS
Sbjct: 181  GNGPKSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKTDLPHRVASPLQSPS 240

Query: 241  AEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELT 300
            AEATAC+TSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEV +LSS S EL 
Sbjct: 241  AEATACLTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVTVLSSSSAELA 300

Query: 301  HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGS 360
            HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK A  DG+ICSSPGS S
Sbjct: 301  HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKAASVDGIICSSPGSSS 360

Query: 361  QN-LQKLLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKT 420
            QN LQKL  SIEK+EISS+ NLGSSLVELF+SDDP+T+ESCFGKSTLNKLLAYKGEISKT
Sbjct: 361  QNHLQKLFSSIEKVEISSITNLGSSLVELFNSDDPSTVESCFGKSTLNKLLAYKGEISKT 420

Query: 421  LEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPL 480
            LE TESEID LENELKSLKS NGGNVSH KSCSA  ++ES  YFKEQDG+SCIA RPAPL
Sbjct: 421  LETTESEIDFLENELKSLKSENGGNVSHPKSCSAVHLVESVPYFKEQDGVSCIAPRPAPL 480

Query: 481  VVVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNG 540
             +VSSSDATVEK+P+C GD+G+EDV TKADEIDSPGTVTSKFNEPSRVVKA+AS++V+N 
Sbjct: 481  KIVSSSDATVEKMPVCIGDMGIEDVGTKADEIDSPGTVTSKFNEPSRVVKAVASNLVEND 540

Query: 541  HCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRS 600
            HCS  TD+IVP KMEGSF  SGPFVDEH TIGSGNEC LAKSCTSES+YGDL  QA S S
Sbjct: 541  HCSEATDSIVPDKMEGSFKKSGPFVDEHLTIGSGNECILAKSCTSESIYGDLTIQADSGS 600

Query: 601  SLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFL 660
            SL D IFA NKE+AS+AAEVIFK  P  MCKIS++STK VSC ETEKL+KEK  MR++FL
Sbjct: 601  SLRDLIFARNKEHASKAAEVIFKELPTEMCKISTQSTKIVSCFETEKLVKEKIAMRRQFL 660

Query: 661  KFKESALTLRFKSLQQSWKEGLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLV 720
            KFKESALTLRFK+LQ SWKEGLLHSVKK RSRPQKKELSLRVTHSGHQKYR SSIRSR V
Sbjct: 661  KFKESALTLRFKALQHSWKEGLLHSVKKSRSRPQKKELSLRVTHSGHQKYR-SSIRSRFV 720

Query: 721  QQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVE 780
            Q G  Q+   N+EIA+R+SSKLLLNPQ+KLYRNTLKMPAMILDK EK+ALRFISHNGLVE
Sbjct: 721  QHGESQNPVVNSEIAIRYSSKLLLNPQVKLYRNTLKMPAMILDKNEKMALRFISHNGLVE 780

Query: 781  DPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSD 840
            DPCAVEKERN+INPWTSAE+E+FWEKLSLFGKDF+KISSFLDLKTTADCIQFYYKNHKSD
Sbjct: 781  DPCAVEKERNMINPWTSAEREMFWEKLSLFGKDFRKISSFLDLKTTADCIQFYYKNHKSD 840

Query: 841  SFKKNKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQK 900
            SFKKNKNLELGKQ+KSSA+TY++TSGKKWNPD NATSLDILGVAS MAAQAD +I NQQ 
Sbjct: 841  SFKKNKNLELGKQVKSSAVTYMLTSGKKWNPDVNATSLDILGVASEMAAQADGNIGNQQN 900

Query: 901  CTRHLGVGRDVESKVSWSASSP-NKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXX 960
            C RHLG+G D+ SKV+WSAS+P NK+NLD LQTEKETVAADVL                 
Sbjct: 901  CNRHLGMGGDIGSKVTWSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCIT 960

Query: 961  XXXDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFM 1020
               DP E+ +ERKC++VD A K PS SDVMQKTDNEPCSDDSSEDVDSSNWTDEEK + M
Sbjct: 961  SAIDPSEDHKERKCHKVDSATKFPSTSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKSILM 1020

Query: 1021 QAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXX 1080
            QAVSSYGKDFDMISRC+ SKSRDQCK+FFSKARKCLGLDL+H SGDVG TPG+       
Sbjct: 1021 QAVSSYGKDFDMISRCVGSKSRDQCKVFFSKARKCLGLDLIHNSGDVG-TPGS--GNDSS 1080

Query: 1081 XXXXXXEEHCVVEICEGRGSDEFISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEEST 1140
                  ++HCVVE C  R SDEF+SKS+N  STSV INHEE+VSAVT NMR S EFEEST
Sbjct: 1081 GSGTDTDDHCVVETCGARSSDEFVSKSVNDLSTSVIINHEESVSAVTANMRNSSEFEEST 1140

Query: 1141 ALQQSDEKGAEAVGNLIFETLKEEDVPN---------------PSQPTHDHKIEGSSENT 1200
            A +Q D  GAEAVGNL+ E  KEEDVPN               PSQP HDHKIEG SENT
Sbjct: 1141 AFEQLDVTGAEAVGNLVSEISKEEDVPNLDSHSACSLTNPAAFPSQPAHDHKIEGCSENT 1200

Query: 1201 ESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLAI-GEEVGSDTNLHGQSTILCSGQ 1260
            E+ K CN+PDILR ESV+TVDENSAAVSE RAT +LA  GEE GSDTNLHGQS +  S Q
Sbjct: 1201 EACKRCNDPDILRPESVATVDENSAAVSESRATTELAFGGEEDGSDTNLHGQSMLQRSVQ 1260

Query: 1261 DSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSCIKSEENFLSVRNSDTGVIGREQ 1320
            DSTG +SN+ALE  ++G DP I HP ILKV+ V  KSCIK +EN L VRNS  GVIGRE+
Sbjct: 1261 DSTGFNSNLALE--ALGFDPQISHPKILKVDSVANKSCIK-DENSLVVRNSGPGVIGREE 1320

Query: 1321 MLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLCNSESSTFPRSYPFNKQIFEDIN 1380
            MLNQD+   TLVLQ + DA+QKPMNRDD A+H N L  + ESS FP SYPFNKQI EDIN
Sbjct: 1321 MLNQDMFPSTLVLQGVGDAHQKPMNRDDCADHQNRLSRHIESSEFPSSYPFNKQIVEDIN 1380

Query: 1381 RNINHAYF-RVQGLSKPDINCNSKYVSEGQFLQNCNSSKP--HNLAEPPFLSQNIELGHD 1440
            RNINH  F   QGLSK  INCN  YV+E  +LQNCNSSK   H  AE P L +N+ELGHD
Sbjct: 1381 RNINHTDFPAFQGLSK--INCNGTYVAEDCYLQNCNSSKEPCHQAAELPLLPRNVELGHD 1440

Query: 1441 HQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKS 1500
            HQ N S      DSDVPRRKGDVKLFGQILSHAPS QNSSSGSN+CGE+K    +   KS
Sbjct: 1441 HQ-NTSCSGNASDSDVPRRKGDVKLFGQILSHAPSLQNSSSGSNDCGEEKE--FHKLRKS 1500

Query: 1501 CDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALS 1560
             DMGEN+PLRSYGFW+GSR+QTGLSALPDSAILQAKYPAAFSGYSATS+KTEQQPL+AL+
Sbjct: 1501 YDMGENVPLRSYGFWNGSRMQTGLSALPDSAILQAKYPAAFSGYSATSLKTEQQPLRALA 1560

Query: 1561 NNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQ 1620
            NNGD++LNELVSAFPTKDGVVDY SYRSRDGV MRPFPVD+FSEMHRRNG+D +SLSSLQ
Sbjct: 1561 NNGDRNLNELVSAFPTKDGVVDYQSYRSRDGVNMRPFPVDLFSEMHRRNGYDPLSLSSLQ 1620

Query: 1621 QQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKADQYAGQPASMFTREDGSWGG 1649
            QQGRV+VGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAK+DQY GQP S FTREDGSW G
Sbjct: 1621 QQGRVVVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKSDQYVGQPGSTFTREDGSWRG 1676

BLAST of CsaV3_5G036430.1 vs. NCBI nr
Match: XP_022956541.1 (uncharacterized protein LOC111458252 [Cucurbita moschata])

HSP 1 Score: 2454.1 bits (6359), Expect = 0.0e+00
Identity = 1304/1689 (77.21%), Postives = 1416/1689 (83.84%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSA RWRDSYHGSREFNRWGSAD RRPTGHGK GG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSATRWRDSYHGSREFNRWGSADFRRPTGHGKLGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSE++SHGYGPSRSFSDRV+EDESFRPSVPRGDGKY RIGRESRGSFS RDWR HS+
Sbjct: 61   WHQFSEETSHGYGPSRSFSDRVLEDESFRPSVPRGDGKYNRIGRESRGSFSQRDWRGHSK 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQS--------------------FHGL 180
            + +  FGNPSRR SSQD SSDQRS+DDTVTYSSPQS                     +GL
Sbjct: 121  ENSKEFGNPSRRPSSQDASSDQRSLDDTVTYSSPQSDFVSVSDKIHSKDRNDKVGGVYGL 180

Query: 181  ENGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPS 240
             NGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEK DLP RVASP++SPS
Sbjct: 181  GNGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKTDLPRRVASPLQSPS 240

Query: 241  AEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELT 300
             EATAC+TSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEV +LSS S ELT
Sbjct: 241  TEATACLTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVTVLSSSSAELT 300

Query: 301  HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGS 360
            HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK A  DG+ICSSPGS S
Sbjct: 301  HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKAASVDGIICSSPGSSS 360

Query: 361  QN-LQKLLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKT 420
            QN LQKL  SIEK+EISS+ NLGSSLVELF+SDDPNT+ESCFGKSTLNKLLAYKGEISKT
Sbjct: 361  QNHLQKLFSSIEKVEISSITNLGSSLVELFNSDDPNTVESCFGKSTLNKLLAYKGEISKT 420

Query: 421  LEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPL 480
            LE TESEID LENELKSLKS NGGNVSH KSCSA  ++ES  YFKEQDG+SCIA+RPAPL
Sbjct: 421  LETTESEIDFLENELKSLKSENGGNVSHPKSCSAVHLVESVPYFKEQDGVSCIASRPAPL 480

Query: 481  VVVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNG 540
             +VSSSDATVEK+P+C GD G+EDV TKADEIDSPGTVTSKFNEPSRVVKA+AS++V+N 
Sbjct: 481  KIVSSSDATVEKMPVCIGDKGIEDVGTKADEIDSPGTVTSKFNEPSRVVKAVASNLVEND 540

Query: 541  HCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRS 600
            HCS  TD+IVP KMEGSF  SGPFVDEH TIGSGNEC LAKSCTSES+YGDL  QA   S
Sbjct: 541  HCSEATDSIVPDKMEGSFKKSGPFVDEHLTIGSGNECILAKSCTSESIYGDLTTQANCGS 600

Query: 601  SLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFL 660
            S  D IFA NKEYAS+A EVIFK  P  MCKIS++STK VSC ETEKL+KEK  MR++FL
Sbjct: 601  SFRDLIFARNKEYASKATEVIFKELPTEMCKISTQSTKIVSCFETEKLVKEKIAMRRQFL 660

Query: 661  KFKESALTLRFKSLQQSWKEGLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLV 720
            KFKESALTLRFK+LQ SWKEGLLHSVKK RSRPQKKELSLRVTHSGHQKYR SSIRSR V
Sbjct: 661  KFKESALTLRFKALQHSWKEGLLHSVKKSRSRPQKKELSLRVTHSGHQKYR-SSIRSRFV 720

Query: 721  QQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVE 780
            Q G  Q+   N+EIA+R+SSKLLLNPQ+KLYRNTLKMPAMILDK EK+ALRFISHNGLVE
Sbjct: 721  QHGETQNPVINSEIAIRYSSKLLLNPQVKLYRNTLKMPAMILDKNEKMALRFISHNGLVE 780

Query: 781  DPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSD 840
            DPCAVEKERN+INPWTSAE+EIFWEKLSLFGKDF+K+SSFLDLKTTADCIQFYYKNHKSD
Sbjct: 781  DPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFRKVSSFLDLKTTADCIQFYYKNHKSD 840

Query: 841  SFKKNKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQK 900
            SFKKNKNLELGKQ+KSSA+TY++TSGKKWNPD NAT+LDILGVAS MAAQAD +I NQQ 
Sbjct: 841  SFKKNKNLELGKQVKSSAVTYMLTSGKKWNPDVNATNLDILGVASEMAAQADGNIGNQQN 900

Query: 901  CTRHLGVGRDVESKVSWSASSP-NKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXX 960
            C RHLG+G D+ SKVSWSAS+P NK+NLD LQTEKETVAADVL                 
Sbjct: 901  CNRHLGMGGDIGSKVSWSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCIT 960

Query: 961  XXXDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFM 1020
               DP E+ +ERKC++VD A K PS SDVMQKTDNEPCSDDSSEDVDSSNWTDEEK + M
Sbjct: 961  SAIDPSEDHKERKCHKVDSATKFPSTSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKSILM 1020

Query: 1021 QAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXX 1080
            QAVSSYGKDFDMISRC+RSKSRDQCK+FFSKARKCLGLDL+H SGDVG TPG+       
Sbjct: 1021 QAVSSYGKDFDMISRCVRSKSRDQCKVFFSKARKCLGLDLIHNSGDVG-TPGS--GNDSS 1080

Query: 1081 XXXXXXEEHCVVEICEGRGSDEFISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEEST 1140
                  ++HCVVE C  R SDEF+SKS+NG STSV INHEE+VSAVT NMR S EFEEST
Sbjct: 1081 GSGTDTDDHCVVETCGARSSDEFVSKSVNGLSTSVIINHEESVSAVTANMRNSSEFEEST 1140

Query: 1141 ALQQSDEKGAEAVGNLIFETLKEEDVPN---------------PSQPTHDHKIEGSSENT 1200
            A +Q D  GAEAV NL+ E  KEEDVPN               PSQP HDHKIEG SENT
Sbjct: 1141 AFEQLDVTGAEAVVNLVSEISKEEDVPNLDSHSACSLTNAAAFPSQPAHDHKIEGCSENT 1200

Query: 1201 ESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLAI-GEEVGSDTNLHGQSTILCSGQ 1260
            E+ K CN+PDILR ESV+TVDENSAAVSE RAT +LA  GEE GSDTNLHGQS +  S Q
Sbjct: 1201 EACKRCNDPDILRPESVATVDENSAAVSESRATTELAFGGEEDGSDTNLHGQSMLQRSVQ 1260

Query: 1261 DSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSCIKSEENFLSVRNSDTGVIGREQ 1320
            DSTG +SN+ LE  S+G DP I HP ILKV+ V  KSCIK E +   VRNS  GV+GRE+
Sbjct: 1261 DSTGFNSNLDLE--SLGFDPRISHPKILKVDSVANKSCIKDENSL--VRNSGLGVVGREE 1320

Query: 1321 MLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLCNSESSTFPRSYPFNKQIFEDIN 1380
            MLNQD+   TLVLQ + DA+QKPMNRDD ++H N L  + ESS FP SYPFNKQI EDIN
Sbjct: 1321 MLNQDMFPSTLVLQGVGDAHQKPMNRDDCSDHQNRLSRHIESSEFPSSYPFNKQIVEDIN 1380

Query: 1381 RNINHAYF-RVQGLSKPDINCNSKYVSEGQFLQNCNSSKP--HNLAEPPFLSQNIELGHD 1440
            RNINH  F   QGLSK  INCN  YV E  +LQNCNSSK   H  AE P L QN+ELGHD
Sbjct: 1381 RNINHTDFPAFQGLSK--INCNGTYVVEDCYLQNCNSSKEPCHRAAELPLLPQNVELGHD 1440

Query: 1441 HQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKS 1500
            HQ N S      DSDVPR KGDVKLFGQILSHAPS QNSSSGSN+CG++K    +   KS
Sbjct: 1441 HQ-NTSCSGNASDSDVPRSKGDVKLFGQILSHAPSLQNSSSGSNDCGDEKE--FHKLRKS 1500

Query: 1501 CDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALS 1560
             DMGEN+PLRSYGFW+GSR+QTGLSALPDSAILQAKYPAAFSGYS+TS+KTEQQPL+AL+
Sbjct: 1501 YDMGENVPLRSYGFWNGSRMQTGLSALPDSAILQAKYPAAFSGYSSTSLKTEQQPLRALA 1560

Query: 1561 NNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQ 1620
            NNGD++LNELVSAFPTKDGVVDY SYRSRDGV MRPFPVD+FSEMHRRNG+D +SLSSLQ
Sbjct: 1561 NNGDRNLNELVSAFPTKDGVVDYQSYRSRDGVNMRPFPVDLFSEMHRRNGYDPLSLSSLQ 1620

Query: 1621 QQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKADQYAGQPASMFTREDGSWGG 1649
            QQGRV+VGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAK+DQY GQP S FTREDGSW G
Sbjct: 1621 QQGRVVVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKSDQYVGQPGSTFTREDGSWRG 1675

BLAST of CsaV3_5G036430.1 vs. NCBI nr
Match: XP_022989544.1 (uncharacterized protein LOC111486582 [Cucurbita maxima])

HSP 1 Score: 2451.4 bits (6352), Expect = 0.0e+00
Identity = 1306/1688 (77.37%), Postives = 1410/1688 (83.53%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSA RWRDSYHGSREFNRWGSAD RRPTGHGK GG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSATRWRDSYHGSREFNRWGSADFRRPTGHGKLGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSE++SHGYGPSRSFSDRV+EDESFRPSVPRGDGKY RIGRESRGSFS RDWRSHS+
Sbjct: 61   WHQFSEETSHGYGPSRSFSDRVLEDESFRPSVPRGDGKYNRIGRESRGSFSQRDWRSHSK 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQS--------------------FHGL 180
            + +  FGNPSRR SSQD SSDQRS+DDTVTYSSPQS                     +GL
Sbjct: 121  ENSKEFGNPSRRPSSQDASSDQRSLDDTVTYSSPQSDFVSVSDKIHSKDRNDKVGGVYGL 180

Query: 181  ENGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPS 240
             NGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEK DLP RVASP++SPS
Sbjct: 181  GNGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKTDLPRRVASPLQSPS 240

Query: 241  AEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELT 300
            A+ATAC+TSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEV +LSS S ELT
Sbjct: 241  ADATACLTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVTVLSSSSAELT 300

Query: 301  HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGS 360
            HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK A  DGMICSSPGS S
Sbjct: 301  HSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKAASVDGMICSSPGSSS 360

Query: 361  QN-LQKLLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKT 420
            QN LQKL  SIEK+EISS+ NLGSSLVELF+SDDP+++ESCFGKSTLNKLL YKGEISKT
Sbjct: 361  QNHLQKLFSSIEKVEISSITNLGSSLVELFNSDDPSSVESCFGKSTLNKLLTYKGEISKT 420

Query: 421  LEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPL 480
            LE TESEID LENELKSLKS NGGNVSH KSCSA  ++ES  YFKEQDG+SCIA RPAPL
Sbjct: 421  LETTESEIDFLENELKSLKSENGGNVSHPKSCSAVHLVESVPYFKEQDGVSCIAPRPAPL 480

Query: 481  VVVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNG 540
             +VSSSDATVEK+P+C GD+G+ED  TKADEIDSPGTVTSKFNEPSRVVKA+ASD+V+N 
Sbjct: 481  KIVSSSDATVEKMPVCIGDMGIEDGSTKADEIDSPGTVTSKFNEPSRVVKAVASDLVEND 540

Query: 541  HCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRS 600
            HCS  TD+IVP KMEGS   SGPFVDEH TIGSGNEC LAKSCTSES+YGD+  QA S S
Sbjct: 541  HCSEATDSIVPHKMEGSSKKSGPFVDEHLTIGSGNECILAKSCTSESIYGDMTTQADSGS 600

Query: 601  SLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFL 660
            SLCD IFA NKEYAS+AAEVIFK  P  MCKIS++S K VSC ETEKL+KEK  MR++FL
Sbjct: 601  SLCDLIFARNKEYASKAAEVIFKELPTEMCKISTQSIKIVSCFETEKLVKEKIAMRRQFL 660

Query: 661  KFKESALTLRFKSLQQSWKEGLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLV 720
            KFKESALTLRFK+LQ SWKEGLLHSVKK RSRPQKKELSLRVTHSGHQKYR SSIRSR V
Sbjct: 661  KFKESALTLRFKALQHSWKEGLLHSVKKSRSRPQKKELSLRVTHSGHQKYR-SSIRSRFV 720

Query: 721  QQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVE 780
            Q G CQ+   N+EIA+R+SSKLLLNPQ+KLYRNTLKMPAMILDK EK+ALRFISHNGLVE
Sbjct: 721  QHGECQNPVVNSEIAIRYSSKLLLNPQVKLYRNTLKMPAMILDKNEKMALRFISHNGLVE 780

Query: 781  DPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSD 840
            DPCAVEKERN+INPWTSAE+EIFWEKLSLFGKDF+KISSFLDLKTTADCIQFYYKNHKSD
Sbjct: 781  DPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFRKISSFLDLKTTADCIQFYYKNHKSD 840

Query: 841  SFKKNKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQK 900
            SFKKNKNLELGKQ+KSSA TY++TSGKKWNPD NATSLDILGVAS MAAQAD DIENQQK
Sbjct: 841  SFKKNKNLELGKQVKSSAATYMLTSGKKWNPDVNATSLDILGVASEMAAQADVDIENQQK 900

Query: 901  CTRHLGVGRDVESKVSWSASSP-NKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXX 960
            C RHLG+GRD+ SKVSWSAS+P NK+NLD LQTEKETVAADVL                 
Sbjct: 901  CNRHLGMGRDIGSKVSWSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCLT 960

Query: 961  XXXDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFM 1020
               DP E+ +ERKC++VD A KLPS SDVMQKTDNEPCSDDSSEDVDSSNWTDEEK + M
Sbjct: 961  SAIDPSEDHKERKCHKVDSATKLPSTSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKSILM 1020

Query: 1021 QAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXX 1080
            QAVSSYGKDFDMISRC+RSKSRDQCK+FFSKARKCLGLDL+HTSGDVG TPG+       
Sbjct: 1021 QAVSSYGKDFDMISRCVRSKSRDQCKVFFSKARKCLGLDLIHTSGDVG-TPGS--GNDSS 1080

Query: 1081 XXXXXXEEHCVVEICEGRGSDEFISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEEST 1140
                  ++HCVVE C  R SDEF+SKS+NG STSV INHEE+VSAVT NMR S +FEEST
Sbjct: 1081 GSGTDTDDHCVVETCGARSSDEFVSKSVNGLSTSVIINHEESVSAVTANMRNSSQFEEST 1140

Query: 1141 ALQQSDEKGAEAVGNLIFETLKEEDVPN---------------PSQPTHDHKIEGSSENT 1200
            A +Q D  GAEAVGNL+ E  KEED PN               PSQP HDHKIEG SENT
Sbjct: 1141 AFEQLDVTGAEAVGNLVSEISKEEDAPNLDSHSACSLTNAAAFPSQPAHDHKIEGCSENT 1200

Query: 1201 ESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLAIGEEVGSDTNLHGQSTILCSGQD 1260
            E+ K CNEPDILR ESV+TVDENSAAVSE RAT +LA G E GSDTNLHGQS +  S QD
Sbjct: 1201 EACKRCNEPDILRPESVATVDENSAAVSESRATTELAFGGEDGSDTNLHGQSMLQRSFQD 1260

Query: 1261 STGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSCIKSEENFLSVRNSDTGVIGREQM 1320
            STG +SN+ALE  S+G DP I HP ILKV+ V  KSCIK +EN L VRNS  G+IGRE+M
Sbjct: 1261 STGFNSNLALE--SLGFDPQISHPKILKVDSVANKSCIK-DENSLVVRNSGPGIIGREEM 1320

Query: 1321 LNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLCNSESSTFPRSYPFNKQIFEDINR 1380
            LNQD+    LVLQ + DA+QKPMNRDD A+H N L  + ESS FP SYPFNKQI EDINR
Sbjct: 1321 LNQDMFPSALVLQGVGDAHQKPMNRDDCADHQNRLSRHIESSEFPSSYPFNKQIVEDINR 1380

Query: 1381 NINHAYF-RVQGLSKPDINCNSKYVSEGQFLQNCNSSKP--HNLAEPPFLSQNIELGHDH 1440
            NINH  F   QGLSK  INCN  YV E  + QNCNSSK   H  AE P L +N+ELGHDH
Sbjct: 1381 NINHTDFPAFQGLSK--INCNGTYVVEDCYPQNCNSSKEPCHRAAELPLLPKNVELGHDH 1440

Query: 1441 QKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKSC 1500
            Q N S      DSDVP RKGDVKLFGQILSHAPS QN SSGSN+C E+K   H   SKS 
Sbjct: 1441 Q-NTSCSGNASDSDVPHRKGDVKLFGQILSHAPSLQNLSSGSNDCREEK-EFHKLRSKSY 1500

Query: 1501 DMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALSN 1560
            DMGEN+PLRSY FWDGSRIQTGLS LPDSAILQAKYPAAFSGYSATS+KTEQQPL+A +N
Sbjct: 1501 DMGENVPLRSYCFWDGSRIQTGLSTLPDSAILQAKYPAAFSGYSATSLKTEQQPLRAFAN 1560

Query: 1561 NGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQ 1620
            NGD++LNELVSAFPTKDGVVDY SYR RDGV MRPFPVD+FSEMHRRNG+D +SLSSLQQ
Sbjct: 1561 NGDRNLNELVSAFPTKDGVVDYQSYRIRDGVNMRPFPVDLFSEMHRRNGYDPLSLSSLQQ 1620

Query: 1621 QGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKADQYAGQPASMFTREDGSWGGG 1649
            QGRV     VVGRGGILMGGSCTGVSDPVAAIKMHYAK+DQY  QP S FTREDGSW GG
Sbjct: 1621 QGRV-----VVGRGGILMGGSCTGVSDPVAAIKMHYAKSDQYVRQPGSTFTREDGSWRGG 1670

BLAST of CsaV3_5G036430.1 vs. TAIR10
Match: AT3G52250.1 (Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 442.6 bits (1137), Expect = 1.1e-123
Identity = 499/1710 (29.18%), Postives = 747/1710 (43.68%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAA--RWRD---SYHGSREF-NRWGSADLRRPTG 60
            MP +   WDRK+L ++RKH++ E    +   RWRD   S+H  REF +R GS D RRP+ 
Sbjct: 1    MPQDHASWDRKELLRQRKHDRPEQSFESPPFRWRDSPSSHHVPREFSSRLGSGDFRRPSC 60

Query: 61   HGKQGGWHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRD 120
            HGKQGG HQF E++SHGY  SRS S R+   +++RPS  RGD +Y R  R+ R S S ++
Sbjct: 61   HGKQGGRHQFVEETSHGYTSSRS-SARMF--DNYRPSASRGDWRYTRNCRDDRVSVSQKE 120

Query: 121  WRSHSRDANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSS--------------------- 180
            W+ ++ + +NG      R     + + +RSVD+   ++S                     
Sbjct: 121  WKCNTWEMSNGSSRSFERPFG--IRNGRRSVDERPLHASDTHSTVVNSLDPANSAHYLDN 180

Query: 181  ------------------------------------------------------------ 240
                                                                        
Sbjct: 181  EISTPVRSLKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNNYGNKVCSPAKQCNDLM 240

Query: 241  -------------------------------PQ---SFHGLE--NGPRSDV-EVSLGSTD 300
                                           PQ   S HG+   +G R    E SLG+T 
Sbjct: 241  YGRRLVSDNSLDAPIPNAELEGTWEQLRLKDPQDNNSLHGINDIDGDRKCAKESSLGATG 300

Query: 301  WKPLKWSRSGSLSSRGSAYSSST--------NSKNEKADLPLRVASPIESPSAEATACVT 360
              PL W+ SGS +S+ S +S S+        +S + K ++  ++ +  +S S +ATAC T
Sbjct: 301  KLPL-WNSSGSFASQSSGFSHSSSLKSLGAVDSSDRKIEVLPKIVTVTQSSSGDATACAT 360

Query: 361  SSLPSEDAISRKKPRLGWGDGLAKYEKEKVEV---PDGSLRKEVALLSSGSGELTHSLGS 420
            ++  SE+  SRKK RLGWG+GLAKYEK+KV+V    DG+      L+ +G  EL HSL  
Sbjct: 361  TTHLSEEMSSRKKQRLGWGEGLAKYEKKKVDVNPNEDGT-----TLMENGLEEL-HSLNK 420

Query: 421  NFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADGMI---CSSPGS-GSQ 480
            N A+KSP      D  SP TPSS ACSSS G  DK   K A A   +   C SP    S 
Sbjct: 421  NIADKSPTAAIVPDYGSPTTPSSVACSSSPGFADKSSPKAAIAASDVSNMCRSPSPVSSI 480

Query: 481  NLQKLLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKST-LNKLLAYKGEISKTL 540
            +L++   +IE+++  S+   G  L EL  +DD  T +S   + T +N LLA+KGEI K +
Sbjct: 481  HLERFPINIEELDNISMERFGCLLNELLGTDDSGTGDSSSVQLTSMNTLLAWKGEILKAV 540

Query: 541  EMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLV 600
            EMTESEID LEN+ ++LK + G   S     S+      +   KEQ   S       P  
Sbjct: 541  EMTESEIDLLENKHRTLK-LEGRRHSRVVGPSSYCCDGDANVPKEQASCSL-----DPKA 600

Query: 601  VVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGH 660
              SS   T+ + P+     G+  V     E DSPG V       + V +    DI+    
Sbjct: 601  TASSVAKTLVRAPV--HQAGLAKVPADVFE-DSPGEVKPLSQSFATVER--EEDILPIPS 660

Query: 661  CSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSS 720
                  +                  E  T    N+ T+  S   +S+       A     
Sbjct: 661  MKAAVSS-----------------KEINTPAFANQETIEVSSADDSM-------ASKEDL 720

Query: 721  LCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLK 780
                + + NK+YA  ++ V  +  P       +     +  ++ +  ++EK   R   L+
Sbjct: 721  FWAKLLSANKKYACESSGVFNQLLPRDFNSSDNSRFPGICQTQFDSHVQEKIADRVGLLR 780

Query: 781  FKESALTLRFKSLQQSWKEGLLH-SVKKCRSRPQKK-ELSLRVTHSGHQKYRSSSIRSRL 840
             +E  L L+FK+ Q SWK+ L   ++ K +S+  KK EL     + G+ K    S+R R 
Sbjct: 781  AREKILLLQFKAFQLSWKKDLDQLALAKYQSKSSKKTELYPNAKNGGYLKL-PQSVRLRF 840

Query: 841  VQQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLV 900
                  + S   T   V +  KLL    +K +R+ LKMPAMILD+KE++  RFIS NGL+
Sbjct: 841  SSSAPRRDSVVPTTELVSYMEKLLPGTHLKPFRDILKMPAMILDEKERVMSRFISSNGLI 900

Query: 901  EDPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKS 960
            EDPC VEKER +INPWTS EKEIF   L++ GKDFKKI+S L  KTTADCI +YYKNHKS
Sbjct: 901  EDPCDVEKERTMINPWTSEEKEIFLNLLAMHGKDFKKIASSLTQKTTADCIDYYYKNHKS 960

Query: 961  DSFKK-NKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQ 1020
            D F K  K    GK+ K    TY++   KKW  +  A SLDILG  S++AA A      +
Sbjct: 961  DCFGKIKKQRAYGKEGKH---TYMLAPRKKWKREMGAASLDILGDVSIIAANAGKVASTR 1020

Query: 1021 QKCTRHLGV-GRDVESKVSWSASSPNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXX 1080
               ++ + + G    + +    ++    +       K T  ADVL               
Sbjct: 1021 PISSKKITLRGCSSANSLQHDGNNSEGCSYSFDFPRKRTAGADVL--AVGPLSPEQINSC 1080

Query: 1081 XXXXXDPREELRERKCYRVDFAAKLPSLSDVMQKTD------------NEPCSDDSSEDV 1140
                   RE   +    + +   K P +S  +   +                   S  + 
Sbjct: 1081 LRTSVSSRERCMDH--LKFNHVVKKPRISHTLHNENXXXXXXXXXXXXXXXXXXXSCGET 1140

Query: 1141 DSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLD-LMHTSG 1200
               +WTD+E+  F+Q  S +GK+F  ISR + ++S DQCK+FFSK RKCLGL+ +   SG
Sbjct: 1141 GPIHWTDDERSAFIQGFSLFGKNFASISRYVGTRSPDQCKVFFSKVRKCLGLESIKFGSG 1200

Query: 1201 DVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDEFISK-SINGGSTSVNINHEETVS 1260
            +V  +                E+ C +E   G  ++   +K  +N  ++  N+N +    
Sbjct: 1201 NVSTSVS--VDNGNEGGGSDLEDPCPMESNSGIVNNGVCAKMGMNSPTSPFNMNQDGVNQ 1260

Query: 1261 AVTDNMRTSMEFEESTALQQSDEKGAEAV-----GNLIFETLKEEDVPN-PSQPTHD--- 1320
            + + N++  +   E       +E G + +      NL+         P+  S+   D   
Sbjct: 1261 SGSANVKADLSRSE-------EENGQKYLCLKDDNNLVNNAYVNGGFPSLVSESCRDLVD 1320

Query: 1321 -HKIEGSSENTESGKSCNEPDILRSESVST-VDENSAAVSEGRATVKLAIGE---EVGSD 1380
             + +E  S+     KS +   +   E V T V  +S  +  G + +   I E   E+   
Sbjct: 1321 INTVESQSQAAGKSKSNDLMSMEIDEGVLTSVTISSEPLYCGLSVLSNVIVETPTEISRK 1380

Query: 1381 TNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPH-----ILHPNILKVEPVE------- 1440
             +    +T+      +       A    + GL+P        +P  L   P+E       
Sbjct: 1381 GSGDQGATMPKFSSKNQDGVMQAANRTRNSGLEPESAPSGFRYPECLHHVPIEVCTENPI 1440

Query: 1441 KKSCIKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPN 1500
              S  +   N  +   S   ++G+      D+  P   L+   D   + +   +  +   
Sbjct: 1441 GVSAPRGNPNCHAESESGNSLVGQVDE-THDLGWPKNNLE--LDGRLQVLGHVNPEQIGL 1500

Query: 1501 NLLCNSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNC 1516
                N+ES   P+     + + +D++R            SK D+   ++   EG  L  C
Sbjct: 1501 LKATNTESCQNPQ-----RSVTQDLSR---------ISRSKSDLIVKTQRTGEGFSLTKC 1560

BLAST of CsaV3_5G036430.1 vs. Swiss-Prot
Match: sp|Q4KKX4|NCOR1_XENTR (Nuclear receptor corepressor 1 OS=Xenopus tropicalis OX=8364 GN=ncor1 PE=2 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 1.9e-13
Identity = 68/215 (31.63%), Postives = 107/215 (49.77%), Query Frame = 0

Query: 628 IKEKFVMRKKFL----------KFKESALTLRFKSLQQSWKEGLLHSVKKCRSRPQK--K 687
           IK   VMRKK +          K +E  +  R+  L ++W++     V +  + P++  K
Sbjct: 271 IKTNQVMRKKLILFFKRRNHARKLREQNICQRYDQLMEAWEK----KVDRIENNPRRKAK 330

Query: 688 ELSLRVTHSGH----QKYRSSSIR-SRLVQQGACQSSTF---NTEIAVRHSSKLLLNPQI 747
           E   R  +       +K R    R  R+ Q+GA  S+T      EI+             
Sbjct: 331 ESKTREYYEKQFPEIRKQREQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNE 390

Query: 748 KLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLS 807
           K  R    +P M+ D +++  ++FI+ NGL+EDP  V K+R  +N WT  EKEIF EK  
Sbjct: 391 KQMRQLSVIPPMMFDAEQR-RVKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKEKFV 450

Query: 808 LFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFK 823
              K+F  I+S+L+ KT +DC+ +YY   K+++FK
Sbjct: 451 QHPKNFGLIASYLERKTVSDCVLYYYLTKKNENFK 480

BLAST of CsaV3_5G036430.1 vs. Swiss-Prot
Match: sp|O75376|NCOR1_HUMAN (Nuclear receptor corepressor 1 OS=Homo sapiens OX=9606 GN=NCOR1 PE=1 SV=2)

HSP 1 Score: 76.3 bits (186), Expect = 3.7e-12
Identity = 67/225 (29.78%), Postives = 109/225 (48.44%), Query Frame = 0

Query: 628 IKEKFVMRKKFLKF----------KESALTLRFKSLQQSWKEGLLHSVKKCRSRPQK--K 687
           IK   VMRKK + F          +E  +  R+  L ++W++     V +  + P++  K
Sbjct: 279 IKTNQVMRKKLILFFKRRNHARKQREQKICQRYDQLMEAWEK----KVDRIENNPRRKAK 338

Query: 688 ELSLRVTHSGH----QKYRSSSIR-SRLVQQGACQSSTF---NTEIAVRHSSKLLLNPQI 747
           E   R  +       +K R    R  R+ Q+GA  S+T      EI+             
Sbjct: 339 ESKTREYYEKQFPEIRKQREQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNE 398

Query: 748 KLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLS 807
           K  R    +P M+ D +++  ++FI+ NGL+EDP  V K+R  +N WT  EKEIF +K  
Sbjct: 399 KQMRQLSVIPPMMFDAEQR-RVKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKDKFI 458

Query: 808 LFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ 833
              K+F  I+S+L+ K+  DC+ +YY   K++++K       GK+
Sbjct: 459 QHPKNFGLIASYLERKSVPDCVLYYYLTKKNENYKALVRRNYGKR 498

BLAST of CsaV3_5G036430.1 vs. Swiss-Prot
Match: sp|Q60974|NCOR1_MOUSE (Nuclear receptor corepressor 1 OS=Mus musculus OX=10090 GN=Ncor1 PE=1 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 3.7e-12
Identity = 67/225 (29.78%), Postives = 109/225 (48.44%), Query Frame = 0

Query: 628 IKEKFVMRKKFLKF----------KESALTLRFKSLQQSWKEGLLHSVKKCRSRPQK--K 687
           IK   VMRKK + F          +E  +  R+  L ++W++     V +  + P++  K
Sbjct: 279 IKTNQVMRKKLILFFKRRNHARKQREQKICQRYDQLMEAWEK----KVDRIENNPRRKAK 338

Query: 688 ELSLRVTHSGH----QKYRSSSIR-SRLVQQGACQSSTF---NTEIAVRHSSKLLLNPQI 747
           E   R  +       +K R    R  R+ Q+GA  S+T      EI+             
Sbjct: 339 ESKTREYYEKQFPEIRKQREQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNE 398

Query: 748 KLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLS 807
           K  R    +P M+ D +++  ++FI+ NGL+EDP  V K+R  +N WT  EKEIF +K  
Sbjct: 399 KQMRQLSVIPPMMFDAEQR-RVKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKDKFI 458

Query: 808 LFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ 833
              K+F  I+S+L+ K+  DC+ +YY   K++++K       GK+
Sbjct: 459 QHPKNFGLIASYLERKSVPDCVLYYYLTKKNENYKALVRRNYGKR 498

BLAST of CsaV3_5G036430.1 vs. Swiss-Prot
Match: sp|Q8QG78|NCOR1_XENLA (Nuclear receptor corepressor 1 OS=Xenopus laevis OX=8355 GN=ncor1 PE=1 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 6.2e-12
Identity = 65/215 (30.23%), Postives = 104/215 (48.37%), Query Frame = 0

Query: 628 IKEKFVMRKKFL----------KFKESALTLRFKSLQQSWKEGLLHSVKKCRSRPQK--K 687
           IK   VMRKK +          K +E  +  R+  L ++W++     V +  + P++  K
Sbjct: 271 IKTNQVMRKKLILFFKRRNHARKLREQNICQRYDQLMEAWEK----KVDRIENNPRRKAK 330

Query: 688 ELSLRVTHSGH----QKYRSSSIR-SRLVQQGACQSSTF---NTEIAVRHSSKLLLNPQI 747
           E   R  +       +K R    R  R+ Q+G   S+T      EI+             
Sbjct: 331 ESKTREYYEKQFPEIRKQREQQERFQRVGQRGTGMSATIARSEHEISEIIDGLSEQENNE 390

Query: 748 KLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLS 807
           K  R    +P M+ D +++  ++FI+ NGL+EDP  V K+R  +N WT  EKEIF EK  
Sbjct: 391 KQMRQLSVIPPMMFDAEQR-RVKFINTNGLMEDPMKVYKDRQFMNVWTDHEKEIFKEKFV 450

Query: 808 LFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFK 823
              K+F  I+S+L+ K  +DC+ +YY   K+++ K
Sbjct: 451 RHPKNFGLIASYLERKNVSDCVLYYYLTKKNENLK 480

BLAST of CsaV3_5G036430.1 vs. Swiss-Prot
Match: sp|Q55DP9|MYBP_DICDI (Myb-like protein P OS=Dictyostelium discoideum OX=44689 GN=mybP PE=3 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 4.1e-11
Identity = 45/129 (34.88%), Postives = 67/129 (51.94%), Query Frame = 0

Query: 695  RSRLVQQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISH 754
            R  +    A   + FN EI  R   +   +P  K  +    +P M+  ++  I   +I+H
Sbjct: 1004 RGTITSDVARSDAEFN-EIMDRLKEQDAQDPNCKYQQGVAIIPPMLSPEERNI--HYINH 1063

Query: 755  NGLVEDPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYK 814
            NG + DP A EK+R  +  WT  EK+ F +K   + K F KI+SF D +TT D I FYY 
Sbjct: 1064 NGFIVDPIAQEKQRKSLIIWTEDEKQKFVKKYLQYPKKFSKIASFFDNRTTEDMIVFYYN 1123

Query: 815  NHKSDSFKK 824
            N K+ + K+
Sbjct: 1124 NKKTLNLKQ 1129

BLAST of CsaV3_5G036430.1 vs. TrEMBL
Match: tr|A0A0A0KWU7|A0A0A0KWU7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G623500 PE=4 SV=1)

HSP 1 Score: 3134.4 bits (8125), Expect = 0.0e+00
Identity = 1649/1649 (100.00%), Postives = 1649/1649 (100.00%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180
            DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL
Sbjct: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180

Query: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK 240
            KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK
Sbjct: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK 240

Query: 241  KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300
            KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC
Sbjct: 241  KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300

Query: 301  ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN 360
            ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN
Sbjct: 301  ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN 360

Query: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV 420
            LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV
Sbjct: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV 420

Query: 421  NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG 480
            NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG
Sbjct: 421  NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG 480

Query: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS 540
            VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS
Sbjct: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS 540

Query: 541  GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600
            GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI
Sbjct: 541  GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600

Query: 601  FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG 660
            FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG
Sbjct: 601  FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG 660

Query: 661  LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK 720
            LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK
Sbjct: 661  LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK 720

Query: 721  LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780
            LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE
Sbjct: 721  LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780

Query: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840
            IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY
Sbjct: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840

Query: 841  LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS 900
            LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS
Sbjct: 841  LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS 900

Query: 901  PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK 960
            PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK
Sbjct: 901  PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK 960

Query: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR 1020
            LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR
Sbjct: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR 1020

Query: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE 1080
            DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE
Sbjct: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE 1080

Query: 1081 FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK 1140
            FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK
Sbjct: 1081 FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK 1140

Query: 1141 EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA 1200
            EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA
Sbjct: 1141 EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA 1200

Query: 1201 IGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSC 1260
            IGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSC
Sbjct: 1201 IGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKSC 1260

Query: 1261 IKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLC 1320
            IKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLC
Sbjct: 1261 IKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNLLC 1320

Query: 1321 NSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSK 1380
            NSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSK
Sbjct: 1321 NSESSTFPRSYPFNKQIFEDINRNINHAYFRVQGLSKPDINCNSKYVSEGQFLQNCNSSK 1380

Query: 1381 PHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSS 1440
            PHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSS
Sbjct: 1381 PHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSSS 1440

Query: 1441 GSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAF 1500
            GSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAF
Sbjct: 1441 GSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAF 1500

Query: 1501 SGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDI 1560
            SGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDI
Sbjct: 1501 SGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPFPVDI 1560

Query: 1561 FSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKAD 1620
            FSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKAD
Sbjct: 1561 FSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKAD 1620

Query: 1621 QYAGQPASMFTREDGSWGGGGNGGDLGSR 1650
            QYAGQPASMFTREDGSWGGGGNGGDLGSR
Sbjct: 1621 QYAGQPASMFTREDGSWGGGGNGGDLGSR 1649

BLAST of CsaV3_5G036430.1 vs. TrEMBL
Match: tr|A0A1S3BG74|A0A1S3BG74_CUCME (LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 OS=Cucumis melo OX=3656 GN=LOC103489481 PE=4 SV=1)

HSP 1 Score: 2860.9 bits (7415), Expect = 0.0e+00
Identity = 1515/1653 (91.65%), Postives = 1558/1653 (94.25%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120
            WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  DANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180
            D NNGFGNPSRR SSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL
Sbjct: 121  DTNNGFGNPSRRPSSQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWKPL 180

Query: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAISRK 240
            KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSED ISRK
Sbjct: 181  KWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDTISRK 240

Query: 241  KPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300
            KPRLGWGDGLAKYEKEKV+VPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC
Sbjct: 241  KPRLGWGDGLAKYEKEKVDVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFSDC 300

Query: 301  ASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSVAN 360
            ASPATPSSFACSSSSGLEDKPFSKGA ADGMICSSPGSGSQNLQKLLCSIEKMEISS+AN
Sbjct: 301  ASPATPSSFACSSSSGLEDKPFSKGASADGMICSSPGSGSQNLQKLLCSIEKMEISSIAN 360

Query: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSV 420
            LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKS 
Sbjct: 361  LGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLKSG 420

Query: 421  NGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVG 480
            NGGNVS+KKSCSATR++ESSTYFKEQDGISCIA RPAPLVVVSSSDATVEKVPLCKGD+G
Sbjct: 421  NGGNVSNKKSCSATRLVESSTYFKEQDGISCIAPRPAPLVVVSSSDATVEKVPLCKGDMG 480

Query: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPIS 540
            VEDVDTKADEIDSPGTVTSKFNEPSRVVK   SDIVDNGHCSVVTD IVPGKMEG+FPIS
Sbjct: 481  VEDVDTKADEIDSPGTVTSKFNEPSRVVKENTSDIVDNGHCSVVTDMIVPGKMEGNFPIS 540

Query: 541  GPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600
             PFVDE +T GSGNEC LAKSC+SES  GDLMAQAGSRSSLCDSIFACNKEYASRAAEVI
Sbjct: 541  EPFVDERKTTGSGNECILAKSCSSESFNGDLMAQAGSRSSLCDSIFACNKEYASRAAEVI 600

Query: 601  FKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEG 660
            FKRSPVG+CKISSKSTK VSCSETEKLIKEKFV RKKFLKFKESALTLRFK+LQQSWK  
Sbjct: 601  FKRSPVGVCKISSKSTKYVSCSETEKLIKEKFVSRKKFLKFKESALTLRFKALQQSWKM- 660

Query: 661  LLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSK 720
                 K+   +  KKELSLRVTHSGHQKYR SS RSRL+QQGACQS+TFNTEIAVRHSSK
Sbjct: 661  FAAFCKEMSLKATKKELSLRVTHSGHQKYR-SSFRSRLIQQGACQSTTFNTEIAVRHSSK 720

Query: 721  LLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780
            LLLNPQIKLYRNTLKMPAMILDKKEK+ALRFISHNGLVEDPCAVEKERNLINPWTSAEKE
Sbjct: 721  LLLNPQIKLYRNTLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNLINPWTSAEKE 780

Query: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840
            IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY
Sbjct: 781  IFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAITY 840

Query: 841  LVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSASS 900
            LVTSGKKWNPDANATSLDILGVASVMAAQA+YDI NQQKC+RHLG G+DVESKVSWSAS+
Sbjct: 841  LVTSGKKWNPDANATSLDILGVASVMAAQAEYDIGNQQKCSRHLGTGKDVESKVSWSAST 900

Query: 901  PNKSNLDDLQTEKETVAADVLXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAK 960
            PNKSNLDDLQTEKETVAADVL  XXXXXXXXXXXXXXXXX DPREELRE+KCY+VD AAK
Sbjct: 901  PNKSNLDDLQTEKETVAADVLAGXXXXXXXXXXXXXXXXXIDPREELREQKCYKVDSAAK 960

Query: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSR 1020
            LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEK++F+QAVSSYGKDFDMISRCIRSKSR
Sbjct: 961  LPSLSDVMQKTDNEPCSDDSSEDVDSSNWTDEEKVIFLQAVSSYGKDFDMISRCIRSKSR 1020

Query: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDE 1080
            DQCKIFFSKARKCLGLDLMHTSGDVGETPGN             E+HCVVEIC GRGSDE
Sbjct: 1021 DQCKIFFSKARKCLGLDLMHTSGDVGETPGN--GNDISGSGTDTEDHCVVEICGGRGSDE 1080

Query: 1081 FISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIFETLK 1140
             ISKSING STSVNINHEE+VSA T NMRTSMEFE STALQQ DEKGAEAVGN+IFETLK
Sbjct: 1081 SISKSINGVSTSVNINHEESVSAATVNMRTSMEFEGSTALQQLDEKGAEAVGNMIFETLK 1140

Query: 1141 EEDVPNPSQPTHDHKIEGSSENTESGKSCNEPDILRSESVSTVDENSAAVSEGRATVKLA 1200
            EEDVPNPSQP HD KIEGSSENTE GKSCNEPDILRSESVSTVDENSAAVSE RATVKLA
Sbjct: 1141 EEDVPNPSQPMHDQKIEGSSENTEGGKSCNEPDILRSESVSTVDENSAAVSECRATVKLA 1200

Query: 1201 IG-EEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNILKVEPVEKKS 1260
            IG EEVGSD NLH QST+ CSGQDSTG DSNIALEGSS+GLDP ILHPNILKVEPVEKKS
Sbjct: 1201 IGEEEVGSDANLHSQSTMQCSGQDSTGYDSNIALEGSSIGLDPQILHPNILKVEPVEKKS 1260

Query: 1261 CIKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEISDANQKPMNR--DDDAEHPNN 1320
            CIKSEENFL+VRNSDTGVIGREQMLNQD+ S TLVLQ++SDA+QKPMNR  DDD EH NN
Sbjct: 1261 CIKSEENFLAVRNSDTGVIGREQMLNQDVSSSTLVLQDVSDADQKPMNRDKDDDDEHRNN 1320

Query: 1321 LLCNSESSTFPRSYPFNKQIFEDINRNINHAYFR-VQGLSKPDINCNSKYVSEGQFLQNC 1380
            LL NSES  FPRSYPFNKQIFEDINRNINH YF  VQGLSKPDINCN+KYV EGQ+LQNC
Sbjct: 1321 LLRNSESPKFPRSYPFNKQIFEDINRNINHTYFPVVQGLSKPDINCNNKYVPEGQYLQNC 1380

Query: 1381 NSSKPHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQ 1440
            NSSKPHN AE PFLSQNIELGH+HQKNA  XXXXX  DVPRRKGDVKLFGQILSHAPSQQ
Sbjct: 1381 NSSKPHNPAELPFLSQNIELGHNHQKNAXXXXXXXXXDVPRRKGDVKLFGQILSHAPSQQ 1440

Query: 1441 NSSSGSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSRIQTGLSALPDSAILQAKY 1500
            NSSSGSNECGEKKG LHNSSSKSCDMGE++PLRSYGFWDGSRIQTGLSALPDSAILQ+KY
Sbjct: 1441 NSSSGSNECGEKKG-LHNSSSKSCDMGEHVPLRSYGFWDGSRIQTGLSALPDSAILQSKY 1500

Query: 1501 PAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDGVVDYHSYRSRDGVKMRPF 1560
            PAAFSGYS TSVKTEQQ LQAL+NN DQSLNE+VSAFPTKDGVVDYHSYRSRDGVKMRPF
Sbjct: 1501 PAAFSGYSGTSVKTEQQTLQALANNSDQSLNEVVSAFPTKDGVVDYHSYRSRDGVKMRPF 1560

Query: 1561 PVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHY 1620
            PVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHY
Sbjct: 1561 PVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHY 1620

Query: 1621 AKADQYAGQPASMFTREDGSWGGGGNGGDLGSR 1650
            AKADQYAGQP SMFTREDGSW  GG GGDLGSR
Sbjct: 1621 AKADQYAGQPGSMFTREDGSW-RGGKGGDLGSR 1647

BLAST of CsaV3_5G036430.1 vs. TrEMBL
Match: tr|M5XR66|M5XR66_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G215600 PE=4 SV=1)

HSP 1 Score: 1169.8 bits (3025), Expect = 0.0e+00
Identity = 791/1745 (45.33%), Postives = 1026/1745 (58.80%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS-YHGSREFNRWGSADLRRPTGHGKQG 60
            MPPEPLPWDRKD FKERKHE+SE++GS ARWRDS +H  R+FNRW SAD RRP GHGKQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSESLGSVARWRDSPHHAPRDFNRWPSADFRRPPGHGKQG 60

Query: 61   GWHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHS 120
            GWH FSEDS HGY  SRS  D+++EDES RPS  RGDG+Y R  R++RGS+S R+ + HS
Sbjct: 61   GWHLFSEDSGHGYASSRS-GDKMLEDESCRPSFSRGDGRYGRNSRDNRGSYSQRECKGHS 120

Query: 121  RDANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQ---------------------SFH 180
             + ++G  N   R +  DV ++QR+ DD +TYSS Q                        
Sbjct: 121  WETSSGSPNTPGRPN--DVINEQRTQDDMLTYSSHQHSDFGSTWDQIQLKDQLDRMGGST 180

Query: 181  GLENGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKN--------EKADLPL 240
            GL  G + + E SLGS DWKPLKW+RSGS+SSRGS +S S++SK+         K +   
Sbjct: 181  GLGAGQKCERENSLGSIDWKPLKWTRSGSMSSRGSGFSHSSSSKSIGAIDFNEAKVESQP 240

Query: 241  RVASPIESPSAEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVA 300
            + A+P++SPS EAT CVTS+ PSE+  SRKKPRLGWG+GLAKYEK+KVEVPDGS+ K+ A
Sbjct: 241  KNATPVQSPSGEATTCVTSAAPSEETTSRKKPRLGWGEGLAKYEKKKVEVPDGSMNKDGA 300

Query: 301  LLSSGSGELTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADG 360
            + S G+ E  HSL SN A+KSP+   FSDCASPATPSS ACSSS G+E+K F K A  D 
Sbjct: 301  VCSVGNMEPVHSLSSNLADKSPRVTVFSDCASPATPSSVACSSSPGVEEKSFGKTANVDN 360

Query: 361  ---MICSSPGSGSQNLQK-LLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKST- 420
                 C SP   SQ+  +    ++EK++ +S+ANLGSSL EL  SDDP++++S   + T 
Sbjct: 361  NNRNFCGSPSPMSQSHHEGFTFNLEKLDCNSIANLGSSLRELLQSDDPSSVDSGIVRPTA 420

Query: 421  LNKLLAYKGEISKTLEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKE 480
            +NKLL +KGEISK LE+TESEIDSLENELK L S +G +     + S+  V ++   FKE
Sbjct: 421  MNKLLIWKGEISKVLEVTESEIDSLENELKVLNSDSGASCPRPATSSSLPVEDNDKSFKE 480

Query: 481  QDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPS 540
            Q  ++ + TRPAPL + SS DA VEK+ L  GD        K ++IDSPGT TSKF EP 
Sbjct: 481  QVTVTNLITRPAPLQIHSSGDADVEKMCLGNGDQVEFCGIVKDEDIDSPGTATSKFVEPL 540

Query: 541  RVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSE 600
              V + +SD++ +  CS   D I   K E    + G    + +    GN   L  S    
Sbjct: 541  LKVVS-SSDVMSHNDCSGDLDPIETTKGEAKCLVPGKDEVKTDLSACGNSSMLLGSEIVA 600

Query: 601  SVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETE 660
             V G L        ++C+SI + NKE A+R+ EV  K  P    K+        S  + +
Sbjct: 601  PVSGGLGFCFSVVDTICNSICSSNKESANRSFEVFNKLLPREHYKVDISGVSISSSGKND 660

Query: 661  KLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEGL-LHSVKKCRSRPQKK-ELSLRVTH 720
             LIKEKF MRK+ L+F E  LTL++K+ Q  WKE L L S++K R +  KK ELSLR T+
Sbjct: 661  SLIKEKFAMRKRRLRFMERVLTLKYKAFQHLWKEDLRLLSIRKYRPKSHKKFELSLRATN 720

Query: 721  SGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDK 780
            +G+QK+R SSIRSR        S    TEI +  ++KLL + Q+K YRN+LKMPA+ILDK
Sbjct: 721  NGYQKHR-SSIRSRFSTPAGNLSLVPTTEI-INFTNKLLSDSQVKRYRNSLKMPALILDK 780

Query: 781  KEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLK 840
            KEK+  RFIS NGLVEDPC VEKER L+NPWT  EKE+F EKL+  GKDF+KI+SFLD K
Sbjct: 781  KEKMVTRFISSNGLVEDPCVVEKERALMNPWTPEEKELFIEKLTTCGKDFRKIASFLDHK 840

Query: 841  TTADCIQFYYKNHKSDSFKK-NKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGV 900
            TTADC++FYYK+HKS  F+K  K  ++ KQ KSSA TYL+++GKKWN + NA SLDILG 
Sbjct: 841  TTADCVEFYYKHHKSVCFEKTKKKADMTKQGKSSAKTYLISNGKKWNREMNAASLDILGA 900

Query: 901  ASVMAAQADYDIENQQKCTRHLGVG--RDVESKVSWSASSPNKSNLDDLQTEKETVAADV 960
            AS +AA AD    ++Q  +  L +G  R+         +     + D +  E+ETVAADV
Sbjct: 901  ASAIAAHADGSTRSRQAFSGRLYLGGYRNTNPSRGDDTTVERSCSFDAIGNERETVAADV 960

Query: 961  LXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDS 1020
            L                    DP E  RE KC +VD  A+ P   DVMQ  D+E CS++S
Sbjct: 961  LAGICGSLSSEAVSSCITSSIDPGEGYREWKCQKVDSLARRPLTPDVMQNVDDETCSEES 1020

Query: 1021 SEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMH 1080
              ++D S+WTD EK  F+QAVSSYGKDF MISRC+R++S+ QCK+FFSKARKCLGLDL+H
Sbjct: 1021 CGEMDPSDWTDAEKSSFIQAVSSYGKDFAMISRCVRTRSQHQCKVFFSKARKCLGLDLVH 1080

Query: 1081 TSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDEFISKSINGGSTSVNINHEET 1140
                 G + G+             E+ CV+E   G  SD+   +       SV    +E+
Sbjct: 1081 PVAGNGTSVGD----DVNGGGSDTEDACVLETGSGISSDKSGCRMNEDMPLSVINMDDES 1140

Query: 1141 VSAVTDNMRTS-MEFEESTALQQSDEKGAEAVGNLIFETLKEEDVPNPSQPTHDHKIEGS 1200
              A T N++T  +  EE   + Q D +G + + +L  + ++ ED PN      D   +  
Sbjct: 1141 DPAETMNLQTGPLRSEEKNVMGQLDHEGGKTLKSLASDAVETEDRPNLVLDDADCVRDAQ 1200

Query: 1201 SENTESGKSCN----EPDILRSESVSTVDE-NSAAVSEGRATVKLAIGEEVGSDTNLHGQ 1260
                 S  +      E  IL +ES       N    + G    KL    E+ SD N    
Sbjct: 1201 KSRVFSADALKDDAAEEGILIAESEPVGGGINFDPTNPGMDGEKLM--GELPSDGN---T 1260

Query: 1261 STILCS------GQDSTGNDSNIALEGSSVG--LDPHILHPNILKVEPVEKKSCIK-SEE 1320
             T  CS        +S+GN S +A  GS  G  L+P  LH   + +  ++K S I    E
Sbjct: 1261 DTSRCSLPGSVHDSNSSGNASALAGGGSCSGFSLNPECLHQVSVGLNSMQKPSVISMPHE 1320

Query: 1321 NFLSVRNS---DTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNL--LC 1380
            N  +  +S   D+  I  E+  NQDILS TL LQE      K +  D+  +H   L    
Sbjct: 1321 NRHAPADSVSPDSAKIECEKAFNQDILSSTLDLQE--GREPKSVGIDECNKHLPGLPIYT 1380

Query: 1381 NSESSTFPRSYPFNKQIFEDINRNINHAYF-RVQGLSKPDINCNSKYVSEGQFLQNCNSS 1440
            N ESS   + YP      +D N ++       VQ  SKPD   N  Y+++  FLQ  N  
Sbjct: 1381 NVESSQVLKGYPLQMPTKKDTNGDVTSGNLSEVQNFSKPDRKINGHYMTKDGFLQFGNCK 1440

Query: 1441 KPHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSS 1500
               +  + P   + +E      K  S      DSD P R GDVKLFG+ILS+  S   SS
Sbjct: 1441 PQCSEVDFPLAPRKVEQPVGPPKAHS--WSSSDSDKPSRNGDVKLFGKILSNPSSLSKSS 1500

Query: 1501 SGSNECGEKKGPLHNSSSKSCDMG------------------------ENIPLRSYGFWD 1560
            S  +E  EK    H  S+ S ++                         E +P RSYGFW+
Sbjct: 1501 SNIHENEEKGAHNHKLSNTSSNLKFTGHHNADGNSSLLKFDCSSYVGIEKVPRRSYGFWE 1560

Query: 1561 GSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPT 1620
            G+++  G  +  DSAIL AKYPAAF  +  TS K EQQPLQA+  N D+++N  VS FP+
Sbjct: 1561 GNKVHAGYPSFSDSAILLAKYPAAFGNFPTTSSKMEQQPLQAVVKNNDRNING-VSVFPS 1620

Query: 1621 KD-----GVVDYHSY-RSRDGVKMRPFPVDI-------FSEMHRRNGFDAVSLSSLQQQG 1648
            ++     GVVDY  + RSRDG K+ PF VD+         +M RRNGFD  ++SSLQQQG
Sbjct: 1621 REISGSNGVVDYPVFSRSRDGAKVPPFTVDVKQQQRQDVFDMPRRNGFD--TISSLQQQG 1680

BLAST of CsaV3_5G036430.1 vs. TrEMBL
Match: tr|F6HNI1|F6HNI1_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_13s0019g04010 PE=4 SV=1)

HSP 1 Score: 1169.5 bits (3024), Expect = 0.0e+00
Identity = 798/1762 (45.29%), Postives = 1049/1762 (59.53%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60
            MPPEPLPWDRKD FKERKHE+SE++G +ARWRDS+ GSREF RWGSA++RRP GHGKQGG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSESLGFSARWRDSHQGSREFARWGSAEVRRPPGHGKQGG 60

Query: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGD--GKYIRIGRESRGSFSHRDWRSH 120
            WH F E+S HG+ PSRS SD+++EDE+ RP   RGD  GKY R  RE RGSFS +DW+ H
Sbjct: 61   WHIFPEESGHGFVPSRS-SDKMVEDENSRPFTTRGDGNGKYSRNNREIRGSFSQKDWKGH 120

Query: 121  SRDANNGFGNPSRRTSSQDVSSDQRSVDDTVTYS----------------SPQSFHGLEN 180
              +  N   N S R+ +    +DQRSVDD + +S                   S +GL  
Sbjct: 121  PLETGNASPNMSGRSLA---INDQRSVDDMLIHSDFVNGWDQLQLKDQHDKMGSVNGLGT 180

Query: 181  GPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKN-------EKADLPLRVASP 240
            G R++ E SL S DWKPLKW+RSGSLSSRGS +S S++SK+        + DL  R  +P
Sbjct: 181  GQRAERENSLSSIDWKPLKWTRSGSLSSRGSGFSHSSSSKSMGVDSNEARGDLQPRNVTP 240

Query: 241  IESPSAEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSG 300
            ++SPS +A ACV S+ PSE+  SRKKPRLGWG+GLAKYE++KVE PD S+ K   +  + 
Sbjct: 241  VQSPSGDAVACVASTAPSEETSSRKKPRLGWGEGLAKYERKKVEGPDESVNKNGIVFCTS 300

Query: 301  SGELTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGAD---GMI 360
            +GE THSL SN A+KSP+ + FSDCASPATPSS ACSSS G+E+K FSK    D     +
Sbjct: 301  NGESTHSLNSNLADKSPRVMGFSDCASPATPSSVACSSSPGMEEKSFSKAGNVDNDTSTL 360

Query: 361  CSSPGSGSQN-LQKLLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKST-LNKLL 420
              SPG  S N L      +E +E + +ANLG S +EL  SDDP++++S F +ST ++KLL
Sbjct: 361  SGSPGPVSLNHLDGFSFILESLEPNQIANLGFSPIELLQSDDPSSVDSNFMRSTAMSKLL 420

Query: 421  AYKGEISKTLEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKEQDGIS 480
             +KG+ISK+LEMTESEID+LENELKSLKS             +  V   +   +EQ   S
Sbjct: 421  IWKGDISKSLEMTESEIDTLENELKSLKSGXXXXXXXXXXXXSFPVEGKAKPCEEQGAAS 480

Query: 481  CIATRPAPLVVVSSSDATVEKVPLCKGDVGVED--VDTKADEIDSPGTVTSKFNEPSRVV 540
             +  RPAPL +V   D   +K  L  G   +ED   + K ++IDSPGT TSKF EP  +V
Sbjct: 481  NLILRPAPLQIVPPGDMMTDKTLL--GSDAMEDAHAEVKDEDIDSPGTATSKFVEPPCLV 540

Query: 541  K-AIASDIVDNGHCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNEC-TLAKSCTSES 600
            K A  SD+V  G CS          ME    +SGP V+E     SG +   L +S T   
Sbjct: 541  KTASPSDMVIQGECSGNLKITRSTNMEVELLVSGPNVEETGISTSGGDSRLLVESKTGAR 600

Query: 601  VYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETEK 660
            V GD+         + + I A NK+ A+RA+EV  K  P   C+       N +C + + 
Sbjct: 601  VSGDMGVLDDEEDKIYNLILASNKDCANRASEVFNKLLPQNQCQNDILGAANFACRQNDS 660

Query: 661  LIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEGL-LHSVKKCRSRPQKK-ELSLRVTHS 720
            LIK+KF MRK+FL+FKE  +TL+F+  Q  WKE + L S++K R++ QKK ELSLR +H 
Sbjct: 661  LIKQKFAMRKRFLRFKEKVITLKFRVSQHVWKEDMRLLSIRKYRAKSQKKFELSLRTSHC 720

Query: 721  GHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDKK 780
            G+QK+R SSIRSR     A   S   T   + ++SK+L   Q+KL RN LKMPA+ILDKK
Sbjct: 721  GYQKHR-SSIRSRF-SSPAGNLSPVPTAEMINYTSKMLSESQMKLCRNILKMPALILDKK 780

Query: 781  EKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLKT 840
            EK A RFIS NGLVEDPCAVE ER +INPWT+ EKEIF +KL++FGK+FKKI+SFLD KT
Sbjct: 781  EKTASRFISSNGLVEDPCAVENERTMINPWTAEEKEIFMDKLAIFGKEFKKIASFLDHKT 840

Query: 841  TADCIQFYYKNHKSDSFKK-NKNLELGKQMKS-SAITYLVTSGKKWNPDANATSLDILGV 900
            TADC++FYYKNHKSD F+K  K LEL KQ KS SA TYLVTSGKKWN + NA SLD+LG 
Sbjct: 841  TADCVEFYYKNHKSDCFEKTKKKLELRKQGKSLSATTYLVTSGKKWNREMNAASLDMLGA 900

Query: 901  ASVMAAQADYDIENQQKCTRH--LGVGRDVESKVSWSASSPNKSNLDDLQTEKETVAADV 960
            ASVMAA+A   +EN Q C     LG   D  +    +      S+ D ++ E+ETVAADV
Sbjct: 901  ASVMAARAGDSMENLQTCPGKFLLGAHHDYRTPHGDNGVVERSSSYDIIRNERETVAADV 960

Query: 961  LXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDS 1020
            L                    DP E  RE +  +V    K P   +V Q  D E CSD+S
Sbjct: 961  LAGICGSLSSEAMSSCITSSLDPGEGYRELR-QKVGSGVKRPLTPEVTQSIDEETCSDES 1020

Query: 1021 SEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMH 1080
              ++D ++WTDEEK +F+QAVSSYGKDF  ISRC+R++SRDQCK+FFSKARKCLGLDL+H
Sbjct: 1021 CGEMDPADWTDEEKCIFVQAVSSYGKDFAKISRCVRTRSRDQCKVFFSKARKCLGLDLIH 1080

Query: 1081 TSGDVGETPGNXXXXXXXXXXXXXEEHCVVE----ICEGRGSDEFISKSINGGSTSVNIN 1140
               +VG TP +             E+ CVVE    IC  +   +    S+    + +NIN
Sbjct: 1081 PGPNVG-TPES---DDANGGGSDTEDACVVEAGSVICSNKSGSKMEEDSL---LSVLNIN 1140

Query: 1141 HEETVSAVTDNMRTSMEFE-ESTALQQSDEKGAEAVGNLIFETL----KEEDVPNPSQPT 1200
             +E+  +   N++T +    E+  + + D K  E V NL+ +      K E V   S   
Sbjct: 1141 PDESDFSGMKNLQTDLNRSYENNGIGRVDHKDDETVTNLVSDKCHQLEKTEQVFGDSNSL 1200

Query: 1201 HDHKIEGSSENTESGKSCNEPDILRSESVSTVD-----ENSAAVSEGR--------ATVK 1260
            +    +  + + E    C + + +  ESVS V+     + S AVS+              
Sbjct: 1201 NGIDSKSLTLHVEKNGPCTKME-MDHESVSAVEATDPSDRSNAVSQAEDLTEGNLLPETS 1260

Query: 1261 LAIGEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVGLDPHILHPNI---LKVEPV 1320
            L +  E  +D +  GQ ++ C+ +DS   ++ +    +S      I +      + VE  
Sbjct: 1261 LNVRREENNDADTSGQMSLKCTVKDSEVKENALHQVPNSTSCPRFIFNSGCQDQVSVELD 1320

Query: 1321 EKKSCIKS--EENFLSVRNS---DTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDD 1380
             +K  + S  +E+ L   +S   D+ VI  E+ L+Q +   TL L+E  D N K +  D+
Sbjct: 1321 NQKPGVISLLQESSLMAEDSVPKDSSVIQYEKTLDQGMSPSTLDLKETKDKN-KSIGVDE 1380

Query: 1381 DAEHPN-----NLLCNSESSTFPRSYPFNKQIFEDINRNIN--HAYFRVQGLSKPDINCN 1440
              +H +     N   N+E S      P      ED+NR+++  +     + LSK D +  
Sbjct: 1381 YHQHLSGHSLLNNAVNAELSQKVGGCPLQTPPKEDMNRDLSCKNPSSAAERLSKLDRDIQ 1440

Query: 1441 SKY-VSEGQFLQNCNSSKPHNL-AEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGD 1500
            S + +++  +LQ CN SK H+L  E PFLSQ++E    +Q  A G     D++   R GD
Sbjct: 1441 SSHSLAQDCYLQKCNGSKSHSLGTELPFLSQSLE-RTSNQTRAHG-RSLSDTEKTSRNGD 1500

Query: 1501 VKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKSCDMG------------------ 1560
             KLFGQILSH PS QN +S SNE  +K       SSKS ++                   
Sbjct: 1501 FKLFGQILSHPPSLQNPNSCSNENDDKGAHNPKLSSKSVNLKFTGHHCIDGNLGASKVDR 1560

Query: 1561 ------ENIPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGY-SATSVKTEQQPLQ 1620
                  EN+P+ SYGFWDG+RIQTG S+LPDS +L AKYPAAFS Y  ++S K EQQ LQ
Sbjct: 1561 NNYLGLENLPM-SYGFWDGNRIQTGFSSLPDSTLLLAKYPAAFSNYPMSSSTKIEQQSLQ 1620

Query: 1621 ALSNNGDQSLNELVSAFPTKD-----GVVDYHS-YRSRDGVKMRPFPV------DIFSEM 1645
             +  + +++LN  +S FPT+D     GV DYH  +R RD  K++PF V      D+FSEM
Sbjct: 1621 TVVKSNERNLNG-ISVFPTRDMSSSNGVADYHQVFRGRDCTKLQPFTVDMKQRQDLFSEM 1680

BLAST of CsaV3_5G036430.1 vs. TrEMBL
Match: tr|A0A251QJE8|A0A251QJE8_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G215600 PE=4 SV=1)

HSP 1 Score: 1168.3 bits (3021), Expect = 0.0e+00
Identity = 790/1745 (45.27%), Postives = 1025/1745 (58.74%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS-YHGSREFNRWGSADLRRPTGHGKQG 60
            MPPEPLPWDRKD FKERKHE+SE++GS ARWRDS +H  R+FNRW SAD RRP GHGKQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSESLGSVARWRDSPHHAPRDFNRWPSADFRRPPGHGKQG 60

Query: 61   GWHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHS 120
            GWH FSEDS HGY  SRS  D+++EDES RPS  RGDG+Y R  R++RGS+S R+ + HS
Sbjct: 61   GWHLFSEDSGHGYASSRS-GDKMLEDESCRPSFSRGDGRYGRNSRDNRGSYSQRECKGHS 120

Query: 121  RDANNGFGNPSRRTSSQDVSSDQRSVDDTVTYSSPQ---------------------SFH 180
             + ++G  N   R +  DV ++QR+ DD +TYSS Q                        
Sbjct: 121  WETSSGSPNTPGRPN--DVINEQRTQDDMLTYSSHQHSDFGSTWDQIQLKDQLDRMGGST 180

Query: 181  GLENGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKN--------EKADLPL 240
            GL  G + + E SLGS DWKPLKW+RSGS+SSRGS +S S++SK+         K +   
Sbjct: 181  GLGAGQKCERENSLGSIDWKPLKWTRSGSMSSRGSGFSHSSSSKSIGAIDFNEAKVESQP 240

Query: 241  RVASPIESPSAEATACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVA 300
            + A+P++SPS EAT CVTS+ PSE+  SRKKPRLGWG+GLAKYEK+KVEVPDGS+ K+ A
Sbjct: 241  KNATPVQSPSGEATTCVTSAAPSEETTSRKKPRLGWGEGLAKYEKKKVEVPDGSMNKDGA 300

Query: 301  LLSSGSGELTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGAGADG 360
            + S G+ E  HSL SN A+KSP+   FSDCASPATPSS ACSSS G+E+K F K A  D 
Sbjct: 301  VCSVGNMEPVHSLSSNLADKSPRVTVFSDCASPATPSSVACSSSPGVEEKSFGKTANVDN 360

Query: 361  ---MICSSPGSGSQNLQK-LLCSIEKMEISSVANLGSSLVELFHSDDPNTIESCFGKST- 420
                 C SP   SQ+  +    ++EK++ +S+ANLGSSL EL  SDDP++++S   + T 
Sbjct: 361  NNRNFCGSPSPMSQSHHEGFTFNLEKLDCNSIANLGSSLRELLQSDDPSSVDSGIVRPTA 420

Query: 421  LNKLLAYKGEISKTLEMTESEIDSLENELKSLKSVNGGNVSHKKSCSATRVMESSTYFKE 480
            +NKLL +KGEISK LE+TESEIDSLENELK L S +G +     + S+  V ++   FKE
Sbjct: 421  MNKLLIWKGEISKVLEVTESEIDSLENELKVLNSDSGASCPRPATSSSLPVEDNDKSFKE 480

Query: 481  QDGISCIATRPAPLVVVSSSDATVEKVPLCKGDVGVEDVDTKADEIDSPGTVTSKFNEPS 540
            Q  ++ + TRPAPL + SS DA VEK+ L  GD        K ++IDSPGT TSKF EP 
Sbjct: 481  QVTVTNLITRPAPLQIHSSGDADVEKMCLGNGDQVEFCGIVKDEDIDSPGTATSKFVEPL 540

Query: 541  RVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFPISGPFVDEHETIGSGNECTLAKSCTSE 600
              V + +SD++ +  CS   D I   K E    + G    + +    GN   L  S    
Sbjct: 541  LKVVS-SSDVMSHNDCSGDLDPIETTKGEAKCLVPGKDEVKTDLSACGNSSMLLGSEIVA 600

Query: 601  SVYGDLMAQAGSRSSLCDSIFACNKEYASRAAEVIFKRSPVGMCKISSKSTKNVSCSETE 660
             V G L        ++C+SI + NKE A+R+ EV  K  P    K+        S  + +
Sbjct: 601  PVSGGLGFCFSVVDTICNSICSSNKESANRSFEVFNKLLPREHYKVDISGVSISSSGKND 660

Query: 661  KLIKEKFVMRKKFLKFKESALTLRFKSLQQSWKEGL-LHSVKKCRSRPQKK-ELSLRVTH 720
             LIKEKF MRK+ L+F E  LTL++K+ Q  WKE L L S++K R +  KK ELSLR T+
Sbjct: 661  SLIKEKFAMRKRRLRFMERVLTLKYKAFQHLWKEDLRLLSIRKYRPKSHKKFELSLRATN 720

Query: 721  SGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHSSKLLLNPQIKLYRNTLKMPAMILDK 780
            +G+QK+R SSIRSR    G    S   T   +  ++KLL + Q+K YRN+LKMPA+ILDK
Sbjct: 721  NGYQKHR-SSIRSRFSTPG--NLSLVPTTEIINFTNKLLSDSQVKRYRNSLKMPALILDK 780

Query: 781  KEKIALRFISHNGLVEDPCAVEKERNLINPWTSAEKEIFWEKLSLFGKDFKKISSFLDLK 840
            KEK+  RFIS NGLVEDPC VEKER L+NPWT  EKE+F EKL+  GKDF+KI+SFLD K
Sbjct: 781  KEKMVTRFISSNGLVEDPCVVEKERALMNPWTPEEKELFIEKLTTCGKDFRKIASFLDHK 840

Query: 841  TTADCIQFYYKNHKSDSFKK-NKNLELGKQMKSSAITYLVTSGKKWNPDANATSLDILGV 900
            TTADC++FYYK+HKS  F+K  K  ++ KQ KSSA TYL+++GKKWN + NA SLDILG 
Sbjct: 841  TTADCVEFYYKHHKSVCFEKTKKKADMTKQGKSSAKTYLISNGKKWNREMNAASLDILGA 900

Query: 901  ASVMAAQADYDIENQQKCTRHLGVG--RDVESKVSWSASSPNKSNLDDLQTEKETVAADV 960
            AS +AA AD    ++Q  +  L +G  R+         +     + D +  E+ETVAADV
Sbjct: 901  ASAIAAHADGSTRSRQAFSGRLYLGGYRNTNPSRGDDTTVERSCSFDAIGNERETVAADV 960

Query: 961  LXXXXXXXXXXXXXXXXXXXXDPREELRERKCYRVDFAAKLPSLSDVMQKTDNEPCSDDS 1020
            L                    DP E  RE KC +VD  A+ P   DVMQ  D+E CS++S
Sbjct: 961  LAGICGSLSSEAVSSCITSSIDPGEGYREWKCQKVDSLARRPLTPDVMQNVDDETCSEES 1020

Query: 1021 SEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIRSKSRDQCKIFFSKARKCLGLDLMH 1080
              ++D S+WTD EK  F+QAVSSYGKDF MISRC+R++S+ QCK+FFSKARKCLGLDL+H
Sbjct: 1021 CGEMDPSDWTDAEKSSFIQAVSSYGKDFAMISRCVRTRSQHQCKVFFSKARKCLGLDLVH 1080

Query: 1081 TSGDVGETPGNXXXXXXXXXXXXXEEHCVVEICEGRGSDEFISKSINGGSTSVNINHEET 1140
                 G + G+             E+ CV+E   G  SD+   +       SV    +E+
Sbjct: 1081 PVAGNGTSVGD----DVNGGGSDTEDACVLETGSGISSDKSGCRMNEDMPLSVINMDDES 1140

Query: 1141 VSAVTDNMRTS-MEFEESTALQQSDEKGAEAVGNLIFETLKEEDVPNPSQPTHDHKIEGS 1200
              A T N++T  +  EE   + Q D +G + + +L  + ++ ED PN      D   +  
Sbjct: 1141 DPAETMNLQTGPLRSEEKNVMGQLDHEGGKTLKSLASDAVETEDRPNLVLDDADCVRDAQ 1200

Query: 1201 SENTESGKSCN----EPDILRSESVSTVDE-NSAAVSEGRATVKLAIGEEVGSDTNLHGQ 1260
                 S  +      E  IL +ES       N    + G    KL    E+ SD N    
Sbjct: 1201 KSRVFSADALKDDAAEEGILIAESEPVGGGINFDPTNPGMDGEKLM--GELPSDGN---T 1260

Query: 1261 STILCS------GQDSTGNDSNIALEGSSVG--LDPHILHPNILKVEPVEKKSCIK-SEE 1320
             T  CS        +S+GN S +A  GS  G  L+P  LH   + +  ++K S I    E
Sbjct: 1261 DTSRCSLPGSVHDSNSSGNASALAGGGSCSGFSLNPECLHQVSVGLNSMQKPSVISMPHE 1320

Query: 1321 NFLSVRNS---DTGVIGREQMLNQDILSPTLVLQEISDANQKPMNRDDDAEHPNNL--LC 1380
            N  +  +S   D+  I  E+  NQDILS TL LQE      K +  D+  +H   L    
Sbjct: 1321 NRHAPADSVSPDSAKIECEKAFNQDILSSTLDLQE--GREPKSVGIDECNKHLPGLPIYT 1380

Query: 1381 NSESSTFPRSYPFNKQIFEDINRNINHAYF-RVQGLSKPDINCNSKYVSEGQFLQNCNSS 1440
            N ESS   + YP      +D N ++       VQ  SKPD   N  Y+++  FLQ  N  
Sbjct: 1381 NVESSQVLKGYPLQMPTKKDTNGDVTSGNLSEVQNFSKPDRKINGHYMTKDGFLQFGNCK 1440

Query: 1441 KPHNLAEPPFLSQNIELGHDHQKNASGXXXXXDSDVPRRKGDVKLFGQILSHAPSQQNSS 1500
               +  + P   + +E      K  S      DSD P R GDVKLFG+ILS+  S   SS
Sbjct: 1441 PQCSEVDFPLAPRKVEQPVGPPKAHS--WSSSDSDKPSRNGDVKLFGKILSNPSSLSKSS 1500

Query: 1501 SGSNECGEKKGPLHNSSSKSCDMG------------------------ENIPLRSYGFWD 1560
            S  +E  EK    H  S+ S ++                         E +P RSYGFW+
Sbjct: 1501 SNIHENEEKGAHNHKLSNTSSNLKFTGHHNADGNSSLLKFDCSSYVGIEKVPRRSYGFWE 1560

Query: 1561 GSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPT 1620
            G+++  G  +  DSAIL AKYPAAF  +  TS K EQQPLQA+  N D+++N  VS FP+
Sbjct: 1561 GNKVHAGYPSFSDSAILLAKYPAAFGNFPTTSSKMEQQPLQAVVKNNDRNING-VSVFPS 1620

Query: 1621 KD-----GVVDYHSY-RSRDGVKMRPFPVDI-------FSEMHRRNGFDAVSLSSLQQQG 1648
            ++     GVVDY  + RSRDG K+ PF VD+         +M RRNGFD  ++SSLQQQG
Sbjct: 1621 REISGSNGVVDYPVFSRSRDGAKVPPFTVDVKQQQRQDVFDMPRRNGFD--TISSLQQQG 1680

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004142488.10.0e+00100.00PREDICTED: uncharacterized protein LOC101222167 [Cucumis sativus] >KGN52286.1 hy... [more]
XP_008446909.20.0e+0091.65PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 [Cucumis me... [more]
XP_023516042.10.0e+0077.50uncharacterized protein LOC111780021 [Cucurbita pepo subsp. pepo][more]
XP_022956541.10.0e+0077.21uncharacterized protein LOC111458252 [Cucurbita moschata][more]
XP_022989544.10.0e+0077.37uncharacterized protein LOC111486582 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G52250.11.1e-12329.18Duplicated homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q4KKX4|NCOR1_XENTR1.9e-1331.63Nuclear receptor corepressor 1 OS=Xenopus tropicalis OX=8364 GN=ncor1 PE=2 SV=1[more]
sp|O75376|NCOR1_HUMAN3.7e-1229.78Nuclear receptor corepressor 1 OS=Homo sapiens OX=9606 GN=NCOR1 PE=1 SV=2[more]
sp|Q60974|NCOR1_MOUSE3.7e-1229.78Nuclear receptor corepressor 1 OS=Mus musculus OX=10090 GN=Ncor1 PE=1 SV=1[more]
sp|Q8QG78|NCOR1_XENLA6.2e-1230.23Nuclear receptor corepressor 1 OS=Xenopus laevis OX=8355 GN=ncor1 PE=1 SV=1[more]
sp|Q55DP9|MYBP_DICDI4.1e-1134.88Myb-like protein P OS=Dictyostelium discoideum OX=44689 GN=mybP PE=3 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0KWU7|A0A0A0KWU7_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G623500 PE=4 SV=1[more]
tr|A0A1S3BG74|A0A1S3BG74_CUCME0.0e+0091.65LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 OS=Cucumis melo OX=365... [more]
tr|M5XR66|M5XR66_PRUPE0.0e+0045.33Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G215600 PE=4 SV=1[more]
tr|F6HNI1|F6HNI1_VITVI0.0e+0045.29Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_13s0019g04010 PE=4 SV=... [more]
tr|A0A251QJE8|A0A251QJE8_PRUPE0.0e+0045.27Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G215600 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR009057Homeobox-like_sf
IPR017884SANT_dom
IPR001005SANT/Myb
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsaV3_5G036430CsaV3_5G036430gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsaV3_5G036430.1CsaV3_5G036430.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_5G036430.1.exon1CsaV3_5G036430.1.exon1exon
CsaV3_5G036430.1.exon2CsaV3_5G036430.1.exon2exon
CsaV3_5G036430.1.exon3CsaV3_5G036430.1.exon3exon
CsaV3_5G036430.1.exon4CsaV3_5G036430.1.exon4exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_5G036430.1.cds1CsaV3_5G036430.1.cds1CDS
CsaV3_5G036430.1.cds2CsaV3_5G036430.1.cds2CDS
CsaV3_5G036430.1.cds3CsaV3_5G036430.1.cds3CDS
CsaV3_5G036430.1.cds4CsaV3_5G036430.1.cds4CDS


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 393..420
NoneNo IPR availableGENE3DG3DSA:1.20.58.1880coord: 932..1042
e-value: 1.2E-16
score: 63.1
NoneNo IPR availableGENE3DG3DSA:1.10.10.60coord: 750..822
e-value: 3.0E-24
score: 86.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1431..1461
NoneNo IPR availablePANTHERPTHR13992NUCLEAR RECEPTOR CO-REPRESSOR RELATED NCORcoord: 1379..1629
coord: 1..1192
NoneNo IPR availablePANTHERPTHR13992:SF7DNA-BINDING PROTEIN SNT1-RELATEDcoord: 1379..1629
coord: 1..1192
IPR001005SANT/Myb domainSMARTSM00717santcoord: 985..1033
e-value: 2.9E-6
score: 36.8
coord: 770..818
e-value: 3.1E-5
score: 33.4
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 988..1027
e-value: 5.8E-6
score: 26.3
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 988..1027
e-value: 2.54493E-5
score: 42.9478
IPR017884SANT domainPROSITEPS51293SANTcoord: 984..1035
score: 11.622
IPR017884SANT domainPROSITEPS51293SANTcoord: 769..820
score: 13.656
IPR009057Homeobox-like domain superfamilySUPERFAMILYSSF46689Homeodomain-likecoord: 757..819
IPR009057Homeobox-like domain superfamilySUPERFAMILYSSF46689Homeodomain-likecoord: 983..1035