Lsi04G018740 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G018740
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionProtein PAT1
Locationchr04 : 25871282 .. 25880247 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATCGTTATCTCTTTTTAGGGTTCCAATTCCAAATTTGGTTTCTCTCTCCGATCCAATTTTCAATTTCAACCCTTAAAAAGCCTTCACTGACACAGCGAAAAGCAAGCTCAATCCATGAGGACGACACACTAGTATTACTCTACCACCAACTGTACATATTTTTTGGGATTTTTTTTCCAGGCCATTTCCGCACAAACCCTAGGATTTTGACCTGTTAGGGTTTTTGATTTATTTCCTTTTCGTATTTCGTTTAGTTTTGGATCCTATTTCGGCGTACTTCTTCTTGTTTATTAAGGTTAAGATTCGATTTCCATGTTCTTGTTCTCCCTCTCGAGTAATTGAGTCGCAGATATGGATGGTTTTGGTAACGGAGCTAGAGTTCAAGTGGCATCTACATCCGAGGATCTCAAGCGTTTTGGAGCCAATTCTACGGGTATTTTTCTTTGAAACCTTGTTACTTCAGATCCATTTTGTTTGAACTTCTTTTTTTAGTTGTGTTGATGAAGTCGGTAATTGAGAATTTTGAGCTCGAATGGTGGTAGTTGTTGTGTATTAATTATTGTTCAACTGTTGTGTTCTCGAATCTTCGGAAGGATGTGAGGATTTTGTCCTCGTCCTGTTATTGAGATGGAATAATCGTCAAGGATTAAGTAGATAACAGTTTGGTTGGGGATTATGGCAGTTGTCTTCGATGGGTCTCAATTCTCTACATAGGAATTTCGTGTCCATTTTTTTAGGAAAGCGTTCTTGGTTTCGAGCAGTGATTACTGTTATTTTATGATGTAAATGTCATTTAGATAATCATTGGAGTTCGTATGTGTTATTGCAGCAGCTGCATTTTAAATTGTTGTTTTCTTTTTCTTTTTTGAAACTTCACCAGAAGGATGGGGAGGGAGCTATAGAGGTGGTTTATTTGATTCTTAAACAGAACTCTTGATTCTGAAAAATCACGTTTATTTACTTCATTTATTTAATGTTAGATCTGAAAAATGTTGTGATCTCTACTTTTATAATGAAGATGCAAGCATTGTGATTTAATTTTCAATGTTGCAGAAGATGCTCTGTTTGATGCATCCCAGTATGCATTTTTTGGCAAGGATGTCATGGAGGAGGTTGAATTGGGGGGATTAGAAGATGAAGAGGATGATACACTTGCTGCTGGGAATGAGGAGGAGGAGTTTTTGTTTGATAAGGAGGTATCTGTTATTTCCATTGTTCTATACAGTTATGTTTCAATTTTTTTTTATCTGGATGCACATGTTTCTTTTTAGAGTCGTTGGTAGTTGCATGTGATTGACTAGAAGTCTAACTTCAATCCCATGCCCCACACTACAAGTTTCGGAATTGAAATTTAAAAAATCACCCAACAGAAAAGTTGGATATCATAAATTTTTGTTTGACTGCAGTTTTTCTGGCTGCTTATTCTACGAGTTTATCGTGTGGTTGTGTGCTTGAAGTCCTTGCATCATCTCCATTTTCTTTATTGATGATAAATCTATCATCAATAAGTTTGAGGGGTGGGGGATATAGTTGTTACTTGTGAGGAATCAAGGAAAGAGGGAAATACTAAAAAGTTCATGAGAAATATGCTAAAGAGGTCAAACCCCAAACGGCTTCGCTGATCTACGATGCTCAAAATAGCTTCTTCCATTTATTTCCAACCAAATATTCTACAATATAGCAATAAAACCAGCTTGCCAACTAGATTTTTTTTTTTTTTTTGGGGGGGGGGGGGGGGATAATTGTCCCAAGACAACTTTTTTGTCATGGAGCATGAAAAGTATCTCCAAGTTGGAGGTTAGGTTCAATTTGATCCGCTTCCCCATGCCAGTTTTGTAACTATGAAATCCCTTAGCTTATTATTTTCTTGTTTGTACTATATTTCCTCCAAGAGGCTTCATTTGGTGGCCTCTGTGTTACCTTCTATTGGTTCTCTTTCTCTTGGGTTATGTCGTTCCTATCTCCGATAGGAAAATGACGAACATTGTGGCTCTTTTTTCCACGTTGGAGTTTTTTTAGTCTGTGTGGGGAGGAAAGGTATTTGTCATTCGGGTACAATCCTTTAGAAGGATTCTCTTGTAGATCCCCTTTTTTAGTGGCCCAATCTTTACCTTGCTTTGGAAGGTTGAAATGCCTAAGTAGGTCTGGATCGCGTGTTGAGCAAGTCCTCTTTGACAGGGACATAGTGTTGTATCCTTTGTAGAAGAGCATCTGATGACCTCGATCACATGTTGTGTATATGTCGTTTTGCCTCATTGGTTTGGGATTGTTTTCACACGGCTTGACCTGTTGTGTGCAGATCTATGCTGGAGGTGTTTCTTCTTCCATCACCCTTTTGCGCTAAGGGTTGTTTGTTTGCCGATGTTTGAGCTATTTTGTGGGTTAAGGTTTCTCTTTGGATTTTCATGGTCAAGAGTTTTATAATTATTCCTTAGGTATCATTTTCTCAGATCGGAGCTTTTTTTTTAGCCTAGGTCGGTTCCTTTTGTGGCTTTTTTTTATTTTGGTATTCTCTTTCTATTCTTTCATCTTCCCTTCATGAAAGTTTGATTAGTGCTATCTTGACCATTTATGTTGTAGATATTTTATTAGTTCTATCCTTTTGGACTACTTTGTAAAGAAAACACATACTGGTTTTAGGTTAAGCCAGACAACATAACAAGGTGGGCCATAGGAATTCATGCTTGTATCTCATATAATTGTAAGTTTTTGAAGTTGCGTGTTGTAGTCTTGATGGGAAAGACTGGTGCTAATGTGAGCTCCGAAATTTTTTAAAGCACTTTGAGCAAATTGGTTTATTACTTGTTCTCTTATTGCACAAGTCGCATTTGGGCTACTGGTTTTTTCTTGCTCTGTTACAAAAATATGCCTTGTTATATTAAGGGCTAACACAACCGTTGGGTGTTCATTTATAATTCATAAGTGCCATGTACTTGGTTTTAAGGTTTTAACTAACTTCTTTCTTTGGTTTGCAGAGTGAGGACTTTAGACCTCCATCTGATATTGACGATCTTGTTTCTTCATTTGAAAAGGTTAGAAATATGAAAGTCTTTGCCCTCCTTCTATTCTTGTTTAGAACTTAGAAGTGGCTCCCTAGGAAGCACGGACACGTTGGTTTTGATGGAGTGTTCGTGTTAGACACGTGTCGGACACCGACACGTCATTTTACGTGTCTTTTTTTTTTTTTTTTTTTTNTTCTTGCTCTGTTACAAAAATATGCCTTGTTATATTAAGGGCTAACACAACCGTTGGGTGTTCATTTATAATTCATAAGTGCCATGTACTTGGTTTTAAGGTTTTAACTAACTTCTTTCTTTGGTTTGCAGAGTGAGGACTTTAGACCTCCATCTGATATTGACGATCTTGTTTCTTCATTTGAAAAGGTTAGAAATATGAAAGTCTTTGCCCTCCTTCTATTCTTGTTTAGAACTTAGAAGTGGCTCCCTAGGAAGCACGGACACGTTGGTTTTGATGGAGTGTTCGTGTTAGACACGTGTCGGACACCGACACGTCATTTTACGTGTTTTTTTTTTTTTTTTTTTATTTATAATTTTTTAATAATCAGACACTCGGTGACACGTTCAGACACGCTCAAACAACTCCATGTTCTATTTCCTTTCCCTCCTATTCTCTCTTTTCTTTCGCCGGAGGACCACTATTTCGTCATCGGATGTTGGAGAGCGTTGGCGGGAGTTGGCGATTGTCAGGAATCTCAGGATGGTGACGGCAGATCAATGGTGAAGGGAGGAGGACTGAACGAAATTGGGGATGAGAGTAGAATAGGGAGAGGGGAGAACATAATCGGGGGAAGAAGAAGAGAAGAAGGCTGAAGAGAGAATGAGATGTAGGTTTTTTAATCGGGGGAGGAGGACTAATAGTTAGGGTTTTCTTGTTTTAATATGGCTGCTGAAGGAAGAATAAGATATTAGGGTTTCTTTTTGTTCTAGTGTTTGTTGGGCCTTTTAATGTAGGTTTTTTTAATGGGTTTTGGGCCTTTGTCTTTTTTAGTTTTAGTGGGCAAAAATTTATTGTTTCATGCACCAACACCATTAACATTTATAATTAAATTAAACTTTTTTTAAGCAACTGAATAATCAGCATATTTGTATTTACTTTTCATTCATAATTTTTTAATAATTATTTTTAATAGTTTGTGAAACAATTAAAGTATTGGGTGTTTTCTTTTAATATTATTTTAATAATAATAATAAAAAATAACGTGCTCCAACGTGTCGTGTCCTACATTTTTAGAAATTGACGTGTCGCCGTGTCGTGTCGTGTCGCGTGTCCGTGTTCGTGCGTCCTAGGTGGCTCCCTTATCTGATTTTCCATCTCTATGATTTTCTATTCAAACTTTGTATCCGTATCTTCCTTCATATACAAAAGATGGAATTCTGAAGCTTTTAGTGTATATTTGCATCTCCATAAGATCATGTCATCTTCAGGATTTTGTACTCCGTAGTGTTAAGCTAGAATGATAGTTTGCTTTGTAACAAGAGGATACTGGGCAGGGAGGTTAAAAGAGGTGTTGATATCATCATTATTGTGTGTTCTTTCAATTATCTATATAGTTTCTAATGAGCAATGAATATAATTGACAATCGGAAGATAGCTTGATGGATAAAGACACTTGTTACCTTACTTGAAGACAAATGTTCGATCCCCACCATTTCTAATTTTCTACCCAAGAAAATTATGCTTCTTATCATTGATAATTTATTATTATTGTCTCAAATTGTCACATTGCGAAGGAGTCTGGACAAACAATTGCTGTCTTAGATGATATAGAATTTAAAAAGTCAGTTTGGAATTTCAAATTATGCCTGTTTCTAGTGATTTACTATCTAGTTTTATGTGTGGGAAGAACATAATTGATTGCTCCCATTTCAATATTAAGGTCCATTTGTAGAAAGTTTGTAGAAATTTCTTCTGGAGATAACCGTGAAAGCCAATTTGATTTGACATTGTCCATGTTGAGCAACTAAAGATTGATTTGTTTTGGCAGAAGTTGCTGGTTGCATTTTCTTTTCCTCTAATTGGGCATTTTTAGAACTCCCTGCATGCGTGGCATTTTGTTTCTTTATTGTCATCTATATTTGATTAGTATTTTGTGTCATTTCAGTTGAACGAGGTTGGTAGCGGGCCAAGGGGAGTTATTGGAGGCAGAATATTTAGAGACAGTAAGTTTTGCCTATCAACTCTTTGTGATGTTAAATTAGTACTCGGTTGGTCATATGGTAATGTAATAGTTCTGATAGAATAGTTTGCTTCTGTCGAGATGGTCTATGGAACATGGCTTTATGTCACAGACAAACATAACTTGGTGCTCTGAACTCCCTCCAACAGGGCAGGTCTAGAAATTTTCTCCGTGGGAAGATCTACATTTCTTTTTAAATCTCTTTCAATTAAAAAAAATTAAGTGGGTAAATTAAAGGGAATTTTCAAATCTGTACTTAATATAATTTTTATTGTAAGTGTTCACTCCTTGTTCTTCATCATGATATAAGTCAACTTCTTACTCAAGGAAAGTGATCATAGACCTTTTGCAAATGTTAATATAATTTTTATTGTAAGTGTTCACTCCTTGTTCTTCATCATGATATAAGTCAACTTCTTACTCAAGGAAAGTGATCATAGACCTTTTGCAAATGTTAATTTTGACCACTCGTAGCTTGGATGCTATATATGCTATATATGTTAATGTTGGACATGTATTTTAGAGGCTAGGTATTTGCCCCCTTTTTTCCCTCTCTTTTGCTAATCATGGTATGTGATTTATGCCAACTGGTTATTTCATATTTACTCATGACTCTATTGTTGTTGTGAATAGTTGACTGATTTCTTTTATTTTCTTTCGTTTTTTTTCCCATCTACTCTTCTAGGTTCGTCAGTTAATGAATGGGCACGTGAGGAGGGTTTCTCTAATTGGCTTGCCCAACAAGGCTATAATGTTGAAAGTGCTCAGGAAGGCAAAAGATGGTCATCACATCCACATTCTTCCTCTCTTGCAGAATCTACATCTTTGTATAGGACATCGTCTTTCCCTGATCAGCCGCAGCCGCAGCAATACCACCAACAGTTCTCTAGTGAGCCAATTTTGGTGCCAAAGTCTTCATATCCTCCTAGCGGAATATCTCCTCATGCTTCACCGAACCAGCATTCAAGCCATCTAAATATGCCTTTTGTTCCTGGTGGACGCCATGTAGTATCATTATCTCCATCAAATCTCACACCTCCAAACTCTCAGATTGCTGCTGGTTTTAATCCTGGATCACGGTTTGGAAATGTGCCGCAACTTAACTCTGGCCTCTCTATTAATGGTGGACCGCAGAGCCAATGGGTCAACCAAACTGGCATGTTTCCCGGAGAACATTCTAGTCACCTAAACAATTTATTGCCTCACCAGTTATCAAATCAGAATGGATTTCCGCAGTTACCACCACAGCAGCAGCAACAGCAACAGCAGCAGCAGCATAGATTGCAGCATCCTGTTCAGCCTCCATTTGGTGGTTCTCTACCAGGTTTTCAGTCCCATCTTTTTAATTCCCACCTGTCTTCGGGCCCGCCCCACTTAATGAACAAGTTGGAAGCCATGCTTGGCCTACCAGATATGAGGGATCAAAGGCCTAGGTCTCAGAAAGGTAGACAGAATACTCGTTTTATTCATCAGGGTCATGAGACCAATAGTTTTAGGAATGACATTGGGTGGCCTTTCTATAGATCCAAGTACATGACAGCTGATGAACTAGAAAATATTGTTAGAATGCAGCTTGCAGCAACGCATAGTAATGATCCATATGTAGATGACTACTATCATCAGGCTTGTCTTTCAAGAAAATCTGCAGGTGCAAAATTGAGGCATCATTTTTGTCCTAATCAACTAAGGGATCTTCCACCACGTGCCCGTGCCAATAATGAGCCACATGCTTTTCTTCAGGTTGAAGCGCTTGGTAGGGTCCCATTTTCATCAATTCGTAGACCTCGCCCTCTTCTTGAAGTTGATCCTCCAAGTTCATCTGTTGGTGGAAGCACTGATCAAAAGGTTTCTGAGAAGCCCCTTGAACAGGAGCCTATGCTGGCAGCTAGAGTTACGATTGAAGATGGTCATTGTCTACTTCTTGATGTGGATGATATTGATCGCTTCCTGCAATTCAATCAGTTCCAAGACGGTGGTGCTCAATTAAGAAGACGTCGCCAGGTACTGTTAGAAGGACTGGCTTCATCATTTCACATTGTTGATCCACTCAGTAAAGATGGTCACGCTGTTGGGCTGGCTCCTAAAGATGATTTCGTTTTCTTGAGGTTGGTTTCTCTTCCCAAAGGTCGAAAGCTTCTAGGAAAGTACCTTCAGCTACTCGTACCAGGAGGTGAGCTTATGCGAATAGTTTGCATGGCTATTTTCCGTCACTTAAGATTCTTGTTTGGTAGTGTTCCGTCTGATCCTGCGACAGCAGATTCTGTTAGTAATCTTGCAAGAATTGTTTCATTGCAAACCCATAGTATGGATCTTGGAGCTCTAAGTGCATGTCTTGCGGCTGTAGTTTGTTCCTCAGAGCAACCTCCACTTCGCCCTCTAGGGGCCCCTGCAGGAGATGGGGCGTCCTTGATTTTGAAATCTGTTCTCGAGAGAGCTACGGGACTCTTAACCGATCCTCATGCTGCAAGCAACTATAACATAACTCACCGAGCTCTTTGGCAGGCTTCTTTTGATGAATTTTTTGGACTTCTTGCAAAGTATTGTGTGAACAAGTACGATAGTATAATGCAATCATTACTCAGACAATCTCCACAGAATGCTGCAGCAGCTGTCTCGGATGCAGCCACTGCTATCAGCCAAGAAATGCCAGTTGAAGTATTGCGTGCAAGTCTTCCCCACACCGACGAGCACCAGAGGAAAGTGTTGATAGATTTTGCCCAACGCTCGATGTCTGTTGGTGGATTTATTAACAGTGGGGCCGAGCACAGTGGTCGCAACAATTTTGGTTCCTTATGATCAGGAGGGAGTAAGTATGTTGCTCTTTATCTATATATACATTCTCAATGAAGAAGAAGCCTTCAGAATAATCTCCATCATCACATATGGAAGGACAAAGAAACCCGTTCCTCCATTTTGAGGTCTCTCTCTCTACTGCTCTTCTACTGCTCCCTTAAATTTTCTTATTCCTCTTCATGTTTTTACTTTCGAAAAATTCGACGCTCGTAAACGACGCCATTGTTATGTTGGGTTTAGTTTTTCATTTTTTCTTTCTTTCTTTCGGGGTTTGGTAGTAGAGTTTTAGATAGAATATAATGCTGTTGAGAATGGTGGGGTGGTCCAGATATGATATGATATTTGTGGATGGTTGAGGGTTAGAATGTACAGGGTTGGTCTGAGACCAATTTCCCATGGTCCTTCCCTATTAATACCTGAGGAGGTGGTGGGATTGTTTTTGTGTTTAAAAATCTACTTTTTTCTTTTTTAATGATGTTATTATAGTGTTCTATGCACACAAGAGAGAAACAAAAGAAAGGCTTTGATTTGATTGATGCAAATATATAAGAATTGGGGTAGTTTTGAGTATGGAATTGAACGTTTGTTTTTTTAGCCCTAAGAAACAAGGGGGATTCATAGATCTTGTTTTGGGAGGGAGTACTTTTTTCATTTATTTTTCTTTTCAGAAAAATGTAAGGATGGAAAATACTGTTTCCACAATTATTTTTTTTGAGTCATTTGTATTTACTAAATTTGGACGTTGTGTTGTGCTCTTTTGTATCTTTCTATGAATGATTAGATGAAAATGGAATAACACCTTTTTGGTTTAGGTGAAAGGTTGTTCTGTTCAGGTGATGTATCTTTTTGTGCTTTTTCTACAATGGAAATTAATTATTGTATTTCTCACCCCCCTAATGTTTGCTACTGATGGTGCATGCCATAATCAATACAATCAGTGTTGTGGGAATAAAAACTTATATTGGTTGTCTCCTT

mRNA sequence

AATCGTTATCTCTTTTTAGGGTTCCAATTCCAAATTTGGTTTCTCTCTCCGATCCAATTTTCAATTTCAACCCTTAAAAAGCCTTCACTGACACAGCGAAAAGCAAGCTCAATCCATGAGGACGACACACTAGTATTACTCTACCACCAACTGTACATATTTTTTGGGATTTTTTTTCCAGGCCATTTCCGCACAAACCCTAGGATTTTGACCTGTTAGGGTTTTTGATTTATTTCCTTTTCGTATTTCGTTTAGTTTTGGATCCTATTTCGGCGTACTTCTTCTTGTTTATTAAGGTTAAGATTCGATTTCCATGTTCTTGTTCTCCCTCTCGAGTAATTGAGTCGCAGATATGGATGGTTTTGGTAACGGAGCTAGAGTTCAAGTGGCATCTACATCCGAGGATCTCAAGCGTTTTGGAGCCAATTCTACGGAAGATGCTCTGTTTGATGCATCCCAGTATGCATTTTTTGGCAAGGATGTCATGGAGGAGGTTGAATTGGGGGGATTAGAAGATGAAGAGGATGATACACTTGCTGCTGGGAATGAGGAGGAGGAGTTTTTGTTTGATAAGGAGAGTGAGGACTTTAGACCTCCATCTGATATTGACGATCTTGTTTCTTCATTTGAAAAGAGTGAGGACTTTAGACCTCCATCTGATATTGACGATCTTGTTTCTTCATTTGAAAAGTTGAACGAGGTTGGTAGCGGGCCAAGGGGAGTTATTGGAGGCAGAATATTTAGAGACAGTTCGTCAGTTAATGAATGGGCACGTGAGGAGGGTTTCTCTAATTGGCTTGCCCAACAAGGCTATAATGTTGAAAGTGCTCAGGAAGGCAAAAGATGGTCATCACATCCACATTCTTCCTCTCTTGCAGAATCTACATCTTTGTATAGGACATCGTCTTTCCCTGATCAGCCGCAGCCGCAGCAATACCACCAACAGTTCTCTAGTGAGCCAATTTTGGTGCCAAAGTCTTCATATCCTCCTAGCGGAATATCTCCTCATGCTTCACCGAACCAGCATTCAAGCCATCTAAATATGCCTTTTGTTCCTGGTGGACGCCATGTAGTATCATTATCTCCATCAAATCTCACACCTCCAAACTCTCAGATTGCTGCTGGTTTTAATCCTGGATCACGGTTTGGAAATGTGCCGCAACTTAACTCTGGCCTCTCTATTAATGGTGGACCGCAGAGCCAATGGGTCAACCAAACTGGCATGTTTCCCGGAGAACATTCTAGTCACCTAAACAATTTATTGCCTCACCAGTTATCAAATCAGAATGGATTTCCGCAGTTACCACCACAGCAGCAGCAACAGCAACAGCAGCAGCAGCATAGATTGCAGCATCCTGTTCAGCCTCCATTTGGTGGTTCTCTACCAGGTTTTCAGTCCCATCTTTTTAATTCCCACCTGTCTTCGGGCCCGCCCCACTTAATGAACAAGTTGGAAGCCATGCTTGGCCTACCAGATATGAGGGATCAAAGGCCTAGGTCTCAGAAAGGTAGACAGAATACTCGTTTTATTCATCAGGGTCATGAGACCAATAGTTTTAGGAATGACATTGGGTGGCCTTTCTATAGATCCAAGTACATGACAGCTGATGAACTAGAAAATATTGTTAGAATGCAGCTTGCAGCAACGCATAGTAATGATCCATATGTAGATGACTACTATCATCAGGCTTGTCTTTCAAGAAAATCTGCAGGTGCAAAATTGAGGCATCATTTTTGTCCTAATCAACTAAGGGATCTTCCACCACGTGCCCGTGCCAATAATGAGCCACATGCTTTTCTTCAGGTTGAAGCGCTTGGTAGGGTCCCATTTTCATCAATTCGTAGACCTCGCCCTCTTCTTGAAGTTGATCCTCCAAGTTCATCTGTTGGTGGAAGCACTGATCAAAAGGTTTCTGAGAAGCCCCTTGAACAGGAGCCTATGCTGGCAGCTAGAGTTACGATTGAAGATGGTCATTGTCTACTTCTTGATGTGGATGATATTGATCGCTTCCTGCAATTCAATCAGTTCCAAGACGGTGGTGCTCAATTAAGAAGACGTCGCCAGGTACTGTTAGAAGGACTGGCTTCATCATTTCACATTGTTGATCCACTCAGTAAAGATGGTCACGCTGTTGGGCTGGCTCCTAAAGATGATTTCGTTTTCTTGAGGTTGGTTTCTCTTCCCAAAGGTCGAAAGCTTCTAGGAAAGTACCTTCAGCTACTCGTACCAGGAGGTGAGCTTATGCGAATAGTTTGCATGGCTATTTTCCGTCACTTAAGATTCTTGTTTGGTAGTGTTCCGTCTGATCCTGCGACAGCAGATTCTGTTAGTAATCTTGCAAGAATTGTTTCATTGCAAACCCATAGTATGGATCTTGGAGCTCTAAGTGCATGTCTTGCGGCTGTAGTTTGTTCCTCAGAGCAACCTCCACTTCGCCCTCTAGGGGCCCCTGCAGGAGATGGGGCGTCCTTGATTTTGAAATCTGTTCTCGAGAGAGCTACGGGACTCTTAACCGATCCTCATGCTGCAAGCAACTATAACATAACTCACCGAGCTCTTTGGCAGGCTTCTTTTGATGAATTTTTTGGACTTCTTGCAAAGTATTGTGTGAACAAGTACGATAGTATAATGCAATCATTACTCAGACAATCTCCACAGAATGCTGCAGCAGCTGTCTCGGATGCAGCCACTGCTATCAGCCAAGAAATGCCAGTTGAAGTATTGCGTGCAAGTCTTCCCCACACCGACGAGCACCAGAGGAAAGTGTTGATAGATTTTGCCCAACGCTCGATGTCTGTTGGTGGATTTATTAACAGTGGGGCCGAGCACAGTGGTCGCAACAATTTTGGTTCCTTATGATCAGGAGGGAGTAAGTATGTTGCTCTTTATCTATATATACATTCTCAATGAAGAAGAAGCCTTCAGAATAATCTCCATCATCACATATGGAAGGACAAAGAAACCCGTTCCTCCATTTTGAGGTCTCTCTCTCTACTGCTCTTCTACTGCTCCCTTAAATTTTCTTATTCCTCTTCATGTTTTTACTTTCGAAAAATTCGACGCTCGTAAACGACGCCATTGTTATGTTGGGTTTAGTTTTTCATTTTTTCTTTCTTTCTTTCGGGGTTTGGTAGTAGAGTTTTAGATAGAATATAATGCTGTTGAGAATGGTGGGGTGGTCCAGATATGATATGATATTTGTGGATGGTTGAGGGTTAGAATGTACAGGGTTGGTCTGAGACCAATTTCCCATGGTCCTTCCCTATTAATACCTGAGGAGGTGGTGGGATTGTTTTTGTGTTTAAAAATCTACTTTTTTCTTTTTTAATGATGTTATTATAGTGTTCTATGCACACAAGAGAGAAACAAAAGAAAGGCTTTGATTTGATTGATGCAAATATATAAGAATTGGGGTAGTTTTGAGTATGGAATTGAACGTTTGTTTTTTTAGCCCTAAGAAACAAGGGGGATTCATAGATCTTGTTTTGGGAGGGAGTACTTTTTTCATTTATTTTTCTTTTCAGAAAAATGTAAGGATGGAAAATACTGTTTCCACAATTATTTTTTTTGAGTCATTTGTATTTACTAAATTTGGACGTTGTGTTGTGCTCTTTTGTATCTTTCTATGAATGATTAGATGAAAATGGAATAACACCTTTTTGGTTTAGGTGAAAGGTTGTTCTGTTCAGGTGATGTATCTTTTTGTGCTTTTTCTACAATGGAAATTAATTATTGTATTTCTCACCCCCCTAATGTTTGCTACTGATGGTGCATGCCATAATCAATACAATCAGTGTTGTGGGAATAAAAACTTATATTGGTTGTCTCCTT

Coding sequence (CDS)

ATGGATGGTTTTGGTAACGGAGCTAGAGTTCAAGTGGCATCTACATCCGAGGATCTCAAGCGTTTTGGAGCCAATTCTACGGAAGATGCTCTGTTTGATGCATCCCAGTATGCATTTTTTGGCAAGGATGTCATGGAGGAGGTTGAATTGGGGGGATTAGAAGATGAAGAGGATGATACACTTGCTGCTGGGAATGAGGAGGAGGAGTTTTTGTTTGATAAGGAGAGTGAGGACTTTAGACCTCCATCTGATATTGACGATCTTGTTTCTTCATTTGAAAAGAGTGAGGACTTTAGACCTCCATCTGATATTGACGATCTTGTTTCTTCATTTGAAAAGTTGAACGAGGTTGGTAGCGGGCCAAGGGGAGTTATTGGAGGCAGAATATTTAGAGACAGTTCGTCAGTTAATGAATGGGCACGTGAGGAGGGTTTCTCTAATTGGCTTGCCCAACAAGGCTATAATGTTGAAAGTGCTCAGGAAGGCAAAAGATGGTCATCACATCCACATTCTTCCTCTCTTGCAGAATCTACATCTTTGTATAGGACATCGTCTTTCCCTGATCAGCCGCAGCCGCAGCAATACCACCAACAGTTCTCTAGTGAGCCAATTTTGGTGCCAAAGTCTTCATATCCTCCTAGCGGAATATCTCCTCATGCTTCACCGAACCAGCATTCAAGCCATCTAAATATGCCTTTTGTTCCTGGTGGACGCCATGTAGTATCATTATCTCCATCAAATCTCACACCTCCAAACTCTCAGATTGCTGCTGGTTTTAATCCTGGATCACGGTTTGGAAATGTGCCGCAACTTAACTCTGGCCTCTCTATTAATGGTGGACCGCAGAGCCAATGGGTCAACCAAACTGGCATGTTTCCCGGAGAACATTCTAGTCACCTAAACAATTTATTGCCTCACCAGTTATCAAATCAGAATGGATTTCCGCAGTTACCACCACAGCAGCAGCAACAGCAACAGCAGCAGCAGCATAGATTGCAGCATCCTGTTCAGCCTCCATTTGGTGGTTCTCTACCAGGTTTTCAGTCCCATCTTTTTAATTCCCACCTGTCTTCGGGCCCGCCCCACTTAATGAACAAGTTGGAAGCCATGCTTGGCCTACCAGATATGAGGGATCAAAGGCCTAGGTCTCAGAAAGGTAGACAGAATACTCGTTTTATTCATCAGGGTCATGAGACCAATAGTTTTAGGAATGACATTGGGTGGCCTTTCTATAGATCCAAGTACATGACAGCTGATGAACTAGAAAATATTGTTAGAATGCAGCTTGCAGCAACGCATAGTAATGATCCATATGTAGATGACTACTATCATCAGGCTTGTCTTTCAAGAAAATCTGCAGGTGCAAAATTGAGGCATCATTTTTGTCCTAATCAACTAAGGGATCTTCCACCACGTGCCCGTGCCAATAATGAGCCACATGCTTTTCTTCAGGTTGAAGCGCTTGGTAGGGTCCCATTTTCATCAATTCGTAGACCTCGCCCTCTTCTTGAAGTTGATCCTCCAAGTTCATCTGTTGGTGGAAGCACTGATCAAAAGGTTTCTGAGAAGCCCCTTGAACAGGAGCCTATGCTGGCAGCTAGAGTTACGATTGAAGATGGTCATTGTCTACTTCTTGATGTGGATGATATTGATCGCTTCCTGCAATTCAATCAGTTCCAAGACGGTGGTGCTCAATTAAGAAGACGTCGCCAGGTACTGTTAGAAGGACTGGCTTCATCATTTCACATTGTTGATCCACTCAGTAAAGATGGTCACGCTGTTGGGCTGGCTCCTAAAGATGATTTCGTTTTCTTGAGGTTGGTTTCTCTTCCCAAAGGTCGAAAGCTTCTAGGAAAGTACCTTCAGCTACTCGTACCAGGAGGTGAGCTTATGCGAATAGTTTGCATGGCTATTTTCCGTCACTTAAGATTCTTGTTTGGTAGTGTTCCGTCTGATCCTGCGACAGCAGATTCTGTTAGTAATCTTGCAAGAATTGTTTCATTGCAAACCCATAGTATGGATCTTGGAGCTCTAAGTGCATGTCTTGCGGCTGTAGTTTGTTCCTCAGAGCAACCTCCACTTCGCCCTCTAGGGGCCCCTGCAGGAGATGGGGCGTCCTTGATTTTGAAATCTGTTCTCGAGAGAGCTACGGGACTCTTAACCGATCCTCATGCTGCAAGCAACTATAACATAACTCACCGAGCTCTTTGGCAGGCTTCTTTTGATGAATTTTTTGGACTTCTTGCAAAGTATTGTGTGAACAAGTACGATAGTATAATGCAATCATTACTCAGACAATCTCCACAGAATGCTGCAGCAGCTGTCTCGGATGCAGCCACTGCTATCAGCCAAGAAATGCCAGTTGAAGTATTGCGTGCAAGTCTTCCCCACACCGACGAGCACCAGAGGAAAGTGTTGATAGATTTTGCCCAACGCTCGATGTCTGTTGGTGGATTTATTAACAGTGGGGCCGAGCACAGTGGTCGCAACAATTTTGGTTCCTTATGA

Protein sequence

MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDTLAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSGPRGVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSSLAESTSLYRTSSFPDQPQPQQYHQQFSSEPILVPKSSYPPSGISPHASPNQHSSHLNMPFVPGGRHVVSLSPSNLTPPNSQIAAGFNPGSRFGNVPQLNSGLSINGGPQSQWVNQTGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRFIHQGHETNSFRNDIGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASLILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINSGAEHSGRNNFGSL
BLAST of Lsi04G018740 vs. TrEMBL
Match: A0A0A0KZM3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G604530 PE=4 SV=1)

HSP 1 Score: 1385.2 bits (3584), Expect = 0.0e+00
Identity = 716/827 (86.58%), Postives = 754/827 (91.17%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEED T
Sbjct: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEED-T 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
           LA G EEEEFLFDKESEDFRPPSDIDD VSSF K+                   NE+ S 
Sbjct: 61  LATGIEEEEFLFDKESEDFRPPSDIDDPVSSFGKA-------------------NELASR 120

Query: 121 PRGVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSSLAESTSL 180
           PRGVIG  + R+SSSVNEWAREEGFSNWL Q    VESAQEGKRWSSHPHSSSLAESTSL
Sbjct: 121 PRGVIGS-LLRESSSVNEWAREEGFSNWLGQY---VESAQEGKRWSSHPHSSSLAESTSL 180

Query: 181 YRTSSFPDQPQPQQYHQQFSSEPILVPKSSYPPSGISPHASPNQHSSHLNMPFVPGGRHV 240
           YRTSS+PDQPQ  QYHQQFSSEPILVPK+SYPPSGISPHASPNQHSSHLNMPFVPGGRHV
Sbjct: 181 YRTSSYPDQPQ--QYHQQFSSEPILVPKTSYPPSGISPHASPNQHSSHLNMPFVPGGRHV 240

Query: 241 VSLSPSNLTPPNSQIAAGFNPGSRFGNVPQLNSGLSINGGPQSQWVNQTGMFPGEHSSHL 300
            SLSPSNLTPPNSQIA GFNPGSRFGN+ QLNSGLSINGGPQ+QWVNQTGM PGE+SSHL
Sbjct: 241 ASLSPSNLTPPNSQIA-GFNPGSRFGNMQQLNSGLSINGGPQNQWVNQTGMLPGEYSSHL 300

Query: 301 NNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQSHLFNSHLSSGP 360
           NNLLP QLSNQNGFPQLPPQQ QQ+Q    +LQHPVQPPFGGSLPGFQSHLFNSH SSGP
Sbjct: 301 NNLLPQQLSNQNGFPQLPPQQPQQRQ----KLQHPVQPPFGGSLPGFQSHLFNSHPSSGP 360

Query: 361 PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRFIHQGHETNSFRNDIGWPFYRSKYMTADE 420
           PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTR IHQG+ET+SFRN+ GWPFYRSKYMTADE
Sbjct: 361 PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRLIHQGYETHSFRNEFGWPFYRSKYMTADE 420

Query: 421 LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH 480
           LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH
Sbjct: 421 LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH 480

Query: 481 AFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQEPMLAARVTIEDG 540
           AFLQVEALGRVPFSSIRRPRPLLEVDPPSS   GS DQKVSEKPLEQEPMLAARVTIEDG
Sbjct: 481 AFLQVEALGRVPFSSIRRPRPLLEVDPPSSCGSGSADQKVSEKPLEQEPMLAARVTIEDG 540

Query: 541 HCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD 600
           HCLLLDVDDIDRFLQFNQFQDGGAQL+RRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD
Sbjct: 541 HCLLLDVDDIDRFLQFNQFQDGGAQLKRRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD 600

Query: 601 DFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPATADSVS 660
           DFVFLRLVSLPKG KL+ KYL+LLVPGGELMRIVCMAIFRHLRFLFGSVPSDPA+ADSV+
Sbjct: 601 DFVFLRLVSLPKGLKLITKYLKLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPASADSVT 660

Query: 661 NLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASLILKSVLERATGLL 720
            LAR VSL+ + MDLGA+SACLAAVVCSSEQPPLRPLG+PAGDGASLILKS LERAT LL
Sbjct: 661 ELARTVSLRVYGMDLGAISACLAAVVCSSEQPPLRPLGSPAGDGASLILKSCLERATLLL 720

Query: 721 TDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQSPQNAAAAVSDAAT 780
           TDP+AA NYN+THR+LWQASFD+FF +L KYCVNKYD+IMQSL+R S QNAAAA S+AA 
Sbjct: 721 TDPNAACNYNLTHRSLWQASFDDFFDILTKYCVNKYDTIMQSLVRHSQQNAAAAASEAAA 780

Query: 781 AISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINSGAEHS 828
           A+S+EMPVEVLRASLPHTD +Q+K+L++FAQRSM VGGF NS AE S
Sbjct: 781 AMSREMPVEVLRASLPHTDGYQKKMLLNFAQRSMPVGGFANSVAEQS 796

BLAST of Lsi04G018740 vs. TrEMBL
Match: A0A061FH59_THECC (Topoisomerase II-associated protein PAT1, putative OS=Theobroma cacao GN=TCM_032439 PE=4 SV=1)

HSP 1 Score: 956.1 bits (2470), Expect = 2.9e-275
Identity = 527/842 (62.59%), Postives = 619/842 (73.52%), Query Frame = 1

Query: 16  SEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDTLAAGNEEEEFLFDKE 75
           SEDLK+FG +ST  A+FDASQYAFFGKDV+EEVELGGL+DEE +  A G E+EEFLFD+E
Sbjct: 4   SEDLKQFGDSSTGAAVFDASQYAFFGKDVLEEVELGGLDDEEAELPAVGLEQEEFLFDRE 63

Query: 76  SEDFRPPSDIDDLVSSFEKSEDF----------------------------RPPSDIDDL 135
             D      +   +  F +  DF                            R  SDIDD+
Sbjct: 64  EIDAIALVSVWHAICYFNEVPDFIAIYGVKLCGRSYSAMFHFSWQDTGEVLRSLSDIDDI 123

Query: 136 VSSFEKLNEVGSGPRG--VIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRW 195
            S+F KLN   SGPRG  +IG R  R+SSSV EWA  E F NW  QQ    ES  EGKRW
Sbjct: 124 ASTFSKLNTAVSGPRGSGIIGDRGSRESSSVAEWAHGEEFRNWFDQQALETESIPEGKRW 183

Query: 196 SSHPHSS-SLAESTSLYRTSSFPDQPQPQQYH---QQFSSEPILVPKSSY----PPSGIS 255
           SS P+SS    +S  LYRTSS+P+Q Q Q  H   Q FSSEPILVPKSSY    PP G S
Sbjct: 184 SSQPYSSVPNLDSEHLYRTSSYPEQQQQQLQHHHNQHFSSEPILVPKSSYTSYPPPGGRS 243

Query: 256 PHASPNQHSSHLNMPFVPGGRHVVSLSPSNLTPPNSQIAA-GFNPGSRF-GNVPQLNSGL 315
           P ASPN HS HLN+P + GG  + S SP+  +  NSQ+   G + GS + GN+PQ   GL
Sbjct: 244 PQASPNHHSGHLNIPHMAGGSQMAS-SPNLSSFSNSQLQLPGLHHGSHYAGNMPQFPPGL 303

Query: 316 SINGGPQSQWVNQTGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHP 375
           S+N  P +QW +Q  ++ G+++S LNN+L  QLS+QNG   +P Q   Q Q  Q RLQHP
Sbjct: 304 SVNNRPSNQWGSQPNLYGGDNTSVLNNMLQQQLSHQNGL--IPSQLMPQLQSHQQRLQHP 363

Query: 376 VQPPFGGSLPGFQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRS-QKGRQNTRFIH 435
           VQP FG  L G QS LFN HLS  PP LMNK EA+LGL D+RDQRP+S Q+ RQN RF  
Sbjct: 364 VQPSFG-HLSGIQSQLFNPHLSPSPP-LMNKFEAILGLGDLRDQRPKSAQRSRQNPRFSQ 423

Query: 436 QGHETNSFRNDIGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAG 495
           QG + +  ++DIGWP +RSKYM+ DE+E I+RMQLAATHSNDPYVDDYYHQACL+RK AG
Sbjct: 424 QGFDNSGLKSDIGWPQFRSKYMSTDEIEGILRMQLAATHSNDPYVDDYYHQACLARKYAG 483

Query: 496 AKLRHHFCPNQLRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGG 555
           AKLRHHFCP  LRDLPPRARAN EPHAFLQV+ALGRVPFSSIRRPRPLLEVDPP+SS   
Sbjct: 484 AKLRHHFCPTHLRDLPPRARANTEPHAFLQVDALGRVPFSSIRRPRPLLEVDPPNSSAVS 543

Query: 556 STDQKVSEKPLEQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLL 615
           + +QKVS+ PLEQEPMLAARVTIEDG CLLLDVDDIDRFLQFNQ QD GAQLR+RRQVLL
Sbjct: 544 NNEQKVSDMPLEQEPMLAARVTIEDGLCLLLDVDDIDRFLQFNQLQDSGAQLRQRRQVLL 603

Query: 616 EGLASSFHIVDPLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIV 675
           EGLA+S  +VDPL K+GH   LA KDDFVFLR+VSLPKGRKLL +YLQL+ PGGELMR+V
Sbjct: 604 EGLAASLQLVDPLGKNGHTDELAHKDDFVFLRIVSLPKGRKLLARYLQLVFPGGELMRVV 663

Query: 676 CMAIFRHLRFLFGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPL 735
           CMAIFRHLRFLFG +PSDP  A++ +NLAR+VS   H MDL ALS CLAAVVCSSEQPPL
Sbjct: 664 CMAIFRHLRFLFGGLPSDPGAAETTNNLARVVSSCVHGMDLRALSVCLAAVVCSSEQPPL 723

Query: 736 RPLGAPAGDGASLILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVN 795
           RP+G+PAGDGASLILKSVL+RAT L+ D  AA NYN+T+++LW+ASFDEFF LL KYCVN
Sbjct: 724 RPVGSPAGDGASLILKSVLDRATKLMIDFRAAGNYNMTNQSLWKASFDEFFNLLTKYCVN 783

Query: 796 KYDSIMQSLLRQSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSM 817
           KYD++MQSL  Q   + A   SDA  AI +EMPV++L A LPH ++ Q+K++ D +QRS+
Sbjct: 784 KYDTVMQSLRLQVKPDMAIDESDATRAIKREMPVDLLHACLPHINDQQKKLIWDLSQRSV 840

BLAST of Lsi04G018740 vs. TrEMBL
Match: A0A067JMJ1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22657 PE=4 SV=1)

HSP 1 Score: 950.7 bits (2456), Expect = 1.2e-273
Identity = 531/840 (63.21%), Postives = 631/840 (75.12%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           M+G  +G  +Q  S  +D K+ G NSTEDA+FDASQYAFFGKD++EEVELGGL+DEE+  
Sbjct: 1   MEGIESGIGIQEISKVDDPKQTGDNSTEDAVFDASQYAFFGKDLVEEVELGGLDDEEEAL 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
            AA  +EEEFLF ++                  + E  R  SDIDDL S+F KLN+V SG
Sbjct: 61  PAAELDEEEFLFGRQ------------------EGEIVRSLSDIDDLASTFSKLNKVVSG 120

Query: 121 PRG--VIGGRIFRDSSSVNEWAREEGFSNWLAQQGY-NVESAQEGKRWSSHPHSSS--LA 180
           PRG  VIG R  R+SSS  EWA+ + F NW  QQ   + E  Q+GKRWSS P+SSS  L+
Sbjct: 121 PRGAGVIGDRGSRESSSAAEWAQGDDFPNWFDQQQLLDPEGFQDGKRWSSQPYSSSARLS 180

Query: 181 ESTSLYRTSSFPDQPQPQQYHQQFSSEPILVPKSSYP--PSGISPHASPNQHSSHLNMPF 240
           E   LYRTSS+P+Q   QQ+HQ FSSEPILVPKSSY   P G SP ASPN   SHLN+P+
Sbjct: 181 ELKPLYRTSSYPEQ---QQHHQHFSSEPILVPKSSYTSYPPGQSPQASPNH--SHLNIPY 240

Query: 241 VPGGRHVVSLSPSNLTP---PNSQIAAGFNPGSRFG-NVPQLNSGLSINGGPQSQWVNQT 300
           + GG  + ++S  NL+P   P  Q+    +    FG N+ Q +SG S N  P +QW+N T
Sbjct: 241 LGGGPQM-AISLPNLSPFSGPQLQLTGLHHGSPHFGGNLSQFSSGPSANSRPPNQWMNHT 300

Query: 301 GMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQS 360
           G++PG+H + LNN+L  QL +QNG   + PQ   Q Q QQHR+ HPVQPP G  L G QS
Sbjct: 301 GLYPGDHPNRLNNML-QQLPHQNGL--MAPQLMSQLQSQQHRMHHPVQPPLG-HLSGMQS 360

Query: 361 HLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRS-QKGRQNTRFIHQGHETNSFRNDIGW 420
            LFN H SS P HLMNK EA+LG+ D RDQRP++ QKGRQN  +   G ++N  + +  W
Sbjct: 361 QLFNLHPSSSP-HLMNKFEAVLGMGDNRDQRPKTAQKGRQNLYYSQHGFDSNGQKIESFW 420

Query: 421 PFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRD 480
           P +RSKYMTADE+E+I+RMQLAATHSNDPYVDDYYHQACLS+KSAGAKL+HHFCP  LRD
Sbjct: 421 PQFRSKYMTADEIESILRMQLAATHSNDPYVDDYYHQACLSKKSAGAKLKHHFCPTHLRD 480

Query: 481 LPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQE 540
           LPPRARANNEPHAFLQV+ALGR PFSSIRRPRPLLEVDPP+SS+ G+TDQKVSEKPLEQE
Sbjct: 481 LPPRARANNEPHAFLQVDALGRAPFSSIRRPRPLLEVDPPNSSISGATDQKVSEKPLEQE 540

Query: 541 PMLAARVTIEDGHCLLLDVDDIDRFLQFN--QFQDGGAQLRRRRQVLLEGLASSFHIVDP 600
           PMLAARVTIEDG CLLLDVDDIDRFL+FN  Q QDGG QL+RRRQVLLEGLA+S  +VDP
Sbjct: 541 PMLAARVTIEDGLCLLLDVDDIDRFLEFNFNQLQDGGVQLKRRRQVLLEGLAASMQLVDP 600

Query: 601 LSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLF 660
           L K+GH+VGLAPKDD VFLRLVSLPKGRKLL KYLQ L PGGELMRIVCMAIFRHLRFLF
Sbjct: 601 LGKNGHSVGLAPKDDLVFLRLVSLPKGRKLLAKYLQFLSPGGELMRIVCMAIFRHLRFLF 660

Query: 661 GSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGAS 720
           G +PSD   A++ +NLA++VSL    MDL +LSACLAAVVCSSE PPLRPLG  AG+GAS
Sbjct: 661 GGLPSDVGAAETTNNLAKVVSLCVRRMDLSSLSACLAAVVCSSEPPPLRPLGNSAGNGAS 720

Query: 721 LILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQ 780
           LIL SVLERAT LL +   A+NYN+T+RALW+ASFDEFFGLL KYC+NKYDSIMQS L  
Sbjct: 721 LILMSVLERATELLIELQDANNYNMTNRALWKASFDEFFGLLIKYCINKYDSIMQSSL-- 780

Query: 781 SPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSV----GGFINS 823
                    SD A AI +E+P+E+LRAS+PH +++Q+K+L D +QRS++     GG +NS
Sbjct: 781 ---------SDPAEAIKRELPMELLRASVPHVNDYQKKLLYDLSQRSLASQDGNGGHMNS 800

BLAST of Lsi04G018740 vs. TrEMBL
Match: W9S1T2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_002129 PE=4 SV=1)

HSP 1 Score: 949.1 bits (2452), Expect = 3.5e-273
Identity = 537/847 (63.40%), Postives = 627/847 (74.03%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           M+ F +G+R+Q A  S+DLK+FG +ST D +FDASQYAFFGKDV+EEVELGGLEDEE+D 
Sbjct: 1   MEAFESGSRIQEAPNSQDLKQFGNDST-DTVFDASQYAFFGKDVLEEVELGGLEDEEEDL 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
            AAG EEEEFL+DKE                  ++   R  SD+DDL S+F K   V SG
Sbjct: 61  PAAGFEEEEFLYDKE------------------ENAVLRSLSDVDDLASTFSK---VMSG 120

Query: 121 PR--GVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSS-LAES 180
           PR  G++G    R +SS  EWA+EE F N +     + +   EGKRWSS P S++ L ES
Sbjct: 121 PRNTGIVGDIGSRQNSSAAEWAQEE-FPNGINHH-LDSDGIPEGKRWSSQPFSAARLTES 180

Query: 181 TSLYRTSSFPDQPQPQQ-YHQQFSSEPILVPKSSYP----PSGISPHASPNQHSSHLNMP 240
             LYRTSS+P+  Q QQ  H  +SSEPI VPKSS+P    P G +P  SPN HS HLNM 
Sbjct: 181 KPLYRTSSYPEPQQQQQPQHTHYSSEPIPVPKSSFPSYPSPGGRTPQDSPNHHSGHLNMQ 240

Query: 241 FVPGGRHVVSLSPSNLTP-PNSQI-AAGFNPGSRF-GNVPQLNSGLSINGGPQSQWVNQT 300
           +  GG H   LS  NL P  NSQ+  AG   GS F GN+PQL   LS+N    SQW+NQ 
Sbjct: 241 YHAGGPH-GGLSSPNLPPFSNSQVPLAGLAHGSHFGGNLPQLPPCLSVNNRLPSQWINQP 300

Query: 301 GMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQS 360
           GMFPG++S+ LN+++  QLS+QNG   +PP    Q   QQHR+   VQP F   L G QS
Sbjct: 301 GMFPGDNSALLNSMMQPQLSHQNGL--MPP----QLMTQQHRIHPTVQPSF-NHLSGMQS 360

Query: 361 HLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRS-QKGRQNTRFIHQGHETNSFRNDIGW 420
            LFN HLS  PP LM+K +AMLGL D+RDQ+P+S QKGR N R+   G +T++ + D GW
Sbjct: 361 QLFNPHLSPSPP-LMSKFDAMLGLGDLRDQKPKSFQKGRLNLRYSQLGFDTSNQKGDGGW 420

Query: 421 PFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRD 480
           P +RSKYMTA+E++ I+RMQLAATHSNDPYVDDYYHQA L++ SAGAKLRHHFCP  LR+
Sbjct: 421 PPFRSKYMTAEEIDGILRMQLAATHSNDPYVDDYYHQASLAKNSAGAKLRHHFCPTHLRE 480

Query: 481 LPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQE 540
           LPPRARANNEPHAFLQV+ALGR+PFSSIRRPRPLLEVD P+SS  GSTDQK SEKPLEQE
Sbjct: 481 LPPRARANNEPHAFLQVDALGRIPFSSIRRPRPLLEVDSPNSSGHGSTDQKASEKPLEQE 540

Query: 541 PMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPLS 600
           PMLAARV IEDG CLLLDVDDIDRFLQFNQ  DGG   + RRQ LLE LA+S  +VDPL 
Sbjct: 541 PMLAARVAIEDGICLLLDVDDIDRFLQFNQLPDGGVHYKHRRQALLEDLAASLQLVDPLG 600

Query: 601 KDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFGS 660
           K G  +GL PKDD VFLRLVSLPKGRKLL +YLQLL   GELMRIVCMAIFRHLRFLFG 
Sbjct: 601 KSGGTIGLVPKDDLVFLRLVSLPKGRKLLARYLQLLFLDGELMRIVCMAIFRHLRFLFGF 660

Query: 661 VPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASLI 720
           +PSDP  A++ +NLA++VS     MDLG+LSACLAAVVCSSEQPPLRPLG+ AGDGASLI
Sbjct: 661 LPSDPGAAETANNLAKVVSSCIQEMDLGSLSACLAAVVCSSEQPPLRPLGSSAGDGASLI 720

Query: 721 LKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQSP 780
           LKSVLERAT LLTDP+AASNYN+ +RALWQASFDEFFGLL KYC NKYDSIMQSLL Q P
Sbjct: 721 LKSVLERATELLTDPNAASNYNMQNRALWQASFDEFFGLLTKYCSNKYDSIMQSLLTQGP 780

Query: 781 QNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINSGAEHSG 836
            N A   +DAA AIS+EMPVE++RASLPHTD  QR++L+DF QRSMS+G        + G
Sbjct: 781 TNTAVIGADAARAISREMPVELVRASLPHTDVRQRQLLLDFTQRSMSLGASNTPPGGNDG 814

BLAST of Lsi04G018740 vs. TrEMBL
Match: B9RI49_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1576440 PE=4 SV=1)

HSP 1 Score: 939.9 bits (2428), Expect = 2.2e-270
Identity = 530/845 (62.72%), Postives = 628/845 (74.32%), Query Frame = 1

Query: 1   MDGFGNGAR-VQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDD 60
           M+ FG+G   +Q A  ++DLK+FG NS+E A+FDASQYAFFG D++E+VELGGLEDEE+D
Sbjct: 1   MERFGSGGGGIQEALKADDLKQFGDNSSEGAVFDASQYAFFGNDLVEDVELGGLEDEEED 60

Query: 61  TLAAGN--EEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEV 120
             A G   +EEEF+F ++           +L  SF         SDIDDL S+F KLN+V
Sbjct: 61  LPAVGGRFDEEEFIFGRQE---------GELARSF---------SDIDDLASTFSKLNKV 120

Query: 121 GSGPR--GVIGGRIFRDSSSVNEWAREEGFSNWLAQQG-YNVESAQEGKRWSSHPHSSS- 180
            SGPR  GVIG R  R+SSS  EWA+ E F NWL QQ  ++ +  Q+GKRWSS P+SSS 
Sbjct: 121 VSGPRTAGVIGDRGSRESSSATEWAQGEEFQNWLDQQQLFDPDGIQDGKRWSSQPYSSSS 180

Query: 181 -LAESTSLYRTSSFPDQPQPQQYHQQFSSEPILVPKSSY----PPSGISPHASPNQHSSH 240
            L+E   LYRTSS+P+Q   QQ+HQ FSSEPILVPKSSY    PP G SP ASPN   SH
Sbjct: 181 RLSELKPLYRTSSYPEQ---QQHHQHFSSEPILVPKSSYTSYPPPGGQSPQASPNH--SH 240

Query: 241 LNMPFVPGGRHVVSLSPSNLTP---PNSQIAAGFNPGSRFG-NVPQLNSGLSINGGPQSQ 300
           +NM ++ GG  + ++S  NL+P   P  Q+    +    FG N+ QL+SGLS N  P +Q
Sbjct: 241 MNMHYLGGGPQM-AISLPNLSPFSSPQLQLTGLHHGSQHFGRNLSQLSSGLSGNNRPPNQ 300

Query: 301 WVNQTGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSL 360
           W N  G++ G+H + LNN+L  QL +QNG   +PPQ   Q Q QQHRL H VQP  G  L
Sbjct: 301 WANHAGLYLGDHPNRLNNMLQQQLPHQNGL--MPPQLMAQLQTQQHRLHHLVQPSLG-HL 360

Query: 361 PGFQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRS-QKGRQNTRFIHQGHETNSFR 420
            G QS LFN H S  P  LM K + +LGL D+RDQRPRS QK R N R+  QG + NS +
Sbjct: 361 SGMQSQLFNPHHSPSPA-LMGKFDPVLGLGDIRDQRPRSAQKARPNMRYSQQGFDLNSQK 420

Query: 421 NDIGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCP 480
            D  WP +RSK+MTADE+E+I+RMQLAA HSNDPYVDDYYHQACL++KS GAKL+HHFCP
Sbjct: 421 IDGIWPQFRSKHMTADEIESILRMQLAAMHSNDPYVDDYYHQACLAKKSVGAKLKHHFCP 480

Query: 481 NQLRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEK 540
             LRDLPPRARAN EPHAFLQV+ALGR  FSSIRRPRPLLEVDPP+SSV G TDQKVSEK
Sbjct: 481 THLRDLPPRARANAEPHAFLQVDALGRAAFSSIRRPRPLLEVDPPNSSVSGGTDQKVSEK 540

Query: 541 PLEQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHI 600
           PLEQEPMLAARV IEDG CLLLDVDDIDRFL+FNQFQDGGAQLRRRRQVL+EGLA+S  +
Sbjct: 541 PLEQEPMLAARVAIEDGLCLLLDVDDIDRFLEFNQFQDGGAQLRRRRQVLMEGLATSMQL 600

Query: 601 VDPLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLR 660
           VDPL K+GH VGLAPKDD VFLRLVSLPKGRKLL KYLQLL PG +LMRIVCMAIFRHLR
Sbjct: 601 VDPLGKNGHTVGLAPKDDLVFLRLVSLPKGRKLLAKYLQLLSPGSDLMRIVCMAIFRHLR 660

Query: 661 FLFGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGD 720
           FLFG +PSD   A++ +NLAR+VSL    MDLG+LSACLAAVVCSSEQPPLRPLG+ AG+
Sbjct: 661 FLFGGLPSDLGAAETTNNLARVVSLCACRMDLGSLSACLAAVVCSSEQPPLRPLGSSAGN 720

Query: 721 GASLILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSL 780
           GASLIL SVLERA  LL +   ASNYN+T+RALW+ASFDEFF LL KYC+NKYDSIMQS 
Sbjct: 721 GASLILMSVLERAAELLGELQDASNYNVTNRALWKASFDEFFVLLVKYCINKYDSIMQS- 780

Query: 781 LRQSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSM----SVGGF 825
                      + D A AI +E+P+E+LR S+PHT+++Q+K+L D +QRS+    S GG 
Sbjct: 781 ----------PIQDPAEAIKRELPMELLRVSVPHTNDYQKKMLYDLSQRSLVGQNSNGGH 806

BLAST of Lsi04G018740 vs. TAIR10
Match: AT1G79090.1 (AT1G79090.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 739.6 bits (1908), Expect = 2.2e-213
Identity = 441/828 (53.26%), Postives = 552/828 (66.67%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           MD FG G+ +  A  ++DLK+FG NST + +FDASQYAFFG DV+EEVELGGLE EED+ 
Sbjct: 1   MDAFGIGSSLNQAPVTQDLKKFGDNSTGNTMFDASQYAFFGNDVVEEVELGGLE-EEDEI 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGS- 120
           L+     E+F FDK                  E+  D R  SD+DDL S+F KLN     
Sbjct: 61  LSFTGIAEDFSFDK------------------EEVGDSRLLSDVDDLASTFSKLNREPDV 120

Query: 121 -GPRGVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSS-SLAES 180
               G I  R    +S   EW   E   NW  +Q  + ++ ++ K WS+ P SS    E 
Sbjct: 121 YSNTGPITDRRSSQNSLAAEWTHGEELPNWYGRQILDSDAIKDDKVWSAQPFSSLDRVEQ 180

Query: 181 TSLYRTSSFPD-QPQPQQYH--QQFSSEPILVPKS---SYPPSGISPHASPNQHSSHLNM 240
               RT  +P+ Q Q  Q H  QQFSSEPILVPKS   SYPP G     SP+Q   H N+
Sbjct: 181 RIPDRTKLYPEPQRQLHQDHNQQQFSSEPILVPKSSFVSYPPPG---SISPDQRLGHPNI 240

Query: 241 PFVPGGRHVVS--LSP-SNLTPPNSQIAAGFNPGSRFGNVPQLNSGLSINGGPQSQWVNQ 300
           P+  GG  + S   SP  NL P    +  G       GN PQ    L +N  P +QW+N+
Sbjct: 241 PYQSGGPQMGSPNFSPFPNLQPQLPSMHHG--SPQHTGNRPQFRPALPLNNLPPAQWMNR 300

Query: 301 TGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQ 360
             M PG+ S  +NN +  Q  +QNG   +PP    Q Q  Q+RL HP+QPP  G +PG Q
Sbjct: 301 QNMHPGDSSGIMNNAMLQQPPHQNGL--MPP----QMQGSQNRLPHPMQPPL-GHMPGMQ 360

Query: 361 SHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRSQKG-RQNTRFIHQGHETNSFRNDIG 420
             LFNSHLS          + MLG  D+R+ RP S  G RQN RF  QG +    R    
Sbjct: 361 PQLFNSHLSRSSS--SGNYDGMLGFGDLREVRPGSGHGNRQNVRFPQQGFDAGVQRR--- 420

Query: 421 WPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLR 480
           +PF RSKYM+A E+ENI+RMQL ATHSNDPYVDDYYHQACL++KSAGAKL+HHFCPN LR
Sbjct: 421 YPF-RSKYMSAGEIENILRMQLVATHSNDPYVDDYYHQACLAKKSAGAKLKHHFCPNHLR 480

Query: 481 DLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQ 540
           DL  RAR+NNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPP+S+  G+ + K ++KPL+Q
Sbjct: 481 DLQQRARSNNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPNSAKFGNAEHKPTDKPLDQ 540

Query: 541 EPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPL 600
           EPMLAARV IEDG CLLL+VDDIDRFL+FNQ QDGG QL++RRQ LL+ LA S  + DPL
Sbjct: 541 EPMLAARVYIEDGLCLLLEVDDIDRFLEFNQLQDGGHQLKQRRQALLQSLAVSLQLGDPL 600

Query: 601 SKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFG 660
           +K+G +  L   DDF+FLR++SLPKGRKLL +YLQL+ PG +LMRIVCMAIFRHLR LFG
Sbjct: 601 AKNGQSQSL---DDFLFLRVISLPKGRKLLIRYLQLIFPGSDLMRIVCMAIFRHLRSLFG 660

Query: 661 SVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASL 720
            + SDP    + + LA +++L   +M+LG +S CLAAV CSSEQ PLRPLG+P GDGAS 
Sbjct: 661 VLSSDPDIIKTTNKLATVINLCIQNMELGPVSTCLAAVSCSSEQAPLRPLGSPVGDGAST 720

Query: 721 ILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQS 780
           +LKS+L+RA+ L+     A+N+N    ALW+ASF+EFF +L +YC++KYDSIMQSL  Q 
Sbjct: 721 VLKSILDRASELI----RANNFNNAGIALWRASFNEFFNMLMRYCISKYDSIMQSL--QL 780

Query: 781 PQNAAAAVS-DAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSM 815
           P + A  +S +AA AI +EMP+E+LR+S PH DE Q+++L++F +RSM
Sbjct: 781 PPHFATEISEEAAKAIVREMPIELLRSSFPHIDEQQKRILMEFLKRSM 782

BLAST of Lsi04G018740 vs. TAIR10
Match: AT3G22270.1 (AT3G22270.1 Topoisomerase II-associated protein PAT1)

HSP 1 Score: 562.0 bits (1447), Expect = 6.2e-160
Identity = 369/836 (44.14%), Postives = 507/836 (60.65%), Query Frame = 1

Query: 14  STSEDLKRFGANSTED---ALFDASQYAFFGKDVMEEVELGGLEDEE--DDTLAAGNEEE 73
           S S DL  F   S+ D    LFDASQY FFG++ ++++ELGGL+D+      L   +++E
Sbjct: 4   SDSRDLYNFVRASSLDKNSTLFDASQYEFFGQN-LDDMELGGLDDDGVIAPVLGHADDDE 63

Query: 74  EFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSGPR--GVIG 133
             LFDK                   +       SD+DDL ++F KLN V +GP+  GVIG
Sbjct: 64  YHLFDK------------------GEGAGLGSLSDMDDLATTFAKLNRVVTGPKHPGVIG 123

Query: 134 ----GRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSSLAESTSLYR 193
               G   R+SSS  +W ++   ++WL +Q       QE KRWSS P   S A S  LYR
Sbjct: 124 DRGSGSFSRESSSATDWTQDAELTSWLDEQD------QEAKRWSSQP--QSFAHSKPLYR 183

Query: 194 TSSFPDQPQPQQYHQQFSSEPILVPKSSY----PPSGISPHASP-NQHSSHLNMPFVPGG 253
           TSS+P Q QPQ  H  ++SEPI++P+S++    PP   SP ASP N H +    P +PGG
Sbjct: 184 TSSYPQQ-QPQLQH--YNSEPIILPESNFTSFPPPGNRSPQASPGNLHRA----PSLPGG 243

Query: 254 RHVVSLSPSNLTPPNSQIAAGFNPGSRF-GNVPQLNS-GLSINGGPQSQWVNQTGMFPGE 313
             +   +PS L+     + +G + G  + GN+ +  S G ++    Q  WV   G   G+
Sbjct: 244 SQLTYSAPSPLSNSGFHL-SGLSQGPHYGGNLTRYASCGPTLGNMVQPHWVTDPGHLHGD 303

Query: 314 HSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQSHLFNSH 373
           HS  L+NL+  Q        QLPP   +     QH L    +  +   L   QS L++S+
Sbjct: 304 HSGLLHNLVQQQ------HQQLPP---RNAIMSQHLLALQQRQSY-AQLAALQSQLYSSY 363

Query: 374 LSSGPPHLMNKLEAMLGLPDMRDQRPR-SQKGRQNTRFIHQGHETNSFRNDIGWPFYRSK 433
            S          +   G+ ++R+ + + S + R+N     Q  +  S +++ G  F RSK
Sbjct: 364 PSP-------SRKVPFGVGEVREHKHKSSHRSRKNRGLSQQTSDAASQKSETGLQF-RSK 423

Query: 434 YMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRAR 493
           +MT++E+E+I++MQ + +HSNDPYV+DYYHQA L++KSAG+K   HF P QL+D  PR+R
Sbjct: 424 HMTSEEIESILKMQHSNSHSNDPYVNDYYHQAKLAKKSAGSKAISHFYPAQLKDHQPRSR 483

Query: 494 ANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQEPMLAAR 553
            ++E H  + V+ALG++   S+RRP  LLEVD       GS D K S K LEQEP++AAR
Sbjct: 484 NSSEQHPQVHVDALGKITLPSVRRPHALLEVDSSPGFNDGSGDHKGSGKHLEQEPLVAAR 543

Query: 554 VTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPLSKDGHAV 613
           VTIED   +L+D+ DIDR LQ  + QDGGAQL+R+RQ+LLEGLA++  + DP SK G   
Sbjct: 544 VTIEDALGVLIDIVDIDRTLQNTRPQDGGAQLKRKRQILLEGLATALQLADPFSKTGQKS 603

Query: 614 GLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPA 673
           G+  KDD VFLR+ +LPKGRKLL KYLQLLVPG E  R+VCMAIFRHLRFLFG +PSD  
Sbjct: 604 GMTAKDDIVFLRIATLPKGRKLLTKYLQLLVPGTENARVVCMAIFRHLRFLFGGLPSDTL 663

Query: 674 TADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASLILKSVLE 733
            A+++SNLA+ V++   +MDL ALSACLAAVVCSSEQPPLRP+G+ AGDGAS++L S+LE
Sbjct: 664 AAETISNLAKAVTVCVQAMDLRALSACLAAVVCSSEQPPLRPIGSSAGDGASVVLISLLE 723

Query: 734 RATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQSPQNAAAA 793
           RA  ++  P     +  ++  LW+ASFDEFF LL KYC +KYD+I         QN  +A
Sbjct: 724 RAAEVVVVPRVM--HGNSNDGLWRASFDEFFNLLTKYCRSKYDTI-------RGQNQGSA 777

Query: 794 VSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSV--------GGFINS 823
                 AI +EMP E+LRASL HT++ QR  L++F ++  ++        GG INS
Sbjct: 784 ADVLELAIKREMPAELLRASLRHTNDDQRNYLLNFGRKPSAISESASHARGGQINS 777

BLAST of Lsi04G018740 vs. TAIR10
Match: AT4G14990.1 (AT4G14990.1 Topoisomerase II-associated protein PAT1)

HSP 1 Score: 543.1 bits (1398), Expect = 3.0e-154
Identity = 370/846 (43.74%), Postives = 509/846 (60.17%), Query Frame = 1

Query: 14  STSEDLKRFGANSTED--ALFDASQYAFFGKDVMEEVELGGLEDEEDDTLAAGNEEEEF- 73
           S S D   F   S+++  ALFDASQY FFG+  +EEVELGGL+D  D T+    ++EE+ 
Sbjct: 4   SDSRDFYNFAKTSSDNNSALFDASQYEFFGQS-LEEVELGGLDD--DGTVRGHVDDEEYH 63

Query: 74  LFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSGPR--GVIG-- 133
           LFDK     R  + +  L             SD+DDL ++F KLN   +GP+  GVIG  
Sbjct: 64  LFDK-----REGAGLGSL-------------SDMDDLATTFAKLNRNVTGPKHLGVIGDR 123

Query: 134 --GRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSSLAESTSLYRTS 193
             G   R+SS+  +W ++  F++WL Q  + VE   +   WSS P SS    S SLYRTS
Sbjct: 124 GSGSFSRESSTATDWTQDNEFTSWLDQ--HTVEEQVQEASWSSQPQSS--PNSNSLYRTS 183

Query: 194 SFPDQPQPQQYHQQFSSEPILVPKSSYPPSGISPHASPNQHSSHLN-MPFVPGGRHVVSL 253
           S+P Q   Q   Q +SSEPI+VP+S++         S     SH++  P +PGG      
Sbjct: 184 SYPQQ---QTQLQHYSSEPIIVPESTFTSFPSPGKRSQQSSPSHIHRAPSLPGG------ 243

Query: 254 SPSNLTPPNSQ-------IAAGFNPG-SRFGN--VPQLNSGLSINGGPQS--QWVNQTGM 313
           S SN + PN+          +G + G S +GN      + G ++    Q    WV   G+
Sbjct: 244 SQSNFSAPNASPLSNSTFHLSGLSHGPSHYGNNLARYASCGPTLGNMVQQPPHWVTDPGL 303

Query: 314 FPGEHSSHLNNLLPHQLSNQNGFPQLPPQ-----QQQQQQQQQHRLQHPVQPPFGGSLPG 373
             G+HS+     L H L  Q    QLPP+     QQ    QQ+  L H         L  
Sbjct: 304 LHGDHSA-----LLHSLMQQQHLQQLPPRNGFTSQQLISLQQRQSLAH---------LAA 363

Query: 374 FQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPR-SQKGRQNTRFI-HQGHETNSFRN 433
            QS L++S+ S  P H     +A+ G+ ++R+ + + S + R+N   I  Q  +  S ++
Sbjct: 364 LQSQLYSSYPS--PSH-----KALFGVGEVREHKHKSSHRSRKNRGGISQQTSDLASQKS 423

Query: 434 DIGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPN 493
           + G  F RSKYMT++E+E+I++MQ + +HS+DPYV+DYYHQA L++KS+G++ +    P+
Sbjct: 424 ESGLQF-RSKYMTSEEIESILKMQHSNSHSSDPYVNDYYHQARLAKKSSGSRTKPQLYPS 483

Query: 494 QLRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKP 553
            L+D   R+R +++    + V+ALG++   SI RPR LLEVD P SS           K 
Sbjct: 484 HLKDHQSRSRNSSDQQPQVHVDALGKITLPSICRPRALLEVDSPPSS---------GHKH 543

Query: 554 LEQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIV 613
           LE EP++AARVTIED   +L+D+ DIDR LQFN+ QDGGAQLRR+RQ+LLEGLA+S  +V
Sbjct: 544 LEDEPLVAARVTIEDAFGVLIDIVDIDRTLQFNRPQDGGAQLRRKRQILLEGLATSLQLV 603

Query: 614 DPLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRF 673
           DP SK G   GL  KDD VFLR+ +LPKGRKLL KYLQLLVPG E+ R+VCMA+FRHLRF
Sbjct: 604 DPFSKTGQKTGLTTKDDIVFLRITTLPKGRKLLTKYLQLLVPGTEIARVVCMAVFRHLRF 663

Query: 674 LFGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDG 733
           LFG +PSD   A++++NLA+ V++   +MDL ALSACLAAVVCSSEQPPLRP+G+ +GDG
Sbjct: 664 LFGGLPSDSLAAETIANLAKAVTVCVQAMDLRALSACLAAVVCSSEQPPLRPIGSSSGDG 723

Query: 734 ASLILKSVLERATGLLTD--PHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQS 793
           AS++L S+LERA  ++    P   SN+   +  LW+ASFDEFF LL KYC +KY++I   
Sbjct: 724 ASVVLVSLLERAAEVIVAVVPPRVSNHGNPNDGLWRASFDEFFSLLTKYCRSKYETIH-- 777

Query: 794 LLRQSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINS 829
              Q+  NAA  +     AI +EMP E+LRASL HT+E QR  L++  + +  V     +
Sbjct: 784 --GQNHDNAADVLE---LAIKREMPAELLRASLRHTNEDQRNFLLNVGRSASPVSESTTT 777

BLAST of Lsi04G018740 vs. NCBI nr
Match: gi|659087137|ref|XP_008444289.1| (PREDICTED: protein PAT1 homolog 1 [Cucumis melo])

HSP 1 Score: 1415.6 bits (3663), Expect = 0.0e+00
Identity = 725/827 (87.67%), Postives = 761/827 (92.02%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           MDGFGNGARVQVASTSEDL RFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT
Sbjct: 1   MDGFGNGARVQVASTSEDLNRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
           LAAG EEEEFLFDKESEDFRPPSDIDD VSSFEK                   +NEV S 
Sbjct: 61  LAAGIEEEEFLFDKESEDFRPPSDIDDPVSSFEK-------------------VNEVASR 120

Query: 121 PRGVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSSLAESTSL 180
           PRGVIGG + R+SSSVN+WA EEGFSNWL Q   +VESAQEGKRWSSHPHSSSLAESTSL
Sbjct: 121 PRGVIGG-LLRESSSVNQWAHEEGFSNWLGQ---HVESAQEGKRWSSHPHSSSLAESTSL 180

Query: 181 YRTSSFPDQPQPQQYHQQFSSEPILVPKSSYPPSGISPHASPNQHSSHLNMPFVPGGRHV 240
           YRTSS+PDQPQ QQYHQQFSSEPILVPK+SYPPSGISPHASPNQHSSHLNMPFV GGRH+
Sbjct: 181 YRTSSYPDQPQVQQYHQQFSSEPILVPKTSYPPSGISPHASPNQHSSHLNMPFVSGGRHI 240

Query: 241 VSLSPSNLTPPNSQIAAGFNPGSRFGNVPQLNSGLSINGGPQSQWVNQTGMFPGEHSSHL 300
            SLSPSNLTPPNSQIA GFNPGSRFG++ QLNSGLS NGGPQSQWVNQTGMFPGEHSSHL
Sbjct: 241 ASLSPSNLTPPNSQIA-GFNPGSRFGSMLQLNSGLSNNGGPQSQWVNQTGMFPGEHSSHL 300

Query: 301 NNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQSHLFNSHLSSGP 360
           NNLLP QLSNQNGFPQLPPQ       Q+H+LQHPVQPPFGGSLPGFQSHLFNSH SSGP
Sbjct: 301 NNLLPQQLSNQNGFPQLPPQ-------QRHKLQHPVQPPFGGSLPGFQSHLFNSHPSSGP 360

Query: 361 PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRFIHQGHETNSFRNDIGWPFYRSKYMTADE 420
           PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRFIHQG+ETNSFRN+ GWPFYRSKYMTADE
Sbjct: 361 PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRFIHQGYETNSFRNEFGWPFYRSKYMTADE 420

Query: 421 LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH 480
           LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH
Sbjct: 421 LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH 480

Query: 481 AFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQEPMLAARVTIEDG 540
           AFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGS DQKVSEKPLEQEPMLAARVTIEDG
Sbjct: 481 AFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSADQKVSEKPLEQEPMLAARVTIEDG 540

Query: 541 HCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD 600
           HCLLLDVDDIDRFLQFNQFQDGGAQL+RRRQVLLEGLASSFHI+DPLSKDGHAVGLAPKD
Sbjct: 541 HCLLLDVDDIDRFLQFNQFQDGGAQLKRRRQVLLEGLASSFHIIDPLSKDGHAVGLAPKD 600

Query: 601 DFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPATADSVS 660
           DFVFLRLVSLPKG KLL KYL+LLVPGGELMRIVCMAIFRHLRFLFGSVPSDPA+ADSVS
Sbjct: 601 DFVFLRLVSLPKGLKLLTKYLKLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPASADSVS 660

Query: 661 NLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASLILKSVLERATGLL 720
            LARIVSL+ +SMDLGA+SACLAAVVCS EQPPLRPLG+PAGDGASLILKS LERAT LL
Sbjct: 661 ELARIVSLRIYSMDLGAISACLAAVVCSPEQPPLRPLGSPAGDGASLILKSCLERATLLL 720

Query: 721 TDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQSPQNAAAAVSDAAT 780
           TDP+AA NYN+THR+LWQASFD+FF +L KYCVNKYD+IMQSL+R SPQNAAAA SDAA 
Sbjct: 721 TDPNAACNYNLTHRSLWQASFDDFFNILTKYCVNKYDTIMQSLVRHSPQNAAAAASDAAA 780

Query: 781 AISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINSGAEHS 828
           A+S+EMPVEVLRASLPHTD +Q+K+L++FAQRSM VGGF NS AE S
Sbjct: 781 AMSREMPVEVLRASLPHTDGYQKKMLLNFAQRSMPVGGFTNSVAEQS 796

BLAST of Lsi04G018740 vs. NCBI nr
Match: gi|449453874|ref|XP_004144681.1| (PREDICTED: protein PAT1 homolog 1 [Cucumis sativus])

HSP 1 Score: 1385.2 bits (3584), Expect = 0.0e+00
Identity = 716/827 (86.58%), Postives = 754/827 (91.17%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEED T
Sbjct: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEED-T 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
           LA G EEEEFLFDKESEDFRPPSDIDD VSSF K+                   NE+ S 
Sbjct: 61  LATGIEEEEFLFDKESEDFRPPSDIDDPVSSFGKA-------------------NELASR 120

Query: 121 PRGVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSSLAESTSL 180
           PRGVIG  + R+SSSVNEWAREEGFSNWL Q    VESAQEGKRWSSHPHSSSLAESTSL
Sbjct: 121 PRGVIGS-LLRESSSVNEWAREEGFSNWLGQY---VESAQEGKRWSSHPHSSSLAESTSL 180

Query: 181 YRTSSFPDQPQPQQYHQQFSSEPILVPKSSYPPSGISPHASPNQHSSHLNMPFVPGGRHV 240
           YRTSS+PDQPQ  QYHQQFSSEPILVPK+SYPPSGISPHASPNQHSSHLNMPFVPGGRHV
Sbjct: 181 YRTSSYPDQPQ--QYHQQFSSEPILVPKTSYPPSGISPHASPNQHSSHLNMPFVPGGRHV 240

Query: 241 VSLSPSNLTPPNSQIAAGFNPGSRFGNVPQLNSGLSINGGPQSQWVNQTGMFPGEHSSHL 300
            SLSPSNLTPPNSQIA GFNPGSRFGN+ QLNSGLSINGGPQ+QWVNQTGM PGE+SSHL
Sbjct: 241 ASLSPSNLTPPNSQIA-GFNPGSRFGNMQQLNSGLSINGGPQNQWVNQTGMLPGEYSSHL 300

Query: 301 NNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPGFQSHLFNSHLSSGP 360
           NNLLP QLSNQNGFPQLPPQQ QQ+Q    +LQHPVQPPFGGSLPGFQSHLFNSH SSGP
Sbjct: 301 NNLLPQQLSNQNGFPQLPPQQPQQRQ----KLQHPVQPPFGGSLPGFQSHLFNSHPSSGP 360

Query: 361 PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRFIHQGHETNSFRNDIGWPFYRSKYMTADE 420
           PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTR IHQG+ET+SFRN+ GWPFYRSKYMTADE
Sbjct: 361 PHLMNKLEAMLGLPDMRDQRPRSQKGRQNTRLIHQGYETHSFRNEFGWPFYRSKYMTADE 420

Query: 421 LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH 480
           LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH
Sbjct: 421 LENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQLRDLPPRARANNEPH 480

Query: 481 AFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPLEQEPMLAARVTIEDG 540
           AFLQVEALGRVPFSSIRRPRPLLEVDPPSS   GS DQKVSEKPLEQEPMLAARVTIEDG
Sbjct: 481 AFLQVEALGRVPFSSIRRPRPLLEVDPPSSCGSGSADQKVSEKPLEQEPMLAARVTIEDG 540

Query: 541 HCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD 600
           HCLLLDVDDIDRFLQFNQFQDGGAQL+RRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD
Sbjct: 541 HCLLLDVDDIDRFLQFNQFQDGGAQLKRRRQVLLEGLASSFHIVDPLSKDGHAVGLAPKD 600

Query: 601 DFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPATADSVS 660
           DFVFLRLVSLPKG KL+ KYL+LLVPGGELMRIVCMAIFRHLRFLFGSVPSDPA+ADSV+
Sbjct: 601 DFVFLRLVSLPKGLKLITKYLKLLVPGGELMRIVCMAIFRHLRFLFGSVPSDPASADSVT 660

Query: 661 NLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGASLILKSVLERATGLL 720
            LAR VSL+ + MDLGA+SACLAAVVCSSEQPPLRPLG+PAGDGASLILKS LERAT LL
Sbjct: 661 ELARTVSLRVYGMDLGAISACLAAVVCSSEQPPLRPLGSPAGDGASLILKSCLERATLLL 720

Query: 721 TDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLRQSPQNAAAAVSDAAT 780
           TDP+AA NYN+THR+LWQASFD+FF +L KYCVNKYD+IMQSL+R S QNAAAA S+AA 
Sbjct: 721 TDPNAACNYNLTHRSLWQASFDDFFDILTKYCVNKYDTIMQSLVRHSQQNAAAAASEAAA 780

Query: 781 AISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINSGAEHS 828
           A+S+EMPVEVLRASLPHTD +Q+K+L++FAQRSM VGGF NS AE S
Sbjct: 781 AMSREMPVEVLRASLPHTDGYQKKMLLNFAQRSMPVGGFANSVAEQS 796

BLAST of Lsi04G018740 vs. NCBI nr
Match: gi|1009130114|ref|XP_015882122.1| (PREDICTED: uncharacterized protein LOC107417972 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 1013.1 bits (2618), Expect = 2.9e-292
Identity = 555/863 (64.31%), Postives = 659/863 (76.36%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           M+GFG+G+ +  A   +DLK+FG NSTE  +FDASQYAFFGK+V+EEVELGGLED ++D 
Sbjct: 1   MEGFGSGSGIHEAPNPQDLKQFGDNSTEGTVFDASQYAFFGKNVLEEVELGGLEDGQEDF 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
            AAG +EEEF+FD+                  E+ E  R  SDIDDL S+F KLN+  +G
Sbjct: 61  PAAGFDEEEFIFDR------------------EQREVLRSLSDIDDLASTFSKLNKGVTG 120

Query: 121 PR--GVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSS-LAES 180
           PR  GVIG R  R+SS+ +EWA+EE F NW+  Q Y+  S+ EGKRWSS P SS+ L E 
Sbjct: 121 PRNTGVIGDRRSRESSTASEWAQEE-FPNWVDHQFYDAGSSLEGKRWSSQPISSAQLMEL 180

Query: 181 TSLYRTSSFPDQPQ-----------PQQYHQQ------FSSEPILVPKSSY----PPSGI 240
             LYRTSS+P++ Q           P+Q HQ+      F+SEPILVPKSS+    PP G 
Sbjct: 181 KPLYRTSSYPEEEQQVKPLYRTSSYPEQEHQRQNHLQHFASEPILVPKSSFTSYPPPGGR 240

Query: 241 SPHASPNQHSSHLNMPFVPGGRHVVSLSPSNLTP-PNSQIAAGFNPGSRF--GNVPQLNS 300
           S HASPN HS HLN+P++ G  H   LS  NL    NSQ+     P S    GN+PQLNS
Sbjct: 241 SQHASPNHHSGHLNIPYLVG--HQGGLSSPNLNAFSNSQLQLSGPPHSPHFGGNLPQLNS 300

Query: 301 GLSINGGPQSQWVNQTGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQ 360
           G+ +NG P +QWVNQ GM+PG+HS+ LNNLL  QLS+QNG   +PPQ   Q QQQQHR+ 
Sbjct: 301 GVRVNGRPPNQWVNQAGMYPGDHSTLLNNLLQQQLSHQNGI--MPPQLVTQSQQQQHRMH 360

Query: 361 HPVQPPFGGSLPGFQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPR-SQKGRQNTRF 420
           H +QP F   LPG QSH+FN HLSS    LM+K EAMLGL D+R+QRP+ SQK R+N+R 
Sbjct: 361 HHIQPSF-NHLPGMQSHVFNPHLSS---PLMSKFEAMLGLADLREQRPKLSQKSRRNSRS 420

Query: 421 IHQGHETNSFRNDIGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKS 480
             Q  +++S ++D G P +RSKYMTADE+E+I+RMQLAATHSNDPYVDDYYHQACL++KS
Sbjct: 421 SQQSSDSSSQKSDGGLPQFRSKYMTADEIESILRMQLAATHSNDPYVDDYYHQACLAKKS 480

Query: 481 AGAKLRHHFCPNQLRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSV 540
           +G KLRH FCP  LRDLPPR R+N+EPHAFLQV+ALGR+ FSSIRRPRPLLEVDPP+SS 
Sbjct: 481 SGGKLRHQFCPTHLRDLPPRGRSNSEPHAFLQVDALGRILFSSIRRPRPLLEVDPPNSSG 540

Query: 541 GGSTDQKVSEKPLEQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQV 600
            GST+QK SEKPLEQEPMLAARVTIEDG CLLLDVDDIDRFLQ +Q QDGG QLRRRRQV
Sbjct: 541 PGSTEQKASEKPLEQEPMLAARVTIEDGLCLLLDVDDIDRFLQCSQLQDGGTQLRRRRQV 600

Query: 601 LLEGLASSFHIVDPLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMR 660
           LLEGLA+S  + DPL K+GH+VGL PKDD VFLRLV+LPKGRKLL +YLQLL PG ELMR
Sbjct: 601 LLEGLAASLQLADPLGKNGHSVGLVPKDDLVFLRLVALPKGRKLLSRYLQLLFPGSELMR 660

Query: 661 IVCMAIFRHLRFLFGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQP 720
           IVCMAIFRHLRFLFG++PSDP  A++ ++LAR+VSL  H MDLGALSACLAAVVCSSEQP
Sbjct: 661 IVCMAIFRHLRFLFGALPSDPGAAETTNDLARVVSLCVHGMDLGALSACLAAVVCSSEQP 720

Query: 721 PLRPLGAPAGDGASLILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYC 780
           PLRPLG+ AGDGASLILKSVLERAT LL DPHAASNYN+T+RALWQASF+EFFGLL KYC
Sbjct: 721 PLRPLGSSAGDGASLILKSVLERATELLMDPHAASNYNMTNRALWQASFNEFFGLLTKYC 780

Query: 781 VNKYDSIMQSLLRQSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQR 836
           VNKY+SIMQSLL Q P N     +DA  +I +EMPVE+LRASLPHTDEHQR++L+DF +R
Sbjct: 781 VNKYNSIMQSLLMQGPPNITVVGTDATKSIIREMPVELLRASLPHTDEHQRQLLMDFTRR 836

BLAST of Lsi04G018740 vs. NCBI nr
Match: gi|731434305|ref|XP_010645006.1| (PREDICTED: uncharacterized protein LOC100267869 isoform X1 [Vitis vinifera])

HSP 1 Score: 1009.2 bits (2608), Expect = 4.1e-291
Identity = 563/850 (66.24%), Postives = 644/850 (75.76%), Query Frame = 1

Query: 5   GNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDTLAAG 64
           G GA +  AS   DL +FG  ST   +FDASQYAFFGKDV+EEVELGGLEDE  D   AG
Sbjct: 8   GGGAGIHEASNHPDLNQFGDTST---VFDASQYAFFGKDVVEEVELGGLEDE--DLPVAG 67

Query: 65  NEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSGPR-- 124
            +EEEFL D+E                  + E  R  SDIDDL S+F KL    SGPR  
Sbjct: 68  FDEEEFLLDRE------------------EGEVLRSLSDIDDLASTFSKLETGVSGPRNA 127

Query: 125 GVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSS--LAESTSL 184
           G+IG R  R+SSS  EWA+EE    W  Q  +  ES Q+GKRWSS PH+SS  L+E   L
Sbjct: 128 GIIGDRGSRESSSAAEWAQEEDLHYWFDQHMFETESLQDGKRWSSQPHASSAHLSELKPL 187

Query: 185 YRTSSFPDQPQPQQY--HQQ----FSSEPILVPKSS---YPPSG-ISPHASPNQHSSHLN 244
           YRTSS+P+Q QPQQ   HQQ    +SSEPILVPKSS   YPP+G  S   SPN HS H++
Sbjct: 188 YRTSSYPEQQQPQQLQQHQQQQHHYSSEPILVPKSSFTSYPPTGGRSLEGSPNHHSRHIS 247

Query: 245 MPFVPGGRHVVSLSPSNLTP---PNSQIAAGFNPGSRFG-NVPQLNSGLSINGGPQSQWV 304
              + GG  + +LSPSNL P   P  Q+ +  + GS+FG N+PQ   GLS+N  P SQWV
Sbjct: 248 --HLSGGPQI-ALSPSNLPPFSNPQLQLPS-LHHGSQFGGNLPQFAPGLSVNSRPPSQWV 307

Query: 305 NQTGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQHPVQPPFGGSLPG 364
           NQT +FPG+H S LNNLL  QL +QNG   +PPQ   QQQ QQHRL HPVQP FG  L G
Sbjct: 308 NQTNIFPGDHPSILNNLLQQQLPHQNGL--MPPQLMLQQQPQQHRLHHPVQPSFG-HLSG 367

Query: 365 FQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPRS-QKGRQNTRFIHQGHETNSFRND 424
            QS LFN HLS  PP +MNK EAMLG+ D+RDQRP+S QKGR N RF  QG +T+S ++D
Sbjct: 368 LQSQLFNPHLSPAPP-IMNKYEAMLGIGDLRDQRPKSMQKGRPNHRFSQQGFDTSSQKSD 427

Query: 425 IGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKSAGAKLRHHFCPNQ 484
           +GWP +RSKYMTADE+E+I+RMQLAATHSNDPYVDDYYHQACL++KSAGA+L+HHFCP  
Sbjct: 428 VGWPQFRSKYMTADEIESILRMQLAATHSNDPYVDDYYHQACLAKKSAGARLKHHFCPTH 487

Query: 485 LRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSVGGSTDQKVSEKPL 544
           LR+LPPRARAN+EPHAFLQV+ALGRVPFSSIRRPRPLLEVDPP+SSV GST+QKVSEKPL
Sbjct: 488 LRELPPRARANSEPHAFLQVDALGRVPFSSIRRPRPLLEVDPPNSSVAGSTEQKVSEKPL 547

Query: 545 EQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQVLLEGLASSFHIVD 604
           EQEPMLAARVTIEDG CLLLDVDDIDRFLQFNQ QDGG QLRRRRQ LLEGLA+S  +VD
Sbjct: 548 EQEPMLAARVTIEDGLCLLLDVDDIDRFLQFNQLQDGGTQLRRRRQNLLEGLAASLQLVD 607

Query: 605 PLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMRIVCMAIFRHLRFL 664
           PL K GH VGLAPKDD VFLRLVSLPKGRKLL KYLQLL P  EL+RIVCMAIFRHLRFL
Sbjct: 608 PLGKPGHTVGLAPKDDLVFLRLVSLPKGRKLLSKYLQLLFPAVELIRIVCMAIFRHLRFL 667

Query: 665 FGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQPPLRPLGAPAGDGA 724
           FG +PSD   A++ +NL+R+VS     MDLGALSAC AAVVCSSEQPPLRPLG+ AGDGA
Sbjct: 668 FGGLPSDSGAAETTTNLSRVVSSCVRGMDLGALSACFAAVVCSSEQPPLRPLGSSAGDGA 727

Query: 725 SLILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYCVNKYDSIMQSLLR 784
           S+ILKSVLERAT +LTDPH A N N+ +RALWQASFDEFFGLL KYC+NKYDSIMQSLL 
Sbjct: 728 SVILKSVLERATEILTDPHVAGNCNMNNRALWQASFDEFFGLLTKYCLNKYDSIMQSLLM 787

Query: 785 QSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQRSMSVGGFINSGAE 836
           Q+  N  A  +DAA AIS+EMPVE+LRASLPHT+EHQ+K+L+DFA RSM V GF + G  
Sbjct: 788 QASSNMTAVGADAARAISREMPVELLRASLPHTNEHQKKLLLDFAHRSMPVMGFNSQGGG 826

BLAST of Lsi04G018740 vs. NCBI nr
Match: gi|1009130116|ref|XP_015882123.1| (PREDICTED: uncharacterized protein LOC107417972 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 1006.5 bits (2601), Expect = 2.7e-290
Identity = 554/863 (64.19%), Postives = 658/863 (76.25%), Query Frame = 1

Query: 1   MDGFGNGARVQVASTSEDLKRFGANSTEDALFDASQYAFFGKDVMEEVELGGLEDEEDDT 60
           M+GFG+G+ +  A   +DLK+FG NST   +FDASQYAFFGK+V+EEVELGGLED ++D 
Sbjct: 1   MEGFGSGSGIHEAPNPQDLKQFGDNST-GTVFDASQYAFFGKNVLEEVELGGLEDGQEDF 60

Query: 61  LAAGNEEEEFLFDKESEDFRPPSDIDDLVSSFEKSEDFRPPSDIDDLVSSFEKLNEVGSG 120
            AAG +EEEF+FD+                  E+ E  R  SDIDDL S+F KLN+  +G
Sbjct: 61  PAAGFDEEEFIFDR------------------EQREVLRSLSDIDDLASTFSKLNKGVTG 120

Query: 121 PR--GVIGGRIFRDSSSVNEWAREEGFSNWLAQQGYNVESAQEGKRWSSHPHSSS-LAES 180
           PR  GVIG R  R+SS+ +EWA+EE F NW+  Q Y+  S+ EGKRWSS P SS+ L E 
Sbjct: 121 PRNTGVIGDRRSRESSTASEWAQEE-FPNWVDHQFYDAGSSLEGKRWSSQPISSAQLMEL 180

Query: 181 TSLYRTSSFPDQPQ-----------PQQYHQQ------FSSEPILVPKSSY----PPSGI 240
             LYRTSS+P++ Q           P+Q HQ+      F+SEPILVPKSS+    PP G 
Sbjct: 181 KPLYRTSSYPEEEQQVKPLYRTSSYPEQEHQRQNHLQHFASEPILVPKSSFTSYPPPGGR 240

Query: 241 SPHASPNQHSSHLNMPFVPGGRHVVSLSPSNLTP-PNSQIAAGFNPGSRF--GNVPQLNS 300
           S HASPN HS HLN+P++ G  H   LS  NL    NSQ+     P S    GN+PQLNS
Sbjct: 241 SQHASPNHHSGHLNIPYLVG--HQGGLSSPNLNAFSNSQLQLSGPPHSPHFGGNLPQLNS 300

Query: 301 GLSINGGPQSQWVNQTGMFPGEHSSHLNNLLPHQLSNQNGFPQLPPQQQQQQQQQQHRLQ 360
           G+ +NG P +QWVNQ GM+PG+HS+ LNNLL  QLS+QNG   +PPQ   Q QQQQHR+ 
Sbjct: 301 GVRVNGRPPNQWVNQAGMYPGDHSTLLNNLLQQQLSHQNGI--MPPQLVTQSQQQQHRMH 360

Query: 361 HPVQPPFGGSLPGFQSHLFNSHLSSGPPHLMNKLEAMLGLPDMRDQRPR-SQKGRQNTRF 420
           H +QP F   LPG QSH+FN HLSS    LM+K EAMLGL D+R+QRP+ SQK R+N+R 
Sbjct: 361 HHIQPSF-NHLPGMQSHVFNPHLSS---PLMSKFEAMLGLADLREQRPKLSQKSRRNSRS 420

Query: 421 IHQGHETNSFRNDIGWPFYRSKYMTADELENIVRMQLAATHSNDPYVDDYYHQACLSRKS 480
             Q  +++S ++D G P +RSKYMTADE+E+I+RMQLAATHSNDPYVDDYYHQACL++KS
Sbjct: 421 SQQSSDSSSQKSDGGLPQFRSKYMTADEIESILRMQLAATHSNDPYVDDYYHQACLAKKS 480

Query: 481 AGAKLRHHFCPNQLRDLPPRARANNEPHAFLQVEALGRVPFSSIRRPRPLLEVDPPSSSV 540
           +G KLRH FCP  LRDLPPR R+N+EPHAFLQV+ALGR+ FSSIRRPRPLLEVDPP+SS 
Sbjct: 481 SGGKLRHQFCPTHLRDLPPRGRSNSEPHAFLQVDALGRILFSSIRRPRPLLEVDPPNSSG 540

Query: 541 GGSTDQKVSEKPLEQEPMLAARVTIEDGHCLLLDVDDIDRFLQFNQFQDGGAQLRRRRQV 600
            GST+QK SEKPLEQEPMLAARVTIEDG CLLLDVDDIDRFLQ +Q QDGG QLRRRRQV
Sbjct: 541 PGSTEQKASEKPLEQEPMLAARVTIEDGLCLLLDVDDIDRFLQCSQLQDGGTQLRRRRQV 600

Query: 601 LLEGLASSFHIVDPLSKDGHAVGLAPKDDFVFLRLVSLPKGRKLLGKYLQLLVPGGELMR 660
           LLEGLA+S  + DPL K+GH+VGL PKDD VFLRLV+LPKGRKLL +YLQLL PG ELMR
Sbjct: 601 LLEGLAASLQLADPLGKNGHSVGLVPKDDLVFLRLVALPKGRKLLSRYLQLLFPGSELMR 660

Query: 661 IVCMAIFRHLRFLFGSVPSDPATADSVSNLARIVSLQTHSMDLGALSACLAAVVCSSEQP 720
           IVCMAIFRHLRFLFG++PSDP  A++ ++LAR+VSL  H MDLGALSACLAAVVCSSEQP
Sbjct: 661 IVCMAIFRHLRFLFGALPSDPGAAETTNDLARVVSLCVHGMDLGALSACLAAVVCSSEQP 720

Query: 721 PLRPLGAPAGDGASLILKSVLERATGLLTDPHAASNYNITHRALWQASFDEFFGLLAKYC 780
           PLRPLG+ AGDGASLILKSVLERAT LL DPHAASNYN+T+RALWQASF+EFFGLL KYC
Sbjct: 721 PLRPLGSSAGDGASLILKSVLERATELLMDPHAASNYNMTNRALWQASFNEFFGLLTKYC 780

Query: 781 VNKYDSIMQSLLRQSPQNAAAAVSDAATAISQEMPVEVLRASLPHTDEHQRKVLIDFAQR 836
           VNKY+SIMQSLL Q P N     +DA  +I +EMPVE+LRASLPHTDEHQR++L+DF +R
Sbjct: 781 VNKYNSIMQSLLMQGPPNITVVGTDATKSIIREMPVELLRASLPHTDEHQRQLLMDFTRR 835

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KZM3_CUCSA0.0e+0086.58Uncharacterized protein OS=Cucumis sativus GN=Csa_4G604530 PE=4 SV=1[more]
A0A061FH59_THECC2.9e-27562.59Topoisomerase II-associated protein PAT1, putative OS=Theobroma cacao GN=TCM_032... [more]
A0A067JMJ1_JATCU1.2e-27363.21Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22657 PE=4 SV=1[more]
W9S1T2_9ROSA3.5e-27363.40Uncharacterized protein OS=Morus notabilis GN=L484_002129 PE=4 SV=1[more]
B9RI49_RICCO2.2e-27062.72Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1576440 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79090.12.2e-21353.26 FUNCTIONS IN: molecular_function unknown[more]
AT3G22270.16.2e-16044.14 Topoisomerase II-associated protein PAT1[more]
AT4G14990.13.0e-15443.74 Topoisomerase II-associated protein PAT1[more]
Match NameE-valueIdentityDescription
gi|659087137|ref|XP_008444289.1|0.0e+0087.67PREDICTED: protein PAT1 homolog 1 [Cucumis melo][more]
gi|449453874|ref|XP_004144681.1|0.0e+0086.58PREDICTED: protein PAT1 homolog 1 [Cucumis sativus][more]
gi|1009130114|ref|XP_015882122.1|2.9e-29264.31PREDICTED: uncharacterized protein LOC107417972 isoform X1 [Ziziphus jujuba][more]
gi|731434305|ref|XP_010645006.1|4.1e-29166.24PREDICTED: uncharacterized protein LOC100267869 isoform X1 [Vitis vinifera][more]
gi|1009130116|ref|XP_015882123.1|2.7e-29064.19PREDICTED: uncharacterized protein LOC107417972 isoform X2 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G018740.1Lsi04G018740.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR21551TOPOISOMERASE II-ASSOCIATED PROTEIN PAT1coord: 1..94
score: 0.0coord: 114..818
score:
NoneNo IPR availablePANTHERPTHR21551:SF4SUBFAMILY NOT NAMEDcoord: 114..818
score: 0.0coord: 1..94
score: