Lsi03G004430 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi03G004430
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUPF0400 protein C337.03 isoform X1
Locationchr03: 4993550 .. 5014091 (-)
RNA-Seq ExpressionLsi03G004430
SyntenyLsi03G004430
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGGCGCGAGAGAAGACCTGGCGCGCCCAAGTAGCGGCGCCAAATCCCGACCCGATTCACAATCGCTAAGCCAAAATCAGTTCCATTCCACTACCACTTTCCACCAGTTCCCTCACCCGTGCGGCCCAAACTGTAGCGACGTCGTGTCCAGCCAAAAATACTCCTAAACCACACCGCTATTCTGAAAACCCACCCATCTTATCTAGCCAGGCCACCTGATTCCAAAATCCCCGCTTCCCCTTTCTGGGCTTTCATCTTCTTCCCTTTTTTGGCTCATCTTCTTATTCACACACTCTCCAACTTCCCCTTTTCCCCTTCTATTGCCCCTTTGCTCTCTGTTTTCGCAAACCCTTTTACCGTTCCCTGCAATTCCCCTCCTTTCTTCCATGGAGAAAGGCCTCCAACGGCCTAATTTTACCCCCTTTTAGTTGCTTTCTCTCTAACCCACATTCCCTTTTCCAGGTATCTTCTGTTTTTTCTTCTACCTTGTCTATGTTGTTGAGCTCTGGAATTGTAGTTCTATGGGGGTTGTAATTTGTCTCGTGATCGTTCTTACTTTGTTGCTTTCGAGTATTTTGCTGTTTTTCGTTTGCTTTTAGCCATCTACTTTCGAGTGCTTTGTTTTATTTTATAGCTCATTTGCATCTTTTTGTTTTTGTTTTTCCCCTTGATGAGTGAAGGGTAGTTTCGAAAAGTAGATCTTGACTTGCGGCTACATTATTAGTTTTTTGATGTCTGAATTTTCTGTTTTGTGTAGGAGGGGTGAATGATTGTTTATTTTTTGATTTATTCTTCGAACTGGCTGGAATTAGTATAACTGCAGTGGATTAAAATATGCATTCTAAAATTTAGTCATTTATGGTCAGGAACTTGTTTAAATTTTGGGGAAAACTTATTATTTATACTTTGCTATTGAGTCAGTCATTGAAATTGACTCTGAATTTGCATAAGATATTCTCCTAAAAGCCAAAGTTTAGATACCACACGTAAATAACTCTTACCTTCCCTCAACTTTCTTTTGCATTTTCTTTTGGAGTTGATGCTTTGTCTCTTTTTTGAAAAAGGTGGCACTGACACTGTGCACGTATGTGATCTTGTTATAAAGAATCCAGAAAATTTTCTTATACTTACATCAACAACTCCGTTTTTAATAACTAAAATTGTAAGTTGCACCAAATAGCTTCAATCTTTCCATCCTCTCTCTTCTACATCTTCTTTTGGAATACACTGTTCTGTCTAGGAAAAAGGTGCGCTAAGCACTCTTGGGATATTATTACAAAGAATACATAAGTTTATTTTACTTACATTATTGACTTTATTTTTACAACTAAAACTGTAAGTTGACAAAAAAAAATGTCTGAGTTGCGTCCTCCAAATTTTAAATAAACCAAGTTTGTAGTTGCATATCAGTCAGTATCACATTTGAGGTGATGTTTTATGTTTTAGTAATATGTTGAGATCACACATCCACTTTATTTATTGTTGGGCCTTTGTTCGATATGTAGTCGATATCTGTAAATCAAACAAAGCACAAGCGCAAGCACAAAAAAAAGGAGAGAAAAATAGTGACACCAAGATTTACGTGGAAAACCCCTCAGTGTAAGGGGAAAAACCACGGGACGGCTCCGAAAAATACTTCCACTATATGTATAATGGGTTACAACAGTTTCTTCCCTAGTCGTAACTAGAGGATACAACAATCATCAGCAAAATAAATCACCGGCAAACAAAATAATACATAAAGCACCCTTCATCAACAATAGATGGAATACAAATGCTAGTGGAGAGATCTCACCAATGAAGACAACTTTGAGAGAACAATAAAGAAGGAAATAAACCACTTCTTCGGACACAACCAGCACCTTCTTCTTCTTTTTCTCGCGTCTCTCTCGGTCTCACACTTGCTGATTTTTTTTTTTTTTGTCTCTGTGTGTATCTATCTCCAGCAGCCCACCAACTTCTATTTTGTCCAAGAAAAAGAAGGAACAAGCTTGCTGAGCAAGCTTTTTAATTTCAACCACATAATTATAATAATATAAAACTTAATATGGTCATTTTGGGTCATAAATAAATTTGGGCCCAACTTTATTTCTAAAGCAGTCGGGACCCAACAAACTCCCCCTCCCGACTATTCAGAAGGGTCAACCATACCCGCCTTCGATCTACACAATTCATGCTTTCCTCTAGGTAAGGCCTTTGTCATCATATTTGAGCCGTTCTCATCTGTGTGTATCTTTTCAATCTCTAACAACCTTTCATTCAACGTGTCTCGAATCCAATGATATCTGACATCAATATGCTTGGACTTCGAGTGGAAAGTTGGTTTCTTGCTAAGATAAATAGCGCTCTGATTGTCACAGTACAACACATATCTTCTCTGTTCAACTCCCAACTCTTGCAAGAATTTCTTCATCCAAAGCACTTCTTTGCAAGCTTCAGTCGCCGCTATAAACTCAGCTTCTGTAGTTGAAAGAGCAACACACTTCTGTAACTTGGATTGCCACAATACAGCTCCCCCTGCAAAAGTGATCAAATAACCTGAAGTGGATTTTCTAGAATCAACGTCACCTGCCATGTCTACATCTGTGAACCCATTAAGCACAATTTCCCCATTCCCAAAGCATAAGCATACTCTGGAAGTACCCCTGAGATATCTGAAAATCCATTTCACTGCATTCCAATGTTCTCTACCTGGATTGGAGAGAAATCGACTGACAATGCCAACAGCGTGAGCTATATCTGGCCTGGTACAGACCATTGCATACATCAAGCTACCAACGGCTGAAGAGTATGGTACTCCATTCATGTCTTCTTTCTCTTTCTCATTAGAAGGGCTCTGTTTGGAACTGAGCTTGAAGTGATTGTCTAGTGGAACACTTACTGGCTTTGCTTTGTCCATGCAAAACCTCTCAAGCACCCTCTCAATATATTTTTCTTGTGATAGCCACAACTTTTTAACATTTCTATCACGGGTGATTCTCATGCCAAGAATTTGTATCGCTGGGCCCAAATCTTTCATAGCAAAGGATTTATTCAACTCCATCTTCAATTCACGAATCTTTTTGCTATCAGCACCAACAATCAACATGTCATCCACGTATAGTAGAAGAATAATGAAATTTCCATCAGCGAATTTCTTAACGAAAACACAATGATCAGAAGTTGTTTTCTTATAACCATGCTCCCCCATGAACGAATCAAACTTCTTATACTACTGCCTCGGTGCTTGCTTTAACCCATAGAGACTCTTCTTTAATTTGCACACCAGGTGTTCTTTACCCTTTACTTAGAATCCCTCTGGTTGTTCCATATATATCTCTTCCTCTAAATCACCATGAAGGAATGCAGTTTTCACATCAAGCTGCTCAATTTCCAAATTTAAACTAGCTGCTAATCCTAGCACAACCCGTATGGATGACATCTTCACCACTGGTGAGAAAATTTCTTCAAAATCTATGCCCTTTTTCTGATTAAACCCTTTCACCACTAATCTTGCTTTGTACCTTGGTTGTGAGCTACTTTCATTTATCTTCAGCCTGAACACCCATTTGTTCTTGAGTGCTCTCTTGCCCTTAGGCAACTTCACCAGTTCATACGTATGGTTCTCTTTCAATGACTTCATCTCCTCCTGCATTGCTTCGAACCACTCATCTTTACTTTCATCAGTTATAGCTTCCTCATAGCTTTCTGGCTCTCCCCCATCAGTAAGCAATAGATATTCATGTGGAGAGTAAGTTCTAGAGGGCTGGCGATCCCTGGTGGATCTTCTAACTTCAACAGTTTCTTGAACTGGTGGCTCTGAAATTGGTAACTCATTAGGGGGCACAACATCATCAATATTGTCTGAGGGGGTGTGATTATCTCTAACATCGTCTTCCATCTCTGAAAGAGAAGGACTCATATCTACCAGATCACCTTTTTCAAGTTGTGGTTCCGTTGCAGTCTCTATGTCTTCAATAGTCTGATCTTCAAGAAATACAACATCTCGGCTTCTAACAAGCTTCTTCTCAACTGGATCCCACAAACAGTAACCAAACTCATCATATCCATAACCCAAAAATATACATTGTCTAGTCTTACTGTCTAGCTTTGATCTCTCATCTCTGGGGATATGTACAAATGCTTTGCATCCAAAGACCTTCAGATAATCATAAGAAGAATACTTTCCTGTCCATATCTTGTGTGGCATTTCACCTTTCAACGGTACAGCAGGAGAAAGATTTATCAAATCCAGTTTTCACAGCTTCACCCCAAAAACATTTTGGCAACTTTGCATTTGACAACATACATTTTGCTCTTTCTATAATGGTGCGATTCATCCTTTCAGCTACACCATTATGTTGAGGAGTCTTGGGAACTGTCATTTGATGTCTAATCCCATGTCTTTTACAATACTGATCAAATGGACCTCTATACTCACCACCATTATCTGTTCTAATGCATTTTAGTTGCCGTCCAGTCTCTCTTTCTACCTTGACTTGAAACTCTTTAAACACATCTAAGACCTGGTCTTTAGATTTTAGTGTATACACCCACACTTTTCTAGAGTGATCGTTGATGAATGTAACAAAATACAAAGCCCCACCAAGAGATTTTTCTTTCAAGGGGCCACAAACATCGGAATGCACCATATCCAACACATTGGAACTTCTATGTGCATGAGTCTGGAATGAAACTCTGTGTTGTTTACCAGCTAGACAGTGTGAACAAGTTTTGAGAGACATACCTTTTTCCTGCGGTATGAGACTTTTCTTCGCAAGAATGTCTAACCCCTTTTGGCTCATATGGCCCAGTCGTTTATGCCATAACTCCATGAGCAACTCATTTTCAACTGCATTTACAAGACTATTTGCAATCTTTGCATTCATCACATACATTGCAGAGTCTCTGTTTCCTCTTGCTACTACCATTGAGCCTCTAGTGAGTTTCCACTGATTATCGCCAAAATAACTGTGATAGCCGTCGTCGTCAAGTTGTCCTGTGGAAATGAGATTCAACCGCATATTTGAAACATGTCGAACATTTTTCAGAAGTAATCGACAACCATTATTGGTATCCAAGCACACATCTCCAATACCAATGATTTTTGATGTACCGTTATTTCCCATCTTTACTGTTCCAAAGTCACCTGATTTATAGGATGAGAAGAACTCCTGCTTGGAAGTAACATGGATTGCCGCACCACTGTCAATTATCCAATCAGTATCTTGACATGTGAAATTAACTGCACTTTCATCACAAACAACATATAAATCACTTGTGACTATTGCAACATCCTTTTCTTCATTTCCTTTTTCATTCTTTTCTTTGTTCTGTTCTTGTTTCCATTTTCGACAATGGATCTTCTTATGACCCATCTTCTTGCAATGGAAGCATTGTAAATCTTTGTTTGATCTACTCCCAGATTTCCCTCTATTTTGATACCGGCTACGACTCTGGCTTCTCCCCCAGTTTTCTGTGACAAGAGCTTCTGACTGACTTGTTGAAGGATTCATTGTTTTCCTTCTCACCTCTTCATTTAGAAGACTACTCTTGACAATACTCGTAGTAACTTTTCCATCGGGTGAGCAATTACTCAAAGAGACTACCAATGTCTCAAAGCTATCGGGTAATGAATTGAGGAGCAGCAGTGCTTGAAGTTCATCATCCAAGTTCATTTGCATATTTGATAGCTGATCCACAATGCTCTGCATTTCATTTAGATGTTCTGCAATTGGAGTCCCTTCTTTATATTTCAAATTTACAAGCTTTCGTATCAAGAAAGCCTTATTGCCTGCAGTCTTTCTTTCATATAAGTCTTCCAACTTTTTCCATAAAGAATATGCTTTTGTCTCTTTGGCAATATTGTGATATACGCTGTCATCAACCCACTGGCGTATAAATCCAACAACTTGCCTATTCAAGAGTTCCCAGTCTTCATCGGACATATCTCCCTTAGCTGACTCTCCCTTGATCGGAGCATGAAACTTTTTGCAATAAAGCAAGTCTTCCATTTTTCCCTTCCATAATTGCCAATTTGTTCCATTCAAACTGACCATTCGGCTTGTGTTAACATCCATGGCTACACACCAAAAAATTGACCAACCAACTAAGTCAACGAAATCAACACAAACTTTGTTGGCTCGCCAATTTGTCGCTAAATTTAACCTGGCGCTCTGATACCACTTGTTGGGCCTTTGTTCGATATGTAGCCGATATCTGTAAATCAAACAAAGCACAAGCGCAAGCACAAAAAAAAGGAGAGAAAAATAGTGACACTAAGATTTACGTGGAAAACCCCTCAGTGTAAGGGGAAAAACCACGGGACGGCTCCGAAAAATACTTCCACTATATGTATAATGGGTTACAACAGTTTCTTCCCTAGTCGTAACTAGAGGATACAACAATCATCAGCAAAATAAATCACCGGTAAGCAAAATAATACATAAAGCACCCTTCATCAACAATAGATGGAATACAAATGCTAGTGGAGAGATCTCACCAATGAAGACAACTTTGAGAGAACAATAAAGAAGGAAATAAACCACTTCTTCGGACACAACCGGCACCTTCTTCTTCTTTTTCTCGTGTCTCTCTCGGTCTCACACTTGCTGATTTTTTTTTTTTGTCTCTGTGTGTATCTATCTCCAGCAGCCCACCAACTTCTATTTTGTCCAAGAAAAAGAAGGAACAAGCTTGCTGAGCAAGCTTTTTAATTTCAACCACATAATTATAATAATATAAAACTTAATATGGTCATTTTGGGCCATAAATAAATTTGGGCCCAACTTTATTTCTAAAGCAGTCGGGACCCAACATTTATTTTATTAGAGAGAGATTTTTGTTGAGGGGAAAAAGAACAAGCAACTGCTGCTACAAAATACAGAAGCTAAGAGAACATGCATAAATTATTTACTTCATTTATTCCCTTTTCTTTGAACATTTTATCCTTGCACAGATTGGCCTGCATATTCTACCTTATTTCCGCAATAAAGTTATCCAAATCAATGGGTGGTACATTCAATCCACAAATTTTGGTAGACAAGCTAGCCAAGCTCAACAACTCACAGGCGAGCATTGAGAGTATCCTTTTTGAAATTTGTACTCTGTTTCGCCCAAGTTATTATTATTTTTTTTTGTTCTCTTCTTTTGACTAAATTAATTTTTTTTCCTTATTAGTTTTATCATGAATATTTATTGCTTTAGTTCTCTTGTTTTGTTGGCAAAACTCTGTATTGAGTTCTTACAGGTGTTTACCCCCCATTATTTCTTTGGCATTATTGTTGTGGTGAAGTTGTTGTGTTAACCCAGTGAATCTGTAATGAATTACTTCCACTGTTACACATTTGGAAATTGTTTAGCTTCCATTAGTTTAGTAGTTAGACTACCTATATTTAATCTTATGTTTTGGGAGAAGTTCTACTTCTCTTCATCATTGGTGTCTTTGTTTAACAGTCCTGGTGACGTTCCTTTGCATTTGCTTTTACAAGATGGGGTTTGAATGTATGTTCTCAGTTTGTCTTTTAGGACTTATTGTGGTTTTTTTCCTTTCTGAGTTCCATCTTTTCTATGTCTGTGTATGTATATGATATGGTATAATGGTACAGTACTGTGGACCTGTGGTACTGCTGTGCTGGTACCTAATAGTCCATTTCTGGTATTCTTCTATGACACTATAACTATATGAAATGAGAATATGAGAGTATGGTTATTGGTACTAGCCACTATGGCGAGGCACTAAAAGTGCATTAGAGAGTGCCTAATTCCTTGGTGTGCCTTATTACTTATCTGTTTGGTATATTAGTTTCATCCTTAATGTGATTAAGCTTTATCCCACTGGTGTATATTTCACATGAACAAAGCCAAGCAAGTTGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGTGAGCAGAGGTTGGCCTATCTGTATCTCGCAAATGACATTTTGCAGAACAGTAGGCGAAAAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCTGATGCACTTCGTGATGTAATTGGGAATGGGGATGAATTTGGAAGAAATGCTGCCTTACGACTGGTACATCAATGCTTACCGTGCATCCTTATCATTTGTCAATTTTTTTACTCGTGCATAAAATTTCTCGAACCTCCATGTTGTATGTTTCTGTCCTTAGGCGAAAATCTAATTGGATACTTGGCATTCAAGTCTCTTGTACTTTGTGAAATCCCTACGTGATGACAACAAATATTGATGTTGTGGGTTGAGGGAAACCACAGAATTTGTAGAGTCTTCTTTTAGCCATTTCCAACATCATCCCCCGATCTCTCCATCTCTCTGAAAATCCTATTCCTCTTGATCCAAATACCTCACATAAAAACGTTGATGACAACAAATATTGATTCTCTCCCTCTTCATCACCCTTTCAAGGAGACCAAATTGTGGCTCAACTTTATTATTACTTTGGTGTTGTGGGTTGAGGGAAACCATAGAATTTTTGAGGCAAGGATGAAATTTTGCTCTCCTTACCTGCTATTTGGTGTAAGCTTCGTAATTTCTTTAATAACTCAATTTCTGATACTTTGGCTAACTGGAACCTTCTCATGGCTTAGGCTAGAGGCTTCCCCCACTTTTGTAAATTTATTTGTCTTCATTTGTAGCTAACTTATACTCAATTTACTGCCCTCTCCCATTTAATCTGTTGGCCACCAACAAAAGTAGCATACTCGTTCCAACTTTCCCATACATGTACTATAGTTCCTTTACTAACTAAACTCCAATGTGCATTGAATATGGATCTTCAACATGGGAATAAGTTAGTTGTAAATGAAGATCTTCACAATTTTTCCGAAGTCTTCACAATTTTCTGTGTCATTCTTTTGTGTTCAATGATTCTTGATATGTTGAAGTGGAAAGATTTCTGATTGTTTTGTCTGGACTCTGTTATAACTTATCATGTGTTTTATCTATATACCTATTTTAATTTTACGCATAGATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGACAGAGTCTTAAAGAAGAGATAATGGGAAAGCATTTGGAAACTAGTAATCGAAATGGGAAGCCATTCAGCAGTAAGCTGGTAAGATATTATTGCTGTTTCATCATGTCCGAGTTATGTTCATGAATAGTCTGACCATTAAATCACCTATTCTCACTTTTCTTTTCAACAGAAACAATCTGCCAGTGTATCATTGGATAAAATAGTCTCTGGTTACCAAGTTGTTTATGGAAATGAGATTGATGAAGATGCAGTATTGAGCAAATGCAGGAATTCTATTAGCTATCTCGAGAAACTGGACAAAGAAATTGGCGCTGATGTCAGTTCAGGTACTTATACATTGTTGATGGGCTTACAAATATTTTTGCTGCATAGTGAGTATTATATTAGCTCTCTCCTGTGTAAAACTTTTAAGATTGGGGAACAGTGCGTTTCTGAATTGGGGAACTAATTAACTGTCAGTGTGTTTTTTTTGGTTCCCTCCACCTCCACCTGACGACCTCAGACTCAGACCACAATAAACAGGGTTTTTTATTTTAAAAAAAAATTAATTTTTCGTTTTTGAGAAACCAAACACGCGAATAATTGGAAAAAGGTCTTTATTTTCAGTTTCAGCGCTTTGTTTGGCTGGTTGGGTTTAAAGAAACAATCGCATCTTGGAAAGATAATGCAGTGTTGGAAGTGCTAACTGCAAGTATTTCGGGATAAATATTACTGTTTGTTTTGTGGTCTAGATATGTAAAATTGGCTTCCTTGTGTTCTAGATGATCAATATATATACATATGCACGGATATGTGTATGTGTGTGTGCATATATATATATATATTAAATTTGCTCTTTAATCTTCTTTGAAAAAGGATTTTTCGTCCCTTCTCATGGATACCTTTATTTTTCTTTAACCAACATATCATGCGGTTCCTATCTGAAAGAAAAACCTCTAGTTGTAGATTTAGATGTAAGAAGTACAAAGTAATTTGGTGATATTATTGAACTTATTTATTTATTTACTTTTTAACTGTTCTTTTCTAATATTGTTTGGGTTGTATAACAAGCCCTTATGATTTCATTTCAGTGTTTCTAGTTCTCATCTCCCTTTGGTAACTTACATAACTTGTTACATTGTCATGCAATGTGAAATTAAAAAATAAATCATGTTTATTGTTTATCCACTTGTTTCTTGGAAGCACAAACACTCTTTTTGACAGTGCCCATGTTAGACACGTGCCGGGCACGGATATGATTGGACACTTCGAACACATCTCATACATGTGTAAAAAAAGGTTCCTATTTTATTTTTAACCTTTTCAATTTTGAACACATCAAGGGCATGCCTGAGACATGTCTAGACACGTAGGGGACACACAAACACTTCAACTTAAAATTTTTTGTTGAGAATTTAAAAAGACAAAAAAACAGTTAACCTTGTCATTACTTCATGCCCATTGTCCAACCCATTTTCTTGTTTATAATCATCCCTAAAATTTAACCTGACCTTAAAAATTCACGAAGACAAATAAATTAGAGCATAATACGTCTAGGGGTTTGAGGTTGATGTCTATTTGGTTCCTAGGGTTTAAAAAAAAATGCTTTAGTGCTTGAGATTTCAAAATTAGCTCTAAATGGTTTATCAATTGATTTGATTGTTAGGTGACTAACAAAAAAATGACTTGGTATGTTGAGGTGACAAAACTTAAATAAGTTGACTATTTTTACTAGTTTTATATTATTTTTTATGGATTGCTTGGATTTTTCTTCCTTCTCTCTTCCTTTCTCTCTTTGGCCTCCCTCTCTTTCCCTTCTTCTTGCCCTTTCTTTCCCTCATCCTTCTGCTCATGGTCTTCCCCAGTTCATTAGTTTCATCAAACTTGTCGTCCCTTTTCATTCTTCCCCATTGTCAATATCCTCCTCCTTATAGATTAAAAAATGGATGTTAATTTCAAAACCAATTTCAAGAATCCCACTTCCAGAACCACTAAGAACTGTGATGAACCTCAAACACTCATAAAGCCCCCAAAATCCCACCCATAAAGCTAAACCACTCACCAATCCCCTAATTCCTCTTTGCTCTTTATTTCCTTGGTTATTTCATTTTTCCTCTATGAAAAATTGTTGGGTGGGGGCTATTTTGGCTCAGTTCACGTGGTTGCCCAAAGAGAAAGGATATTTGTGGTTGCCCAAAGAGAAAGGATATTCATATTTGGAAGTTGGAACCGTAGTCCTTGGGAAGGGTTCTCCTGTAACCCTAGTCCTTTGATTTCATATTTGCCTCGTGAAAGGTGAAACTTTTCAAGGAGTTTAGGTTTTTTGTTTGACTAGTTATCCATGGAAGTTTTTTGTTTTGTTTTGTTTTTTTAAAAAAAGGAAACAAGACTTTCATGGATAAAATGAAACAAGAAAATAGACTAATACTTTTTTTTTATATCCGTGAGTGTCTGGGCTAGCTTACACACACCTGACTAATCTCACGGGACAACCCACCTGATCATACAACATTTGGGTGTCAAGGAAACTTGTAGGAAATTAATTCATAGGTTGGTGACCACCATAGATTGAACCCATGACCTCTTAGCCATTTATTGAGACTTTGTCTCCTTTTTTACCACTAGGCCAACCCATGATGGTTAGAAAAGAGACTAATACTCAAAATACAATGAGACAAGATAAAACAAAATAAAGCAATTGCGGCCATTACAGCTAGATACAATTAATAATAGCACTCAAAAGAGATGTTGAAACACTTCGTATTAGCCAAGCATACAGGACAGTTGAAATAAGGATGTGAAAAAACCCATAATTTCATACTCAATGTCTCAAAATGAAAAAAGAGGAATCCTATTGCTGGACAGAATATCAGTGATCCAAAACTTCCTTTTTATGGCAGCCAAGAAACTGTGAAAAAAAGCTGGGAATTTCATCTCCTAACTTCAGTCAATTAACCCTAAATTTTCAAATCAGATTTCTTTCTTCCGTTGGCTATAACTCTTCATAAATTCCTCAACCCAATTTTTGGCTAGGGTGATTAAGCCAACCATTGACAAATTTGAAAGGGATCAGAGCCAATTTTTCATCTCCAAGAGTAAAGAAAATATAGAAGTGGTAGGATGTAATCTTTTTAAGTACCCATGGAAGAATTAACACCTTGGAAAGGGTGTCGAGGAGGGTTCCCAATTTGCTGGGCCTGTTTAGTTGCATCTTGTGTAGGAGAGCGAAAAAGAAGTTTAATCACATTCTTTAAGAGTGTGGCTTTGTTTTTTCAGTGCGGAGACAATTTTTGGAGGAGCTTGATGTTAGCTTGGCTAGATGTCTGCAAAGATATGGTTGTGGAGTTCTTGTTCCATCTTCCTTTGTGAGAGAAAGGAGGGTTTTTGTGGAAAATTGGGTTGCTTTGGAGAGAGAAACCATACGATTTTTAGAGGGTTTGAGAGATCTCCTTGCGAGGCTTGGTCTTTTATTCAGTTCAACATGTCTATGTGGGCTAAGTTTTTTGCAATTATTTGGCTGGTCTTATATTGCTAGACTAGAGGTCTTTTCTTGTAAGTTGCTTCTCTTTTGTGGGCTCTCTTTTTGTATGGCTTAATTCTTTCATTTTTTTCCTATAAAAGTTCAGTTATTCATCTTAAGAAACAACAAGCTTATAACCGCTTTGGGGTTCAATGTTTGACCGTTGAGCGCCTCTTCCAACAGTCCATTACAATTGCTCTTGTGTCTTCTTCCAACTCAACATCCCTGCAGTCCCAATGATTCAACAGTCCTTTTCTTGCTATAGCTGAATTGACTTCCTCCTACGTTGTCGCCTTCATCGACGATTTTGGGTATGGTCTCAAAATAATTTCTGGCATTGGTGCTTTCCCCCAAATTGATTCCACCAACCCTAGAAAGCGAATTAAGAAATTGGATAAATTTGCTTCTTGTTTACTACATCTTCTTGGTCGCCCTCCTTGGAGCTGTCATTCTTGGTGGTTTTGGTAGTTTCTGCTGATACCAATATTTGTCTTGTCCACAATCCCCACGCTTGTGTCAATGTTGTCGATACCAACAACCTCAACCTCACTGTTAGCGAAGGTGAAGTGAAAAGATGAAGGGTGGGGGGAGAAAAGAGGAGAGAAAAGGAAAGATAGAAGGGAAAAAAATCTATGTGATCCATAAAAATAATATAGAATTGTAAAAATTTTCAACTCATTTAAAATTTTTCCTCTCAACATATCACGTCACTTTTCTGTTAGTCAACTAATGGTCAAATTAAACTAGGGACTATTTAGAAGTTAGAACAATTTCCCAAACCTCAGGAACTAAAGTGACTTATTTTGAAATTCATGGACCAAATAAAAAAACCTCCAACCTCATGGACAAAAAATGTATTTTGCCCAAGAAATTAGGTGCGGAGGTCCATGCTCATTTTTGTTAGTCAACTAACGGTCAAATCAAACATATTTAATTTTTTATTGTTTAATTTATGTGTCCTCGATTTTGAAGTATCAAACCTTTTAGTATATATATACATATTTCTTTCTAAAAAACATATCCTAGTGTGTCCATGTCCTAGATTTCTAGAAATGGATGTGTTGCCATGTTGTATCCATGCCCATTTTTCTTAGCTTGTTTTGTTTTAGTTAAAACTTTTAGTTTGTTGCCGTTTTGTTGTTATTCTTCTTCTTTAACTACACAGGGCAATACCGTGGATCTTCAATGGCAAATGATTTGCGGGGACATCATACCATTTTGAGGGACTGCATCGATCAATTAACAACAATTGAAACATCAAGGGCAAGCCTCGTGTCTCATCTGAGAGAGGCTATTCAAGAACAGGTATTTTGGCATCCCTCTGGAGAAGAATTAATGACATGGTTAATTGAAGTAAATCTGATTTTTTTTTTTGAAATAAAAATTTTATTTTCAGGAATTCAAATTGGAGCAAGTCCGAAATCAACTTCAGGTTTGCATTTCTGCTTCGCATTGTCTTGGAGTATAGGGTCAATGATTCTTCTTTATTTTTCTAATTACTTGGTCTAGATTGAACCTATATGAAAACTTCATTTGTTTTGTGGGATCGGTTCTTTTTAATATGAAACTAACCTTCCTAGCCTCAAGTTGAAATTCATGCCTAGTCGTTCTTTTCCATTTTAAAGTTCATACTTAACCAAACCTTGAAATATATATATATATAGGAAAATTATTATAAACAGAAAAAATATCAAACTAATTACAAATATAGAAAAATTTCACTGTCTATCGCAGTCTATTGTTGATAGACAGTGAAATTTTTCTATATTTGTAAATAGTTTGGCTCATTTTTCTATATTTGAAAACAACCCATATATATATATATGTATGTATGCATATATGTTCATATATATATATGTATGCATGATGTACATATACATATAATGAATGCTCAACCAAACCTTGATAATGCAAGAGAATCTAATCCATGGATAGGCTTTTCGCATTAGTGTCATTTGAACCATACTGTATATGTGTGAGTTGATATAAAGCAAGTTAGGGTTCCTTGTATCTATCCTAAGAAGCACAGATACAAAAACGAGACACGAATACGATACGACATGGACACGGCGACACGCCATATTTTAAATATCTAGGACACAACACGGCAAGGACACGTTTATTAAAATATACATTTTTAAAAATATATATCATTTTCATACCAAAATAAAATTCAAAGTAAATGGGTTGATGCATTTATATGCTTAAAAGATTTAGCTTGATGTATTTCACACTCAAAAGTTATTATTATCGTCATATATGTGTCTTTTTAGTCTACTCAAGTGTTCTATGCATGTCTAACACGTTTGTTGCACTAACAAGTGTCCAATACGTGTAGAACAAGTGGTGGAGCATCCAAGTGTCGGACACGGACACGCTAGCCTTTCTGAGTTGATGAATATTATGATTATTCATTCCTTGTTTAGTGGATTGGGGAACAATCGTATAAGTTTTACTGGAAGCTTTATGCTATAGTCTCTTTCTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTAATTTTTAATATCCATGAGTGTTCAGGCCAGCTTTACACGCATCTCAACGAATCTCACGGGACAACCTGCCTGACCCTACAACAATTGGGTGTCAAGAAAACTCGTAGGATATTAATTCCTAGGTAGTGGCCACCATGGATTGAACCCATGACCTCTTAGTTATTAACTGAGACTATGTCTCCTTTTTTACCACTACACTCATGAGGTAGGAGTGCGAGGAAAAACTGACAAAACCGCCTAAACTGCTTAAACCGCTCATCAATTCAAGTTGAAACCGACCGGGTTGGCGGTTTCAGGTTGAACATATATAAAAACTGAAATTTTCGGGTTGGGTTCGGGTTTAGCAGAAAAAGGGAACTGGAAAACCGAGCCGACCGATAAATTATATATAATAAGCATCAAACCTTAATTTCATAAAACACTTTGGTAGGTTGGTGAGTAGGTGCATTTATATACACGAGGTGGTGAGATCGAAATTGAGGGTATGCAAGGTTTTGTTTTATATTCGGAATTTCCAAAAAGAATAGTCCCTCTTCCTCAGTAAATATCCGGGCACCGTTATTTGTAAATTGAGAACCGAACCGCCCCAAACTGCCAGTCTGAGCAGTTCGGGCGTCGCAGTTCATCATTGAACCCGGTTCAGTTTTGGTTCATGCATTTCTCTTAACCAATTAATCGGTTCGGTACCGATTTTCTATAAAAACCGAGCCGAACTGAACCATTTACACCCCTACCATGAGGCACTGTTTTCTCAAACCGAGGAAGACTTTGACTTATCTTTTTTATGCTATAGTTTGAAGAAGAAACTTAAAAAAGATAAGTCAAGGTCTTCCTCTGTTTGAGAAAACAGTGCCTCAAGTATCTACGGCTGGTTTTTGGTGCAAACACTTCTCTCCTTTTGCTGATTATAGAGTTACTTTGGGTTTTTTTTATCTAAGAAAAACGATTATATTCCTTTGCTTTCGGGGTTGGCAAGGCTGGACCAAACATACTTTAGCCTTAATATTTATTTTAAATATAAAACATATAATTGAATTATGTTACCAGTGCAATTGGCTATATATACACTCTTGTAACTTCCAGGTCTTGAGTCGGCCCCTGTGAATGCGTCTTTTTTAATGTGAATTACAACCAATGCAGTTGGGTTTCAGCGGCACAAGCCTAGTGTCAGCACCCATTCCTATTTCAAGGGCCCAAGGCCCAACAACATGAAGATAAAGTGTTTGGATTTAACGCAAATGGCGAACTGCTAAACCGAACTAACTGAACTTTCAACTTATCAGTTTGGGGCCGTTCGGCTCAATTTTTATTTCTTTATATTTTGGCAGTTCAGCAATCTAACCAAGCTGAATCAAACTGATTACAACCCCCGTTCTATACTTTTGGCCATGTGGAAATCCTTTTGTAATTCCATTTTGGGCGCCACTCCATCCCCTCTTTGTATACTCATATTTTCAATTTCTTCTTAAAAAAGGACTTCCCCTCTTTATATATGTGACGAGCTAATCTTCATTGAAATAACTCACTCAAAATTTTCTTTTCTAGTTCCATTTTCCAATCATAAGATTATAAGGAGTATAGAGATATATTAAAAGATGGCAGTGTTTTATATTTGAACATGATCTTTGAAGAGATAATGTAAAAAATGCGTGCACTCACACAAATCTGGGACTTCCACTGTCAAATCCACTTTCTTGAATAGTTTCTCAGTCTTTCAGAGCTATGGATACAACCTTTAGTTAAGCACGTCTAGGAAGGAAAATCGGAATGAAAGGTTTTCTTTTAGTCTAGGTTGATTGCTCGCACACAGACTGACACATAGAATCCCCAACATTGATGGAAAAGTTCCAAAAGAAAAACCCTAGCTAGTTAAACTCCCAAATGTTTGCGGCATGCGCTTTAGTTCCTGCATTGCAATTTTTCTGCTAGACATTTGTCCCTCCGGCTGGAATTTTTTAATACCTTGTCTATACTACGTAAACAGTAAAGGTCTTTGAGGGAATGAGTTTAAGCCACAATGACTATTGTCCTAAGATTTTATATCATATGACTTTTCTTGGAAACCAAATTTAGTATGCATGCAAGCTGACCCATGCGCTCACGGATATAAAAGAAAATTGTTGTGTGTACTCGTGAATACGATTCACTCATGGCTGTCCGGTATTTTTGTGGGTTGGTGGCTAAAGATAGAGCCAACGCTAAAGTTTTTTGGATGTGTGGATAGATTAGATTCTCTTGTAGGAGTCCTTGGATTGAGGAATTATGGATTTTTGAGATGTAGTTGAACTTTTTGAATCCTTTTACAAACTTGCACTGTATACTGCCTCTATGATTATGTATTTTTTTTTTTGAAAAAATTGGATTGGAGATTGTAATGGTCTTTTCCCAGCCACTGGATCTTGTGATATTTGTCTTTTGAGGAGGCAAATCCATTGACATGCTGAAGATACCAGCATGTTTAGGTAGAAGAGGCTGGGCTTTCTGGAGGGAACCAATTTGCAACTCTACATTTTGTGCTGCAGGAAGCCTGAGAAGTAGAATGGAGATGTACCCTAACACGCCTATGAGTTTTCCTAAACATGTTGAGTAGTTAGGAAAACTGGGTTTAACAAACTGAACAGCTGTGGAACTGTCAGTCAAACATGGGCTGGTTTTGGTGGCATAGCCTAAATCCCAGATCCATCGTTTTTGGAGACTTTGTTAATATATTTTTACCGTCTTCCCTGCTGGCTATCTTACAGAAGAACTGCTAAATTGATGGAGGATACATCAGAGAAATAGGGTAGAGAGAAATAACAGGGAGAGGAGAAAACCAGAGTTTCCAAATCCAAATAAAATATCCAAAACTGCAAAGTTCAAAGATTTGTTTTGCTTTTTTTTTTTTTTTTGGGTGGGGTGGGGGGATTAAACAAAGCCTTCGGGTATCCTTTTTAGCTTGGGTTTCCATTTTCCTTATTTGGTTGGTCGGTCATCCTTGTTACCAGCTGAAACCTAATTAATGGCTTTGGGAGACATGTTTCAAACAATACTTTGTGATCTCTTACATGTTTGTATTTTGACAGGCTTCTCATTCCCAGTCTGAACAAACTCAGAATCTCTGCCGTCAGTTTTTGAATGGTGAAAATGTGCAACCTATGGCTGAGGAGGTATCAAAAGATGCTCAGACCTCGATAGCACCTCACAGCCTTGTACCAAGGGACAGGGAACAATCCGCTCCAGTAATGTATGCAGGCTCAGTACCTTTTCCTGCAAAACCTGGACCAAATGAGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCAAAGCTAACTGCATCAACATCCTCGGTTCAGATGCTCTCTTACGTCCTATCTTCTCTGGCATCGGAAGGCGTAATTGGAAATCCAAATAAAGAGTTACCTGGTGATTATCCATCTGAGAAGAGGCCGAAACTTGAAAATGACCAGTTGCCTTACACATTGCCTCCCAATCCTCAGCGACCACCAGTCTCTTCCTTCCCACACCCGGAGTCTCTCCAACACAACGCTTCGTCCACCAGTCAACAATACACTCCTTCTGACCCTCCACCTCCTCCATCGTCATCTCCGCCGCCCATGCCTCCTTTACCTCCTGTCGCACAGTTCCCTCTGCCCCAGTTTACTCAGAATGCAGGGTCTGTAAGTAGCATACCTTACAGCTACAGTATGACACAATCACTGCCACCATTAGCCATGCCTGGATATCCAAATATCGGTGCCCCAGTGACGGGGATGTCTCCTTTTACAATACCAACGAATTCTTACCAGAGTTTTCAGGCTCCAGATGGTAATTTCTATAATCCGTCTTCATCCATGCCGATGGCGCCAATTTCTAGGCAATAGAGCTATGTAATACTAGTGCTACTTGATCTGTTGTGCTAACAAATCTTTCAGGAAGTGGTGCATCCCATTAAGGCCCTTCGAGGAAATCTTGCATCCTTGTATGCTCCTTAAGTTCTGACGACTTAACTACATGGATTTTGCACATAGTAATAGTTTGTAATTTATCATCACTTTCCAATGTATTGCTTGGAGTCCCTGACTATATTTCACCTAGTTGAATGGAGTAGTATGATAATTAATTATGTTGACTTGATAGATTTTTGCACATAGTAAGAGTTTGTAATTTAACTTCATTTACTAATGTAATGCTCAAGGCTAGAACAGAACTTCATGTGGTTGACTGAATTCTGATCGTTTATTTTGTTTTCGGTTTGGATTTGCTTTAGCTAACAATTGGGAGAGTTAAA

mRNA sequence

GGAGGCGCGAGAGAAGACCTGGCGCGCCCAAGTAGCGGCGCCAAATCCCGACCCGATTCACAATCGCTAAGCCAAAATCAGTTCCATTCCACTACCACTTTCCACCAGTTCCCTCACCCGTGCGGCCCAAACTGTAGCGACGTCGTGTCCAGCCAAAAATACTCCTAAACCACACCGCTATTCTGAAAACCCACCCATCTTATCTAGCCAGGCCACCTGATTCCAAAATCCCCGCTTCCCCTTTCTGGGCTTTCATCTTCTTCCCTTTTTTGGCTCATCTTCTTATTCACACACTCTCCAACTTCCCCTTTTCCCCTTCTATTGCCCCTTTGCTCTCTGTTTTCGCAAACCCTTTTACCGTTCCCTGCAATTCCCCTCCTTTCTTCCATGGAGAAAGGCCTCCAACGGCCTAATTTTACCCCCTTTTAGTTGCTTTCTCTCTAACCCACATTCCCTTTTCCAGATTGGCCTGCATATTCTACCTTATTTCCGCAATAAAGTTATCCAAATCAATGGGTGGTACATTCAATCCACAAATTTTGGTAGACAAGCTAGCCAAGCTCAACAACTCACAGGCGAGCATTGAGACTTTATCCCACTGGTGTATATTTCACATGAACAAAGCCAAGCAAGTTGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGTGAGCAGAGGTTGGCCTATCTGTATCTCGCAAATGACATTTTGCAGAACAGTAGGCGAAAAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCTGATGCACTTCGTGATGTAATTGGGAATGGGGATGAATTTGGAAGAAATGCTGCCTTACGACTGATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGACAGAGTCTTAAAGAAGAGATAATGGGAAAGCATTTGGAAACTAGTAATCGAAATGGGAAGCCATTCAGCAGTAAGCTGAAACAATCTGCCAGTGTATCATTGGATAAAATAGTCTCTGGTTACCAAGTTGTTTATGGAAATGAGATTGATGAAGATGCAGTATTGAGCAAATGCAGGAATTCTATTAGCTATCTCGAGAAACTGGACAAAGAAATTGGCGCTGATGTCAGTTCAGGGCAATACCGTGGATCTTCAATGGCAAATGATTTGCGGGGACATCATACCATTTTGAGGGACTGCATCGATCAATTAACAACAATTGAAACATCAAGGGCAAGCCTCGTGTCTCATCTGAGAGAGGCTATTCAAGAACAGGAATTCAAATTGGAGCAAGTCCGAAATCAACTTCAGGTTTGCATTTCTGCTTCGCATTGTCTTGGAGCTTCTCATTCCCAGTCTGAACAAACTCAGAATCTCTGCCGTCAGTTTTTGAATGGTGAAAATGTGCAACCTATGGCTGAGGAGGTATCAAAAGATGCTCAGACCTCGATAGCACCTCACAGCCTTGTACCAAGGGACAGGGAACAATCCGCTCCAGTAATGTATGCAGGCTCAGTACCTTTTCCTGCAAAACCTGGACCAAATGAGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCAAAGCTAACTGCATCAACATCCTCGGTTCAGATGCTCTCTTACGTCCTATCTTCTCTGGCATCGGAAGGCGTAATTGGAAATCCAAATAAAGAGTTACCTGGTGATTATCCATCTGAGAAGAGGCCGAAACTTGAAAATGACCAGTTGCCTTACACATTGCCTCCCAATCCTCAGCGACCACCAGTCTCTTCCTTCCCACACCCGGAGTCTCTCCAACACAACGCTTCGTCCACCAGTCAACAATACACTCCTTCTGACCCTCCACCTCCTCCATCGTCATCTCCGCCGCCCATGCCTCCTTTACCTCCTGTCGCACAGTTCCCTCTGCCCCAGTTTACTCAGAATGCAGGGTCTGTAAGTAGCATACCTTACAGCTACAGTATGACACAATCACTGCCACCATTAGCCATGCCTGGATATCCAAATATCGGTGCCCCAGTGACGGGGATGTCTCCTTTTACAATACCAACGAATTCTTACCAGAGTTTTCAGGCTCCAGATGGTAATTTCTATAATCCGTCTTCATCCATGCCGATGGCGCCAATTTCTAGGCAATAGAGCTATGTAATACTAGTGCTACTTGATCTGTTGTGCTAACAAATCTTTCAGGAAGTGGTGCATCCCATTAAGGCCCTTCGAGGAAATCTTGCATCCTTGTATGCTCCTTAAGTTCTGACGACTTAACTACATGGATTTTGCACATAGTAATAGTTTGTAATTTATCATCACTTTCCAATGTATTGCTTGGAGTCCCTGACTATATTTCACCTAGTTGAATGGAGTAGTATGATAATTAATTATGTTGACTTGATAGATTTTTGCACATAGTAAGAGTTTGTAATTTAACTTCATTTACTAATGTAATGCTCAAGGCTAGAACAGAACTTCATGTGGTTGACTGAATTCTGATCGTTTATTTTGTTTTCGGTTTGGATTTGCTTTAGCTAACAATTGGGAGAGTTAAA

Coding sequence (CDS)

ATGGGTGGTACATTCAATCCACAAATTTTGGTAGACAAGCTAGCCAAGCTCAACAACTCACAGGCGAGCATTGAGACTTTATCCCACTGGTGTATATTTCACATGAACAAAGCCAAGCAAGTTGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGTGAGCAGAGGTTGGCCTATCTGTATCTCGCAAATGACATTTTGCAGAACAGTAGGCGAAAAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCTGATGCACTTCGTGATGTAATTGGGAATGGGGATGAATTTGGAAGAAATGCTGCCTTACGACTGATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGACAGAGTCTTAAAGAAGAGATAATGGGAAAGCATTTGGAAACTAGTAATCGAAATGGGAAGCCATTCAGCAGTAAGCTGAAACAATCTGCCAGTGTATCATTGGATAAAATAGTCTCTGGTTACCAAGTTGTTTATGGAAATGAGATTGATGAAGATGCAGTATTGAGCAAATGCAGGAATTCTATTAGCTATCTCGAGAAACTGGACAAAGAAATTGGCGCTGATGTCAGTTCAGGGCAATACCGTGGATCTTCAATGGCAAATGATTTGCGGGGACATCATACCATTTTGAGGGACTGCATCGATCAATTAACAACAATTGAAACATCAAGGGCAAGCCTCGTGTCTCATCTGAGAGAGGCTATTCAAGAACAGGAATTCAAATTGGAGCAAGTCCGAAATCAACTTCAGGTTTGCATTTCTGCTTCGCATTGTCTTGGAGCTTCTCATTCCCAGTCTGAACAAACTCAGAATCTCTGCCGTCAGTTTTTGAATGGTGAAAATGTGCAACCTATGGCTGAGGAGGTATCAAAAGATGCTCAGACCTCGATAGCACCTCACAGCCTTGTACCAAGGGACAGGGAACAATCCGCTCCAGTAATGTATGCAGGCTCAGTACCTTTTCCTGCAAAACCTGGACCAAATGAGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCAAAGCTAACTGCATCAACATCCTCGGTTCAGATGCTCTCTTACGTCCTATCTTCTCTGGCATCGGAAGGCGTAATTGGAAATCCAAATAAAGAGTTACCTGGTGATTATCCATCTGAGAAGAGGCCGAAACTTGAAAATGACCAGTTGCCTTACACATTGCCTCCCAATCCTCAGCGACCACCAGTCTCTTCCTTCCCACACCCGGAGTCTCTCCAACACAACGCTTCGTCCACCAGTCAACAATACACTCCTTCTGACCCTCCACCTCCTCCATCGTCATCTCCGCCGCCCATGCCTCCTTTACCTCCTGTCGCACAGTTCCCTCTGCCCCAGTTTACTCAGAATGCAGGGTCTGTAAGTAGCATACCTTACAGCTACAGTATGACACAATCACTGCCACCATTAGCCATGCCTGGATATCCAAATATCGGTGCCCCAGTGACGGGGATGTCTCCTTTTACAATACCAACGAATTCTTACCAGAGTTTTCAGGCTCCAGATGGTAATTTCTATAATCCGTCTTCATCCATGCCGATGGCGCCAATTTCTAGGCAATAG

Protein sequence

MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKCRNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHLREAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSKDAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPESLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYSMTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ
Homology
BLAST of Lsi03G004430 vs. ExPASy Swiss-Prot
Match: Q8VDS4 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Mus musculus OX=10090 GN=Rprd1a PE=1 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 2.9e-22
Identity = 84/273 (30.77%), Postives = 131/273 (47.99%), Query Frame = 0

Query: 5   FNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLA 64
           F+   L  KL++L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 65  NDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGSRG-Q 124
           ND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 125 SLKEEIMG------------KHLETSNRNGKPFSSKLKQS-----ASVSLDKIVSGYQVV 184
            LK  + G            K  E  N +     S+  Q+     A   L+   SG   V
Sbjct: 124 QLKHALYGDKKARKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQDLENAASGDAAV 183

Query: 185 YGNEIDEDAVLSKCRNSISYLEKL-DKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQ 244
           +       A L      +S LEK+ DKE G  +S        +  D  G      D   Q
Sbjct: 184 H----QRIASLPVEVQEVSLLEKITDKESGERLSKMVEDACMLLADYNGRLAAEIDDRKQ 243

Query: 245 LTTIETSRASLVSHLREAIQEQEFKLEQVRNQL 259
           LT +    A  +   +EA+ E+E KLE+ + +L
Sbjct: 244 LTRM---LADFLRCQKEALAEKEHKLEEYKRKL 269

BLAST of Lsi03G004430 vs. ExPASy Swiss-Prot
Match: Q0P5J9 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Bos taurus OX=9913 GN=RPRD1A PE=2 SV=2)

HSP 1 Score: 107.8 bits (268), Expect = 3.8e-22
Identity = 83/273 (30.40%), Postives = 132/273 (48.35%), Query Frame = 0

Query: 5   FNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLA 64
           F+   L  KL++L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 65  NDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGSRG-Q 124
           ND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 125 SLKEEIMG------------KHLETSNRNGKPFSSKLKQS-----ASVSLDKIVSGYQVV 184
            LK+ + G            K  E  N +     S+  Q+     A   L+   SG   V
Sbjct: 124 QLKQALYGDKKPRKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQDLENAASGDAAV 183

Query: 185 YGNEIDEDAVLSKCRNSISYLEKL-DKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQ 244
           +       A L      +S L+K+ DKE G  +S        +  D  G      D   Q
Sbjct: 184 H----QRIASLPVEVQEVSLLDKITDKESGERLSKMVEDACMLLADYNGRLAAEIDDRKQ 243

Query: 245 LTTIETSRASLVSHLREAIQEQEFKLEQVRNQL 259
           LT +    A  +   +EA+ E+E KLE+ + +L
Sbjct: 244 LTRM---LADFLRCQKEALAEKEHKLEEYKRKL 269

BLAST of Lsi03G004430 vs. ExPASy Swiss-Prot
Match: Q96P16 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Homo sapiens OX=9606 GN=RPRD1A PE=1 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 3.8e-22
Identity = 83/273 (30.40%), Postives = 132/273 (48.35%), Query Frame = 0

Query: 5   FNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLA 64
           F+   L  KL++L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 65  NDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGSRG-Q 124
           ND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 125 SLKEEIMG------------KHLETSNRNGKPFSSKLKQS-----ASVSLDKIVSGYQVV 184
            LK+ + G            K  E  N +     S+  Q+     A   L+   SG   V
Sbjct: 124 QLKQALYGDKKPRKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQDLENAASGDAAV 183

Query: 185 YGNEIDEDAVLSKCRNSISYLEKL-DKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQ 244
           +       A L      +S L+K+ DKE G  +S        +  D  G      D   Q
Sbjct: 184 H----QRIASLPVEVQEVSLLDKITDKESGERLSKMVEDACMLLADYNGRLAAEIDDRKQ 243

Query: 245 LTTIETSRASLVSHLREAIQEQEFKLEQVRNQL 259
           LT +    A  +   +EA+ E+E KLE+ + +L
Sbjct: 244 LTRM---LADFLRCQKEALAEKEHKLEEYKRKL 269

BLAST of Lsi03G004430 vs. ExPASy Swiss-Prot
Match: Q5R8Y3 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Pongo abelii OX=9601 GN=RPRD1A PE=2 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 3.8e-22
Identity = 83/273 (30.40%), Postives = 132/273 (48.35%), Query Frame = 0

Query: 5   FNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLA 64
           F+   L  KL++L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 65  NDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGSRG-Q 124
           ND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 125 SLKEEIMG------------KHLETSNRNGKPFSSKLKQS-----ASVSLDKIVSGYQVV 184
            LK+ + G            K  E  N +     S+  Q+     A   L+   SG   V
Sbjct: 124 QLKQALYGDKKPRKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQDLENAASGDAAV 183

Query: 185 YGNEIDEDAVLSKCRNSISYLEKL-DKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQ 244
           +       A L      +S L+K+ DKE G  +S        +  D  G      D   Q
Sbjct: 184 H----QRIASLPVEVQEVSLLDKITDKESGERLSKMVEDACMLLADYNGRLAAEIDDRKQ 243

Query: 245 LTTIETSRASLVSHLREAIQEQEFKLEQVRNQL 259
           LT +    A  +   +EA+ E+E KLE+ + +L
Sbjct: 244 LTRM---LADFLRCQKEALAEKEHKLEEYKRKL 269

BLAST of Lsi03G004430 vs. ExPASy Swiss-Prot
Match: Q9NQG5 (Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Homo sapiens OX=9606 GN=RPRD1B PE=1 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 3.8e-22
Identity = 50/116 (43.10%), Postives = 71/116 (61.21%), Query Frame = 0

Query: 4   TFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYL 63
           +F+   L  KL++L+NSQ S++TLS W I H   A  +V  W ++   +   ++L +LYL
Sbjct: 3   SFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFLYL 62

Query: 64  ANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFG 120
           AND++QNS+RKG EF  EF  VL DA   V    DE  +    RL+ IW+ER V+G
Sbjct: 63  ANDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYG 118

BLAST of Lsi03G004430 vs. ExPASy TrEMBL
Match: A0A1S3B3N2 (UPF0400 protein C337.03 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485437 PE=4 SV=1)

HSP 1 Score: 981.9 bits (2537), Expect = 1.1e-282
Identity = 504/538 (93.68%), Postives = 517/538 (96.10%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET NRNGKPF+SKLKQSASVSLDKIVSGYQVVYG EIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGNRNGKPFNSKLKQSASVSLDKIVSGYQVVYGKEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+SGQYRGSS+A+DLRGHHTILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNSGQYRGSSIADDLRGHHTILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEGSK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTS+APHSLVPR+REQSAPVMYA SVPFP+KPGP+EEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSVAPHSLVPREREQSAPVMYAASVPFPSKPGPSEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNK+LPGDYPSEKRPKLENDQLPY LPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKDLPGDYPSEKRPKLENDQLPYALPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYS 480
           SLQHN SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSYS
Sbjct: 421 SLQHNTSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPIPYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQAPDGNFYN SSSMPMAPISRQ
Sbjct: 481 MTQSLPPLAMPGYPNAGAPVTGMSPFTIPTNSYQNFQAPDGNFYNQSSSMPMAPISRQ 528

BLAST of Lsi03G004430 vs. ExPASy TrEMBL
Match: A0A0A0LMU3 (CID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G404750 PE=4 SV=1)

HSP 1 Score: 973.0 bits (2514), Expect = 5.1e-280
Identity = 500/538 (92.94%), Postives = 516/538 (95.91%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET NRNGKPF+SKLKQSASVSLDKIVSGYQVVYG EIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGNRNGKPFNSKLKQSASVSLDKIVSGYQVVYGKEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIG DV+SGQYRGSS+A+DLRGHH+ILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGNDVNSGQYRGSSIADDLRGHHSILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEGSK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTS+APHSLV R+REQSAPVMYA SVPFP+KPGPNEEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSVAPHSLVSREREQSAPVMYAASVPFPSKPGPNEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNK+LPGDYPSEKRPKLENDQLPY LPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKDLPGDYPSEKRPKLENDQLPYPLPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYS 480
           SLQHN+SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSYS
Sbjct: 421 SLQHNSSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPIPYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQAPDG+FY+ SSSMPMAPISRQ
Sbjct: 481 MTQSLPPLAMPGYPNAGAPVTGMSPFTIPTNSYQNFQAPDGSFYSQSSSMPMAPISRQ 528

BLAST of Lsi03G004430 vs. ExPASy TrEMBL
Match: A0A1S3B3S0 (UPF0400 protein C337.03 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103485437 PE=4 SV=1)

HSP 1 Score: 957.6 bits (2474), Expect = 2.2e-275
Identity = 496/538 (92.19%), Postives = 508/538 (94.42%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET NRNGKPF+SKLKQSASVSLDKIVSGYQVVYG EIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGNRNGKPFNSKLKQSASVSLDKIVSGYQVVYGKEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+S         +DLRGHHTILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNS---------DDLRGHHTILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEGSK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTS+APHSLVPR+REQSAPVMYA SVPFP+KPGP+EEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSVAPHSLVPREREQSAPVMYAASVPFPSKPGPSEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNK+LPGDYPSEKRPKLENDQLPY LPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKDLPGDYPSEKRPKLENDQLPYALPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYS 480
           SLQHN SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSYS
Sbjct: 421 SLQHNTSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPIPYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQAPDGNFYN SSSMPMAPISRQ
Sbjct: 481 MTQSLPPLAMPGYPNAGAPVTGMSPFTIPTNSYQNFQAPDGNFYNQSSSMPMAPISRQ 519

BLAST of Lsi03G004430 vs. ExPASy TrEMBL
Match: A0A6J1DG44 (UPF0400 protein C337.03 OS=Momordica charantia OX=3673 GN=LOC111020199 PE=4 SV=1)

HSP 1 Score: 926.0 bits (2392), Expect = 7.1e-266
Identity = 474/536 (88.43%), Postives = 497/536 (92.72%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFN  ILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAY
Sbjct: 1   MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+FGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKH+ET NRNGK FS KLKQS S SLDKIV+GYQVVYG EIDED VLSKC
Sbjct: 121 RGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+SGQY GSS++ DL+ HHTILR CI+QLT IE+SRA+LVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKL++VRNQLQ          ASHSQSEQTQNL RQFLNGENVQPMAEE SK
Sbjct: 241 REALQEQEFKLDEVRNQLQ----------ASHSQSEQTQNLSRQFLNGENVQPMAEEASK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTSIAPHSLVPR+REQSAPVMYA S+PFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNP KE   DYPSEKRPKLENDQ PYTLPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYSMT 480
           SLQHNASSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+PYSYS+T
Sbjct: 421 SLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLT 480

Query: 481 QSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           Q L PLAMPGYPN+G PVTGMSPFTIPTNSYQ+FQA DGNFYN SSSMPMAP+SRQ
Sbjct: 481 QPLQPLAMPGYPNVGTPVTGMSPFTIPTNSYQNFQASDGNFYNQSSSMPMAPMSRQ 526

BLAST of Lsi03G004430 vs. ExPASy TrEMBL
Match: A0A6J1FFD5 (UPF0400 protein C337.03-like OS=Cucurbita moschata OX=3662 GN=LOC111445219 PE=4 SV=1)

HSP 1 Score: 921.0 bits (2379), Expect = 2.3e-264
Identity = 481/538 (89.41%), Postives = 498/538 (92.57%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+FGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIEHGDDFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGK LET NRNGK FSSKLKQS S+SLDKIV GYQVVY +E+DEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKSLETGNRNGKHFSSKLKQSGSISLDKIVCGYQVVYRSEVDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+SGQYRG+S A DLRGHH ILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNSGQYRGTSAAEDLRGHHHILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMA-EEVS 300
           REA+QEQEFKLEQVRNQLQV          SHSQSEQTQNLCRQFLNGENV+ M  EE S
Sbjct: 241 REALQEQEFKLEQVRNQLQV----------SHSQSEQTQNLCRQFLNGENVEAMTKEEAS 300

Query: 301 KDAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSS 360
           KDAQTSIAPH+LVPR+R+QSAPVMYA S+PFPAKPGP EEDPRKSAAAAVAAKLTASTSS
Sbjct: 301 KDAQTSIAPHTLVPRERDQSAPVMYAPSLPFPAKPGPLEEDPRKSAAAAVAAKLTASTSS 360

Query: 361 VQMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHP 420
           VQMLSYVLSSLASEGVIGNPNKELPGDYPSEKR KLENDQ PYTLPPNPQRPPV  FPHP
Sbjct: 361 VQMLSYVLSSLASEGVIGNPNKELPGDYPSEKRLKLENDQSPYTLPPNPQRPPVPPFPHP 420

Query: 421 ESLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQN-AGSVSSIPYSYS 480
           ESLQHNASSTSQQYTPSD PPPPSSSPPP+PPLPPV Q PLPQFTQN AGSVSSI YSYS
Sbjct: 421 ESLQHNASSTSQQYTPSDLPPPPSSSPPPVPPLPPVGQLPLPQFTQNAAGSVSSIAYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSL PLA PGYPN+GAPVTGMSP TIPTNSYQSFQ  DGNFYNPSSSMPMAPISRQ
Sbjct: 481 MTQSLQPLARPGYPNLGAPVTGMSPCTIPTNSYQSFQGSDGNFYNPSSSMPMAPISRQ 528

BLAST of Lsi03G004430 vs. NCBI nr
Match: XP_038884747.1 (UPF0400 protein C337.03 [Benincasa hispida] >XP_038884750.1 UPF0400 protein C337.03 [Benincasa hispida])

HSP 1 Score: 986.9 bits (2550), Expect = 7.0e-284
Identity = 504/536 (94.03%), Postives = 517/536 (96.46%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET +RNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGSRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIG DV+SGQYRGSS+A+DLRGHHTILRDCI+QLT+IETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGTDVNSGQYRGSSIADDLRGHHTILRDCIEQLTSIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEASK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTSIAPHSLVPRDREQSAPVMYAGS+PFP KPGP+EEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSLPFPTKPGPSEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYSMT 480
           SLQ N SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFP+PQFTQN GSVSSIPYSYSMT
Sbjct: 421 SLQLNTSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPVPQFTQNVGSVSSIPYSYSMT 480

Query: 481 QSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           QSLPPLAMPGYPN+GAPVTG+SPFTIPTNSYQSFQAPDGNFYN SSSMPMAPISRQ
Sbjct: 481 QSLPPLAMPGYPNVGAPVTGLSPFTIPTNSYQSFQAPDGNFYNQSSSMPMAPISRQ 526

BLAST of Lsi03G004430 vs. NCBI nr
Match: XP_008441251.1 (PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo])

HSP 1 Score: 981.9 bits (2537), Expect = 2.2e-282
Identity = 504/538 (93.68%), Postives = 517/538 (96.10%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET NRNGKPF+SKLKQSASVSLDKIVSGYQVVYG EIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGNRNGKPFNSKLKQSASVSLDKIVSGYQVVYGKEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+SGQYRGSS+A+DLRGHHTILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNSGQYRGSSIADDLRGHHTILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEGSK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTS+APHSLVPR+REQSAPVMYA SVPFP+KPGP+EEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSVAPHSLVPREREQSAPVMYAASVPFPSKPGPSEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNK+LPGDYPSEKRPKLENDQLPY LPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKDLPGDYPSEKRPKLENDQLPYALPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYS 480
           SLQHN SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSYS
Sbjct: 421 SLQHNTSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPIPYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQAPDGNFYN SSSMPMAPISRQ
Sbjct: 481 MTQSLPPLAMPGYPNAGAPVTGMSPFTIPTNSYQNFQAPDGNFYNQSSSMPMAPISRQ 528

BLAST of Lsi03G004430 vs. NCBI nr
Match: XP_004138638.1 (UPF0400 protein C337.03 [Cucumis sativus] >XP_031737380.1 UPF0400 protein C337.03 [Cucumis sativus] >KGN63123.1 hypothetical protein Csa_021963 [Cucumis sativus])

HSP 1 Score: 973.0 bits (2514), Expect = 1.0e-279
Identity = 500/538 (92.94%), Postives = 516/538 (95.91%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET NRNGKPF+SKLKQSASVSLDKIVSGYQVVYG EIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGNRNGKPFNSKLKQSASVSLDKIVSGYQVVYGKEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIG DV+SGQYRGSS+A+DLRGHH+ILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGNDVNSGQYRGSSIADDLRGHHSILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEGSK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTS+APHSLV R+REQSAPVMYA SVPFP+KPGPNEEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSVAPHSLVSREREQSAPVMYAASVPFPSKPGPNEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNK+LPGDYPSEKRPKLENDQLPY LPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKDLPGDYPSEKRPKLENDQLPYPLPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYS 480
           SLQHN+SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSYS
Sbjct: 421 SLQHNSSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPIPYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQAPDG+FY+ SSSMPMAPISRQ
Sbjct: 481 MTQSLPPLAMPGYPNAGAPVTGMSPFTIPTNSYQNFQAPDGSFYSQSSSMPMAPISRQ 528

BLAST of Lsi03G004430 vs. NCBI nr
Match: XP_008441252.1 (PREDICTED: UPF0400 protein C337.03 isoform X2 [Cucumis melo])

HSP 1 Score: 957.6 bits (2474), Expect = 4.5e-275
Identity = 496/538 (92.19%), Postives = 508/538 (94.42%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFNPQILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY
Sbjct: 1   MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKHLET NRNGKPF+SKLKQSASVSLDKIVSGYQVVYG EIDEDAVLSKC
Sbjct: 121 RGQSLKEEIMGKHLETGNRNGKPFNSKLKQSASVSLDKIVSGYQVVYGKEIDEDAVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+S         +DLRGHHTILRDCI+QLTTIETSRASLVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNS---------DDLRGHHTILRDCIEQLTTIETSRASLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKLEQVRNQLQ          ASHSQSEQTQNLCRQFLNGENVQPM EE SK
Sbjct: 241 REALQEQEFKLEQVRNQLQ----------ASHSQSEQTQNLCRQFLNGENVQPMTEEGSK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTS+APHSLVPR+REQSAPVMYA SVPFP+KPGP+EEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSVAPHSLVPREREQSAPVMYAASVPFPSKPGPSEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNPNK+LPGDYPSEKRPKLENDQLPY LPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPNKDLPGDYPSEKRPKLENDQLPYALPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYS 480
           SLQHN SSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSYS
Sbjct: 421 SLQHNTSSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPIPYSYS 480

Query: 481 MTQSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           MTQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQAPDGNFYN SSSMPMAPISRQ
Sbjct: 481 MTQSLPPLAMPGYPNAGAPVTGMSPFTIPTNSYQNFQAPDGNFYNQSSSMPMAPISRQ 519

BLAST of Lsi03G004430 vs. NCBI nr
Match: XP_022152479.1 (UPF0400 protein C337.03 [Momordica charantia] >XP_022152481.1 UPF0400 protein C337.03 [Momordica charantia])

HSP 1 Score: 926.0 bits (2392), Expect = 1.5e-265
Identity = 474/536 (88.43%), Postives = 497/536 (92.72%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MGGTFN  ILVDKLA+LNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAY
Sbjct: 1   MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+FGRNAALRLIGIWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNAALRLIGIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKC 180
           RGQSLKEEIMGKH+ET NRNGK FS KLKQS S SLDKIV+GYQVVYG EIDED VLSKC
Sbjct: 121 RGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKC 180

Query: 181 RNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHL 240
           RNSISYLEKLDKEIGADV+SGQY GSS++ DL+ HHTILR CI+QLT IE+SRA+LVSHL
Sbjct: 181 RNSISYLEKLDKEIGADVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHL 240

Query: 241 REAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSK 300
           REA+QEQEFKL++VRNQLQ          ASHSQSEQTQNL RQFLNGENVQPMAEE SK
Sbjct: 241 REALQEQEFKLDEVRNQLQ----------ASHSQSEQTQNLSRQFLNGENVQPMAEEASK 300

Query: 301 DAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360
           DAQTSIAPHSLVPR+REQSAPVMYA S+PFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV
Sbjct: 301 DAQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSV 360

Query: 361 QMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPE 420
           QMLSYVLSSLASEGVIGNP KE   DYPSEKRPKLENDQ PYTLPPNPQRPPVSSFPHPE
Sbjct: 361 QMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQRPPVSSFPHPE 420

Query: 421 SLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYSMT 480
           SLQHNASSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+PYSYS+T
Sbjct: 421 SLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLT 480

Query: 481 QSLPPLAMPGYPNIGAPVTGMSPFTIPTNSYQSFQAPDGNFYNPSSSMPMAPISRQ 537
           Q L PLAMPGYPN+G PVTGMSPFTIPTNSYQ+FQA DGNFYN SSSMPMAP+SRQ
Sbjct: 481 QPLQPLAMPGYPNVGTPVTGMSPFTIPTNSYQNFQASDGNFYNQSSSMPMAPMSRQ 526

BLAST of Lsi03G004430 vs. TAIR 10
Match: AT3G26990.1 (ENTH/VHS family protein )

HSP 1 Score: 483.8 bits (1244), Expect = 1.8e-136
Identity = 295/560 (52.68%), Postives = 363/560 (64.82%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           MG +FN QILV+KLAKLNNSQASIETLSHWCIFHMNKAK VVETW +QFHC+PREQRLAY
Sbjct: 1   MGSSFNAQILVEKLAKLNNSQASIETLSHWCIFHMNKAKHVVETWGRQFHCAPREQRLAY 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNSRRKGSEFVGEFWKVLPDALRD+I NGD+FGR +A RL+ IWEERKVFGS
Sbjct: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDMIENGDDFGRKSARRLVNIWEERKVFGS 120

Query: 121 RGQSLKEEIMGKHLETSNRNGKPFSSKL----KQSASVSLDKIVSGYQVVYGNEIDEDAV 180
           RGQ LKEE++G+  E   RNG     KL    +Q    +L+K+VS  +V++G +IDEDA+
Sbjct: 121 RGQILKEELLGRQPENGTRNGNLVPLKLSVPQRQVNGSTLEKVVSAVEVLHGVQIDEDAL 180

Query: 181 LSKCRNSISYLEKLDKEIGADVSSGQYRGSSMANDLRGHHTILRDCIDQLTTIETSRASL 240
           + K  N+  YLEK  +E+  D+SSG   G ++  +L+G H ILRDCI+QL  +ETSR SL
Sbjct: 181 VGKSTNAAGYLEKATQEVERDLSSGHAPGPAVVKELQGQHVILRDCIEQLGAMETSRTSL 240

Query: 241 VSHLREAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQFL-NGENVQPMA 300
           +SHLREA+QEQE KLEQVRN LQ+          +  QS++T +LCRQ L +G + QP A
Sbjct: 241 ISHLREALQEQELKLEQVRNHLQI----------ARFQSDRTGDLCRQLLDHGGSSQPPA 300

Query: 301 ------EEVSKDAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAV 360
                 +EV K + T+ AP S    D EQSAPVM+A      + P  + EDPRK+AAAAV
Sbjct: 301 TEEEESKEVIKVSSTAAAPQSFTHSDVEQSAPVMFA------SNPTQSLEDPRKTAAAAV 360

Query: 361 AAKLTASTSSVQMLSYVLSSLASEGVIGNPNKEL------PGDYPSEKRPKLENDQLPYT 420
            AKLTASTSS +MLSYVLSSLASEG+IGN N           D+P EKRPKL+N    Y 
Sbjct: 361 VAKLTASTSSAEMLSYVLSSLASEGIIGNNNPPAVTETLSSVDFPPEKRPKLQNHDQSYL 420

Query: 421 LPPNPQRPPVSSFPHPESLQHNASSTSQQYTPSDPPPPPSSSPPPMPPLPPVAQFPLPQF 480
            P                  H  ++T+   TP  P PPP       PP     QF  P  
Sbjct: 421 SP-----------------HHQNTATTSSSTPPQPLPPP-------PPFQLQPQFLQP-- 480

Query: 481 TQNAGSVSSIPYSYSM------TQSLPPLAMPGYPNIGAPVTGMSPFTIPT-NSYQSFQA 537
            Q  G V+  P++Y++      TQ       P  P +    T +S  + P+ NSYQ FQ 
Sbjct: 481 LQPPGPVNHTPFNYTIATSTATTQQQQQEQGPWVPGL----TQLSTTSAPSENSYQKFQG 513

BLAST of Lsi03G004430 vs. TAIR 10
Match: AT5G10060.1 (ENTH/VHS family protein )

HSP 1 Score: 253.4 bits (646), Expect = 3.9e-67
Identity = 184/525 (35.05%), Postives = 276/525 (52.57%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           M   F+ QIL+DKLAKLN+SQ SIETLSHWCIF+ +KA+ +V TW+KQFH +  +Q++  
Sbjct: 1   MSSVFSDQILIDKLAKLNSSQQSIETLSHWCIFNRSKAELIVTTWEKQFHSTEMDQKVPL 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNS+R+G+EFV EFW VLP AL+D++  GD+ G++A  R+I IWEER+VFGS
Sbjct: 61  LYLANDILQNSKRQGNEFVQEFWNVLPKALKDIVSQGDDNGKSAVARVIKIWEERRVFGS 120

Query: 121 RGQSLKEEIMGKHL--------------ETSNRNGKPFSSKLKQSASVSLDKIVSGYQVV 180
           R +SLK+ ++G+ +              ++S R  K   +KL  S  V+ +KI S Y +V
Sbjct: 121 RSKSLKDVMLGEDVPLPLDISKKRPRGSKSSKRESKSSRTKLASSGGVA-EKIASAYHLV 180

Query: 181 YGNEIDEDAVLSKCRNSISYLEKLDKEI--GADVSSGQYRGSSMANDLRGHHTILRDCID 240
                +E+A ++KC++++  + K++K++      +    +  S+A +L     +LR CI+
Sbjct: 181 VAENSNEEAEMNKCKSAVKRIRKMEKDVEEACSTAKDNPKRKSLAKELEEEEYLLRQCIE 240

Query: 241 QLTTIETSRASLVSHLREAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQ 300
           +L +++ SR+SLV+ L++A++EQE +L+ ++ Q+QV          +  Q+E+ QN+ ++
Sbjct: 241 KLKSVQGSRSSLVNQLKDALREQESELDNLKAQIQV----------AKEQTEEAQNMQKR 300

Query: 301 FLNGENVQPMAEEVSKDAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKS 360
            LN E+            QT+ A       D  +S                       K 
Sbjct: 301 -LNDEDY--------TSKQTTAATTITETNDNTKSG-------------------QASKM 360

Query: 361 AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTL 420
             A++AA LTASTSS  ++  VLSS A+E        +  G   SE            T+
Sbjct: 361 TPASIAAMLTASTSSHMIMQSVLSSFAAEAT------KTSGLSKSES-----------TV 420

Query: 421 PPNPQRPPVSSFPHPESLQHNASSTSQQYTPSDPPPPPSSSPPP----------MPPLPP 480
           P +      +SFP   + Q+   +T  QY     PPPP    PP          +P +PP
Sbjct: 421 PVSDTN---ASFPSYNNSQNQTPTTQGQYHVIPNPPPPQFLKPPVMNNPYAFGNIPLMPP 466

Query: 481 VAQFPLPQFTQNAGSVSSIPYSYSMTQSL--PPLAMPGYPNIGAP 498
               P P           IP S S  QS   P    PG    GAP
Sbjct: 481 GLPPPPPPPHLIGNQQPQIPQSNSAQQSQQGPTFQPPGIMYYGAP 466

BLAST of Lsi03G004430 vs. TAIR 10
Match: AT5G65180.1 (ENTH/VHS family protein )

HSP 1 Score: 236.1 bits (601), Expect = 6.5e-62
Identity = 161/467 (34.48%), Postives = 257/467 (55.03%), Query Frame = 0

Query: 1   MGGTFNPQILVDKLAKLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAY 60
           M   F+ +IL+D LAKLN++Q SI+TLS WCI H ++A+ VV TW+KQFH +   Q++  
Sbjct: 1   MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL 60

Query: 61  LYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIGNGDEFGRNAALRLIGIWEERKVFGS 120
           LYLANDILQNS+R+G+EFV EFWKVLP AL+D++  GD++G+    RL+ IWEER+VFGS
Sbjct: 61  LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS 120

Query: 121 RGQSLKEEIMGKHL--------------ETSNRNGKPFSSKLKQSASVSLDKIVSGYQVV 180
           R +SLK+ ++ +                +++ R+ K  S+K K S+    +KIVS + +V
Sbjct: 121 RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSK--STKTKLSSGGVSEKIVSAFNLV 180

Query: 181 YGNEIDEDAVLSKCRNSISYLEKLDKEIGADVSSGQ-YRGSSMANDLRGHHTILRDCIDQ 240
                +E+  ++KC++++  + K++K++    S+ +  R  S+A +L     ILR  +++
Sbjct: 181 RAENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEK 240

Query: 241 LTTIETSRASLVSHLREAIQEQEFKLEQVRNQLQVCISASHCLGASHSQSEQTQNLCRQF 300
           L ++E SR SLV+HLREA++EQE +LE +++Q+QV          +  Q+E+ QN+ ++ 
Sbjct: 241 LKSVEESRTSLVNHLREALREQESELENLQSQIQV----------AQEQTEEAQNMQKR- 300

Query: 301 LNGENVQPMAEEVSKDAQTSIAPHSLVPRDREQSAPVMYAGSVPFPAKPGPNEEDPRKSA 360
           LN E        V+ +  TS     + P                              ++
Sbjct: 301 LNNET------PVNNNNGTSGQSAKITP------------------------------AS 360

Query: 361 AAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELPGDYPSEKRPKLENDQLPYTLP 420
            AA+A  LT+ST+S  ++  VLSS A+E               S       +D   + +P
Sbjct: 361 IAAMAEMLTSSTNSSMIMHSVLSSFAAEAT-----------QTSGLTKSNTSDTNAFVVP 405

Query: 421 PNPQRPPVSSFPHPESLQ---HNASSTSQQYTPSDPPPPPSSSPPPM 450
           PNPQ+  +   P+P + Q   +   +       + PPPPP + PP M
Sbjct: 421 PNPQQYHI--IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHM 405

BLAST of Lsi03G004430 vs. TAIR 10
Match: AT5G65180.2 (ENTH/VHS family protein )

HSP 1 Score: 80.9 bits (198), Expect = 3.5e-15
Identity = 87/319 (27.27%), Postives = 154/319 (48.28%), Query Frame = 0

Query: 135 ETSNRNGKPFSSKLKQSASVSLDKIVSGYQVVYGNEIDEDAVLSKCRNSISYLEKLDKEI 194
           +++ R+ K  S+K K S+    +KIVS + +V     +E+  ++KC++++  + K++K++
Sbjct: 21  KSAKRDSK--STKTKLSSGGVSEKIVSAFNLVRAENSNEETEMNKCKSAVRRIRKMEKDV 80

Query: 195 GADVSSGQ-YRGSSMANDLRGHHTILRDCIDQLTTIETSRASLVSHLREAIQEQEFKLEQ 254
               S+ +  R  S+A +L     ILR  +++L ++E SR SLV+HLREA++EQE +LE 
Sbjct: 81  EDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTSLVNHLREALREQESELEN 140

Query: 255 VRNQLQVCISASHCLGASHSQSEQTQNLCRQFLNGENVQPMAEEVSKDAQTSIAPHSLVP 314
           +++Q+QV          +  Q+E+ QN+ ++ LN E        V+ +  TS     + P
Sbjct: 141 LQSQIQV----------AQEQTEEAQNMQKR-LNNET------PVNNNNGTSGQSAKITP 200

Query: 315 RDREQSAPVMYAGSVPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASE 374
                                         ++ AA+A  LT+ST+S  ++  VLSS A+E
Sbjct: 201 ------------------------------ASIAAMAEMLTSSTNSSMIMHSVLSSFAAE 260

Query: 375 GVIGNPNKELPGDYPSEKRPKLENDQLPYTLPPNPQRPPVSSFPHPESLQ---HNASSTS 434
                          S       +D   + +PPNPQ+  +   P+P + Q   +   +  
Sbjct: 261 AT-----------QTSGLTKSNTSDTNAFVVPPNPQQYHI--IPNPAASQQFPYGFGNIP 277

Query: 435 QQYTPSDPPPPPSSSPPPM 450
                + PPPPP + PP M
Sbjct: 321 LMPPGALPPPPPGTLPPHM 277

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8VDS42.9e-2230.77Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Mus musculus OX=1... [more]
Q0P5J93.8e-2230.40Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Bos taurus OX=991... [more]
Q96P163.8e-2230.40Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Homo sapiens OX=9... [more]
Q5R8Y33.8e-2230.40Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Pongo abelii OX=9... [more]
Q9NQG53.8e-2243.10Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Homo sapiens OX=9... [more]
Match NameE-valueIdentityDescription
A0A1S3B3N21.1e-28293.68UPF0400 protein C337.03 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485437 PE=4 ... [more]
A0A0A0LMU35.1e-28092.94CID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G404750 PE=4 SV... [more]
A0A1S3B3S02.2e-27592.19UPF0400 protein C337.03 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103485437 PE=4 ... [more]
A0A6J1DG447.1e-26688.43UPF0400 protein C337.03 OS=Momordica charantia OX=3673 GN=LOC111020199 PE=4 SV=1[more]
A0A6J1FFD52.3e-26489.41UPF0400 protein C337.03-like OS=Cucurbita moschata OX=3662 GN=LOC111445219 PE=4 ... [more]
Match NameE-valueIdentityDescription
XP_038884747.17.0e-28494.03UPF0400 protein C337.03 [Benincasa hispida] >XP_038884750.1 UPF0400 protein C337... [more]
XP_008441251.12.2e-28293.68PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo][more]
XP_004138638.11.0e-27992.94UPF0400 protein C337.03 [Cucumis sativus] >XP_031737380.1 UPF0400 protein C337.0... [more]
XP_008441252.14.5e-27592.19PREDICTED: UPF0400 protein C337.03 isoform X2 [Cucumis melo][more]
XP_022152479.11.5e-26588.43UPF0400 protein C337.03 [Momordica charantia] >XP_022152481.1 UPF0400 protein C3... [more]
Match NameE-valueIdentityDescription
AT3G26990.11.8e-13652.68ENTH/VHS family protein [more]
AT5G10060.13.9e-6735.05ENTH/VHS family protein [more]
AT5G65180.16.5e-6234.48ENTH/VHS family protein [more]
AT5G65180.23.5e-1527.27ENTH/VHS family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 237..257
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 375..462
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 517..536
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 434..458
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 416..433
NoneNo IPR availablePANTHERPTHR12460CYCLIN-DEPENDENT KINASE INHIBITOR-RELATED PROTEINcoord: 1..536
NoneNo IPR availablePANTHERPTHR12460:SF23OS01G0925000 PROTEINcoord: 1..536
NoneNo IPR availableCDDcd16981CID_RPRD_likecoord: 7..130
e-value: 4.16084E-64
score: 203.196
IPR006569CID domainSMARTSM00582558neu5coord: 9..130
e-value: 1.4E-47
score: 174.1
IPR006569CID domainPFAMPF04818CIDcoord: 9..120
e-value: 5.6E-33
score: 113.8
IPR006569CID domainPROSITEPS51391CIDcoord: 2..134
score: 44.194637
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 2..131
e-value: 5.2E-37
score: 128.7
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 10..117

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi03G004430.1Lsi03G004430.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031124 mRNA 3'-end processing
cellular_component GO:0016591 RNA polymerase II, holoenzyme
molecular_function GO:0000993 RNA polymerase II complex binding