HG10022266 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022266
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionregulator of nonsense transcripts UPF2-like
LocationChr05: 22460110 .. 22479547 (+)
RNA-Seq ExpressionHG10022266
SyntenyHG10022266
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGACTAGCGAGTCCAGAACCGGGTCAATACCAGTTGGGCCTGGATCCAAGCCCAAGCCCAACAAAGCAGTATAAAGATGAGCCCAACCCGTCTTCGCGCGCGTGTTCGTCAACCCCTCGTTTCAGTTCTCAGTTCACAATCCTCCCTTCCCACCGTTTCTCCGTCTCCGTCAGTGCTCCACTGCCGACGATCACTTTCTCTTCACTTTCCGGCGGAACTGTAAGTGCATAAACTGCTGCTGCTAAACCCTAATTGTCTACTCTTCTATCGCTTTCTCTCTCATGTGAGCACTATCATTCGTATCGCCGGCGGACTAATTTGATAATTTTATCGGTGTGAGATCCTCGTGCTAACGGTACGCGGTTCTTGTTCTCTTTCTTTTCGCCAGAGGCTAGGTTTTGATCATGAATTAGCTTCCTGTTTGGTGTTAGGCCATGTTGATTTTCGATTACGGAATTGGATGATAGAAATGTCGTTGAATTTTATTTGGTTTTTTTTTTTTTTTTTCGTGGTATACTTTTGAATGCATCGTTTTATGACAATATTACTTTGTTAACTTAGAGATACGTCCTCCGTTTGACTTAATTCAATGTATGAAGCCTTGGTCTTTTGAAGTTTGAAGCCCCCTCCCCTTCTACTCTTCCAAATCCTTTGCAATCTCAATCCTCACTCACAGCTCTCCTCTTCGATTCTCCTTCGCTGTCATTTTTTTTTTCTCTGACTGTAGAATCGTCAATCCCCGAGAACCTCTACTAACTGCCCCGACTCCTCTCTGGACCCTCCCTCCACTTTAATCCCACTCACTCTGGCTGTTTGCTTCAAGGTTTATATGTTTAAATATATATTATTGTTATTATTATTATTTAATATTTGGCTGAATCTTTCCCTCTCTTTCACAGTTTCAAAACGATGTTTAAATGTACAAAATCATCATGTTTATCAACCCATGTCCACTTTCTTCTAGCTTCAGCATGTTATAAAAACAAACTTCATGTAAAGGAATCAATTTAGGTTTGATTGAGAAAAAAGTAGAAGGCCTTGTCAATTGAGGCCTTGTCAAAATGTGTGCTTGGGGCTCCTTATTCTCTTGGGCTCCCCATGTGACTCAAACTGCGGCGTATTGGGTTCTGCTCCCTTCCGGGGGGGGGGGGGGGGGGGGGGGGGCTGATAAAATTGGAAGTCCACTGCAAATGTCTCTTTCTCTGTGCTTCCTGGTGTAATTTTAAGTATATAATCTGGCTGCTAAGCCATTATTGTCCACCCTCCTATCCTTGAGTAAACACCATACTTTCTATCACCCAAATACCAATTTTAATAATTTTTTCCTAGTGAGATCTTGGTTCCAGTGGTACGTGGTTCTTATCTCTTTCCTGTTGATAGAGTTTAGGTTTTGATCTTGAATTAACTTCCTATTATGCTTTATGGCATGTTGATTTTTGATTACTAAGTTGGACAGCAAAAGTATTGTAAAACATTGAGCATTTTGGTTTTTTCATGGAAGACAATCACTAATGGTTGGCCTGGTGGTCATTGGGCCTTTGGGAAAATTTTTTTAAGGGCCTAGAAAGAATCGGTTCAAATTATGGTGGCGATCTTTCTAGGATTTAATGTCTCATGTATTTCATATGGAACAAAATGTAGTAAGGTTAGTTGGTTGCCTTGACCCACGTGATAGTCACGGAATACATTGAATTTGTCCTCCTTATGAGAATGTCTTTATTGTCCATTAAGCTTAAGAGAGAATGTCTTTAGTCAATTTTATTCAATGGATGGAAGTGTGTTAGGATACCTCCTAACAAGAACACACGGAAAACTCAAGAATACTGAATATGGAAATATGCTATAATATCATAAGACTTTCGAGAGGGTTAGTCTCTCCCAAATTCCCCAAAAGAAATCCCACACAAAATTCTTCACACCTTGCTCCCAACCCTCGCTCCCTCTATTTATAACTAAAAGCCCTAACAAACTTAATGTCTAATTACTCGAATATCCCTTACTACCAATCCTACGAATAATCCTAATATTTCCCCAACTAGGAGTCTTACAAGGTGTTGGGTGGGCTGGGTGGTTGAGAGAAGAAAACTACCCGAGTTAGTGGATACGATCATTGTGTGATTAACCTTTATTGTCTGTCATTTTACCTTCTAATTCATGCTCTCCATTTTTCTTGTCCAGCTCTTTCTAAAGTGTTCTTTCAATTGGCCATGTTTCAATATTTGTTAGTAAGTCCTATAAAGAATTATTTGTCACTAAGAACCGAGCAGTTGGCAGCTTTATGTATTTAGTGACGTGTAAACATGGTAGTTATCTTTATATCAATTAATTTATCGCAAGTACATACTACACAGGACATGGACCATCATGAAGATGATGGCCGCCCAGGGGGTGAAAGTCAACCCAAGAGAGATGATGAGGTGAGATTGATGTACAAGTCTAGTCTAACTGTTCTCCATTTAGTTCTATTAGCTTTTTATTTATTTCATTTGTTTACGACTGCTTCATAAATCATTGGATACCGTGATATCTATGGTTTGAAACTTCATCCTCCCAACCTTGCTACCCATACACCATCATTAAGTGTATGTCCCATTCTTGCACGTATATGCTTTTTTGATACATCTAATTATGTTTTCTATCATTTTCTTTTGGTGCGTCTCATGCTGGATGTTTCTCATGTAGGTCACTCTACACTGCTGATTATTTTTCTCATGTTGTAGGAAACTGTTGCTCGTCAGGAAGAAATTAAAAAGTCATTTGAGGCTAAAATGGCTCTTCGACAAAGTAATCTGAATCCTGAGAGGCCTGGTTAGTGAATTATCTTTAGCATTTTGATGATCATGTAATTAAATGTGTGATGGTAATTCTAAATTTTTCATGGCGAGTAAAATGTTTATTATAACCTATATGATGAATTATGAAAAACAGAAAATGTTCTCATGCCACTTTATAGATTGGGTTTTTAATTTGAATCTTTATGCCTGATAAAAGCTTATGTTTTTAAATTTTGTCATCAATGTTTGAAGATGTGGAAGTAGGTCTTTGTGAACTCTTGGTTATGCTCTATCGGGCAAATAAAATGTGTTATAGCCTGCTTAAAGACATTGAAAAAGTCAGGTTGAGATCGGGGATATCTTCTCTTGATTGAAGGGGACATGATAATCTCATATAAAATCCTTTGAAGCCTACTAGGCATTATTGCAGGTCTAAGTTGATAAGAAATAGCAGTCCTGTCCATAATTTTTGAAAGCTTTGAACAAATTTTCAGCCTCAATCATCCTGTCTAGCTAGGAAAGTAGTTAAATGAAGAAAAAATAATTATTCAATTGCTAAGCCCTTTTGAAAATTCAGATTATCAATCATCAATTTTCCATCAAAATAGTTATAAAGAAAAAAGAAAAAGAAGATCCTTCATGTGTTTTTGCTTTGAAAAAAAGATACATTTCATCTCATCACAAGTCTTGAACATTCATGGGTGATGCTCATTATTCTATATTCACCTAAACACAGCCTATAGAAGATTCAACAAACGGACATAAAGAACTCTTAACAAATCTCGACTAGAGGAGCTTCTTTTGAAAAAAATCTCACTAGCACATTTTCAATTTATAACCTTTCATATGTTCTTTAGATAGTCTGGTATGATGTCTTTGTACTGCAAATGATACGCATACTCCCAAACATCTTCCCTAGTAGACCATTTAGTCACAAGAAGATCCTGATTGGCTGTGGTTTCAACTGAGAGCTTTTCAGTTCTTTATCCAGAGCAAGTCACACCTATCTAGAGTCTTTTCACTTAACTTTCTAAATCAAGGTTTCAAGAGAATCAAACTGAGTGTTGACCCATCCTAGAGCTTCTTTTGGAGACTTACTACCCAGTTCATACATGAATGAGGGCAAGGTTGTTCCAAAAAATTGAATGATTTGTGATCTCTGCCATTGAATTTGATGGCATTTTGGAACTTGACTACAGTGGAAGTGTGGCCTTTGTCAATGACCTCGTGAAGCTGCTGAAGTGTTTTGTTGTAGTTGGTGATGTAGACTTGGTGTTTCTGATGATGGAATTGCATGATTGCAGTGTTGACGACCGGCTCAAGAGTGCCATAGTCATTATGGAGGTTAGCAAGGAGAAGGTTTGCAAACCACGAAACTGTCCATGCCCTAATACTTAAGAAATTGTTCTAATAAATCCCCTGGTCCGGATGGTTTTACTGCTGAATTTTTCAAGTTCTTTTGGCCAACTATAAAGGAGGATTTCATGAGTATGATTACCAATTTTCACCAATCAGGCATTATTAACGCTTCACTGAATGAAACCTACATTTGCCTCATCCCTAAGAGAATTGACTCCTAGACTGTGTCTGACTACCGTCCTATTAGCCTTATTTCTTGTGCATACAAGATCATTGCGAGGGTTCTATCTAATCGTCTGAAATCGGTGCTACCTTCCACCATTGCACCAAACCAGTTGGCATTTGTGGAAAACAGACAAATCTTAGATGCTTCTCTGATGGCTAATGAGTTGATTGATGATTGGTCTCGCACAGGGAAGCAGGGTATGGTTTTTAAACTGGACCTTGAGAAGGCTTTTGACACTGTTGATTCTGTTCTGCAATCCAATGGTTTTGGAAACTTATGGCGTTCCTGGATTCGTGGCTGTGTCTTTAGTGCAAATTTCTCCATCATCATTAATGGTAAACCTCGAGGAAAGATCTTCCCATTGAGAGGTATTCGTCAAGGGGATCCCTTATCTCCTTTTCTATTCATTTTGGTTGCTGACTGCTTGAGTCGTCTGTTGGAGCATGCTCTTACGAGGGGGCTTATTGAGACGCACCCGGTTGGATCTTCCACTTTTGTCCTTAGCCACCTGCAGTTTGCGGACGACAGTCTTCTGTTTTCTACGGTGGATAGAACGGCTCTGCATCGCTTGTTTGAGATTATTCACATTTTTGAACAAGCTTCTGGCTTATCCATTAACCTTTCTAAAAGTGAGCTGATTGGGATTAATACTCCAGACACTGAATTTGCATGGATGGTTTCTGCTTTTGGGTGTAAACAAGGTCATTGGCCTACAAACTATTAGCTTCTGGAACCCTGTCATTGAGCGTATACAACATAAACTCCACAACTGGAAATACGCATATATCTCTAAGGGGGGCATACACACTCTTGTTCAAGCTACCTTATCTAGTCTTCCTACCTATTACTTATCATTATATTATGCCCCTACTAAAGTCATCAATCGTCTTGATAAACTGATTCAGGATTTTTTCTGGGAAGGCTCAAACGGCGAGGATGGGATGCACAGTGTGAACTGGAAGCTGACCCAACGTCCAAAATTACAGGGAGGGCTTGGAATTGGTAACTTCAGGAATCGTAATTCTGCTTTATTGGCAAAATGGATTTGGCGTTTTACCCAGGAAAAAGACGCTTTATGGAGACGTTTGATTGCTGCTAAGTACTACTCTCCTTCAGTTACATGGCCATCTCCTATTCGCATTTCGACAAAAGCTCCATGGAGATTTATTTGTCAAACCATTGACCTGGTGTCTAATCGGGCTCATCTTAGACTTGGCACAGGTACAACCATCTCTTTCTGGAAAGATCAATGGCTTAGCTGTGGTACTTTCTTTACTGCTTTCCCCGTTTATTTCGTCTCGCCCGTGTGCCCAATATTACTGTGGCTGAGGTTTGGAGGGTGGATACAGAAGCTTGGGACCTTCGCCTTCGCCGTAATCTAAATGACTTGGAAATTCTTGAATGGGTCTCTCTATCTCAATTATTGTCAACTGTTCGCCTTAGACACTCCCCAGACACGTGGATTTGGGCTCTCTCCCCCACTTTTTCTGTCAAATCACTGATGGACGACCTTGCTGGTAATGCTGATACTCATGTCAAAGACTTATACTACGCTATCTGGCACGAACACTTCCCTAAGAAAATCAAGATTTTCCTCTGGGAGCTCAGTTTGGGTGCCATTAATACAGCTGATCGTCTCCAACGTCGTATGCCATACATGTCTATCTCTCCAACGTGGTGTCTTTTGTGTCAACGCCATTCTGAATCTGCTGCCCACCTGTTTCTTCACTGTTCTTTTGCTGCTCGTTTTTGGCATATCATCTTGGATGCTTTTGGTTGGACAATAGTCCACTCCAACAATATGTTTGAAATTTTGGCTTCTCTTATGGTGGGTCATCCGTTTGCTGGAGACAAAAGATTACTATGGCTGGCTATTTTGTGCGCTTTTTTCTGGACTTTATGGGGTGAACGCAACAAGCGTCTTTTTAGAGACATCTCATCAAGTTTTGATTTTTTTATGGATTCGGTGTTATCTACTGCTATGTTTTGGTGCAAAAACAGGCACCCTTTTAAGAACTTTAGTCTTTCATTTTTAGTTTCCAATTGGAAATCGTTTATGTAACACCACCTTTTGGTGATTTGGGGTTTTCCCCTTATTTCATTTTATCAATGAAATATTTCTTCTCTAAAAAAATACTTAAGAAATTGTTGGTGTCAGGTTTTTCTTGCAAAGAATATGAAGCGCCTCGATCACTTTGGGCTCATCCTTGTTGATGGTTTTGTGTGGGTAATACTTAGGAAGATGTGGATGAAATGAATAAAAAGGGGTGGAGCTAAAATTTCTATCTTCCTTGGTGGAAGATTTGAGGGGCTGAATGATTGGAGATAGGCATAGGCTTGAGGTGAGGGGACCCACTTTCTCCATTTCTTTCTGTCATGGTCGATTGAATGTGAAATTCTTGGGGGTGTTTCATCAAAGATCTACCTTCTATTCTTTTTTGATAAAGAAACAATTTCATTGATGAATGAAATACAAGAGTAAACTCCAAACACCTTAGGTAAATTACGAAAAGATTCTCTATTAGAGATTAAATAGGAAGACTCTTCTATTACAAAAAGACTCTTCAAATTTTAATTTCTACCTTATATTTTAGTTTGAATGATTTAGGGTATTCTCATTAGGGGTGACTCTTTGGAAGCTTTGGTTGGAGGGGACATGTGATTGTGGAGGAAAAGCTATGGACTTTATGGAACTTTAGGACTTATCATAACTTTGTCTGCCTGCCATTTGGTGGTTGTGATGAAAACTCTTTTGTAATGATAAACCTTGCTGGTCAATACGGAGACCAACTCAAAGGAGTTCTTTGATTTTTGTAGGTTTGGACTTTGGAGGATGCCATTTCTTCCCTCTTTGTACTTCCAATTTGGTTTTTCATCTCTTTGTACTTCCATTTTTGACAACAGAATTAATTTTATAGTAAGCATAAGTTGATCTTTCTACTTCGGAAGAGACTTTCTTTTGTGGAGCTTAGCTACTGTCTACTACCTTTTGAACAATCTATTGGAGGCAGTGTACAGTTAGTTCAATATTTTGTCAGGTGGAGAATTGTTTTCCTTGCGGTAGTTAAATTTGATGCAATGATTTGAGTGCATTTGTAAATGTTTCCTGACAATCTGTATTGTCCTTTTAATTAATGACAGCTCCGAGGAAAGTGAACATAGAACATTTTTTTAATTTCACCATAGTACATATTTAAGTTACTTTTATTTGACACATATTTGATACATTGTTCCCCATGTACATCCAATTTGTTAACATTTGTTTTTGCCTGCAGACTCTGGATTCCTTAGAACTTTGGATTCTAGTATTAGACGCAACACAACAGTTATTAAAAAATTAAAGCAGATAAATGAGGAACAACGAGAAGGGCTGATGGATGAGTTACGAAATGTAAATATGAGTAAATTTGTTAGTGAGGCTGTTTCTGCTATCTGTGATGCGAAGCTTAGAACTTCTGATATTCAGGCAGCTGTTCAGGTAACCTCCTTTCAGTTCTTTTCTTTTGTTTTTTAGTTCAATTTTGTTGTTATAGTTGGGTGGTGGGGCAATAGAAACTTATTCACTTCAAAAGATTTGAACAAAAGTGTTCTTACAAATATCACATCATCTTTTCATAATTTCCCTTCCTCCAATTGGATTAATATGTTTTCTTTATTTTACACTGGCACTTTACCTTTTAGCCATGTAATTGGCTTAAATTTCAATCTAGGGTCTCATTTATTTCATGAGAAATGAGGATGGTAGCCAGGATTCTAGCTGTGCTATGGAGTATTAGTATGGGGACGAACTAATGGGTGCTGAACGTAGTGATTCAAATCTTGGGAGGGAGTATGGTCCTGATGCTTCAACACCCCCCCTTTTTTTTCTTATTGTAAACTGGACACTTTTTAATTCTTTTTTTTGGTATTCTTTTTATGTTTGGGTAACCCTTTGGGTTGGTACTTTGTTCACTGTCTGGCCTCTCCTTGCTCTTTCATTCATTATCCATGAAAGCTCCTGTTCTTACAAAAAAAAAAAAAAAAAAAAAGAGAAGAAAAAAGAAAAAAAGAAAAGAAAAGAAAAAGATTCACGGGACCAAATCAATCTGCCTAGAACATAGATTTATGAGATTATTGGCATAGTTTGATAACCACACTTTCACATGTTGAACAACAAAAATAATAGTAAAAGTTCAGGCTCTAATCTATATTTCTGATGTGTTCCCTTGTCCCCTCCCCACCAAAAAAAAAAAAAAGATCTGTTCGCTCCTTCACCAAAGGTACAAAGACTTCTCACCATGCCTGATTCAAGGACTCTTGAAAGTGTTCTTTCCTGGAAAGTCCGGGGATGAGTTGGATGCTGATAGAAACTTGAAGGCCATGAAGAAGCGCAGCACTCTCAAACTTCTTATGGAACTTTTTTTTGTTGGAGTTGTAGAAGACACTGCAATTTTCAATAATATTATTAAGGATCTTACAAGTATTGAACATTTGAGGGATCGGGATACTACCCTAACTAATTTGACTCTCCTTGCTAGCTTTGCTCGGCAAGGGAGAATCCTTTTGGGTCTACTGCCTACTGGACAAGATCACGAGGAGGTAAACAACTTTTTCTCAATTTCTACACACCCATGGTACTGTTCACATGACTTCTGTATATGTTTCTTGTGGTTTTATGAAATTTGTCTCGTTCCTGGTCATGCATTACTCTAAACTCACTGGGTAACATTCTGTTACAGTTTTTTAAGAGCCTCAGTATTACTGCCGACCAAAAGAAATTCTTCAGAAAGGCTTTTCATACATATTATGACGCTGCTGCAGAATTGCTTCAGTCTGAACATACTGTAAGAGATTCACTGTCTTCTCTAGAGCTAGTTATTTGATTTATCATTCACCTAAGAAGCACGGACACTTTAGTTTGGCTAGTGTGTCCGTGTCCGACACTTCGACACTTGTTGGACACGCATCGGACACTTGTTAGTACAACAAGTGTGTTAGACATGCATTGAACACTTGTTGAGTAGACTAAAAAGACACATATATGACAATAATAATAACTTTTGAGTGTGAAATACATCAAGGTAAGTCTTTTAAGCATATAAATGCATCATATTTTGGTATAAAAATGATATATATTTTTAAAAATGTATATTTTAATAAACATGTCCTTGCCGTGTCGTGTCTTAGATATTTAAAATATGGTATGTCACCGTGTCCGTGTCGTATCGGTGTCTCGTATCCGTGTCCGTGCTTCTTAGTCATTCACTTTCATCTTCCCTCTTGTTTTTCTCCTTCAATTCAAAGTTAAGGTTCATGTTTTGGTTGTTTATTATTAATTCATTAGTATATGTTTATTTTTATAATTTTTTTGGGCCATTATGTTAACAAAATGATGATGCTTTCTATTTCATTATCATATTTTTATTGGTAACTAACTTCCATCATGATGAATTAAAGAATAATATATTTTACACTTGTAATGATATGTTTGCCTGTCACATTCTTTTAAATGGCAGTCACTTCGCCAAATGGAGCAGGAAAATGCAAAGATCTTGAATGCAAAAGGAGAGCTTAATGATGAAAATGTCTCATCATATGAGAAGTTGCGAAAATCTTATGACCATTTATATCGAAATGTCTCATCGTAAGTTGCATTATAACACTTGTGGTATTAGGAATTTAATAATCAATTGATGTCTTGATGGATGTATATATGTTTATTCCTCAACATTGTCCATGTCAAAATCTGACATTTGCGGCATCATTCGCTAATGAATTCTTTATTGCACCCTGCATTGAATTGAAACTTCTTATTTTTTTTGTTTCGTCTAATTGATAATTGTTGATTCTTAATGTTGCACTCCTTGTTTAATGAATACTGATTTATTCTTATAATATATTTCTTGGTGTCCTTGTATTCTGCTGAAGCAGTTTCTTATTTAATATAATCTGGAAGAATTTCTAATGCAAATATGAGAAGTAATTCTTTTTAGCCGAGTGTCCACACAATATAGAGATCAGATGTAGTTGAGATGCATTAGAGAAATTTAAGTTCTCTTCTAATAAGAAGGCATAGATGGAGACAAAAATTAATTAAAAATAAAGTATGGGTAGTCAGGTTTATCATGTAAAGTAAATTATGAAGATGTGATAGAAATATTATTTTTTGAGCCTACCACAAATTAAAATAGATTCCATCAACTATTTTTCATTTGCAAATTAATGGAATTCTACATTACACAGCTTAGCAGAAGCACTTGATATGCAACCCCCAGTGATGCCAGAGGATGGCCACACGACCAGGGTTTCTGCAGGAGAAGATGTTTCATCACCTGCTGCTGGAAAAGATTCTTCTGTAATTGAAGCCATATGGGATGATGAAGACACCCGGGCTTTCTATGAATGCTTACCAGATCTCAGGTTTGGCTTGGTTAGATTATATAGTTTGCTGAGATCCTTTGGGGGAAATTTGTCTTGCATCCATCTTAAAGTAGACTTCTATAGATTACCTAGACTTAAAGATACTTTTGGACACTTACGTGTAAGTGTGTTCTGTATATATGCACATTCTTGGATTTCAATAATAACAATAAAAATAACTTTTAGGACATGCTCATGCATGTTGTGATTGCCAGTCTCCTTCATGTTTACTTAAGTATAGGGTAGTCCATTGGGTCCATACAAGAACTATATTGGTTGGTATATATTTGTTTTATAAGAAGATTAATTGGTATATTGCATGAACTTCTCACTTCCCAGGGCGTTTGTTCCAGCGGTACTGTTGGGAGAAGCAGAGCCTAAAGCAAATGAGCAATCTGCAAAGCCGGCAGAAAATCTGGCTGTAATTTCTCTCGCCTGCAATTTTTTTTTTTTTAAACTTCAAAATTTCTTCCTCTGTTTTACAATAGTCTCTCCTCTTTCACTTCTCCCATGCCTGCTCCTATTTAAACTTTTCACAGTTTTTCCTTCTCTCTCTGTGTGTTGGATTGGCACACATGATATTTTTGGCCATAATTTAAAATTAATCAAGCATAAGTTAGGTTTTAAATGATCTTTACTTTAGAAGTACACTGAATTTATTTTCTTAGTTTTCAGATGAAACTTCAATTAGTATTCGAATAAACTAAGTGATAATGTCAAGAATTTTGTAGTCTAGCAATGTATTTAGAACTTTTAACACTATAATTTATATTGAAACGAATAATGTAAAAGTGTTCTTAGTTTTGCCCATTTTAATTATAACACAATCTAATATATGATATTAACTTAATTTCAGAAACTAAAGTACTTTTTATAGTATTAGCCTTTCAGCACATATATTTATATTTATCCTAGTATATATTTTTTTTTCTCATAATACCTTTTTTCAGTAGTTTTGTACGGGTAAAAGTAAAATAATTTAGAAAATAATACTTGGTTATTCTTTTTAATGTTCCAATGATTTATAGTAATTGATACCTAAATTTTTTGTTTATTACCATATTAAAAATATTGTTTTCTTAAAAAAAAAAAATTGAACTGTTAATGGGAAGTTTAAAAATGTAAAATTCTAATTATTACTTTACTATATTTTCTATTTTATTTGAGTCGATTATGATATTAAATAACATGTTATGGATATTTTTTTTTTTAAATTTTAATAACGACGAGAATTGGATATGTATGTATTTTATGAATGGGGGAGTGGACAAAATGGTCCAATAACGATTTCTCAAGCATTCTTCCAAATTGGGAGCTTAGTCTAAGAAATCTTCTTAGGAAAATTTGGAAGCATCGTTATCCTAAGAAAATCAAAATGTTCCTTTGGAAGCTTAGTCTTGGAGTTGTAAATATTGCTTATAGATTACAACGATGAATGCCTTGCATTGCATCTCTCGCTTTCTTGGTGGCTCATGTGTCAACAACGTGCTGAGACTCAAGATTATTTATTATGCATTGGCTTTTTGCATCTCATTTATGGCGCATTATTTTGGAGGCTTTTGGTTGGTCACTGCCGTGTCCCAATATTACTTTTGATCTTCTAGATCTTCTTGCATCCTTATTGGTGGGACATTCTTTTGGTGGTAGTAAAAAGACGATATGTTGGCCCTTTTGCGAGCGTTCTTTTGGACTTAGTGGGGCGAGAGAAATGGACGTCTTTTTCACTATTAATTGTTTTATGGATTTGATTCTGTCTACTACTTTATTTTGGTGCAAAACTAAGCACCCTCTTAGGGCATGTCTGATAGTGATTCTGTAATTGTTAAAATCATTTTTGTCATTTTCAAAAACACTTTGAAACATTCTTTTAATAACTGAAAACAAATTTTGATGATTTGAAAATTGCATTTTGAAGTGTAAAATTAAAATTAAATTAACTTTAAGTGATTAAAAGTGTGTTTTGAGCGATTTTGAAAATAGCAAAAGTGATTTTAACCATTTTAAAATCACTCCCAAACATGCACTTAGTCATTATAGCTTATCCTTTTTAGTTTCTAATAGGCGTACCTTGTAGTAATTACCTATTGCTGCTGGGGTTTCCCCCTAATTTCATCATATCACTAAAATGTTTCATATTAAAGAAAAAAGGTATTCTCCCTTTTAAATATGTATTATAAAGATTTGATCCTTTGTTGTCCTTATTAGCCTATTCCATGAAGGTGTCTTTTCTGTTGAAGAATTTGAATGTCTTCTGTAGGAATCTGAAGCAGACCAAGGTCAGCAAACTGCCCTAGAGGCCATTGAGGTTTCTACAGATTGTTCGCTACAGGAAGGCAAAATTAATGAGAAAGGGAAGGAGAAAGAAGAAAAAGATAAAGAAAAGAATAAGGATACAGACAAGGAGAAGGGGAAAGAAAAAGACGCAGATAGAAAGGTGGAAAATGAAAAAGAAAAACTTAAAAATATTGAAGGAACAAATTTGGATGCTTTGTTACAGAGACTCCCTGGTTGTGTCAGCCGTGACCTTATTGATCAGTTGACTGTAATTACTTCTCTTTCTCCCAGCTTAATTGTGATTTTAAGAGGGGATGGAAGGCTGGTTCTTTGTTAAGAATTTGATGCTAATATTCTTGTATATTTGCAGGTAGAGTTTTGTTATTTGAATTCCAAGGCTAACCGAAAGAAGCTTGTAAGGGCTTTGTTTAATGTACCTAGGACGTCTTTGGAGTTGCTGCCTTACTACTCACGCATGGTTGCGACATTGTCAACTTGTATGAAGGATGTATCTGTCATTCTCCTCCAGATGTTAGAGGAGGAGTTCAACTTTTTATTGAACAAAAAGGTCTCATTTAGTTCATATAACTTCTCTTTGGAATGGAGTTCCCCTTTTCTCTTCTATTATTATTATTTTTATTATTATTTTTTTTTTTTAAATCAAACAGGACCAAATGAACATTGAAACGAAGATTAGAAATATTAGGTTTATTGGAGAGCTTTGCAAGTTCAAAATTGCTTCAGCCGGCCTGGTTTTTAGCTGTTTGAAGGTATGAGTGGAGGAGACACCTGTTGATTTTTGTTCTTTTCTTTCATCATGTTCTATTGTTTTATGTATTTTGAAATCATATTTTTGTCACATGCATGCCAAAACTTTGTCAAATGTGGCGTGTGGTGTCAAATTTCCTTAGTCTATTGGGATTATTTTGTTTATCTAAAAAGCTTCATGCTTTTAATTGATTGTTTCGACCACTTTGGGTGTTTCATTTTCATTACCTGTATATCATTTTTTCGGGGCGAGAAACAATTTTACTGACGAGGAAATTACAAAAATGGGGTAGGAAAACACAGCCCACAGCTAAGGAAGATTAAAAAGAGCTCCGCATAGAGAGTAATTATAAAGAATTCCAAATAGAGAGTATTCAAATGTGATTGTTGGAAATAGTACGAAATTTGCCCAAGAAAGAGCTGAGAAATAGTGTGTTGAAAACAATAATAGGCTTTCCTTTGTTGTTGAAGACTCTCTTGTTTTTAACTTGTCATAAAAGCCAAAAGAAAGGTCTAACTGCAAGTTGAACGAGCAATTAGCCTATGCTAGTTTGTAAACCAGAAGTTGATGTTTCTGTTGTGGTTGGAAAAATGATGGTTAATTGATGAACTCATTTTATCTCTTTTTGGCAGGCATGTTTAGATGATTTCACTCACCATAATATTGATGTTGCTTGTAATCTTCTTGAGACCTGTGGACGTTTTCTTTATCGGTCTCCTGAAACTACAGTAAGGATGGCTAACATGTTGGAGATATTGATGCGCTTGAAAAATGTAAAAAATTTGGATCCCCGGCACAGCACCCTGGTAGAAAATGCATACTATCTTTGCAAGCCTCCTGAGAGATCTGCGCGGGTGTCAAAAGTTCGTCCACCATTGCATCAGGTATGGGACGCATTCAATGTTCCCTACTTTCTCCCTACGTACAGTTGGTTCATATGTTGTATTTACTCATTGTATCCTGCAGTATATTAGAAAGTTGCTCTTCTCAGATCTAGACAAGTCTGCCATTGAGAATGTGTTGAGGCAACTTCGGAAACTTCCATGGAGTGAATGTGAGCAATACCTTTTAAAATGTTTTATGAAAGTTCACAAAGGGAAGTATGGACAGATTCACTTGATTGCTTCTCTAACGTCTGGTTTGAGTCGATATCATGACGAATTCTCAGTTGCCGTTGTTGATGAGGTCAGTTCACTGAAAACATTTTCAACTTAGAATGGGACTTGTCAGTCTACCTGATAAGATCATTTTACCTATAGAGCGTGCTTTATTGACGTCATTCCATATGTTTTTACATTTTTATGTATATTGAGGGTTGTTCAGTACTTCTTTTAATCGTGTAAAGATGCTTTCCTGTGATTAGAATCAAATTATATTTCAGTAGTAGTAGTAGCAGTACATATGTTAAATTTTTTGCTGCAGCTCAGAAAGTATTTTTGAACGTTTTAAAAACACAAGAACACGTGAAAGAGTTTTTTAGGAATGAAGAAAAGTAATTATTTTTTTGTTAATTGGCTTTGGTTCATATGTATATCAATTATCATTAATCTCTCTGTATAACTACACATTCTACTTCTCAGTAGCCTTTCTAGATTTATGTTCTCTTACTGGTTCCTTAACACATTTTCCTTTCACTAATATTTTTCGGTACATTTTCTTTTTCTCTTTACACCCACTTTATTCTATTAATCTTAACTAACTTCTTTGCAGGGCATTAAGGGGTCTTTTAGGGCACATACTTTTTCTTAACCAACTACAGAACTCATCCATAAAATTTGATTAATAGCAATTTCTTAGTTTATCTACTCTTAAAGAACATCTGGGAGCATATTTTCCCCCTTTTGCACATCTAAATATTTGGCAATTTTTTTTTTTTTTTTTGGATATGAAGTCTGGTTATTTGAAACTTATTGAATGGAAGTTATTTTGTACTACATGATGAATGTAGTTTGCATATTTGTCCAGGTTCTCTTGTTTATATCTGATTTATCTATTTTATTCATCTTGGTGGTGGTTACTTATCTCTATGATTTCTAGGTATTAGAAGAAATTAGGCTTGGACTTGAAGTAAATGACTATGGGATGCAGCAAAAACGCATTGCCCATATGCGGTTTTTGGGGGAGTTGTACAACTATGAACTTGTAGATTCATCTGTGGTCTTTGATACCCTTTATTTGATTATTGTCTTTGGCCATGGCACTTCAGAGGTACGGAATTGTTTTGTTACTCATTATAATTTTTTTTTTGTTCTTTCACCTTGATATGTAAGCACCTCAACAGGCTTTCCCACTTATATATATATTTTAAAGGAAACTGAGAAACTATATATTAGCAGAAAGTAAAAAGAACAACCTACGATCAGGTAGAGAGAACCCCTCCCCTAAGCAACTGTGTAGGAAAGCTTTCCAATTCTTATTAATCATGGATAGACTATAATTACAAAATAATTTTGTATGATTCGTGCACCACCCAGAAGCATCATGTTACACACTAGAACAAAAAGAGTTGAAAGAAAAGGACTTGAACTTCAAATGTCCTAAGAATGTCGCACAACGCCAAATAGTATTTACTTTAACTTTTGTAAAGCGCCAACTACTAAATTTTCTCACATCCCATGAACTGGACTTGTGATTACATCAACAGTTTTTCTTTGATGTGTCTTTCAGTGATCTTTGATTATTTATTAGATACTACTAGAATTGTATTTAAAATTTTAAATTTCATAAAAAAATATGAATGACTAGTTGTACCATACCTTAACACAAGTAGAGACCTTAGCTATAAGCAATCAATGGGCTTCCCTTTTGCACTACTGATGGCACCAAGAATCTTGGTAATCTAATCAAGTTGGATGGAAATTTGAAAGCATACTTTCTGTCATTCTGATCACAAAAATGGAGGAGGACTTGGCATTAGCAACTTTAAAGATTAAAAAAAATGTAGTCTTATCAACATGTGGGTTTGGAGATTTACACGAGAAAGAGATTCTAGATCTCTTTTGGTACTCTTAATTGTCAGAAGATTTTTCCGTGACATTCATCAGTTCTCATTAACAGTTGTGGTAAACCCGAAGAAATAATGTATGGAAGTATTGAAATGTGGGCAGTTTCTGAGATCCCTTTTGGCACTAAGATTCTTATTGGAAATTTTTTGTAGGATATTCTTCTTTGCCATGGGATTTGGCTTCCCTTCTGTTTTGAAAATTTTCTAGTTTCTTCAACAATGAATTTTATAGGAGAATCACTTCTTTTTTTTTTAGTGCCTTATATCTGCATTTGGATCTTGGGCATGGTTAAAAATATAGTTTCTGTGCCAACATGAGCAGCTTAGTGGTAATTGGTATGTAATCATTTTCAAAATATATAGTTTTAGTTCCATAGTAGTATTTCGATATGGTTTGCTTATGATCTGGTTGTCATTTTATCAAATTATAATTGCCTTCATTGAATTATTTTTATCCTTGGTAATCGGGTAAAAATTTGGGGGTGAGGCAGTAAAATAGCAAGGACTTGAGCTTATAGGATATGTGATCTGTTAATGTCCGATGAATGAAGTAGACCAAATAAATGTTCAGTGCTTGTGGAGGATACTGGGTCTTGGCTTATCCAGTACAGTACAAGTTTGTTTCTTTGTGCTTGGCCAAATGATCCCCCCCCCCCCCCCTCCAAGTCTAATGTGATTGATCTTTGTTGCTTTTAGCAAGATGTCTTGGATCCACCTGAGGACACTTTCCGCATAAGGATGATTATCACCCTTCTTCAAACCTGTGGTCACTACTTTGATCGAGGATCTTCAAAGAGGAAACTTGATCGATTTTTTATACACTTCCAGAAATATATTCTCAGCAAAGGAGCACTTCCATTGGACATTGAGTTCGACTTACAGGTCAGTTGTTTATCATGTTAGTTTACTTGCTAATATCTTTCCCATTGTATGTTAGCGTAAAATAACAGCTTATTCAAGTTGATAGTTTTCATCCTTGTGTTGTTATAGACTTGGTTCTATCTCCAAGCCGTGGAGATAACTTCGATAATAATTTGGCATTAAATGCAAAAGTCTGGATGTGGGCATTGCATTGCATTGCATGGACATGTTGCTGCAAGAACATATTTATTATAATACTTATTTTTTTAGAGATAATTTTTATACAAAAAGGAACTCTAAGCTTAAGCTTACACATTTATAAACTTACAAAAATTTAACAAAATGAGTTTTGATGTATTTTGCTCTCAAATTTTTTTATTTTCGTGTATACATTTTGTACCATGAGTGTTCACAGGCCAATTACGCTAATAAGTGTCCTAAATCTTTAGGTTCATTGTAATCATGTTCTATTACGTTTGCAAATTATTCATGCAAGATTATTTAATGGTCACTATGATTTTTTTTCTCCAGTTTAAATTCTTTTTTTCTCCCTCAATTCTTGCACGCCAAATTATCTGATGAAGTTGCCTTTACAGGATTTATTTGCAGAATTGCAACCAAACATGACCAGATACTCATCCATTGAAGAGATAAATGCTGCTTTTGTAGAACTTGAGGAGCATGAACGTTCGGTCTCAAATGACAAGCCTAATACTGAGAAACACCTTGATGCTGAAAAGCCCAGCAGAGCAACTTCCAATACCGCCTCAGCTAATGGAAGGGACACAGTGAATGGCTCTAAAGAAAATGGTGGAGCTCATGAAGACGGTGTTGATAGCGACAGTGATACAGGAAGTGGCACTATTGAGGCAGAGGGACGTGATGATGAAGAGTCAGACTTGGAGAATCATGAGGATGGATGTGACACTGAGGATGACGAAGACGATGAAGAAGCGGGTGGGCCTGCTTCTGATGAGGATGATGAGGTTCACGTTAGACAGAAAGTGCCTGAGGTAGACCCCAGAGAAGAAGCCAATTTCGAGCAAGAGCTCAGGGCTGTAATGCAGGTAAGATTTATTTTTGTGATCACTTTACTTTCTTGCCAGTACTCTTACACTTCTATACTCAAGGTTTCAATTTGATGAATGCAACAGGGCATTTAAAATTCTAATTTTAAAAGGCTTAGTTGGTTCGTAGGGGCACTTGTAGACGTTCTATTATTTTAAAAAATTTGTTGATGTCTGATATATTTCTGAATGCAGGAGAGTATGGATCAGCGCAGGCAAGAGCTTCGTGGCCGACCAACATTGAATATGATGATACCAATGAATTTGTTTGAGGGGTCGACGAGGGACCACCACGGAAGGGGGGTTGGGGGAGAGAGTGGGGACGAGGGGTTGGACGAGGATGCTGGTGGAAGCAAGGAGGTTCAGGTGAAAGTTCTTGTTAAGCGTGGGAACAAGCAGCAAACCAAGAAAATGTACATCCCTCGTGATTGCACACTTTTACAGAGCACTAAACAGAAAGAAGCAGCGGAGCTAGAAGAGAAGCAAGATATTAAGAGGTTAATTTTAGAGTACAATGACCGGGAAGAGGAAGAACTTAATGGATTAGGCTCCCAAACGATGAATTGGATGCAGACAGGGGGCAATAGGGGAGTCCCTACTAGAGGCAACAATTGGGAAGCTTCGGGCGGGAGGAGTGGTGGGTCACGTCATCCCCATCATCGGTATCCTGGCAGTGGCGTGCATTATAGTAGAAAGAAGTGA

mRNA sequence

ATGCGACTAGCGAGTCCAGAACCGGGTCAATACCAGTTGGGCCTGGATCCAAGCCCAAGCCCAACAAAGCAGTATAAAGATGAGCCCAACCCGTCTTCGCGCGCGTGTTCGTCAACCCCTCGTTTCAGTTCTCAGTTCACAATCCTCCCTTCCCACCGTTTCTCCGTCTCCGTCAGTGCTCCACTGCCGACGATCACTTTCTCTTCACTTTCCGGCGGAACTGACATGGACCATCATGAAGATGATGGCCGCCCAGGGGGTGAAAGTCAACCCAAGAGAGATGATGAGGAAACTGTTGCTCGTCAGGAAGAAATTAAAAAGTCATTTGAGGCTAAAATGGCTCTTCGACAAAGTAATCTGAATCCTGAGAGGCCTGACTCTGGATTCCTTAGAACTTTGGATTCTAGTATTAGACGCAACACAACAGTTATTAAAAAATTAAAGCAGATAAATGAGGAACAACGAGAAGGGCTGATGGATGAGTTACGAAATGTAAATATGAGTAAATTTGTTAGTGAGGCTGTTTCTGCTATCTGTGATGCGAAGCTTAGAACTTCTGATATTCAGGCAGCTGTTCAGTCCGGGGATGAGTTGGATGCTGATAGAAACTTGAAGGCCATGAAGAAGCGCAGCACTCTCAAACTTCTTATGGAACTTTTTTTTGTTGGAGTTGTAGAAGACACTGCAATTTTCAATAATATTATTAAGGATCTTACAAGTATTGAACATTTGAGGGATCGGGATACTACCCTAACTAATTTGACTCTCCTTGCTAGCTTTGCTCGGCAAGGGAGAATCCTTTTGGGTCTACTGCCTACTGGACAAGATCACGAGGAGTTTTTTAAGAGCCTCAGTATTACTGCCGACCAAAAGAAATTCTTCAGAAAGGCTTTTCATACATATTATGACGCTGCTGCAGAATTGCTTCAGTCTGAACATACTTCACTTCGCCAAATGGAGCAGGAAAATGCAAAGATCTTGAATGCAAAAGGAGAGCTTAATGATGAAAATGTCTCATCATATGAGAAGTTGCGAAAATCTTATGACCATTTATATCGAAATGTCTCATCCTTAGCAGAAGCACTTGATATGCAACCCCCAGTGATGCCAGAGGATGGCCACACGACCAGGGTTTCTGCAGGAGAAGATGTTTCATCACCTGCTGCTGGAAAAGATTCTTCTGTAATTGAAGCCATATGGGATGATGAAGACACCCGGGCTTTCTATGAATGCTTACCAGATCTCAGGGCGTTTGTTCCAGCGGTACTGTTGGGAGAAGCAGAGCCTAAAGCAAATGAGCAATCTGCAAAGCCGGCAGAAAATCTGGCTGAATCTGAAGCAGACCAAGGTCAGCAAACTGCCCTAGAGGCCATTGAGGTTTCTACAGATTGTTCGCTACAGGAAGGCAAAATTAATGAGAAAGGGAAGGAGAAAGAAGAAAAAGATAAAGAAAAGAATAAGGATACAGACAAGGAGAAGGGGAAAGAAAAAGACGCAGATAGAAAGGTGGAAAATGAAAAAGAAAAACTTAAAAATATTGAAGGAACAAATTTGGATGCTTTGTTACAGAGACTCCCTGGTTGTGTCAGCCGTGACCTTATTGATCAGTTGACTGTAGAGTTTTGTTATTTGAATTCCAAGGCTAACCGAAAGAAGCTTGTAAGGGCTTTGTTTAATGTACCTAGGACGTCTTTGGAGTTGCTGCCTTACTACTCACGCATGGTTGCGACATTGTCAACTTGTATGAAGGATGTATCTGTCATTCTCCTCCAGATGTTAGAGGAGGAGTTCAACTTTTTATTGAACAAAAAGGACCAAATGAACATTGAAACGAAGATTAGAAATATTAGGTTTATTGGAGAGCTTTGCAAGTTCAAAATTGCTTCAGCCGGCCTGGTTTTTAGCTGTTTGAAGGCATGTTTAGATGATTTCACTCACCATAATATTGATGTTGCTTGTAATCTTCTTGAGACCTGTGGACGTTTTCTTTATCGGTCTCCTGAAACTACAGTAAGGATGGCTAACATGTTGGAGATATTGATGCGCTTGAAAAATGTAAAAAATTTGGATCCCCGGCACAGCACCCTGGTAGAAAATGCATACTATCTTTGCAAGCCTCCTGAGAGATCTGCGCGGGTGTCAAAAGTTCGTCCACCATTGCATCAGTATATTAGAAAGTTGCTCTTCTCAGATCTAGACAAGTCTGCCATTGAGAATGTGTTGAGGCAACTTCGGAAACTTCCATGGAGTGAATGTGAGCAATACCTTTTAAAATGTTTTATGAAAGTTCACAAAGGGAAGTATGGACAGATTCACTTGATTGCTTCTCTAACGTCTGGTTTGAGTCGATATCATGACGAATTCTCAGTTGCCGTTGTTGATGAGGTATTAGAAGAAATTAGGCTTGGACTTGAAGTAAATGACTATGGGATGCAGCAAAAACGCATTGCCCATATGCGGTTTTTGGGGGAGTTGTACAACTATGAACTTGTAGATTCATCTGTGGTCTTTGATACCCTTTATTTGATTATTGTCTTTGGCCATGGCACTTCAGAGCAAGATGTCTTGGATCCACCTGAGGACACTTTCCGCATAAGGATGATTATCACCCTTCTTCAAACCTGTGGTCACTACTTTGATCGAGGATCTTCAAAGAGGAAACTTGATCGATTTTTTATACACTTCCAGAAATATATTCTCAGCAAAGGAGCACTTCCATTGGACATTGAGTTCGACTTACAGGATTTATTTGCAGAATTGCAACCAAACATGACCAGATACTCATCCATTGAAGAGATAAATGCTGCTTTTGTAGAACTTGAGGAGCATGAACGTTCGGTCTCAAATGACAAGCCTAATACTGAGAAACACCTTGATGCTGAAAAGCCCAGCAGAGCAACTTCCAATACCGCCTCAGCTAATGGAAGGGACACAGTGAATGGCTCTAAAGAAAATGGTGGAGCTCATGAAGACGGTGTTGATAGCGACAGTGATACAGGAAGTGGCACTATTGAGGCAGAGGGACGTGATGATGAAGAGTCAGACTTGGAGAATCATGAGGATGGATGTGACACTGAGGATGACGAAGACGATGAAGAAGCGGGTGGGCCTGCTTCTGATGAGGATGATGAGGTTCACGTTAGACAGAAAGTGCCTGAGGTAGACCCCAGAGAAGAAGCCAATTTCGAGCAAGAGCTCAGGGCTGTAATGCAGGAGAGTATGGATCAGCGCAGGCAAGAGCTTCGTGGCCGACCAACATTGAATATGATGATACCAATGAATTTGTTTGAGGGGTCGACGAGGGACCACCACGGAAGGGGGGTTGGGGGAGAGAGTGGGGACGAGGGGTTGGACGAGGATGCTGGTGGAAGCAAGGAGGTTCAGGTGAAAGTTCTTGTTAAGCGTGGGAACAAGCAGCAAACCAAGAAAATGTACATCCCTCGTGATTGCACACTTTTACAGAGCACTAAACAGAAAGAAGCAGCGGAGCTAGAAGAGAAGCAAGATATTAAGAGGTTAATTTTAGAGTACAATGACCGGGAAGAGGAAGAACTTAATGGATTAGGCTCCCAAACGATGAATTGGATGCAGACAGGGGGCAATAGGGGAGTCCCTACTAGAGGCAACAATTGGGAAGCTTCGGGCGGGAGGAGTGGTGGGTCACGTCATCCCCATCATCGGTATCCTGGCAGTGGCGTGCATTATAGTAGAAAGAAGTGA

Coding sequence (CDS)

ATGCGACTAGCGAGTCCAGAACCGGGTCAATACCAGTTGGGCCTGGATCCAAGCCCAAGCCCAACAAAGCAGTATAAAGATGAGCCCAACCCGTCTTCGCGCGCGTGTTCGTCAACCCCTCGTTTCAGTTCTCAGTTCACAATCCTCCCTTCCCACCGTTTCTCCGTCTCCGTCAGTGCTCCACTGCCGACGATCACTTTCTCTTCACTTTCCGGCGGAACTGACATGGACCATCATGAAGATGATGGCCGCCCAGGGGGTGAAAGTCAACCCAAGAGAGATGATGAGGAAACTGTTGCTCGTCAGGAAGAAATTAAAAAGTCATTTGAGGCTAAAATGGCTCTTCGACAAAGTAATCTGAATCCTGAGAGGCCTGACTCTGGATTCCTTAGAACTTTGGATTCTAGTATTAGACGCAACACAACAGTTATTAAAAAATTAAAGCAGATAAATGAGGAACAACGAGAAGGGCTGATGGATGAGTTACGAAATGTAAATATGAGTAAATTTGTTAGTGAGGCTGTTTCTGCTATCTGTGATGCGAAGCTTAGAACTTCTGATATTCAGGCAGCTGTTCAGTCCGGGGATGAGTTGGATGCTGATAGAAACTTGAAGGCCATGAAGAAGCGCAGCACTCTCAAACTTCTTATGGAACTTTTTTTTGTTGGAGTTGTAGAAGACACTGCAATTTTCAATAATATTATTAAGGATCTTACAAGTATTGAACATTTGAGGGATCGGGATACTACCCTAACTAATTTGACTCTCCTTGCTAGCTTTGCTCGGCAAGGGAGAATCCTTTTGGGTCTACTGCCTACTGGACAAGATCACGAGGAGTTTTTTAAGAGCCTCAGTATTACTGCCGACCAAAAGAAATTCTTCAGAAAGGCTTTTCATACATATTATGACGCTGCTGCAGAATTGCTTCAGTCTGAACATACTTCACTTCGCCAAATGGAGCAGGAAAATGCAAAGATCTTGAATGCAAAAGGAGAGCTTAATGATGAAAATGTCTCATCATATGAGAAGTTGCGAAAATCTTATGACCATTTATATCGAAATGTCTCATCCTTAGCAGAAGCACTTGATATGCAACCCCCAGTGATGCCAGAGGATGGCCACACGACCAGGGTTTCTGCAGGAGAAGATGTTTCATCACCTGCTGCTGGAAAAGATTCTTCTGTAATTGAAGCCATATGGGATGATGAAGACACCCGGGCTTTCTATGAATGCTTACCAGATCTCAGGGCGTTTGTTCCAGCGGTACTGTTGGGAGAAGCAGAGCCTAAAGCAAATGAGCAATCTGCAAAGCCGGCAGAAAATCTGGCTGAATCTGAAGCAGACCAAGGTCAGCAAACTGCCCTAGAGGCCATTGAGGTTTCTACAGATTGTTCGCTACAGGAAGGCAAAATTAATGAGAAAGGGAAGGAGAAAGAAGAAAAAGATAAAGAAAAGAATAAGGATACAGACAAGGAGAAGGGGAAAGAAAAAGACGCAGATAGAAAGGTGGAAAATGAAAAAGAAAAACTTAAAAATATTGAAGGAACAAATTTGGATGCTTTGTTACAGAGACTCCCTGGTTGTGTCAGCCGTGACCTTATTGATCAGTTGACTGTAGAGTTTTGTTATTTGAATTCCAAGGCTAACCGAAAGAAGCTTGTAAGGGCTTTGTTTAATGTACCTAGGACGTCTTTGGAGTTGCTGCCTTACTACTCACGCATGGTTGCGACATTGTCAACTTGTATGAAGGATGTATCTGTCATTCTCCTCCAGATGTTAGAGGAGGAGTTCAACTTTTTATTGAACAAAAAGGACCAAATGAACATTGAAACGAAGATTAGAAATATTAGGTTTATTGGAGAGCTTTGCAAGTTCAAAATTGCTTCAGCCGGCCTGGTTTTTAGCTGTTTGAAGGCATGTTTAGATGATTTCACTCACCATAATATTGATGTTGCTTGTAATCTTCTTGAGACCTGTGGACGTTTTCTTTATCGGTCTCCTGAAACTACAGTAAGGATGGCTAACATGTTGGAGATATTGATGCGCTTGAAAAATGTAAAAAATTTGGATCCCCGGCACAGCACCCTGGTAGAAAATGCATACTATCTTTGCAAGCCTCCTGAGAGATCTGCGCGGGTGTCAAAAGTTCGTCCACCATTGCATCAGTATATTAGAAAGTTGCTCTTCTCAGATCTAGACAAGTCTGCCATTGAGAATGTGTTGAGGCAACTTCGGAAACTTCCATGGAGTGAATGTGAGCAATACCTTTTAAAATGTTTTATGAAAGTTCACAAAGGGAAGTATGGACAGATTCACTTGATTGCTTCTCTAACGTCTGGTTTGAGTCGATATCATGACGAATTCTCAGTTGCCGTTGTTGATGAGGTATTAGAAGAAATTAGGCTTGGACTTGAAGTAAATGACTATGGGATGCAGCAAAAACGCATTGCCCATATGCGGTTTTTGGGGGAGTTGTACAACTATGAACTTGTAGATTCATCTGTGGTCTTTGATACCCTTTATTTGATTATTGTCTTTGGCCATGGCACTTCAGAGCAAGATGTCTTGGATCCACCTGAGGACACTTTCCGCATAAGGATGATTATCACCCTTCTTCAAACCTGTGGTCACTACTTTGATCGAGGATCTTCAAAGAGGAAACTTGATCGATTTTTTATACACTTCCAGAAATATATTCTCAGCAAAGGAGCACTTCCATTGGACATTGAGTTCGACTTACAGGATTTATTTGCAGAATTGCAACCAAACATGACCAGATACTCATCCATTGAAGAGATAAATGCTGCTTTTGTAGAACTTGAGGAGCATGAACGTTCGGTCTCAAATGACAAGCCTAATACTGAGAAACACCTTGATGCTGAAAAGCCCAGCAGAGCAACTTCCAATACCGCCTCAGCTAATGGAAGGGACACAGTGAATGGCTCTAAAGAAAATGGTGGAGCTCATGAAGACGGTGTTGATAGCGACAGTGATACAGGAAGTGGCACTATTGAGGCAGAGGGACGTGATGATGAAGAGTCAGACTTGGAGAATCATGAGGATGGATGTGACACTGAGGATGACGAAGACGATGAAGAAGCGGGTGGGCCTGCTTCTGATGAGGATGATGAGGTTCACGTTAGACAGAAAGTGCCTGAGGTAGACCCCAGAGAAGAAGCCAATTTCGAGCAAGAGCTCAGGGCTGTAATGCAGGAGAGTATGGATCAGCGCAGGCAAGAGCTTCGTGGCCGACCAACATTGAATATGATGATACCAATGAATTTGTTTGAGGGGTCGACGAGGGACCACCACGGAAGGGGGGTTGGGGGAGAGAGTGGGGACGAGGGGTTGGACGAGGATGCTGGTGGAAGCAAGGAGGTTCAGGTGAAAGTTCTTGTTAAGCGTGGGAACAAGCAGCAAACCAAGAAAATGTACATCCCTCGTGATTGCACACTTTTACAGAGCACTAAACAGAAAGAAGCAGCGGAGCTAGAAGAGAAGCAAGATATTAAGAGGTTAATTTTAGAGTACAATGACCGGGAAGAGGAAGAACTTAATGGATTAGGCTCCCAAACGATGAATTGGATGCAGACAGGGGGCAATAGGGGAGTCCCTACTAGAGGCAACAATTGGGAAGCTTCGGGCGGGAGGAGTGGTGGGTCACGTCATCCCCATCATCGGTATCCTGGCAGTGGCGTGCATTATAGTAGAAAGAAGTGA

Protein sequence

MRLASPEPGQYQLGLDPSPSPTKQYKDEPNPSSRACSSTPRFSSQFTILPSHRFSVSVSAPLPTITFSSLSGGTDMDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQSGDELDADRNLKAMKKRSTLKLLMELFFVGVVEDTAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK
Homology
BLAST of HG10022266 vs. NCBI nr
Match: XP_038890797.1 (regulator of nonsense transcripts UPF2 [Benincasa hispida])

HSP 1 Score: 2152.1 bits (5575), Expect = 0.0e+00
Identity = 1139/1195 (95.31%), Postives = 1153/1195 (96.49%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGR GGESQPKRDDEETVARQEEIKK FEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRQGGESQPKRDDEETVARQEEIKKIFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ  
Sbjct: 61   SIKRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLSPTGQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESE DQGQQ +LEAIE+STDCSLQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEVDQGQQASLEAIEISTDCSLQ 420

Query: 496  EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
            +GKIN   EKGK++EEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQR
Sbjct: 421  DGKINEKGEKGKDREEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVEFCYLNSK+NRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481  LPGCVSRDLIDQLTVEFCYLNSKSNRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781  LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPN+TRYSSIEEIN AFVELEEHE
Sbjct: 841  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNVTRYSSIEEINTAFVELEEHE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
            RSVSNDKPNTEKHLDAEKPSRATSN+ SANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI
Sbjct: 901  RSVSNDKPNTEKHLDAEKPSRATSNSTSANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 960

Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
            EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ
Sbjct: 961  EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1020

Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
            ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGS RDHHGRGVGGESGDEGLDEDAGGS
Sbjct: 1021 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGS-RDHHGRGVGGESGDEGLDEDAGGS 1080

Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
            KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140

Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
             NGLGSQTMNWMQTGGNR VPTRGNNWEASGGRSGG RHPHHRYPGSG+HYSRKK
Sbjct: 1141 HNGLGSQTMNWMQTGGNR-VPTRGNNWEASGGRSGGLRHPHHRYPGSGMHYSRKK 1193

BLAST of HG10022266 vs. NCBI nr
Match: XP_008463566.1 (PREDICTED: LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 [Cucumis melo])

HSP 1 Score: 2150.9 bits (5572), Expect = 0.0e+00
Identity = 1136/1195 (95.06%), Postives = 1148/1195 (96.07%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ  
Sbjct: 61   SIKRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQ+EQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQLEQENAKILNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKP ENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPTENLAESEAEQGQQTSLEAIEVSTDCPLQ 420

Query: 496  EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
            +GKIN   EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEK KLKNIEGTNLDALLQR
Sbjct: 421  DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKXKLKNIEGTNLDALLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781  LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
            RSVSNDKPNTEKHLDAEKPSRATS T SANGRD VNGSKEN GAHEDG DSDSDTGSGTI
Sbjct: 901  RSVSNDKPNTEKHLDAEKPSRATSTTTSANGRDRVNGSKENSGAHEDGADSDSDTGSGTI 960

Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
            EAEGRDDEESDLENHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFEQ
Sbjct: 961  EAEGRDDEESDLENHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1020

Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
            ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGGS
Sbjct: 1021 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGGS 1080

Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
            KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140

Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            LNGLGSQTMNWMQTGGNR VPTRGNNWEASGGRSGGSRHPHHRYPGSG+HYSRKK
Sbjct: 1141 LNGLGSQTMNWMQTGGNR-VPTRGNNWEASGGRSGGSRHPHHRYPGSGMHYSRKK 1194

BLAST of HG10022266 vs. NCBI nr
Match: XP_004143811.1 (regulator of nonsense transcripts UPF2 [Cucumis sativus] >KGN51237.1 hypothetical protein Csa_008908 [Cucumis sativus])

HSP 1 Score: 2150.6 bits (5571), Expect = 0.0e+00
Identity = 1138/1196 (95.15%), Postives = 1151/1196 (96.24%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGRPGGESQPKRDDEE+VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRPGGESQPKRDDEESVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTTVIKKLKQINEEQREGLMD+LRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ  
Sbjct: 61   SIKRNTTVIKKLKQINEEQREGLMDDLRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAEQGQQTSLEAIEVSTDCLLQ 420

Query: 496  EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
            +GKIN   EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEKEKLKNIEGTNLDALLQR
Sbjct: 421  DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKEKLKNIEGTNLDALLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            VSVILLQMLEEEF+FLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541  VSVILLQMLEEEFSFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781  LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
            RSVSNDKPNTEKHLDAEKPSRATSN  SANGRDTVNGSKENGGAHEDG DSDSDTGSGTI
Sbjct: 901  RSVSNDKPNTEKHLDAEKPSRATSNITSANGRDTVNGSKENGGAHEDGADSDSDTGSGTI 960

Query: 1036 EAEGRDDEESDLE-NHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFE 1095
            EAEGRDDEESDLE NHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFE
Sbjct: 961  EAEGRDDEESDLENNHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFE 1020

Query: 1096 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGG 1155
            QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGG
Sbjct: 1021 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGG 1080

Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
            SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE
Sbjct: 1081 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1140

Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            ELNGLGSQTMNWMQTGGNR VPTRGNNWE SGGRSGGSRHPHHRYPGSGVHYSRKK
Sbjct: 1141 ELNGLGSQTMNWMQTGGNR-VPTRGNNWEGSGGRSGGSRHPHHRYPGSGVHYSRKK 1195

BLAST of HG10022266 vs. NCBI nr
Match: KAG6578740.1 (Regulator of nonsense transcripts UPF2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2127.4 bits (5511), Expect = 0.0e+00
Identity = 1128/1229 (91.78%), Postives = 1158/1229 (94.22%), Query Frame = 0

Query: 43   SSQFTILPS---HRFSVSVSAPLPTITFSSL-SGGTDMDHHEDDGRPGGESQPKRDDEET 102
            S+  T+ PS    R S+S+  P   +    L +  TDMDHHEDDGRP  ESQPKRDDEET
Sbjct: 28   STIHTVSPSVLHCRRSLSLHLPAELLVPKLLQNNSTDMDHHEDDGRPVSESQPKRDDEET 87

Query: 103  VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSIRRNTTVIKKLKQINEEQREGL 162
             ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSI+RNTT+IKKLKQIN+EQREGL
Sbjct: 88   AARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSIKRNTTIIKKLKQINDEQREGL 147

Query: 163  MDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ------------------------- 222
            MDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ                         
Sbjct: 148  MDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQICSLLHQRYKDFSPCLIQGLLKVFF 207

Query: 223  ---SGDELDADRNLKAMKKRSTLKLLMELFFVGVVEDTAIFNNIIKDLTSIEHLRDRDTT 282
               SGDELDADRNLKAMKKRSTLKLL+ELFFVGVVED+AIFNNIIKDLTSIEHLRDRDTT
Sbjct: 208  PGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVVEDSAIFNNIIKDLTSIEHLRDRDTT 267

Query: 283  LTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQ 342
            LTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+ITADQKKFFRKAFHTYYDAAAELLQ
Sbjct: 268  LTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNITADQKKFFRKAFHTYYDAAAELLQ 327

Query: 343  SEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMP 402
            SEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEA+DMQPPVMP
Sbjct: 328  SEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEAVDMQPPVMP 387

Query: 403  EDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPK 462
            EDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPK
Sbjct: 388  EDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPK 447

Query: 463  ANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTD 522
            ANEQSAKPAENLAESEADQGQQT+LEA+E+STD  LQ+GKINEKGK+KEEKDKEKNKDTD
Sbjct: 448  ANEQSAKPAENLAESEADQGQQTSLEAVEISTDSLLQDGKINEKGKDKEEKDKEKNKDTD 507

Query: 523  KEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANR 582
            KEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANR
Sbjct: 508  KEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANR 567

Query: 583  KKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIE 642
            KKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIE
Sbjct: 568  KKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIE 627

Query: 643  TKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETT 702
            TKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETT
Sbjct: 628  TKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETT 687

Query: 703  VRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFS 762
            VRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFS
Sbjct: 688  VRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFS 747

Query: 763  DLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSV 822
            DLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSV
Sbjct: 748  DLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSV 807

Query: 823  AVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGT 882
            AVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLI+VFGHGT
Sbjct: 808  AVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLILVFGHGT 867

Query: 883  SEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDIE 942
            SEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLD+E
Sbjct: 868  SEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDVE 927

Query: 943  FDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRATSNT 1002
            FDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSVSN KPNTEKHLDA+KPSRATSNT
Sbjct: 928  FDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSVSNVKPNTEKHLDAKKPSRATSNT 987

Query: 1003 ASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDED 1062
             SANGRDTVNGSKENG AHEDGVDSDSDTGSGTIEAEG DDEESDLENHEDGCDTEDDED
Sbjct: 988  TSANGRDTVNGSKENGAAHEDGVDSDSDTGSGTIEAEGHDDEESDLENHEDGCDTEDDED 1047

Query: 1063 DEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMM 1122
            DEEAGGPASDEDDEVHVR KV EVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMM
Sbjct: 1048 DEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMM 1107

Query: 1123 IPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDC 1182
            IPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEVQVKVLVKRGNKQQTKKMYIPRD 
Sbjct: 1108 IPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDS 1167

Query: 1183 TLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVPTRGNN 1240
             LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTG NR  PTRGNN
Sbjct: 1168 ALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGNNR-APTRGNN 1227

BLAST of HG10022266 vs. NCBI nr
Match: XP_023550316.1 (regulator of nonsense transcripts UPF2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2121.7 bits (5496), Expect = 0.0e+00
Identity = 1116/1192 (93.62%), Postives = 1140/1192 (95.64%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGRP  ESQPKRDDEET ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRPVSESQPKRDDEETAARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTT+IKKLKQIN+EQREGLMDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ  
Sbjct: 61   SIKRNTTIIKKLKQINDEQREGLMDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGD+LDADRNLKAMKKRSTLKLL+ELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDDLDADRNLKAMKKRSTLKLLLELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSSLAEA+DMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSLAEAVDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQT+LE +E+STD  LQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTSLEPVEISTDSLLQ 420

Query: 496  EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
            +GKINEKGK+KEEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPG
Sbjct: 421  DGKINEKGKDKEEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPG 480

Query: 556  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
            CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSV
Sbjct: 481  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSV 540

Query: 616  ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
            ILLQMLEEEFNFLLNKKDQMNIETKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHH
Sbjct: 541  ILLQMLEEEFNFLLNKKDQMNIETKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 600

Query: 676  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
            NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660

Query: 736  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
            ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720

Query: 796  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
            YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780

Query: 856  YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
            YELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK
Sbjct: 781  YELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 840

Query: 916  LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
            LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841  LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900

Query: 976  SNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAE 1035
            SN KPNTEKHLDA+KPSRATSNT SANGRDTVNGSKENG AHEDGVDSDSDTGSGTIEAE
Sbjct: 901  SNVKPNTEKHLDAKKPSRATSNTTSANGRDTVNGSKENGAAHEDGVDSDSDTGSGTIEAE 960

Query: 1036 GRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELR 1095
            G DDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVR KV EVDPREEANFEQELR
Sbjct: 961  GHDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELR 1020

Query: 1096 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEV 1155
            AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEV
Sbjct: 1021 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEV 1080

Query: 1156 QVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNG 1215
            QVKVLVKRGNKQQTKKMYIPRD  LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNG
Sbjct: 1081 QVKVLVKRGNKQQTKKMYIPRDSALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNG 1140

Query: 1216 LGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            LGSQTMNWMQTG NR  PTRGNNW+ASGGRSGGS HPHHRYPG GVHYSRKK
Sbjct: 1141 LGSQTMNWMQTGNNR-APTRGNNWDASGGRSGGSHHPHHRYPGGGVHYSRKK 1191

BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match: F4IUX6 (Regulator of nonsense transcripts UPF2 OS=Arabidopsis thaliana OX=3702 GN=UPF2 PE=2 SV=1)

HSP 1 Score: 1538.5 bits (3982), Expect = 0.0e+00
Identity = 831/1196 (69.48%), Postives = 992/1196 (82.94%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDH ED+         K+DDEE +AR EEIKKS EAK+ LRQ+NLNPERPDS +LRTLDS
Sbjct: 1    MDHPEDESH-----SEKQDDEEALARLEEIKKSIEAKLTLRQNNLNPERPDSAYLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNT VIKKLKQINEEQREGLMD+LR VN+SKFVSEAV+AIC+AKL++SDIQAAVQ  
Sbjct: 61   SIKRNTAVIKKLKQINEEQREGLMDDLRGVNLSKFVSEAVTAICEAKLKSSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      S ++L+AD+N KAMKKRSTLKLL+EL++VGV+ED
Sbjct: 121  SLLHQRYKEFSASLTQGLLKVFFPGKSAEDLEADKNSKAMKKRSTLKLLLELYYVGVIED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            + IF NIIKDLTS+E L+DRDTT TNLTLL SFARQGRI LGL  +GQD E+FFK L +T
Sbjct: 181  SNIFINIIKDLTSVEQLKDRDTTQTNLTLLTSFARQGRIFLGLPISGQD-EDFFKGLDVT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKK F+KAF+TYYDA A+LLQSEH  L QME+ENAK++NAKGEL++++ SSYEKLRKS
Sbjct: 241  ADQKKSFKKAFNTYYDALADLLQSEHKLLLQMEKENAKLVNAKGELSEDSASSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRN+SSLAEALDMQPPVMPEDG TTR++AG++ S     KD+SV E IWDDEDT+ 
Sbjct: 301  YDHLYRNISSLAEALDMQPPVMPEDG-TTRLTAGDEASPSGTVKDTSVPEPIWDDEDTKT 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAD--QGQQTALEAIEVSTDCS 495
            FYECLPDLRAFVPAVLLGEAEPK+NEQSAK  E L+ES ++  + QQT  +  EVS D +
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKSNEQSAKAKEKLSESSSEVVENQQTTEDTTEVSADSA 420

Query: 496  LQEGKIN-EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
              + + N E+ KEKEE +KEK KDT KEKGKEKD+++K+E+EKEK K+++  N + LLQR
Sbjct: 421  SMDDRSNAEQPKEKEEVEKEKAKDTKKEKGKEKDSEKKMEHEKEKGKSLDVANFERLLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVE+CYLNSK NRKKLV+ALFNVPRTSLELL YYSRMVATL++CMKD
Sbjct: 481  LPGCVSRDLIDQLTVEYCYLNSKTNRKKLVKALFNVPRTSLELLAYYSRMVATLASCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            +  +L+QMLE+EFN L++KKDQMNIETKIRNIRFIGELCKFKI  AGLVFSCLKACLD+F
Sbjct: 541  IPSMLVQMLEDEFNSLVHKKDQMNIETKIRNIRFIGELCKFKIVPAGLVFSCLKACLDEF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETT+RM NML+ILMRLKNVKNLDPR STLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTLRMTNMLDILMRLKNVKNLDPRQSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSAR+SKVRPPLHQY+RKLLFSDLDK +I NVL+QLRKLPWSECEQY+LKCFMKVH
Sbjct: 661  KPPERSARISKVRPPLHQYVRKLLFSDLDKDSIANVLKQLRKLPWSECEQYILKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSR+HDEF VAVVDEVLEEIR+GLE+N+YG QQKR+AHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRHHDEFVVAVVDEVLEEIRVGLELNEYGAQQKRLAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYE VDSSV+F+TLYL +++GH TSEQ+VLDPPED FR+RM+I LL+TCGHYFDRGSS
Sbjct: 781  LYNYEHVDSSVIFETLYLTLLYGHDTSEQEVLDPPEDFFRVRMVIILLETCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            K++LD+F IHFQ+YILSKG LPLDIEFDLQDLFA L+PNMTRYS+I+E+NAA ++LEE E
Sbjct: 841  KKRLDQFLIHFQRYILSKGHLPLDIEFDLQDLFANLRPNMTRYSTIDEVNAAILQLEERE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
             + S DK + E+H D +  ++++S+  S+NG+ T    +ENG AH  G +SDSD+GSG++
Sbjct: 901  HASSGDKVSIERHSDTKPSNKSSSDVISSNGKSTAKDIRENGEAH--GEESDSDSGSGSV 960

Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
              +G+ +EE D  NHE G ++ D +D ++  GP SD DD+  VRQKV  VD  E+A+F+Q
Sbjct: 961  VRDGQ-NEELDDGNHERGSESGDGDDYDDGDGPGSD-DDKFRVRQKVVTVDLEEQADFDQ 1020

Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG-VGGESGDEGLDEDAGG 1155
            EL+A++QESM+QR+ ELRGRP LNM IPM++FEGS +DHH  G V GE+G+E LDE+ G 
Sbjct: 1021 ELKALLQESMEQRKLELRGRPALNMTIPMSVFEGSGKDHHHFGRVVGENGEEVLDEENGE 1080

Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
             +EVQVKVLVKRGNKQQT++M IP DC L+QSTKQKEAAELEEKQDIKRL+LEYN+R+EE
Sbjct: 1081 QREVQVKVLVKRGNKQQTRQMLIPSDCALVQSTKQKEAAELEEKQDIKRLVLEYNERDEE 1140

Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            E NGLG+Q +NW  +GG+RG    G       G+SGGSRH  + + G G  Y  ++
Sbjct: 1141 EANGLGTQILNW-TSGGSRGSTRTGE----GSGKSGGSRHRFYYHQGGGGSYHARR 1180

BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match: A2AT37 (Regulator of nonsense transcripts 2 OS=Mus musculus OX=10090 GN=Upf2 PE=1 SV=1)

HSP 1 Score: 589.3 bits (1518), Expect = 9.8e-167
Identity = 430/1212 (35.48%), Postives = 658/1212 (54.29%), Query Frame = 0

Query: 80   EDDGRPGGESQPKRDDEETVARQEEIKKSF----------EAKMALRQSNLN--PERPDS 139
            E++ R   E Q KR  EE  A+ +E ++S           + +  LR  N N    RP+ 
Sbjct: 98   EEEERKKQEEQAKRQQEEAAAQLKEKEESLQLHQEAWERHQLRKELRSKNQNAPDNRPEE 157

Query: 140  GFLRTLDSSIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSD 199
             F   LDSS+++NT  +KKLK I E+QR+ L  +   +N+SK+++EAV++I +AKL+ SD
Sbjct: 158  NFFSRLDSSLKKNTAFVKKLKTITEQQRDSLSHDFNGLNLSKYIAEAVASIVEAKLKLSD 217

Query: 200  IQAAV----------------------QSGDELDADRNLKAMKKRSTLKLLMELFFVGVV 259
            +  A                       +  +    ++     K R+ L+ + EL  VG+ 
Sbjct: 218  VNCAAHLCSLFHQRYSDFAPSLLQVWKKHFEARKEEKTPNITKLRTDLRFIAELTIVGIF 277

Query: 260  EDTAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQ-GRILLGLLP------TGQDHE 319
             D    + I + L SI +  DR++  T+++++ SF R  G  + GL+P        + + 
Sbjct: 278  TDKEGLSLIYEQLKSIIN-ADRESH-THVSVVISFCRHCGDDIAGLVPRKVKSAAEKFNL 337

Query: 320  EFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENV 379
             F  S  I+ ++++ F+     Y+ +  + L+ +H  L+  E++N +IL++KGEL+++  
Sbjct: 338  SFPPSEIISPEKQQPFQNLLKEYFTSLTKHLKRDHRELQNTERQNRRILHSKGELSEDRH 397

Query: 380  SSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRV-SAGEDVSSPAAGKDSSVIE 439
              YE+   SY  L  N  SLA+ LD   P +P+D  T      G D+ +P    +  +  
Sbjct: 398  KQYEEFAMSYQKLLANSQSLADLLDENMPDLPQDKPTPEEHGPGIDIFTPGKPGEYDLEG 457

Query: 440  AIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEA 499
             IW+DED R FYE L DL+AFVPA+L  + E   N+ S K     A+   D  + ++ + 
Sbjct: 458  GIWEDEDARNFYENLIDLKAFVPAILFKDNEKSQNKDSNKDDSKEAKEPKDNKEASSPDD 517

Query: 500  IEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTN 559
            +E+     L+  +IN+   E E  D+   +D  K+   E++ + +  +    LK I    
Sbjct: 518  LEL----ELENLEINDDTLELEGADEA--EDLTKKLLDEQEQEDEEASTGSHLKLI---- 577

Query: 560  LDALLQRLPGCVSRDLIDQLTVEFCY-LNSKANRKKLVRALFNVPRTSLELLPYYSRMVA 619
            +DA LQ+LP CV+RDLID+  ++FC  +N+KANRKKLVRALF VPR  L+LLP+Y+R+VA
Sbjct: 578  VDAFLQQLPNCVNRDLIDKAAMDFCMNMNTKANRKKLVRALFIVPRQRLDLLPFYARLVA 637

Query: 620  TLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSC 679
            TL  CM DV+  L  ML  +F F + KKDQ+NIETK + +RFIGEL KFK+ +      C
Sbjct: 638  TLHPCMSDVAEDLCSMLRGDFRFHVRKKDQINIETKNKTVRFIGELTKFKMFTKNDTLHC 697

Query: 680  LKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTL 739
            LK  L DF+HH+I++AC LLETCGRFL+RSPE+ +R + +LE +MR K   +LD R+ T+
Sbjct: 698  LKMLLSDFSHHHIEMACTLLETCGRFLFRSPESHLRTSVLLEQMMRKKQAMHLDARYVTM 757

Query: 740  VENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPW--SECEQ 799
            VENAYY C PP     V K RPPL +Y+RKLL+ DL K   E VLRQ+RKLPW   E + 
Sbjct: 758  VENAYYYCNPPPAEKTVRKKRPPLQEYVRKLLYKDLSKVTTEKVLRQMRKLPWQDQEVKD 817

Query: 800  YLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQK 859
            Y++ C + +   KY  IH +A+L +GL  Y ++  + VVD VLE+IRLG+EVN     Q+
Sbjct: 818  YVICCMINIWNVKYNSIHCVANLLAGLVLYQEDVGIHVVDGVLEDIRLGMEVNQPKFNQR 877

Query: 860  RIAHMRFLGELYNYELVDSSVVFDTLYLIIVFG-HGTSEQDVLDPPEDTFRIRMIITLLQ 919
            RI+  +FLGELYNY +V+S+V+F TLY    FG +       LDPPE  FRIR++ T+L 
Sbjct: 878  RISSAKFLGELYNYRMVESAVIFRTLYSFTSFGVNPDGSPSSLDPPEHLFRIRLVCTILD 937

Query: 920  TCGHYFDRGSSKRKLDRFFIHFQKYILSKGAL---------PLDIEFDLQDLFAELQPNM 979
            TCG YFDRGSSKRKLD F ++FQ+Y+  K +L         P+DI++ + D    L+P +
Sbjct: 938  TCGQYFDRGSSKRKLDCFLVYFQRYVWWKKSLEVWTKDHPFPIDIDYMISDTLELLRPKI 997

Query: 980  TRYSSIEEINAAFVELEEH---ERSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNG 1039
               +S+EE      +LE     +  + NDK + +   + E                  + 
Sbjct: 998  KLCNSLEESIRQVQDLEREFLIKLGLVNDKESKDSMTEGENLEE--------------DE 1057

Query: 1040 SKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDE 1099
             +E GGA  +    +    +   E EG ++EE   E  E+  D   D + E       +E
Sbjct: 1058 EEEEGGAETEEQSGNESEVNEPEEEEGSEEEEEGEEEEEENTDYLTDSNKE---NETDEE 1117

Query: 1100 DDEVHVR----QKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMMIPMNLFE 1159
            + EV ++    + VP V   E+ +F Q L  +M E++ QR  E      L++ IP++L  
Sbjct: 1118 NAEVMIKGGGLKHVPCV---EDEDFIQALDKMMLENLQQRSGESVKVHQLDVAIPLHL-- 1177

Query: 1160 GSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTK 1219
              ++   G  +GG  G      +   +  +   +L ++GNKQQ K + +P    L  +  
Sbjct: 1178 -KSQLRKGPPLGGGEG------ETESADTMPFVMLTRKGNKQQFKILNVPMSSQLAANHW 1237

Query: 1220 QKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGR 1230
             ++ AE EE+  +K+L L+ N+R+E+E           +Q+   R  P   N        
Sbjct: 1238 NQQQAEQEERMRMKKLTLDINERQEQE------DYQEMLQSLAQRPAPANTNR------- 1252

BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match: Q9HAU5 (Regulator of nonsense transcripts 2 OS=Homo sapiens OX=9606 GN=UPF2 PE=1 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 2.2e-166
Identity = 434/1224 (35.46%), Postives = 662/1224 (54.08%), Query Frame = 0

Query: 79   HEDDGRPGGESQPKRDDEETVA-----RQEEIKKSFEA------KMALRQSNLN--PERP 138
            H+++ R   E Q KR  EE  A     ++E I+   EA      +  LR  N N    RP
Sbjct: 97   HQEEERKKQEEQAKRQQEEEAAAQMKEKEESIQLHQEAWERHHLRKELRSKNQNAPDSRP 156

Query: 139  DSGFLRTLDSSIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRT 198
            +  F   LDSS+++NT  +KKLK I E+QR+ L  +   +N+SK+++EAV++I +AKL+ 
Sbjct: 157  EENFFSRLDSSLKKNTAFVKKLKTITEQQRDSLSHDFNGLNLSKYIAEAVASIVEAKLKI 216

Query: 199  SDIQAAV----------------------QSGDELDADRNLKAMKKRSTLKLLMELFFVG 258
            SD+  AV                      +  +    ++     K R+ L+ + EL  VG
Sbjct: 217  SDVNCAVHLCSLFHQRYADFAPSLLQVWKKHFEARKEEKTPNITKLRTDLRFIAELTIVG 276

Query: 259  VVEDTAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQ-GRILLGLLP------TGQD 318
            +  D    + I + L +I +  DR++  T+++++ SF R  G  + GL+P        + 
Sbjct: 277  IFTDKEGLSLIYEQLKNIIN-ADRESH-THVSVVISFCRHCGDDIAGLVPRKVKSAAEKF 336

Query: 319  HEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDE 378
            +  F  S  I+ ++++ F+     Y+ +  + L+ +H  L+  E++N +IL++KGEL+++
Sbjct: 337  NLSFPPSEIISPEKQQPFQNLLKEYFTSLTKHLKRDHRELQNTERQNRRILHSKGELSED 396

Query: 379  NVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRV-SAGEDVSSPAAGKDSSV 438
                YE+   SY  L  N  SLA+ LD   P +P+D  T      G D+ +P    +  +
Sbjct: 397  RHKQYEEFAMSYQKLLANSQSLADLLDENMPDLPQDKPTPEEHGPGIDIFTPGKPGEYDL 456

Query: 439  IEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEP--------KANEQSAKPAENLAESEA 498
               IW+DED R FYE L DL+AFVPA+L  + E         K + + AK ++   E  +
Sbjct: 457  EGGIWEDEDARNFYENLIDLKAFVPAILFKDNEKSCQNKESNKDDTKEAKESKENKEVSS 516

Query: 499  DQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEK 558
                +  LE +E++ D    EG    +   K+  D+++ +D +   G             
Sbjct: 517  PDDLELELENLEINDDTLELEGGDEAEDLTKKLLDEQEQEDEEASTGSH----------- 576

Query: 559  EKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCY-LNSKANRKKLVRALFNVPRTSLE 618
              LK I    +DA LQ+LP CV+RDLID+  ++FC  +N+KANRKKLVRALF VPR  L+
Sbjct: 577  --LKLI----VDAFLQQLPNCVNRDLIDKAAMDFCMNMNTKANRKKLVRALFIVPRQRLD 636

Query: 619  LLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFK 678
            LLP+Y+R+VATL  CM DV+  L  ML  +F F + KKDQ+NIETK + +RFIGEL KFK
Sbjct: 637  LLPFYARLVATLHPCMSDVAEDLCSMLRGDFRFHVRKKDQINIETKNKTVRFIGELTKFK 696

Query: 679  IASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNV 738
            + +      CLK  L DF+HH+I++AC LLETCGRFL+RSPE+ +R + +LE +MR K  
Sbjct: 697  MFTKNDTLHCLKMLLSDFSHHHIEMACTLLETCGRFLFRSPESHLRTSVLLEQMMRKKQA 756

Query: 739  KNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRK 798
             +LD R+ T+VENAYY C PP     V K RPPL +Y+RKLL+ DL K   E VLRQ+RK
Sbjct: 757  MHLDARYVTMVENAYYYCNPPPAEKTVKKKRPPLQEYVRKLLYKDLSKVTTEKVLRQMRK 816

Query: 799  LPW--SECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGL 858
            LPW   E + Y++ C + +   KY  IH +A+L +GL  Y ++  + VVD VLE+IRLG+
Sbjct: 817  LPWQDQEVKDYVICCMINIWNVKYNSIHCVANLLAGLVLYQEDVGIHVVDGVLEDIRLGM 876

Query: 859  EVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFG-HGTSEQDVLDPPEDTF 918
            EVN     Q+RI+  +FLGELYNY +V+S+V+F TLY    FG +       LDPPE  F
Sbjct: 877  EVNQPKFNQRRISSAKFLGELYNYRMVESAVIFRTLYSFTSFGVNPDGSPSSLDPPEHLF 936

Query: 919  RIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGAL---------PLDIEFDLQ 978
            RIR++ T+L TCG YFDRGSSKRKLD F ++FQ+Y+  K +L         P+DI++ + 
Sbjct: 937  RIRLVCTILDTCGQYFDRGSSKRKLDCFLVYFQRYVWWKKSLEVWTKDHPFPIDIDYMIS 996

Query: 979  DLFAELQPNMTRYSSIEEINAAFVELEEH---ERSVSNDKPNTEKHLDAE--KPSRATSN 1038
            D    L+P +   +S+EE      +LE     +  + NDK + +   + E  +       
Sbjct: 997  DTLELLRPKIKLCNSLEESIRQVQDLEREFLIKLGLVNDKDSKDSMTEGENLEEDEEEEE 1056

Query: 1039 TASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDE 1098
              +     + N S+ N    E+G D+D D        EG ++EE + +   D    +++E
Sbjct: 1057 GGAETEEQSGNESEVNEPEEEEGSDNDDD--------EGEEEEEENTDYLTD--SNKENE 1116

Query: 1099 DDEEAGGPASDEDDEVHVR----QKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRP 1158
             DE        E+ EV ++    + VP V   E+ +F Q L  +M E++ QR  E     
Sbjct: 1117 TDE--------ENTEVMIKGGGLKHVPCV---EDEDFIQALDKMMLENLQQRSGESVKVH 1176

Query: 1159 TLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMY 1218
             L++ IP++L    ++   G  +GG  G      +A  +  +   +L ++GNKQQ K + 
Sbjct: 1177 QLDVAIPLHL---KSQLRKGPPLGGGEG------EAESADTMPFVMLTRKGNKQQFKILN 1236

Query: 1219 IPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVP 1230
            +P    L  +   ++ AE EE+  +K+L L+ N+R+E+E           +Q+   R  P
Sbjct: 1237 VPMSSQLAANHWNQQQAEQEERMRMKKLTLDINERQEQE------DYQEMLQSLAQRPAP 1255

BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match: O13824 (Nonsense-mediated mRNA decay protein 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=upf2 PE=1 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 9.2e-64
Identity = 282/1165 (24.21%), Postives = 508/1165 (43.61%), Query Frame = 0

Query: 99   VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRT---LDSSIRRNTTVIKKLK-QINEEQ 158
            ++R+E+IKK     +  R+     +  D     T   LDSS+++NT  +K+ K  +  E 
Sbjct: 1    MSREEQIKK-LNQYLDNRELAFRAKDGDKNIFHTESQLDSSLKKNTAFMKRCKSSLTSEN 60

Query: 159  REGLMDELRNVNMSKFVSEAVSAICDAKLR---TSDIQAAV------------------- 218
             +  + E++ +++ KF+ E  +AI +  ++   T DI ++V                   
Sbjct: 61   YDSFIKEIKTLSLKKFIPEITAAIVEGMMKCKATKDILSSVKIVWALNLRFSTAFTGPML 120

Query: 219  ------------------------QSGDEL-DADRNLKAMKKRSTLKLLMELFFVGVV-- 278
                                    Q+ +E+ + DR+   +K R  L+ L+E +  GVV  
Sbjct: 121  ANLYCALYPNPGYSLCHESYFELKQNENEVSEKDRSSHLLKVRPLLRFLIEFWLNGVVGT 180

Query: 279  -EDTAIF------------------NNIIKDLTSI--EHLRDRDTTLTNLTLLASFARQG 338
             ED   +                   N+ K L  +    L D       L +L S  R  
Sbjct: 181  PEDFVSYLPSTDSNDKKFRKPWFEEQNLKKPLVVLLFNDLMDTRFGFLLLPVLTSLVRTF 240

Query: 339  RILLGLLPTGQDHE--EFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQE 398
               L      +D E  E    L+     + + RK+ ++Y D      Q   +   ++ ++
Sbjct: 241  SCELFTTEDFEDKETLELVNRLNPVV-WRTYLRKSLNSYVDKLEVYCQKRKSLFEELNKQ 300

Query: 399  NAKILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPV-MPEDGHTTRVSAG 458
              +    + + N+E         KS +  + + +SL+E L+ +    + E     + S+G
Sbjct: 301  YQEQSIIRADPNNEKFQRLANFSKSIESEFSSYASLSEVLNRKASEDLLELNFMEKASSG 360

Query: 459  EDVSSPAAGK--DSSVIEA--IWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAK 518
             +    A+G+  +S+ +E   +WDD +   FYE  P+                 NE S  
Sbjct: 361  TNSVFNASGERSESANVETAQVWDDREQYFFYEVFPNF----------------NEGS-- 420

Query: 519  PAENLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEK 578
                +AE                       +  I E  +E      E NK  D  K    
Sbjct: 421  ----IAE----------------------MKSSIYESSQEGIRSSSENNKKEDDLKDSTG 480

Query: 579  DADRKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANRKKLVRAL 638
            D +    + +          +D  L +LP  VS +L +++ +EF  LN+KA+R +L++AL
Sbjct: 481  DLNTTQVSSR----------VDNFLLKLPSMVSLELTNEMALEFYDLNTKASRNRLIKAL 540

Query: 639  FNVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIR 698
              +PRTS  L+PYY R+   LS    + S  L+      F  ++++K +   +T++  +R
Sbjct: 541  CTIPRTSSFLVPYYVRLARILSQLSSEFSTSLVDHARHSFKRMIHRKAKHEYDTRLLIVR 600

Query: 699  FIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANML 758
            +I EL KF++    +VF C K C+++FT  +++V   LLE+CGRFL R PET ++M + L
Sbjct: 601  YISELTKFQLMPFHMVFECYKLCINEFTPFDLEVLALLLESCGRFLLRYPETKLQMQSFL 660

Query: 759  EILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAI 818
            E + + K    L  +   ++ENA +   PP+R   VSK +    +++  L+   L    +
Sbjct: 661  EAIQKKKLASALASQDQLVLENALHFVNPPKRGIIVSKKKSLKEEFLYDLIQIRLKDDNV 720

Query: 819  ENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVL 878
               L  LRK  W +  Q L    M+V   KY  ++ +A L S L ++H EF + V+D+ L
Sbjct: 721  FPTLLLLRKFDWKDDYQILYNTIMEVWNIKYNSLNALARLLSALYKFHPEFCIHVIDDTL 780

Query: 879  EEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGTS-----E 938
            E +   +  +D+  +QKR+A  RF+ EL    ++D   + + L+ ++      S      
Sbjct: 781  ESLFSAVNNSDHVEKQKRLAQARFISELCVIHMLDVRAITNFLFHLLPLEKFESFLTMKA 840

Query: 939  QDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDIEFD 998
              + +   D FR+R+I+ +LQTCG    R  +K+ +  + + +Q Y L +  +PLD+ ++
Sbjct: 841  STLTNINNDMFRLRLIVVVLQTCGPSIIRSKTKKTMLTYLLAYQCYFLIQPEMPLDMLYE 900

Query: 999  LQDLFAELQPNMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRATSNTAS 1058
             +D+   ++P+M  Y   EE   A   L E  +++S+D        D  +P    +N   
Sbjct: 901  FEDVIGYVRPSMKVYMHYEEARNA---LTERLQAISDDWEE-----DDTRPVFQGANDGD 960

Query: 1059 ANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDEDDE 1118
                  ++ ++E+    ED            I  E   DEES      D  D+ED+    
Sbjct: 961  ------ISSNEESVYLPED------------ISDESETDEESSGLEESDLLDSEDE---- 1020

Query: 1119 EAGGPASDEDDEVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMMIP 1178
                   D D+E+ + +++           ++E   +  ES+  R  E    P  ++ +P
Sbjct: 1021 -------DIDNEMQLSREL-----------DEEFERLTNESLLTRMHE--KNPGFDVPLP 1048

BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match: P38798 (Nonsense-mediated mRNA decay protein 2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=NMD2 PE=1 SV=2)

HSP 1 Score: 209.9 bits (533), Expect = 1.6e-52
Identity = 267/1175 (22.72%), Postives = 481/1175 (40.94%), Query Frame = 0

Query: 131  RTLDSSIRRNTTVIKKLKQ-INEEQREGLMDELRNVNMSKFVSEAVSAICDAKL----RT 190
            + LDSSI+RNT  IKKLK+   +     L+ +L   ++ K++SE +  + +  L    + 
Sbjct: 28   KKLDSSIKRNTGFIKKLKKGFVKGSESSLLKDLSEASLEKYLSEIIVTVTECLLNVLNKN 87

Query: 191  SDIQAAVQ--SG---------------------------DELDADRNLKAMKKRSTLKLL 250
             D+ AAV+  SG                            E + D   +  + +  L++ 
Sbjct: 88   DDVIAAVEIISGLHQRFNGRFTSPLLGAFLQAFENPSVDIESERDELQRITRVKGNLRVF 147

Query: 251  MELFFVGV------VEDTAIFNNII------KDLTSIEHLRDRDTTLTNLTLLASFARQG 310
             EL+ VGV      +E      N +      KD      LR+       L    + A   
Sbjct: 148  TELYLVGVFRTLDDIESKDAIPNFLQKKTGRKDPLLFSILREILNYKFKLGFTTTIAT-- 207

Query: 311  RILLGLLPTGQDHEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENA 370
              +    P  +D +  +  L   +  K   +  F  + DA        H  + ++++E+ 
Sbjct: 208  AFIKKFAPLFRDDDNSWDDLIYDSKLKGALQSLFKNFIDATFARATELHKKVNKLQREHQ 267

Query: 371  KILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDV 430
            K     G+L DE V  Y+KL   +     +  +L E   ++ P +         ++ +D+
Sbjct: 268  KCQIRTGKLRDEYVEEYDKLLPIFIRFKTSAITLGEFFKLEIPEL-------EGASNDDL 327

Query: 431  SSPAAGKDSSVI----EAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAKPAE 490
               A+   ++ I    + +W++EDTR FYE LPD+                         
Sbjct: 328  KETASPMITNQILPPNQRLWENEDTRKFYEILPDI------------------------- 387

Query: 491  NLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDAD 550
                                               K  EE    K               
Sbjct: 388  ----------------------------------SKTVEESQSSKT-------------- 447

Query: 551  RKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEF--CYLNSKANRKKLVRALF 610
                   EK  N+   N++     L     +D+ID L+  +   YL++KA R ++++  F
Sbjct: 448  -------EKDSNVNSKNINLFFTDLEMADCKDIIDDLSNRYWSSYLDNKATRNRILK--F 507

Query: 611  NVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRF 670
             +       LP YSR +AT S  M ++    +  L+  F   L+       +  ++NI F
Sbjct: 508  FMETQDWSKLPVYSRFIATNSKYMPEIVSEFINYLDNGFRSQLHSN-----KINVKNIIF 567

Query: 671  IGELCKFKIASAGLVFSCLKACLDDF-THHNIDVACNLLETCGRFLYRSPETTVRMANML 730
              E+ KF++  + ++F  ++  +      +N+++   LLE  G+FL   PE    M  M+
Sbjct: 568  FSEMIKFQLIPSFMIFHKIRTLIMYMQVPNNVEILTVLLEHSGKFLLNKPEYKELMEKMV 627

Query: 731  EILMRLKNVKNLDPRHSTLVENAYYLCKPPE-RSARVS-KVRPPLHQYIRKLLFSDLDKS 790
            +++   KN + L+    + +EN   L  PP  +S  V+ K   P  Q+ R L+ S+L   
Sbjct: 628  QLIKDKKNDRQLNMNMKSALENIITLLYPPSVKSLNVTVKTITPEQQFYRILIRSELSSL 687

Query: 791  AIENVLRQLRKLPWSE--CEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVV 850
              +++++ +RK  W +   ++ L   F K HK  Y  I L+  +  GL  Y  +F +  +
Sbjct: 688  DFKHIVKLVRKAHWDDVAIQKVLFSLFSKPHKISYQNIPLLTKVLGGLYSYRRDFVIRCI 747

Query: 851  DEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGTSEQ 910
            D+VLE I  GLE+NDYG    RI+++R+L E++N+E++ S V+ DT+Y II FGH  ++ 
Sbjct: 748  DQVLENIERGLEINDYGQNMHRISNVRYLTEIFNFEMIKSDVLLDTIYHIIRFGHINNQP 807

Query: 911  DVL-----DPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLD 970
            +       DPP++ FRI+++ T+L          + K KL   F  +  +I  +  LP +
Sbjct: 808  NPFYLNYSDPPDNYFRIQLVTTILLNINRTPAAFTKKCKLLLRFFEYYTFI-KEQPLPKE 867

Query: 971  IEFDLQDLFAELQP--NMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRA 1030
             EF +   F + +     T++   E +  +   LE   +S++  K          K  R 
Sbjct: 868  TEFRVSSTFKKYENIFGNTKFERSENLVESASRLESLLKSLNAIK---------SKDDRV 927

Query: 1031 TSNTASA-NGRDT-------VNGSKENGGAHEDGVD----------SDSDTGSGTIEAEG 1090
              ++AS  NG+++           ++    ++DGVD          S  +T S   + + 
Sbjct: 928  KGSSASIHNGKESAVPIESITEDDEDEDDENDDGVDLLGEDEDAEISTPNTESAPGKHQA 987

Query: 1091 RDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDD------------------------- 1150
            + DE  D ++ +D  D +DD+DD++  G   DEDD                         
Sbjct: 988  KQDESEDEDDEDDDEDDDDDDDDDDDDGEEGDEDDDEDDDDEDDDDEEEEDSDSDLEYGG 1047

Query: 1151 --------------EVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQE--------L 1177
                          E + R+   E + + E   E++ + +MQES+D R+ E        +
Sbjct: 1048 DLDADRDIEMKRMYEEYERKLKDEEERKAEEELERQFQKMMQESIDARKSEKVVASKIPV 1085

BLAST of HG10022266 vs. ExPASy TrEMBL
Match: A0A1S3CJK4 (LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 OS=Cucumis melo OX=3656 GN=LOC103501680 PE=4 SV=1)

HSP 1 Score: 2150.9 bits (5572), Expect = 0.0e+00
Identity = 1136/1195 (95.06%), Postives = 1148/1195 (96.07%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ  
Sbjct: 61   SIKRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQ+EQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQLEQENAKILNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKP ENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPTENLAESEAEQGQQTSLEAIEVSTDCPLQ 420

Query: 496  EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
            +GKIN   EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEK KLKNIEGTNLDALLQR
Sbjct: 421  DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKXKLKNIEGTNLDALLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781  LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
            RSVSNDKPNTEKHLDAEKPSRATS T SANGRD VNGSKEN GAHEDG DSDSDTGSGTI
Sbjct: 901  RSVSNDKPNTEKHLDAEKPSRATSTTTSANGRDRVNGSKENSGAHEDGADSDSDTGSGTI 960

Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
            EAEGRDDEESDLENHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFEQ
Sbjct: 961  EAEGRDDEESDLENHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1020

Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
            ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGGS
Sbjct: 1021 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGGS 1080

Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
            KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140

Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            LNGLGSQTMNWMQTGGNR VPTRGNNWEASGGRSGGSRHPHHRYPGSG+HYSRKK
Sbjct: 1141 LNGLGSQTMNWMQTGGNR-VPTRGNNWEASGGRSGGSRHPHHRYPGSGMHYSRKK 1194

BLAST of HG10022266 vs. ExPASy TrEMBL
Match: A0A0A0KS34 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G497020 PE=4 SV=1)

HSP 1 Score: 2150.6 bits (5571), Expect = 0.0e+00
Identity = 1138/1196 (95.15%), Postives = 1151/1196 (96.24%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGRPGGESQPKRDDEE+VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRPGGESQPKRDDEESVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTTVIKKLKQINEEQREGLMD+LRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ  
Sbjct: 61   SIKRNTTVIKKLKQINEEQREGLMDDLRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAEQGQQTSLEAIEVSTDCLLQ 420

Query: 496  EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
            +GKIN   EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEKEKLKNIEGTNLDALLQR
Sbjct: 421  DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKEKLKNIEGTNLDALLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            VSVILLQMLEEEF+FLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541  VSVILLQMLEEEFSFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781  LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
            RSVSNDKPNTEKHLDAEKPSRATSN  SANGRDTVNGSKENGGAHEDG DSDSDTGSGTI
Sbjct: 901  RSVSNDKPNTEKHLDAEKPSRATSNITSANGRDTVNGSKENGGAHEDGADSDSDTGSGTI 960

Query: 1036 EAEGRDDEESDLE-NHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFE 1095
            EAEGRDDEESDLE NHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFE
Sbjct: 961  EAEGRDDEESDLENNHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFE 1020

Query: 1096 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGG 1155
            QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGG
Sbjct: 1021 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGG 1080

Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
            SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE
Sbjct: 1081 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1140

Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            ELNGLGSQTMNWMQTGGNR VPTRGNNWE SGGRSGGSRHPHHRYPGSGVHYSRKK
Sbjct: 1141 ELNGLGSQTMNWMQTGGNR-VPTRGNNWEGSGGRSGGSRHPHHRYPGSGVHYSRKK 1195

BLAST of HG10022266 vs. ExPASy TrEMBL
Match: A0A6J1FEV2 (regulator of nonsense transcripts UPF2-like OS=Cucurbita moschata OX=3662 GN=LOC111445073 PE=4 SV=1)

HSP 1 Score: 2118.2 bits (5487), Expect = 0.0e+00
Identity = 1114/1192 (93.46%), Postives = 1139/1192 (95.55%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGRP  ESQPKRDDEET ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRPVSESQPKRDDEETAARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTT+IKKLKQIN+EQREGLMDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ  
Sbjct: 61   SIKRNTTIIKKLKQINDEQREGLMDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLL+ELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSSLAEA+DMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSLAEAVDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQT+LEA+E+STD  LQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTSLEAVEISTDSLLQ 420

Query: 496  EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
            +GKINEKGK+KEEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPG
Sbjct: 421  DGKINEKGKDKEEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPG 480

Query: 556  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
            CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSV
Sbjct: 481  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSV 540

Query: 616  ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
            ILLQMLEEEFNFLLNKKDQMNIETKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHH
Sbjct: 541  ILLQMLEEEFNFLLNKKDQMNIETKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 600

Query: 676  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
            NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660

Query: 736  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
            ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720

Query: 796  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
            YG IHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721  YGHIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780

Query: 856  YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
            YELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK
Sbjct: 781  YELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 840

Query: 916  LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
            LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841  LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900

Query: 976  SNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAE 1035
            SN KPN EKHLDA+KPSRATSNT SANGRDT+NGSKENG AHEDGVDSDSDTGSGTIEAE
Sbjct: 901  SNVKPNIEKHLDAKKPSRATSNTTSANGRDTMNGSKENGAAHEDGVDSDSDTGSGTIEAE 960

Query: 1036 GRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELR 1095
            G DDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVR KV EVDPREEANFEQELR
Sbjct: 961  GHDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELR 1020

Query: 1096 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEV 1155
            AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEV
Sbjct: 1021 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEV 1080

Query: 1156 QVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNG 1215
            QVKVLVKRGNKQQTKKMYIPRD  LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNG
Sbjct: 1081 QVKVLVKRGNKQQTKKMYIPRDSALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNG 1140

Query: 1216 LGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            LGSQTMNWMQTG NR  PTRGNNW+ASGGRSGGS HPHHRYPG G+HYSRKK
Sbjct: 1141 LGSQTMNWMQTGNNR-APTRGNNWDASGGRSGGSHHPHHRYPGGGMHYSRKK 1191

BLAST of HG10022266 vs. ExPASy TrEMBL
Match: A0A6J1JX56 (regulator of nonsense transcripts UPF2-like OS=Cucurbita maxima OX=3661 GN=LOC111489130 PE=4 SV=1)

HSP 1 Score: 2117.4 bits (5485), Expect = 0.0e+00
Identity = 1114/1192 (93.46%), Postives = 1139/1192 (95.55%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGR   ESQPKRDDEET ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRSVSESQPKRDDEETAARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTT+IKKLKQIN+EQREGLMDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ  
Sbjct: 61   SIKRNTTIIKKLKQINDEQREGLMDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLL+ELFFVGVVED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVVED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            +AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181  SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSSLAEA+DMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSLAEAVDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQT+LEA+E+STD  L+
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTSLEAVEISTDSLLE 420

Query: 496  EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
            +GKINEKGK+KEEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPG
Sbjct: 421  DGKINEKGKDKEEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPG 480

Query: 556  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
            CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSV
Sbjct: 481  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSV 540

Query: 616  ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
            ILLQMLEEEFNFLLNKKDQMNIETKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHH
Sbjct: 541  ILLQMLEEEFNFLLNKKDQMNIETKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 600

Query: 676  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
            NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660

Query: 736  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
            ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720

Query: 796  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
            YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780

Query: 856  YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
            YELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRM+ITLLQTCGHYFDRGSSKRK
Sbjct: 781  YELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMVITLLQTCGHYFDRGSSKRK 840

Query: 916  LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
            LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841  LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900

Query: 976  SNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAE 1035
            SN KPNTEKHLDA+KPSRATSNT SANGRDTVNGSKENG AHEDGVDSDSDTGSGTIEAE
Sbjct: 901  SNVKPNTEKHLDAKKPSRATSNTTSANGRDTVNGSKENGAAHEDGVDSDSDTGSGTIEAE 960

Query: 1036 GRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELR 1095
            G  DEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVR KV EVDPREEANFEQELR
Sbjct: 961  GHGDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELR 1020

Query: 1096 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEV 1155
            AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEV
Sbjct: 1021 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEV 1080

Query: 1156 QVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNG 1215
            QVKVLVKRGNKQQTKKMYIPRD  LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNG
Sbjct: 1081 QVKVLVKRGNKQQTKKMYIPRDSALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNG 1140

Query: 1216 LGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            LGSQTMNWMQTG NR  PTRGNNW+ASGGRSGGS HPHHRYPG GVHYSRKK
Sbjct: 1141 LGSQTMNWMQTGNNR-APTRGNNWDASGGRSGGSHHPHHRYPGGGVHYSRKK 1191

BLAST of HG10022266 vs. ExPASy TrEMBL
Match: A0A6J1BVY0 (regulator of nonsense transcripts UPF2 OS=Momordica charantia OX=3673 GN=LOC111006232 PE=4 SV=1)

HSP 1 Score: 2092.8 bits (5421), Expect = 0.0e+00
Identity = 1109/1195 (92.80%), Postives = 1140/1195 (95.40%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDHHEDDGR GGESQPKRDDEETVARQEEIKKSFEAK+ALRQSNLNPERPDSGFLRTLDS
Sbjct: 1    MDHHEDDGRLGGESQPKRDDEETVARQEEIKKSFEAKIALRQSNLNPERPDSGFLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNTT+IKKLKQINEEQREGLMDELRNVNMSKFVSEAV+AICDAKLR SDIQAAVQ  
Sbjct: 61   SIKRNTTIIKKLKQINEEQREGLMDELRNVNMSKFVSEAVAAICDAKLRASDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      SGDELDADRNLKAMKKRSTLKLL+ELFFVGV+ED
Sbjct: 121  SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVIED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
             AIFNNIIKDLTSIEHLRDRD TLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181  CAIFNNIIKDLTSIEHLRDRDATLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKKFFRK FHTYYDAA+ELLQSEHTSLRQME ENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241  ADQKKFFRKVFHTYYDAASELLQSEHTSLRQMEHENAKILNAKGELNDENVSSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
            FYECLPDLRAFVPAVLLGEAEPKAN+QS KPAEN+AESEADQGQQT+LEA+E+STDCSLQ
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKANDQSTKPAENMAESEADQGQQTSLEAVEISTDCSLQ 420

Query: 496  EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
            +GKINEKGK+KEEKDKEK+KDTDKEKGKEKDADRK+ENEKEKLKN+EGTNLDALLQRLPG
Sbjct: 421  DGKINEKGKDKEEKDKEKSKDTDKEKGKEKDADRKMENEKEKLKNVEGTNLDALLQRLPG 480

Query: 556  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
            CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV
Sbjct: 481  CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 540

Query: 616  ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
            ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKI+SAGLVFSCLK+CLDDFTHH
Sbjct: 541  ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKISSAGLVFSCLKSCLDDFTHH 600

Query: 676  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
            NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601  NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660

Query: 736  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
            ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661  ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720

Query: 796  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
            YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721  YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780

Query: 856  YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
            YELV+SSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK
Sbjct: 781  YELVESSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 840

Query: 916  LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
            LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841  LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900

Query: 976  -SNDKPNTEKHLDAEK-PSRATSNTASANGRDTVNGSKENG-GAHEDGVDSDSDTGSGTI 1035
             S+DKPNTEKHLDAEK PSR TSNT SANGRDTVNGS+ENG  AHED  DSDSDTGSGTI
Sbjct: 901  SSSDKPNTEKHLDAEKTPSRTTSNTTSANGRDTVNGSRENGAAAHEDVADSDSDTGSGTI 960

Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
            EAEGRDDEESDLENHEDG D+EDDEDDEE GGPASDEDDEVHVRQKV EVDPREEANFEQ
Sbjct: 961  EAEGRDDEESDLENHEDG-DSEDDEDDEEGGGPASDEDDEVHVRQKVAEVDPREEANFEQ 1020

Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
            ELRAVMQESMDQRRQE+RGRPTLNMMIPMNLFEG TRDHHGRGVGGESGDE LDEDAGG+
Sbjct: 1021 ELRAVMQESMDQRRQEIRGRPTLNMMIPMNLFEG-TRDHHGRGVGGESGDEALDEDAGGT 1080

Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
            KEVQVKVLVKRGNKQQTKKM+IPRDC LLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMFIPRDCALLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140

Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            LNGLGSQTMNWMQTGGNR VPTRGNNWE SGGRSGG RHPHHRY G GVHYSRKK
Sbjct: 1141 LNGLGSQTMNWMQTGGNR-VPTRGNNWEGSGGRSGGPRHPHHRYIGGGVHYSRKK 1192

BLAST of HG10022266 vs. TAIR 10
Match: AT2G39260.1 (binding;RNA binding )

HSP 1 Score: 1538.5 bits (3982), Expect = 0.0e+00
Identity = 831/1196 (69.48%), Postives = 992/1196 (82.94%), Query Frame = 0

Query: 76   MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
            MDH ED+         K+DDEE +AR EEIKKS EAK+ LRQ+NLNPERPDS +LRTLDS
Sbjct: 1    MDHPEDESH-----SEKQDDEEALARLEEIKKSIEAKLTLRQNNLNPERPDSAYLRTLDS 60

Query: 136  SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
            SI+RNT VIKKLKQINEEQREGLMD+LR VN+SKFVSEAV+AIC+AKL++SDIQAAVQ  
Sbjct: 61   SIKRNTAVIKKLKQINEEQREGLMDDLRGVNLSKFVSEAVTAICEAKLKSSDIQAAVQIC 120

Query: 196  --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
                                      S ++L+AD+N KAMKKRSTLKLL+EL++VGV+ED
Sbjct: 121  SLLHQRYKEFSASLTQGLLKVFFPGKSAEDLEADKNSKAMKKRSTLKLLLELYYVGVIED 180

Query: 256  TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
            + IF NIIKDLTS+E L+DRDTT TNLTLL SFARQGRI LGL  +GQD E+FFK L +T
Sbjct: 181  SNIFINIIKDLTSVEQLKDRDTTQTNLTLLTSFARQGRIFLGLPISGQD-EDFFKGLDVT 240

Query: 316  ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
            ADQKK F+KAF+TYYDA A+LLQSEH  L QME+ENAK++NAKGEL++++ SSYEKLRKS
Sbjct: 241  ADQKKSFKKAFNTYYDALADLLQSEHKLLLQMEKENAKLVNAKGELSEDSASSYEKLRKS 300

Query: 376  YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
            YDHLYRN+SSLAEALDMQPPVMPEDG TTR++AG++ S     KD+SV E IWDDEDT+ 
Sbjct: 301  YDHLYRNISSLAEALDMQPPVMPEDG-TTRLTAGDEASPSGTVKDTSVPEPIWDDEDTKT 360

Query: 436  FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAD--QGQQTALEAIEVSTDCS 495
            FYECLPDLRAFVPAVLLGEAEPK+NEQSAK  E L+ES ++  + QQT  +  EVS D +
Sbjct: 361  FYECLPDLRAFVPAVLLGEAEPKSNEQSAKAKEKLSESSSEVVENQQTTEDTTEVSADSA 420

Query: 496  LQEGKIN-EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
              + + N E+ KEKEE +KEK KDT KEKGKEKD+++K+E+EKEK K+++  N + LLQR
Sbjct: 421  SMDDRSNAEQPKEKEEVEKEKAKDTKKEKGKEKDSEKKMEHEKEKGKSLDVANFERLLQR 480

Query: 556  LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
            LPGCVSRDLIDQLTVE+CYLNSK NRKKLV+ALFNVPRTSLELL YYSRMVATL++CMKD
Sbjct: 481  LPGCVSRDLIDQLTVEYCYLNSKTNRKKLVKALFNVPRTSLELLAYYSRMVATLASCMKD 540

Query: 616  VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
            +  +L+QMLE+EFN L++KKDQMNIETKIRNIRFIGELCKFKI  AGLVFSCLKACLD+F
Sbjct: 541  IPSMLVQMLEDEFNSLVHKKDQMNIETKIRNIRFIGELCKFKIVPAGLVFSCLKACLDEF 600

Query: 676  THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
            THHNIDVACNLLETCGRFLYRSPETT+RM NML+ILMRLKNVKNLDPR STLVENAYYLC
Sbjct: 601  THHNIDVACNLLETCGRFLYRSPETTLRMTNMLDILMRLKNVKNLDPRQSTLVENAYYLC 660

Query: 736  KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
            KPPERSAR+SKVRPPLHQY+RKLLFSDLDK +I NVL+QLRKLPWSECEQY+LKCFMKVH
Sbjct: 661  KPPERSARISKVRPPLHQYVRKLLFSDLDKDSIANVLKQLRKLPWSECEQYILKCFMKVH 720

Query: 796  KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
            KGKYGQIHLIASLTSGLSR+HDEF VAVVDEVLEEIR+GLE+N+YG QQKR+AHMRFLGE
Sbjct: 721  KGKYGQIHLIASLTSGLSRHHDEFVVAVVDEVLEEIRVGLELNEYGAQQKRLAHMRFLGE 780

Query: 856  LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
            LYNYE VDSSV+F+TLYL +++GH TSEQ+VLDPPED FR+RM+I LL+TCGHYFDRGSS
Sbjct: 781  LYNYEHVDSSVIFETLYLTLLYGHDTSEQEVLDPPEDFFRVRMVIILLETCGHYFDRGSS 840

Query: 916  KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
            K++LD+F IHFQ+YILSKG LPLDIEFDLQDLFA L+PNMTRYS+I+E+NAA ++LEE E
Sbjct: 841  KKRLDQFLIHFQRYILSKGHLPLDIEFDLQDLFANLRPNMTRYSTIDEVNAAILQLEERE 900

Query: 976  RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
             + S DK + E+H D +  ++++S+  S+NG+ T    +ENG AH  G +SDSD+GSG++
Sbjct: 901  HASSGDKVSIERHSDTKPSNKSSSDVISSNGKSTAKDIRENGEAH--GEESDSDSGSGSV 960

Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
              +G+ +EE D  NHE G ++ D +D ++  GP SD DD+  VRQKV  VD  E+A+F+Q
Sbjct: 961  VRDGQ-NEELDDGNHERGSESGDGDDYDDGDGPGSD-DDKFRVRQKVVTVDLEEQADFDQ 1020

Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG-VGGESGDEGLDEDAGG 1155
            EL+A++QESM+QR+ ELRGRP LNM IPM++FEGS +DHH  G V GE+G+E LDE+ G 
Sbjct: 1021 ELKALLQESMEQRKLELRGRPALNMTIPMSVFEGSGKDHHHFGRVVGENGEEVLDEENGE 1080

Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
             +EVQVKVLVKRGNKQQT++M IP DC L+QSTKQKEAAELEEKQDIKRL+LEYN+R+EE
Sbjct: 1081 QREVQVKVLVKRGNKQQTRQMLIPSDCALVQSTKQKEAAELEEKQDIKRLVLEYNERDEE 1140

Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
            E NGLG+Q +NW  +GG+RG    G       G+SGGSRH  + + G G  Y  ++
Sbjct: 1141 EANGLGTQILNW-TSGGSRGSTRTGE----GSGKSGGSRHRFYYHQGGGGSYHARR 1180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890797.10.0e+0095.31regulator of nonsense transcripts UPF2 [Benincasa hispida][more]
XP_008463566.10.0e+0095.06PREDICTED: LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 [Cucumis ... [more]
XP_004143811.10.0e+0095.15regulator of nonsense transcripts UPF2 [Cucumis sativus] >KGN51237.1 hypothetica... [more]
KAG6578740.10.0e+0091.78Regulator of nonsense transcripts UPF2, partial [Cucurbita argyrosperma subsp. s... [more]
XP_023550316.10.0e+0093.62regulator of nonsense transcripts UPF2-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
F4IUX60.0e+0069.48Regulator of nonsense transcripts UPF2 OS=Arabidopsis thaliana OX=3702 GN=UPF2 P... [more]
A2AT379.8e-16735.48Regulator of nonsense transcripts 2 OS=Mus musculus OX=10090 GN=Upf2 PE=1 SV=1[more]
Q9HAU52.2e-16635.46Regulator of nonsense transcripts 2 OS=Homo sapiens OX=9606 GN=UPF2 PE=1 SV=1[more]
O138249.2e-6424.21Nonsense-mediated mRNA decay protein 2 OS=Schizosaccharomyces pombe (strain 972 ... [more]
P387981.6e-5222.72Nonsense-mediated mRNA decay protein 2 OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A1S3CJK40.0e+0095.06LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 OS=Cucumis melo OX=3... [more]
A0A0A0KS340.0e+0095.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G497020 PE=4 SV=1[more]
A0A6J1FEV20.0e+0093.46regulator of nonsense transcripts UPF2-like OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1JX560.0e+0093.46regulator of nonsense transcripts UPF2-like OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A6J1BVY00.0e+0092.80regulator of nonsense transcripts UPF2 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
Match NameE-valueIdentityDescription
AT2G39260.10.0e+0069.48binding;RNA binding [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 590..610
NoneNo IPR availableCOILSCoilCoilcoord: 496..516
NoneNo IPR availableCOILSCoilCoilcoord: 316..336
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..41
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 964..981
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 429..451
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1097..1123
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 68..102
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 74..102
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1187..1239
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 466..508
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1040..1063
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 941..963
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 365..390
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1008..1039
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1220..1239
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..41
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 469..508
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 941..1063
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1187..1216
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1099..1123
NoneNo IPR availablePANTHERPTHR12839:SF8SUBFAMILY NOT NAMEDcoord: 193..1237
coord: 76..193
IPR003890MIF4G-like, type 3SMARTSM00543if4_15coord: 722..923
e-value: 4.1E-45
score: 165.9
coord: 519..707
e-value: 1.4E-48
score: 177.4
IPR003890MIF4G-like, type 3PFAMPF02854MIF4Gcoord: 724..918
e-value: 5.8E-44
score: 150.3
coord: 522..706
e-value: 3.4E-31
score: 108.5
IPR007193Up-frameshift suppressor 2, C-terminalPFAMPF04050Upf2coord: 1040..1176
e-value: 3.8E-34
score: 118.3
IPR016021MIF4G-like domain superfamilyGENE3D1.25.40.180coord: 89..196
e-value: 1.0E-29
score: 105.7
coord: 197..370
e-value: 4.7E-39
score: 136.4
IPR016021MIF4G-like domain superfamilyGENE3D1.25.40.180coord: 717..948
e-value: 1.1E-92
score: 311.9
IPR016021MIF4G-like domain superfamilyGENE3D1.25.40.180coord: 476..715
e-value: 2.1E-39
score: 137.2
IPR039762Nonsense-mediated mRNA decay protein Nmd2/UPF2PANTHERPTHR12839NONSENSE-MEDIATED MRNA DECAY PROTEIN 2 UP-FRAMESHIFT SUPPRESSOR 2coord: 193..1237
IPR039762Nonsense-mediated mRNA decay protein Nmd2/UPF2PANTHERPTHR12839NONSENSE-MEDIATED MRNA DECAY PROTEIN 2 UP-FRAMESHIFT SUPPRESSOR 2coord: 76..193
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 516..710
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 717..941

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022266.1HG10022266.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay
cellular_component GO:0005737 cytoplasm
cellular_component GO:0035145 exon-exon junction complex
cellular_component GO:0005844 polysome
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding