Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTCCAATTCCGGCGGATTCCTCTTCCACTTCCCACATCGGTGAAATCTCTCCTCTCCTTTGCACGGGTTTTCATCAATCAGACATCAACTTGAAAATCCTGAATATAAAGAAATCCCTGAATTTGGAGCTTTTAAATTTTCCCCTCGAAAGCATCTGACACTTCCTTGCAATTCTGTTTACACTGCTCAAATCTCAAATCTGGGCTTTGTTTCGAAGCTGGAGGGAATATGTGGTTTATTTTGATTTTGCTTTGTGCGACTGAAAGTTCAATTCTCAGAGAATGAAATTCACAAGAGTAGCTTCGGACATTTCCAATCCCACAGGCTAGAGAATTATCCTGATTCGGTTGAAAAGAAGTGCAATTCAATGGCGATGCGCAGAGATTGCTTTCTGGCATTGTTTTTTTGTTATGCAGTTGTTTCGGAGGTAACTAATGCCTCGCTTTGGGACTCAAGAAAGTTGTTAGACCCGGCTTCAAAAGATAACTCCAAGAGAACGTCGCCAGTAATCCTCTACTTTCTTGAAAAAAAAGAAAAAGAATTAAAGAAAGAATCCTATATCTCTGTACTATTTTATGAACTTTCCGTTTTCGCCTTTTAGGTTTCCCCACTCATTAGTCCGGTTACTTCTGTGAATAACTCGGATTCTGTACCCGTTGACGGAAAGAAGAAGCAAATTGAACCACCCCCGCCGGCGCCCAACAACTCATCGTCAAATGTGGATTCGAAAGTACTCAATAATTCGAGTTCCGGTAGTTCGCCTAATGTTAGCGAGACTGGTAACGGTAAGCCTTTTGACACAAAGCAGGAGAACGGGACTGAAGCCAGTCCTCAATCAAGTAATCGAGAAACTTGTGATGGTGTACCCGACAACAAGAAGTGCAAGGACCAGAAGAAGTTGGTTGCTTGTATCCAAAGTAATCTTATTGGTAAACACGAGTTGCTCTTCGGCCTTGTTAGCTTTCTAACTTGATTTTTACCATGCTTTTCCGTTTTCTTCTGGGGTGGATTCTGATATCATTTTCTGCTGCGTTAGGATTTTATACCTCTAGAATCCTGCTACCTCTAGAACTGTCGCTTATTAAATTTGTCCCGATACGCTGTATGGTTGCTTTTGTTTGTTTATATATATCCCTTTTGTTTCTTATAAAAAAGAATAATAATAAATCTGTCCCGATAGTTAATTGGTATTTTTGTGTATCTGCTTGATCCGATAATTTTTTAAAATTATTATTGGAACCGGTCAAGTTCCATCTTCTGCGGCCTAGAAGCCAGTTGAGTATTTCTGGAAGTTGCATGCACTCTCCAAGGTAGGAAACCAAACCTTTATTATGGAATTGAAGTTTGACATATGATCTCCAATGCGAAGCTTCAAATAGTTAACCTGCGCTAGGAGTATGTGTATGTCAGTAATGGGTTTTCCCCCATAATTGTATGTGGTGAAATTCTCGTCACTTCAAAGAGTATTGCATTATAATCTCTATAAATTTTAAGTTTTGGTCTTTAGGATCGTTTGGTTGAGTTTCACTATACATATAGTCTATAGATATAGGTTTATTAAACGGTATATGGTAGGTTAATTTTTTGTGTTCCTTTACACTCTTTTGCGGATTTCTAAGCCTGTTATTGCTAATGGTTGTTTCAACATGGGGAATGTCTTAGAGCTCTCTGTTCCTTGCATGATAAAGGGCAACCTTTTATTAGTTCATCTTTGAGTCTAAAAGCTAGTGTAACCAGAATCTTATTTATATTTATATCTAGTCTAATCTCAATATGAGGATCATTTCGTCTAAAACATCTTCATTTTGTGTATTTCTTAAATGTATATGGAATACTAAATGTTAATAGTTTGTTCTACATTGGCGTATACAATGATTTAGACTATTATATCATATTTTCTTCATCAAATATATCCAAGCTATCATATGGGTCTGTCTACTTAAGGTGTAGCCCAGGAGTGGTTCCTAGTCCCTAGTTTTGAATGAAATGATAATAGATATGATTTTCTTGGTGTCTTACATTTTATAACAACATCATATTTTTCTATTTTATCAATTTTCCATTTTACATATTAGTATTAAATTGCTGAAAAAACCTTCAGGTTTTAGACTGCCCTATTACTTGGTTAGCAAGGGGTGATTGTACTGGCATGTGTTATCACATCCTTTACCATGATCGGGAGTTAGATTAGCTTTTGTGGCATAGATGACCTTTTAGAAAAGGTCTCTGAGAATGAGGTATATATTGATTTCTTCTTGCTTTCATTTTCTGCCATCGTTTAATTTTGATTGATTTTTTCATACAAAAGGTTAGAAAATGTTTACCAAGTTTTCTATTTGTTTCAAAATTGGTTGTATGAAGGATGAAAATTTGAATTAACATGACTGCATTCATGCTGTAAAAAGTATACGATGTGATGCATAACATGGAAAAATACTATTGTTGGTTGATAATTAGTCTTGCACTTTTATGTTGGCAATTTGAGCAAGTTCTAATTTCTGATGTTTCTCTATTATGTCCTAGAATTTGTTCTGGGTTTACATGGTTTTGAGTTATTCAAATCCTGTGGGTTGGCGCCAGCCATCTCTAAGTAGGGGAGTTGTGGTAGTTTAATGGATGAAAGTATTGGTTTTTGAGAGCATTGATATTTTTGAATTAGCTCTTTAGTCAAGAAAAAGAAACATATTGAGCATTCACCCCTCTGGATTTTCTCTCCGTAATCATCCCCCATTTTCCTCCCCCCAACCATCCCTCCCTTTACCGACTTTCTTTCTCCCTTCCTTCCAGTTGATCAAGGCAATTATGTTATGAAAATATTTATCATCGGAATAAAAACTCTTAAAACCTTCTTGTTTTAACTAAATATTGTACATCCCAAATAATATTATAAGATGTGTTGTGCATAAATCATAGTGAGATGTGTTGACCTTGTGTAAATAGTCAACAAGTATTATAACTTATAAGTGCATAGTTTGTGTGTCATGCATCCCTAAGCCTGAATTAACTTAAGATATCCGTGCAAAGGTAATAGTACGATATTCCCTTGGCATTGAGATGTATTAAACCTCCCTAAACTAACGAATCACTTTTACTTTGTTTAGGGCTCCGTATGGGGTTTGCAAGGATTTTAAGAGACTCATGAGGAAATTTTTATGGAAGGGCATCGATGAGGGGAGAGGGGAGAATCTTGTTAGTTGAGAGGTGGTGGAGAAACCTGTGGGTCTTGGGGTCTAGAACCAGGCAATTTAAGGGTCCACAATAGAGCCTTATTAGCGAAATGGTTGTGGTATTTCCCCCTCGAGTCTACATCTTTATGGCACCGTGTAATTGTGAGTAAATATGGTACTCACCCCATGAGTGGTTGTCGGCTGGGGCCAAGGGGATGTCCAGGAACCCTTGGATAGACATTGCTTATGAGCTCCCTAGTGATCATTTTCCGGTGGGGGATGGTAGAGGGATTTATTTTTGGGATGACAAGTGGGTGGGGGACAGTCCCCTTTGCTCTAAATTTTCTAATCTATATCGCTTGTCCTCAATGAAAAATCGTTTGATTATCGATGTGTTCAACCCTATTGGGAGCTCCATGTCTGTTTCGTTGGGTTTCCATCGTTCTTTGACCAATAGGGAAACAGTGGATGTCACTAATCTCCTGTCTTTGATTGGGGATTTCCAGTTTAGGGCGGGAAGAAAGGATGTTTGGTGTTGGAGTCTTATCCCCTCCGAGGGTTTTTCTTGTGGTTCCATATTTTGTTGGTTGCTCAACCCTTCTCCCCCTAGTGAGTTTGTGTTTTCTATTCTGTGGAGGGTGAAAATTCCAAAGAAGATTAGGTTCTTTATTTGGCAAGCTATCCACAGAAGAGTCAATACGATTGACAAACTCTCAAGAAAGATGACTTCGTTGGTTGGGCCGTTTTGTTGCATCTTTTGTTGGAAGGTGGAGGAAGATCTGGACCATATTCTGTGGATGTACGAGTTTGCCCATTTGGTGTGGGCCTGTGGGGCTTGCTCTATGAGGCATTTGGGATTTAGGTGAGGAGTTTTATAGATTACAAGGAGATGATCCAGGAGTTCCTCTTCCATCCACCATTTTGAGACAGGGGAGGTTTTTATGGTTAGCAGGAGTGTGAGTTGTAATGTGGGGTCTTTGAGGGGAGAAAAATAATAGAGTGTTTCGTGGCCTTGAGAGAGACCCGTCTAACGTTTGGTCCCTCACTAGGTTCTATGTTTCTCTCTCGACTTCCGTGTCCAGGTCTTTTTGTAATTACTTTCTAGGCACTATTATTCTTGATTGGCGACCCTTCCTTTTTAAGGGTTTTTTTTTTGTGGGATCGGTTTTTTTAGATGTCCTTGTATTCTTTCCTTTTTTCTCGCTTTTTCATTTTAAAAAAATTTGTAAAGCTTTTAACCAAATGTAAGAGGAGAGCTATAGTGTGGAATTGATAGGCATGTTTCCAAACTACAGAAAACTTGTTCAGAACTCTGTACACTGATGTCGTGTAAGAATGGCTTGATAAGATATATATAATTATATACAGTCATTTATTTATTCATTGAAAGCTTGATTAGATCATGTTGCACTATTCAGATTGATGGTCTCAATGCTTTACTTCATACTCTAGTAGTATAGTGTTCTTAAAGTGAAATTGCAATAATCCTGTTTTGTGATATTTTTCACTTTTATCAATGAAATATACAAAAGGGAGGGAAGGAAAGCCAAACCCTCCCATTAGCCCAAATATCACTAATTACGAAGGGATTTACTAAACCTACTCCAAGAAGTTGTAACAAAAGAGGAGATTGTTCATACGAGATAAAAAAATTTCTATTTTTTCCCCAAAAATATCTTAGATTTCTTTCCAATCAAGTGCACCAAAGGATATTTCTAATAAATTTGTGATAGTTTCTCAATCACCGTGGATGTTAGAGGGAATGATTCAAGTCTTGGTGACCACATGTCTAGGATTGAATATCCTGCAAGTTTCCTTGATAACCAAATGTAGTCGGATTTGACGGTTGTCCTATGAAAATAACCAAGGTGTATGCAAGTTGGTCTAGACGACACTAATGGATAATAAAAAACTATATATATTTGTGACATTTAGTCTTGTCTTTGAGTGTATTGATGAAAAGAGAGAAGAGATGCTGACCTCCTTGAAATCAAAAGCATAAGACACCCAAAAGAGGAAAGCATGTCCCAAGCTGTAGGAAAACATGCTCCAAAATACTGTCAAAGATTTCCCAAATGTGAACTTCCACCTTTTTGGGGTAGTTATTCTTCCAAATAATGACATAATTGGCGCTCAGGAGCTTTATGGAGGAATTAAGATCTTGGAGGAAATATTTGCAACTGAAACGCCCTGAGGATTCAAAAGACAAATTCCAAGAGTCCACATTACCATCAAGGGAGAGATTTCTATGCTGCCTATTGACGAAAGCCTGTCTTCGAATTTTTCCTCCTTTTGATTTCTCCTAAAAGATGAACTCCAAAAAAAGCCACTGGTTTCATTTATCCGAGGAGACAGAGATTGAAAAAGAGTATTTCTTTTTAACAAAGGTCAAAGGTCTAAGGCAAATACCTCAGATTATGGCCAGAGAAGATATTGCCAGCCATCTATTTTCCCAAGAAAGTGTATTATGGCCGTTTCCACCTCGATACTTCTAGTGGAGATCAAAGGTCTAAGAAGGAAAATATCTTTCCAAAGCCTTTTAGATTGGCTTGATCATTAGACCGCCAATCTGAGGATGAGGATCCCACCTTGGCCTTAATCACTTTCCTCCAAAGAACTGATTTCTTATGAGCTAATCACCAGGAGTCCTGGACCATTAGTCAAGAGGGGCAGGCTCTATTCTTCTACTAAACGTCCCTTATACACAATCTGCCTGGAGAATCTAACATAAGACTTTCTTCCAACTTACAAGGTGATAAGCTGCACTTTTCGAATGACATTCTCAATAGAAACTTACCATGATTCTCTCAATAGCTTCAGAAACTTTCTTCTGCATTTTAGAAAGACATAAAATAAGAGGTAAGGCTTGAGAGAGAAGCTTGGATCAAGGTGAGCCAGCCATCTTCAGAGTGAAGAGATTTCTCCCATTGGATCCTTCTTGAGAATAGGGTCCCAGAAGGAAGCTTCTTTTTGGATCATCATTTAAGGGAAGACCAAGGTAGTTACTTGGCCATTAATCCAAAGAAAAATAAAAATGTGGCCTTTTTTGTGCACCAAGGGATCTTCACAATTTACCCCCAACATAGTAGTCATTCCAAGGTTGAAGTTCAATTCTGAGGCTGCTTCAGAAACCTTAATAACTTTCAAAAGATTTTAGAGCTTTGAATCCAGAATGGATGAGAATAGGATGGTATTAAACGAATTGAAGATGGTTGGCCTTTAAGGAGTATAACTTACTAAAATAATATATATCATGAGTGTGAATACAAAGGGAGATAGGGGATTTGTCTTCCCTATGAGCTCATTTTCAATAAAGCAAGAGATAATCATTTTCAATTATTGCTCATTCATGGAATAAGGAGGAATACTTTTAGCCTCTCCGCTAACACTTTGGTGATAATTTTGTATATACTTGCAAGGCTAATGGTCTGAAGTCTTTCTGATTTTTTGTTTCGATTTTCTTAGGAATGAGGCACAGGTATGTTTCGTTAACATTTGTAATTCCATTCTGGAAATGTCCCAGGATTCAGGAATTCTTTATTATGTTAAGCATGCTCCATGTTTGAATTTTTAGCCTCTGCACTTTCTAAATAACCGAATTATTTGGTCCATCTGAATTAAAGTTTAATTATAATTTAAAAACCCGACTGCGCCCATTGTCCTCTTCAAATGTTTACTTTGTGTAGTTCTCTAGCCCAAACTCAATATTTGACTTGAATTTTACATGGAACGAAGAGGAAAGGGAAGTAAATACTTGCAATTGTGTAGAGTTTTATAGAAAAGAAGAAATGTACTTCACATGTTTTTTTTTTTGTTAATCATCAAATGGTAACATAGGTAAAAGAGACAAATTGGTCACTTAAGAACGTAAAGAAATTTTTTACTTTTTTTTTTTTGGGTGTGTGGCCCTGTGTCAGTTCACTCTCTTTTTTCATCAACATCTAAGAAAAATTTTGTCAATCCCCCAGCAGTCCCTTTTTGGACAACATCAATGGAAAATTGCAGTTAGTTCACTCTCTTATGTGATTAGTAAGTGACTGAAGCAGACAAATCCTTTCAGGTGCTAGTATCCCTTTGTTTTGGTTTATGAATTTTATTCTTACATGATTCTATCGAAGGAAATGCTATTTCTGCATACAAATCCTGTGCTCCAAATTATTGCCAATTTCTGTATGAAACCAGGGACTGTCTCGTTAACCCTTAGGTTTTGATTGAGTGATTGACTATTGTTACCTCACGTATATGTTATGAGGGTTTCTTTAATGCATGTTCACTTGGTAGAACCCAATGACTTTTACCGCCGTGTAACTTTTGGGGTTCAGTCGATGAGTTTCTTAAACTGATAGACATTGCTTAATTTCTGAAATATTTAATATTAATACGTGATGAGAATAACTTTATATTTACATCTTTTTCCTTGTTGAAGAGTCCAGAGATTTGGCTGTTCTAGTCCTGAACAAAGGAGAAGATGCATTAGAAGTGAATGTTACGGGAGGAAACTTTTTCAAGGGGCTTAAGATTTTAAAACATCGTGCAAAAAGGGTAAGGGATTGTGATTTCAACCTACTTTTTCCCAAATGGAAGCCATGTGTATTTTTGGTTCTTATGTTATACTACCATTGTTTCTTTATTCGAAATAAGTTTCTCTTCATTTTGGGGAATGCTGTAACTTTTTAAAAATTTCTTTACACAGTTAATGTAATCAATTAGACTTTCAGCCATCAGCTGCAGTTTTAAGAATTTTTGTTTTTCCACTTCTCAAGTGAATGTTTACACCTCTTACCGTTCAAAGCAAAATCTCTACATTCTAATGGTTTAGTTTATATATCAAATTGTGTTAGTTCTTTTTTGTGGCCGGGTTGAATTCGTTGATATTATTGTGCATATGTCCTGTGAAAATATTAGGACAACTTCCATTATGTTTTAGTTCCAGTTATTTTTCCATTCATTGCAGTCGGTCCTTTAAAGATCTTACTCTTGTTTGTTTGACCAGCTTGAATCTGCCAGGATCTTTTTCCCTTTTTCATAAAAAATTCAAAGACCTTATTCTTGTTTGTCTGTTACTAATAACTCACTGTTGCATCCTAACATTTTTGGGTTACTGGTTACATGTTTTGAAGTCATTAGTAATCCGCAAAATTTAATTTCCGTAACTATCTGCTGTGAGTGAGCACTTGGAAATAGAACTAGAGTTTTCTGTTCTCTTATTTTTCAGATCACCATTCCACTGCCGACCGAAAAAGGCATGAAATTAATATTAAATGCTGGAAATGGAGAATGCGTGCTTTACACCAGCCCTCTTGTTTCTGGAGAAGACCACCTTCTGCATCTCCCTTCATTTGACCAGCTAGCAACACCAGTAAACGCTGCTTACTTCTTAATTCTTGCAGTCCTAATCTTTGGGGGGTCCTTGGCTTGCTGCAAGTTTAGGAGGAGACAAGCGGGTGGTATCCCATATCAAGAGCTTGAAATGGCACTGCCAGAATCTTCCTCAGCTGTCAACGTTGAGACTGCTGAAGGTTGGGATCAGGGTTGGGATGATGACTGGGATGAAGAAAATGCTGTGAAATCACCAGGGGGGCGCCATATTGGAAGCATTTCAGCTAATGGCCTTACTGCTAGACCCAACAAAGATGGGTGGGAGAATGATTGGGATGTCTAGTTAATGAAGCAGCCAGAGAAGACAACAAAACTTCTGGTCTCTCGATAATTTGATCAGTGGTAGGTGACTGACTGGAAGGCAGAAAGGACAACGTGAAGCACATAACATTACCACTGAGGTTATTCAATAGGTGAATGTAAGCCTTCCTCCCACTTCTGATTAGATAGAGTTTGTAGCTGTCGCATATTCGGTCAGAATCCATAAAAGCCGGTCCATATAAATTTTTTGGGTTTGTTTTGGTGGGATGTAGAATGCATGTTTTTTTTATGTTTTTTTTTATTTGAATTTTTACTTTAAGAATACAGTTCCAATTTTTTTACATGATAGCTTTACAATATATAAATGTTCATGCGAGCTTTGAAATTATGATGGTGTCTGCCATTGCAAAACCTTGCAAAGAGAACCGTAAATAATACAATACACATGCTCATGTGCGTGCGTATCTTTTTGAGAATGTGGAACAGGTTCTCTCTCTCTCTAATCTGTGAAATAATTTTGAACAGACACTCTTCTGAATTACATTTTTTTGTATTGATGCCTGTGAGAAGCATCACCTCTCATGT
mRNA sequence
ATGGCGATGCGCAGAGATTGCTTTCTGGCATTGTTTTTTTGTTATGCAGTTGTTTCGGAGGTAACTAATGCCTCGCTTTGGGACTCAAGAAAGTTGTTAGACCCGGCTTCAAAAGATAACTCCAAGAGAACGTCGCCAGTTTCCCCACTCATTAGTCCGGTTACTTCTGTGAATAACTCGGATTCTGTACCCGTTGACGGAAAGAAGAAGCAAATTGAACCACCCCCGCCGGCGCCCAACAACTCATCGTCAAATGTGGATTCGAAAGTACTCAATAATTCGAGTTCCGGTAGTTCGCCTAATGTTAGCGAGACTGGTAACGGTAAGCCTTTTGACACAAAGCAGGAGAACGGGACTGAAGCCAGTCCTCAATCAAGTAATCGAGAAACTTGTGATGGTGTACCCGACAACAAGAAGTGCAAGGACCAGAAGAAGTTGGTTGCTTGTATCCAAAGTAATCTTATTGGATTTTATACCTCTAGAATCCTGCTACCTCTAGAACTGTCGCTTATTAAATTTGTCCCGATACGCTGTATGAATTTGTTCTGGGTTTACATGGTTTTGAGTTATTCAAATCCTGTGGGTTGGCGCCAGCCATCTCTAAGTAGGGGAGTTGTGTGGTTGTCGGCTGGGGCCAAGGGGATGTCCAGGAACCCTTGGATAGACATTGCTTATGAGCTCCCTAGTGATCATTTTCCGGTGGGGGATGGTAGAGGGATTTATTTTTGGGATGACAAGTGGGTGGGGGACAGTCCCCTTTGCTCTAAATTTTCTAATCTATATCGCTTGTCCTCAATGAAAAATCGTTTGATTATCGATGTGTTCAACCCTATTGGGAGCTCCATGTCTGTTTCGTTGGGTTTCCATCGTTCTTTGACCAATAGGGAAACAGTGGATGTCACTAATCTCCTGTCTTTGATTGGGGATTTCCAGTTTAGGGCGGGAAGAAAGGATGTTTGGTGTTGGAGTCTTATCCCCTCCGAGGGTTTTTCTTGTGGTTCCATATTTTGTTGGTTGCTCAACCCTTCTCCCCCTAGTGAGTTTGTGTTTTCTATTCTGTGGAGGGTGAAAATTCCAAAGAAGATTAGGTTCTTTATTTGGCAAGCTATCCACAGAAGAGTCAATACGATTGACAAACTCTCAAGAAAGATGACTTCGTTGGTTGGGCCGTTTTGTTGCATCTTTTGTTGGAAGGTGGAGGAAGATCTGGACCATATTCTGTGGATGTACGAGTTTGCCCATTTGGTGTGGGCCTGTGGGGCTTGCTCTATGAGGCATTTGGGATTTAGGAATGAGGCACAGATCACCATTCCACTGCCGACCGAAAAAGGCATGAAATTAATATTAAATGCTGGAAATGGAGAATGCGTGCTTTACACCAGCCCTCTTGTTTCTGGAGAAGACCACCTTCTGCATCTCCCTTCATTTGACCAGCTAGCAACACCAGTAAACGCTGCTTACTTCTTAATTCTTGCAGTCCTAATCTTTGGGGGGTCCTTGGCTTGCTGCAAGTTTAGGAGGAGACAAGCGGGTGGTATCCCATATCAAGAGCTTGAAATGGCACTGCCAGAATCTTCCTCAGCTGTCAACGTTGAGACTGCTGAAGGTTGGGATCAGGGTTGGGATGATGACTGGGATGAAGAAAATGCTGTGAAATCACCAGGGGGGCGCCATATTGGAAGCATTTCAGCTAATGGCCTTACTGCTAGACCCAACAAAGATGGGTGGGAGAATGATTGGGATGTCTAG
Coding sequence (CDS)
ATGGCGATGCGCAGAGATTGCTTTCTGGCATTGTTTTTTTGTTATGCAGTTGTTTCGGAGGTAACTAATGCCTCGCTTTGGGACTCAAGAAAGTTGTTAGACCCGGCTTCAAAAGATAACTCCAAGAGAACGTCGCCAGTTTCCCCACTCATTAGTCCGGTTACTTCTGTGAATAACTCGGATTCTGTACCCGTTGACGGAAAGAAGAAGCAAATTGAACCACCCCCGCCGGCGCCCAACAACTCATCGTCAAATGTGGATTCGAAAGTACTCAATAATTCGAGTTCCGGTAGTTCGCCTAATGTTAGCGAGACTGGTAACGGTAAGCCTTTTGACACAAAGCAGGAGAACGGGACTGAAGCCAGTCCTCAATCAAGTAATCGAGAAACTTGTGATGGTGTACCCGACAACAAGAAGTGCAAGGACCAGAAGAAGTTGGTTGCTTGTATCCAAAGTAATCTTATTGGATTTTATACCTCTAGAATCCTGCTACCTCTAGAACTGTCGCTTATTAAATTTGTCCCGATACGCTGTATGAATTTGTTCTGGGTTTACATGGTTTTGAGTTATTCAAATCCTGTGGGTTGGCGCCAGCCATCTCTAAGTAGGGGAGTTGTGTGGTTGTCGGCTGGGGCCAAGGGGATGTCCAGGAACCCTTGGATAGACATTGCTTATGAGCTCCCTAGTGATCATTTTCCGGTGGGGGATGGTAGAGGGATTTATTTTTGGGATGACAAGTGGGTGGGGGACAGTCCCCTTTGCTCTAAATTTTCTAATCTATATCGCTTGTCCTCAATGAAAAATCGTTTGATTATCGATGTGTTCAACCCTATTGGGAGCTCCATGTCTGTTTCGTTGGGTTTCCATCGTTCTTTGACCAATAGGGAAACAGTGGATGTCACTAATCTCCTGTCTTTGATTGGGGATTTCCAGTTTAGGGCGGGAAGAAAGGATGTTTGGTGTTGGAGTCTTATCCCCTCCGAGGGTTTTTCTTGTGGTTCCATATTTTGTTGGTTGCTCAACCCTTCTCCCCCTAGTGAGTTTGTGTTTTCTATTCTGTGGAGGGTGAAAATTCCAAAGAAGATTAGGTTCTTTATTTGGCAAGCTATCCACAGAAGAGTCAATACGATTGACAAACTCTCAAGAAAGATGACTTCGTTGGTTGGGCCGTTTTGTTGCATCTTTTGTTGGAAGGTGGAGGAAGATCTGGACCATATTCTGTGGATGTACGAGTTTGCCCATTTGGTGTGGGCCTGTGGGGCTTGCTCTATGAGGCATTTGGGATTTAGGAATGAGGCACAGATCACCATTCCACTGCCGACCGAAAAAGGCATGAAATTAATATTAAATGCTGGAAATGGAGAATGCGTGCTTTACACCAGCCCTCTTGTTTCTGGAGAAGACCACCTTCTGCATCTCCCTTCATTTGACCAGCTAGCAACACCAGTAAACGCTGCTTACTTCTTAATTCTTGCAGTCCTAATCTTTGGGGGGTCCTTGGCTTGCTGCAAGTTTAGGAGGAGACAAGCGGGTGGTATCCCATATCAAGAGCTTGAAATGGCACTGCCAGAATCTTCCTCAGCTGTCAACGTTGAGACTGCTGAAGGTTGGGATCAGGGTTGGGATGATGACTGGGATGAAGAAAATGCTGTGAAATCACCAGGGGGGCGCCATATTGGAAGCATTTCAGCTAATGGCCTTACTGCTAGACCCAACAAAGATGGGTGGGAGAATGATTGGGATGTCTAG
Protein sequence
MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNSDSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTEASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMNLFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGIYFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDVTNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPKKIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACGACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLATPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQGWDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV
Homology
BLAST of Spg033181 vs. NCBI nr
Match:
XP_023528429.1 (uncharacterized protein LOC111791362 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 424.9 bits (1091), Expect = 1.1e-114
Identity = 272/582 (46.74%), Postives = 301/582 (51.72%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMRRDCFLALF C+AVVSEVTNASLWDSRKLL+PAS +NS RTS SP+ISP++SVN S
Sbjct: 1 MAMRRDCFLALFLCHAVVSEVTNASLWDSRKLLEPASNNNSTRTSQGSPVISPISSVNIS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PV GKKKQ+EPPPPAPNNSSS+++S VLNNSSSGSS VSETG+GKPFDTK+ENGTE
Sbjct: 61 DSAPVIGKKKQVEPPPPAPNNSSSSLESNVLNNSSSGSSSTVSETGSGKPFDTKKENGTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
AS SS ETCDGVPDNKKC+DQKKL+ACIQSNLI
Sbjct: 121 ASALSSKHETCDGVPDNKKCRDQKKLIACIQSNLI------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
+ RG
Sbjct: 181 -------------------------------------------------------ESRG- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
L + V N ++ V++
Sbjct: 241 -----------------------------LAVLVLNEGEDTLEVNV-------------- 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
+ G+ F L KI K
Sbjct: 301 ------------------------------TGGNFFKGL-----------------KILK 342
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 342
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
++ +ITIPL TE MKLIL+AGNGECVLY SPLVSGE+ LHLPSFDQLA
Sbjct: 421 ---------HHKERITIPLTTETSMKLILDAGNGECVLYRSPLVSGENLFLHLPSFDQLA 342
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFL LAVLIFGGS ACCKFRRRQAGGIPYQELEMALPESSSAVN+ETA+GWDQG
Sbjct: 481 TPVNGAYFLFLAVLIFGGSWACCKFRRRQAGGIPYQELEMALPESSSAVNIETADGWDQG 342
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTAR NKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARSNKDGWENDWDV 342
BLAST of Spg033181 vs. NCBI nr
Match:
XP_022971211.1 (uncharacterized protein LOC111470002 [Cucurbita maxima])
HSP 1 Score: 419.9 bits (1078), Expect = 3.7e-113
Identity = 273/582 (46.91%), Postives = 295/582 (50.69%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMR DCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNS TSPV PLISP +S+N+S
Sbjct: 1 MAMRGDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSTGTSPVLPLISPNSSMNHS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PVDGKK++ EPPPPAPNNSSSN+DSK LNNSSSGSS NVSE GNGKP DT + N TE
Sbjct: 61 DSAPVDGKKRR-EPPPPAPNNSSSNLDSKGLNNSSSGSSHNVSEAGNGKPLDTNKRNRTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
QSSNRE CDGVPDNKKC+DQKKLVACI+SN+I
Sbjct: 121 GGLQSSNREICDGVPDNKKCRDQKKLVACIESNII------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
S V+ L+ G
Sbjct: 181 ------------------ESKDLAVLVLNKG----------------------------- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
+T+DV
Sbjct: 241 ------------------------------------------------------EDTLDV 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
++ G F+ +KI K
Sbjct: 301 ----NVTGGNFFKG-----------------------------------------LKILK 341
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 341
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
+ +ITIPL T K MKL+LNAGNGECVLYTSPLV+GED LHLPSFDQL
Sbjct: 421 ---------HHSKRITIPLTTLKDMKLVLNAGNGECVLYTSPLVAGEDLFLHLPSFDQLV 341
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS AC K RRRQAGGIPYQELEMALPESSSAV VETA GWDQG
Sbjct: 481 TPVNGAYFLILAVLIFGGSWACFKLRRRQAGGIPYQELEMALPESSSAVKVETAVGWDQG 341
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 341
BLAST of Spg033181 vs. NCBI nr
Match:
KAG7017185.1 (hypothetical protein SDJN02_19047 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 419.1 bits (1076), Expect = 6.3e-113
Identity = 273/582 (46.91%), Postives = 299/582 (51.37%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMRRDCFLALF C+AVVSEVTNASLWDSRKLL+ AS +NS RTS SP+ISP++SVN S
Sbjct: 1 MAMRRDCFLALFLCHAVVSEVTNASLWDSRKLLESASNNNSTRTSQGSPVISPISSVNIS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PV GKKKQ+EPPPP+PNNSSS+++S VLNNSSSGSS VSETG+GK DTK+ENGTE
Sbjct: 61 DSAPVIGKKKQVEPPPPSPNNSSSSLESDVLNNSSSGSSSTVSETGSGKTIDTKKENGTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
AS SS ETCDGVPD+KKC+DQKKL+ACIQSNLI
Sbjct: 121 ASALSSKHETCDGVPDDKKCRDQKKLIACIQSNLIE------------------------ 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
SRG+ L
Sbjct: 181 ---------------------SRGLAVLVLNE---------------------------- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
G+ PL V+V
Sbjct: 241 --------GEDPL-------------------------------------------EVNV 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
T G+ F L KI K
Sbjct: 301 TG------------------------------GNFFKGL-----------------KILK 342
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 342
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
++ +ITIPL TE MKLIL+AGNGECVLY SPLVSGE+ LHLPSFDQLA
Sbjct: 421 ---------HHKERITIPLTTETSMKLILDAGNGECVLYRSPLVSGENLFLHLPSFDQLA 342
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS ACCKFRRRQAGGIPYQELEMALPESSSAVNVETA+GWDQG
Sbjct: 481 TPVNGAYFLILAVLIFGGSWACCKFRRRQAGGIPYQELEMALPESSSAVNVETADGWDQG 342
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTAR NKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARSNKDGWENDWDV 342
BLAST of Spg033181 vs. NCBI nr
Match:
XP_022941631.1 (uncharacterized protein LOC111446930 [Cucurbita moschata])
HSP 1 Score: 418.3 bits (1074), Expect = 1.1e-112
Identity = 274/582 (47.08%), Postives = 292/582 (50.17%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMR DCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNS TSPV PLISP +S N+S
Sbjct: 1 MAMRGDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSTGTSPVLPLISPNSSTNHS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PVDG KKQ EPPPPAPNNSSSN+DSK LNNSSSGSS NVSE GNGKP DT N TE
Sbjct: 61 DSAPVDG-KKQREPPPPAPNNSSSNLDSKGLNNSSSGSSHNVSEAGNGKPLDTNNGNRTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
QSSN E CDGVPDNKKC+DQKKLVACIQ+N+I
Sbjct: 121 GGLQSSNHEICDGVPDNKKCRDQKKLVACIQNNII------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
S V+ L+ G
Sbjct: 181 ------------------ESKDLAVLVLNKG----------------------------- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
+T+DV
Sbjct: 241 ------------------------------------------------------EDTLDV 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
++ G F+ +KI K
Sbjct: 301 ----NVTGGNFFKG-----------------------------------------LKILK 341
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 341
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
+ +ITIPL T K MKL+L+AGNGECVLYTSPLVSGED LHLPSFDQL
Sbjct: 421 ---------HHSKRITIPLTTLKDMKLVLDAGNGECVLYTSPLVSGEDLFLHLPSFDQLV 341
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS AC K RRRQAGGIPYQELEMALPESSSAV VETAEGWDQG
Sbjct: 481 TPVNGAYFLILAVLIFGGSWACFKLRRRQAGGIPYQELEMALPESSSAVKVETAEGWDQG 341
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 341
BLAST of Spg033181 vs. NCBI nr
Match:
KAG6580427.1 (hypothetical protein SDJN03_20429, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 417.9 bits (1073), Expect = 1.4e-112
Identity = 270/582 (46.39%), Postives = 299/582 (51.37%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMRRDCFLALF C+AVVSEVTNASLWDSRKLL+PAS +NS RTS SP+ISP++SVN S
Sbjct: 27 MAMRRDCFLALFLCHAVVSEVTNASLWDSRKLLEPASNNNSTRTSQGSPVISPISSVNIS 86
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PV GKKKQ+EPPPP+PNNSSS+++S VLNNSSS SS VSETG+GKP DTK+ENGTE
Sbjct: 87 DSAPVIGKKKQVEPPPPSPNNSSSSLESNVLNNSSSRSSSTVSETGSGKPIDTKKENGTE 146
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
AS SS ETCDGVPDNKKC+DQKKL+ACIQSNLI
Sbjct: 147 ASALSSKHETCDGVPDNKKCRDQKKLIACIQSNLI------------------------- 206
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
+ RG
Sbjct: 207 -------------------------------------------------------ESRG- 266
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
L + V N ++ V++
Sbjct: 267 -----------------------------LAVLVLNEGEDTLEVNV-------------- 326
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
+ G+ F L KI K
Sbjct: 327 ------------------------------TGGNFFKGL-----------------KILK 368
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 387 ------------------------------------------------------------ 368
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
++ +ITIPL TE KLIL+AGNGECVLY SPLVSGE+ LHLPSFDQLA
Sbjct: 447 ---------HHKERITIPLTTETSYKLILDAGNGECVLYRSPLVSGENLFLHLPSFDQLA 368
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS ACCKFRRRQAGGIPYQELEMALPESSSAVNVETA+GWDQG
Sbjct: 507 TPVNGAYFLILAVLIFGGSWACCKFRRRQAGGIPYQELEMALPESSSAVNVETADGWDQG 368
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTAR NKDGWENDWDV
Sbjct: 567 WDDDWDEENAVKSPGGRHIGSISANGLTARSNKDGWENDWDV 368
BLAST of Spg033181 vs. ExPASy Swiss-Prot
Match:
P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)
HSP 1 Score: 56.2 bits (134), Expect = 1.4e-06
Identity = 44/176 (25.00%), Postives = 73/176 (41.48%), Query Frame = 0
Query: 235 GDGRGIYFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTN 294
GDG+ I FW D+WV PL + N R + + D++ P G+ + +
Sbjct: 178 GDGQQIRFWTDRWVSGKPLL-ELDNGERPTDCDTVVAKDLWIP-------GRGWDFAKID 237
Query: 295 RETVDVTNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLL---NPSPPSEFVFS 354
T + T L G +D W FS S + L P P F+
Sbjct: 238 PYTTNNTRLELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNMASFFN 297
Query: 355 ILWRVKIPKKIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHIL 408
LW+V++P++++ F+W ++ V T ++ R+ S C C E + H+L
Sbjct: 298 CLWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASN--VCQVCKGGVESMLHVL 343
BLAST of Spg033181 vs. ExPASy TrEMBL
Match:
A0A6J1I2Q4 (uncharacterized protein LOC111470002 OS=Cucurbita maxima OX=3661 GN=LOC111470002 PE=4 SV=1)
HSP 1 Score: 419.9 bits (1078), Expect = 1.8e-113
Identity = 273/582 (46.91%), Postives = 295/582 (50.69%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMR DCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNS TSPV PLISP +S+N+S
Sbjct: 1 MAMRGDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSTGTSPVLPLISPNSSMNHS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PVDGKK++ EPPPPAPNNSSSN+DSK LNNSSSGSS NVSE GNGKP DT + N TE
Sbjct: 61 DSAPVDGKKRR-EPPPPAPNNSSSNLDSKGLNNSSSGSSHNVSEAGNGKPLDTNKRNRTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
QSSNRE CDGVPDNKKC+DQKKLVACI+SN+I
Sbjct: 121 GGLQSSNREICDGVPDNKKCRDQKKLVACIESNII------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
S V+ L+ G
Sbjct: 181 ------------------ESKDLAVLVLNKG----------------------------- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
+T+DV
Sbjct: 241 ------------------------------------------------------EDTLDV 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
++ G F+ +KI K
Sbjct: 301 ----NVTGGNFFKG-----------------------------------------LKILK 341
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 341
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
+ +ITIPL T K MKL+LNAGNGECVLYTSPLV+GED LHLPSFDQL
Sbjct: 421 ---------HHSKRITIPLTTLKDMKLVLNAGNGECVLYTSPLVAGEDLFLHLPSFDQLV 341
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS AC K RRRQAGGIPYQELEMALPESSSAV VETA GWDQG
Sbjct: 481 TPVNGAYFLILAVLIFGGSWACFKLRRRQAGGIPYQELEMALPESSSAVKVETAVGWDQG 341
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 341
BLAST of Spg033181 vs. ExPASy TrEMBL
Match:
A0A6J1FLM5 (uncharacterized protein LOC111446930 OS=Cucurbita moschata OX=3662 GN=LOC111446930 PE=4 SV=1)
HSP 1 Score: 418.3 bits (1074), Expect = 5.2e-113
Identity = 274/582 (47.08%), Postives = 292/582 (50.17%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMR DCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNS TSPV PLISP +S N+S
Sbjct: 1 MAMRGDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSTGTSPVLPLISPNSSTNHS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PVDG KKQ EPPPPAPNNSSSN+DSK LNNSSSGSS NVSE GNGKP DT N TE
Sbjct: 61 DSAPVDG-KKQREPPPPAPNNSSSNLDSKGLNNSSSGSSHNVSEAGNGKPLDTNNGNRTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
QSSN E CDGVPDNKKC+DQKKLVACIQ+N+I
Sbjct: 121 GGLQSSNHEICDGVPDNKKCRDQKKLVACIQNNII------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
S V+ L+ G
Sbjct: 181 ------------------ESKDLAVLVLNKG----------------------------- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
+T+DV
Sbjct: 241 ------------------------------------------------------EDTLDV 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
++ G F+ +KI K
Sbjct: 301 ----NVTGGNFFKG-----------------------------------------LKILK 341
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 341
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
+ +ITIPL T K MKL+L+AGNGECVLYTSPLVSGED LHLPSFDQL
Sbjct: 421 ---------HHSKRITIPLTTLKDMKLVLDAGNGECVLYTSPLVSGEDLFLHLPSFDQLV 341
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS AC K RRRQAGGIPYQELEMALPESSSAV VETAEGWDQG
Sbjct: 481 TPVNGAYFLILAVLIFGGSWACFKLRRRQAGGIPYQELEMALPESSSAVKVETAEGWDQG 341
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 341
BLAST of Spg033181 vs. ExPASy TrEMBL
Match:
A0A6J1F4I5 (uncharacterized protein LOC111442092 OS=Cucurbita moschata OX=3662 GN=LOC111442092 PE=4 SV=1)
HSP 1 Score: 416.0 bits (1068), Expect = 2.6e-112
Identity = 269/582 (46.22%), Postives = 298/582 (51.20%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMRRDCFLALF C+AVVSEVTNASLWDSRKLL+PAS +NS RTS SP+ISP++SVN S
Sbjct: 1 MAMRRDCFLALFLCHAVVSEVTNASLWDSRKLLEPASNNNSTRTSQGSPVISPISSVNIS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PV GKKKQ+EPPPP+PNNSSS+++S VLNNSSS SS VSETG+GKP DTK+ENGTE
Sbjct: 61 DSAPVIGKKKQVEPPPPSPNNSSSSLESNVLNNSSSRSSSTVSETGSGKPIDTKKENGTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
AS SS ETCDGVPDNKKC+DQ KL+ACIQSNLI
Sbjct: 121 ASALSSKHETCDGVPDNKKCRDQNKLIACIQSNLI------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
+ RG
Sbjct: 181 -------------------------------------------------------ESRG- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
L + V N ++ V++
Sbjct: 241 -----------------------------LAVLVLNEGEDTLEVNV-------------- 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
+ G+ F L KI K
Sbjct: 301 ------------------------------TGGNFFKGL-----------------KILK 342
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 342
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
++ +ITIPL TE KLIL+AGNGECVLY SPLVSGE+ LHLPSFDQLA
Sbjct: 421 ---------HHKERITIPLTTETSYKLILDAGNGECVLYRSPLVSGENLFLHLPSFDQLA 342
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVLIFGGS ACCKFRRRQAGGIPYQELEMALPESSSAVNVETA+GWDQG
Sbjct: 481 TPVNGAYFLILAVLIFGGSWACCKFRRRQAGGIPYQELEMALPESSSAVNVETADGWDQG 342
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTAR NKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARSNKDGWENDWDV 342
BLAST of Spg033181 vs. ExPASy TrEMBL
Match:
A0A6J1J516 (uncharacterized protein LOC111481426 OS=Cucurbita maxima OX=3661 GN=LOC111481426 PE=4 SV=1)
HSP 1 Score: 407.1 bits (1045), Expect = 1.2e-109
Identity = 269/582 (46.22%), Postives = 296/582 (50.86%), Query Frame = 0
Query: 1 MAMRRDCFLALFFCYAVVSEVTNASLWDSRKLLDPASKDNSKRTSPVSPLISPVTSVNNS 60
MAMRRDCFLALF CYAVVSEVTNASLWDSRKLL+PAS +NS RTS SPLISP++SVN S
Sbjct: 1 MAMRRDCFLALFLCYAVVSEVTNASLWDSRKLLEPASNNNSTRTSQGSPLISPISSVNIS 60
Query: 61 DSVPVDGKKKQIEPPPPAPNNSSSNVDSKVLNNSSSGSSPNVSETGNGKPFDTKQENGTE 120
DS PV GKKKQ+EPPPPAPNNSSS+++S V NNSSSGSS VSETG+GKPFDTK+E GTE
Sbjct: 61 DSAPVIGKKKQVEPPPPAPNNSSSSLESNV-NNSSSGSSSTVSETGSGKPFDTKKEKGTE 120
Query: 121 ASPQSSNRETCDGVPDNKKCKDQKKLVACIQSNLIGFYTSRILLPLELSLIKFVPIRCMN 180
AS SS ETCDGVPDN KC+DQKKL+ACIQSNLI
Sbjct: 121 ASALSSKHETCDGVPDNNKCRDQKKLIACIQSNLI------------------------- 180
Query: 181 LFWVYMVLSYSNPVGWRQPSLSRGVVWLSAGAKGMSRNPWIDIAYELPSDHFPVGDGRGI 240
+ RG
Sbjct: 181 -------------------------------------------------------ESRG- 240
Query: 241 YFWDDKWVGDSPLCSKFSNLYRLSSMKNRLIIDVFNPIGSSMSVSLGFHRSLTNRETVDV 300
L + V N ++ V++
Sbjct: 241 -----------------------------LAVLVLNEGEDTLEVNV-------------- 300
Query: 301 TNLLSLIGDFQFRAGRKDVWCWSLIPSEGFSCGSIFCWLLNPSPPSEFVFSILWRVKIPK 360
+ G+ F L KI K
Sbjct: 301 ------------------------------TGGNFFKGL-----------------KILK 340
Query: 361 KIRFFIWQAIHRRVNTIDKLSRKMTSLVGPFCCIFCWKVEEDLDHILWMYEFAHLVWACG 420
Sbjct: 361 ------------------------------------------------------------ 340
Query: 421 ACSMRHLGFRNEAQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLA 480
++ +ITIPL TE KL+L+AGNGECVLY SPLVS E+ LHLPSFDQLA
Sbjct: 421 ---------HHKERITIPLTTETSTKLVLDAGNGECVLYRSPLVSRENLFLHLPSFDQLA 340
Query: 481 TPVNAAYFLILAVLIFGGSLACCKFRRRQAGGIPYQELEMALPESSSAVNVETAEGWDQG 540
TPVN AYFLILAVL FGGS ACCKFRRRQAGGIPYQELEMALPESSSAVNVETA+GWDQG
Sbjct: 481 TPVNGAYFLILAVL-FGGSWACCKFRRRQAGGIPYQELEMALPESSSAVNVETADGWDQG 340
Query: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARPNKDGWENDWDV 583
WDDDWDEENAVKSPGGRHIGSISANGLTAR NKDGWENDWDV
Sbjct: 541 WDDDWDEENAVKSPGGRHIGSISANGLTARSNKDGWENDWDV 340
BLAST of Spg033181 vs. ExPASy TrEMBL
Match:
A0A2N9I2C9 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS46035 PE=4 SV=1)
HSP 1 Score: 228.0 bits (580), Expect = 1.0e-55
Identity = 105/151 (69.54%), Postives = 125/151 (82.78%), Query Frame = 0
Query: 433 AQITIPLPTEKGMKLILNAGNGECVLYTSPLVSGEDHLLHLPSFDQLATPVNAAYFLILA 492
++I + L EKG KLILNAGNGEC L P GE + +H PS+D+L TPVN AYFLI
Sbjct: 111 SRINVSLTNEKGSKLILNAGNGECALLVDP--PGEGNFMHFPSYDKLVTPVNGAYFLIFT 170
Query: 493 VLIFGGSLACCKFRRR-QAGGIPYQELEMALPESSSAVNVETAEGWDQGWDDDWDEENAV 552
VL+FGG+ ACCKFR++ GG+PYQELEM++PES SA+NVE+AEGWDQGWDDDWDEENAV
Sbjct: 171 VLVFGGTWACCKFRKKSHHGGVPYQELEMSMPESVSAINVESAEGWDQGWDDDWDEENAV 230
Query: 553 KSPGGRHIGSISANGLTAR-PNKDGWENDWD 582
KSPGGRH+GSISANGLT+R N+DGWENDW+
Sbjct: 231 KSPGGRHVGSISANGLTSRSSNRDGWENDWN 259
BLAST of Spg033181 vs. TAIR 10
Match:
AT3G51580.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; Has 1768 Blast hits to 1607 proteins in 294 species: Archae - 2; Bacteria - 552; Metazoa - 381; Fungi - 236; Plants - 306; Viruses - 38; Other Eukaryotes - 253 (source: NCBI BLink). )
HSP 1 Score: 147.5 bits (371), Expect = 3.3e-35
Identity = 78/145 (53.79%), Postives = 101/145 (69.66%), Query Frame = 0
Query: 446 KLILNAGNGECVLYTSPLVSGEDHL-LHLPSFDQLATPVNAAYFLILAVLIFGG--SLAC 505
K+IL+ G G+C L+ P S E L H PS+++L TP+N AYFLI++V+IFGG +
Sbjct: 248 KIILDTGKGQCALHMYP--SEESTLPFHFPSYEKLVTPINGAYFLIVSVIIFGGIWAFCL 307
Query: 506 CKFRRRQAGGIPYQELEMA----LPESSSAVNVETAEGWDQGWDDDWDEENAVKSPGGRH 565
C+ RR G+PY+ELE++ L S +VETA+ WD+GWDDDWDE NAVKSPG
Sbjct: 308 CRKNRRAGSGVPYRELELSGGPGLENESGVHDVETAD-WDEGWDDDWDENNAVKSPGSAA 367
Query: 566 IG-SISANGLTAR-PNKDGWENDWD 582
SISANGLTAR PN+DGW++DWD
Sbjct: 368 KSVSISANGLTARAPNRDGWDHDWD 389
BLAST of Spg033181 vs. TAIR 10
Match:
AT3G51580.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages. )
HSP 1 Score: 147.5 bits (371), Expect = 3.3e-35
Identity = 78/145 (53.79%), Postives = 101/145 (69.66%), Query Frame = 0
Query: 446 KLILNAGNGECVLYTSPLVSGEDHL-LHLPSFDQLATPVNAAYFLILAVLIFGG--SLAC 505
K+IL+ G G+C L+ P S E L H PS+++L TP+N AYFLI++V+IFGG +
Sbjct: 268 KIILDTGKGQCALHMYP--SEESTLPFHFPSYEKLVTPINGAYFLIVSVIIFGGIWAFCL 327
Query: 506 CKFRRRQAGGIPYQELEMA----LPESSSAVNVETAEGWDQGWDDDWDEENAVKSPGGRH 565
C+ RR G+PY+ELE++ L S +VETA+ WD+GWDDDWDE NAVKSPG
Sbjct: 328 CRKNRRAGSGVPYRELELSGGPGLENESGVHDVETAD-WDEGWDDDWDENNAVKSPGSAA 387
Query: 566 IG-SISANGLTAR-PNKDGWENDWD 582
SISANGLTAR PN+DGW++DWD
Sbjct: 388 KSVSISANGLTARAPNRDGWDHDWD 409
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_023528429.1 | 1.1e-114 | 46.74 | uncharacterized protein LOC111791362 [Cucurbita pepo subsp. pepo] | [more] |
XP_022971211.1 | 3.7e-113 | 46.91 | uncharacterized protein LOC111470002 [Cucurbita maxima] | [more] |
KAG7017185.1 | 6.3e-113 | 46.91 | hypothetical protein SDJN02_19047 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022941631.1 | 1.1e-112 | 47.08 | uncharacterized protein LOC111446930 [Cucurbita moschata] | [more] |
KAG6580427.1 | 1.4e-112 | 46.39 | hypothetical protein SDJN03_20429, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
P0C2F6 | 1.4e-06 | 25.00 | Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1I2Q4 | 1.8e-113 | 46.91 | uncharacterized protein LOC111470002 OS=Cucurbita maxima OX=3661 GN=LOC111470002... | [more] |
A0A6J1FLM5 | 5.2e-113 | 47.08 | uncharacterized protein LOC111446930 OS=Cucurbita moschata OX=3662 GN=LOC1114469... | [more] |
A0A6J1F4I5 | 2.6e-112 | 46.22 | uncharacterized protein LOC111442092 OS=Cucurbita moschata OX=3662 GN=LOC1114420... | [more] |
A0A6J1J516 | 1.2e-109 | 46.22 | uncharacterized protein LOC111481426 OS=Cucurbita maxima OX=3661 GN=LOC111481426... | [more] |
A0A2N9I2C9 | 1.0e-55 | 69.54 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS46035 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G51580.1 | 3.3e-35 | 53.79 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G51580.2 | 3.3e-35 | 53.79 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |