Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACACGTTCATCAACATTCCAACTCTTATGGATCAAATCGAACGCGGCCTGGTAGTCATGCAGCCGGAGGAGGAATGGTGGTCATTTCGAGGCCTTAAAGTTCCCAGAAACCTGGCCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTTCCTTCCTTGTGAAAGAAGTATCAGAGATTTTTAGAGATGCTTCTTCAGCCTTTATGGATAGAGGGAAATATGTTAATTCATGGAGAAGAGATTTTTATAAGAGAGGAAGTGGCTCTCAATTTGTTCTACTGGACCAAAGTACTGTTATAATGGTCAAAGGCGAGAGCGATCAGTTGGTGTGCAGTTATCTTCTAGGAAAGAGTTTTATGTGGGAGCTGGGTTTACAACTTCTAGAATATCTCATGGAAAAGGTATTACCAAACCAAAGTCTAATGATTATTCTCAGCTAAGTATGCAGAGACCTAACCTTTCTGGGAGTGGTGATCAAAATAATATGAGCCAAAAGTTTGACTCAGAATTTCAGGATAGTTTCGAGAATTTTGGTGATCCAGGATGGAGGTAAGAGGGTGGTCACAACAACATCTATTTTCCTTACCCTGAACGATTAAATCCAATTTCTGAGACTGATGGGTCCTATTATAGTGGAAGATCACGTTATTCCCAGAAGCAACATCGAGTTCTTCCTCCTCCATCTTTGATACAGAAATCTTCTATCAGAGGTGCATTTGAATCTGTTCCCCAGGATATCATCAATAGTGAGATACAATATAATCATTCGGTAAGAAATGTTTCTACTGCTCAGACAGGGTATATTCATCATGAAAACTTCACGCTATCAACTATAATTGATGTTAATTTAGATAATATTGAGAATGGGGAGCAAAAACCAGATGGTGGCATAACTCTACGGTGTGATTCACAGTCAACCCTTTTTGTATTTAGCCCTCCAACCTCTCAAACTCATCTATCTCACGAGGACTTGGATGATTCTGGCGATTCACTTGTTTTATCTGCTAGTAGAGAAGGCACATTGTCGATAGAGGATAATGAATCTTCTATACCAGCCAAGGCTGGTAAAGTGATCATGATTACCTCTACTAAAGTATCTACAGGTGATGAAGATGATTGGGCTGTTGCAAACGAGCATGTTCAGGAACAAGAAGAATATGATGAAGATGATGATGAGTATCAAGAAGAAGATGAAGTTCATGAAGGGGAGGACGAGAACATTGACCTTGCACAAGATTTTGATGATTTACATTTAGACGATACTAATAGAGGGTCACCCCACATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGAGTTGGAGTGGGGATGTCAAATGATGAGTTTGAAAGAATTCCTGGAAATGAAGAAAATATGTATGTTGCAACAGAAATTTCAAATTGCATCAAAGAAGAACAGGGGTCTTCGGAAGGATTGCAAGTTGATGGTAAAGTCTGTCAATACGTGGATGCTTCTCAAGTAAGGATTCCTGACCCTGAAGAGATGCATGACATGATCATGCAATTTAAAACTTCCCAAGCATTACCAGAACCTGAAATTACCAAGCAAGGAAATTCTTGCAGATCTAGTATGTCTTTTCAACAGCCAGTCTCGTCTTCAGTTTCAATGGCCTCACAATCTTCATCTGGTCAAGTTATTGTGTCGAATGAAGTCCTTTCAGGTCAAGTTGAGCCTCCTCTTAAGCTTCAGTTTGGTTTGTTCTCAGGTCCTTCTCTCATACCATCACATGTACCATCTATACAGATAGGCTCTATACAGATGCCTCTTCATTTACATCCCTAGATTACTCCATCTATAATTAAGAAGCATTCATCACAGCCCCCTTTATTCCAGTTTGGACAACTTAGGTACCCGTCTTCTGTCCCCCAAGGTGTACTTCCTTTGCCTCCTCAACCGCCAACATTTGTTCAGCCCACTGCTCAAACTGGTTTTCCTTCAAGTGAAAACCCAAGGGATGCTCTGGTTCGTTCAAACTTTTTAGGAAACTTGTTTTAACAAGTCCCAAAAAAATGATGTGTTGCCTTTTATGAAGGATAATCAAGTCCTTGTGTCAAGATCTCTGAATATGAATTCACCAAGAGATTCACAGTCATTACCTTTAACAGAAAGTATAGAAGTCAAAGTTATAAATCAGTAGGATCAAACTGCAGTTTCTTGCATTAACGAGAGCAATATCAAATCTCAACCAGGTTTTCAAGCAGAAAATCAGAGGCACCATGTTCCAACTTCAGGCAATCACTATCACTACATGGTATAAGGGGAAAAGAATCGGAAGGACGAGCTCAGGATGGGATGTGGCCATTTGATTCTGTTTTAAGAGATAAGGGTAGGAGGTTTAAAACTCGTGGGCAATTTCCTAGTGGAAGAGGGAAAACATTTATCTTTACAGTAAGAAATTATGGATCTAGATTGCCTTTTGTAGGTTCTGAATCTATTCGTTTATATAATGGTGGACTTCAGAGGCGACTCAAGCGCAATACTCCCGTACTGAGTTTTGTGTTTGGGAAATTGTGGATAAAAAATTATCTAACAGTCAAGTTTCTTCGAACTACGTAGAGGTAGATGATAAACCAACTATTAATTGAATAAGCGCAGTCAATTTTTGCAAAAATGGGACTAGAAAGGTTGTCATATCTAATAAGCCATCAAAAAGAGCACTAGAGTCTGAAGGATTTAGCTCTGCAGCGAGTACTTCTTTAAAGCTTGATTCTGGTTATAGATATGTAAAGGGAGTGAAAAAAGATTATTTGGGCAAGAGCCAGGAAAGCCAATATTCGAGATAGAGTATCTTCAGAAAGAATATTGGTTCTTAGGAGGATGGTTATGCTCCTTTGCAGAGTGGGATCATAGGTGTATTTGAGCAACCTGGCATAGAAGCTCCCAGTGATGAGGATGATTTCATTGAGGTACGATCTAAAAGGCAAATGTTGAATGATAGGCGTGAACAAAGAGAGAAAGATACCAAGGCAAAGTCCCACAATTCTAAGGTTCATACGCTATTCTTCATTTCAATGCTCCACATGTCAACTTTTAAATGGTCCATGATGCACACTTAGCTGCCATGTCACCATTTCGGGATTTTCTCATTCAAGTTTGTTTTGATCATAACTTCTTCGTTTCAACTCCGATTTGAGTAATTCAAATTGCGTTAGAATCCTTGTTTCGAGCACTACACTGTTGACATATTAAATCATCAAAATTGTTAGTAAATAAGATCAGGTTTGTTATCATCAAAATATTAATTAATTAATAGGATTTATTAACTTCTCTTCGAACCAAACGAGATTTTATTAAGAGAATTTACCAATAGTACAACATTGTACCTCTTGTATTATATTCGCAACAAAAGTTTTAAAATAATTCATCTATAGCATATTGATTTTATCACTCAACTAGTGATAGAACTCAATTTGGATTTCTATCACTTGATAGAAAAATCAAGCAGTGATAGAATGAGTATATCATTAATTGAATTATATCACTAATAAAACTCAAACTAGTAATAGAACTTAATTTTAATTTTGTGCTACAAATGAATTAATTTCGATTTTGCGCTATAGATGGAAATACTTTGCCCTTTTTTGTTATACATGCAAAATCTTCTTTTATTAACTCCTCATTTTTACTTTTAAAAATAACTTTTATTTCTAACCATTAATTTTTAATCATTCATTTATAACTACTCAATCATTTATAATCACTCATTTTTAATCATTTCTTTAAGATCAATCAAAATAACTCATTTCAACTACTTATTTTTAATAAATTCTTTTTTAACTATTAAAAATAACTACTCATTTTAAATCTAAGTTCAAACAAGATTTTGACTCTAACTTCTCTACATAAAAACTACATCCAAACGAGTTTTTTTTACCATAACTCTCAATTTTTAACTACTCAATATAACTCCTTAAGCTAAATCCTACCTAAATATGTCAAACAAACACCACTGAGCTGAAATTATTATAAACTAGATTTGAAGAAATTTTATTATTATTATTATTATTTTAGTACAACTTGGAGGGGGACCAACAAATCCTACATAAAGGGAACCATTGCTCTCAAATAATATGAGAGTCAAATAATTGTGAAATTGACATGAATTAACATCAATCAAGATAATCAACTGCGTCAATGACTCATTCATAGAATATGAGATTTATGTTTTAATTTGATGTTTAATATAACCTCTAAACTCTCCACTGACCCATGACCGTGTGCCTAATATAATCTCCAAACTCTTGTCTATGCATCAACTTTGTTGTGATAATGATTGTAGGTTTTTTTTTTCTTTTTTCTTTTATGCTAATGAATTTATTATTCACGATATATCAAATAATAAAATTTTGTTCAAAGGCCTAGCATTGTCGATTTAGATCTTATTCGTACTAATTCACAACCCGATCATATTCAAGCTTATATGAGTATGAAAGTCTCGCCTTTTGTGTGGCATGACAGATTGGGCCACCCAAATACTGATGTTTTATGTTATGTTTGTTCTTCTTTACCTTGTACTACTAAATTCGTGAATGTTACATGTAAAGATTGTATACTTGGCAATATGACAAGGCAAACTTTTCCAAATTCTAGCTTTTTGTCTAGCTTTTCTCTATAACTCATTCATAGTGACGTTTGGGATTCGATTTCTGAAAATTCATTTAATGGAAAAAGATTCTACGTTTCCATAATAGACGACTTCTCTCGTTATACGTGGCTCTTCCCTATTTCTTATAAATCTGACGTGTTTGATGTTATCATCAAATTCATTACGTACATAGAAGTTTCTCTTCCATTTAAAGTTAAGAAATTTTGTATTGACGATGGAGGTGAATATGTTAACATTAGGCTTGGTTCTTCCTTTGTATTCATAGGAAATTTTTTTTTATGTTTTAATTAACAAGAATTTTTTAGTATGAAATTTAAATTTATGATTTCGTAATCATACAGTAAACACCATGCTCTATGTTTATTTTGAATGTAAATTAAATTTAACAAAAGTATTTGAATATAAAAATTGATTTATAGAGATTCAGTGGACAAAATTTATTTCATTGATAAGGTTGATGAGGTTGAGTGTAGTTCGTGGATCAATTTTTTACTTATTTTTACTGTTCCATAAAGTCTAATGATTTTGATATAAAGTTTAGTCATATCGAAAATAGAGCATCAAAGGTTTATTAGTTGATCTTTTTCATGAACCTTGATTTATTTTTTTATTGATATGGAAATTGCACAATCTAGCTACTCTTTTCTCTTACCTGTGCAATTGTTAACACGAGCATAAAAATCGACAAGGCGATGACAAATTTGGTATTGTCGGTACCATCCTCCTTCCCCAAGCACACCTTTAGAGAAAGTTTGTGGGAACAATAATGATAATAATTATGAGGGAATGAGAATTAGGGGAAAAATAAAAAAGAGAGTGATTATTTATGTTTGGAATGAAAGTAAGGATTACTTGTGTATTTAGGTGAAAAGAATGATTATGAAGAAAATAAAATAATGAATACAAGGAGAGTGTGAAATAGTAATATGATGTATAAAAAGTTGTAGTTGAGAGAATGATAGTTATGAGGAGAATGGAAATGTGATGTTTGAAAAACATGATTAAGAGAGTAATGATTTTTTGTTTGGGGTGTGCATGGGTTGGGTTGGGTAGGGTTGAGGGAGATTTTTAGCCCAACCCAAAAATTTGGGTTGATAATTTCTTCAACTCAACCCAATCAAAGAGACAACCCGACCCGACCCGACCCTTATATTTTTGGGTTGAGTTAAGTTGTGTCATCGAGTTCACAATAATTCTAAGCTCAAAGCTTGACCCCAAACTTAACAAGAGCACACTTGAGATTTTCTTCGATCTTCAAAATTTTAATTTCAATATAAGATTTCATTAAACCTTTTAACTATGAAATTAAATAAAAATAAAAAAAATTTATGTCTAATTTGTAAAAACAACTTGTTTTAATTCAACCTAAGATTAAATTATAAGAATTATTAAAATAAAGGTGATTAAAAATATAAAATTAGTATCAAAACCACTCCAAGGCCTTCTTTAATGATTTATCTAAATTATACATAATATACAGGTTTTGTTCTGGTTGAACCCAAACTAGAAAGGGCCAACCCAAGAACCGACCCAAAAATTCAATTTTCTTTATTTTTCAACCCAATCCAACTCGGAATAATACTTAATCCAACTCAACCCTTACGGTTTGGGTTGGGTAGTGTTTGCGAAATCGTGAACACACAATAATTAAAATAATAAGAAAATTAGAATGATGACGTGGTAGCTTGGATTTCGCTAAAGCGAAACCTCTACTCAGATTAAGCAAAATGTGTATGATTTCTCAAAGATGTGCAAGTATACACAGTCTTGTTGTAGCAAGTTTTAAAGTAAGTAAAAGTCATCTCTACAGGGATTAGTGATTAACAACATGCTTAAGCGAAAATTAGAGAGTTATGGTAACAAGGGGAATATTGATGTGATGAAACTCAAACTAAGTTGCTGAAATAAGTTTGCTAACTCAATTTGCTAAAGTGAAATAAATGTGAATCTCAATTGACAAAATACTTAAGATGCAACGAATGATAAACTAACTATAATCATATTAAAGACATCGGAAGAAGGTTAAGACTTTTCTGAGAGTTTCGCTAAAGCGAAATGGCATTTAGCTAAGAAGTGAACAACTGATGGCTTCTCCTTGGCTCTCTCAAACTCTCTAAGTTGGCTCTCTCGAGCTCCTAGTTTGGTTCTCTCGAACTCAAACAACAAATAAACCAAGCGATTCAGCACTAGGTGAACCTTATCTGTAAGTGTTCTTTTCAAATGAAAGTAAACTTAGCAAAACCTAATATTATGACATGCAAGGTTGCTCTATTCCTTTTCTCTAGAAAGCGAAATAAAGTTCATACAAATAGTTAGCTAGACAAAATGTATGTCAACACAACATGCATAAATAGTTGACATTTTGCTACCACTTTGCTTAAACGAAATACTCATACTTAGCCTATTTCCTTCCGACTAAAAACTTAGCTACACATAATGATAAATAGATTCAACGCAAAGTAAAAAAGAAAAGTTCATCTTCAATTAAGTAAGGAAATAACAACGTATGCTTACAAAACAAAGTTAAAGTGAAAATACTAAAATGAAAGTAAAAAGAAACAAAGTTAAGAAGAAGCCTAATCAGAATCACTCAAACAATGTCTCTGAATCAAGCTTCAGCTATACTCTGCAACGTTGGCTAACTACTTCGCTAGAGCTAACTAGCTAAACAAGTGGAAATGGAAAACAACTAAAACGACCAAGCTAACTATAAAAAAATGACAATGAATTTCGTCGGAAGAAAACTTGACGATGATCTCTCCAACAAAATGGATGATGCCCTTCACAATGACAAAATGTTGCTATTTATATTGCTTCAGACAAGACTCTACTCCACGATCTCATGCGCATGTCCTTTGCTCCATGATCTCAAGTGCAACGGTTGAGATTGAAAGACTTTTCTCGGATTTTTGTCACGTGTCTTCCACTCTAGCTATTTTTCCGATAGCTCATCTCGTTGGATTTCAGTTCGCTTAAGAGAAACTTTTAACAACTGCACTTAGCTCCGTAAAACTTTTCTCTCTGGATTTGTGTTCGCTTAAACCAAATGTTTAACAACTACACTTAGCTCCAAAAAGCTTTTAGCACTTCTGCATTTCGCTCCCTTGGTATGCTAGAGCAAAAACAAAATATCAAGTTTATAGAGATGGAACTTAAAGATTTACACATTCATGTACTTTTGCTAAAGCAAACCTCTAAAATGTTATCCTAAACAGAAAACTACAGGATTTTGCAATTCGCTCAAATAATTACGCTAAAGCGAACAGACTTAGCGAATTCTTGCTATGATGTAGCGAAAGTTCCAGCGAACTGCTATTAAACTTAGCCCTCAACTCTCAATTATTGAGAGTTATCAGGTAGTCCGGGTTCATCGGGTTATCGGTTCTTTTGTGCGCCCTTATTTTTTGTGGCCAAAACATAGAAATCATCTCCATATCTATTCTCTCCTCTTATTTTTGGTGACACAAGCGTATCCTTAATAAGTGCCATTTATTAAAGAGAGAAGTTTAGGTGCTAGCTGGGTAGTAAATATTTGTTGCTACCCACCACCATGTGGCATCTTTTCATTGGATCAATAATTTTCTTTTTAAAACAACTGGTTTTAAAACAATTGAATAAAATTAATTTTTTTAAAAAATTATTGGTCCAATGAAAAGATGCTACATGGCTTCAAGTAGCAACAAATATTTGCTACCCAGCTAGCACCCAAACTTCTCTCTTTATTAAAATCATTAAACCTTCAATAAACTAGTGGCATTTTTGTAATTTCATTGCAGTTTTTTTTTTTTTGCTATTCTTGATTTATTATTTTTGTAATTTCAAAATTTACTGATGACATTACTCCTTCAATATCTATCGGTGTCTCGCTTAGGGACGGACACAATTACCCTTGTTTAAAAAAAACGAATTTATCATTTTAAATTTGTTTAATTATTTATGTTTTCTTTTAATTCCTAAAAAAGCCCTTACAAATACTTTTATTTTTATACGCATAAAACATAGTTATAATAAATTGTGTTTTTATGTTTGGGTGGTTTAACGTTTGGGCTTTCTGTTCTTCTCCGCCCCATCTTCGTCGTGGTTAGTGTAGGGGTGGTTCCTTTCTTCTGACGTTTACTGTAGAGACGTCTTCTTCCTCGCCGCGCTCGATAATCTCCCAAACTCACAGAGGTAACTTCATTCTTCTTCTATCTTCCCTTTGATTCTATCTCAAATCACGCGTTCCATTGCACTGAAACCCTAATTGCAGGGCACAACTTCGTCTCTTTTCTTTCATTTTGATTGGTTGATTTGATCCATTGTATTGAGCTTTCCGAATCGGTAATACCTTCTGCTCAATTCTTGTTTTGATTTCATCTACCAGGAACAAGAACTTTTTGGGTTC
mRNA sequence
ATGGACACGTTCATCAACATTCCAACTCTTATGGATCAAATCGAACGCGGCCTGGTAGTCATGCAGCCGGAGGAGGAATGGTGGTCATTTCGAGGCCTTAAATACTGTTATAATGGTCAAAGGCGAGAGCGATCAGTTGGTGTGCAGTTATCTTCTAGGAAAGAGTTTTATGTGGGAGCTGGGTTTACAACTTCTAGAATATCTCATGGAAAAGGTATTACCAAACCAAAGTCTAATGATTATTCTCAGCTAAGTATGCAGAGACCTAACCTTTCTGGGAGTGGTGATCAAAATAATATGAGCCAAAAGTTTGACTCAGAATTTCAGGATAGTTTCGAGAATTTTGGTGATCCAGGATGGAGTGGAAGATCACGTTATTCCCAGAAGCAACATCGAGTTCTTCCTCCTCCATCTTTGATACAGAAATCTTCTATCAGAGGTGCATTTGAATCTGTTCCCCAGGATATCATCAATAGTGAGATACAATATAATCATTCGGTAAGAAATGTTTCTACTGCTCAGACAGGGTATATTCATCATGAAAACTTCACGCTATCAACTATAATTGATGTTAATTTAGATAATATTGAGAATGGGGAGCAAAAACCAGATGGTGGCATAACTCTACGGTGTGATTCACAGTCAACCCTTTTTGTATTTAGCCCTCCAACCTCTCAAACTCATCTATCTCACGAGGACTTGGATGATTCTGGCGATTCACTTGTTTTATCTGCTAGTAGAGAAGGCACATTGTCGATAGAGGATAATGAATCTTCTATACCAGCCAAGGCTGGTAAAGTGATCATGATTACCTCTACTAAAGTATCTACAGGTGATGAAGATGATTGGGCTGTTGCAAACGAGCATGTTCAGGAACAAGAAGAATATGATGAAGATGATGATGAGTATCAAGAAGAAGATGAAGTTCATGAAGGGGAGGACGAGAACATTGACCTTGCACAAGATTTTGATGATTTACATTTAGACGATACTAATAGAGGGTCACCCCACATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGAGTTGGAGTGGGGATGTCAAATGATGAGTTTGAAAGAATTCCTGGAAATGAAGAAAATATGTATGTTGCAACAGAAATTTCAAATTGCATCAAAGAAGAACAGGGGTCTTCGGAAGGATTGCAAGTTGATGGTAAAGTCTGTCAATACGTGGATGCTTCTCAAGTAAGGATTCCTGACCCTGAAGAGATGCATGACATGATCATGCAATTTAAAACTTCCCAAGCATTACCAGAACCTGAAATTACCAAGCAAGGAAATTCTTGCAGATCTAGTATGTCTTTTCAACAGCCAGTCTCGTCTTCAGTTTCAATGGCCTCACAATCTTCATCTGGTCAAGTTATTGTGTCGAATGAAGTCCTTTCAGGTCAAGTTGAGCCTCCTCTTAAGCTTCAGTTTGGTTTGTTCTCAGGTACCCGTCTTCTGTCCCCCAAGGTGTACTTCCTTTGCCTCCTCAACCGCCAACATTTGTTCAGCCCACTGCTCAAACTGGTTTTCCTTCAAGTGAAAACCCAAGGGATGCTCTGGTTTTCAAGCAGAAAATCAGAGGCACCATGTTCCAACTTCAGGCAATCACTATCACTACATGGTATAAGGGGAAAAGAATCGGAAGGACGAGCTCAGGATGGGATGTGGCCATTTGATTCTGTTTTAAGAGATAAGGGTAGGAGGTTTAAAACTCGTGGGCAATTTCCTAGTGGAAGAGGGAAAACATTTATCTTTACAAGTGGGATCATAGGTGTATTTGAGCAACCTGGCATAGAAGCTCCCAGTGATGAGGATGATTTCATTGAGGTACGATCTAAAAGGCAAATGTTGAATGATAGGCGTGAACAAAGAGAGAAAGATACCAAGGCAAAGTCCCACAATTCTAAGGGGTGGTTCCTTTCTTCTGACGTTTACTGTAGAGACGTCTTCTTCCTCGCCGCGCTCGATAATCTCCCAAACTCACAGAGGTAACTTCATTCTTCTTCTATCTTCCCTTTGATTCTATCTCAAATCACGCGTTCCATTGCACTGAAACCCTAATTGCAGGGCACAACTTCGTCTCTTTTCTTTCATTTTGATTGGTTGATTTGATCCATTGTATTGAGCTTTCCGAATCGGTAATACCTTCTGCTCAATTCTTGTTTTGATTTCATCTACCAGGAACAAGAACTTTTTGGGTTC
Coding sequence (CDS)
ATGGACACGTTCATCAACATTCCAACTCTTATGGATCAAATCGAACGCGGCCTGGTAGTCATGCAGCCGGAGGAGGAATGGTGGTCATTTCGAGGCCTTAAATACTGTTATAATGGTCAAAGGCGAGAGCGATCAGTTGGTGTGCAGTTATCTTCTAGGAAAGAGTTTTATGTGGGAGCTGGGTTTACAACTTCTAGAATATCTCATGGAAAAGGTATTACCAAACCAAAGTCTAATGATTATTCTCAGCTAAGTATGCAGAGACCTAACCTTTCTGGGAGTGGTGATCAAAATAATATGAGCCAAAAGTTTGACTCAGAATTTCAGGATAGTTTCGAGAATTTTGGTGATCCAGGATGGAGTGGAAGATCACGTTATTCCCAGAAGCAACATCGAGTTCTTCCTCCTCCATCTTTGATACAGAAATCTTCTATCAGAGGTGCATTTGAATCTGTTCCCCAGGATATCATCAATAGTGAGATACAATATAATCATTCGGTAAGAAATGTTTCTACTGCTCAGACAGGGTATATTCATCATGAAAACTTCACGCTATCAACTATAATTGATGTTAATTTAGATAATATTGAGAATGGGGAGCAAAAACCAGATGGTGGCATAACTCTACGGTGTGATTCACAGTCAACCCTTTTTGTATTTAGCCCTCCAACCTCTCAAACTCATCTATCTCACGAGGACTTGGATGATTCTGGCGATTCACTTGTTTTATCTGCTAGTAGAGAAGGCACATTGTCGATAGAGGATAATGAATCTTCTATACCAGCCAAGGCTGGTAAAGTGATCATGATTACCTCTACTAAAGTATCTACAGGTGATGAAGATGATTGGGCTGTTGCAAACGAGCATGTTCAGGAACAAGAAGAATATGATGAAGATGATGATGAGTATCAAGAAGAAGATGAAGTTCATGAAGGGGAGGACGAGAACATTGACCTTGCACAAGATTTTGATGATTTACATTTAGACGATACTAATAGAGGGTCACCCCACATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGAGTTGGAGTGGGGATGTCAAATGATGAGTTTGAAAGAATTCCTGGAAATGAAGAAAATATGTATGTTGCAACAGAAATTTCAAATTGCATCAAAGAAGAACAGGGGTCTTCGGAAGGATTGCAAGTTGATGGTAAAGTCTGTCAATACGTGGATGCTTCTCAAGTAAGGATTCCTGACCCTGAAGAGATGCATGACATGATCATGCAATTTAAAACTTCCCAAGCATTACCAGAACCTGAAATTACCAAGCAAGGAAATTCTTGCAGATCTAGTATGTCTTTTCAACAGCCAGTCTCGTCTTCAGTTTCAATGGCCTCACAATCTTCATCTGGTCAAGTTATTGTGTCGAATGAAGTCCTTTCAGGTCAAGTTGAGCCTCCTCTTAAGCTTCAGTTTGGTTTGTTCTCAGGTACCCGTCTTCTGTCCCCCAAGGTGTACTTCCTTTGCCTCCTCAACCGCCAACATTTGTTCAGCCCACTGCTCAAACTGGTTTTCCTTCAAGTGAAAACCCAAGGGATGCTCTGGTTTTCAAGCAGAAAATCAGAGGCACCATGTTCCAACTTCAGGCAATCACTATCACTACATGGTATAAGGGGAAAAGAATCGGAAGGACGAGCTCAGGATGGGATGTGGCCATTTGATTCTGTTTTAAGAGATAAGGGTAGGAGGTTTAAAACTCGTGGGCAATTTCCTAGTGGAAGAGGGAAAACATTTATCTTTACAAGTGGGATCATAGGTGTATTTGAGCAACCTGGCATAGAAGCTCCCAGTGATGAGGATGATTTCATTGAGGTACGATCTAAAAGGCAAATGTTGAATGATAGGCGTGAACAAAGAGAGAAAGATACCAAGGCAAAGTCCCACAATTCTAAGGGGTGGTTCCTTTCTTCTGACGTTTACTGTAGAGACGTCTTCTTCCTCGCCGCGCTCGATAATCTCCCAAACTCACAGAGGTAA
Protein sequence
MDTFINIPTLMDQIERGLVVMQPEEEWWSFRGLKYCYNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGDQNNMSQKFDSEFQDSFENFGDPGWSGRSRYSQKQHRVLPPPSLIQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLSTIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSASREGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEEDEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNEENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDASQVRIPDPEEMHDMIMQFKTSQALPEPEITKQGNSCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLFSGTRLLSPKVYFLCLLNRQHLFSPLLKLVFLQVKTQGMLWFSSRKSEAPCSNFRQSLSLHGIRGKESEGRAQDGMWPFDSVLRDKGRRFKTRGQFPSGRGKTFIFTSGIIGVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKDTKAKSHNSKGWFLSSDVYCRDVFFLAALDNLPNSQR
Homology
BLAST of Sed0026550 vs. NCBI nr
Match:
XP_022950041.1 (uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950042.1 uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950043.1 uncharacterized protein LOC111453246 [Cucurbita moschata])
HSP 1 Score: 674.5 bits (1739), Expect = 9.5e-190
Identity = 448/909 (49.28%), Postives = 500/909 (55.01%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RRE + G ++SSRKEFY GAG TSRI + +G+T+P+S+DYSQL QRPNLSG GD
Sbjct: 779 YTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVSTAQT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++PAKAGK IMITST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+VA E+SNCI+EEQGSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFVAPEVSNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSVS ASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRLLSPKVYFLCL----------------LNRQHLFSPLL----KLVFLQVKTQGML- 576
SG L+ V + + + H P L +L + +QG+L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 ------------------------------------WFSSRKSE---------------- 636
+SRK++
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 ------------------------------APCSNFRQSLSLHGI--------------- 641
C + S S G
Sbjct: 1379 LNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDNH 1438
BLAST of Sed0026550 vs. NCBI nr
Match:
KAG7034343.1 (hypothetical protein SDJN02_04070, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 673.3 bits (1736), Expect = 2.1e-189
Identity = 451/911 (49.51%), Postives = 500/911 (54.88%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RRE + G ++SSRKEFY GAG TSRI + +G+T+P+S+DYSQL QRPNLSG GD
Sbjct: 779 YTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVSTAQT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++PAKAGK IMI+ST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVPAKAGKEIMISSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+VA EISNCI+EEQGSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFVAPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSVS ASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRL------------------------------------------------------- 576
SG L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 LSPK----------------------------------------VYFLCLLNRQHLFS-- 636
L+P+ V L + N+Q L S
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 ------------PLLKLVFLQV-----KTQGMLWFSSRKSEAPCSNFRQSLSLHGI---- 641
PL + + QV +T G S P F+ H +
Sbjct: 1379 LNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEP--GFQSEHQRHHVSTSD 1438
BLAST of Sed0026550 vs. NCBI nr
Match:
KAG6604182.1 (hypothetical protein SDJN03_04791, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 671.8 bits (1732), Expect = 6.2e-189
Identity = 451/911 (49.51%), Postives = 499/911 (54.77%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RRE + G ++SSRKEFY GAG TSRI + +G+T+P+S+DYSQL QRPNLSG GD
Sbjct: 779 YTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVSTAQT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++ AKAGK IMITST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVTAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+VA EISNCI+EEQGSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFVAPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSVS ASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRL------------------------------------------------------- 576
SG L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 LSPK----------------------------------------VYFLCLLNRQHLFS-- 636
L+P+ V L + N+Q L S
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 ------------PLLKLVFLQV-----KTQGMLWFSSRKSEAPCSNFRQSLSLHGI---- 641
PL + + QV +T G S P F+ H +
Sbjct: 1379 LNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEP--GFQSEHQRHHVSTSD 1438
BLAST of Sed0026550 vs. NCBI nr
Match:
XP_022978332.1 (uncharacterized protein LOC111478360 [Cucurbita maxima] >XP_022978333.1 uncharacterized protein LOC111478360 [Cucurbita maxima] >XP_022978334.1 uncharacterized protein LOC111478360 [Cucurbita maxima])
HSP 1 Score: 670.6 bits (1729), Expect = 1.4e-188
Identity = 448/909 (49.28%), Postives = 498/909 (54.79%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RE + G ++SSRKEFY GAG TTSRI + +G+T+P+S+DYSQL QRPNLSG GD
Sbjct: 779 YTGPWREATTGGRVSSRKEFYGGAGLTTSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD W GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHAWRQEGSRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVSTAQT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++PAKAGK IMI+ST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVPAKAGKEIMISSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+ EISNCI+EEQGSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFATPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSVSMASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVSMASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRLLSPKVYFLCL----------------LNRQHLFSPLL----KLVFLQVKTQGMLW 576
SG L+ V + + + H P L +L + +QG+L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 FS-------------------------------------------------------SRK 636
+ SR
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALLIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 SEAPCSNFRQSLSL---------------------------------------------H 641
S S +SL L H
Sbjct: 1379 SNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDENNSRSELGFQAEHQRQHVSTSDNH 1438
BLAST of Sed0026550 vs. NCBI nr
Match:
XP_023543957.1 (uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023543958.1 uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023543959.1 uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 667.9 bits (1722), Expect = 8.9e-188
Identity = 446/909 (49.06%), Postives = 498/909 (54.79%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RRE + G ++SSRKEFY GAG TTSRI + +G+ +P+S+DYSQL QRPNLSG GD
Sbjct: 779 YPGPRREATTGGRVSSRKEFYGGAGLTTSRIYNRRGMAEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVST+QT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTSQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++PAKAGK IMITST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+VA EISNCI+EE GSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFVAPEISNCIREELGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSV ASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVLTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRLLSPKVYFLCL----------------LNRQHLFSPLL----KLVFLQVKTQGML- 576
SG L+ V + + + H P L +L + +QG+L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 ------------------------------------WFSSRKSE---------------- 636
+SRK++
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 ------------------------------APCSNFRQSLSLHGI--------------- 641
C + S S G
Sbjct: 1379 LNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDNH 1438
BLAST of Sed0026550 vs. ExPASy TrEMBL
Match:
A0A6J1GDR0 (uncharacterized protein LOC111453246 OS=Cucurbita moschata OX=3662 GN=LOC111453246 PE=4 SV=1)
HSP 1 Score: 674.5 bits (1739), Expect = 4.6e-190
Identity = 448/909 (49.28%), Postives = 500/909 (55.01%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RRE + G ++SSRKEFY GAG TSRI + +G+T+P+S+DYSQL QRPNLSG GD
Sbjct: 779 YTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVSTAQT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++PAKAGK IMITST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+VA E+SNCI+EEQGSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFVAPEVSNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSVS ASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRLLSPKVYFLCL----------------LNRQHLFSPLL----KLVFLQVKTQGML- 576
SG L+ V + + + H P L +L + +QG+L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 ------------------------------------WFSSRKSE---------------- 636
+SRK++
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 ------------------------------APCSNFRQSLSLHGI--------------- 641
C + S S G
Sbjct: 1379 LNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDNH 1438
BLAST of Sed0026550 vs. ExPASy TrEMBL
Match:
A0A6J1IST3 (uncharacterized protein LOC111478360 OS=Cucurbita maxima OX=3661 GN=LOC111478360 PE=4 SV=1)
HSP 1 Score: 670.6 bits (1729), Expect = 6.7e-189
Identity = 448/909 (49.28%), Postives = 498/909 (54.79%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
Y G RE + G ++SSRKEFY GAG TTSRI + +G+T+P+S+DYSQL QRPNLSG GD
Sbjct: 779 YTGPWREATTGGRVSSRKEFYGGAGLTTSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGGD 838
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
Q N SQ+FDSEFQD+ ENFGD W GRSRYSQ
Sbjct: 839 QYNRSQEFDSEFQDNVENFGDHAWRQEGSRNNFYFPYPERVNPISEADGSYSVGRSRYSQ 898
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG F SV +DI SEIQY+H RNVSTAQT YIHHEN TL
Sbjct: 899 RQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTLP 958
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQKPDG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 959 EIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1018
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDNES++PAKAGK IMI+ST+ STGDED+W V +EHVQEQEEYDEDDD Y+EE
Sbjct: 1019 REGTLSIEDNESAVPAKAGKEIMISSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYREE 1078
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDLAQ+FDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1079 DEVHEGEDENIDLAQNFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNE 1138
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+ EISNCI+EEQGSSEGLQVDGKVCQY DA SQ+RI DPEEM D++MQ +T+QAL
Sbjct: 1139 ENMFATPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQAL 1198
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
PEPEI +QGN SCRSS+S QQP+SSSVSMASQSSSGQVIV N SGQ EPP+KLQFGLF
Sbjct: 1199 PEPEINEQGNSSCRSSVSVQQPISSSVSMASQSSSGQVIVPNAAGSGQAEPPVKLQFGLF 1258
Query: 517 SGTRLLSPKVYFLCL----------------LNRQHLFSPLL----KLVFLQVKTQGMLW 576
SG L+ V + + + H P L +L + +QG+L
Sbjct: 1259 SGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1318
Query: 577 FS-------------------------------------------------------SRK 636
+ SR
Sbjct: 1319 LAPQPLTFVPPAVQTGFPLNKNPGDALLIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRS 1378
Query: 637 SEAPCSNFRQSLSL---------------------------------------------H 641
S S +SL L H
Sbjct: 1379 SNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDENNSRSELGFQAEHQRQHVSTSDNH 1438
BLAST of Sed0026550 vs. ExPASy TrEMBL
Match:
A0A6J1BUR5 (uncharacterized protein LOC111005936 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111005936 PE=4 SV=1)
HSP 1 Score: 658.7 bits (1698), Expect = 2.6e-185
Identity = 440/909 (48.40%), Postives = 491/909 (54.02%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
YNG RRE +G + +SRKEFY GAGFTTSRISH +GIT+P+S+DYSQL RPNLSG GD
Sbjct: 750 YNGPRREAPIGGRATSRKEFYGGAGFTTSRISHRRGITEPQSDDYSQLRGHRPNLSGGGD 809
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
+ S FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 810 HYSQSPDFDSEFQDNVENFGDHGWRQETGRNNFYFPYPERVNPISETDGSYSVGRSRYSQ 869
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG +ESVP+DI+ SEIQY+H RNVSTAQTGYIHHEN +
Sbjct: 870 RQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPARNVSTAQTGYIHHENRSFP 929
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNLDN EN EQKPD TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 930 EIIDVNLDNAENEEQKPDCDTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 989
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIED ES++P K GK IMI+ST+VSTGDED+WAV NEHVQEQEEYDEDDD Y EE
Sbjct: 990 REGTLSIEDTESAVPTKCGKEIMISSTRVSTGDEDEWAVGNEHVQEQEEYDEDDDGYDEE 1049
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHE EDENIDLAQDFDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1050 DEVHEVEDENIDLAQDFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERISGNE 1109
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+V EIS+CI+EEQGSSE LQVD +CQY DA SQVRIPD EEM D+++ K +QAL
Sbjct: 1110 ENMFVTPEISSCIREEQGSSEVLQVDSNICQYPDASSQVRIPDTEEMKDLVILSKPAQAL 1169
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
P E+T+QG SCRS +S Q P+SSSVSMASQS GQVIV N +SGQ EPP+KLQFGLF
Sbjct: 1170 PGSELTEQGKFSCRSGVSVQLPISSSVSMASQSPPGQVIVPNAAISGQAEPPVKLQFGLF 1229
Query: 517 SGTRLLSPKVYFLCLLNRQ--------------HLFS----------------------- 576
SG L+ V + + + Q H+ S
Sbjct: 1230 SGPSLIPSPVPAIQIGSIQMPLHLHAQITPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1289
Query: 577 -----------------PLLK------------------------LVFLQVKTQGM---- 636
PL K L FL QG+
Sbjct: 1290 LAPQPLTFVPPTVQTGFPLNKNPGDAPSIQTSQETCAHNSRKNDVLPFLMDNQQGLASRS 1349
Query: 637 LWFSSRKSEAPCSNFRQSLSL------------------------------HGI------ 641
L S P ++ ++S + H +
Sbjct: 1350 LNSSGESKSLPLTDNKESEVITQQDQTAGSCIDESNSRSESGFQAEHQRHHHNVSTSDDH 1409
BLAST of Sed0026550 vs. ExPASy TrEMBL
Match:
A0A6J1BUX9 (uncharacterized protein LOC111005936 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005936 PE=4 SV=1)
HSP 1 Score: 658.7 bits (1698), Expect = 2.6e-185
Identity = 440/909 (48.40%), Postives = 491/909 (54.02%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
YNG RRE +G + +SRKEFY GAGFTTSRISH +GIT+P+S+DYSQL RPNLSG GD
Sbjct: 783 YNGPRREAPIGGRATSRKEFYGGAGFTTSRISHRRGITEPQSDDYSQLRGHRPNLSGGGD 842
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
+ S FDSEFQD+ ENFGD GW GRSRYSQ
Sbjct: 843 HYSQSPDFDSEFQDNVENFGDHGWRQETGRNNFYFPYPERVNPISETDGSYSVGRSRYSQ 902
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ IQKSS+RG +ESVP+DI+ SEIQY+H RNVSTAQTGYIHHEN +
Sbjct: 903 RQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPARNVSTAQTGYIHHENRSFP 962
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNLDN EN EQKPD TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 963 EIIDVNLDNAENEEQKPDCDTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1022
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIED ES++P K GK IMI+ST+VSTGDED+WAV NEHVQEQEEYDEDDD Y EE
Sbjct: 1023 REGTLSIEDTESAVPTKCGKEIMISSTRVSTGDEDEWAVGNEHVQEQEEYDEDDDGYDEE 1082
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHE EDENIDLAQDFDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERI GNE
Sbjct: 1083 DEVHEVEDENIDLAQDFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERISGNE 1142
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDGKVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQAL 456
ENM+V EIS+CI+EEQGSSE LQVD +CQY DA SQVRIPD EEM D+++ K +QAL
Sbjct: 1143 ENMFVTPEISSCIREEQGSSEVLQVDSNICQYPDASSQVRIPDTEEMKDLVILSKPAQAL 1202
Query: 457 PEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGLF 516
P E+T+QG SCRS +S Q P+SSSVSMASQS GQVIV N +SGQ EPP+KLQFGLF
Sbjct: 1203 PGSELTEQGKFSCRSGVSVQLPISSSVSMASQSPPGQVIVPNAAISGQAEPPVKLQFGLF 1262
Query: 517 SGTRLLSPKVYFLCLLNRQ--------------HLFS----------------------- 576
SG L+ V + + + Q H+ S
Sbjct: 1263 SGPSLIPSPVPAIQIGSIQMPLHLHAQITPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1322
Query: 577 -----------------PLLK------------------------LVFLQVKTQGM---- 636
PL K L FL QG+
Sbjct: 1323 LAPQPLTFVPPTVQTGFPLNKNPGDAPSIQTSQETCAHNSRKNDVLPFLMDNQQGLASRS 1382
Query: 637 LWFSSRKSEAPCSNFRQSLSL------------------------------HGI------ 641
L S P ++ ++S + H +
Sbjct: 1383 LNSSGESKSLPLTDNKESEVITQQDQTAGSCIDESNSRSESGFQAEHQRHHHNVSTSDDH 1442
BLAST of Sed0026550 vs. ExPASy TrEMBL
Match:
A0A1S3B1H0 (LOW QUALITY PROTEIN: uncharacterized protein LOC103484772 OS=Cucumis melo OX=3656 GN=LOC103484772 PE=4 SV=1)
HSP 1 Score: 650.2 bits (1676), Expect = 9.3e-183
Identity = 441/909 (48.51%), Postives = 498/909 (54.79%), Query Frame = 0
Query: 37 YNGQRRERSVGVQLSSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGD 96
YNG RRE S G ++SSRKEFY GA FTTS+ SH +GIT+P+S++YSQL QRPNLSG D
Sbjct: 789 YNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSGGVD 848
Query: 97 QNNMSQKFDSEFQDSFENFGDPGWS----------------------------GRSRYSQ 156
N +Q+FDS+FQD+ ENFGD GW GRSRYSQ
Sbjct: 849 HYNRTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYSVGRSRYSQ 908
Query: 157 KQHRVLPPPSL--IQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLS 216
+Q RVLPPPS+ +QKSS+R +ESVP+D I SEIQY+H N+STAQT YIHHEN L
Sbjct: 909 RQPRVLPPPSVASMQKSSVRNEYESVPRD-IESEIQYDHPASNISTAQTMYIHHENRALP 968
Query: 217 TIIDVNLDNIENGEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSAS 276
IIDVNL+N EN EQK DG TLRCDSQSTL VFSPPTS THLSHEDLDDSGDS VLSAS
Sbjct: 969 EIIDVNLENGENEEQKTDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSAS 1028
Query: 277 REGTLSIEDNESSIPAKAGKVIMITSTKVSTGDEDDWAVANEHVQEQEEYDEDDDEYQEE 336
REGTLSIEDN+S++PAKAGK IMITST+VSTGDED+W +EHVQEQEEYDEDDD YQEE
Sbjct: 1029 REGTLSIEDNDSAVPAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDEDDDGYQEE 1088
Query: 337 DEVHEGEDENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNE 396
DEVHEGEDENIDL DFDDLHLDD +GSPHMLDNLVLGFNEGV VGM NDEFERIPGNE
Sbjct: 1089 DEVHEGEDENIDLVPDFDDLHLDD--KGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGNE 1148
Query: 397 ENMYVATEISNCIKEEQGSSEGLQVDG-KVCQYVDA-SQVRIPDPEEMHDMIMQFKTSQA 456
EN+YVA+EISN I+EE+GSSEGLQVDG KVCQYVDA SQ+RI DPEEM D++MQ KT+QA
Sbjct: 1149 ENLYVASEISNDIREERGSSEGLQVDGNKVCQYVDASSQIRI-DPEEMQDLVMQSKTAQA 1208
Query: 457 LPEPEITKQGN-SCRSSMSFQQPVSSSVSMASQSSSGQVIVSNEVLSGQVEPPLKLQFGL 516
LP+ EIT+QGN SCRSS+S +QP+SSSVSMASQS SGQVIV + V SGQ EPP+KLQFGL
Sbjct: 1209 LPDSEITEQGNASCRSSVSVRQPISSSVSMASQSISGQVIVPSAV-SGQAEPPVKLQFGL 1268
Query: 517 FSGTRLLS---PKVYFLCLLNRQHL----------------------------------- 576
FSG L+ P + + HL
Sbjct: 1269 FSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSPGVL 1328
Query: 577 --------FSPLLKLVFLQVKTQG----------MLWFSSRKSEA--------------- 636
F+P ++ F K G SSRK+++
Sbjct: 1329 PLAPQPLTFAPTVQTGFSLNKNPGDGLSIHPSQETCAHSSRKNDSSPFSMDNQQGLVSRS 1388
Query: 637 -------------------------------PCSNFRQSLSLHGI--------------- 641
C + S S G
Sbjct: 1389 LNVNPSGESKSLPLTESMESKVVSPQDQAAVSCIDESNSRSEPGFQAEHHRLHVSTSDNH 1448
BLAST of Sed0026550 vs. TAIR 10
Match:
AT3G50370.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; Has 27734 Blast hits to 16708 proteins in 1259 species: Archae - 81; Bacteria - 3434; Metazoa - 10876; Fungi - 2514; Plants - 987; Viruses - 212; Other Eukaryotes - 9630 (source: NCBI BLink). )
HSP 1 Score: 172.6 bits (436), Expect = 1.1e-42
Identity = 166/491 (33.81%), Postives = 238/491 (48.47%), Query Frame = 0
Query: 51 SSRKEFYVGAGFTTSRISHGKGITKPKSNDYSQLSMQRPNLSGSGDQNNMSQKFDSEFQD 110
S ++EF+ AG+ ++ KP ++S Q + G G + + +SE ++
Sbjct: 660 SPQEEFFGTAGYLSA-----PSYFKPGFPEHS--IDQSWRIPGDGRTHGRNYGMESESRE 719
Query: 111 SF-ENFGDPGWS------------------------------GRSRYSQKQHRVLPPP-S 170
+F E +GDPGW GR RYS +Q RVLPPP
Sbjct: 720 NFGEQYGDPGWGQSQGRPRHGPYSPYPEKLYQNPEGDDYYPFGRPRYSVRQPRVLPPPQE 779
Query: 171 LIQKSSIRGAFESVPQDIINSEIQYNHSVRNVSTAQTGYIHHENFTLSTIIDVNLDNIEN 230
QK+S R E I Y+H R ST YI + + ID +
Sbjct: 780 SRQKTSFRSEVEHPGPSTSIGGINYSHKGRTNSTVLANYIEDHHVLPGSGIDEH------ 839
Query: 231 GEQKPDGGITLRCDSQSTLFVFSPPTSQTHLSHEDLDDSGDSLVLSASREGT---LSIED 290
++ D +T RCDSQS+L V SPP S HLSH+DLD+S DS VL SR G L +
Sbjct: 840 --RRFDTKLTGRCDSQSSLSVTSPPDSPVHLSHDDLDESADSTVLPTSRMGEDAGLLEKG 899
Query: 291 NESSIPAKAGK-VIMITSTKVSTGDEDDWAV-ANEHVQEQEEYDEDDDEYQEEDEVHEGE 350
I + GK +M+ + VS D ++W + +NE +QEQEEYDED+D YQEED++H G
Sbjct: 900 GAPIISSDIGKDSLMMATGSVSCWDNEEWTLDSNERLQEQEEYDEDEDGYQEEDKIH-GV 959
Query: 351 DENIDLAQDFDDLHLDDTNRGSPHMLDNLVLGFNEGVGVGMSNDEFERIPGNEENMY-VA 410
DENIDLAQ+ +++HLDD + NLVLGFNEGV V + +D+FE+ N E+ + +
Sbjct: 960 DENIDLAQELEEMHLDDKD-------SNLVLGFNEGVEVEIPSDDFEKCQRNSESTFPLH 1019
Query: 411 TEISNCIKEEQGSSEGLQVDGKVCQYVDASQVRIPDPEEMHDMIMQFKTSQALPE----- 470
+ + +E+ S E + + A + DP MH+ F+ ++ +
Sbjct: 1020 QHTVDSLDDERPSIETSRGEQA------AQPAVVSDPLGMHNASRTFQGAETTMQNLTVH 1079
Query: 471 PEITKQG-------NSCRSSMSFQQPVSSSVSMASQSSSGQVIVSN-EVLSGQVEPPLKL 491
P I +Q +S +S PV S A SS Q + S Q+E P+K
Sbjct: 1080 PNIGRQSFEVASKVDSTSNSTVSTHPVIPLHSAALHPSSLQTAIPPVSTTSAQMEEPVKF 1121
HSP 2 Score: 77.4 bits (189), Expect = 4.8e-14
Identity = 39/51 (76.47%), Postives = 44/51 (86.27%), Query Frame = 0
Query: 592 GIIGVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKDTKAKSHNSKGW 643
GI+ VFEQ GIEAPSD+DDFIEVRSKRQMLNDRREQREK+ K KS +K +
Sbjct: 1431 GIVRVFEQQGIEAPSDDDDFIEVRSKRQMLNDRREQREKEIKEKSQAAKAF 1481
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022950041.1 | 9.5e-190 | 49.28 | uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950042.1 unchar... | [more] |
KAG7034343.1 | 2.1e-189 | 49.51 | hypothetical protein SDJN02_04070, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6604182.1 | 6.2e-189 | 49.51 | hypothetical protein SDJN03_04791, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022978332.1 | 1.4e-188 | 49.28 | uncharacterized protein LOC111478360 [Cucurbita maxima] >XP_022978333.1 uncharac... | [more] |
XP_023543957.1 | 8.9e-188 | 49.06 | uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GDR0 | 4.6e-190 | 49.28 | uncharacterized protein LOC111453246 OS=Cucurbita moschata OX=3662 GN=LOC1114532... | [more] |
A0A6J1IST3 | 6.7e-189 | 49.28 | uncharacterized protein LOC111478360 OS=Cucurbita maxima OX=3661 GN=LOC111478360... | [more] |
A0A6J1BUR5 | 2.6e-185 | 48.40 | uncharacterized protein LOC111005936 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1BUX9 | 2.6e-185 | 48.40 | uncharacterized protein LOC111005936 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A1S3B1H0 | 9.3e-183 | 48.51 | LOW QUALITY PROTEIN: uncharacterized protein LOC103484772 OS=Cucumis melo OX=365... | [more] |
Match Name | E-value | Identity | Description | |
AT3G50370.1 | 1.1e-42 | 33.81 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |