Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGGTAAAAAAATCCTCTAAGCTTTGGGAACTCACTGATGTCCCTTCTCCTTGAACTCTTTTGCATCCAGATAAGGTTCTCTTGAAATGAAAAAAGATAAAGATCAAGGCCAGGTTTTTGTTACCAAAAGGATCGGTGCTTGGTTGGCGATTAATTCCTTCTATGGCTACTATAATTTCTAGTTCTCATCTTGTTAGAACAGAACCCTCTACCTGGCCACCATCCATTCCTTGCACTCAAGTGAGAGAATGTCAAGTACTTTTGACAAGGTTGACAAGGATCACAGTGTTCAACCATTCAAGCTTTTGTTGAAGCAAGAACCCTTAAATCAAAGAAATTAACACCTATTTGTGCAAAATTTAGATCAACCAAGAGATAAAGCTAGAATTAGATTATCTCTTAAGATCAAAGATCTTGAAACAAGATTTGGAACCTTGTTCTAAATTGAGTCACTCCACAATCAAACTTGATCAAGGCTTGATTGAATAGCTCATATGCAATTTACCACAAGAATTGCATGAAACTTAGAAATGACAAAAATGATGCTCAAATTTCTAAGGGTGAAAGTCTCAATTTGAGAGCTAAATTTCATTCATCCAAAAATCTAACTTTTACATAGTATTTTGTGGGTATATAGTCTATCCACAAAAGAAAGATCCAATTCTTCAAATGCTCACTAAGGAAGTTAGTGGCCATGATTTGTGAGTTACCCCCAATACCCTTTAAATATTCACAAAAATAGATGTTATGACCACTTTGACAAGTAAATTCCCACTCTTGTTCCCACTCCAAGAAATTATGGAATTAAAATAATAAATAAAGTTTTCTAACAAATTATCCATCAAGTGCTAGTTATAGCAATCTCATCCGCTACTTCAACTATGAAAATATTCGCTTCTGAAGCTCCTAATGGTTCTTCTTCCTCAATCGAATCAGTTTGTAACACATGAAATAACGATTGAACCGAGTCTATCCAATTTTGTAGATAAAAAATGAATGCTTGTTGAATCCTCTCGGTCTTGGGTCGTGTAATAGGTCTAGTTGGAACATTTGGAACACCGTGATCCTTATCATCTTCTTCCTCTTCAAAAGAATTCGTCCTTGAATCAGGAAAATCATCAGCTACGTCAGAAGGACTCAAATAAGTAACATTGAAAGTAGAATGTACTGAAAATTCTCCAGGAAGATCAATTTTATAGACATTGTTATTAATTCTTTCAAGAACTTGGTAAGGACAATCCCCTCTTGGATGTAACTTTGTCTTCCTTTAAGATGGAAATTGCTCCTTGCGAAAATGTATCCAAACCGAATCACCAGGTTGAAAAATGATTATCTGTCCTTTGTTAATTCTTGAAACAACTTTGAAATTTTGTTTTTCAATTCTCTCCTTCACCATTTTGTGCAATGTTTTGATCAAATCTGCCTTACTTGTGGCTACTTCACTCACAAAAGTATAAGAAGGAATAGGAAGCAAATCAAGAGGCGTAAGTGGATTAAATCCATAAACAACTTCAAAAGGAGAACAATGAGTGGTACTATGAATTGCTCTATTATATGCAAACTCTACAAATGTTAATATGCAAACTCTACAAATGGCAAACATTCGTCCCAAGACTTAATATTCTTAGAAATAATAGCCCTAAGCAACACTCCTAATGTTCTATTAACCTCCTTAGTTTGCCCATCAGTCTGAGGATGAGATGTAGTTGAAAACAACAACTTGGTACCCAATTTACCTCACAATACTTTCCAAATGTTCCAACTCGACTTATTACATGATTAAAGATCGAGAAGATTAAGAAAACATTCATTCTTTATCTACAAAATTGGATAGACTTGGGTCAATCGTTATTTCATGTGTTACAAGCTGATTCAATTGAGGAAGAAGGATCGCTTGGAGCTTTGGAAGTGAATATTTTCACGTTTGAAGTAGCGGATGAGATTGCTATAACTAGCATTTAATGGATGATTTGTTAGAAAACTTTATTTATTATTTTAATTCCATAATTTCTTGTAGTGGTAACAAGGGTGGCAATTTACTTGTCAAAGTGGTCATGACATCTATTTTGTGGAGATTTAAAGGATAGTGGGAGTAGCTCACAAATCATGGCCACTAACTTCCTTAATGGGCATTTGAAGAGTTGGATATTTCTTTTGTGGATAGACTATAAATACCCACAAAATACTATATAAAAGTTAGTTTTTTTTAGATGAATGAAATTTAGCTCACAAATTGACACTTTCACCTTTAGAAATTAGAAATTTAAGCATTAGTCTTGCAATTTCTAAGTTTCATGCAATTCTTGTGGTAGATTACATATGAGCTATTCAAACAAGTCTTGATCAAGTTCGATTGTGGAGTGACTCAATTTAGAACAAGGTTTCAAATCTTACTTTTAAATCTTTGATCTTAAGAGGTAATCTAATTCTAGTCTTATCTCTTGGTTGAGCCAAATTTTGTATAAGGAGGTGTTAATTTCTTTGATTCAAGAGTTCTTGCTTTAACAAAAGCTTGGATCGTTGGGCTAAGGATTAGGGTTGCCTTATCAACTTTTAGCATGAGTTAATTATTAAACCTGATGTAGGCAATGCTTTTTATATCTCCCAAAATTCTTAGTGGTTAGGCCTTCAAGGTACCTTTTTTGGCCTAGCTGCCTCATAAGTGGATCGTTCCTTTTTCGATAGATTGCAGTAGTTTTAAGTTCACATATCTCTATTATTTGGGATTAAAAAATCTTTTTTACATTATAGCATTTTTTTTATTTGACCAATTGTCGTTTGAACATCCAGGGAATTCAAATTGGAGCTGTCACTTCTCTATATTTACTTTGTTGCATTTTGCGCATAGTTAAAATAAAAGATCTGGCAAACACCATCTCTACTGCCTTTTTTTGTCCATTGGACTCTTTCTCCCCACATTGTGAAGGCAGACTGATCGAAAATATGAATTGGTTATCTTGTGCAAATAGAAGCCAGTCATCAGGAAGTGATAGCATTGTAAGGCAGCCCTTGGATGCCGAGTCTTTAAGAAAAGAAGTATTATATTCTTCTACTCCTAAAACTGAGTTAGAAGGTGCGTCTATGAAAAATGGTTTTCGAGGCTCCTGCTTGGATTTGAGGTAACGTTTTAGCCAATTAGAGCTAGCTTAAAGTTCTCCTGTCATCTTATTCATATTCAAACACCCATCTCAACTTTCAACTGGAAAAGAGGTTTCAATGACTGAACTTATGCCTTTAAATGATTAATCAAGAGCAAATAAACTCATAGCATTGGCTTCCTGAACTCTGCAACATATGATATCTGAAATCACAATGTTCTATTTTACTTGTCGAAAGAATTTGGTTGCAAATAATTAGCTGCCAGTTTTATTAATTTTTCCTACACTTCTACTGTTTTAGAGTTTTGCATTTTTCTTCTTCTCTTCATCACCTTGTAGGATTTATCGTCTTGCCATGTTTTGAACTCCATGTTCTTTTTAAATAGTTCTTGTTTCTTGTAGAACAATATGCAAGCGGTGGCACTAAAATTTAGTTCCTGCACCACAAAGTTTGGATCCCAGTTACAGACTGGAGAAGTGTCTGGGCTTGTAGTTGACATCCTTTTTTTTTTTTTCTTTTTGTAATTATTATGAACACCTTGTATAGGCTTTTCTTTTTGTTTTGTTTGTTTGTTTTTTCTAACCAGAAATGTTTTTTTTTTTTTTATTATTATTATTATCAACACCTCCATCAAACTATTTGTTCTACTTCTAACATTGTTTCTATCTCTCATTCTTGTTTTTTCTTTTTTTTTTTTTTTTCAATACCAAATGTTTCAAGGGGGGAAAATCAATGTCAAATAAAAAGGAGCCATGCTAAAACAAAAAAGGGCTCCAATCAAGTAAAAGGAAACTTAAAAGATAATTATAAAAAGTCTTAATGATTGAGGCTCAAAGAGAGACATGCAACCTAAAATGAGCTCAAGGGTCCCCTTAAATCTCTTTGACTCGCTAAAACCTTGTTGTTCCTCTCGATCCAAATACCTCACAAAAGTTGAGATGGGTGTTTGAATATGAATAAGATGACAGGAGAACTTTTGTGCATATCAGATCCAATCTGAAACTTCCAATTTCTATTTTCTAAAGTGGTTCTTGCAAATGTTAGAAACTCTTGATTCAATAGATTTTCTTTTTCCTTTCCCTTTATTGGGAGGCAGGTTGACTCAGCATAATATATGTACTCACATTAGTCCAAATAATATTTAATGTTTGTCAGCAATGTGTTCCAAAATATCCCAACATGTATGTTTTTGCAGGGAAGCTTTGCTTTCTCATATAACAACTGGGGACGATGTAGAAGTCTTGGGTGCTCTAAGTGTTCTGGCTACACTATTGCAGACTGAAGGTCAGATCAATGCAGTTATTCAACTTCCTTCACTCCGTTATTCTTTTCTTATCATGCATCCCATCTCTGTAGAGCTGGACGAATCAATGCTGGATGCGCTTGCAATCCTTCCTCAAAGAAAACAACATAAGAAATTGTTATTGGTACTCTTCCATGATATACAATATTGCAATTTTTGGACTCTTTCAATAGATAGCATGTGTTGGTCCTTTTCCATTCTGTTCTCGTCAGGCTTAATTTTGTTTTCTTGACTGATATTAACGAACATTATTGAGCACTCCTCAAATGCACAATATGTGAAGAAGAAAAAAAAAAAAAGTATGCCTGCCTGGCATTTCCATATTTTAATTTGATCTCTTGATACTAAAGATTTATGTAGCAATTTATTGATTTTTGATTGCTCCATCTATCACACTGAACTTGATCATTGGCATTTTTCTGAATTTATTTTTAAATTATTGCATCTGAACTTGTGATTTATTTTTTGTAATCTTGGAAGTATAGATATTTGAAGATTTATATCAGTTTATAATCCATTTCCAGTAGATTACAGCATTTGTGTCAACTTCAATGAAGTAAATTTTAAGGCTGTTTGTGGGGTTGAATCCTCAATCACGTTGTTAACATATTGGTCATCTTCAATGGACTCTGTAGTGTTAAATTTTTATTTGTATTTTTATATGCTTGTATGGATAAAGTTCGAAAAATTTGGGTCTGTAACTTTAAATTGAGTTTTATTCTTTCCTGATGCTCTTGCATATGTAGCTCTGTCTTGTCACTGTTTTTACATGTTGCAGTTTACGGCCACATTTGATCATGTTTTCAGAATTTGAGTACTTACTACCTGTTAAGAACCTAAGATTATGCTGTTTTATTTTTGCACCTGCTTTTTGACAAACACGTTCAGACAAGTCTCAAATTATAAATTATATTCAACAGGAAGCCTTAGTTGGTGAGGATTCTGGCGAACAACAACTCTTTTCTTCAGAAAACGCCTCATCGAAAGGTGGCGTCAATGTTGAACTTGATGGTTACCTAAAGAAGCTTAAGGTACTCTTTTTGTCTTCCATGCTTCCACTAACCTTCTTTCATGCTAATATTAAGAAAGGAATCATATTGCATACAAGAATTGTAATACTGAAAGGCTTTCAAGGGCTACTCTGATAATATTTTAGTTGCATTTCTGAAGTTAAACTCTGGTTTTTTATTATGGCACCTTTGTTTGAGGACAAAAAGATTTAATTTCATGCTGCTTGATTCTTATTTAATTTCAGGATTATGGCATTTCATATTTTCTTAAAGTAGGTGCAAGCCCTCGTGCCCTTAGGTTTGTGGTACTTCTCTATATCTTATGGATATCATTTGGATCAGAAACACTTTCATATATTGTGTGTTAGTGATTATAAAAGTTCCTCACTTATGCTGAAGCGAAAACCGAACGAAAAGAACAGACTGTTAAAGCTACAGTTAATGATAAATTATTCTGTTGTGAAATGATATCATTTTATTCACATAACAGCGTGAAATCATTCTTGTACTACGAGCGTGGGTGGGTTAATCCATTTTTATTGGCCATGTTGTAGGTGCTAGATGCATTGGTCAGCCTCTTTTGTCGTTCAAATATATCTGCAGAAATTTTGTGGGATGGCGGGTGGCTTCTGCGGCAGTTGTTACCTTATAGTGAGGCAGAGTTTAACAGTCATCATCTAAAATTGCTGAAAGTAAGTTATATATGGAAGAGGAATGAGAAAGCACATTCTATTTCTGTTATTGATTCCTTGAAGGAAATGCATCATGATCATTGCTTGTATCTGTAAAAGAAAATGTTTAGCCTTGTTTTTTCTGATCTTTTTCTCTTTGATATCTAATAGGATTCATACAAGTACTGGGCTACTGAGCTCTTACAGGAAGCTAGAGGAATTTGGTCTGATTTCCTCGTAATAATTCTTTCTGACAAGTGGAAAAAGTGCAAAAGAGGTACTTAATGATGAATGATTAGCTAAGAAGCACAAATACGAATTTGAGACATGGATACAATACAACACGTACTAGAAAGAAATTCAAAATCAATAAATTTATGCATTTATACGCTAAAGTTGAGTGTTAATATATCTCACTCTGATATGATATGGAATCATTACAAGGGAGTAAGATTCGGATTACCTCGTTGATTAAATATCTCAAGGCAAGAACATTTGTTTGAGATTCGAATCACTCCACAAGCAAGATTGATCATGTCTAGCTTGAATGGTTCTTGTTGATTAAATATCTCAAGGAAAAAATACTTGTTTGAGATTCGAATCACTCCACAAGCAAGATTGATCATAGATTGATCATGTCTAGCTTGAGTGATTCTACTTGCAACCTAAACTACATAGAATTGCAAATAAACTTAGCCATTGGTTGAAAGCACAAATGCTCCTTTTACTATCTTTTCCAAGTCTCCCTTACAATAATAACATACATGGCTTTACGACTATAATTAGCCTAATTAGCCACTATATAAATAAGCCTTAAAATACATTAAAGAAACACCATAACTCTAAATTACTCTAAATTATGATCCACCCAAAATTTATAACAATGAAACTTGATTCTTCTTTAATGTGACATGAATTGAAACATCTTTTGATAACTTTGACAATATTTTCTTCCCATCATCCTTGAAGTTTATATTACATGATTGATGTCTTGGTTCATATCACATTCTCTAAATTATTACTTTTGTCAACCTGTATTGATTTTGGATCATTAGTGTCTAGGACATATTTGTTGTGTTTACGAGTGTACGATATGAGTCCAACAAGTTTCAGAGTGTCCAAGTGTCTGACACATGTCAGACACGGACTTAGATGATAAATGTATTAAGAGTTTTATGCGTGCTGAACCAATAAACTTTTTTAGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAGTGCATGCTCTTGTATTCTGCAAAGGCTTCTGTCGTAGGTAAGACTGCTGACTAGTTATAAGATTCATAGTTTGTGATTGATTTCCTTAAATTCAACTTCTTGTCCATACAGATGCTGTTCCACCCGAATCATCGCTCGCTGCTGGTCAAAGAATGTCCGAGTTGGTAAAGGTGTCTCTCGTCCCCCTTCCCTCTTCGATTAACTATTTTGTGTGAAGTTTCTTTTAACAATATCTTCCCGCCTTTCATTCAAGGTATTTGTTCTTCTACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAACCCTTTATGGACCCTCCCTCAGAAGCTTCTGAATGCTCCCGTGCAAAGGTTGCTGGGCTCGATGCTTCGGGACCTAAACCGGGTGCGGAGTTGAGACTTGGTTAGTTTTAAACTTTGAATTGCTTACTTTAGGTGTTTTACACAATTTTATATGGCAAAGTTGTGTAGGATTCGAACGTATTTTTAGTTCAATCACGTCGGCTGCTGTGAGTTTGTGAATAAACCTCCATAGTTGTTGCTTTCTAGATGGATCTGTGCCTTGTAGAATTTCATTTGAGAGAGGCAAAGAGCGCCATTTTTACTTTCTTGGAACTTCCATGGGAACTTCCGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTCGAGTTGCTGCACCTCTTGCTGGATCAAATGTAAGAGTGAACGAACTCTCGATTTCGAAAAAATTGATCTTTGGCCAATCTTTAATCTTTCCTTGCTTGGATGAGAACACTTTTTTTGGTTCTTATCAGCCTAGAATGGATGAAAAGCATCCAAGATGGCTGCATTTGAGGATTCGTCCATCAACTTTACCCTTTTTGGATCATCCTGCTAAATATGGTACCCCCTTAAACTTAAAGACAAAGCCTTTTGTGGATGGGAGATGGATCCTGGCATTCCAGGACGACAATACTTGCAAATCCGCTTTATCTATGGTTTTGGAGGAGATTAATCTGCAAAGCCAGGAGGTCGAGAGACGACTTAAACCATTGATTGACCTCGAAAGAGCTGTAGATTCGTCCAAGATGCATCGTTAAGTTCTACTAAGTAAGTCATTGACTTGTGTGTGTGAATGTATTAGCTCAGAAATGCAAAGGGAAAAGGAAAGTAGTTTTAGGATGTGAGAGTTTGTATCATTGTTTGTTTGTATAGTAATAGTAAGTATTCATTCATTTGTTGATCTGTAACCATCGGCTTCTGCTCTCTCCCTCATTGTGTAATCAGTATAATAAACATTAGTATATAATTGATTGATTGAACTTTAAGGTTTTCTTTCG
mRNA sequence
ATGGGAATTCAAATTGGAGCTGTCACTTCTCTATATTTACTTTGTTGCATTTTGCGCATAGTTAAAATAAAAGATCTGGCAAACACCATCTCTACTGCCTTTTTTTGTCCATTGGACTCTTTCTCCCCACATTGTGAAGGCAGACTGATCGAAAATATGAATTGGTTATCTTGTGCAAATAGAAGCCAGTCATCAGGAAGTGATAGCATTGTAAGGCAGCCCTTGGATGCCGAGTCTTTAAGAAAAGAAGTATTATATTCTTCTACTCCTAAAACTGAGTTAGAAGGTGCGTCTATGAAAAATGGTTTTCGAGGCTCCTGCTTGGATTTGAGGGAAGCTTTGCTTTCTCATATAACAACTGGGGACGATGTAGAAGTCTTGGGTGCTCTAAGTGTTCTGGCTACACTATTGCAGACTGAAGGTCAGATCAATGCAGTTATTCAACTTCCTTCACTCCGTTATTCTTTTCTTATCATGCATCCCATCTCTGTAGAGCTGGACGAATCAATGCTGGATGCGCTTGCAATCCTTCCTCAAAGAAAACAACATAAGAAATTGTTATTGGAAGCCTTAGTTGGTGAGGATTCTGGCGAACAACAACTCTTTTCTTCAGAAAACGCCTCATCGAAAGGTGGCGTCAATGTTGAACTTGATGGTTACCTAAAGAAGCTTAAGGATTATGGCATTTCATATTTTCTTAAAGTAGGTGCAAGCCCTCGTGCCCTTAGGATTCATACAAGTACTGGGCTACTGAGCTCTTACAGGAAGCTAGAGGAATTTGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAGTGCATGCTCTTGTATTCTGCAAAGGCTTCTGTCGTAGATGCTGTTCCACCCGAATCATCGCTCGCTGCTGGTCAAAGAATGTCCGAGTTGGTAAAGGTATTTGTTCTTCTACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAACCCTTTATGGACCCTCCCTCAGAAGCTTCTGAATGCTCCCGTGCAAAGGTTGCTGGGCTCGATGCTTCGGGACCTAAACCGGGTGCGGAGTTGAGACTTGATGGATCTGTGCCTTGTAGAATTTCATTTGAGAGAGGCAAAGAGCGCCATTTTTACTTTCTTGGAACTTCCATGGGAACTTCCGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTCGAGTTGCTGCACCTCTTGCTGGATCAAATCCTAGAATGGATGAAAAGCATCCAAGATGGCTGCATTTGAGGATTCGTCCATCAACTTTACCCTTTTTGGATCATCCTGCTAAATATGGTACCCCCTTAAACTTAAAGACAAAGCCTTTTGTGGATGGGAGATGGATCCTGGCATTCCAGGACGACAATACTTGCAAATCCGCTTTATCTATGGTTTTGGAGGAGATTAATCTGCAAAGCCAGGAGGTCGAGAGACGACTTAAACCATTGATTGACCTCGAAAGAGCTGTAGATTCGTCCAAGATGCATCGTTAAGTTCTACTAAGTAAGTCATTGACTTGTGTGTGTGAATGTATTAGCTCAGAAATGCAAAGGGAAAAGGAAAGTAGTTTTAGGATGTGAGAGTTTGTATCATTGTTTGTTTGTATAGTAATAGTAAGTATTCATTCATTTGTTGATCTGTAACCATCGGCTTCTGCTCTCTCCCTCATTGTGTAATCAGTATAATAAACATTAGTATATAATTGATTGATTGAACTTTAAGGTTTTCTTTCG
Coding sequence (CDS)
ATGGGAATTCAAATTGGAGCTGTCACTTCTCTATATTTACTTTGTTGCATTTTGCGCATAGTTAAAATAAAAGATCTGGCAAACACCATCTCTACTGCCTTTTTTTGTCCATTGGACTCTTTCTCCCCACATTGTGAAGGCAGACTGATCGAAAATATGAATTGGTTATCTTGTGCAAATAGAAGCCAGTCATCAGGAAGTGATAGCATTGTAAGGCAGCCCTTGGATGCCGAGTCTTTAAGAAAAGAAGTATTATATTCTTCTACTCCTAAAACTGAGTTAGAAGGTGCGTCTATGAAAAATGGTTTTCGAGGCTCCTGCTTGGATTTGAGGGAAGCTTTGCTTTCTCATATAACAACTGGGGACGATGTAGAAGTCTTGGGTGCTCTAAGTGTTCTGGCTACACTATTGCAGACTGAAGGTCAGATCAATGCAGTTATTCAACTTCCTTCACTCCGTTATTCTTTTCTTATCATGCATCCCATCTCTGTAGAGCTGGACGAATCAATGCTGGATGCGCTTGCAATCCTTCCTCAAAGAAAACAACATAAGAAATTGTTATTGGAAGCCTTAGTTGGTGAGGATTCTGGCGAACAACAACTCTTTTCTTCAGAAAACGCCTCATCGAAAGGTGGCGTCAATGTTGAACTTGATGGTTACCTAAAGAAGCTTAAGGATTATGGCATTTCATATTTTCTTAAAGTAGGTGCAAGCCCTCGTGCCCTTAGGATTCATACAAGTACTGGGCTACTGAGCTCTTACAGGAAGCTAGAGGAATTTGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAGTGCATGCTCTTGTATTCTGCAAAGGCTTCTGTCGTAGATGCTGTTCCACCCGAATCATCGCTCGCTGCTGGTCAAAGAATGTCCGAGTTGGTAAAGGTATTTGTTCTTCTACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAACCCTTTATGGACCCTCCCTCAGAAGCTTCTGAATGCTCCCGTGCAAAGGTTGCTGGGCTCGATGCTTCGGGACCTAAACCGGGTGCGGAGTTGAGACTTGATGGATCTGTGCCTTGTAGAATTTCATTTGAGAGAGGCAAAGAGCGCCATTTTTACTTTCTTGGAACTTCCATGGGAACTTCCGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTCGAGTTGCTGCACCTCTTGCTGGATCAAATCCTAGAATGGATGAAAAGCATCCAAGATGGCTGCATTTGAGGATTCGTCCATCAACTTTACCCTTTTTGGATCATCCTGCTAAATATGGTACCCCCTTAAACTTAAAGACAAAGCCTTTTGTGGATGGGAGATGGATCCTGGCATTCCAGGACGACAATACTTGCAAATCCGCTTTATCTATGGTTTTGGAGGAGATTAATCTGCAAAGCCAGGAGGTCGAGAGACGACTTAAACCATTGATTGACCTCGAAAGAGCTGTAGATTCGTCCAAGATGCATCGTTAA
Protein sequence
MGIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLSSYRKLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Homology
BLAST of CmoCh05G007590 vs. ExPASy Swiss-Prot
Match:
Q8W4P9 (Protein TRANSPARENT TESTA 9 OS=Arabidopsis thaliana OX=3702 GN=TT9 PE=2 SV=1)
HSP 1 Score: 398.3 bits (1022), Expect = 1.3e-109
Identity = 262/566 (46.29%), Postives = 325/566 (57.42%), Query Frame = 0
Query: 3 IQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRS 62
I + VTSLYLL CILRIVKIKDLAN + FCP+ +F L++ + L+ +
Sbjct: 299 ISVDPVTSLYLLSCILRIVKIKDLANMTAATLFCPVKAF---ISSSLVKPNSSLAPEGLT 358
Query: 63 QSSG-SDSIVRQPLDAESLRKEVLYSSTPKTEL-EGASMKNGFRGSCLDLREALLSHITT 122
+G D V + + + S + L + K+ F S + RE LL +I+
Sbjct: 359 YVNGHPDKGVTEEANQQCSSTAAGMSDDGNSHLCSEDTPKSIFNNSHMTFRETLLQYISE 418
Query: 123 GDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQR 182
GDDV+ G+L VLATLLQT+ EL+ESMLDA ILPQR
Sbjct: 419 GDDVQAQGSLFVLATLLQTK------------------------ELEESMLDAFGILPQR 478
Query: 183 KQHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKD-YGISYFLKVGA-S 242
KQHKKLLL++LVGED+GE+QLFS N S + G++ ELD YL++L++ +G+ L A
Sbjct: 479 KQHKKLLLQSLVGEDTGEEQLFSPRNGSMRDGLSSELDWYLRRLEEQFGVCCSLPGAARC 538
Query: 243 PRALR---IHTSTGLLS-------------------------------------SYRK-- 302
PR R + T LL SY K
Sbjct: 539 PRVHRHQVVDTLVTLLCRENISAETLWDGGWLLRQLLPYSEAEFNRKHLKMLNVSYEKCK 598
Query: 303 -------------------LEEF-----AIEAPSPRKEPKCMLLYSAKASVVDAVPPESS 362
L+E+ IEAPSP+KEPK +LL ++S D ESS
Sbjct: 599 NSLTREIKGIWPDLLIRVLLDEWRKCKRVIEAPSPQKEPKSVLLQLDRSSSNDNSVSESS 658
Query: 363 LAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPG 422
AG+RM E+VKVFVLLHQLQ FSLG++L EQP + PP++ SE SRA AGLD S PKPG
Sbjct: 659 FTAGERMCEVVKVFVLLHQLQIFSLGRSLPEQPPIYPPADRSETSRATRAGLDVSVPKPG 718
Query: 423 AELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGS 482
EL+L +VPCRI+FERGKER F FL S G SGWI+LA+ + GI+RV APLAG
Sbjct: 719 TELKLVDAVPCRIAFERGKERDFSFLALSSGESGWIVLADP-----DNGIVRVTAPLAGC 778
Query: 483 NPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSAL 499
PR+DEKHPRWLHLRIRPSTLP LD P K G LK+K VDGRWILAF+DD +C SA
Sbjct: 779 KPRIDEKHPRWLHLRIRPSTLPLLD-PTKRGVYEKLKSKGLVDGRWILAFRDDESCHSAY 831
BLAST of CmoCh05G007590 vs. ExPASy TrEMBL
Match:
A0A6J1ENM5 (uncharacterized protein LOC111436147 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111436147 PE=3 SV=1)
HSP 1 Score: 872.8 bits (2254), Expect = 6.7e-250
Identity = 470/572 (82.17%), Postives = 473/572 (82.69%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 174 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 233
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG
Sbjct: 234 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 293
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDALAILPQRK
Sbjct: 294 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALAILPQRK 353
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA
Sbjct: 354 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 413
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 414 LRFVVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATEL 473
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA
Sbjct: 474 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 533
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
Sbjct: 534 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 593
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR
Sbjct: 594 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 653
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 508
MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV
Sbjct: 654 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 713
BLAST of CmoCh05G007590 vs. ExPASy TrEMBL
Match:
A0A6J1EN85 (uncharacterized protein LOC111436147 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436147 PE=3 SV=1)
HSP 1 Score: 872.8 bits (2254), Expect = 6.7e-250
Identity = 470/572 (82.17%), Postives = 473/572 (82.69%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDALAILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALAILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA
Sbjct: 474 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 533
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 534 LRFVVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATEL 593
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA
Sbjct: 594 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 653
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
Sbjct: 654 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 713
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR
Sbjct: 714 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 773
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 508
MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV
Sbjct: 774 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 833
BLAST of CmoCh05G007590 vs. ExPASy TrEMBL
Match:
A0A6J1ESN2 (uncharacterized protein LOC111436147 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436147 PE=3 SV=1)
HSP 1 Score: 845.5 bits (2183), Expect = 1.2e-241
Identity = 457/552 (82.79%), Postives = 461/552 (83.51%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDALAILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALAILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASP 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLK D +S F + S
Sbjct: 474 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKVLDALVSLFCRSNISA 533
Query: 242 RAL------------------RIHTSTGLLSSYR-------------------------- 301
L H L SY+
Sbjct: 534 EILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATELLQEARGIWSDFLVIILSDKW 593
Query: 302 KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF
Sbjct: 594 KKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF 653
Query: 362 SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF 421
SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF
Sbjct: 654 SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF 713
Query: 422 YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF 481
YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
Sbjct: 714 YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF 773
Query: 482 LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID 508
LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID
Sbjct: 774 LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID 821
BLAST of CmoCh05G007590 vs. ExPASy TrEMBL
Match:
A0A6J1K759 (uncharacterized protein LOC111491772 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491772 PE=3 SV=1)
HSP 1 Score: 815.8 bits (2106), Expect = 9.8e-233
Identity = 441/571 (77.23%), Postives = 455/571 (79.68%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLI NMNWL CANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISAAFFCPLDSFSPHCEGRLIGNMNWLCCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLD ESLRKEVLYSS PKTELEGAS KNG RGS LDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDTESLRKEVLYSSAPKTELEGASTKNGCRGSRLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDAL ILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALGILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALV EDSGEQQLFSSENASSKGG++VE+DGYLKKLKDYGISYFLKVGASPRA
Sbjct: 474 QHKKLLLEALVSEDSGEQQLFSSENASSKGGIDVEIDGYLKKLKDYGISYFLKVGASPRA 533
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 534 LRFEVLDALVSLFYRSNISAEILWDGGWLLRQLLPYSDAEFNSHHLKLLKDSYKYWATEL 593
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSP KEPKC+LLYSAKASVVDAVPPESSLAA
Sbjct: 594 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPMKEPKCILLYSAKASVVDAVPPESSLAA 653
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQLQSFSLGKAL EQPFMDPPSE SECSRAKVAGLDASGPKPGAEL
Sbjct: 654 GQRMSELVKVFVLLHQLQSFSLGKALPEQPFMDPPSEVSECSRAKVAGLDASGPKPGAEL 713
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKERHFYF+GTSMGTSGWIILA+ELPSKLN GIIRVAAPLAGSNPR
Sbjct: 714 RLDGSVPCRISFERGKERHFYFIGTSMGTSGWIILADELPSKLNRGIIRVAAPLAGSNPR 773
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 507
+DEKHPRWLHLRIRPSTLPFLDHPAKYGT LNLKT+PFVDGRWILAFQDD+TCKSALSMV
Sbjct: 774 IDEKHPRWLHLRIRPSTLPFLDHPAKYGTLLNLKTEPFVDGRWILAFQDDDTCKSALSMV 833
BLAST of CmoCh05G007590 vs. ExPASy TrEMBL
Match:
A0A6J1K559 (uncharacterized protein LOC111491772 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111491772 PE=3 SV=1)
HSP 1 Score: 787.3 bits (2032), Expect = 3.7e-224
Identity = 428/551 (77.68%), Postives = 443/551 (80.40%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLI NMNWL CANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISAAFFCPLDSFSPHCEGRLIGNMNWLCCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLD ESLRKEVLYSS PKTELEGAS KNG RGS LDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDTESLRKEVLYSSAPKTELEGASTKNGCRGSRLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDAL ILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALGILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASP 241
QHKKLLLEALV EDSGEQQLFSSENASSKGG++VE+DGYLKKLK D +S F + S
Sbjct: 474 QHKKLLLEALVSEDSGEQQLFSSENASSKGGIDVEIDGYLKKLKVLDALVSLFYRSNISA 533
Query: 242 RAL------------------RIHTSTGLLSSYR-------------------------- 301
L H L SY+
Sbjct: 534 EILWDGGWLLRQLLPYSDAEFNSHHLKLLKDSYKYWATELLQEARGIWSDFLVIILSDKW 593
Query: 302 KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF 361
K + AIEAPSP KEPKC+LLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF
Sbjct: 594 KKCKRAIEAPSPMKEPKCILLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF 653
Query: 362 SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF 421
SLGKAL EQPFMDPPSE SECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF
Sbjct: 654 SLGKALPEQPFMDPPSEVSECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF 713
Query: 422 YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF 481
YF+GTSMGTSGWIILA+ELPSKLN GIIRVAAPLAGSNPR+DEKHPRWLHLRIRPSTLPF
Sbjct: 714 YFIGTSMGTSGWIILADELPSKLNRGIIRVAAPLAGSNPRIDEKHPRWLHLRIRPSTLPF 773
Query: 482 LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID 507
LDHPAKYGT LNLKT+PFVDGRWILAFQDD+TCKSALSMVLEEINLQS EVERRLKPL+D
Sbjct: 774 LDHPAKYGTLLNLKTEPFVDGRWILAFQDDDTCKSALSMVLEEINLQSLEVERRLKPLVD 820
BLAST of CmoCh05G007590 vs. NCBI nr
Match:
KAG7029854.1 (hypothetical protein SDJN02_08197, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 899.4 bits (2323), Expect = 1.4e-257
Identity = 483/572 (84.44%), Postives = 488/572 (85.31%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 27 GIQIGAVTSLYLLCCILRIVKIKDLANTISAAFFCPLDSFSPHCEGRLIENMNWLSCANR 86
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRK VLYSSTPKTELEGAS KNG RGS LDLREALLSHITTG
Sbjct: 87 SQSSGSDSIVRQPLDAESLRK-VLYSSTPKTELEGASTKNGCRGSRLDLREALLSHITTG 146
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDAL ILPQRK
Sbjct: 147 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALGILPQRK 206
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA
Sbjct: 207 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 266
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 267 LRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATEL 326
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA
Sbjct: 327 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 386
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQL SFSLGKALSEQPFMDPPSEASECSRAKVAGLDAS PKPGAEL
Sbjct: 387 GQRMSELVKVFVLLHQLLSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASEPKPGAEL 446
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR
Sbjct: 447 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 506
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 508
+DEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTK FVDGRWILAFQDD+TCKSALSMV
Sbjct: 507 IDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKRFVDGRWILAFQDDDTCKSALSMV 566
BLAST of CmoCh05G007590 vs. NCBI nr
Match:
XP_022929612.1 (uncharacterized protein LOC111436147 isoform X1 [Cucurbita moschata])
HSP 1 Score: 872.8 bits (2254), Expect = 1.4e-249
Identity = 470/572 (82.17%), Postives = 473/572 (82.69%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDALAILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALAILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA
Sbjct: 474 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 533
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 534 LRFVVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATEL 593
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA
Sbjct: 594 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 653
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
Sbjct: 654 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 713
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR
Sbjct: 714 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 773
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 508
MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV
Sbjct: 774 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 833
BLAST of CmoCh05G007590 vs. NCBI nr
Match:
XP_022929615.1 (uncharacterized protein LOC111436147 isoform X4 [Cucurbita moschata] >XP_022929616.1 uncharacterized protein LOC111436147 isoform X4 [Cucurbita moschata])
HSP 1 Score: 872.8 bits (2254), Expect = 1.4e-249
Identity = 470/572 (82.17%), Postives = 473/572 (82.69%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 174 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 233
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG
Sbjct: 234 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 293
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDALAILPQRK
Sbjct: 294 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALAILPQRK 353
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA
Sbjct: 354 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 413
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 414 LRFVVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATEL 473
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA
Sbjct: 474 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 533
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
Sbjct: 534 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 593
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR
Sbjct: 594 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 653
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 508
MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV
Sbjct: 654 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 713
BLAST of CmoCh05G007590 vs. NCBI nr
Match:
XP_022929613.1 (uncharacterized protein LOC111436147 isoform X2 [Cucurbita moschata])
HSP 1 Score: 845.5 bits (2183), Expect = 2.4e-241
Identity = 457/552 (82.79%), Postives = 461/552 (83.51%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDALAILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALAILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASP 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLK D +S F + S
Sbjct: 474 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKVLDALVSLFCRSNISA 533
Query: 242 RAL------------------RIHTSTGLLSSYR-------------------------- 301
L H L SY+
Sbjct: 534 EILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATELLQEARGIWSDFLVIILSDKW 593
Query: 302 KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF
Sbjct: 594 KKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSF 653
Query: 362 SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF 421
SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF
Sbjct: 654 SLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHF 713
Query: 422 YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF 481
YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
Sbjct: 714 YFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF 773
Query: 482 LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID 508
LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID
Sbjct: 774 LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLID 821
BLAST of CmoCh05G007590 vs. NCBI nr
Match:
KAG6598900.1 (Protein TRANSPARENT TESTA 9, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 842.4 bits (2175), Expect = 2.0e-240
Identity = 459/572 (80.24%), Postives = 465/572 (81.29%), Query Frame = 0
Query: 2 GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANR 61
GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLIENMNWLSCANR
Sbjct: 294 GIQIGAVTSLYLLCCILRIVKIKDLANTISAAFFCPLDSFSPHCEGRLIENMNWLSCANR 353
Query: 62 SQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDLREALLSHITTG 121
SQSSGSDSIVRQPLDAESLRK VLYSSTPKTELEGAS KNG RGS LDLREALLSHITTG
Sbjct: 354 SQSSGSDSIVRQPLDAESLRK-VLYSSTPKTELEGASTKNGCRGSRLDLREALLSHITTG 413
Query: 122 DDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRK 181
DDVEVLGALSVLATLLQTE ELDESMLDAL ILPQRK
Sbjct: 414 DDVEVLGALSVLATLLQTE------------------------ELDESMLDALGILPQRK 473
Query: 182 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 241
QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA
Sbjct: 474 QHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRA 533
Query: 242 LRIHTSTGLLS----------------------------------------SYR------ 301
LR L+S SY+
Sbjct: 534 LRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKDSYKYWATEL 593
Query: 302 --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 361
K + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA
Sbjct: 594 LQEARGIWSDFLVIILSDKWKKCKRAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAA 653
Query: 362 GQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL 421
GQRMSELVKVFVLLHQL SFSLGKALSEQPFMDPPSEASECSRAKVAGLDAS PKPGAEL
Sbjct: 654 GQRMSELVKVFVLLHQLLSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASEPKPGAEL 713
Query: 422 RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 481
RLDGSVPCRISFERGKER+FYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR
Sbjct: 714 RLDGSVPCRISFERGKERYFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR 773
Query: 482 MDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMV 508
+DEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDD+TCKSALSMV
Sbjct: 774 IDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDDTCKSALSMV 833
BLAST of CmoCh05G007590 vs. TAIR 10
Match:
AT3G28430.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function FPL (InterPro:IPR019155); Has 243 Blast hits to 233 proteins in 101 species: Archae - 0; Bacteria - 0; Metazoa - 110; Fungi - 0; Plants - 53; Viruses - 0; Other Eukaryotes - 80 (source: NCBI BLink). )
HSP 1 Score: 398.3 bits (1022), Expect = 9.4e-111
Identity = 262/566 (46.29%), Postives = 325/566 (57.42%), Query Frame = 0
Query: 3 IQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRS 62
I + VTSLYLL CILRIVKIKDLAN + FCP+ +F L++ + L+ +
Sbjct: 299 ISVDPVTSLYLLSCILRIVKIKDLANMTAATLFCPVKAF---ISSSLVKPNSSLAPEGLT 358
Query: 63 QSSG-SDSIVRQPLDAESLRKEVLYSSTPKTEL-EGASMKNGFRGSCLDLREALLSHITT 122
+G D V + + + S + L + K+ F S + RE LL +I+
Sbjct: 359 YVNGHPDKGVTEEANQQCSSTAAGMSDDGNSHLCSEDTPKSIFNNSHMTFRETLLQYISE 418
Query: 123 GDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQR 182
GDDV+ G+L VLATLLQT+ EL+ESMLDA ILPQR
Sbjct: 419 GDDVQAQGSLFVLATLLQTK------------------------ELEESMLDAFGILPQR 478
Query: 183 KQHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGYLKKLKD-YGISYFLKVGA-S 242
KQHKKLLL++LVGED+GE+QLFS N S + G++ ELD YL++L++ +G+ L A
Sbjct: 479 KQHKKLLLQSLVGEDTGEEQLFSPRNGSMRDGLSSELDWYLRRLEEQFGVCCSLPGAARC 538
Query: 243 PRALR---IHTSTGLLS-------------------------------------SYRK-- 302
PR R + T LL SY K
Sbjct: 539 PRVHRHQVVDTLVTLLCRENISAETLWDGGWLLRQLLPYSEAEFNRKHLKMLNVSYEKCK 598
Query: 303 -------------------LEEF-----AIEAPSPRKEPKCMLLYSAKASVVDAVPPESS 362
L+E+ IEAPSP+KEPK +LL ++S D ESS
Sbjct: 599 NSLTREIKGIWPDLLIRVLLDEWRKCKRVIEAPSPQKEPKSVLLQLDRSSSNDNSVSESS 658
Query: 363 LAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPG 422
AG+RM E+VKVFVLLHQLQ FSLG++L EQP + PP++ SE SRA AGLD S PKPG
Sbjct: 659 FTAGERMCEVVKVFVLLHQLQIFSLGRSLPEQPPIYPPADRSETSRATRAGLDVSVPKPG 718
Query: 423 AELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGS 482
EL+L +VPCRI+FERGKER F FL S G SGWI+LA+ + GI+RV APLAG
Sbjct: 719 TELKLVDAVPCRIAFERGKERDFSFLALSSGESGWIVLADP-----DNGIVRVTAPLAGC 778
Query: 483 NPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSAL 499
PR+DEKHPRWLHLRIRPSTLP LD P K G LK+K VDGRWILAF+DD +C SA
Sbjct: 779 KPRIDEKHPRWLHLRIRPSTLPLLD-PTKRGVYEKLKSKGLVDGRWILAFRDDESCHSAY 831
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8W4P9 | 1.3e-109 | 46.29 | Protein TRANSPARENT TESTA 9 OS=Arabidopsis thaliana OX=3702 GN=TT9 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ENM5 | 6.7e-250 | 82.17 | uncharacterized protein LOC111436147 isoform X4 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EN85 | 6.7e-250 | 82.17 | uncharacterized protein LOC111436147 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1ESN2 | 1.2e-241 | 82.79 | uncharacterized protein LOC111436147 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1K759 | 9.8e-233 | 77.23 | uncharacterized protein LOC111491772 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1K559 | 3.7e-224 | 77.68 | uncharacterized protein LOC111491772 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
KAG7029854.1 | 1.4e-257 | 84.44 | hypothetical protein SDJN02_08197, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022929612.1 | 1.4e-249 | 82.17 | uncharacterized protein LOC111436147 isoform X1 [Cucurbita moschata] | [more] |
XP_022929615.1 | 1.4e-249 | 82.17 | uncharacterized protein LOC111436147 isoform X4 [Cucurbita moschata] >XP_0229296... | [more] |
XP_022929613.1 | 2.4e-241 | 82.79 | uncharacterized protein LOC111436147 isoform X2 [Cucurbita moschata] | [more] |
KAG6598900.1 | 2.0e-240 | 80.24 | Protein TRANSPARENT TESTA 9, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
Match Name | E-value | Identity | Description | |
AT3G28430.1 | 9.4e-111 | 46.29 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |