Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTCAAGTCACATTTTATGATAACTAATGGAATAGGGATTGAATGTCACTTTCCACAAAAGTTTAGCTCAACCAAGGTTAGAAGTTGAAAGGAACTTAGCTCAAAGCTACAAAGTTGGTTTCTCCATCTCAATTTGGCCATAGATATGAGCATGACCACAGAAATTTTGGCGCTGAAGGAAGAATGATACAGCGTCTTCAACTCAAGGCTCTACATTTATGGCCTACTCTTCGCTCTTCGTCCTCTCCCCATCTCCATTCTCAAACCCTCAACCTAAGTTCCTCTTCTTCTCTTCAATACACGCCTTGGTCTGGTCTCAAAGCTTGGAAACAGAGTCCCCTTAATGAGAATCGATTCTGGGGACCCAATGGACCAGAGCCTCTGCTTGAATCTTCATCAACTGGGGTTTTCTTTGATAGCCGAATCGAATCGGCTTCGTCTCTTGCGGAATTGGGTGCATTGGTTCTCTCTACAAGTGACCCTTTAACCAAATCCAAACTCTCTCATCTTGCTTACTCCAGATGGTCTCAGGAAGGTCTTCCCATTGGCGTTTTCGAAGCTCCTTCTCATCCTGCTCGACCTTCGTTGCCGAAATTGGTTTGTTTTCTATTTTCCGGTGGATGTTTTTGTCTATGGAAAATGTTTGATTCAATTCTTCGAAGAAACTGCTATTTTGTTTGCTTGATGTTGAACTTGCACGGGGTTATGAATTTGATCAGTTATTGTGATTAGATCATTTCATAGAAACAGTAATCACCTTTTTGAAATCTTCCGTGTTAATGTATTTCTTGACCATAGTTTCACATTGATTGGATAAAGGGATCATAATGGGTATAAAATGATGATAACTATCTCCCTTGGTATGCGGCCAACATTACTTTTGCTCTTGGGATTGAAAACAAAAGCAAAGCTATAAGAGTTTATGCCGTGGACAGTATCATACCGTGGCGGGAGATATGTGGAGGGTTCACTGTCACTAACAATTTCAATCATTTATTCGGAGAATTGTTCTTTCCATCTCTTCTTGCTTTTGAGTTATGGTATGTTGGAGTGAAAATACGCAGTTCAATTGTTAATTTTACACTTTTAAATATATTTTTCATACTACTAAAATTTATTTTGGAATGATCTTGAGCATTTTTAAGTGATTTTGAAAATTACTCCGAAACATGCCATTAGTTGTTCAGATGAAGTTTTTGATTTTAAATAGTATACTTTTTTTTCATGGAGAAGAATGAAAAAATATTTTATTAAGTTGTCTTTTAATCGTTTGTCAGGTCTCTCCAAAGGAAATTCCGGCTCCTAAAAACTCAGGATTACCTCTGAATGCTTATATGTTACACAATCTTGCTCATGTTGAGCTTAATGCAATTGATTTGGCATGGGACACTGTCGTTCGATTTTCTCTCTTCAGTGATGTTCTTGGGGAGGGTTTTTTTGCTGACTTTGCTCATGTTGCTGATGATGAGAGTCGCCATTTTATGTGGTGTTCACAGAGACTTGCTGAACTTGGTTTCAAGTGAGGAGATCATAAACCTTGGGTGTAATAGTTGATGTTTATTCGTTGTTACTCACCTTGATTGCTTTGCTACAAGATATGGAGATATGGCTGCTCATAATTTGCTTTGGAGGGAGTGTGAGAAATCATCCAACAATGTAGCTGCACGCTTGGCAGCAATACCGCTTGTCCAGGTAAATTTGTGTTCATGCTTGTGGTTAGTTATTTGTTTCTTTATACAATGCTTGAATCCCTTCAATTTTTCATTATTCTGATTTTGATTATCTAGAATGCAATATGTGAAAGTGGAAGGTTTAAGCAGTTCTTAAGAGAATGCAATCTTCAAAATGTAGTGTAGTAAGTAATAGGCTGCTGCTCTAGGCTTGCTTCGCTTATTCAAGCTCCACCCCTTATGTTTTTGATTAGGCATGCGTTTTGGCCTTCGTCATAAGGAAATGGTATACTAATCAATTACTCCCCTCACTCAATTAGTTTTTGGGCTTAAGTTGAACTACAAGATACCTTTATGGCTTTACCTTCTCCCTTTAGTTGGTGTCTTTTGAGTAAGTAGAGTGGAAATATGGTGATTACTTTTATCTTTGGTGAATGTACTTTGTAAACATTTGAACAGCTCATTAGGGCATCTTTTGGTCGTATAAGACCTTTCTTCTTGATGTAAGAGATTTTTTTACTTTTGCTTTTTGTAGATACCTGTTTAAGTTTAATGACACTATAGAGCTATATTGTGGGGATTATGACTAGATAGGAGAAGTAAAATTTCTAAATGGTGAAGAACCACGATGGATAACCCAGAGAACAACAGCCTAAACAACCTTTGCCTAGATGGTGGGTGACCAAACGAAATCAGGAACTATTTGTGTTGTTGGTGAAGTTTAGCCTACTTAAGAGAACCCATGGGAGAGCCTAAGCAAGCTTAGACAGAGTTTGGTTACCTATGAGGTGAGCTTAATGGAGCTCAAAGAAGCGAAGAGAGAGCTCGGTGGTTGAGCTATAGAGCCGGGCTTAAAAGAACCTAGGGGAGAGCTCAGAGGAGCTAAGAGAGAACTCATAAGTCGAGCTATAGTTGAAGAGCTAAAAGATTGACCTATAAAGGAGAGCTAAGGGGCTCATAGGTCAAGCTAGAGTTGAGGAGCTCAGAGGTCACTCTATAAAGAAGAGCTTAGAGGCTGACCTATAAAGGAAAATTGAGGAGCTCATGAGTCGAGCTATAGTTGAAGAGCTCAAAGGTCAAGCGATAGAGGAGGGCTCAAAGGTCGAGCTAAGATTCTCATAGGTTGAGCGATAGACAACATACGGTTCATAAATGTGATATTTAAGATTTGAATGGGTGGTGCACATTAAAGCACCTAGATAGATGATATAATCGTCTAAAGAGTGTTGGCTCAACCATTCCAAACTTTCAAGAGGTGCAGTTATGTGCTCAAATGGCTGTTACACTTTTTTACCTATATAAACGACTAAGACATTTGTCAAAGGTACTCTGAAAAATACCCTACTATTTCCTATTCTTTAATCACTCACAAACTCAATACTGGTTTGAGCATCGAAGTGTGTGTGGCCAGACACCACATCAGTGCTTAATCTCTTTTGTCTTGCAAACACTTGAACGATCTTCCCCCGTAATAAAAATTTATTGTTGGTGCGATCTTTGGCATCAACAATTGACACTGTTTGTGGGAACGATTTTTGTTGTGCCTACTACTGAGAATCTAAGTCGTTACTATGAGTAACATGATGGATCACCAAAACTTAGCAAAAGATGTCGGTGATGAACCTTGTTATGACACCATCTGAGAAGGACACTTGTATGAGATCAAAATGACACGTTGATAGAAATTTGTCAGCTGACGGAGAAACCTATTAATGACTATGCCTATGCTGCTTTAGTGGACTAGGTGAAAGAAATGGACCAAGATCTAACAGAGATCTTGGAGCTACTTAGGCATCCAAGCTCCGTCATAGAAGTCGTGGCTAGGGTAGCCGAGGAAGTACCCCAAGGTAGTCCAAGATGGGAGCAGCAAAGGAAAAATTCGTTGGTACTAGAAAATAAAGTTACCACTTTTGTTCTCCGAGATCAGGTTAAGGGAAAGGCTCTGGTAGATGAAGGGGTCAAAGAATCCCAAAATATAGGACCAGGATCGATGGAGAAGAGGATAGATGTCTCAAAAGGGAAATTCGTTACCAAATTGCTTTTGACACAACCACAAAATTTCCCTAAACTTCTTCAACTTGAGGAAAGGATTGCTATCTTAGAAGCCTCGTTCGAAACTTTAGGAAAACAAGTTCAAAAGGAAAGTGACTCGAGTGGCAATGACTCAATGTTGCTACAAGGCGTCGTCAAGGCAAATATAAGCATGCCTGCAAGACAATCTCTTTTCACTAAAGCATAAGATCACTCAGCTGAGGGTTGCAATCAAAGTCCAAACTCAAGTTTTGGTGCAACAGAACTTTGACAATGTAACCCTCAGATCGAGAGAACAGTTACTTCAGGTTTCTCGTACACGAGACTCAGAGATAAGACCTGAGGTCATCTAGATAGATAGTGACCACGAGGATGACTACAAGCTTGTTGAATGTTTGAAACATCTTGAAGTCAATGAGTACACATTGTGCGAAGACTGTCATGACTTTAAAGAATGTCCATGGGTGACAACCTGAGCACAGATGGACCCTACAGGTTCTTTTGGTTTCTAGATTTGCTAAAGATGAAAGCCAAGTTGCAGTATATAGAAATAAGGCATCTGAGTTGGTCTTGAAATGTTTGAAAGTTTTAAATTGTTTTCAGTTGCGCGTGCTAATTTGTCTATAGTTTCACCTAATGAAAATTTCTTTTTCATAAATGTGACATTACTCAGGCATGTTATTTTTACTGAGATAAAAAAATCAAATGTGCATGATTGGAGGCAAGTTAGGATGCATGTTATGAGTTCAATAGAGATCACCATCCTAAGTGAACCTGCTTGGGGGCAGATCATTGTATATCAAATAGGTACCCTTACATGGTGTAGAAATTGAAAAAAGAGTATGTCACCTTAGCAGCCAAGGTGAAGGGAGTAGACTAAAGCTTGAGTGAAATCCTGGAGCCACTTAAGTGTTTGAACCCACTCGTGGCATCAACAATTGGGATAAGAAATTTGATGAACTGGGAAAAAAGTATTATTGGGTAAAAGCAATAAGGAAATAAAAAAAGGGATAAGCTCAGAGAAAGGATGGAATATCGAAGATATAGAGTGGGGCATTGCCATTGTAATGATTATGATCACCAAAACTTAGAGAGATCAAAGCAATTGTATGTCAAACCCATTTCTTGTTGACATGGCCTTGCTCCAATGTTCTTGTAAGGATCAAGGAAGAGGTTGATTCTTTTGCAAGCATGAAGGACGAAATAAAATGAGGAAGCTTTCTCTTGAATTTGAAAGTTGGAATGCACAGAAGCATGAAAATGTAGAGGTAATTTCAACTTTCCAAGTTATGAAGGTTGAATCAAATTGAGGTGGAATACCTCAATTGTCACGGAAGGAGCTGATATTTGAAGGATAATTGTGGAGGATTTATGATATGGGCTCATTGAACTACTATTATGATTGATTGTTTGGAAGCCAGTTTAGAAATTAAAAGGATTTTTTTTATTAGTCCCTGCTGAAGCATCCATCGAGCATTCAAAATATGGGAGATTTGTTGTTCACTGCAACTGCAAGTTGTGGTGACTTGATGATTGAGATCGCTGCTTGAAAGAGTACAAAAATTCATGGAACTCCTTCAGATGTGGCTATGGGAACTGAAAGAATGCATAAAATTTCTTCATTCAATGTTGCTTTGAGATTGGGATATTTAGGAGAGATAAAGTCTGAGCAGTATAAAGATGAAGATGAATAGCCGGCGTGGAGGAAAGGGAGGGAAGCGAAGGGACGGGATTACTAGAAGAAATGGCTTTTGAAAATACTATGGAAGAGGAACTGTCAAATTCAAACCTAAAGGAAGTGGAAAGGGTGGATGCTAATGACATCATAAGCATGGCTGAAAGTTGAAGAGTTGGTTACCTTAAATGAGATCATAGAAAGAAGTGGAAGAAAATTAACAAATAGGCTGGAATGAAATCAGAAGGGTTTTTTGGGGGAAACAAACTGATGTAATTCAAAATTGGATGTATTTTCCAAATGAATTCTCTGCCTTCAAAGGCTAAGAGATGGGGTATTTATTAAGAATCTTGAGAGCTTGTTACAAGGTAGTTATAGGAAGGTTGTTACAAAAACAGTTATAATGGTTAACTGTTTAAACTATCTTATCTTGTTCTACTTATTACATCACAAATGGTTGTAAAGATCTAAGGAGGCTTGAAAGAATTAACTGAGATGAACATTAGTAAGAAAAGAGTCAGATGAAAGAATGGGAACAGCTTTTATGGGACTTTAATTGCTAAGTTTGCTTTGGTTGGATGGTTGTCTCCCATTGGGTCGATGAATGGATGATGGGGGGTTTGTACGGGGCAACTTCTGCAGAAAAGAAAAAAAAGGAAAAAATTTGTGGAGTTGTGCAACCCATGCTCTCTTGTGGCATCTCTGAAATAAAAGAAATAGTAGGGTGTTCAATTATAAGTTTAGCTCTTTTGATTATTTTTGGGCTCTTGTGAAGCATACGGCTTCTTGGTGGTGTACAAACTACACAAAGTTATTTTGTAATTACGGCCTTCTCATGATTATTAACAACTGGAAGGTGCTTATTGTTTAGTTTTTTTGTTAGGGGGAGGGGTCCTCAACCCCTCGCACTTTGACTGTTTTCAACATGCTGATTTATATATATTAATCTCTATTTCTTAACAAAAAAAAAAAAAAAGAAAGACGATTTTTTTGCTATTAGATACAAAAATAAAGTTTAAGATTCTATTAGATACAAATTTAAATTTATATCTATTCAATCAATTATTTTTTAAAAACTCTCGAATTTTTTGGGGCCTATTAAACCAAAAATTGTAAGTTCAAGGATCGATTAGGGATGAAATTAAAAGTTCATGGAATTGTTGGACACAAAATTGAAAGTTCGGGGATTTATGAGAGAAAACTGAAAGTTTAGGGACCTATTAGACATTTTTTAAAACTCAAAGACCAAATTCACACAAATTTCATTTAACCTTTTATGTCTACATTCACCATTGGAGTGGAATCTTTATTGGAAGACTGACTACGTGTATTCTCCTAGAACGTTTGTTGTTTTGATGCCTGGCACATCTTTTGGTTAACCTGAACATCTTTTGGTTAAATCTGATCAAATGGATTGAAAAAGAAAAAATTTCTCTTGGTCATTGATGTTGCTTTGAGGGGAAATATTGATTAATGATACAAATGGAGTGGGTTACAGGATACGAGCAGATGAGCAATCAAGCAGATTTTGGAGTATAGTTCCACTACCAAGCGTCTCTAAAAGGGCTTGAAAACTTGGTTGTTTATTTAATTTTATTTTTTAATACAACAACTGTAAGGTGGGAATCAAACATCCAAGCTCTAGGAGGAAGGCCATGCCAAATTACCTTAAACTCACTTTAGCTAGAAACTTGTATTTATTTTATGTTTGTCTATTCCAATAAATTGTATATTATCTCGTTCTGGTTATCACTTTTCTTGTATTAAAGGCACATATTTGTATTTAATATCGCACATTGTGTAGGAGGCTAGAGGACTCGATGCTGGGCCTCGACTAGTGAAAAAATTAGTTGGCTTCGGGGATCATAGAACATCTGATATTGTAGCTAAAATTGCTGATGAAGAAGTTGCACATGTAGCTGTTGGTGTCTACTGGTTTGTCCTAGTCTGTCAGAAAATGGAACGTGCTCCATGCTCAACTTTTAAAGGTTTATATCCGTCATTACTTCAGTTTGTACTATTTCATTAAAATGATACAAGTCCTAGAGCTTGCATTACTGGATAACATCTTTGGTGGCACTAACGCAGAGTTGTTAAAAGAATACAGTGTGGAACTAAAAGGGCCTTTCAATTATTCAGCTCGAGATGAAGCTGGCCTTCCACGTGATTGGTATGAAAACTTCCCATCTAGTTTACAGTCGGAATTTTTCTTTACCCAACTCTTCTGCAGCTTTTATGAACCATTATACTACTGCTCACACTTTATAGGTATGACATATCAAACACGAATGTACAGGACGAGTTGAGTGGAGACACTAAAAATGAACAGCTATCTGTGGTACATAATCATTAATCTAACTTCTTGTAACATGCCTGTAATTGTGTGAACCATGTACATGCGTTTTTTTTTTTTCATTAGTAAGATTTTGGTATCACTGATATTTCAGGTTTACGACAGGCTTGCTTCAGTAATATCCATGGAGTGCAAGAACTCCAGTTTGCATGGGCCTTCAGAATGAGGTGTTATTCACCATCATCCGGTGTTTTTGTAACGTGCTTCGTGGGCCTGAGATAATGTTTAATGATTCTGGAGCACAGTTTTGTATGGTTAGTATCCTGTTTTATCTCAGCTTCAAGATTAAGAGCATGATTTCGTCCCCAGGATGTAGAACTGCTTGACGATCTCTAGATCCATCTTTTTAAGGAGTATAGTCGTGTATAGTTGGTGTTCTGTTTTATCTTAGTCATAAAAAATTCAAGATCAACAACACATATTGGTCTCTCTACTAATGTTGAACTGCCTGAGGTATCCTTAAATCATCCCATTGTTTCAATCAGTAAGTTTTGATACGAAAGATATGAAAAAAAAGCACAACTTAGAAACTCTTTTGATTCAGTTCATGGAATTGAAATAATCTATTTGCCCTGTGGATGATAGTCTTCGTTATGAAACTGATTTGAATGGATGCGTCCTGACACGAACTATCGACTTTTAAAATGATAATTTGGTGTATGTATCATCCTCCTTTTTGAAACTTCAATACTTTGACTGCTTACTTTCTCCAAACCGTCGTGTTTGATCTAGTGACAAAGAAAGAACGTTAAGTTTGATAAACGAGCTTAGAAGGACGCACTTAATCTATGATGACCACTTGTTTAAAATTTAATATCTTACGAATTTTCTTGACATCCAAATGTAAGGTC
mRNA sequence
GTTCAAGTCACATTTTATGATAACTAATGGAATAGGGATTGAATGTCACTTTCCACAAAAGTTTAGCTCAACCAAGGTTAGAAGTTGAAAGGAACTTAGCTCAAAGCTACAAAGTTGGTTTCTCCATCTCAATTTGGCCATAGATATGAGCATGACCACAGAAATTTTGGCGCTGAAGGAAGAATGATACAGCGTCTTCAACTCAAGGCTCTACATTTATGGCCTACTCTTCGCTCTTCGTCCTCTCCCCATCTCCATTCTCAAACCCTCAACCTAAGTTCCTCTTCTTCTCTTCAATACACGCCTTGGTCTGGTCTCAAAGCTTGGAAACAGAGTCCCCTTAATGAGAATCGATTCTGGGGACCCAATGGACCAGAGCCTCTGCTTGAATCTTCATCAACTGGGGTTTTCTTTGATAGCCGAATCGAATCGGCTTCGTCTCTTGCGGAATTGGGTGCATTGGTTCTCTCTACAAGTGACCCTTTAACCAAATCCAAACTCTCTCATCTTGCTTACTCCAGATGGTCTCAGGAAGGTCTTCCCATTGGCGTTTTCGAAGCTCCTTCTCATCCTGCTCGACCTTCGTTGCCGAAATTGGTCTCTCCAAAGGAAATTCCGGCTCCTAAAAACTCAGGATTACCTCTGAATGCTTATATGTTACACAATCTTGCTCATGTTGAGCTTAATGCAATTGATTTGGCATGGGACACTGTCGTTCGATTTTCTCTCTTCAGTGATGTTCTTGGGGAGGGTTTTTTTGCTGACTTTGCTCATGTTGCTGATGATGAGAGTCGCCATTTTATGTGGTGTTCACAGAGACTTGCTGAACTTGGTTTCAAATATGGAGATATGGCTGCTCATAATTTGCTTTGGAGGGAGTGTGAGAAATCATCCAACAATGTAGCTGCACGCTTGGCAGCAATACCGCTTGTCCAGGAGGCTAGAGGACTCGATGCTGGGCCTCGACTAGTGAAAAAATTAGTTGGCTTCGGGGATCATAGAACATCTGATATTGTAGCTAAAATTGCTGATGAAGAAGTTGCACATGTAGCTGTTGGTGTCTACTGGTTTGTCCTAGTCTGTCAGAAAATGGAACGTGCTCCATGCTCAACTTTTAAAGAGTTGTTAAAAGAATACAGTGTGGAACTAAAAGGGCCTTTCAATTATTCAGCTCGAGATGAAGCTGGCCTTCCACGTGATTGGTATGACATATCAAACACGAATGTACAGGACGAGTTGAGTGGAGACACTAAAAATGAACAGCTATCTGTGGTTTACGACAGGCTTGCTTCAGTAATATCCATGGAGTGCAAGAACTCCAGTTTGCATGGGCCTTCAGAATGAGGTGTTATTCACCATCATCCGGTGTTTTTGTAACGTGCTTCGTGGGCCTGAGATAATGTTTAATGATTCTGGAGCACAGTTTTGTATGGTTAGTATCCTGTTTTATCTCAGCTTCAAGATTAAGAGCATGATTTCGTCCCCAGGATGTAGAACTGCTTGACGATCTCTAGATCCATCTTTTTAAGGAGTATAGTCGTGTATAGTTGGTGTTCTGTTTTATCTTAGTCATAAAAAATTCAAGATCAACAACACATATTGGTCTCTCTACTAATGTTGAACTGCCTGAGGTATCCTTAAATCATCCCATTGTTTCAATCAGTAAGTTTTGATACGAAAGATATGAAAAAAAAGCACAACTTAGAAACTCTTTTGATTCAGTTCATGGAATTGAAATAATCTATTTGCCCTGTGGATGATAGTCTTCGTTATGAAACTGATTTGAATGGATGCGTCCTGACACGAACTATCGACTTTTAAAATGATAATTTGGTGTATGTATCATCCTCCTTTTTGAAACTTCAATACTTTGACTGCTTACTTTCTCCAAACCGTCGTGTTTGATCTAGTGACAAAGAAAGAACGTTAAGTTTGATAAACGAGCTTAGAAGGACGCACTTAATCTATGATGACCACTTGTTTAAAATTTAATATCTTACGAATTTTCTTGACATCCAAATGTAAGGTC
Coding sequence (CDS)
ATGATACAGCGTCTTCAACTCAAGGCTCTACATTTATGGCCTACTCTTCGCTCTTCGTCCTCTCCCCATCTCCATTCTCAAACCCTCAACCTAAGTTCCTCTTCTTCTCTTCAATACACGCCTTGGTCTGGTCTCAAAGCTTGGAAACAGAGTCCCCTTAATGAGAATCGATTCTGGGGACCCAATGGACCAGAGCCTCTGCTTGAATCTTCATCAACTGGGGTTTTCTTTGATAGCCGAATCGAATCGGCTTCGTCTCTTGCGGAATTGGGTGCATTGGTTCTCTCTACAAGTGACCCTTTAACCAAATCCAAACTCTCTCATCTTGCTTACTCCAGATGGTCTCAGGAAGGTCTTCCCATTGGCGTTTTCGAAGCTCCTTCTCATCCTGCTCGACCTTCGTTGCCGAAATTGGTCTCTCCAAAGGAAATTCCGGCTCCTAAAAACTCAGGATTACCTCTGAATGCTTATATGTTACACAATCTTGCTCATGTTGAGCTTAATGCAATTGATTTGGCATGGGACACTGTCGTTCGATTTTCTCTCTTCAGTGATGTTCTTGGGGAGGGTTTTTTTGCTGACTTTGCTCATGTTGCTGATGATGAGAGTCGCCATTTTATGTGGTGTTCACAGAGACTTGCTGAACTTGGTTTCAAATATGGAGATATGGCTGCTCATAATTTGCTTTGGAGGGAGTGTGAGAAATCATCCAACAATGTAGCTGCACGCTTGGCAGCAATACCGCTTGTCCAGGAGGCTAGAGGACTCGATGCTGGGCCTCGACTAGTGAAAAAATTAGTTGGCTTCGGGGATCATAGAACATCTGATATTGTAGCTAAAATTGCTGATGAAGAAGTTGCACATGTAGCTGTTGGTGTCTACTGGTTTGTCCTAGTCTGTCAGAAAATGGAACGTGCTCCATGCTCAACTTTTAAAGAGTTGTTAAAAGAATACAGTGTGGAACTAAAAGGGCCTTTCAATTATTCAGCTCGAGATGAAGCTGGCCTTCCACGTGATTGGTATGACATATCAAACACGAATGTACAGGACGAGTTGAGTGGAGACACTAAAAATGAACAGCTATCTGTGGTTTACGACAGGCTTGCTTCAGTAATATCCATGGAGTGCAAGAACTCCAGTTTGCATGGGCCTTCAGAATGA
Protein sequence
MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLPIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVCQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQLSVVYDRLASVISMECKNSSLHGPSE*
Homology
BLAST of CsGy6G021730 vs. ExPASy Swiss-Prot
Match:
P43935 (Uncharacterized protein HI_0077 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_0077 PE=4 SV=1)
HSP 1 Score: 132.5 bits (332), Expect = 1.0e-29
Identity = 90/266 (33.83%), Postives = 138/266 (51.88%), Query Frame = 0
Query: 95 LSTSDPLTKSKLSHLAYSRWSQEGLPIGVFEAP------SHPARPSLPKLVSPKEIPAPK 154
L T++P K +L + Y + I + + P + A P P LV+PK++P
Sbjct: 16 LKTANPQEKCRLVNDLYDNLLPQIQLIKLEDFPEIVPQDNIAAFPEKPLLVAPKDVPKRS 75
Query: 155 NSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSL-FSDVLGEG--FFADFAHVADDESRH 214
+ A LH +AH+E NAI+L D RF + LGEG F D+ VA +ES H
Sbjct: 76 FATEEGYAATLHAIAHIEFNAINLGLDAAWRFGRNAQEELGEGLAFVKDWLRVAREESTH 135
Query: 215 FMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQEARGLDAGPRLVKK 274
F ++ L LG++YGD AH LW + +++++ R+A +P V EARGLDA P L +K
Sbjct: 136 FSLVNEHLKTLGYQYGDFEAHAGLWEMAQATAHDIWERMALVPRVLEARGLDATPVLQEK 195
Query: 275 LVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVCQKMERAPCSTFKELLKEYSVEL-KG 334
+ D +I+ I +E+ HV +G +W+ + +K F ELL +Y + + KG
Sbjct: 196 IAQRKDFAAVNILDIILRDEIGHVYIGNHWYHALSKKRGLDAMKCFTELLHKYRIVIFKG 255
Query: 335 PFNYSARDEAGLPR---DW-YDISNT 347
N AR +AG + DW Y++ T
Sbjct: 256 VINTDARIQAGFTQHELDWIYEVEQT 281
BLAST of CsGy6G021730 vs. NCBI nr
Match:
XP_004140115.1 (uncharacterized protein LOC101207410 isoform X3 [Cucumis sativus])
HSP 1 Score: 771 bits (1991), Expect = 3.83e-281
Identity = 386/386 (100.00%), Postives = 386/386 (100.00%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG
Sbjct: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
Query: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP
Sbjct: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
Query: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF
Sbjct: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
Query: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV
Sbjct: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
Query: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC
Sbjct: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
Query: 301 QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 360
QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ
Sbjct: 301 QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 360
Query: 361 LSVVYDRLASVISMECKNSSLHGPSE 386
LSVVYDRLASVISMECKNSSLHGPSE
Sbjct: 361 LSVVYDRLASVISMECKNSSLHGPSE 386
BLAST of CsGy6G021730 vs. NCBI nr
Match:
XP_008449389.1 (PREDICTED: uncharacterized protein HI_0077 [Cucumis melo] >KAA0057373.1 DUF455 domain-containing protein [Cucumis melo var. makuwa] >TYK30062.1 DUF455 domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 746 bits (1927), Expect = 2.26e-271
Identity = 376/387 (97.16%), Postives = 379/387 (97.93%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFW 60
MIQRLQLKALHLWPTLRSSS H HSQTLN+ SSSSS QYTPWSGLKAWKQSPLNENRFW
Sbjct: 1 MIQRLQLKALHLWPTLRSSSCLHFHSQTLNVNSSSSSPQYTPWSGLKAWKQSPLNENRFW 60
Query: 61 GPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGL 120
GPNGPEPL+ESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKS+LSHLAYSRWSQE L
Sbjct: 61 GPNGPEPLVESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSRLSHLAYSRWSQESL 120
Query: 121 PIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR 180
PIGVFEAPSHPARP LPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR
Sbjct: 121 PIGVFEAPSHPARPPLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR 180
Query: 181 FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN 240
FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN
Sbjct: 181 FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN 240
Query: 241 VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV 300
VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV
Sbjct: 241 VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV 300
Query: 301 CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNE 360
CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDE SGDTKNE
Sbjct: 301 CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDESSGDTKNE 360
Query: 361 QLSVVYDRLASVISMECKNSSLHGPSE 386
QLSVVYDRLASVISMECKNSSLHGPSE
Sbjct: 361 QLSVVYDRLASVISMECKNSSLHGPSE 387
BLAST of CsGy6G021730 vs. NCBI nr
Match:
XP_011657545.1 (uncharacterized protein LOC101207410 isoform X1 [Cucumis sativus] >KAE8647338.1 hypothetical protein Csa_003477 [Cucumis sativus])
HSP 1 Score: 726 bits (1873), Expect = 6.50e-263
Identity = 363/363 (100.00%), Postives = 363/363 (100.00%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG
Sbjct: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
Query: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP
Sbjct: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
Query: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF
Sbjct: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
Query: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV
Sbjct: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
Query: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC
Sbjct: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
Query: 301 QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 360
QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ
Sbjct: 301 QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 360
Query: 361 LSV 363
LSV
Sbjct: 361 LSV 363
BLAST of CsGy6G021730 vs. NCBI nr
Match:
XP_031743610.1 (uncharacterized protein LOC101207410 isoform X2 [Cucumis sativus])
HSP 1 Score: 713 bits (1840), Expect = 5.97e-258
Identity = 359/363 (98.90%), Postives = 359/363 (98.90%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG
Sbjct: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
Query: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP
Sbjct: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
Query: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF
Sbjct: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
Query: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV
Sbjct: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
Query: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC
Sbjct: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
Query: 301 QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 360
QKMERAPCSTFKE YSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ
Sbjct: 301 QKMERAPCSTFKE----YSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 359
Query: 361 LSV 363
LSV
Sbjct: 361 LSV 359
BLAST of CsGy6G021730 vs. NCBI nr
Match:
XP_038887628.1 (uncharacterized protein HI_0077 [Benincasa hispida])
HSP 1 Score: 702 bits (1812), Expect = 7.04e-254
Identity = 349/386 (90.41%), Postives = 367/386 (95.08%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWG 60
M+QRL+LK+L LWPTLRSSS PHLHSQTL L+SSS LQYTPWSG+KAW+QSPLNENRFWG
Sbjct: 1 MMQRLRLKSLLLWPTLRSSSYPHLHSQTLKLNSSS-LQYTPWSGIKAWRQSPLNENRFWG 60
Query: 61 PNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLP 120
PNGPEPL ESSSTG FDSRIESASSLAELGALVLSTSDPLTKS LSHLAYSRWSQE LP
Sbjct: 61 PNGPEPLAESSSTGFLFDSRIESASSLAELGALVLSTSDPLTKSILSHLAYSRWSQEDLP 120
Query: 121 IGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
IGVF+APS PARP +PKLVSPKEIPAPKN+GLPLNAYMLHNLAHVELNAIDLAWDTVVRF
Sbjct: 121 IGVFDAPSRPARPPVPKLVSPKEIPAPKNAGLPLNAYMLHNLAHVELNAIDLAWDTVVRF 180
Query: 181 SLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNV 240
S FS+VLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSS+NV
Sbjct: 181 SPFSEVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSDNV 240
Query: 241 AARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVC 300
AARLA IPLVQEARGLDAGPRLVKKL+GFGDHRTSDIVA+IADEEVAHVAVGVYWFVLVC
Sbjct: 241 AARLATIPLVQEARGLDAGPRLVKKLIGFGDHRTSDIVARIADEEVAHVAVGVYWFVLVC 300
Query: 301 QKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNEQ 360
QKM RAPCSTF++LLKEY+VELKGPFNYSARDEAG PRDWYDISNTNVQDE SG +NEQ
Sbjct: 301 QKMARAPCSTFRDLLKEYNVELKGPFNYSARDEAGFPRDWYDISNTNVQDESSGSARNEQ 360
Query: 361 LSVVYDRLASVISMECKNSSLHGPSE 386
LSVVYDRL S+ISMECKNSSLHGPSE
Sbjct: 361 LSVVYDRLTSLISMECKNSSLHGPSE 385
BLAST of CsGy6G021730 vs. ExPASy TrEMBL
Match:
A0A5D3E2D9 (DUF455 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G00330 PE=4 SV=1)
HSP 1 Score: 746 bits (1927), Expect = 1.10e-271
Identity = 376/387 (97.16%), Postives = 379/387 (97.93%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFW 60
MIQRLQLKALHLWPTLRSSS H HSQTLN+ SSSSS QYTPWSGLKAWKQSPLNENRFW
Sbjct: 1 MIQRLQLKALHLWPTLRSSSCLHFHSQTLNVNSSSSSPQYTPWSGLKAWKQSPLNENRFW 60
Query: 61 GPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGL 120
GPNGPEPL+ESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKS+LSHLAYSRWSQE L
Sbjct: 61 GPNGPEPLVESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSRLSHLAYSRWSQESL 120
Query: 121 PIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR 180
PIGVFEAPSHPARP LPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR
Sbjct: 121 PIGVFEAPSHPARPPLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR 180
Query: 181 FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN 240
FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN
Sbjct: 181 FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN 240
Query: 241 VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV 300
VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV
Sbjct: 241 VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV 300
Query: 301 CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNE 360
CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDE SGDTKNE
Sbjct: 301 CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDESSGDTKNE 360
Query: 361 QLSVVYDRLASVISMECKNSSLHGPSE 386
QLSVVYDRLASVISMECKNSSLHGPSE
Sbjct: 361 QLSVVYDRLASVISMECKNSSLHGPSE 387
BLAST of CsGy6G021730 vs. ExPASy TrEMBL
Match:
A0A1S3BMU7 (uncharacterized protein HI_0077 OS=Cucumis melo OX=3656 GN=LOC103491286 PE=4 SV=1)
HSP 1 Score: 746 bits (1927), Expect = 1.10e-271
Identity = 376/387 (97.16%), Postives = 379/387 (97.93%), Query Frame = 0
Query: 1 MIQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFW 60
MIQRLQLKALHLWPTLRSSS H HSQTLN+ SSSSS QYTPWSGLKAWKQSPLNENRFW
Sbjct: 1 MIQRLQLKALHLWPTLRSSSCLHFHSQTLNVNSSSSSPQYTPWSGLKAWKQSPLNENRFW 60
Query: 61 GPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGL 120
GPNGPEPL+ESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKS+LSHLAYSRWSQE L
Sbjct: 61 GPNGPEPLVESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSRLSHLAYSRWSQESL 120
Query: 121 PIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR 180
PIGVFEAPSHPARP LPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR
Sbjct: 121 PIGVFEAPSHPARPPLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVR 180
Query: 181 FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN 240
FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN
Sbjct: 181 FSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNN 240
Query: 241 VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV 300
VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV
Sbjct: 241 VAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLV 300
Query: 301 CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKNE 360
CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDE SGDTKNE
Sbjct: 301 CQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDESSGDTKNE 360
Query: 361 QLSVVYDRLASVISMECKNSSLHGPSE 386
QLSVVYDRLASVISMECKNSSLHGPSE
Sbjct: 361 QLSVVYDRLASVISMECKNSSLHGPSE 387
BLAST of CsGy6G021730 vs. ExPASy TrEMBL
Match:
A0A6J1JES7 (uncharacterized protein LOC111486257 OS=Cucurbita maxima OX=3661 GN=LOC111486257 PE=4 SV=1)
HSP 1 Score: 670 bits (1728), Expect = 2.88e-241
Identity = 335/388 (86.34%), Postives = 358/388 (92.27%), Query Frame = 0
Query: 2 IQRLQLKALHLWPTLRSSSSPHLHSQTLNL---SSSSSLQYTPWSGLKAWKQSPLNENRF 61
+QRLQLKAL +WPT R S ++HSQ++ L SSSSSL+YTPWSGLKAW+QSP+NENRF
Sbjct: 6 MQRLQLKALLIWPTFRCSFRANIHSQSVKLNSSSSSSSLEYTPWSGLKAWRQSPINENRF 65
Query: 62 WGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEG 121
WG NGPE L+ESSS G FDSRIESASSLAELGALVLSTSDPL KS+LSHLAYSRWS E
Sbjct: 66 WGSNGPEALVESSSNGFLFDSRIESASSLAELGALVLSTSDPLIKSRLSHLAYSRWSLED 125
Query: 122 LPIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVV 181
LPIGVFEAP PARP PKLVSP+EIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVV
Sbjct: 126 LPIGVFEAPRRPARPPQPKLVSPREIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVV 185
Query: 182 RFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSN 241
RFS FS++LGEGFFADFAHVADDESRHF WCSQRLAELGFKYGDMAAHNLLWRECEKSS+
Sbjct: 186 RFSCFSELLGEGFFADFAHVADDESRHFTWCSQRLAELGFKYGDMAAHNLLWRECEKSSD 245
Query: 242 NVAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVL 301
NVAARLAAIPLVQEARGLDAGPRLVKKL+GFGDHRTSDIVA+IADEEVAHVAVG+ WF+L
Sbjct: 246 NVAARLAAIPLVQEARGLDAGPRLVKKLIGFGDHRTSDIVARIADEEVAHVAVGIDWFIL 305
Query: 302 VCQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSGDTKN 361
VCQKMERAPCSTFK+LLKEY+VELKGPFNYSARDEAGLPRDWYD+SNTN QDE SG KN
Sbjct: 306 VCQKMERAPCSTFKDLLKEYNVELKGPFNYSARDEAGLPRDWYDMSNTNKQDETSGGAKN 365
Query: 362 EQLSVVYDRLASVISMECKNSSLHGPSE 386
EQLSVVY+RLASVISME KNSSLHGPSE
Sbjct: 366 EQLSVVYNRLASVISMESKNSSLHGPSE 393
BLAST of CsGy6G021730 vs. ExPASy TrEMBL
Match:
A0A6J1EXG4 (uncharacterized protein LOC111437070 OS=Cucurbita moschata OX=3662 GN=LOC111437070 PE=4 SV=1)
HSP 1 Score: 658 bits (1697), Expect = 1.31e-236
Identity = 333/392 (84.95%), Postives = 355/392 (90.56%), Query Frame = 0
Query: 2 IQRLQLKALHLWPTLRSSSSPHLHSQTLNL-------SSSSSLQYTPWSGLKAWKQSPLN 61
+QRLQLKAL +WPT R S ++HSQ++ L SSSSSL+YTPWSGLKAW+QSP+N
Sbjct: 1 MQRLQLKALLIWPTFRCSFRANIHSQSVKLNSSSSSSSSSSSLEYTPWSGLKAWRQSPIN 60
Query: 62 ENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRW 121
ENRFWG NGPE L+ESSS G FDSRIESASSLAELGALVLSTSDPL KS+LSHLAYSRW
Sbjct: 61 ENRFWGSNGPEALVESSSNGFLFDSRIESASSLAELGALVLSTSDPLIKSRLSHLAYSRW 120
Query: 122 SQEGLPIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAW 181
S E LPIGVFEAP PARP PKLVSP+EIPAPKNSGLPLNAYMLHNLAHVELNAIDLAW
Sbjct: 121 SLEDLPIGVFEAPRRPARPPQPKLVSPREIPAPKNSGLPLNAYMLHNLAHVELNAIDLAW 180
Query: 182 DTVVRFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECE 241
DTVVRFS FS+VLGEGFFADFAHVADDESRHF WCSQRLAELGFKYGDMAAHNLLWRECE
Sbjct: 181 DTVVRFSCFSEVLGEGFFADFAHVADDESRHFTWCSQRLAELGFKYGDMAAHNLLWRECE 240
Query: 242 KSSNNVAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVY 301
KSS+NVAARLAAIPLVQEARGLDAGPRLVKKL+GFGDHRTSDIVA+IADEEVAHVAVG+
Sbjct: 241 KSSDNVAARLAAIPLVQEARGLDAGPRLVKKLIGFGDHRTSDIVARIADEEVAHVAVGID 300
Query: 302 WFVLVCQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSG 361
WF+LVCQKMERAPCSTFK+LLKEY+VELKGPFNYSARDEAGLPRDWYD+SNTN QDE SG
Sbjct: 301 WFILVCQKMERAPCSTFKDLLKEYNVELKGPFNYSARDEAGLPRDWYDMSNTNKQDETSG 360
Query: 362 DTKNEQLSVVYDRLASVISMECKNSSLHGPSE 386
KNEQLSVVY+RLASVISME KNS GPSE
Sbjct: 361 GAKNEQLSVVYNRLASVISMESKNS---GPSE 389
BLAST of CsGy6G021730 vs. ExPASy TrEMBL
Match:
A0A6J1D299 (uncharacterized protein LOC111016701 OS=Momordica charantia OX=3673 GN=LOC111016701 PE=4 SV=1)
HSP 1 Score: 625 bits (1613), Expect = 7.27e-224
Identity = 322/388 (82.99%), Postives = 345/388 (88.92%), Query Frame = 0
Query: 2 IQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSS------SSLQYTPWSGLKAWKQSPLNE 61
+QRLQ K+ H LR LHSQ++ + SS SSL+Y PWSGL+AW++SPLNE
Sbjct: 1 MQRLQPKSFH----LRCFFCCDLHSQSVKIKSSHSSYSSSSLKYAPWSGLEAWRESPLNE 60
Query: 62 NRFWGPNGPEPL-LESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRW 121
+RFWG NGPEP + SSTG D RIESASSLAELGALVLSTSDPL+KS+LSHLA+SRW
Sbjct: 61 SRFWGLNGPEPQPVVESSTGFLHDGRIESASSLAELGALVLSTSDPLSKSRLSHLAFSRW 120
Query: 122 SQEGLPIGVFEAPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAW 181
SQ+ LPIGV EAP P+RP PKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAW
Sbjct: 121 SQKRLPIGVSEAPPPPSRPPEPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAW 180
Query: 182 DTVVRFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECE 241
DTVVRFS FS+VLGEGFFADFAHVADDESRHF WCSQRLAELGFKYGDMAAHNLLWRECE
Sbjct: 181 DTVVRFSPFSEVLGEGFFADFAHVADDESRHFTWCSQRLAELGFKYGDMAAHNLLWRECE 240
Query: 242 KSSNNVAARLAAIPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVY 301
KSS+NVAARLAAIPLVQEARGLDAGPRLVKKL+GFGDHRTSDIVA+IADEEVAHVAVGVY
Sbjct: 241 KSSDNVAARLAAIPLVQEARGLDAGPRLVKKLIGFGDHRTSDIVARIADEEVAHVAVGVY 300
Query: 302 WFVLVCQKMERAPCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDISNTNVQDELSG 361
WF+LVCQKMERAPCSTFKELLKEY+VELKGPFNYSARDEAGLPRDWYDISNTNVQDE S
Sbjct: 301 WFILVCQKMERAPCSTFKELLKEYNVELKGPFNYSARDEAGLPRDWYDISNTNVQDETSR 360
Query: 362 DTKNEQLSVVYDRLASVISMECKNSSLH 382
KNEQLSVVYDRLA+VISME KNSSLH
Sbjct: 361 GAKNEQLSVVYDRLATVISMESKNSSLH 384
BLAST of CsGy6G021730 vs. TAIR 10
Match:
AT1G06240.1 (Protein of unknown function DUF455 )
HSP 1 Score: 482.3 bits (1240), Expect = 3.8e-136
Identity = 253/381 (66.40%), Postives = 296/381 (77.69%), Query Frame = 0
Query: 11 HLWPTLRSSSSPH-LHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLE 70
HL P +S SP L T SSS+ Q+ WSGL+ W++SP+N+ R WGP G PLL
Sbjct: 5 HLRPFSTASLSPRSLTISTALSSSSTKQQHKLWSGLENWRKSPVNDLRLWGPTG--PLLP 64
Query: 71 SSSTGVFFD--SRIESASSLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGLPIG-VFEA 130
SSS + D + +ASSLA+LGALVLSTSDPL+KS +SHLA+SRW +E LP+G +
Sbjct: 65 SSSDSISADFYGLVSAASSLADLGALVLSTSDPLSKSHISHLAFSRWRRENLPVGSISHL 124
Query: 131 PSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDV 190
PS PARP P LV+ ++P PK+S LPLNA+MLHNLAHVELNAIDLAWDTV RFS F D+
Sbjct: 125 PSSPARPPKPLLVATNQVPNPKDSNLPLNAHMLHNLAHVELNAIDLAWDTVARFSPFFDL 184
Query: 191 LGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAA 250
LG FF DFAHVADDESRHF+WCSQRLAELGFKYGD+ A+NLL RECEK+SNNVAARLA
Sbjct: 185 LGHNFFDDFAHVADDESRHFLWCSQRLAELGFKYGDIPANNLLMRECEKTSNNVAARLAC 244
Query: 251 IPLVQEARGLDAGPRLVKKLVGFGDHRTSDIVAKIADEEVAHVAVGVYWFVLVCQKMERA 310
IPLVQEARGLDAGPRLVK+L GFGD+RTS IVAKIA+EEVAHVAVGV WF+ VCQKM RA
Sbjct: 245 IPLVQEARGLDAGPRLVKRLTGFGDNRTSKIVAKIAEEEVAHVAVGVDWFLSVCQKMNRA 304
Query: 311 PCSTFKELLKEYSVELKGPFNYSARDEAGLPRDWYDIS-NTNVQDELSGDTKNEQLSVVY 370
P TFK+L+KEY VEL+GPFN+SAR+ AG+PRDWYD S T V + EQLS VY
Sbjct: 305 PSPTFKDLIKEYGVELRGPFNHSAREVAGIPRDWYDPSCGTEVDKGDNEQGDKEQLSAVY 364
Query: 371 DRLASVISMECKNSSLHGPSE 387
DRL +ISME +NSSL P++
Sbjct: 365 DRLTHIISMESENSSLEKPAK 383
BLAST of CsGy6G021730 vs. TAIR 10
Match:
AT5G04520.1 (Protein of unknown function DUF455 )
HSP 1 Score: 155.6 bits (392), Expect = 8.1e-38
Identity = 101/281 (35.94%), Postives = 143/281 (50.89%), Query Frame = 0
Query: 86 SLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGL-----PIGVFEAPSHPARPSLP-KLV 145
+L E +L+TSDP K++L +W Q + P F P PAR LP KLV
Sbjct: 5 TLIESAIRILNTSDPHEKARLGDSIAVKWLQGAIAEPYDPTVDFPVPDRPAR--LPVKLV 64
Query: 146 SPKEIPAPKNSG-LPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHV 205
SP +P +G L ++H+LAH E AIDL+WD + RF + + FF DF V
Sbjct: 65 SPSLMPKLGRAGSLQSRQAIVHSLAHTESWAIDLSWDIIARFGK-QEKMPRDFFTDFVRV 124
Query: 206 ADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQEARGLDA 265
A DE RHF + RL E+G YG + AH+ LW +S+++ ARLA V EARGLD
Sbjct: 125 AQDEGRHFTLLAARLEEIGSSYGALPAHDGLWDSATATSHDLLARLAIEHCVHEARGLDV 184
Query: 266 GPRLVKKLVGFGDHRTSDIVAKIA-DEEVAHVAVGVYWFVLVCQKME------------- 325
P + + GD+ T+D++ K+ EE+ H A GV WF +C++ +
Sbjct: 185 LPTTISRFRNGGDNETADLLEKVVYPEEITHCAAGVKWFKYLCERSKDPEFTISSKESDD 244
Query: 326 --RAPCSTFKELLKE-YSVELKGPFNYSARDEAGLPRDWYD 343
+ F +++E + LK PFN AR AG WY+
Sbjct: 245 SNEEIINKFHSVVREHFRGPLKPPFNAEARKAAGFGPQWYE 282
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P43935 | 1.0e-29 | 33.83 | Uncharacterized protein HI_0077 OS=Haemophilus influenzae (strain ATCC 51907 / D... | [more] |
Match Name | E-value | Identity | Description | |
XP_004140115.1 | 3.83e-281 | 100.00 | uncharacterized protein LOC101207410 isoform X3 [Cucumis sativus] | [more] |
XP_008449389.1 | 2.26e-271 | 97.16 | PREDICTED: uncharacterized protein HI_0077 [Cucumis melo] >KAA0057373.1 DUF455 d... | [more] |
XP_011657545.1 | 6.50e-263 | 100.00 | uncharacterized protein LOC101207410 isoform X1 [Cucumis sativus] >KAE8647338.1 ... | [more] |
XP_031743610.1 | 5.97e-258 | 98.90 | uncharacterized protein LOC101207410 isoform X2 [Cucumis sativus] | [more] |
XP_038887628.1 | 7.04e-254 | 90.41 | uncharacterized protein HI_0077 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3E2D9 | 1.10e-271 | 97.16 | DUF455 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
A0A1S3BMU7 | 1.10e-271 | 97.16 | uncharacterized protein HI_0077 OS=Cucumis melo OX=3656 GN=LOC103491286 PE=4 SV=... | [more] |
A0A6J1JES7 | 2.88e-241 | 86.34 | uncharacterized protein LOC111486257 OS=Cucurbita maxima OX=3661 GN=LOC111486257... | [more] |
A0A6J1EXG4 | 1.31e-236 | 84.95 | uncharacterized protein LOC111437070 OS=Cucurbita moschata OX=3662 GN=LOC1114370... | [more] |
A0A6J1D299 | 7.27e-224 | 82.99 | uncharacterized protein LOC111016701 OS=Momordica charantia OX=3673 GN=LOC111016... | [more] |