Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAATGGGACATTTGTGCGTTCTTCTATTCTACCGCGAAGATTGTGCATCGAAGCGAATGGGCAATGCAGAAGGTTATGGCTTCATCTCAAATTGGTAAAGTGAACAACTCTCTCTCAACTCAAAAATTCTTCAGCATTGCATGTTTTACACTCAATTCATTCCACTTATTTTGCAATTTCGATTAGTTTCCATCTTTCCACACACTCTTTTCTGTTCTACTTTGATTCACTTCTAAATTCCTCGAGTAGACGAAGAAAATCGACATTACAAGCGTTTAATCTCGCTGGAATCCTTCGATTCTTCATTTATTGTAGATTGAATCAACACCGTTTCAATTTCATCTCATAATTTATGGTTTCTATTACGGAATGGCGTTTGATTTTTCATTCGTCAAATGTTGCGTCGTAGATAACGCGTTTGGATTGAAATAGAGAACTGAAGAATCTCGAAACTCGAAACGAAGAACATTTTAGTGCGAAATGAAGAAGGGGGACGGCAACCGTGGACCGGGGGTTTCAGGTTCTCGTCGGACGCGGTCTCAGATAGAACCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGCTTTGTCTAGCTATCAGAAATGGAAGATTGTTGCTGAGAACTGCACGTCTTTAGATGTGGTTCGGAATTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTGGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATTGATAATGTTGCCACAATGAGGGAAAATCAGTCGGATACCGAGCCAGATAGCGATCCCGAGGCTGCAGTTGAGATTGTTGATGAAGTTGCAGAGCCTGGTATGGTTCTTTGTTTCCTAGAGTGCTTGTTATAGATCACATTTCCATGGTACTAAGTTTTCACGTGAAATTGTTCACTCTAGAGTTTCTTTGAATACACTTGATGCTAATTATTTTTTGCTTCACTTAATGACTTAAAACCCTTTGTCTCCCGGCTTTGGGCAGGCAAGGCGTCCCTAGAGATTAGTGCCGCGCGTCACGAGCATAATGCAAGGATTAGCTCTGATGGCTTGGTTATATGAAAAAGTCTTTTTTTGCTTCTTGATGAGTGATTGAGGAAGTTTTCGATACGTTTTTTTGTTTCTTCATGCTTTTAATGCTGTAAAATTAATAGCAACAAATATAACCTGGTGCTTGAAATCCTTGTATTTCTTGTTTTTGGTATTAAATTTTGAGCAACATGATAGCCAACCTAAGCATATCTTACTTGATTAAGGCATATATTATCAACCAAAAGGTCAGAAATTCAAATCTCCACCTCCCATATGTTGTTGAACTTAAAATCTTGAGCAACATGATTGTTTCTATTTTATCACATCAATGACAAAAATGTTTTATGTTTCAATTTTTGTGAGCTCCTCGAAGTTATAGAGGGATTGGCTTTGGAACCTTTTTATGTGGCGGTTGTGTATATATTAAACAGCAAATAAAAGCTAAACATTGTAAGGAAAGGGTATTTAAACTACTTGAGAATCTCAGATACAGTGCCTGCTTCAAACATTTCATGCATTTATATGATTGTATCTAGAAATTATTTCTATATCACTTGTATATATTATAATGCTTCACTGGTCCTATTCTTCAATTGCTGTTCGTATTGTATCCATTCCAGGATCTAGATTTTTTGGTTATTTTCCTTCTAATTCATTAGGTGAAATTTGAACTTAGTATATTCTGCTAATTTATATCAAACAAGGCCTTCTGATACGGCGTTGCCTTTCTGTGTTCTTTTGACACCCTAATTTTCTCATGGATACGAAAGACATTTCTAACTGCATTTTCATAAGGCTTAATTGTGAACTTTTTGAAACCAAAAAGGCTGGTTGACAAAAATGTCATGACTGCATAACTTGCTTTGCACCAACTTCTAAAGTGTGGTTAAAGAGTGAGATGGATCTTATTATTGCTGTCTTCAAATCACTTCAATAATATTCGTCTTTTTCTTCTTTCGAATTTTGTCATATTAGGCCCTAAAAGACAAAGACGGCGTTCAATGTCTAAGAGAAATCAAGTCCTTGAGAAATCTTTAAAATGTGAAGAAGAAGAAGAAGAAAAACCTCCATTGAGCTTTCCCGAAGTAGAGCCTCGTGAATGCCACCTCAAAAGCAACGGAGAAAAGGCAACCAATAATGCTGAACCCAAAGAGCAAATGATGGCGAAGAATTTGCTTGAAACTGCAGCAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGTGGCTTCTGATGCAAAGAACGTCGGCCACCGAATTGATTTGGTAAGGCGTCAAGGGAGCAAGCTTATCAGATCCCTTGGAGATTTTCTCAACACCATTAATGATCTCAGTGGCCTGCTCGAAGATTGTGAGTGACTCTCATTTTGCTTCTTTTCAGCCTACCAAGTAAGGAATGTTATCCAATGATCTTCTCAACTCCTTGTGTAAATGTTTGTTTAAATCATTTTGTTAGAGTGCATCTCTCAGGTGAGCTCATTTAGCACGTCTTAGGAAATTGCCAATTTGATGACATATTAGAGTTTTGAAACTTCCCATCTCTTTCTTACAGACAAACATGTCAAGTTGGCCAATTCGAGCAACAGCAAAACGTTGCCGACATTTTCCAGGTTACAGATTAACCCAATCCAATGTTTATTTATGAAACCCCTTCTGTCTTATCTCTCTGTTATTCTCTGTTATCATTATTATTTGGTTAAAATATTGTTTTGGTCCGTGTAATTAGAAAGTACGCATTCCATTTTAGTTTTTATACTTTC
mRNA sequence
GAATGGGACATTTGTGCGTTCTTCTATTCTACCGCGAAGATTGTGCATCGAAGCGAATGGGCAATGCAGAAGGTTATGGCTTCATCTCAAATTGATAACGCGTTTGGATTGAAATAGAGAACTGAAGAATCTCGAAACTCGAAACGAAGAACATTTTAGTGCGAAATGAAGAAGGGGGACGGCAACCGTGGACCGGGGGTTTCAGGTTCTCGTCGGACGCGGTCTCAGATAGAACCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGCTTTGTCTAGCTATCAGAAATGGAAGATTGTTGCTGAGAACTGCACGTCTTTAGATGTGGTTCGGAATTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTGGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATTGATAATGTTGCCACAATGAGGGAAAATCAGTCGGATACCGAGCCAGATAGCGATCCCGAGGCTGCAGTTGAGATTGTTGATGAAGTTGCAGAGCCTGGCCCTAAAAGACAAAGACGGCGTTCAATGTCTAAGAGAAATCAAGTCCTTGAGAAATCTTTAAAATGTGAAGAAGAAGAAGAAGAAAAACCTCCATTGAGCTTTCCCGAAGTAGAGCCTCGTGAATGCCACCTCAAAAGCAACGGAGAAAAGGCAACCAATAATGCTGAACCCAAAGAGCAAATGATGGCGAAGAATTTGCTTGAAACTGCAGCAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGTGGCTTCTGATGCAAAGAACGTCGGCCACCGAATTGATTTGGTAAGGCGTCAAGGGAGCAAGCTTATCAGATCCCTTGGAGATTTTCTCAACACCATTAATGATCTCAGTGGCCTGCTCGAAGATTACAAACATGTCAAGTTGGCCAATTCGAGCAACAGCAAAACGTTGCCGACATTTTCCAGGTTACAGATTAACCCAATCCAATGTTTATTTATGAAACCCCTTCTGTCTTATCTCTCTGTTATTCTCTGTTATCATTATTATTTGGTTAAAATATTGTTTTGGTCCGTGTAATTAGAAAGTACGCATTCCATTTTAGTTTTTATACTTTC
Coding sequence (CDS)
ATGAAGAAGGGGGACGGCAACCGTGGACCGGGGGTTTCAGGTTCTCGTCGGACGCGGTCTCAGATAGAACCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGCTTTGTCTAGCTATCAGAAATGGAAGATTGTTGCTGAGAACTGCACGTCTTTAGATGTGGTTCGGAATTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTGGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATTGATAATGTTGCCACAATGAGGGAAAATCAGTCGGATACCGAGCCAGATAGCGATCCCGAGGCTGCAGTTGAGATTGTTGATGAAGTTGCAGAGCCTGGCCCTAAAAGACAAAGACGGCGTTCAATGTCTAAGAGAAATCAAGTCCTTGAGAAATCTTTAAAATGTGAAGAAGAAGAAGAAGAAAAACCTCCATTGAGCTTTCCCGAAGTAGAGCCTCGTGAATGCCACCTCAAAAGCAACGGAGAAAAGGCAACCAATAATGCTGAACCCAAAGAGCAAATGATGGCGAAGAATTTGCTTGAAACTGCAGCAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGTGGCTTCTGATGCAAAGAACGTCGGCCACCGAATTGATTTGGTAAGGCGTCAAGGGAGCAAGCTTATCAGATCCCTTGGAGATTTTCTCAACACCATTAATGATCTCAGTGGCCTGCTCGAAGATTACAAACATGTCAAGTTGGCCAATTCGAGCAACAGCAAAACGTTGCCGACATTTTCCAGGTTACAGATTAACCCAATCCAATGTTTATTTATGAAACCCCTTCTGTCTTATCTCTCTGTTATTCTCTGTTATCATTATTATTTGGTTAAAATATTGTTTTGGTCCGTGTAA
Protein sequence
MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLEDYKHVKLANSSNSKTLPTFSRLQINPIQCLFMKPLLSYLSVILCYHYYLVKILFWSV
Homology
BLAST of Tan0009744 vs. NCBI nr
Match:
XP_038897371.1 (trihelix transcription factor ASR3 [Benincasa hispida])
HSP 1 Score: 462.2 bits (1188), Expect = 3.8e-126
Identity = 245/305 (80.33%), Postives = 257/305 (84.26%), Query Frame = 0
Query: 2 KKGDGNRGPGVSGSRRTRSQI--EPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 61
K+ GNRG GVSGSRRTRSQI PDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA
Sbjct: 3 KENAGNRGLGVSGSRRTRSQIAVAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 62
Query: 62 ENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFD 121
ENCTSLDVVR SNQCRRKWDCLLIEHDVIKQWEL+MP+DDSYWCLESGRRKELGLPDNFD
Sbjct: 63 ENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPEDDSYWCLESGRRKELGLPDNFD 122
Query: 122 EELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSL 181
EELFKAIDNVATMR NQSDTEPDSDPEAAVE +DE+AEPGPKRQRRRSMSK NQVLEKSL
Sbjct: 123 EELFKAIDNVATMRANQSDTEPDSDPEAAVENIDEIAEPGPKRQRRRSMSKSNQVLEKSL 182
Query: 182 KCE--------------------EEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQ 241
+CE EEEEEKP LSFPEVEPREC++K+NG K T+N EPKEQ
Sbjct: 183 ECERNRALEKSLECKEEEEVEDGEEEEEKPLLSFPEVEPRECYIKNNGSKVTDNLEPKEQ 242
Query: 242 MMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLS 285
MMAK LLE A KVQAIVSENAEY SD KN + +LVR QGSKLIR LGD LNTINDL
Sbjct: 243 MMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTINDLR 302
BLAST of Tan0009744 vs. NCBI nr
Match:
XP_022139752.1 (trihelix transcription factor ASR3 [Momordica charantia] >XP_022139827.1 trihelix transcription factor ASR3 [Momordica charantia])
HSP 1 Score: 457.6 bits (1176), Expect = 9.4e-125
Identity = 231/285 (81.05%), Postives = 251/285 (88.07%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DGNRG GVSGSRRTRSQI PDWTAA+CLVLVNVIAAVEADCLKALSSYQKWKIVAE
Sbjct: 1 MKQEDGNRGAGVSGSRRTRSQIAPDWTAAECLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV R SNQCRRKWDCLLIEHDVIKQWELEMPDDDSYW LESGRRKELGLP+NFD+
Sbjct: 61 NCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWSLESGRRKELGLPENFDK 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
ELFKAIDNVATMR NQSDTEPDSDPEA VE++DE++EPGPKRQRRRS+SKR+Q LEKSL+
Sbjct: 121 ELFKAIDNVATMRANQSDTEPDSDPEAGVEMLDEISEPGPKRQRRRSISKRSQALEKSLE 180
Query: 181 CEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAE-PKEQMMAKNLLETAAKVQAIVSEN 240
CEEE+EEKPPL+ PE EPREC +KSNGEK ++ E +EQMM K LLE ++QAIVSEN
Sbjct: 181 CEEEDEEKPPLNSPEAEPRECFIKSNGEKQIDSLELEEEQMMTKKLLENVEQIQAIVSEN 240
Query: 241 AEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
AEY SD KN HRID VRRQG+ LIR LGD LN INDL GL ED
Sbjct: 241 AEYATSDEKNTNHRIDSVRRQGNTLIRCLGDILNAINDLHGLFED 285
BLAST of Tan0009744 vs. NCBI nr
Match:
KAG7037576.1 (Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 445.3 bits (1144), Expect = 4.8e-121
Identity = 232/286 (81.12%), Postives = 251/286 (87.76%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DG RG VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAE
Sbjct: 1 MKEEDGKRGTEVSGSRRTRSRIAPDWTAADCLVLVNVIAAVEADCWKALSSFQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE
Sbjct: 61 NCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
E+FKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RN+ LEKS+K
Sbjct: 121 EVFKAIDNVVSMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSIRNRSLEKSVK 180
Query: 181 CEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENA 240
CEEE+EE PP+S PEVE R C++KS+GEKAT++ EP+EQ MAK LLETA KVQAIVSENA
Sbjct: 181 CEEEDEE-PPVSSPEVERRGCYIKSHGEKATDSTEPEEQTMAKKLLETAEKVQAIVSENA 240
Query: 241 EYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
EY SD KN + R + +RRQGSKLI+ L DFLNTINDL LLED
Sbjct: 241 EYATSDEKNDNYNVRTNSIRRQGSKLIKCLEDFLNTINDLHSLLED 285
BLAST of Tan0009744 vs. NCBI nr
Match:
XP_023524323.1 (trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo] >XP_023524324.1 trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo] >XP_023524325.1 trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo] >XP_023524326.1 trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 444.9 bits (1143), Expect = 6.3e-121
Identity = 231/288 (80.21%), Postives = 248/288 (86.11%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DG RG VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAE
Sbjct: 1 MKEEDGKRGTEVSGSRRTRSRIAPDWTAADCLVLVNVIAAVEADCWKALSSFQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV RNSNQCRRKWDCLLIEHDVI+QWELEMPDDDSYWCLES RRKELGLPDNFDE
Sbjct: 61 NCTSLDVARNSNQCRRKWDCLLIEHDVIRQWELEMPDDDSYWCLESERRKELGLPDNFDE 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
ELFKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+K
Sbjct: 121 ELFKAIDNVVSMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSVK 180
Query: 181 CEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENA 240
CE EEEE+PP+S PEVE R C++KS+GEK T++ EP+EQ MAK LLETA KVQAIVSENA
Sbjct: 181 CEGEEEEEPPVSSPEVERRGCYIKSHGEKTTDSTEPEEQTMAKKLLETAEKVQAIVSENA 240
Query: 241 EYVASDAK----NVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
EY SD K N R + +RRQGSKLI+ L DFLNTINDL LLED
Sbjct: 241 EYATSDEKNDNDNYNVRTNSIRRQGSKLIKCLEDFLNTINDLHSLLED 288
BLAST of Tan0009744 vs. NCBI nr
Match:
KAG6608224.1 (Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 443.7 bits (1140), Expect = 1.4e-120
Identity = 230/286 (80.42%), Postives = 249/286 (87.06%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DG RG VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKI+AE
Sbjct: 1 MKEEDGKRGTEVSGSRRTRSRIAPDWTAADCLVLVNVIAAVEADCWKALSSFQKWKIIAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE
Sbjct: 61 NCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
E+FKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+
Sbjct: 121 EVFKAIDNVVSMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQSLEKSVN 180
Query: 181 CEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENA 240
CEEE+EE PP+S PEVE R C++KS+GEKAT++ EP+EQ M K LLETA KVQAIVSENA
Sbjct: 181 CEEEDEE-PPVSSPEVERRGCYIKSHGEKATDSTEPEEQTMVKKLLETAEKVQAIVSENA 240
Query: 241 EYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
EY SD KN + R + +RRQGSKLI+ L DFLNTINDL LLED
Sbjct: 241 EYATSDEKNDNYNVRTNSIRRQGSKLIKCLEDFLNTINDLHSLLED 285
BLAST of Tan0009744 vs. ExPASy TrEMBL
Match:
A0A6J1CEU7 (trihelix transcription factor ASR3 OS=Momordica charantia OX=3673 GN=LOC111010559 PE=4 SV=1)
HSP 1 Score: 457.6 bits (1176), Expect = 4.5e-125
Identity = 231/285 (81.05%), Postives = 251/285 (88.07%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DGNRG GVSGSRRTRSQI PDWTAA+CLVLVNVIAAVEADCLKALSSYQKWKIVAE
Sbjct: 1 MKQEDGNRGAGVSGSRRTRSQIAPDWTAAECLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV R SNQCRRKWDCLLIEHDVIKQWELEMPDDDSYW LESGRRKELGLP+NFD+
Sbjct: 61 NCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWSLESGRRKELGLPENFDK 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
ELFKAIDNVATMR NQSDTEPDSDPEA VE++DE++EPGPKRQRRRS+SKR+Q LEKSL+
Sbjct: 121 ELFKAIDNVATMRANQSDTEPDSDPEAGVEMLDEISEPGPKRQRRRSISKRSQALEKSLE 180
Query: 181 CEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAE-PKEQMMAKNLLETAAKVQAIVSEN 240
CEEE+EEKPPL+ PE EPREC +KSNGEK ++ E +EQMM K LLE ++QAIVSEN
Sbjct: 181 CEEEDEEKPPLNSPEAEPRECFIKSNGEKQIDSLELEEEQMMTKKLLENVEQIQAIVSEN 240
Query: 241 AEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
AEY SD KN HRID VRRQG+ LIR LGD LN INDL GL ED
Sbjct: 241 AEYATSDEKNTNHRIDSVRRQGNTLIRCLGDILNAINDLHGLFED 285
BLAST of Tan0009744 vs. ExPASy TrEMBL
Match:
A0A6J1FMB3 (trihelix transcription factor ASR3-like OS=Cucurbita moschata OX=3662 GN=LOC111446515 PE=4 SV=1)
HSP 1 Score: 442.6 bits (1137), Expect = 1.5e-120
Identity = 231/286 (80.77%), Postives = 249/286 (87.06%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DG RG VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAE
Sbjct: 1 MKEEDGKRGTEVSGSRRTRSRIAPDWTAADCLVLVNVIAAVEADCWKALSSFQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE
Sbjct: 61 NCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
E+FKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+K
Sbjct: 121 EVFKAIDNVVSMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQSLEKSVK 180
Query: 181 CEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENA 240
CEEE+EE P +S PEVE R C++KS+GEKAT++ EP+EQ MAK LLETA KVQAIVSENA
Sbjct: 181 CEEEDEE-PQVSSPEVERRGCYIKSHGEKATDSTEPEEQTMAKKLLETAEKVQAIVSENA 240
Query: 241 EYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
EY SD KN + R + +R QGSKLI+ L DFLNTINDL LLED
Sbjct: 241 EYATSDEKNDNYNVRTNSIRHQGSKLIKCLEDFLNTINDLHSLLED 285
BLAST of Tan0009744 vs. ExPASy TrEMBL
Match:
A0A0A0LDW0 (Myb-like domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G882960 PE=4 SV=1)
HSP 1 Score: 440.3 bits (1131), Expect = 7.5e-120
Identity = 236/307 (76.87%), Postives = 249/307 (81.11%), Query Frame = 0
Query: 2 KKGDGNRGPGVSGSRRTRSQI--EPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 61
K+ GNRG GVSGSRRTRSQI P WTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA
Sbjct: 3 KENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 62
Query: 62 ENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFD 121
ENCTSLDVVR SNQCRRKWDCLLIEHDVIKQWEL+MPDDDSYWCL SGRRKELGLP+NFD
Sbjct: 63 ENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWCLASGRRKELGLPENFD 122
Query: 122 EELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSL 181
EELFKAIDNVA+MR NQSDTEPDSDPEAA+ DE+AEPGPKRQRRRSMSK NQVLEKSL
Sbjct: 123 EELFKAIDNVASMRANQSDTEPDSDPEAAIGNADEIAEPGPKRQRRRSMSKSNQVLEKSL 182
Query: 182 KCE----------------------EEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPK 241
+CE EE EEKP LS PE+EPREC++KSN K T+N EPK
Sbjct: 183 ECERNLGLEISLECKEVEDRGERGGEEVEEKPLLSSPELEPRECYIKSNESKVTDNIEPK 242
Query: 242 EQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTIND 285
EQMMAK LLE A KVQAIVSENAEY SD K + +LVR QGSKLIR LGD LNTIND
Sbjct: 243 EQMMAKFLLENAEKVQAIVSENAEYTTSDEKCAKDQTNLVRHQGSKLIRCLGDILNTIND 302
BLAST of Tan0009744 vs. ExPASy TrEMBL
Match:
A0A6J1J3M2 (trihelix transcription factor ASR3 OS=Cucurbita maxima OX=3661 GN=LOC111481039 PE=4 SV=1)
HSP 1 Score: 434.5 bits (1116), Expect = 4.1e-118
Identity = 230/295 (77.97%), Postives = 247/295 (83.73%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MK+ DG RG VSGSRRTRS+I PDWTAA+CLVLVNVIAAVEADC KALSS+QKWKIVAE
Sbjct: 1 MKEEDGKRGTEVSGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
NCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE
Sbjct: 61 NCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
ELFKAIDNV MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+K
Sbjct: 121 ELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSVK 180
Query: 181 C-------EEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQ 240
C EEEEEE+P +S PEVE R C++KS+GEKAT+N EP+EQ M K LLETA KVQ
Sbjct: 181 CEEEEEEEEEEEEEEPLVSSPEVERRGCYIKSHGEKATDNTEPEEQTMTKKLLETAEKVQ 240
Query: 241 AIVSENAEYVAS----DAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED 285
AIVSENA+Y S D N +R + +RRQGSKLI+ L DFLNTINDL L ED
Sbjct: 241 AIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLNTINDLHSLPED 295
BLAST of Tan0009744 vs. ExPASy TrEMBL
Match:
A0A6J1IN02 (trihelix transcription factor ASR3-like OS=Cucurbita maxima OX=3661 GN=LOC111476700 PE=4 SV=1)
HSP 1 Score: 428.7 bits (1101), Expect = 2.3e-116
Identity = 219/290 (75.52%), Postives = 250/290 (86.21%), Query Frame = 0
Query: 1 MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAE 60
MKK +GNRG GVSGSRRTRSQI P+WTAA+CLVLVNVI AVEADC+KALSSYQKWKIVAE
Sbjct: 1 MKKENGNRGSGVSGSRRTRSQIAPEWTAAECLVLVNVIGAVEADCMKALSSYQKWKIVAE 60
Query: 61 NCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
+CT+L+V R SNQCR+KW+CLLIEHDVI+QWEL MP+DDSYWCLESGRRKELGLPDNFDE
Sbjct: 61 DCTALNVARTSNQCRKKWECLLIEHDVIRQWELTMPEDDSYWCLESGRRKELGLPDNFDE 120
Query: 121 ELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLK 180
ELFKAI NV++MR NQSDTEPD+DPEAAVE DE++EPGPKRQRR SMSKRNQ LEKSL+
Sbjct: 121 ELFKAIYNVSSMRANQSDTEPDNDPEAAVENADEISEPGPKRQRRGSMSKRNQGLEKSLE 180
Query: 181 C----EEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIV 240
C EEE EE+P LS PE + R+C++K+NG KAT++ EP+EQMM K LLE A VQ IV
Sbjct: 181 CKEDEEEEAEEQPLLSSPEADLRDCYIKNNGAKATDDIEPEEQMMVKKLLENAENVQEIV 240
Query: 241 SENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLEDYK 287
SENAE V SD KN + +L+RRQGSKLIR LGDFLNTINDL LLED++
Sbjct: 241 SENAECVTSDEKNDKDQTNLIRRQGSKLIRCLGDFLNTINDLRDLLEDFE 290
BLAST of Tan0009744 vs. TAIR 10
Match:
AT4G31270.1 (sequence-specific DNA binding transcription factors )
HSP 1 Score: 185.7 bits (470), Expect = 6.4e-47
Identity = 117/283 (41.34%), Postives = 164/283 (57.95%), Query Frame = 0
Query: 11 GVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRN 70
G SGSRRTRSQ+ P+W DCLVLVN IAAVEADC ALSS+QKW ++ ENC +LDV RN
Sbjct: 4 GTSGSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSNALSSFQKWTMITENCNALDVSRN 63
Query: 71 SNQCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRRKELGLPDNFDEELFKAIDNV 130
NQCRRKWD L+ +++ IK+WE + SYW L S +RK L LP + D ELF+AI+ V
Sbjct: 64 LNQCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAINAV 123
Query: 131 ATMRENQSDTEPDSDPEA--AVEIVDEVAEPGPKRQRRRSMSKRNQVLE--KSLKCEEEE 190
+++ ++ TE DSDPEA V++ E+A G KR R+R+M + E ++ + +
Sbjct: 124 VMIQDEKAGTESDSDPEAQDVVDLSAELAFVGSKRSRQRTMVMKETKKEEPRTSRVQVNT 183
Query: 191 EEKP--------PLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVS 250
EKP + E +P E E T N E ++M L + AIV
Sbjct: 184 REKPITTKATHQNKTMGEKKPVEDMSTDEEEDETMNIEEDVEVMEAKLSYKIDLIHAIVG 243
Query: 251 EN--AEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDL 279
N + D ++ ++ VR+QG +LI L + ++T+N L
Sbjct: 244 RNLAKDNETKDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRL 286
BLAST of Tan0009744 vs. TAIR 10
Match:
AT2G35640.1 (Homeodomain-like superfamily protein )
HSP 1 Score: 52.8 bits (125), Expect = 6.5e-07
Identity = 38/143 (26.57%), Postives = 61/143 (42.66%), Query Frame = 0
Query: 25 DWTAADCLVLVNVIAAVEADCLKALSSYQK------------WKIVAENCTSLDVVRNSN 84
+WT ++ LVL I A + D + + +K WK + E C RN N
Sbjct: 21 NWTVSETLVL---IEAKKMDDQRRVRRSEKQPEGRNKPAELRWKWIEEYCWRRGCYRNQN 80
Query: 85 QCRRKWDCLLIEHDVIKQWELEMPD-------DDSYWCLESGRRKELGLPDNFDEELFKA 144
QC KWD L+ ++ I+++E + SYW ++ RKE LP N +++
Sbjct: 81 QCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERKEKNLPSNMLPQIYDV 140
Query: 145 IDNVATMRENQSDTEPDSDPEAA 149
+ + + T P S AA
Sbjct: 141 LSELVDRK-----TLPSSSSAAA 155
BLAST of Tan0009744 vs. TAIR 10
Match:
AT1G31310.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 49.7 bits (117), Expect = 5.5e-06
Identity = 30/113 (26.55%), Postives = 49/113 (43.36%), Query Frame = 0
Query: 54 KWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD-------------- 113
+WK + + C +R+ NQC KWD L+ ++ ++++E +
Sbjct: 63 RWKWIEDYCWRKGCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAG 122
Query: 114 ---SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAV 150
SYW +E RKE LP N + ++A+ V +S T P S AV
Sbjct: 123 ETASYWKMEKSERKERSLPSNMLPQTYQALFEVV-----ESKTLPSSTAVTAV 170
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038897371.1 | 3.8e-126 | 80.33 | trihelix transcription factor ASR3 [Benincasa hispida] | [more] |
XP_022139752.1 | 9.4e-125 | 81.05 | trihelix transcription factor ASR3 [Momordica charantia] >XP_022139827.1 triheli... | [more] |
KAG7037576.1 | 4.8e-121 | 81.12 | Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. argyr... | [more] |
XP_023524323.1 | 6.3e-121 | 80.21 | trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo] >XP_0235243... | [more] |
KAG6608224.1 | 1.4e-120 | 80.42 | Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. soror... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CEU7 | 4.5e-125 | 81.05 | trihelix transcription factor ASR3 OS=Momordica charantia OX=3673 GN=LOC11101055... | [more] |
A0A6J1FMB3 | 1.5e-120 | 80.77 | trihelix transcription factor ASR3-like OS=Cucurbita moschata OX=3662 GN=LOC1114... | [more] |
A0A0A0LDW0 | 7.5e-120 | 76.87 | Myb-like domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G882960 PE... | [more] |
A0A6J1J3M2 | 4.1e-118 | 77.97 | trihelix transcription factor ASR3 OS=Cucurbita maxima OX=3661 GN=LOC111481039 P... | [more] |
A0A6J1IN02 | 2.3e-116 | 75.52 | trihelix transcription factor ASR3-like OS=Cucurbita maxima OX=3661 GN=LOC111476... | [more] |
Match Name | E-value | Identity | Description | |
AT4G31270.1 | 6.4e-47 | 41.34 | sequence-specific DNA binding transcription factors | [more] |
AT2G35640.1 | 6.5e-07 | 26.57 | Homeodomain-like superfamily protein | [more] |
AT1G31310.1 | 5.5e-06 | 26.55 | hydroxyproline-rich glycoprotein family protein | [more] |