HG10013507 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013507
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontrihelix transcription factor ASR3
LocationChr02: 2185086 .. 2187204 (+)
RNA-Seq ExpressionHG10013507
SyntenyHG10013507
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGGAAAACGCCGCCAACCGTGGACCGGGGGCTTCAGGCTCTCGTCGGACGCGTTCTCAGATATCACCGGATTGGACGGCGGCGGATTGTCTTGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGATTTGTCTAGCTATCAGAAATGGAAGATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGACTTCGTATCAGTGCAGGAGAAAGTGGGACTGTTTGCTGATTGAACATGATGTTATTAAGCAATGGGAGTTAAAGATGCCGGAGGATGATTCGTTTTGGTGTTTGGAGAGTGGAAGGAGAAAAGAATTGGGACTTCCTGACAACTTTGATGAGGAGCTGTTCAAAGCAATTGATAATGTCGCAACGATGAGAGCGAATCAGTCAGATACGGAGCCTGATAGTGATCTGGAGGCTGCGGTTGAGAACACTGATGAAATTGCAGAGCCTGGTATGGTTCTTGAGTTCATAGAGTGCTTAGAGATCAAGTTTTCATGCGAATTTGTCCTCTCTAGAGTTTCTTTGAATGCACTTGATAATCATTTCTTACTTTTGGTCTAGTGGTCGATAAGGGTCATGAAAATAAAAAAAACGAACAAAGAGCTTAGAGGAAGTGAGTTCAAAGTCATGGTGGATACCTATCTAAGATTTAATATCCAACAACAACCAAATATAATAGAGTCATGCGGTTATCCATAAGAATAGTATGGATATTAAAAGAAAACAAACCATTTTATTTTTTGCTTCACTTCCCTTGTATGCACAAAGCCTTCCTAGAGATTAACTGTATGAGTCATGAGCATGATGCAAGCAGGGGCTTTGATGGTCTGGCTATATGAATTTTTTTTCCTTGATCAACGATTGGGGAAAATTCAGAAACGTTTTTTGTTTTCTTCTTGTTGTTAATGCTGTGAAGTTAATAGAAATAGATACAAATACCTGTATTTCTTGTTCTTGGTAATAAATCTTGAGCGACATGATTGTTTCTATTACACCACATCAATGAGAAAGACGATGTATGTTTCATCTTTTATGAGGTCTTCAAAGTTATGGAAGCGTTCTTTTTATCTGGCCGGTTGTGTGGATGACTGCATTAAACAGCAAATTAAAGTTAAAAACTGTAAGGAGGAGTAGTTAACTACCTCAGAATCTCAGATACAGTGCCTGCCTCAAACATTTCATGAATATATGTCATTATATCTAGAAATTATCTGCATATAAACGTTGAAAGAGTCACCCTTACTCAATCCTGAGCAGTGAATTGAAGAATTGGAGCAGTGAAGCATTATGTTAAATAGAAGAACTCACGGTCGGTATTATAGTCATTCCAGGATCTAGATTGTTGGGTTATTTTCCTTGTAATCCATTAGGTGCAATTTGACATTAGTATTCTGCTAACTTATTATCATATACTAGACCTGATAGGACGGTGTCAGGATTATTTTGGCATCCTAATTTTCTCATGGAGACGAAAGTCATTCGTAAGGCTTAATTGTAAATTTTTTGTAACCAAAAGAGCTGGTTGACAAAAATGTCATTACTGTATAAATTGCTTTGAAGCATCTTCTAATGTGTGGAGATGGATCTTATAATTGCTGTCTTCAAATTAATTCAATAACATTGGTCTTTTTTCCTTTCTGATTTTGTCATTCCAGGGCCTAAAAGGCAAAGACGTCGTTCAATGTCCAAGAGCAATCAGGCCCTTGAGAAATCTTTGGAATGTGAGAGAAATCAGGCCCTTGAGAAATGTTTAGAATGTAAAAAAGAAGTAGAGGAAGAAGAAGAAAAAGAAAAGCCTCTATTAAGCTTTCCAGAAGTAGAACCTCGTGAATGCTACATCAAAAGCAATGGTTCAAAGTTGACCGATAATATCGAACCCAAAGAGCAAATGATGGCTAAGTTTTTGCTTGAAAATGCAGAAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGCAACTTCTGATGAAAAGAATGACAAGGACCAAACTAATTTGGTAAGGCATCAAGGGAGCAAGCTTATCAGATGTCTTGGAGATATTCTCAACACTATTGACGATCTCCGTGGCCTGCTCGAAGATTTTGAGTGA

mRNA sequence

ATGAAGAAGGAAAACGCCGCCAACCGTGGACCGGGGGCTTCAGGCTCTCGTCGGACGCGTTCTCAGATATCACCGGATTGGACGGCGGCGGATTGTCTTGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGATTTGTCTAGCTATCAGAAATGGAAGATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGACTTCGTATCAGTGCAGGAGAAAGTGGGACTGTTTGCTGATTGAACATGATGTTATTAAGCAATGGGAGTTAAAGATGCCGGAGGATGATTCGTTTTGGTGTTTGGAGAGTGGAAGGAGAAAAGAATTGGGACTTCCTGACAACTTTGATGAGGAGCTGTTCAAAGCAATTGATAATGTCGCAACGATGAGAGCGAATCAGTCAGATACGGAGCCTGATAGTGATCTGGAGGCTGCGGTTGAGAACACTGATGAAATTGCAGAGCCTGGGCCTAAAAGGCAAAGACGTCGTTCAATGTCCAAGAGCAATCAGGCCCTTGAGAAATCTTTGGAATGTGAGAGAAATCAGGCCCTTGAGAAATGTTTAGAATGTAAAAAAGAAGTAGAGGAAGAAGAAGAAAAAGAAAAGCCTCTATTAAGCTTTCCAGAAGTAGAACCTCGTGAATGCTACATCAAAAGCAATGGTTCAAAGTTGACCGATAATATCGAACCCAAAGAGCAAATGATGGCTAAGTTTTTGCTTGAAAATGCAGAAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGCAACTTCTGATGAAAAGAATGACAAGGACCAAACTAATTTGGTAAGGCATCAAGGGAGCAAGCTTATCAGATGTCTTGGAGATATTCTCAACACTATTGACGATCTCCGTGGCCTGCTCGAAGATTTTGAGTGA

Coding sequence (CDS)

ATGAAGAAGGAAAACGCCGCCAACCGTGGACCGGGGGCTTCAGGCTCTCGTCGGACGCGTTCTCAGATATCACCGGATTGGACGGCGGCGGATTGTCTTGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGATTTGTCTAGCTATCAGAAATGGAAGATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGACTTCGTATCAGTGCAGGAGAAAGTGGGACTGTTTGCTGATTGAACATGATGTTATTAAGCAATGGGAGTTAAAGATGCCGGAGGATGATTCGTTTTGGTGTTTGGAGAGTGGAAGGAGAAAAGAATTGGGACTTCCTGACAACTTTGATGAGGAGCTGTTCAAAGCAATTGATAATGTCGCAACGATGAGAGCGAATCAGTCAGATACGGAGCCTGATAGTGATCTGGAGGCTGCGGTTGAGAACACTGATGAAATTGCAGAGCCTGGGCCTAAAAGGCAAAGACGTCGTTCAATGTCCAAGAGCAATCAGGCCCTTGAGAAATCTTTGGAATGTGAGAGAAATCAGGCCCTTGAGAAATGTTTAGAATGTAAAAAAGAAGTAGAGGAAGAAGAAGAAAAAGAAAAGCCTCTATTAAGCTTTCCAGAAGTAGAACCTCGTGAATGCTACATCAAAAGCAATGGTTCAAAGTTGACCGATAATATCGAACCCAAAGAGCAAATGATGGCTAAGTTTTTGCTTGAAAATGCAGAAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGCAACTTCTGATGAAAAGAATGACAAGGACCAAACTAATTTGGTAAGGCATCAAGGGAGCAAGCTTATCAGATGTCTTGGAGATATTCTCAACACTATTGACGATCTCCGTGGCCTGCTCGAAGATTTTGAGTGA

Protein sequence

MKKENAANRGPGASGSRRTRSQISPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKIVAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDNFDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEKSLECERNQALEKCLECKKEVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLLEDFE
Homology
BLAST of HG10013507 vs. NCBI nr
Match: XP_038897371.1 (trihelix transcription factor ASR3 [Benincasa hispida])

HSP 1 Score: 542.0 bits (1395), Expect = 3.4e-150
Identity = 282/309 (91.26%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +PDWTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGLGVSGSRRTRSQIAVAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMPEDDS+WCLESGRRKELGLPDN
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPEDDSYWCLESGRRKELGLPDN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEK 180
           FDEELFKAIDNVATMRANQSDTEPDSD EAAVEN DEIAEPGPKRQRRRSMSKSNQ LEK
Sbjct: 121 FDEELFKAIDNVATMRANQSDTEPDSDPEAAVENIDEIAEPGPKRQRRRSMSKSNQVLEK 180

Query: 181 SLECERNQALEKCLECKKEVE---EEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPK 240
           SLECERN+ALEK LECK+E E    EEE+EKPLLSFPEVEPRECYIK+NGSK+TDN+EPK
Sbjct: 181 SLECERNRALEKSLECKEEEEVEDGEEEEEKPLLSFPEVEPRECYIKNNGSKVTDNLEPK 240

Query: 241 EQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDD 300
           EQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTI+D
Sbjct: 241 EQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIND 300

Query: 301 LRGLLEDFE 305
           LRGLLED E
Sbjct: 301 LRGLLEDCE 309

BLAST of HG10013507 vs. NCBI nr
Match: XP_004136441.1 (trihelix transcription factor ASR3 [Cucumis sativus] >XP_011652540.1 trihelix transcription factor ASR3 [Cucumis sativus] >KGN60185.1 hypothetical protein Csa_001069 [Cucumis sativus])

HSP 1 Score: 506.9 bits (1304), Expect = 1.2e-139
Identity = 269/312 (86.22%), Postives = 280/312 (89.74%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +P WTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMP+DDS+WCL SGRRKELGLP+N
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWCLASGRRKELGLPEN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEK 180
           FDEELFKAIDNVA+MRANQSDTEPDSD EAA+ N DEIAEPGPKRQRRRSMSKSNQ LEK
Sbjct: 121 FDEELFKAIDNVASMRANQSDTEPDSDPEAAIGNADEIAEPGPKRQRRRSMSKSNQVLEK 180

Query: 181 SLECERNQALEKCLECKKEVEE------EEEKEKPLLSFPEVEPRECYIKSNGSKLTDNI 240
           SLECERN  LE  LEC KEVE+      EE +EKPLLS PE+EPRECYIKSN SK+TDNI
Sbjct: 181 SLECERNLGLEISLEC-KEVEDRGERGGEEVEEKPLLSSPELEPRECYIKSNESKVTDNI 240

Query: 241 EPKEQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNT 300
           EPKEQMMAKFLLENAEKVQAIVSENAEY TSDEK  KDQTNLVRHQGSKLIRCLGDILNT
Sbjct: 241 EPKEQMMAKFLLENAEKVQAIVSENAEYTTSDEKCAKDQTNLVRHQGSKLIRCLGDILNT 300

Query: 301 IDDLRGLLEDFE 305
           I+DLRGLLED E
Sbjct: 301 INDLRGLLEDCE 311

BLAST of HG10013507 vs. NCBI nr
Match: XP_008466281.1 (PREDICTED: trihelix transcription factor ASR3 [Cucumis melo])

HSP 1 Score: 491.5 bits (1264), Expect = 5.2e-135
Identity = 261/309 (84.47%), Postives = 277/309 (89.64%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +P WTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMP+DDS+W L SGRRKELGLP+N
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWRLASGRRKELGLPEN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEK 180
           FDEELFKAIDNVA+MRANQSDTEPDSD EAA+EN +EIAEPGPKRQRRRSMSKSNQALE 
Sbjct: 121 FDEELFKAIDNVASMRANQSDTEPDSDPEAAIENANEIAEPGPKRQRRRSMSKSNQALEN 180

Query: 181 SLECERNQALEKCLECKK-----EVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIE 240
           S ECERNQALE  LECK+     E E EE KEKPLLS PE+E +E YIKSN SK+ D++E
Sbjct: 181 SPECERNQALEISLECKEVEDGGEGEGEEVKEKPLLSSPELESQEYYIKSNESKVADDVE 240

Query: 241 PKEQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTI 300
           PKEQMMAKFLLENAEKVQAIVSENAEY TSDEK +KDQTNLVRHQGSKLIRCLGDILNTI
Sbjct: 241 PKEQMMAKFLLENAEKVQAIVSENAEYTTSDEKCNKDQTNLVRHQGSKLIRCLGDILNTI 300

Query: 301 DDLRGLLED 303
           +DLRGLL+D
Sbjct: 301 NDLRGLLKD 309

BLAST of HG10013507 vs. NCBI nr
Match: KAA0038734.1 (trihelix transcription factor ASR3 [Cucumis melo var. makuwa] >TYK31347.1 trihelix transcription factor ASR3 [Cucumis melo var. makuwa])

HSP 1 Score: 473.0 bits (1216), Expect = 1.9e-129
Identity = 261/346 (75.43%), Postives = 277/346 (80.06%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +P WTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMP+DDS+W L SGRRKELGLP+N
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWRLASGRRKELGLPEN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEP------------------- 180
           FDEELFKAIDNVA+MRANQSDTEPDSD EAA+EN +EIAEP                   
Sbjct: 121 FDEELFKAIDNVASMRANQSDTEPDSDPEAAIENANEIAEPGMGQFELSILLVLTCIMLD 180

Query: 181 ------------------GPKRQRRRSMSKSNQALEKSLECERNQALEKCLECKK----- 240
                             GPKRQRRRSMSKSNQALE S ECERNQALE  LECK+     
Sbjct: 181 LIRWDFLSGLFWHPNFLMGPKRQRRRSMSKSNQALENSPECERNQALEISLECKEVEDGG 240

Query: 241 EVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMAKFLLENAEKVQAIVSE 300
           E E EE KEKPLLS PE+E +E YIKSN SK+ D++EPKEQMMAKFLLENAEKVQAIVSE
Sbjct: 241 EGEGEEVKEKPLLSSPELESQEYYIKSNESKVADDVEPKEQMMAKFLLENAEKVQAIVSE 300

Query: 301 NAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLLED 303
           NAEY TSDEK +KDQTNLVRHQGSKLIRCLGDILNTI+DLRGLL+D
Sbjct: 301 NAEYTTSDEKCNKDQTNLVRHQGSKLIRCLGDILNTINDLRGLLKD 346

BLAST of HG10013507 vs. NCBI nr
Match: XP_022976249.1 (trihelix transcription factor ASR3-like [Cucurbita maxima] >XP_022976250.1 trihelix transcription factor ASR3-like [Cucurbita maxima] >XP_022976251.1 trihelix transcription factor ASR3-like [Cucurbita maxima] >XP_022976252.1 trihelix transcription factor ASR3-like [Cucurbita maxima])

HSP 1 Score: 457.2 bits (1175), Expect = 1.1e-124
Identity = 237/304 (77.96%), Postives = 262/304 (86.18%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQISPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKIVA 60
           MKKEN  NRG G SGSRRTRSQI+P+WTAA+CLVLVNVI AVEADC+K LSSYQKWKIVA
Sbjct: 1   MKKEN-GNRGSGVSGSRRTRSQIAPEWTAAECLVLVNVIGAVEADCMKALSSYQKWKIVA 60

Query: 61  ENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDNFD 120
           E+CT+L+VARTS QCR+KW+CLLIEHDVI+QWEL MPEDDS+WCLESGRRKELGLPDNFD
Sbjct: 61  EDCTALNVARTSNQCRKKWECLLIEHDVIRQWELTMPEDDSYWCLESGRRKELGLPDNFD 120

Query: 121 EELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEKSL 180
           EELFKAI NV++MRANQSDTEPD+D EAAVEN DEI+EPGPKRQRR SMSK NQ LEKSL
Sbjct: 121 EELFKAIYNVSSMRANQSDTEPDNDPEAAVENADEISEPGPKRQRRGSMSKRNQGLEKSL 180

Query: 181 ECERNQALEKCLECKKEVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMA 240
           EC             KE EEEE +E+PLLS PE + R+CYIK+NG+K TD+IEP+EQMM 
Sbjct: 181 EC-------------KEDEEEEAEEQPLLSSPEADLRDCYIKNNGAKATDDIEPEEQMMV 240

Query: 241 KFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLL 300
           K LLENAE VQ IVSENAE  TSDEKNDKDQTNL+R QGSKLIRCLGD LNTI+DLR LL
Sbjct: 241 KKLLENAENVQEIVSENAECVTSDEKNDKDQTNLIRRQGSKLIRCLGDFLNTINDLRDLL 290

Query: 301 EDFE 305
           EDFE
Sbjct: 301 EDFE 290

BLAST of HG10013507 vs. ExPASy TrEMBL
Match: A0A0A0LDW0 (Myb-like domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G882960 PE=4 SV=1)

HSP 1 Score: 506.9 bits (1304), Expect = 5.8e-140
Identity = 269/312 (86.22%), Postives = 280/312 (89.74%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +P WTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMP+DDS+WCL SGRRKELGLP+N
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWCLASGRRKELGLPEN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEK 180
           FDEELFKAIDNVA+MRANQSDTEPDSD EAA+ N DEIAEPGPKRQRRRSMSKSNQ LEK
Sbjct: 121 FDEELFKAIDNVASMRANQSDTEPDSDPEAAIGNADEIAEPGPKRQRRRSMSKSNQVLEK 180

Query: 181 SLECERNQALEKCLECKKEVEE------EEEKEKPLLSFPEVEPRECYIKSNGSKLTDNI 240
           SLECERN  LE  LEC KEVE+      EE +EKPLLS PE+EPRECYIKSN SK+TDNI
Sbjct: 181 SLECERNLGLEISLEC-KEVEDRGERGGEEVEEKPLLSSPELEPRECYIKSNESKVTDNI 240

Query: 241 EPKEQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNT 300
           EPKEQMMAKFLLENAEKVQAIVSENAEY TSDEK  KDQTNLVRHQGSKLIRCLGDILNT
Sbjct: 241 EPKEQMMAKFLLENAEKVQAIVSENAEYTTSDEKCAKDQTNLVRHQGSKLIRCLGDILNT 300

Query: 301 IDDLRGLLEDFE 305
           I+DLRGLLED E
Sbjct: 301 INDLRGLLEDCE 311

BLAST of HG10013507 vs. ExPASy TrEMBL
Match: A0A1S3CQW8 (trihelix transcription factor ASR3 OS=Cucumis melo OX=3656 GN=LOC103503736 PE=4 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 2.5e-135
Identity = 261/309 (84.47%), Postives = 277/309 (89.64%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +P WTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMP+DDS+W L SGRRKELGLP+N
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWRLASGRRKELGLPEN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEK 180
           FDEELFKAIDNVA+MRANQSDTEPDSD EAA+EN +EIAEPGPKRQRRRSMSKSNQALE 
Sbjct: 121 FDEELFKAIDNVASMRANQSDTEPDSDPEAAIENANEIAEPGPKRQRRRSMSKSNQALEN 180

Query: 181 SLECERNQALEKCLECKK-----EVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIE 240
           S ECERNQALE  LECK+     E E EE KEKPLLS PE+E +E YIKSN SK+ D++E
Sbjct: 181 SPECERNQALEISLECKEVEDGGEGEGEEVKEKPLLSSPELESQEYYIKSNESKVADDVE 240

Query: 241 PKEQMMAKFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTI 300
           PKEQMMAKFLLENAEKVQAIVSENAEY TSDEK +KDQTNLVRHQGSKLIRCLGDILNTI
Sbjct: 241 PKEQMMAKFLLENAEKVQAIVSENAEYTTSDEKCNKDQTNLVRHQGSKLIRCLGDILNTI 300

Query: 301 DDLRGLLED 303
           +DLRGLL+D
Sbjct: 301 NDLRGLLKD 309

BLAST of HG10013507 vs. ExPASy TrEMBL
Match: A0A5A7T5I6 (Trihelix transcription factor ASR3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G006480 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 9.3e-130
Identity = 261/346 (75.43%), Postives = 277/346 (80.06%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQI--SPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKI 60
           MKKENA NRG G SGSRRTRSQI  +P WTAADCLVLVNVIAAVEADCLK LSSYQKWKI
Sbjct: 1   MKKENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKI 60

Query: 61  VAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDN 120
           VAENCTSLDV RTS QCRRKWDCLLIEHDVIKQWELKMP+DDS+W L SGRRKELGLP+N
Sbjct: 61  VAENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWRLASGRRKELGLPEN 120

Query: 121 FDEELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEP------------------- 180
           FDEELFKAIDNVA+MRANQSDTEPDSD EAA+EN +EIAEP                   
Sbjct: 121 FDEELFKAIDNVASMRANQSDTEPDSDPEAAIENANEIAEPGMGQFELSILLVLTCIMLD 180

Query: 181 ------------------GPKRQRRRSMSKSNQALEKSLECERNQALEKCLECKK----- 240
                             GPKRQRRRSMSKSNQALE S ECERNQALE  LECK+     
Sbjct: 181 LIRWDFLSGLFWHPNFLMGPKRQRRRSMSKSNQALENSPECERNQALEISLECKEVEDGG 240

Query: 241 EVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMAKFLLENAEKVQAIVSE 300
           E E EE KEKPLLS PE+E +E YIKSN SK+ D++EPKEQMMAKFLLENAEKVQAIVSE
Sbjct: 241 EGEGEEVKEKPLLSSPELESQEYYIKSNESKVADDVEPKEQMMAKFLLENAEKVQAIVSE 300

Query: 301 NAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLLED 303
           NAEY TSDEK +KDQTNLVRHQGSKLIRCLGDILNTI+DLRGLL+D
Sbjct: 301 NAEYTTSDEKCNKDQTNLVRHQGSKLIRCLGDILNTINDLRGLLKD 346

BLAST of HG10013507 vs. ExPASy TrEMBL
Match: A0A6J1IN02 (trihelix transcription factor ASR3-like OS=Cucurbita maxima OX=3661 GN=LOC111476700 PE=4 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 5.3e-125
Identity = 237/304 (77.96%), Postives = 262/304 (86.18%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQISPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKIVA 60
           MKKEN  NRG G SGSRRTRSQI+P+WTAA+CLVLVNVI AVEADC+K LSSYQKWKIVA
Sbjct: 1   MKKEN-GNRGSGVSGSRRTRSQIAPEWTAAECLVLVNVIGAVEADCMKALSSYQKWKIVA 60

Query: 61  ENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDNFD 120
           E+CT+L+VARTS QCR+KW+CLLIEHDVI+QWEL MPEDDS+WCLESGRRKELGLPDNFD
Sbjct: 61  EDCTALNVARTSNQCRKKWECLLIEHDVIRQWELTMPEDDSYWCLESGRRKELGLPDNFD 120

Query: 121 EELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEKSL 180
           EELFKAI NV++MRANQSDTEPD+D EAAVEN DEI+EPGPKRQRR SMSK NQ LEKSL
Sbjct: 121 EELFKAIYNVSSMRANQSDTEPDNDPEAAVENADEISEPGPKRQRRGSMSKRNQGLEKSL 180

Query: 181 ECERNQALEKCLECKKEVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMA 240
           EC             KE EEEE +E+PLLS PE + R+CYIK+NG+K TD+IEP+EQMM 
Sbjct: 181 EC-------------KEDEEEEAEEQPLLSSPEADLRDCYIKNNGAKATDDIEPEEQMMV 240

Query: 241 KFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLL 300
           K LLENAE VQ IVSENAE  TSDEKNDKDQTNL+R QGSKLIRCLGD LNTI+DLR LL
Sbjct: 241 KKLLENAENVQEIVSENAECVTSDEKNDKDQTNLIRRQGSKLIRCLGDFLNTINDLRDLL 290

Query: 301 EDFE 305
           EDFE
Sbjct: 301 EDFE 290

BLAST of HG10013507 vs. ExPASy TrEMBL
Match: A0A6J1FEH7 (trihelix transcription factor ASR3-like OS=Cucurbita moschata OX=3662 GN=LOC111443343 PE=4 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 1.3e-123
Identity = 237/304 (77.96%), Postives = 261/304 (85.86%), Query Frame = 0

Query: 1   MKKENAANRGPGASGSRRTRSQISPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKIVA 60
           MKKEN  NRG G SGSRRTRSQI+P+WTAA+CLVLVNVI AVEADCLK LSSYQKWKIVA
Sbjct: 1   MKKEN-GNRGSGVSGSRRTRSQIAPEWTAAECLVLVNVIGAVEADCLKALSSYQKWKIVA 60

Query: 61  ENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDDSFWCLESGRRKELGLPDNFD 120
           E+CT+L+VARTS QCR+KW+CLLIEHDVIKQWEL MPEDDS+WCLESGRRKELGLPDNFD
Sbjct: 61  EDCTALNVARTSNQCRKKWECLLIEHDVIKQWELTMPEDDSYWCLESGRRKELGLPDNFD 120

Query: 121 EELFKAIDNVATMRANQSDTEPDSDLEAAVENTDEIAEPGPKRQRRRSMSKSNQALEKSL 180
           EELFKAIDNV++MRANQSDTEPD+D EAAVEN DEI+EPGPKRQRR SMSK NQ LEKSL
Sbjct: 121 EELFKAIDNVSSMRANQSDTEPDNDPEAAVENADEISEPGPKRQRRGSMSKRNQGLEKSL 180

Query: 181 ECERNQALEKCLECKKEVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMA 240
           E              KE EE+E +E+PLLS PE + R+CYIK+NG+  TD+IEP+EQMM 
Sbjct: 181 EF-------------KEDEEDEAEEQPLLSSPESDLRDCYIKNNGATATDDIEPEEQMMV 240

Query: 241 KFLLENAEKVQAIVSENAEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLL 300
           K LLENAE VQ IVSENAE ATSDEKNDKDQTNL+R QGSKLIRCLGD LNTI+DLR LL
Sbjct: 241 KKLLENAENVQEIVSENAECATSDEKNDKDQTNLIRRQGSKLIRCLGDFLNTINDLRDLL 290

Query: 301 EDFE 305
           ED E
Sbjct: 301 EDCE 290

BLAST of HG10013507 vs. TAIR 10
Match: AT4G31270.1 (sequence-specific DNA binding transcription factors )

HSP 1 Score: 172.9 bits (437), Expect = 3.8e-43
Identity = 118/299 (39.46%), Postives = 168/299 (56.19%), Query Frame = 0

Query: 12  GASGSRRTRSQISPDWTAADCLVLVNVIAAVEADCLKDLSSYQKWKIVAENCTSLDVART 71
           G SGSRRTRSQ++P+W   DCLVLVN IAAVEADC   LSS+QKW ++ ENC +LDV+R 
Sbjct: 4   GTSGSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSNALSSFQKWTMITENCNALDVSRN 63

Query: 72  SYQCRRKWDCLLIEHDVIKQWELK-MPEDDSFWCLESGRRKELGLPDNFDEELFKAIDNV 131
             QCRRKWD L+ +++ IK+WE +      S+W L S +RK L LP + D ELF+AI+ V
Sbjct: 64  LNQCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAINAV 123

Query: 132 ATMRANQSDTEPDSDLEA--AVENTDEIAEPGPKRQRRRSM-SKSNQALEKSLECERNQA 191
             ++  ++ TE DSD EA   V+ + E+A  G KR R+R+M  K  +  E      +   
Sbjct: 124 VMIQDEKAGTESDSDPEAQDVVDLSAELAFVGSKRSRQRTMVMKETKKEEPRTSRVQVNT 183

Query: 192 LEKCLECKKEVEEEEEKEKPLLSFPEVEPRECYIKSNGSKLTDNIEPKEQMMAKFLLENA 251
            EK +  K   + +   EK        +P E          T NIE   ++M   L    
Sbjct: 184 REKPITTKATHQNKTMGEK--------KPVEDMSTDEEEDETMNIEEDVEVMEAKLSYKI 243

Query: 252 EKVQAIVSEN--AEYATSDEKNDKDQTNLVRHQGSKLIRCLGDILNTIDDLRGLLEDFE 305
           + + AIV  N   +  T D  +  D+   VR QG +LI CL +I++T++ L  + ++ E
Sbjct: 244 DLIHAIVGRNLAKDNETKDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQEIE 294

BLAST of HG10013507 vs. TAIR 10
Match: AT2G35640.1 (Homeodomain-like superfamily protein )

HSP 1 Score: 47.4 bits (111), Expect = 2.4e-05
Identity = 37/146 (25.34%), Postives = 60/146 (41.10%), Query Frame = 0

Query: 26  DWTAADCLVLVNVIAAVEADCLKDLSSYQK------------WKIVAENCTSLDVARTSY 85
           +WT ++ LVL   I A + D  + +   +K            WK + E C      R   
Sbjct: 21  NWTVSETLVL---IEAKKMDDQRRVRRSEKQPEGRNKPAELRWKWIEEYCWRRGCYRNQN 80

Query: 86  QCRRKWDCLLIEHDVIKQWELKMPE-------DDSFWCLESGRRKELGLPDNFDEELFKA 145
           QC  KWD L+ ++  I+++E    E         S+W ++   RKE  LP N   +++  
Sbjct: 81  QCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERKEKNLPSNMLPQIYDV 140

Query: 146 IDNVATMRANQSDTEPDSDLEAAVEN 153
           +  +   +   S     S   AAV N
Sbjct: 141 LSELVDRKTLPS----SSSAAAAVGN 159

BLAST of HG10013507 vs. TAIR 10
Match: AT1G31310.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 47.0 bits (110), Expect = 3.2e-05
Identity = 29/113 (25.66%), Postives = 48/113 (42.48%), Query Frame = 0

Query: 55  KWKIVAENCTSLDVARTSYQCRRKWDCLLIEHDVIKQWELKMPEDD-------------- 114
           +WK + + C      R+  QC  KWD L+ ++  ++++E +  E                
Sbjct: 63  RWKWIEDYCWRKGCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAG 122

Query: 115 ---SFWCLESGRRKELGLPDNFDEELFKAIDNVATMRANQSDTEPDSDLEAAV 151
              S+W +E   RKE  LP N   + ++A+  V      +S T P S    AV
Sbjct: 123 ETASYWKMEKSERKERSLPSNMLPQTYQALFEVV-----ESKTLPSSTAVTAV 170

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897371.13.4e-15091.26trihelix transcription factor ASR3 [Benincasa hispida][more]
XP_004136441.11.2e-13986.22trihelix transcription factor ASR3 [Cucumis sativus] >XP_011652540.1 trihelix tr... [more]
XP_008466281.15.2e-13584.47PREDICTED: trihelix transcription factor ASR3 [Cucumis melo][more]
KAA0038734.11.9e-12975.43trihelix transcription factor ASR3 [Cucumis melo var. makuwa] >TYK31347.1 trihel... [more]
XP_022976249.11.1e-12477.96trihelix transcription factor ASR3-like [Cucurbita maxima] >XP_022976250.1 trihe... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LDW05.8e-14086.22Myb-like domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G882960 PE... [more]
A0A1S3CQW82.5e-13584.47trihelix transcription factor ASR3 OS=Cucumis melo OX=3656 GN=LOC103503736 PE=4 ... [more]
A0A5A7T5I69.3e-13075.43Trihelix transcription factor ASR3 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... [more]
A0A6J1IN025.3e-12577.96trihelix transcription factor ASR3-like OS=Cucurbita maxima OX=3661 GN=LOC111476... [more]
A0A6J1FEH71.3e-12377.96trihelix transcription factor ASR3-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
Match NameE-valueIdentityDescription
AT4G31270.13.8e-4339.46sequence-specific DNA binding transcription factors [more]
AT2G35640.12.4e-0525.34Homeodomain-like superfamily protein [more]
AT1G31310.13.2e-0525.66hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 184..204
NoneNo IPR availableGENE3D1.10.10.60coord: 27..85
e-value: 7.3E-6
score: 28.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..179
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availablePANTHERPTHR33492:SF4OS02G0174300 PROTEINcoord: 4..301
NoneNo IPR availablePANTHERPTHR33492OSJNBA0043A12.37 PROTEIN-RELATEDcoord: 4..301
IPR044822Myb/SANT-like DNA-binding domain 4PFAMPF13837Myb_DNA-bind_4coord: 25..99
e-value: 6.7E-7
score: 29.6
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 27..83
score: 6.574026

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013507.1HG10013507.1mRNA