Cp4.1LG19g01230.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG19g01230.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranscription factor ORG2-like protein
LocationCp4.1LG19 : 925435 .. 927380 (-)
Sequence length636
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGGTGGATTCCAGGGAACCCTCAGTCTACAAACGGCCGCTCATCGGAGCTGTTCATTCAGTTCTCATCACCACAGCCATCTCATCATGTCAAGGTTGAGAGCTGTTCTCCACCATCAGCTGGTGACGGTCATCATCGTGGTGGCGGTGGCTGTGGCGGTGATGATCCGCTCGCTGTAGCTAAGAAACTTAAACATAATGCTAGCGAACGCGATCGTCGCAAGAAGATGAACTCTCTCTATACTTCTCTTCGCTGCTTGCTTCCATCGACAAATCGAACGGTAGGTTTTGCTTCTTCGTCAGGCATGAATGTGTTAATATGTTTTTCAAATTTGGATATAAATTCTAACAGGTTTGTGATATCCCACGTTGGGAAAGTTCTTTTATAAGAGTGTGAAAACTTCTCTCTGTCAAACGCATTTTAAAAACCTTGAGGGGAAGCGTGAAGGGGAAAACCCAAAGAGGAAAATATCCGTTAGCGGTGGGCCTGAGCTGTTACAAATGGTATCAGAGCCAGACACCGGGCGATGTGTCAATTAGGAGGTTGAGCCCCGAAGGGGGTGGATACGAGGCGGTATGTCAGCAAGGACGTTGGGCCCCGAAAGGGGGTGGATTATGATATCCCACATCGGTTGGTGAGGAGAACGAAGCATTCTTTATAAGTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACTCGTTTTAAAAACTTTAAGAGGAAGCTCGAAAGGAAAAACTCAAAGAGGACAATATCTACCGGGGCTGAGCTGTTACAAATGGTATTAGAGCAAGGCTTCGGGCAATATGCCAACGAGGAGGTTGAGTCTCGAAGGGGGGTGAACACGAGGCGGTGTGCCAACAAGGACGCTGGGCCCTGAAAGGAGGTGGACTGGGGAGTCCCACGTCGATTGGAGAAGGGAACGAGTGTCAGCAAGGACGTGGGCTCCAAAGGGGGGGGTGAACTATGATATCCCATGTCAGTTGCGGAGGAAAACAAAACATTCTTTATAAGGGTGTGAAAACCTTTCCTATCATACGCGTTTTAAAAACTTTGAAGAGAAGTCCGAAAGAAAAAGTCGTCCAAAGAGGACAATATCTACTAGCGTTAGACTTAGACTATTTAAGGAGTATTCATGGGTTGGATTTAGTCAGGTTAGAACATTTTTTGACGGTCGAAAATCACTTCTTGCAGAAAACAATGAGTAATCCAGCCACAATATCAAGGGCTTTGAAGTATATACCTGAGCTAAGACAGCATGTGGAGGAGCTAAGGAGAAGGAAGCAAGGGCTTGAGACAAAGATTAATGCCATGACTCATGACCAACAAAAGCAAGTGAGGAAAAACAAGGAAGCCCCATGGATGGGTTGTTCTTCATGTGCTGTGAATTGGCTAAGAGAAAATGAAGCATTGCTTCAACTCACTTCTAATGATACCTTCAAACTCAAATTTTCACAAATTCTCCGGTCTCTAGACGAAGATGAACTTCTCGTTAAGACCGTTTCTACCTTTCAATCATTCGATGGGAGGCACTTCTTTACTCTGCTCGTTCAGGTATAATATAATATTTTTTCCATACAAATGTTCGTGGGTTGGATCGGGTCAGGTCGGATTATAATATTTTTCCGACCCAATCTAATTATTTGGGTTGTGAATTTGTGAGATCTAACATCATCAGTTGAAACCTCTTCATAATCGACCCGTTTTAAAACATTGAGGAAAACCTTGATACTTATATACATGTATGTTGATATTTTTTTTTTTCTCGGGGGTTCGTACTAACTAAATATTTAAGAAAACGGAGATATTTTATTTGTTTATTTAAAGTTTTAAGGCTTAATGAAATATGCATCATATAATTTAATTTAATTTTATGTTTAATGCAGGCTAAGCCAAATACTCCCTCAAGAGTGCTGCAAGAGATTTTGAATAGAAAGTTCTAG

mRNA sequence

ATGGTGGGGAACCCTCAGTCTACAAACGGCCGCTCATCGGAGCTGTTCATTCAGTTCTCATCACCACAGCCATCTCATCATGTCAAGGTTGAGAGCTGTTCTCCACCATCAGCTGGTGACGGTCATCATCGTGGTGGCGGTGGCTGTGGCGGTGATGATCCGCTCGCTGTAGCTAAGAAACTTAAACATAATGCTAGCGAACGCGATCGTCGCAAGAAGATGAACTCTCTCTATACTTCTCTTCGCTGCTTGCTTCCATCGACAAATCGAACGAAAACAATGAGTAATCCAGCCACAATATCAAGGGCTTTGAAGTATATACCTGAGCTAAGACAGCATGTGGAGGAGCTAAGGAGAAGGAAGCAAGGGCTTGAGACAAAGATTAATGCCATGACTCATGACCAACAAAAGCAAGTGAGGAAAAACAAGGAAGCCCCATGGATGGGTTGTTCTTCATGTGCTGTGAATTGGCTAAGAGAAAATGAAGCATTGCTTCAACTCACTTCTAATGATACCTTCAAACTCAAATTTTCACAAATTCTCCGGTCTCTAGACGAAGATGAACTTCTCGTTAAGACCGCTAAGCCAAATACTCCCTCAAGAGTGCTGCAAGAGATTTTGAATAGAAAGTTCTAG

Coding sequence (CDS)

ATGGTGGGGAACCCTCAGTCTACAAACGGCCGCTCATCGGAGCTGTTCATTCAGTTCTCATCACCACAGCCATCTCATCATGTCAAGGTTGAGAGCTGTTCTCCACCATCAGCTGGTGACGGTCATCATCGTGGTGGCGGTGGCTGTGGCGGTGATGATCCGCTCGCTGTAGCTAAGAAACTTAAACATAATGCTAGCGAACGCGATCGTCGCAAGAAGATGAACTCTCTCTATACTTCTCTTCGCTGCTTGCTTCCATCGACAAATCGAACGAAAACAATGAGTAATCCAGCCACAATATCAAGGGCTTTGAAGTATATACCTGAGCTAAGACAGCATGTGGAGGAGCTAAGGAGAAGGAAGCAAGGGCTTGAGACAAAGATTAATGCCATGACTCATGACCAACAAAAGCAAGTGAGGAAAAACAAGGAAGCCCCATGGATGGGTTGTTCTTCATGTGCTGTGAATTGGCTAAGAGAAAATGAAGCATTGCTTCAACTCACTTCTAATGATACCTTCAAACTCAAATTTTCACAAATTCTCCGGTCTCTAGACGAAGATGAACTTCTCGTTAAGACCGCTAAGCCAAATACTCCCTCAAGAGTGCTGCAAGAGATTTTGAATAGAAAGTTCTAG

Protein sequence

MVGNPQSTNGRSSELFIQFSSPQPSHHVKVESCSPPSAGDGHHRGGGGCGGDDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDTFKLKFSQILRSLDEDELLVKTAKPNTPSRVLQEILNRKF
BLAST of Cp4.1LG19g01230.1 vs. Swiss-Prot
Match: BH100_ARATH (Transcription factor bHLH100 OS=Arabidopsis thaliana GN=BHLH100 PE=2 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 3.9e-20
Identity = 59/141 (41.84%), Postives = 93/141 (65.96%), Query Frame = 1

Query: 52  DDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELR 111
           D+P+ V KKL HNASER+RRKK+N++++SLR  LP TN+TK +S  AT+S+ALKYIPEL+
Sbjct: 56  DNPV-VMKKLNHNASERERRKKINTMFSSLRSCLPPTNQTKKLSVSATVSQALKYIPELQ 115

Query: 112 QHVEELRRRKQGLETKINAMTH-DQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSN 171
           + V++L ++K+ L  +I+         Q  K++E      S+ +   L E E ++Q++S 
Sbjct: 116 EQVKKLMKKKEELSFQISGQRDLVYTDQNSKSEEGVTSYASTVSSTRLSETEVMVQISSL 175

Query: 172 DTFKLKFSQILRSLDEDELLV 192
            T K  F  +L  ++ED L++
Sbjct: 176 QTEKCSFGNVLSGVEEDGLVL 195

BLAST of Cp4.1LG19g01230.1 vs. Swiss-Prot
Match: ORG3_ARATH (Transcription factor ORG3 OS=Arabidopsis thaliana GN=ORG3 PE=1 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 8.7e-20
Identity = 55/140 (39.29%), Postives = 93/140 (66.43%), Query Frame = 1

Query: 52  DDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELR 111
           D+   V KKL HNASERDRR+K+NSL++SLR  LP++ ++K +S PAT+SR+LKYIPEL+
Sbjct: 70  DNNPVVVKKLNHNASERDRRRKINSLFSSLRSCLPASGQSKKLSIPATVSRSLKYIPELQ 129

Query: 112 QHVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSND 171
           + V++L ++K+ L  +I+    + +  V++  +A     S+ +   L +NE ++Q++S+ 
Sbjct: 130 EQVKKLIKKKEELLVQISGQ-RNTECYVKQPPKAVANYISTVSATRLGDNEVMVQISSSK 189

Query: 172 TFKLKFSQILRSLDEDELLV 192
                 S +L  L+ED  ++
Sbjct: 190 IHNFSISNVLSGLEEDRFVL 208

BLAST of Cp4.1LG19g01230.1 vs. Swiss-Prot
Match: BH101_ARATH (Transcription factor bHLH101 OS=Arabidopsis thaliana GN=BHLH101 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 1.6e-18
Identity = 58/149 (38.93%), Postives = 92/149 (61.74%), Query Frame = 1

Query: 55  LAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHV 114
           + + KKL HNASERDRR+K+N+LY+SLR LLP +++ + +S P T++R +KYIPE +Q +
Sbjct: 62  VVLEKKLNHNASERDRRRKLNALYSSLRALLPLSDQKRKLSIPMTVARVVKYIPEQKQEL 121

Query: 115 EELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSS---CAVNWLRENEALLQLTSND 174
           + L RRK+ L  +I+  TH  Q+Q+R       +  SS    A NWL + E  +Q+ ++ 
Sbjct: 122 QRLSRRKEELLKRISRKTH--QEQLRNKAMMDSIDSSSSQRIAANWLTDTEIAVQIATSK 181

Query: 175 TFKLKFSQILRSLDEDELLVKTAKPNTPS 201
              +  S +L  L+E+ L V +   +  S
Sbjct: 182 WTSV--SDMLLRLEENGLNVISVSSSVSS 206

BLAST of Cp4.1LG19g01230.1 vs. Swiss-Prot
Match: ORG2_ARATH (Transcription factor ORG2 OS=Arabidopsis thaliana GN=ORG2 PE=1 SV=1)

HSP 1 Score: 91.7 bits (226), Expect = 1.1e-17
Identity = 59/175 (33.71%), Postives = 102/175 (58.29%), Query Frame = 1

Query: 16  FIQFSSPQP---SHHVKVESCSPPSAGDGHHRGGGGCGGDDPLAVAKKLKHNASERDRRK 75
           F++ + PQ    +HH      S  S G+           D+   V KKL HNASERDRRK
Sbjct: 35  FLELTVPQTYEVTHHQNSLGVSVSSEGNEI---------DNNPVVVKKLNHNASERDRRK 94

Query: 76  KMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMT 135
           K+N+L++SLR  LP+++++K +S P T+S++LKYIPEL+Q V+ L ++K+ +  +++   
Sbjct: 95  KINTLFSSLRSCLPASDQSKKLSIPETVSKSLKYIPELQQQVKRLIQKKEEILVRVSGQ- 154

Query: 136 HDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDTFKLKFSQILRSLDED 188
            D +   ++  +A     S+ +   L +NE ++Q++S+       S +L  ++ED
Sbjct: 155 RDFELYDKQQPKAVASYLSTVSATRLGDNEVMVQVSSSKIHNFSISNVLGGIEED 199

BLAST of Cp4.1LG19g01230.1 vs. TrEMBL
Match: A0A0A0KYH3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G434980 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 1.8e-24
Identity = 77/139 (55.40%), Postives = 90/139 (64.75%), Query Frame = 1

Query: 92  KTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCS 151
           K MSNP+TIS+ALKYIPEL+Q VE LRRRK+GL TK+N    +  KQ+RKN + PWM  S
Sbjct: 6   KRMSNPSTISKALKYIPELQQQVEGLRRRKEGLVTKLN---EENLKQIRKNNKEPWMS-S 65

Query: 152 SCAVNWLRENEALLQLTSNDT--FKLKFSQILRSLDEDELLVKT---------------- 211
            CAVNWL E EALLQ+   D    +L FSQIL SL+ED LL+ T                
Sbjct: 66  FCAVNWLSETEALLQIALEDQTHTQLPFSQILLSLEEDGLLLLTASSFRSFNGRLFLTLL 125

BLAST of Cp4.1LG19g01230.1 vs. TrEMBL
Match: M5WCF6_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa019552mg PE=4 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 7.1e-21
Identity = 69/150 (46.00%), Postives = 98/150 (65.33%), Query Frame = 1

Query: 50  GGDDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPE 109
           GG D L V KKL HNASERD RKK+N+LY++LR LLP++ + K +SNPA ISRA+KYIPE
Sbjct: 36  GGRD-LTVVKKLNHNASERDDRKKINTLYSTLRSLLPASYQMKKLSNPAAISRAVKYIPE 95

Query: 110 LRQHVEELRRRKQGLETKINAMTHD-----QQKQVRKNKEAPWMGCSSCAVNWLRENEAL 169
           L+Q V+ L ++K+ L +++  +         +KQ R +     +  S+ AV+WL + E +
Sbjct: 96  LQQQVKGLIQKKEELLSRLRRLQQQGDPIYNEKQSR-SAALSSLSASAFAVSWLNDREVV 155

Query: 170 LQLTSNDTFKLKFSQILRSLDEDELLVKTA 195
           LQ++S    K   SQIL  L+ED LL+  A
Sbjct: 156 LQISSYVVQKSPLSQILVDLEEDGLLLLNA 183

BLAST of Cp4.1LG19g01230.1 vs. TrEMBL
Match: A0A067DMC3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g025477mg PE=4 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 2.1e-20
Identity = 68/142 (47.89%), Postives = 94/142 (66.20%), Query Frame = 1

Query: 53  DPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQ 112
           DP  V KKL HNASERDRRKK+NSLY+SLR LLP  ++TK +S PAT+SR LKYIPEL+Q
Sbjct: 60  DPTMV-KKLYHNASERDRRKKINSLYSSLRSLLPVADQTKKLSIPATVSRVLKYIPELQQ 119

Query: 113 HVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDT 172
            VE L ++K+ L +KI+       +Q  + K A     +S + + L + E L+Q++S   
Sbjct: 120 QVERLMQKKEELLSKISKPGEISHQQ-HQRKIAIGSSLASISASRLSDMEILIQISSYKV 179

Query: 173 FKLKFSQILRSLDEDELLVKTA 195
            K   S+IL +L+ED L++  A
Sbjct: 180 HKCPLSKILFNLEEDGLVLVNA 199

BLAST of Cp4.1LG19g01230.1 vs. TrEMBL
Match: B9IHR7_POPTR (Basic helix-loop-helix family protein OS=Populus trichocarpa GN=POPTR_0016s03680g PE=4 SV=2)

HSP 1 Score: 107.5 bits (267), Expect = 2.1e-20
Identity = 72/183 (39.34%), Postives = 106/183 (57.92%), Query Frame = 1

Query: 9   NGRSSELFIQFSSPQPSHHVKVESCSPPSAGDGHHRGGGGCGGDDPLAVAKKLKHNASER 68
           +G + E F  F   QP       S S  +     H G G     DP ++AKKL HNASER
Sbjct: 32  DGETPESFTHFPPSQPDVRQLDRSTSFTA-----HSGSG-----DP-SMAKKLNHNASER 91

Query: 69  DRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHVEELRRRKQGLETKI 128
           DRRKK+NSLY+SLR LLP+ ++ K +S P T+SR L YIP+L+Q VE L +RK+ L +K+
Sbjct: 92  DRRKKINSLYSSLRSLLPAADQRKKLSIPYTVSRVLVYIPKLQQQVERLIQRKEELLSKL 151

Query: 129 NAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDTFKLKFSQILRSLDEDE 188
           +    D   Q  + K   +   SS + + L + E ++ +++N   +   S+IL +L+E  
Sbjct: 152 SRQADDLTHQENQRKGTMYSSLSSVSASRLSDREVVIHISTNKLHRSSLSEILVNLEEAG 203

Query: 189 LLV 192
           LL+
Sbjct: 212 LLL 203

BLAST of Cp4.1LG19g01230.1 vs. TrEMBL
Match: B9SZX1_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_0071670 PE=4 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 4.6e-20
Identity = 62/143 (43.36%), Postives = 89/143 (62.24%), Query Frame = 1

Query: 51  GDDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPEL 110
           G D   + KKL HNASERDRRKKMN+LY+SLR L P+ +  K +S PATISR LKYIPEL
Sbjct: 67  GGDANDMVKKLNHNASERDRRKKMNTLYSSLRSLFPAADEMKKLSIPATISRVLKYIPEL 126

Query: 111 RQHVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLT-- 170
           ++ +E L +RK+ +  +I+   H    Q+ + K       S  + N + + EA++Q++  
Sbjct: 127 QEQLERLVQRKEEILLRISKQNHIVNPQINQRKGTSHSSLSVVSANQISDKEAIIQISTY 186

Query: 171 SNDTFKLKFSQILRSLDEDELLV 192
           SN       S+IL  L+E+ LL+
Sbjct: 187 SNTIHTSPLSEILLLLEEEGLLL 209

BLAST of Cp4.1LG19g01230.1 vs. TAIR10
Match: AT2G41240.1 (AT2G41240.1 basic helix-loop-helix protein 100)

HSP 1 Score: 99.8 bits (247), Expect = 2.2e-21
Identity = 59/141 (41.84%), Postives = 93/141 (65.96%), Query Frame = 1

Query: 52  DDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELR 111
           D+P+ V KKL HNASER+RRKK+N++++SLR  LP TN+TK +S  AT+S+ALKYIPEL+
Sbjct: 56  DNPV-VMKKLNHNASERERRKKINTMFSSLRSCLPPTNQTKKLSVSATVSQALKYIPELQ 115

Query: 112 QHVEELRRRKQGLETKINAMTH-DQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSN 171
           + V++L ++K+ L  +I+         Q  K++E      S+ +   L E E ++Q++S 
Sbjct: 116 EQVKKLMKKKEELSFQISGQRDLVYTDQNSKSEEGVTSYASTVSSTRLSETEVMVQISSL 175

Query: 172 DTFKLKFSQILRSLDEDELLV 192
            T K  F  +L  ++ED L++
Sbjct: 176 QTEKCSFGNVLSGVEEDGLVL 195

BLAST of Cp4.1LG19g01230.1 vs. TAIR10
Match: AT3G56980.1 (AT3G56980.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 98.6 bits (244), Expect = 4.9e-21
Identity = 55/140 (39.29%), Postives = 93/140 (66.43%), Query Frame = 1

Query: 52  DDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELR 111
           D+   V KKL HNASERDRR+K+NSL++SLR  LP++ ++K +S PAT+SR+LKYIPEL+
Sbjct: 70  DNNPVVVKKLNHNASERDRRRKINSLFSSLRSCLPASGQSKKLSIPATVSRSLKYIPELQ 129

Query: 112 QHVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSND 171
           + V++L ++K+ L  +I+    + +  V++  +A     S+ +   L +NE ++Q++S+ 
Sbjct: 130 EQVKKLIKKKEELLVQISGQ-RNTECYVKQPPKAVANYISTVSATRLGDNEVMVQISSSK 189

Query: 172 TFKLKFSQILRSLDEDELLV 192
                 S +L  L+ED  ++
Sbjct: 190 IHNFSISNVLSGLEEDRFVL 208

BLAST of Cp4.1LG19g01230.1 vs. TAIR10
Match: AT5G04150.1 (AT5G04150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 94.4 bits (233), Expect = 9.2e-20
Identity = 58/149 (38.93%), Postives = 92/149 (61.74%), Query Frame = 1

Query: 55  LAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHV 114
           + + KKL HNASERDRR+K+N+LY+SLR LLP +++ + +S P T++R +KYIPE +Q +
Sbjct: 62  VVLEKKLNHNASERDRRRKLNALYSSLRALLPLSDQKRKLSIPMTVARVVKYIPEQKQEL 121

Query: 115 EELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSS---CAVNWLRENEALLQLTSND 174
           + L RRK+ L  +I+  TH  Q+Q+R       +  SS    A NWL + E  +Q+ ++ 
Sbjct: 122 QRLSRRKEELLKRISRKTH--QEQLRNKAMMDSIDSSSSQRIAANWLTDTEIAVQIATSK 181

Query: 175 TFKLKFSQILRSLDEDELLVKTAKPNTPS 201
              +  S +L  L+E+ L V +   +  S
Sbjct: 182 WTSV--SDMLLRLEENGLNVISVSSSVSS 206

BLAST of Cp4.1LG19g01230.1 vs. TAIR10
Match: AT3G56970.1 (AT3G56970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 91.7 bits (226), Expect = 6.0e-19
Identity = 59/175 (33.71%), Postives = 102/175 (58.29%), Query Frame = 1

Query: 16  FIQFSSPQP---SHHVKVESCSPPSAGDGHHRGGGGCGGDDPLAVAKKLKHNASERDRRK 75
           F++ + PQ    +HH      S  S G+           D+   V KKL HNASERDRRK
Sbjct: 35  FLELTVPQTYEVTHHQNSLGVSVSSEGNEI---------DNNPVVVKKLNHNASERDRRK 94

Query: 76  KMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMT 135
           K+N+L++SLR  LP+++++K +S P T+S++LKYIPEL+Q V+ L ++K+ +  +++   
Sbjct: 95  KINTLFSSLRSCLPASDQSKKLSIPETVSKSLKYIPELQQQVKRLIQKKEEILVRVSGQ- 154

Query: 136 HDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDTFKLKFSQILRSLDED 188
            D +   ++  +A     S+ +   L +NE ++Q++S+       S +L  ++ED
Sbjct: 155 RDFELYDKQQPKAVASYLSTVSATRLGDNEVMVQVSSSKIHNFSISNVLGGIEED 199

BLAST of Cp4.1LG19g01230.1 vs. TAIR10
Match: AT1G71200.1 (AT1G71200.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 48.5 bits (114), Expect = 5.8e-06
Identity = 31/70 (44.29%), Postives = 42/70 (60.00%), Query Frame = 1

Query: 58  AKKLKHNASERDRRKKMNSLYTSLRCLLP---STNRTKTMSNPATISRALKYIPELRQHV 117
           AKK  HNA ER RR ++++ Y +L  LLP   S++  K  S P+ I   + YIP+L+  V
Sbjct: 60  AKKQDHNAKERLRRMRLHASYLTLGTLLPDHSSSSSKKKWSAPSIIDNVITYIPKLQNEV 119

Query: 118 EELRRRKQGL 125
            EL  RKQ L
Sbjct: 120 GELTLRKQKL 129

BLAST of Cp4.1LG19g01230.1 vs. NCBI nr
Match: gi|778697748|ref|XP_004144366.2| (PREDICTED: transcription factor bHLH101-like [Cucumis sativus])

HSP 1 Score: 192.6 bits (488), Expect = 7.1e-46
Identity = 121/217 (55.76%), Postives = 141/217 (64.98%), Query Frame = 1

Query: 14  ELFIQFSSPQPSHHVKVESCSPPSAGDGHHRGGGGCGGDDPLAVAKKLKHNASERDRRKK 73
           E F QFSS QPS  VK+ESC  P +         G   DD   +AKKLKHNA+ERDRR+K
Sbjct: 2   EPFFQFSSLQPSQQVKLESCPTPLSA--------GVVADDITDMAKKLKHNANERDRRRK 61

Query: 74  MNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMTH 133
           +NSLY SLRCLLP T+  K MSNP+TIS+ALKYIPEL+Q VE LRRRK+GL TK+N    
Sbjct: 62  INSLYCSLRCLLPPTDSMKRMSNPSTISKALKYIPELQQQVEGLRRRKEGLVTKLN---E 121

Query: 134 DQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDT--FKLKFSQILRSLDEDELLV 193
           +  KQ+RKN + PWM  S CAVNWL E EALLQ+   D    +L FSQIL SL+ED LL+
Sbjct: 122 ENLKQIRKNNKEPWMS-SFCAVNWLSETEALLQIALEDQTHTQLPFSQILLSLEEDGLLL 181

Query: 194 KT------------------AKPNTPSRVLQEILNRK 211
            T                  AK NT  RVLQEILN+K
Sbjct: 182 LTASSFRSFNGRLFLTLLLQAKANTLPRVLQEILNKK 206

BLAST of Cp4.1LG19g01230.1 vs. NCBI nr
Match: gi|659111343|ref|XP_008455701.1| (PREDICTED: transcription factor bHLH101-like [Cucumis melo])

HSP 1 Score: 186.0 bits (471), Expect = 6.6e-44
Identity = 118/217 (54.38%), Postives = 140/217 (64.52%), Query Frame = 1

Query: 14  ELFIQFSSPQPSHHVKVESCSPPSAGDGHHRGGGGCGGDDPLAVAKKLKHNASERDRRKK 73
           E F QFSS QPS  VK+ESC    +         G  GDD  A+ KKLKHNA+ERDRR+K
Sbjct: 2   EPFYQFSSLQPSQQVKLESCPTLLSA--------GVVGDDITAMDKKLKHNANERDRRRK 61

Query: 74  MNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMTH 133
           +NSLY SLRCLLP T+  K MSNP+TIS+ALKYIPEL+Q VE LRRRK+GL TK+N    
Sbjct: 62  INSLYYSLRCLLPPTDSMKRMSNPSTISKALKYIPELQQQVEGLRRRKEGLVTKLN---E 121

Query: 134 DQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDT--FKLKFSQILRSLDEDELLV 193
           +  KQ+RKN + PWM  S CAVNWL E EALLQ+   +    +L FSQIL SL++D LL+
Sbjct: 122 ENLKQIRKNNKEPWMS-SLCAVNWLSETEALLQIALEEQTHTQLPFSQILLSLEDDGLLL 181

Query: 194 KT------------------AKPNTPSRVLQEILNRK 211
            T                  AK N   RVLQEILN+K
Sbjct: 182 STASSFRSSNGSLFFTLLLQAKANILPRVLQEILNKK 206

BLAST of Cp4.1LG19g01230.1 vs. NCBI nr
Match: gi|700199567|gb|KGN54725.1| (hypothetical protein Csa_4G434980 [Cucumis sativus])

HSP 1 Score: 120.9 bits (302), Expect = 2.6e-24
Identity = 77/139 (55.40%), Postives = 90/139 (64.75%), Query Frame = 1

Query: 92  KTMSNPATISRALKYIPELRQHVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCS 151
           K MSNP+TIS+ALKYIPEL+Q VE LRRRK+GL TK+N    +  KQ+RKN + PWM  S
Sbjct: 6   KRMSNPSTISKALKYIPELQQQVEGLRRRKEGLVTKLN---EENLKQIRKNNKEPWMS-S 65

Query: 152 SCAVNWLRENEALLQLTSNDT--FKLKFSQILRSLDEDELLVKT---------------- 211
            CAVNWL E EALLQ+   D    +L FSQIL SL+ED LL+ T                
Sbjct: 66  FCAVNWLSETEALLQIALEDQTHTQLPFSQILLSLEEDGLLLLTASSFRSFNGRLFLTLL 125

BLAST of Cp4.1LG19g01230.1 vs. NCBI nr
Match: gi|595811539|ref|XP_007203256.1| (hypothetical protein PRUPE_ppa019552mg, partial [Prunus persica])

HSP 1 Score: 109.0 bits (271), Expect = 1.0e-20
Identity = 69/150 (46.00%), Postives = 98/150 (65.33%), Query Frame = 1

Query: 50  GGDDPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPE 109
           GG D L V KKL HNASERD RKK+N+LY++LR LLP++ + K +SNPA ISRA+KYIPE
Sbjct: 36  GGRD-LTVVKKLNHNASERDDRKKINTLYSTLRSLLPASYQMKKLSNPAAISRAVKYIPE 95

Query: 110 LRQHVEELRRRKQGLETKINAMTHD-----QQKQVRKNKEAPWMGCSSCAVNWLRENEAL 169
           L+Q V+ L ++K+ L +++  +         +KQ R +     +  S+ AV+WL + E +
Sbjct: 96  LQQQVKGLIQKKEELLSRLRRLQQQGDPIYNEKQSR-SAALSSLSASAFAVSWLNDREVV 155

Query: 170 LQLTSNDTFKLKFSQILRSLDEDELLVKTA 195
           LQ++S    K   SQIL  L+ED LL+  A
Sbjct: 156 LQISSYVVQKSPLSQILVDLEEDGLLLLNA 183

BLAST of Cp4.1LG19g01230.1 vs. NCBI nr
Match: gi|985472733|ref|XP_006494391.2| (PREDICTED: transcription factor ORG2 [Citrus sinensis])

HSP 1 Score: 108.2 bits (269), Expect = 1.7e-20
Identity = 68/142 (47.89%), Postives = 94/142 (66.20%), Query Frame = 1

Query: 53  DPLAVAKKLKHNASERDRRKKMNSLYTSLRCLLPSTNRTKTMSNPATISRALKYIPELRQ 112
           DP  V KKL HNASERDRRKK+NSLY+SLR LLP  ++TK +S PAT+SR LKYIPEL+Q
Sbjct: 60  DPTMV-KKLYHNASERDRRKKINSLYSSLRSLLPVADQTKKLSIPATVSRVLKYIPELQQ 119

Query: 113 HVEELRRRKQGLETKINAMTHDQQKQVRKNKEAPWMGCSSCAVNWLRENEALLQLTSNDT 172
            VE L ++K+ L +KI+       +Q  + K A     +S + + L + E L+Q++S   
Sbjct: 120 QVERLMQKKEELLSKISKQGEISHQQ-HQRKIAIGSSLASISASRLSDMEILIQISSYKV 179

Query: 173 FKLKFSQILRSLDEDELLVKTA 195
            K   S+IL +L+ED L++  A
Sbjct: 180 HKCPLSKILFNLEEDGLVLVNA 199

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH100_ARATH3.9e-2041.84Transcription factor bHLH100 OS=Arabidopsis thaliana GN=BHLH100 PE=2 SV=1[more]
ORG3_ARATH8.7e-2039.29Transcription factor ORG3 OS=Arabidopsis thaliana GN=ORG3 PE=1 SV=1[more]
BH101_ARATH1.6e-1838.93Transcription factor bHLH101 OS=Arabidopsis thaliana GN=BHLH101 PE=2 SV=1[more]
ORG2_ARATH1.1e-1733.71Transcription factor ORG2 OS=Arabidopsis thaliana GN=ORG2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KYH3_CUCSA1.8e-2455.40Uncharacterized protein OS=Cucumis sativus GN=Csa_4G434980 PE=4 SV=1[more]
M5WCF6_PRUPE7.1e-2146.00Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa019552mg PE=4 S... [more]
A0A067DMC3_CITSI2.1e-2047.89Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g025477mg PE=4 SV=1[more]
B9IHR7_POPTR2.1e-2039.34Basic helix-loop-helix family protein OS=Populus trichocarpa GN=POPTR_0016s03680... [more]
B9SZX1_RICCO4.6e-2043.36DNA binding protein, putative OS=Ricinus communis GN=RCOM_0071670 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G41240.12.2e-2141.84 basic helix-loop-helix protein 100[more]
AT3G56980.14.9e-2139.29 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G04150.19.2e-2038.93 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G56970.16.0e-1933.71 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G71200.15.8e-0644.29 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778697748|ref|XP_004144366.2|7.1e-4655.76PREDICTED: transcription factor bHLH101-like [Cucumis sativus][more]
gi|659111343|ref|XP_008455701.1|6.6e-4454.38PREDICTED: transcription factor bHLH101-like [Cucumis melo][more]
gi|700199567|gb|KGN54725.1|2.6e-2455.40hypothetical protein Csa_4G434980 [Cucumis sativus][more]
gi|595811539|ref|XP_007203256.1|1.0e-2046.00hypothetical protein PRUPE_ppa019552mg, partial [Prunus persica][more]
gi|985472733|ref|XP_006494391.2|1.7e-2047.89PREDICTED: transcription factor ORG2 [Citrus sinensis][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR015660MASH1/Ascl1a-like
IPR011598bHLH_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0046983 protein dimerization activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG19g01230Cp4.1LG19g01230gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG19g01230.1:cds:004Cp4.1LG19g01230.1:cds:004CDS
Cp4.1LG19g01230.1:cds:003Cp4.1LG19g01230.1:cds:003CDS
Cp4.1LG19g01230.1:cds:002Cp4.1LG19g01230.1:cds:002CDS
Cp4.1LG19g01230.1:cds:001Cp4.1LG19g01230.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG19g01230.1Cp4.1LG19g01230.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 59..131
score: 1.0
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 59..110
score: 2.9
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 64..116
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 58..110
score: 1
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 59..131
score: 1.26
IPR015660Achaete-scute transcription factor-relatedPANTHERPTHR13935ACHAETE-SCUTE TRANSCRIPTION FACTOR-RELATEDcoord: 53..144
score: 1.6
NoneNo IPR availableunknownCoilCoilcoord: 107..134
scor
NoneNo IPR availablePANTHERPTHR13935:SF41ACHAETE-SCUTE HOMOLOG 3coord: 53..144
score: 1.6