Cp4.1LG04g11370 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g11370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPUR alpha-1 family protein
LocationCp4.1LG04 : 8926106 .. 8930850 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGTTCTTCTTCCTGATTCAGGCGGGTCGAGATTGAGAATCACCCGTTCATGGGATGCTGCCAATGGCGACTGCAATCAGAAAGCCCAGCCTTTGACCGAAGTTGACTACGAAGATTGTCAACATTACGATACAGTGGCAACACAGGTAAACGACTCTGTGAAAATCTTCTTGGAAAATTGTTTCTTCATCATCTTCTTTCATTGTCGTTTGATTTCCAACTGTTCGAAGCAAAAAAATCCCATCACAGATCGTCGATGTGATCCTTTTCCTCAACCCTAATTTTGCAGAACCCTTTTGCCGAAATCATTCATTTTCTCTCTCTATCTCTCACGAACCTTTCTTCTTAGATCTGCCATTTCTAGGTTTCCGCTTCTTTTCTGGTATCGTGTTGGTTTTGAAGTCATTTGGCGGGTTATTTTTGCGTCGATTTCTGTTCAAATCGGGGTTCGTTCGCTTCAATTTCTGTGGGTTTTTGTTTGTTTGTTATTGTTTTTTGTAAAACCAATTGACCTAGGAGGAGATATAAGAGATTCCGGTAATTTGAATCGAAATTATGGTGTAAATTTCGATAATTACCCGCAGATATTTCCATTTTTTAGATGAGTCTCTTTATCTATATTTGTTGTATTTTACACAGTTGCAGACGCTGAAAGGAATTATTAGTAGAATCTGAGACAGAGCTGGAAGTGGGAGATGGAGGGGAATTCCGGCGGAGGTGGAGTGAGTATTGGGGGAACGACGGCGGGCAGCGTGGCGGTGGGTGGAGTGACGGGTGGCGGCGGCGGAGGAGGAAACGACGTGGAGTTGATGTGCAAAACATTGCAGGTGGAGCACAAATTGTTCTACTTCGATCTGAAGGAGAATCCCAGAGGCCGATATCTGAAGATTTCAGAGAAGACATCGGCTACAAGGTCAACGATCATTGTTCCCTTCTCAGGGATTCCATGGTTCTTAGATCTCTTCAATTACTATATCAATTCTGATGATCCGGAGGTTTTTAGCAAGGAATTGCAGCTCGACACCAAGGCACCTTTGTTCCTCTATCTCTTTTCTCTGTTCATCGTCTTCCTCCACTTTTACTTTTTAGTTCATCTCTGTGATCTCCTGTTCAACTTTTTGCTAATTGTACTCATCCTGCCTTTCAGGTGTTCTACTTTGACATTGGTGAAAATCGGAGGGGTCGATTCTTGAAGGTAAATTCCGTTTCCTGCTACCATATCTGTTGTTTTTCTTCTCGAGATTCCAACTTTCTTTAATTTCCCATTTTTGTTTTGTTCTTGAAAGTACTCTCTCTAGCTTTTCTGTTTGATTCTGTCAAATTAGGCCTTGTTGATCTAGAATTTTGCTTAAAATTTGCAACAGTTCTTCAGAAAAAGCTGACAGATCCACATTCTTCGAATGATTCTGTACTTTTCAAATTTTTCTTTGATTGAAGTTCTACTTCAACCCAGTTCGTTTCAGTTTACAGATTCTTTATTGATAGCTTCTGATGCTCAAGAACAATATGAACTCTTATCAAAATGGAAGAATGAAATCAATTGAAATATGGGTTTGTAAATGCAGTCATATCCTTGATCTGTGTCCTTTTTAGGAAAAGATAAAAGAAACAAATTCTGTCTTTATGTTCTTGATGACCCTTTATAGCCTTCCATTTGACAATACTTCCAAATGCGTTTTTCCAAATATGAGCAGTTGCCTAAAATTATGCTCTTATTAATGGAGATTCCCGAACTGTTTGATTTGACCTTTTTTTTGTTCTTTCTTTCTTTCATCTTCCTTCTTTTTAAAGTATTTTTCATTCACCTGTTTCATTGTCAATAGTAATGACAAACCAATGGCTACTTGCAAAATATTCAAGTTTATAAAAGTCTAGGAAACCAAAAGCATTAATATGGCACATTACTATTCAAATCATTAGCATGAACATGTGAAAGACATGGGATATTATAGTATATAATGATTTTCGTAGTTTTACTAGATTTCGATTATGGCAATGTCTAATTTTTCTCATTTTTATAGTTTCTTGTACATGTGTGTTTTCTCATCTTGATGTCGGATGCATAAAAGGTTACAGATGTGTTCATATTATGCATCATATGCTTCCACTAAAGCCAACTCTGTGCCATGAATTACATCATAAATTATTCATACAGTCATGTGTGCCAATTTGATTTAATTTCGTTTTGAACCGTGGCTCGTAAATCGTGCCTACACCTTGGTTTACTTAGTTTCAGAGAGAATGATGCACATCAGCCATTAGGGTAGTCTGGAGTGAAGTGGCTAGACACTGACACAAAAGACATGGCTGAACTTGTTGGCTTTAGTTATCAAAAACATTTTCCAATTCTGAGATCAAACCTTGAAACTGGATGTGTTGCTTAAGTTAGGTCAAGACTTGTATTTAGTGTTGGAAGTGTTTATCCGTTTTTATTTTTATTTTTTTATAAGAGACGATTTCATTGATGATTGAAATTTACAAAAGAGATGTATTATCCATGGTGTTTACAAAAGACCTTTCCAATTTACACCGAGGGAGGTATAACTGTAGGATATAGCTTGGTAAACAACATTGTCGAAAATTTTTGTGTAGGTCTGTGTCTTTTCTGCAAATATTCTCTGAGTGTTTTCCGTCACCAAGAAGTGTTTTCCGTCATCTCAAAACAGAAACATGTTTCTGCTCAAGAAAAATGCATGTTCTAGCTGTTCTCAGTGTGCTCGTGATAGATATTCTCCAGGAATCTATAAGGAGAGAAGAAAGTGCTGATGTAGAATAGCTCATACCCTTCACGTCTGCAGTAACCAGCCTTTAAAAGTTCTATCTTTTCTATCTCCGTCTTTGGCCATCCATTACCACCTTCTCCATTATCAAGGAGTTGGACCATAATCCCCACAGGCTGCCATTGTGCGGCATCAGATTACAACCACCTCAGGCTCCATAGCCCCTTCATACCAGACAGTTCAGCAACCCCTCTTTTGATCCCTTTGGAGTGTCATCTTCACGTGTCAGTTCACGTAAGTCGAATTTCCAGTTCTCTTATTCCCAGTCTCTCATGCTCTGGCCCTTGTAATATTGCTTGTTGATTCTACAATAAGCTTCCACGTGCGCTACAGATCTCCAATACTCCCACGTTTCTACTTTAGGGTATACTTTTATTTGAAGTTTTTTTAAACATAGACGTGAATATGTCGTGTAGTTTTACGAGCGTAAACGAACTTTCTATATCTTATAATATCAGTCTGGATCTGAACTTTGGCTGATTTCAGCTCTCATTTAGTCTTATTTTGATCATTTTCGGTTCCGTTACAAATTTGAATTAATGCTGAATAATTCAAATTTTGGGATTAGAGGGTTGATAGTTTGACCTCTTCAGCTATAGGCACTATTGAGTTCACTGTTGCCCTCTTCTGGATTGTGTGGCTTAAGATTAGGATTTTGGGGAATTGGTGCACTCATCTACTTCATTAATTCTTTTTTCTTTGATAAATATTGATGTTTTGTTGGAATTCTTCTCCCTTTTTTTTTTTGGTGCCTCAAAGAAAAGAAAAAAAGATTTTGAAGATCTGGAATGTGTGGTTGTAAATAGAGATGGTTTTCGTTGTGTTAACTGGGCAGGTATCTGAAGCTTCTGTCAGTAGAAACCGCAGCACCATTATCGTTCCTGCTGGAAGCAATCGGGACGAGGGATGGACTGCGTTCAGAAACATTTTAGCAGAGATCAATGAAGCGTCTAGGCTTTTCATACTGCCCAATCAGGTTTCTGCTGCTCGTGGTCGTGGCGAGCTTCTCATCATGCCTTTTCACGGGATTACTTCCATTGTTTCTCTTCTAACACAATTTCATAATGTTTTCTTTTGCCCAAACTAGGAAAATTCCGAACATTCAGAACGTCTTGTCGGACTTTCGGATGATGTAGGTGCTGGCTTCATATCAGGTCATAGTAGTCAACCTGGTCCAACCTCTGACTTGAATGTAGACAGACAAGTAGACTTATCAGCTCAAGATGAAATGGGCAATCTGGGGGTATCAAAAGTTATCCGAGCTGATCAGAAGCGATTCTTTTTCGATCTTGGAAGTAACAATCGGGGTCATTTTTTGAGGATATCTGAGGTAATTATACAAAAAAACTGAGCTTAAGATCCTTATTTTGATGCTTATTTGATCTGGTTTTGTCTGTCTTGAACCCCATGGTGCCTTTCTTCAGTGTCTTGATATGAAACTCGTCATATCCATATCGGTTGCAGGTTGCAGGGGCAGATCGTTCTTCGATCATTCTCCCATTGTCGGGGCTAAAGCAGTTCTATGAAATAGTTGGACATTTCGTGGAGATCACCAAAGATAGGATTGAAGGAATGACGGGCGTGAATGTTAGGACGGTGGATCCGCCTCATAGATGAACACTATAGGCCCGGGGGGTTTGTGGTGAATAATTTCGGTTAATGTTTGTGTAGATGATGTTCGAGTAGGGTTGAAATTGTGGTGGGTGTGGTTTGATATTGTGTTGTCGTCGTCGTCTTGATTTTCTAGTTCTAAGTCTGAATAATCCATGCTGCTTGATTAGCAGAGGCTATGGAAAATGCTTGAGAAGCTTAATTATTCTTCTTCCATGAATAACACAAAAGAGGGGGGAAAGTAAGAAAGAAAGAGATGCTTGTAATTGTCACTTTATAATAATGGTGAGTTCAGGTGTGTTTTGTTCATTTCACTTCCAATTACT

mRNA sequence

TTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGTTCTTCTTCCTGATTCAGGCGGGTCGAGATTGAGAATCACCCGTTCATGGGATGCTGCCAATGGCGACTGCAATCAGAAAGCCCAGCCTTTGACCGAAGTTGACTACGAAGATTGTCAACATTACGATACAGTGGCAACACAGAGCTGGAAGTGGGAGATGGAGGGGAATTCCGGCGGAGGTGGAGTGAGTATTGGGGGAACGACGGCGGGCAGCGTGGCGGTGGGTGGAGTGACGGGTGGCGGCGGCGGAGGAGGAAACGACGTGGAGTTGATGTGCAAAACATTGCAGGTGGAGCACAAATTGTTCTACTTCGATCTGAAGGAGAATCCCAGAGGCCGATATCTGAAGATTTCAGAGAAGACATCGGCTACAAGGTCAACGATCATTGTTCCCTTCTCAGGGATTCCATGGTTCTTAGATCTCTTCAATTACTATATCAATTCTGATGATCCGGAGGTGTTCTACTTTGACATTGGTGAAAATCGGAGGGGTCGATTCTTGAAGGTATCTGAAGCTTCTGTCAGTAGAAACCGCAGCACCATTATCGTTCCTGCTGGAAGCAATCGGGACGAGGGATGGACTGCGTTCAGAAACATTTTAGCAGAGATCAATGAAGCGTCTAGGCTTTTCATACTGCCCAATCAGGAAAATTCCGAACATTCAGAACGTCTTGTCGGACTTTCGGATGATGTAGGTGCTGGCTTCATATCAGGTCATAGTAGTCAACCTGGTCCAACCTCTGACTTGAATGTAGACAGACAAGTAGACTTATCAGCTCAAGATGAAATGGGCAATCTGGGGGTATCAAAAGTTATCCGAGCTGATCAGAAGCGATTCTTTTTCGATCTTGGAAGTAACAATCGGGGTCATTTTTTGAGGATATCTGAGGTTGCAGGGGCAGATCGTTCTTCGATCATTCTCCCATTGTCGGGGCTAAAGCAGTTCTATGAAATAGTTGGACATTTCGTGGAGATCACCAAAGATAGGATTGAAGGAATGACGGGCGTGAATGTTAGGACGGTGGATCCGCCTCATAGATGAACACTATAGGCCCGGGGGGTTTGTGGTGAATAATTTCGGTTAATGTTTGTGTAGATGATGTTCGAGTAGGGTTGAAATTGTGGTGGGTGTGGTTTGATATTGTGTTGTCGTCGTCGTCTTGATTTTCTAGTTCTAAGTCTGAATAATCCATGCTGCTTGATTAGCAGAGGCTATGGAAAATGCTTGAGAAGCTTAATTATTCTTCTTCCATGAATAACACAAAAGAGGGGGGAAAGTAAGAAAGAAAGAGATGCTTGTAATTGTCACTTTATAATAATGGTGAGTTCAGGTGTGTTTTGTTCATTTCACTTCCAATTACT

Coding sequence (CDS)

TTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGTTCTTCTTCCTGATTCAGGCGGGTCGAGATTGAGAATCACCCGTTCATGGGATGCTGCCAATGGCGACTGCAATCAGAAAGCCCAGCCTTTGACCGAAGTTGACTACGAAGATTGTCAACATTACGATACAGTGGCAACACAGAGCTGGAAGTGGGAGATGGAGGGGAATTCCGGCGGAGGTGGAGTGAGTATTGGGGGAACGACGGCGGGCAGCGTGGCGGTGGGTGGAGTGACGGGTGGCGGCGGCGGAGGAGGAAACGACGTGGAGTTGATGTGCAAAACATTGCAGGTGGAGCACAAATTGTTCTACTTCGATCTGAAGGAGAATCCCAGAGGCCGATATCTGAAGATTTCAGAGAAGACATCGGCTACAAGGTCAACGATCATTGTTCCCTTCTCAGGGATTCCATGGTTCTTAGATCTCTTCAATTACTATATCAATTCTGATGATCCGGAGGTGTTCTACTTTGACATTGGTGAAAATCGGAGGGGTCGATTCTTGAAGGTATCTGAAGCTTCTGTCAGTAGAAACCGCAGCACCATTATCGTTCCTGCTGGAAGCAATCGGGACGAGGGATGGACTGCGTTCAGAAACATTTTAGCAGAGATCAATGAAGCGTCTAGGCTTTTCATACTGCCCAATCAGGAAAATTCCGAACATTCAGAACGTCTTGTCGGACTTTCGGATGATGTAGGTGCTGGCTTCATATCAGGTCATAGTAGTCAACCTGGTCCAACCTCTGACTTGAATGTAGACAGACAAGTAGACTTATCAGCTCAAGATGAAATGGGCAATCTGGGGGTATCAAAAGTTATCCGAGCTGATCAGAAGCGATTCTTTTTCGATCTTGGAAGTAACAATCGGGGTCATTTTTTGAGGATATCTGAGGTTGCAGGGGCAGATCGTTCTTCGATCATTCTCCCATTGTCGGGGCTAAAGCAGTTCTATGAAATAGTTGGACATTTCGTGGAGATCACCAAAGATAGGATTGAAGGAATGACGGGCGTGAATGTTAGGACGGTGGATCCGCCTCATAGATGA

Protein sequence

FFFFFFFFFWVLLPDSGGSRLRITRSWDAANGDCNQKAQPLTEVDYEDCQHYDTVATQSWKWEMEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHSERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFFDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTVDPPHR
BLAST of Cp4.1LG04g11370 vs. Swiss-Prot
Match: PUR_ARATH (Transcription factor Pur-alpha 1 OS=Arabidopsis thaliana GN=PURA1 PE=1 SV=2)

HSP 1 Score: 432.2 bits (1110), Expect = 5.6e-120
Identity = 231/306 (75.49%), Postives = 253/306 (82.68%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           ME NSGGGG + GG      AV G  GGGGGGG+DVEL+ KTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEANSGGGGGAEGGR-----AVTG--GGGGGGGSDVELVSKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSD-----------DPEVFYFDIGEN 183
           RYLKISEKTSATRSTIIVP SGI WFLDLFNYY+NS+           D +VFYFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSEEHELFSKELQLDSKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQ-ENSEH 243
           RRGRFLKVSEASVSRNRSTIIVPAGS+ DEGW AFRNILAEI+EAS LF++PNQ + S+ 
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSSPDEGWAAFRNILAEIHEASGLFVMPNQVKPSDG 180

Query: 244 SERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFF 303
            E LV   DDVGAGFI GH SQ   +S+ NVDR +D   Q+E G  GVSKVIRADQKRFF
Sbjct: 181 QEHLV---DDVGAGFIPGHGSQQPSSSEHNVDRTIDSPGQEETGMTGVSKVIRADQKRFF 240

Query: 304 FDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRT 358
           FDLG+NNRGHFLRISEVAG+DRSSIILPLSGLKQF+E++GHFVEITKD+IEGMTG NVRT
Sbjct: 241 FDLGNNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEVIGHFVEITKDKIEGMTGANVRT 296

BLAST of Cp4.1LG04g11370 vs. Swiss-Prot
Match: PURB_MOUSE (Transcriptional activator protein Pur-beta OS=Mus musculus GN=Purb PE=1 SV=3)

HSP 1 Score: 83.6 bits (205), Expect = 4.9e-15
Identity = 85/300 (28.33%), Postives = 119/300 (39.67%), Query Frame = 1

Query: 66  GNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRGRY 125
           G  GGGG   GG        GG  GG GG     EL  K L +++K FY D+K+N +GR+
Sbjct: 11  GGGGGGGGGPGGFQPAPRGGGGGGGGPGGEQETQELASKRLDIQNKRFYLDVKQNAKGRF 70

Query: 126 LKISE-KTSATRSTIIVPFSGIPWFLDLFNYYI------NSDDPE--------------- 185
           LKI+E     ++S + +  +    F D    +I          PE               
Sbjct: 71  LKIAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLAAGAEEGGGPRRA 130

Query: 186 -----------VFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILA 245
                       +Y D+ EN+RGRFL++ + +V+R         G     G         
Sbjct: 131 LKSEFLVRENRKYYLDLKENQRGRFLRIRQ-TVNRGGGGF----GGGPGPGGL------- 190

Query: 246 EINEASRLFILPNQENSEHSERLVGLSDDVG------AGFISGHSSQPGPTSDLNVDRQV 305
              ++ +   LP Q   E  + L  L DD G      AG   G +  PG           
Sbjct: 191 ---QSGQTIALPAQGLIEFRDALAKLIDDYGGDEDELAGGPGGGAGGPG----------- 250

Query: 306 DLSAQDEMGNLGVSKVIRADQKRFFFDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQF 327
                   G L     I  D KRFFFD+G N  G FLR+SEV  + R++I +P     +F
Sbjct: 251 ----GGLYGELPEGTSITVDSKRFFFDVGCNKYGVFLRVSEVKPSYRNAITVPFKAWGKF 280

BLAST of Cp4.1LG04g11370 vs. Swiss-Prot
Match: PURB_DANRE (Transcriptional activator protein Pur-beta OS=Danio rerio GN=purb PE=2 SV=3)

HSP 1 Score: 82.8 bits (203), Expect = 8.3e-15
Identity = 82/295 (27.80%), Postives = 114/295 (38.64%), Query Frame = 1

Query: 86  GGVTGGGGGG-------GNDVELMCKTLQVEHKLFYFDLKENPRGRYLKISE-KTSATRS 145
           G   GG  GG           EL  K L +++K FY D+K+N +GR++KI+E     ++S
Sbjct: 7   GSERGGSSGGLQHFQREQETQELASKRLDIQNKRFYLDVKQNAKGRFIKIAEVGAGGSKS 66

Query: 146 TIIVPFSGIPWFLDLFNYYI------NSDDPE---------------------------V 205
            + +  S    F D    +I          PE                            
Sbjct: 67  RLTLSMSVAAEFRDYLGDFIEHYAQLGPSSPEQIAQSSGGDDGGPRRALKSEFLVRENRK 126

Query: 206 FYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILP 265
           +Y D+ EN+RGRFL++ +   + NR       G     G            +A +   LP
Sbjct: 127 YYLDLKENQRGRFLRIRQ---TVNRGPGFGVGGGGGPGGGV----------QAGQTIALP 186

Query: 266 NQENSEHSERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIR 325
            Q   E  + L  L DD G G     S  PG              A    G L     I 
Sbjct: 187 AQGLIEFRDALAKLIDDYG-GEDEELSGGPG--------------AAGGYGELPEGTSIM 246

Query: 326 ADQKRFFFDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKD 340
            D KRFFFD+GSN  G FLR+SEV  + R+SI +P     +F      + E  K+
Sbjct: 247 VDSKRFFFDVGSNKYGVFLRVSEVKPSYRNSITIPFKAWGKFGGAFSRYAEEMKE 273

BLAST of Cp4.1LG04g11370 vs. Swiss-Prot
Match: PURA_MOUSE (Transcriptional activator protein Pur-alpha OS=Mus musculus GN=Pura PE=1 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 1.2e-13
Identity = 84/300 (28.00%), Postives = 120/300 (40.00%), Query Frame = 1

Query: 63  EMEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGND-------------VELMCKTLQVE 122
           E  G + G G S+G   +GS + GG  GGGGGGG+               EL  K + ++
Sbjct: 9   EQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSGGGGGAPGGLQHETQELASKRVDIQ 68

Query: 123 HKLFYFDLKENPRGRYLKISE-KTSATRSTIIVPFSGIPWFLDLFNYYIN-------SDD 182
           +K FY D+K+N +GR+LKI+E      +S + +  S    F D    +I        S  
Sbjct: 69  NKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTLSMSVAVEFRDYLGDFIEHYAQLGPSQP 128

Query: 183 PEVF----------------------YFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSN 242
           P++                       Y D+ EN+RGRFL++        R T+    G  
Sbjct: 129 PDLAQAQDEPRRALKSEFLVRENRKYYMDLKENQRGRFLRI--------RQTVNRGPGLG 188

Query: 243 RDEGWTAFRNILAEINEASRLFILPNQENSEHSERLVGLSDDVGAGFISGHSSQPGPTSD 302
             +G T                 LP Q   E  + L  L DD G   +    ++    + 
Sbjct: 189 STQGQT---------------IALPAQGLIEFRDALAKLIDDYG---VEEEPAELPEGTS 248

Query: 303 LNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFFDLGSNNRGHFLRISEVAGADRSSIILP 320
           L VD                        KRFFFD+GSN  G F+R+SEV    R+SI +P
Sbjct: 249 LTVDN-----------------------KRFFFDVGSNKYGVFMRVSEVKPTYRNSITVP 259

BLAST of Cp4.1LG04g11370 vs. Swiss-Prot
Match: PURA_HUMAN (Transcriptional activator protein Pur-alpha OS=Homo sapiens GN=PURA PE=1 SV=2)

HSP 1 Score: 78.6 bits (192), Expect = 1.6e-13
Identity = 84/301 (27.91%), Postives = 120/301 (39.87%), Query Frame = 1

Query: 63  EMEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGND--------------VELMCKTLQV 122
           E  G + G G S+G   +GS + GG  GGGGGGG+                EL  K + +
Sbjct: 9   EQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSGGGGGGAPGGLQHETQELASKRVDI 68

Query: 123 EHKLFYFDLKENPRGRYLKISE-KTSATRSTIIVPFSGIPWFLDLFNYYIN-------SD 182
           ++K FY D+K+N +GR+LKI+E      +S + +  S    F D    +I        S 
Sbjct: 69  QNKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTLSMSVAVEFRDYLGDFIEHYAQLGPSQ 128

Query: 183 DPEVF----------------------YFDIGENRRGRFLKVSEASVSRNRSTIIVPAGS 242
            P++                       Y D+ EN+RGRFL++        R T+    G 
Sbjct: 129 PPDLAQAQDEPRRALKSEFLVRENRKYYMDLKENQRGRFLRI--------RQTVNRGPGL 188

Query: 243 NRDEGWTAFRNILAEINEASRLFILPNQENSEHSERLVGLSDDVGAGFISGHSSQPGPTS 302
              +G T                 LP Q   E  + L  L DD G   +    ++    +
Sbjct: 189 GSTQGQT---------------IALPAQGLIEFRDALAKLIDDYG---VEEEPAELPEGT 248

Query: 303 DLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFFDLGSNNRGHFLRISEVAGADRSSIIL 320
            L VD                        KRFFFD+GSN  G F+R+SEV    R+SI +
Sbjct: 249 SLTVDN-----------------------KRFFFDVGSNKYGVFMRVSEVKPTYRNSITV 260

BLAST of Cp4.1LG04g11370 vs. TrEMBL
Match: A0A0A0LZP1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G598870 PE=4 SV=1)

HSP 1 Score: 551.6 bits (1420), Expect = 7.1e-154
Identity = 284/305 (93.11%), Postives = 285/305 (93.44%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           MEGNSGGGGV IGGTTAG VA GG  G GGGGGNDVELMCKTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEGNSGGGGVGIGGTTAGGVAAGG--GAGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVF-----------YFDIGEN 183
           RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVF           YFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVFSKELQLDTKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHS 243
           RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGW+AFRNILAEINEASRLFILPNQENSEHS
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWSAFRNILAEINEASRLFILPNQENSEHS 180

Query: 244 ERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 303
           ERL GLSDDVGAGFISGHSSQ GPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF
Sbjct: 181 ERLAGLSDDVGAGFISGHSSQSGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 240

Query: 304 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 358
           DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV
Sbjct: 241 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 300

BLAST of Cp4.1LG04g11370 vs. TrEMBL
Match: W9R7X0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_016433 PE=4 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 3.4e-140
Identity = 257/300 (85.67%), Postives = 274/300 (91.33%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAV------GGVTGGGGGGGNDVELMCKTLQVEHKLFYFDL 123
           MEGNSGGGG   GG+  G+VA       GG  GGGGGGGNDVEL+CKTLQVEHKLFYFDL
Sbjct: 1   MEGNSGGGGG--GGSGGGTVAAAAGGGGGGGGGGGGGGGNDVELLCKTLQVEHKLFYFDL 60

Query: 124 KENPRGRYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVFYFDIGENRRGRF 183
           KENPRGRYLKISEKTSATRSTIIVPFSGI WFLDLFNYY+NSDD +VFYFD+GENRRGRF
Sbjct: 61  KENPRGRYLKISEKTSATRSTIIVPFSGISWFLDLFNYYVNSDDQDVFYFDVGENRRGRF 120

Query: 184 LKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHSERLVG 243
           LKVSEASVSRNRSTIIVPAGS RDEGW AFRNILAEINEASRLF+LPNQ++SE SERLVG
Sbjct: 121 LKVSEASVSRNRSTIIVPAGSTRDEGWAAFRNILAEINEASRLFMLPNQQSSEPSERLVG 180

Query: 244 LSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFFDLGSN 303
           LSDDVGAGFISGHSSQP  TS+LN+DR V+  AQDE+GN+GVSKVIRADQKRFFFDLGSN
Sbjct: 181 LSDDVGAGFISGHSSQPATTSELNIDRSVEFPAQDEIGNMGVSKVIRADQKRFFFDLGSN 240

Query: 304 NRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTVDPPHR 358
           NRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITKDRIEGMTG NVRTV+PP R
Sbjct: 241 NRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGANVRTVEPPQR 298

BLAST of Cp4.1LG04g11370 vs. TrEMBL
Match: A0A061GR22_THECC (Transcription factor Pur-alpha 1 OS=Theobroma cacao GN=TCM_039973 PE=4 SV=1)

HSP 1 Score: 496.1 bits (1276), Expect = 3.5e-137
Identity = 260/308 (84.42%), Postives = 271/308 (87.99%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGG---GNDVELMCKTLQVEHKLFYFDLKEN 123
           MEGNSGGGG   GG + G+   GG  GGGGGG   GNDVEL+CKTLQVEHKLFYFDLKEN
Sbjct: 1   MEGNSGGGG---GGGSGGADRGGGGGGGGGGGERGGNDVELVCKTLQVEHKLFYFDLKEN 60

Query: 124 PRGRYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPE-----------VFYFDI 183
           PRGRYLKISEKTSATRSTIIVP SGI WFLDLFNYY+NSDD +           VFYFDI
Sbjct: 61  PRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHDLFSKELQLDTKVFYFDI 120

Query: 184 GENRRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENS 243
           GENRRGRFLKVSEASVSRNRSTIIVPAGS RDEGW AFRNILAEINEASRLFILPNQ+ S
Sbjct: 121 GENRRGRFLKVSEASVSRNRSTIIVPAGSTRDEGWAAFRNILAEINEASRLFILPNQQTS 180

Query: 244 EHSERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKR 303
           E SERLVGLSDDVGAGFISGHSSQP  TS+LNVDR VDL AQDE+GN+GVSKVIRADQKR
Sbjct: 181 EPSERLVGLSDDVGAGFISGHSSQPASTSELNVDRSVDLPAQDEIGNMGVSKVIRADQKR 240

Query: 304 FFFDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNV 358
           FFFDLGSNNRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITKDRIEGMTG NV
Sbjct: 241 FFFDLGSNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGANV 300

BLAST of Cp4.1LG04g11370 vs. TrEMBL
Match: A0A067KEG7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16210 PE=4 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 4.6e-137
Identity = 258/306 (84.31%), Postives = 273/306 (89.22%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           MEGNSGGGG   GG  AG+    GV+GGGGG GNDVELMCKTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEGNSGGGGG--GGVAAGAT---GVSGGGGGAGNDVELMCKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPE-----------VFYFDIGEN 183
           RYLKISEKTSATRSTIIVPFSGI WFLDLFNYY+NSDD +           VFYFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPFSGISWFLDLFNYYVNSDDQDLFSKELQLDTKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQE-NSEH 243
           RRGRFLKVSEASVSRNRSTIIVPAGS RDEGW AFRNILAEINEASRLF+LPNQ+ +SE 
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSARDEGWAAFRNILAEINEASRLFMLPNQQQSSES 180

Query: 244 SERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFF 303
           SERLVGLSDDVGAGFISGHSSQP P S+LNVDR V+L+ Q+E+GNLGVSKVIRADQKRFF
Sbjct: 181 SERLVGLSDDVGAGFISGHSSQPAPASELNVDRSVELAPQEEIGNLGVSKVIRADQKRFF 240

Query: 304 FDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRT 358
           FDLGSNNRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITK+RIEGMTG NVRT
Sbjct: 241 FDLGSNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKERIEGMTGANVRT 300

BLAST of Cp4.1LG04g11370 vs. TrEMBL
Match: V4VZ56_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10016090mg PE=4 SV=1)

HSP 1 Score: 495.4 bits (1274), Expect = 6.0e-137
Identity = 257/307 (83.71%), Postives = 268/307 (87.30%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGG--GGGGNDVELMCKTLQVEHKLFYFDLKENP 123
           MEGNSGGGG   GG+       GGVTGGG  GGGGNDVEL+CKTLQVEHKLFYFDLKENP
Sbjct: 1   MEGNSGGGGAGAGGS-------GGVTGGGAGGGGGNDVELVCKTLQVEHKLFYFDLKENP 60

Query: 124 RGRYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPE-----------VFYFDIG 183
           RGRYLKISEKTSATRSTIIVP SGI WFLDLFNYY+NSDD E           VFYFDIG
Sbjct: 61  RGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIG 120

Query: 184 ENRRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSE 243
           ENRRGRFLKVSEASVSRNRSTIIVPAGS+RDEGW AFRNILAEINEASRL ILPNQ+ SE
Sbjct: 121 ENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSE 180

Query: 244 HSERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRF 303
            SE LVGLSDDVGAGFISGH SQP P S+LNVDR VDL AQDE+GN+GVSKVIRADQKRF
Sbjct: 181 QSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRF 240

Query: 304 FFDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVR 358
           FFDLGSNNRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITKDRIEGMTG +VR
Sbjct: 241 FFDLGSNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGASVR 300

BLAST of Cp4.1LG04g11370 vs. TAIR10
Match: AT2G32080.1 (AT2G32080.1 purin-rich alpha 1)

HSP 1 Score: 432.2 bits (1110), Expect = 3.2e-121
Identity = 231/306 (75.49%), Postives = 253/306 (82.68%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           ME NSGGGG + GG      AV G  GGGGGGG+DVEL+ KTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEANSGGGGGAEGGR-----AVTG--GGGGGGGSDVELVSKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSD-----------DPEVFYFDIGEN 183
           RYLKISEKTSATRSTIIVP SGI WFLDLFNYY+NS+           D +VFYFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSEEHELFSKELQLDSKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQ-ENSEH 243
           RRGRFLKVSEASVSRNRSTIIVPAGS+ DEGW AFRNILAEI+EAS LF++PNQ + S+ 
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSSPDEGWAAFRNILAEIHEASGLFVMPNQVKPSDG 180

Query: 244 SERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFF 303
            E LV   DDVGAGFI GH SQ   +S+ NVDR +D   Q+E G  GVSKVIRADQKRFF
Sbjct: 181 QEHLV---DDVGAGFIPGHGSQQPSSSEHNVDRTIDSPGQEETGMTGVSKVIRADQKRFF 240

Query: 304 FDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRT 358
           FDLG+NNRGHFLRISEVAG+DRSSIILPLSGLKQF+E++GHFVEITKD+IEGMTG NVRT
Sbjct: 241 FDLGNNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEVIGHFVEITKDKIEGMTGANVRT 296

BLAST of Cp4.1LG04g11370 vs. NCBI nr
Match: gi|778663268|ref|XP_011660046.1| (PREDICTED: transcription factor Pur-alpha 1 [Cucumis sativus])

HSP 1 Score: 551.6 bits (1420), Expect = 1.0e-153
Identity = 284/305 (93.11%), Postives = 285/305 (93.44%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           MEGNSGGGGV IGGTTAG VA GG  G GGGGGNDVELMCKTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEGNSGGGGVGIGGTTAGGVAAGG--GAGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVF-----------YFDIGEN 183
           RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVF           YFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVFSKELQLDTKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHS 243
           RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGW+AFRNILAEINEASRLFILPNQENSEHS
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWSAFRNILAEINEASRLFILPNQENSEHS 180

Query: 244 ERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 303
           ERL GLSDDVGAGFISGHSSQ GPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF
Sbjct: 181 ERLAGLSDDVGAGFISGHSSQSGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 240

Query: 304 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 358
           DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV
Sbjct: 241 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 300

BLAST of Cp4.1LG04g11370 vs. NCBI nr
Match: gi|659100199|ref|XP_008450978.1| (PREDICTED: transcription factor Pur-alpha 1 [Cucumis melo])

HSP 1 Score: 548.1 bits (1411), Expect = 1.1e-152
Identity = 282/305 (92.46%), Postives = 284/305 (93.11%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           MEGNSGGGGV IGGTTAG VA GG  G G GGGNDVELMCKTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEGNSGGGGVGIGGTTAGGVAAGG--GAGSGGGNDVELMCKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVF-----------YFDIGEN 183
           RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVF           YFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVFSKELQLDTKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHS 243
           RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGW+AFRNILA+INEASRLFILPNQENSEHS
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWSAFRNILADINEASRLFILPNQENSEHS 180

Query: 244 ERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 303
           ERL GLSDDVGAGFISGHSSQ GPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF
Sbjct: 181 ERLAGLSDDVGAGFISGHSSQSGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 240

Query: 304 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 358
           DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV
Sbjct: 241 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 300

BLAST of Cp4.1LG04g11370 vs. NCBI nr
Match: gi|703093202|ref|XP_010094851.1| (hypothetical protein L484_016433 [Morus notabilis])

HSP 1 Score: 506.1 bits (1302), Expect = 4.9e-140
Identity = 257/300 (85.67%), Postives = 274/300 (91.33%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAV------GGVTGGGGGGGNDVELMCKTLQVEHKLFYFDL 123
           MEGNSGGGG   GG+  G+VA       GG  GGGGGGGNDVEL+CKTLQVEHKLFYFDL
Sbjct: 1   MEGNSGGGGG--GGSGGGTVAAAAGGGGGGGGGGGGGGGNDVELLCKTLQVEHKLFYFDL 60

Query: 124 KENPRGRYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPEVFYFDIGENRRGRF 183
           KENPRGRYLKISEKTSATRSTIIVPFSGI WFLDLFNYY+NSDD +VFYFD+GENRRGRF
Sbjct: 61  KENPRGRYLKISEKTSATRSTIIVPFSGISWFLDLFNYYVNSDDQDVFYFDVGENRRGRF 120

Query: 184 LKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHSERLVG 243
           LKVSEASVSRNRSTIIVPAGS RDEGW AFRNILAEINEASRLF+LPNQ++SE SERLVG
Sbjct: 121 LKVSEASVSRNRSTIIVPAGSTRDEGWAAFRNILAEINEASRLFMLPNQQSSEPSERLVG 180

Query: 244 LSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFFDLGSN 303
           LSDDVGAGFISGHSSQP  TS+LN+DR V+  AQDE+GN+GVSKVIRADQKRFFFDLGSN
Sbjct: 181 LSDDVGAGFISGHSSQPATTSELNIDRSVEFPAQDEIGNMGVSKVIRADQKRFFFDLGSN 240

Query: 304 NRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTVDPPHR 358
           NRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITKDRIEGMTG NVRTV+PP R
Sbjct: 241 NRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGANVRTVEPPQR 298

BLAST of Cp4.1LG04g11370 vs. NCBI nr
Match: gi|802659472|ref|XP_012080845.1| (PREDICTED: transcription factor Pur-alpha 1 isoform X2 [Jatropha curcas])

HSP 1 Score: 500.4 bits (1287), Expect = 2.7e-138
Identity = 258/305 (84.59%), Postives = 273/305 (89.51%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGGGNDVELMCKTLQVEHKLFYFDLKENPRG 123
           MEGNSGGGG   GG  AG+    GV+GGGGG GNDVELMCKTLQVEHKLFYFDLKENPRG
Sbjct: 1   MEGNSGGGGG--GGVAAGAT---GVSGGGGGAGNDVELMCKTLQVEHKLFYFDLKENPRG 60

Query: 124 RYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPE-----------VFYFDIGEN 183
           RYLKISEKTSATRSTIIVPFSGI WFLDLFNYY+NSDD +           VFYFDIGEN
Sbjct: 61  RYLKISEKTSATRSTIIVPFSGISWFLDLFNYYVNSDDQDLFSKELQLDTKVFYFDIGEN 120

Query: 184 RRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENSEHS 243
           RRGRFLKVSEASVSRNRSTIIVPAGS RDEGW AFRNILAEINEASRLF+LPNQ++SE S
Sbjct: 121 RRGRFLKVSEASVSRNRSTIIVPAGSARDEGWAAFRNILAEINEASRLFMLPNQQSSESS 180

Query: 244 ERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKRFFF 303
           ERLVGLSDDVGAGFISGHSSQP P S+LNVDR V+L+ Q+E+GNLGVSKVIRADQKRFFF
Sbjct: 181 ERLVGLSDDVGAGFISGHSSQPAPASELNVDRSVELAPQEEIGNLGVSKVIRADQKRFFF 240

Query: 304 DLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNVRTV 358
           DLGSNNRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITK+RIEGMTG NVRTV
Sbjct: 241 DLGSNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKERIEGMTGANVRTV 300

BLAST of Cp4.1LG04g11370 vs. NCBI nr
Match: gi|590582554|ref|XP_007014656.1| (Transcription factor Pur-alpha 1 [Theobroma cacao])

HSP 1 Score: 496.1 bits (1276), Expect = 5.0e-137
Identity = 260/308 (84.42%), Postives = 271/308 (87.99%), Query Frame = 1

Query: 64  MEGNSGGGGVSIGGTTAGSVAVGGVTGGGGGG---GNDVELMCKTLQVEHKLFYFDLKEN 123
           MEGNSGGGG   GG + G+   GG  GGGGGG   GNDVEL+CKTLQVEHKLFYFDLKEN
Sbjct: 1   MEGNSGGGG---GGGSGGADRGGGGGGGGGGGERGGNDVELVCKTLQVEHKLFYFDLKEN 60

Query: 124 PRGRYLKISEKTSATRSTIIVPFSGIPWFLDLFNYYINSDDPE-----------VFYFDI 183
           PRGRYLKISEKTSATRSTIIVP SGI WFLDLFNYY+NSDD +           VFYFDI
Sbjct: 61  PRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHDLFSKELQLDTKVFYFDI 120

Query: 184 GENRRGRFLKVSEASVSRNRSTIIVPAGSNRDEGWTAFRNILAEINEASRLFILPNQENS 243
           GENRRGRFLKVSEASVSRNRSTIIVPAGS RDEGW AFRNILAEINEASRLFILPNQ+ S
Sbjct: 121 GENRRGRFLKVSEASVSRNRSTIIVPAGSTRDEGWAAFRNILAEINEASRLFILPNQQTS 180

Query: 244 EHSERLVGLSDDVGAGFISGHSSQPGPTSDLNVDRQVDLSAQDEMGNLGVSKVIRADQKR 303
           E SERLVGLSDDVGAGFISGHSSQP  TS+LNVDR VDL AQDE+GN+GVSKVIRADQKR
Sbjct: 181 EPSERLVGLSDDVGAGFISGHSSQPASTSELNVDRSVDLPAQDEIGNMGVSKVIRADQKR 240

Query: 304 FFFDLGSNNRGHFLRISEVAGADRSSIILPLSGLKQFYEIVGHFVEITKDRIEGMTGVNV 358
           FFFDLGSNNRGHFLRISEVAG+DRSSIILPLSGLKQF+EIVGHFVEITKDRIEGMTG NV
Sbjct: 241 FFFDLGSNNRGHFLRISEVAGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGANV 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PUR_ARATH5.6e-12075.49Transcription factor Pur-alpha 1 OS=Arabidopsis thaliana GN=PURA1 PE=1 SV=2[more]
PURB_MOUSE4.9e-1528.33Transcriptional activator protein Pur-beta OS=Mus musculus GN=Purb PE=1 SV=3[more]
PURB_DANRE8.3e-1527.80Transcriptional activator protein Pur-beta OS=Danio rerio GN=purb PE=2 SV=3[more]
PURA_MOUSE1.2e-1328.00Transcriptional activator protein Pur-alpha OS=Mus musculus GN=Pura PE=1 SV=1[more]
PURA_HUMAN1.6e-1327.91Transcriptional activator protein Pur-alpha OS=Homo sapiens GN=PURA PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LZP1_CUCSA7.1e-15493.11Uncharacterized protein OS=Cucumis sativus GN=Csa_1G598870 PE=4 SV=1[more]
W9R7X0_9ROSA3.4e-14085.67Uncharacterized protein OS=Morus notabilis GN=L484_016433 PE=4 SV=1[more]
A0A061GR22_THECC3.5e-13784.42Transcription factor Pur-alpha 1 OS=Theobroma cacao GN=TCM_039973 PE=4 SV=1[more]
A0A067KEG7_JATCU4.6e-13784.31Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16210 PE=4 SV=1[more]
V4VZ56_9ROSI6.0e-13783.71Uncharacterized protein OS=Citrus clementina GN=CICLE_v10016090mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G32080.13.2e-12175.49 purin-rich alpha 1[more]
Match NameE-valueIdentityDescription
gi|778663268|ref|XP_011660046.1|1.0e-15393.11PREDICTED: transcription factor Pur-alpha 1 [Cucumis sativus][more]
gi|659100199|ref|XP_008450978.1|1.1e-15292.46PREDICTED: transcription factor Pur-alpha 1 [Cucumis melo][more]
gi|703093202|ref|XP_010094851.1|4.9e-14085.67hypothetical protein L484_016433 [Morus notabilis][more]
gi|802659472|ref|XP_012080845.1|2.7e-13884.59PREDICTED: transcription factor Pur-alpha 1 isoform X2 [Jatropha curcas][more]
gi|590582554|ref|XP_007014656.1|5.0e-13784.42Transcription factor Pur-alpha 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006628PUR-bd_fam
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g11370.1Cp4.1LG04g11370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006628Purine-rich element binding protein familyPANTHERPTHR12611PUR-TRANSCRIPTIONAL ACTIVATORcoord: 240..253
score: 1.2E-126coord: 269..355
score: 1.2E-126coord: 66..215
score: 1.2E
IPR006628Purine-rich element binding protein familyPFAMPF04845PurAcoord: 163..216
score: 3.0E-7coord: 104..158
score: 9.0E-9coord: 271..333
score: 8.6
IPR006628Purine-rich element binding protein familySMARTSM00712purcoord: 164..219
score: 2.6E-13coord: 277..338
score: 4.4E-24coord: 100..161
score: 9.7