Cp4.1LG04g02120.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG04g02120.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTATA-binding protein-associated factor 2N
LocationCp4.1LG04 : 2552798 .. 2556215 (+)
Sequence length1699
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATGGACCCGGTTCACTCCCATCAAACACGGGACTGAGAGAAGAAAAAAATAACAGAGTGGCAGAAATCAAAGTCTGAGATTGTCCAAGCCTCAGCACGGAACTCAGGTAACCCTATCGCTCTTCTTCCTCTACGTCTACTTCTCATCCCTTCTCTCCACTACCCATGATTCAATCACCATTCATCCTTCCAACATTTCTGCAATTTGGCTACAATTTCCCTTTCTTCCTTTACTTTCTTACCCTAATCTTTCGCTTTTCTCGCCTGCCCACTGTTTTCATTGTTTTCCTCTCTGCTTTTCCACTTGAGATCTATATTTTTGTAACCCATGTCGCCTCCTGCACGATGATTTATACGTTTCTCATGGGGTTTTCTAGAGATTTTAATTGACTATACAGTTCTCTTCATCTAAACTGGCCCATCTTTTGTGTAGATCTTAATATTGATTTCATGACAATTGGTGAATAATGGCAACCTATGCGGGTAAAGGAGCCCTCTCCAATGGATCTGTTTACATCTGCAACCTGCCCTTTGGGACCGATGAGAATATGCTTGCTGAATATTTTGGTACAATTGGGTTATTGAAGGTTCGGAGTTTAGCCACAATTTTGTTCACCCTTTTTCACTTTCTATACACATTGGATTTTGCTTTTATTGTTCTTATAATAACTGTTGGACATTTTTTGCTTAATCTTCAAAGAAAGACAAGCGAACAGGTAGACCGAAAATCTGGTTGTATCGTGATAAATCGACGAATGAGCCTAAAGGAGATGCTACTGTGACATATGAGGATCCACATGCAGCTTTAGCGGCTGTTGAATGGTTTGACAATAAGGATTTTCATGGTAACATCATTGAAGTTCATATAGCAGAGTCCAAAAGTAAAGATGATGTCTCGTTTAATGTGGGGGTCGATCCGATTGTTGTTGGTGATACTCTTGATTTTGAGGAAAATGATGGGGGTGGTATGAATGGGGGCGGTGGAAGGGGGAGAGGTCGCGGTGATGCTCCTCCAGGAAAAGCATGGCAACAAGAGGGAGATTGGCTGTGTCCAAATACAAGGTCTGTGATGTGATATTGGTGAGCAGCTTCTTCTTGCTTCTTTCAATCTTGGCTGGATTCTATGTTCTATTTAGTATTGTTGAAATGGTTTCTCATGCTCTATCCTACTGGCAGTTGTACCAATGTTAATTTTGCATTTCGTGGTGTGTGTAACCGGTGTGGAAGTGCCCGACCTTCTGGTGCTGCTGGCAGTGGTGCAGGCTCTATAGGTCGTGGCAGGGGCCGTGGCACTGGCAACCAGGAATCAGGAAGCAATAGTCGTCCAGTTGGTGGCCCTACTGGTCTCTTTGGTCCAAATGACTGGCCGTGTCCAATGTAAATATAATCTTCCTTTTACTTATATGCCATCGCCATATTATACGTGCATCAGTTTTAGCTTGTAGGATTGTCTTACCTTCGCGTTTTCGTTTCTGCAATTTTATCATTTGGTATATTAGGAATTCGAGTCTGATTTTGTAAAACCTACCAAATTAATTGGTTTCTTTTTTCAATGCAACTATATGGATCAAAGTAGATTATATTGATGTTTTAAAAAGAATAAAAAGATGGATTATATATAGTTAGAAGATATAATCACACATCCAAAATAATAATAAAAAAAAAAGACGAAGAAGATGGTTCGGTGTAGAATCAAACGACAGGAAGTTCTTTTCAATATTTGATTTCACACATCTTTATATACGCCCTTTACTTTTGGTTTAATCTTAAGGAATAGAATAAGCATTAAAGACAGTGACACATCTGAATATATGAAGTTATCTTGCTTTACATGTATACTTTCTTGTTTCTTTCAGAATGTGGTCTTTTAATTAAATATTTTGAATATCATGGGATGGGGAAGGAGGCATCTCTAATGTTAATGTTAATTTGAAGTTTCGGGAGAAGTGTTCTATCGTACGGTTATTTGTTTCTTTAATCTTGTTCTCTGCTTGGATAACACTGCTTGCTTGTGTAACTGGTTGATATTGGTGCTTCCTTCTGCTTTATGTTCCAGGTGTGGTAACATCAATTGGGCAAAGCGCTCGAAATGCAATATTTGTAACACGAACAAACCGGGTCACAATGAGGGTGGCGTGAGGTATAACATGAACTCTTTTGTGTATTTCCCAAGAGAAATAATGCAACTTTTTATTTGATCAAAATATGGTCTAGGCTTTTCTAACAAATTGATGTGAATTTGACTTTTGAGATTGAGTTCTGAGCTTAATCAATTAAATTTAAAGCAAATTGATGGGTTCATCTGCAGTTTCCTGAACTTTAATTTTGGTATATCTTGGAATAGCTTAGAAACTGGAAATCTATACTTGTTGTTACTACCCTGAAATGAATACTTATAGCCTACTATAAAATTTCTCTTATTTTGAATTATAATTGAACAGAGGAGGGCGCGGTGGAGGTTACAAAGAACTTGATGAAGAAGAGTTGGAGGAGACTAAACGACGTCGGCGTGAAGCTGAAGAAGTAAGATGCTGAAACCATGATTTTGAGAAAACGAAAACAGTTTTTGGATTGTGATATGATCGTTTCATTACGATATCAGGATGATGGTGAGATGTATGACGAGTTTGGAAACCTTAAGAAGAAGTTTCGTGCCAAATCCCAGCAAATGGAAGCCGGTCAGATAATACCAGGTGCTGGGCGGGCTGGATGGGAATTCGAAGAACTAGGTACGTGTCGGTTGACTAACACCGTGCTATTGTCCTTGAAAAAGTCGATGATAATTCTTTGTGCTATTGATTACGTATTTACATCTTTGGCAAACTTACATTTTCAGGTGTAGTTGAGAAGGATAGAAGGGAGAGAAGTAGAGACCAAGGAAGGGAATGGAATGACCGAGACAGAGACCGAGACCGAGATGGTAGCAGAAATAAAGAAAGAGAACCTAGGGAAAGACATCGGAGTCGAAGCAGAGAGCGGGACAGGGGTCGAGATCGTGATCGTGATCGAGATCGAGATCGAGACTATGATTTCGAGCGAGATAAAGAACATGGAAGGGAGAAGGAGCACCGGAGCAGGCACCGGTATTGAAGAGCTTGGATTAAAACTTTATGGCCGAACCTGGTGGGTCTCATTTTTTATGAAAGATATTCTATTTTTGGTATTATTGTGATTACATCTTCTTGGCGCTTTTAAGAATTCTATCAGAATACGAATTGAATTTGCAAAGCAATTGGAATTGTTTGGCTGCCTTCTCACCACCAACTCCAAACTTGTAATCAAGTTCACAGATTAGCAAACTTAGGCGATCTTATTTATATGTACATGACTCTCGAAAAGACTCTTGAAAATAGACATGAGAAATCGAAAAGACACTTGTAACTGCTCGACTCTCGAGCTTCTGCCAACGACTGTTATC

mRNA sequence

CATGGACCCGGTTCACTCCCATCAAACACGGGACTGAGAGAAGAAAAAAATAACAGAGTGGCAGAAATCAAAGTCTGAGATTGTCCAAGCCTCAGCACGGAACTCAGATCTTAATATTGATTTCATGACAATTGGTGAATAATGGCAACCTATGCGGGTAAAGGAGCCCTCTCCAATGGATCTGTTTACATCTGCAACCTGCCCTTTGGGACCGATGAGAATATGCTTGCTGAATATTTTGGTACAATTGGGTTATTGAAGAAAGACAAGCGAACAGGTAGACCGAAAATCTGGTTGTATCGTGATAAATCGACGAATGAGCCTAAAGGAGATGCTACTGTGACATATGAGGATCCACATGCAGCTTTAGCGGCTGTTGAATGGTTTGACAATAAGGATTTTCATGGTAACATCATTGAAGTTCATATAGCAGAGTCCAAAAGTAAAGATGATGTCTCGTTTAATGTGGGGGTCGATCCGATTGTTGTTGGTGATACTCTTGATTTTGAGGAAAATGATGGGGGTGGTATGAATGGGGGCGGTGGAAGGGGGAGAGGTCGCGGTGATGCTCCTCCAGGAAAAGCATGGCAACAAGAGGGAGATTGGCTGTGTCCAAATACAAGTTGTACCAATGTTAATTTTGCATTTCGTGGTGTGTGTAACCGGTGTGGAAGTGCCCGACCTTCTGGTGCTGCTGGCAGTGGTGCAGGCTCTATAGGTCGTGGCAGGGGCCGTGGCACTGGCAACCAGGAATCAGGAAGCAATAGTCGTCCAGTTGGTGGCCCTACTGGTCTCTTTGGTCCAAATGACTGGCCGTGTCCAATGTGTGGTAACATCAATTGGGCAAAGCGCTCGAAATGCAATATTTGTAACACGAACAAACCGGGTCACAATGAGGGTGGCGTGAGAGGAGGGCGCGGTGGAGGTTACAAAGAACTTGATGAAGAAGAGTTGGAGGAGACTAAACGACGTCGGCGTGAAGCTGAAGAAGATGATGGTGAGATGTATGACGAGTTTGGAAACCTTAAGAAGAAGTTTCGTGCCAAATCCCAGCAAATGGAAGCCGGTCAGATAATACCAGGTGCTGGGCGGGCTGGATGGGAATTCGAAGAACTAGGTGTAGTTGAGAAGGATAGAAGGGAGAGAAGTAGAGACCAAGGAAGGGAATGGAATGACCGAGACAGAGACCGAGACCGAGATGGTAGCAGAAATAAAGAAAGAGAACCTAGGGAAAGACATCGGAGTCGAAGCAGAGAGCGGGACAGGGGTCGAGATCGTGATCGTGATCGAGATCGAGATCGAGACTATGATTTCGAGCGAGATAAAGAACATGGAAGGGAGAAGGAGCACCGGAGCAGGCACCGGTATTGAAGAGCTTGGATTAAAACTTTATGGCCGAACCTGGTGGGTCTCATTTTTTATGAAAGATATTCTATTTTTGGTATTATTGTGATTACATCTTCTTGGCGCTTTTAAGAATTCTATCAGAATACGAATTGAATTTGCAAAGCAATTGGAATTGTTTGGCTGCCTTCTCACCACCAACTCCAAACTTGTAATCAAGTTCACAGATTAGCAAACTTAGGCGATCTTATTTATATGTACATGACTCTCGAAAAGACTCTTGAAAATAGACATGAGAAATCGAAAAGACACTTGTAACTGCTCGACTCTCGAGCTTCTGCCAACGACTGTTATC

Coding sequence (CDS)

ATGGCAACCTATGCGGGTAAAGGAGCCCTCTCCAATGGATCTGTTTACATCTGCAACCTGCCCTTTGGGACCGATGAGAATATGCTTGCTGAATATTTTGGTACAATTGGGTTATTGAAGAAAGACAAGCGAACAGGTAGACCGAAAATCTGGTTGTATCGTGATAAATCGACGAATGAGCCTAAAGGAGATGCTACTGTGACATATGAGGATCCACATGCAGCTTTAGCGGCTGTTGAATGGTTTGACAATAAGGATTTTCATGGTAACATCATTGAAGTTCATATAGCAGAGTCCAAAAGTAAAGATGATGTCTCGTTTAATGTGGGGGTCGATCCGATTGTTGTTGGTGATACTCTTGATTTTGAGGAAAATGATGGGGGTGGTATGAATGGGGGCGGTGGAAGGGGGAGAGGTCGCGGTGATGCTCCTCCAGGAAAAGCATGGCAACAAGAGGGAGATTGGCTGTGTCCAAATACAAGTTGTACCAATGTTAATTTTGCATTTCGTGGTGTGTGTAACCGGTGTGGAAGTGCCCGACCTTCTGGTGCTGCTGGCAGTGGTGCAGGCTCTATAGGTCGTGGCAGGGGCCGTGGCACTGGCAACCAGGAATCAGGAAGCAATAGTCGTCCAGTTGGTGGCCCTACTGGTCTCTTTGGTCCAAATGACTGGCCGTGTCCAATGTGTGGTAACATCAATTGGGCAAAGCGCTCGAAATGCAATATTTGTAACACGAACAAACCGGGTCACAATGAGGGTGGCGTGAGAGGAGGGCGCGGTGGAGGTTACAAAGAACTTGATGAAGAAGAGTTGGAGGAGACTAAACGACGTCGGCGTGAAGCTGAAGAAGATGATGGTGAGATGTATGACGAGTTTGGAAACCTTAAGAAGAAGTTTCGTGCCAAATCCCAGCAAATGGAAGCCGGTCAGATAATACCAGGTGCTGGGCGGGCTGGATGGGAATTCGAAGAACTAGGTGTAGTTGAGAAGGATAGAAGGGAGAGAAGTAGAGACCAAGGAAGGGAATGGAATGACCGAGACAGAGACCGAGACCGAGATGGTAGCAGAAATAAAGAAAGAGAACCTAGGGAAAGACATCGGAGTCGAAGCAGAGAGCGGGACAGGGGTCGAGATCGTGATCGTGATCGAGATCGAGATCGAGACTATGATTTCGAGCGAGATAAAGAACATGGAAGGGAGAAGGAGCACCGGAGCAGGCACCGGTATTGA

Protein sequence

MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSKCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKFRAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKEREPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRHRY
BLAST of Cp4.1LG04g02120.1 vs. Swiss-Prot
Match: TAF15_ARATH (Transcription initiation factor TFIID subunit 15 OS=Arabidopsis thaliana GN=TAF15 PE=1 SV=1)

HSP 1 Score: 481.9 bits (1239), Expect = 7.1e-135
Identity = 270/402 (67.16%), Postives = 309/402 (76.87%), Query Frame = 1

Query: 8   GALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATV 67
           G  +NGSVY+ NLP GTDENMLA+YFGTIGLLK+DKRTG PK+WLYRDK T+EPKGDATV
Sbjct: 3   GYPTNGSVYVSNLPLGTDENMLADYFGTIGLLKRDKRTGTPKVWLYRDKETDEPKGDATV 62

Query: 68  TYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDG 127
           TYEDPHAALAAVEWF+NKDFHGN I V +AESK+K+             GD ++F E DG
Sbjct: 63  TYEDPHAALAAVEWFNNKDFHGNTIGVFMAESKNKN------------AGDAVEFVEFDG 122

Query: 128 GG--MNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAA 187
           G    NGG GRGRG+ D+   K WQQ+GDW+CPNTSCTNVNFAFRGVCNRCG+ARP+GA+
Sbjct: 123 GAEETNGGAGRGRGQADSS-AKPWQQDGDWMCPNTSCTNVNFAFRGVCNRCGTARPAGAS 182

Query: 188 GSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSKCNICNT 247
           G   G+ GRGRGRG G        +P G PTGLFGPNDW CPMCGN+NWAKR KCNICNT
Sbjct: 183 GGSMGA-GRGRGRGGGADGGAPGKQPSGAPTGLFGPNDWACPMCGNVNWAKRLKCNICNT 242

Query: 248 NKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKFRAKSQQ 307
           NKPG NEGGVRGGRGGGYKELDE+ELEETKRRRREAEEDDGEMYDEFGNLKKK+R K+ Q
Sbjct: 243 NKPGQNEGGVRGGRGGGYKELDEQELEETKRRRREAEEDDGEMYDEFGNLKKKYRVKTNQ 302

Query: 308 MEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKEREPRER 367
            +    +  AGRAGWE EELG ++KD RERSR       DR RDR RD   +K     +R
Sbjct: 303 ADTRPAV-AAGRAGWEVEELG-IDKDGRERSR-------DRQRDRGRDHHYDK-----DR 362

Query: 368 HRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRH 408
            RSRSRER+RG++RD D D DRD    RD+++GRE+  R R+
Sbjct: 363 RRSRSRERERGKERDYDYDHDRD----RDRDYGRERGSRYRN 372

BLAST of Cp4.1LG04g02120.1 vs. Swiss-Prot
Match: RBP56_HUMAN (TATA-binding protein-associated factor 2N OS=Homo sapiens GN=TAF15 PE=1 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 9.5e-31
Identity = 96/258 (37.21%), Postives = 131/258 (50.78%), Query Frame = 1

Query: 12  NGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATVTYED 71
           N ++++  L  G   + + E+F  IG++K +K+TG+P I LY DK T +PKG+ATV+++D
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292

Query: 72  PHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDGGGMN 131
           P +A AA++WFD K+FHGNII+V  A  + +    F  G                GGG  
Sbjct: 293 PPSAKAAIDWFDGKEFHGNIIKVSFATRRPE----FMRG-------------GGSGGGRR 352

Query: 132 GGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAAGSGAGS 191
           G GG  RGRG         + GDW+CPN SC N+NFA R  CN+C   RP  +  SG   
Sbjct: 353 GRGGY-RGRGGFQGRGGDPKSGDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGGDF 412

Query: 192 IGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSKCNICNTNKPGHN 251
            GRG G   G +  G      GG  G +G +       G      RS     + ++ G  
Sbjct: 413 RGRGYGGERGYRGRGGR----GGDRGGYGGD-----RSGGGYGGDRSSGGGYSGDRSGGG 463

Query: 252 EGGVRGG------RGGGY 264
            GG R G      RGGGY
Sbjct: 473 YGGDRSGGGYGGDRGGGY 463

BLAST of Cp4.1LG04g02120.1 vs. Swiss-Prot
Match: FUS_MOUSE (RNA-binding protein FUS OS=Mus musculus GN=Fus PE=1 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 2.0e-28
Identity = 81/225 (36.00%), Postives = 116/225 (51.56%), Query Frame = 1

Query: 12  NGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATVTYED 71
           N ++++  L        +A+YF  IG++K +K+TG+P I LY D+ T + KG+ATV+++D
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336

Query: 72  PHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDGGGMN 131
           P +A AA++WFD K+F GN I+V  A  ++  +     G      G  +      GGG +
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMG-RGGYGGGGS 396

Query: 132 GGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAAGS---- 191
           GGGGRG        G   Q+ GDW CPN +C N+NF++R  CN+C + +P G  G     
Sbjct: 397 GGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGS 456

Query: 192 -----------GAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGP 222
                      G G   RG  RG G    G      GG  G FGP
Sbjct: 457 HMGGNYGDDRRGRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGP 500

BLAST of Cp4.1LG04g02120.1 vs. Swiss-Prot
Match: FUS_BOVIN (RNA-binding protein FUS OS=Bos taurus GN=FUS PE=2 SV=2)

HSP 1 Score: 127.9 bits (320), Expect = 2.6e-28
Identity = 81/226 (35.84%), Postives = 116/226 (51.33%), Query Frame = 1

Query: 12  NGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATVTYED 71
           N ++++  L        +A+YF  IG++K +K+TG+P I LY D+ T + KG+ATV+++D
Sbjct: 271 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 330

Query: 72  PHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDGGGMN 131
           P +A AA++WFD K+F GN I+V  A  ++  +     G      G  +      GGG +
Sbjct: 331 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMG-RGGYGGGGS 390

Query: 132 GGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAAG----- 191
           GGGGRG        G   Q+ GDW CPN +C N+NF++R  CN+C + +P G  G     
Sbjct: 391 GGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGS 450

Query: 192 -----------SGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGP 222
                       G G   RG  RG G    G      GG  G FGP
Sbjct: 451 HMGGNYGDDRRGGRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGP 495

BLAST of Cp4.1LG04g02120.1 vs. Swiss-Prot
Match: FUS_HUMAN (RNA-binding protein FUS OS=Homo sapiens GN=FUS PE=1 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 2.6e-28
Identity = 81/226 (35.84%), Postives = 116/226 (51.33%), Query Frame = 1

Query: 12  NGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATVTYED 71
           N ++++  L        +A+YF  IG++K +K+TG+P I LY D+ T + KG+ATV+++D
Sbjct: 284 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 343

Query: 72  PHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDGGGMN 131
           P +A AA++WFD K+F GN I+V  A  ++  +     G      G  +      GGG +
Sbjct: 344 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMG-RGGYGGGGS 403

Query: 132 GGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAAG----- 191
           GGGGRG        G   Q+ GDW CPN +C N+NF++R  CN+C + +P G  G     
Sbjct: 404 GGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGS 463

Query: 192 -----------SGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGP 222
                       G G   RG  RG G    G      GG  G FGP
Sbjct: 464 HMGGNYGDDRRGGRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGP 508

BLAST of Cp4.1LG04g02120.1 vs. TrEMBL
Match: A0A0A0LCQ7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G824830 PE=4 SV=1)

HSP 1 Score: 679.9 bits (1753), Expect = 2.0e-192
Identity = 353/410 (86.10%), Postives = 377/410 (91.95%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA+Y GKGA SNGS+YICNLP+GTDENMLAEYFGTIG+LKKDKRTGRPKIWLYRDKSTNE
Sbjct: 1   MASYVGKGAPSNGSIYICNLPYGTDENMLAEYFGTIGVLKKDKRTGRPKIWLYRDKSTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVV-GDT 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+IIEVHIAESKSKDD+SFNV VDPIV  GD 
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIEVHIAESKSKDDLSFNVVVDPIVAAGDD 120

Query: 121 LDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSA 180
           +  EE    GMNGGGGRGRGRGDAP GKAWQQEGDWLCPNTSC+NVNFAFRGVCNRCGSA
Sbjct: 121 IGSEET-AVGMNGGGGRGRGRGDAP-GKAWQQEGDWLCPNTSCSNVNFAFRGVCNRCGSA 180

Query: 181 RPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSK 240
           RPSGAAGSGAGSIGRGRGRGT NQ+SG NSR VG PTGLFGPNDWPCPMCGNINWAKR+K
Sbjct: 181 RPSGAAGSGAGSIGRGRGRGTSNQDSGGNSRQVGAPTGLFGPNDWPCPMCGNINWAKRTK 240

Query: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300
           CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF
Sbjct: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300

Query: 301 RAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKE 360
           RAKSQQMEAG+I+PGAGRAGWE EELGVVEKDRRERSRD+GR+W+      DRD SRN+E
Sbjct: 301 RAKSQQMEAGRILPGAGRAGWEVEELGVVEKDRRERSRDRGRDWD------DRDSSRNRE 360

Query: 361 REPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRHRY 410
           RE RERHRSRSRERDRG      RDRD DY++ERD+E+GR+K+HR+RHRY
Sbjct: 361 RESRERHRSRSRERDRG------RDRDLDYEYERDREYGRDKDHRNRHRY 396

BLAST of Cp4.1LG04g02120.1 vs. TrEMBL
Match: A0A067JGB5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01663 PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 4.3e-163
Identity = 309/410 (75.37%), Postives = 351/410 (85.61%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MATY GKGA SNGS+Y+CNLP GTDENMLAEYFGTIG+LKKDKR+GRPKIWLYRDK TNE
Sbjct: 1   MATYPGKGAPSNGSIYVCNLPQGTDENMLAEYFGTIGVLKKDKRSGRPKIWLYRDKITNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTL 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+II V +AESKSKDD ++N GVDP V GD  
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIGVFMAESKSKDDNTYNSGVDPNVAGDFG 120

Query: 121 DFEENDGGGMNGG-GGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSA 180
             EE+    MNGG GG+GRGRGDA  GKAWQQEGDWLCPNTSC+NVNFAFRGVCNRCGSA
Sbjct: 121 GLEEST---MNGGDGGKGRGRGDAS-GKAWQQEGDWLCPNTSCSNVNFAFRGVCNRCGSA 180

Query: 181 RPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSK 240
           RPSG +G  AG+ GRGRGRG+  Q +  + R   G TGLFGPNDWPCPMCGNINWAKR+K
Sbjct: 181 RPSGVSGGSAGAGGRGRGRGS--QNTAGHGRSATGSTGLFGPNDWPCPMCGNINWAKRTK 240

Query: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300
           CNICNTNKPGHNEGGVRGGRGGGYKELDEEE+EET+RRR+EAEEDDGE+YDEFGNLKKKF
Sbjct: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEEIEETRRRRKEAEEDDGELYDEFGNLKKKF 300

Query: 301 RAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKE 360
           R K+QQ E+G+++PGAGRAGWE EELGV ++D RERSRD GRE   RD   DR+ S+N+E
Sbjct: 301 RVKTQQAESGRVLPGAGRAGWEVEELGVADRDARERSRDGGRE---RD---DRESSKNRE 360

Query: 361 REPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRHRY 410
              R+RHRSRSRERDRG+DRDR+ D DRD D+ RD++  R+++ R R+RY
Sbjct: 361 YNARDRHRSRSRERDRGKDRDREYDYDRDRDYGRDRDRDRDRD-RDRYRY 397

BLAST of Cp4.1LG04g02120.1 vs. TrEMBL
Match: D7TAD5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g01410 PE=4 SV=1)

HSP 1 Score: 578.9 bits (1491), Expect = 4.7e-162
Identity = 314/425 (73.88%), Postives = 351/425 (82.59%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA Y+G GA SNGSVY+CNLP GTDE MLAEYFGTIGL+KKDKRTGRPKIWLYRDK TNE
Sbjct: 1   MANYSGTGAPSNGSVYVCNLPHGTDETMLAEYFGTIGLIKKDKRTGRPKIWLYRDKVTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDT- 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+II V IAESK+KDD S+N G     VGD  
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIGVFIAESKNKDDHSYNSGNQVNDVGDPG 120

Query: 121 ----LDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNR 180
               L   E D   MNG  GRGRGRGDA  GKAWQQ+GDWLCPNTSC+NVNFAFRGVCNR
Sbjct: 121 AASDLRGLEEDPLDMNGSAGRGRGRGDAA-GKAWQQDGDWLCPNTSCSNVNFAFRGVCNR 180

Query: 181 CGSARPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWA 240
           CGSARPSG +GSGAG+ GRGRGRG    +S  + R VG PTGLFGPNDWPCPMCGNINWA
Sbjct: 181 CGSARPSGVSGSGAGAGGRGRGRG--GPDSAGHGRSVGAPTGLFGPNDWPCPMCGNINWA 240

Query: 241 KRSKCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNL 300
           KR+KCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGE+YDEFGNL
Sbjct: 241 KRTKCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGELYDEFGNL 300

Query: 301 KKKFRAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGS 360
           KKKFRAK+QQ EAGQ++PGAGRAGWE EELG+ ++D RERS+D+GRE +DR+        
Sbjct: 301 KKKFRAKTQQAEAGQVLPGAGRAGWEVEELGMADRDGRERSKDRGRERDDRE-------- 360

Query: 361 RNKEREPRERHRSRSRERDRGRDRDRDR----DRDRDYDFERDKEHGREKEH-------R 410
           +N+ER+ R+R RSRSRERDRG+DRDRDR    DRDRDY  +RD++  R+++        R
Sbjct: 361 KNRERDDRDRRRSRSRERDRGKDRDRDRDYDHDRDRDYGRDRDRDRDRDRDRDRDRDRDR 414

BLAST of Cp4.1LG04g02120.1 vs. TrEMBL
Match: B9S3N2_RICCO (RNA binding protein, putative OS=Ricinus communis GN=RCOM_1385570 PE=4 SV=1)

HSP 1 Score: 567.0 bits (1460), Expect = 1.9e-158
Identity = 302/406 (74.38%), Postives = 340/406 (83.74%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA+Y+GKGA +NGSVY+CNLP GTDE+MLAEYFGTIGLLKKDKRTGRPKIWLYRDK TNE
Sbjct: 2   MASYSGKGASANGSVYVCNLPQGTDEDMLAEYFGTIGLLKKDKRTGRPKIWLYRDKLTNE 61

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTL 120
           PKGDATVTYEDPHAA AA+EWF+NKDFHGN+I V +AESK+KD+ ++N  VDP  VGD  
Sbjct: 62  PKGDATVTYEDPHAAQAAIEWFNNKDFHGNLIGVFMAESKNKDEHAYNSEVDPNAVGDFG 121

Query: 121 DFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSAR 180
             EE   G MN  GGRGRG+GDA  GKAWQQEGDWLCPNTSC+NVNFAFRGVCNRCGSAR
Sbjct: 122 GLEET-AGDMNDDGGRGRGKGDAS-GKAWQQEGDWLCPNTSCSNVNFAFRGVCNRCGSAR 181

Query: 181 PSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSKC 240
           PSGA+G   GS G GRGRG G Q SG   R   G TGLFGPNDWPCPMCGNINWAKR+KC
Sbjct: 182 PSGASG---GSAGAGRGRGRGGQNSGGLGRAATGSTGLFGPNDWPCPMCGNINWAKRTKC 241

Query: 241 NICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKFR 300
           NICNTNKPGHNEGGVRGGRGGGYKELDEEE+EET+RRR+EAEEDDGE+YDEFGNLKKKFR
Sbjct: 242 NICNTNKPGHNEGGVRGGRGGGYKELDEEEIEETRRRRKEAEEDDGELYDEFGNLKKKFR 301

Query: 301 AKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKER 360
           AK+QQ E+G+++PGAGRAGWE EELGV+++D RERSR++ RE     RD DR+ S+N+E 
Sbjct: 302 AKTQQAESGRVLPGAGRAGWEVEELGVIDRDGRERSRERVRE-----RD-DRESSKNREH 361

Query: 361 EPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSR 407
             RER RSRSRERDRG+DRD D D DRD D+ RD++  R      R
Sbjct: 362 NGRERRRSRSRERDRGKDRDWDYDYDRDRDYGRDRDRDRNPNPSGR 396

BLAST of Cp4.1LG04g02120.1 vs. TrEMBL
Match: M5XDK0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006646mg PE=4 SV=1)

HSP 1 Score: 565.5 bits (1456), Expect = 5.4e-158
Identity = 312/416 (75.00%), Postives = 345/416 (82.93%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MATY GKGA SNGSVY+CNLP GTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDK TNE
Sbjct: 1   MATYPGKGAPSNGSVYVCNLPEGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKLTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTL 120
           PKGDATVTYEDPHAA AAVEWF++KDFHGN I V IAESKSKDD   N   DPIV G+  
Sbjct: 61  PKGDATVTYEDPHAASAAVEWFNDKDFHGNAIGVFIAESKSKDDQIHNPVGDPIVGGE-Y 120

Query: 121 DFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTS-CTNVNFAFRGVCNRCGSA 180
           D  E      NGG G GRGRGDAP GKAWQQEGDW CPNTS C+NVNFAFRGVCNRCGSA
Sbjct: 121 DGLEEIAQDTNGGVGGGRGRGDAP-GKAWQQEGDWTCPNTSSCSNVNFAFRGVCNRCGSA 180

Query: 181 RPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTG-LFGPNDWPCPMCGNINWAKRS 240
           RP+GA+G GAG  GRGRGRG    +SG    PVG PTG LFGPNDWPCPMCGNINWAKR+
Sbjct: 181 RPTGASGGGAGGGGRGRGRGL---DSGGRGGPVGAPTGGLFGPNDWPCPMCGNINWAKRT 240

Query: 241 KCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKK 300
           KCNICNTN+PGHNEGGVRGGRGGGYKELDEEELEE KRRRREAEEDDGEMYDEFGNLKKK
Sbjct: 241 KCNICNTNRPGHNEGGVRGGRGGGYKELDEEELEEIKRRRREAEEDDGEMYDEFGNLKKK 300

Query: 301 FRAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNK 360
           FRAK+QQ E G+ +PGAGRAGWE EE+GV+++D RER+R++GRE     RD DR  S+N+
Sbjct: 301 FRAKTQQTETGRALPGAGRAGWEVEEIGVIDRDGRERNRERGRE-----RD-DRPSSKNR 360

Query: 361 EREPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEH-----RSRHRY 410
           ER+ R+R RSRSRE    R+RDR RDR RDYD++RDK++GR+++      R+RHRY
Sbjct: 361 ERDDRDRRRSRSRE----RERDRGRDRPRDYDYDRDKDYGRDRDRDRDRDRNRHRY 401

BLAST of Cp4.1LG04g02120.1 vs. TAIR10
Match: AT1G50300.1 (AT1G50300.1 TBP-associated factor 15)

HSP 1 Score: 481.9 bits (1239), Expect = 4.0e-136
Identity = 270/402 (67.16%), Postives = 309/402 (76.87%), Query Frame = 1

Query: 8   GALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATV 67
           G  +NGSVY+ NLP GTDENMLA+YFGTIGLLK+DKRTG PK+WLYRDK T+EPKGDATV
Sbjct: 3   GYPTNGSVYVSNLPLGTDENMLADYFGTIGLLKRDKRTGTPKVWLYRDKETDEPKGDATV 62

Query: 68  TYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEENDG 127
           TYEDPHAALAAVEWF+NKDFHGN I V +AESK+K+             GD ++F E DG
Sbjct: 63  TYEDPHAALAAVEWFNNKDFHGNTIGVFMAESKNKN------------AGDAVEFVEFDG 122

Query: 128 GG--MNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSARPSGAA 187
           G    NGG GRGRG+ D+   K WQQ+GDW+CPNTSCTNVNFAFRGVCNRCG+ARP+GA+
Sbjct: 123 GAEETNGGAGRGRGQADSS-AKPWQQDGDWMCPNTSCTNVNFAFRGVCNRCGTARPAGAS 182

Query: 188 GSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSKCNICNT 247
           G   G+ GRGRGRG G        +P G PTGLFGPNDW CPMCGN+NWAKR KCNICNT
Sbjct: 183 GGSMGA-GRGRGRGGGADGGAPGKQPSGAPTGLFGPNDWACPMCGNVNWAKRLKCNICNT 242

Query: 248 NKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKFRAKSQQ 307
           NKPG NEGGVRGGRGGGYKELDE+ELEETKRRRREAEEDDGEMYDEFGNLKKK+R K+ Q
Sbjct: 243 NKPGQNEGGVRGGRGGGYKELDEQELEETKRRRREAEEDDGEMYDEFGNLKKKYRVKTNQ 302

Query: 308 MEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKEREPRER 367
            +    +  AGRAGWE EELG ++KD RERSR       DR RDR RD   +K     +R
Sbjct: 303 ADTRPAV-AAGRAGWEVEELG-IDKDGRERSR-------DRQRDRGRDHHYDK-----DR 362

Query: 368 HRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRH 408
            RSRSRER+RG++RD D D DRD    RD+++GRE+  R R+
Sbjct: 363 RRSRSRERERGKERDYDYDHDRD----RDRDYGRERGSRYRN 372

BLAST of Cp4.1LG04g02120.1 vs. TAIR10
Match: AT5G58470.1 (AT5G58470.1 TBP-associated factor 15B)

HSP 1 Score: 59.3 bits (142), Expect = 6.4e-09
Identity = 47/137 (34.31%), Postives = 65/137 (47.45%), Query Frame = 1

Query: 12  NGSVYICNLPFGTDENMLAEYFGTIGLLKKDK-RTGRPKIW-----LYRDKSTNEPKGDA 71
           N  +YI NLP     + L + FG IG + + K + G    W     +Y D+  N  KGDA
Sbjct: 279 NARIYISNLPPDVTTDELKDLFGGIGQVGRIKQKRGYKDQWPYNIKIYTDEKGNY-KGDA 338

Query: 72  TVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTLDFEEN 131
            + YEDP AA +A  +F+N +  GN I V +AE  +    +               F++ 
Sbjct: 339 CLAYEDPSAAHSAGGFFNNYEMRGNKISVTMAEKSAPRAPT---------------FDQR 398

Query: 132 DGGGMNGGGGRGRGRGD 143
            GG   GGGG G G GD
Sbjct: 399 GGGRGGGGGGYGGGGGD 399

BLAST of Cp4.1LG04g02120.1 vs. TAIR10
Match: AT5G16260.1 (AT5G16260.1 RNA binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 48.9 bits (115), Expect = 8.6e-06
Identity = 26/67 (38.81%), Postives = 40/67 (59.70%), Query Frame = 1

Query: 12  NGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNEPKGDATVTY-E 71
           N  +Y+  LP       +AE F   G++ K+  TG+P+I LY DK+T + KGDA ++Y +
Sbjct: 275 NPHIYVNGLPDDVTIEEVAEVFSKCGII-KEDDTGKPRIKLYSDKATGKLKGDALISYMK 334

Query: 72  DPHAALA 78
           +P   LA
Sbjct: 335 EPSVDLA 340

BLAST of Cp4.1LG04g02120.1 vs. NCBI nr
Match: gi|659085353|ref|XP_008443374.1| (PREDICTED: transcription initiation factor TFIID subunit 15 [Cucumis melo])

HSP 1 Score: 686.8 bits (1771), Expect = 2.3e-194
Identity = 354/410 (86.34%), Postives = 381/410 (92.93%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA+Y GKGA SNGS+YICNLP+GTDENMLAEYFGTIG+LKKDKRTGRPKIWLYRDKSTNE
Sbjct: 1   MASYVGKGAPSNGSIYICNLPYGTDENMLAEYFGTIGVLKKDKRTGRPKIWLYRDKSTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVV-GDT 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+IIEVHIAESK+KDD+SFNVGVDPIV  GD 
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIEVHIAESKNKDDISFNVGVDPIVAAGDD 120

Query: 121 LDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSA 180
           +  EE+  G MNGGGGRGRGRGDAP GKAWQQEGDWLCPNTSC+NVNFAFRGVCNRCGSA
Sbjct: 121 IGPEESTAG-MNGGGGRGRGRGDAP-GKAWQQEGDWLCPNTSCSNVNFAFRGVCNRCGSA 180

Query: 181 RPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSK 240
           RPSGAAGSGAGSIGRGRGRGT NQ+SG NSR VGGPTGLFGPNDWPCPMCGNINWAKR+K
Sbjct: 181 RPSGAAGSGAGSIGRGRGRGTSNQDSGGNSRQVGGPTGLFGPNDWPCPMCGNINWAKRTK 240

Query: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300
           CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF
Sbjct: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300

Query: 301 RAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKE 360
           RAKSQQMEAG+I+PGAGRAGWE EELGVVEKDRRERSRD+GR+W+      DRDGSRN+E
Sbjct: 301 RAKSQQMEAGRILPGAGRAGWEVEELGVVEKDRRERSRDRGRDWD------DRDGSRNRE 360

Query: 361 REPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRHRY 410
           RE RERHRSRSRERDRG      R+RD DY++ERD+E+GR+K+HR+RHRY
Sbjct: 361 RESRERHRSRSRERDRG------RERDLDYEYERDREYGRDKDHRNRHRY 396

BLAST of Cp4.1LG04g02120.1 vs. NCBI nr
Match: gi|778685475|ref|XP_011652224.1| (PREDICTED: transcription initiation factor TFIID subunit 15 [Cucumis sativus])

HSP 1 Score: 679.9 bits (1753), Expect = 2.8e-192
Identity = 353/410 (86.10%), Postives = 377/410 (91.95%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA+Y GKGA SNGS+YICNLP+GTDENMLAEYFGTIG+LKKDKRTGRPKIWLYRDKSTNE
Sbjct: 1   MASYVGKGAPSNGSIYICNLPYGTDENMLAEYFGTIGVLKKDKRTGRPKIWLYRDKSTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVV-GDT 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+IIEVHIAESKSKDD+SFNV VDPIV  GD 
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIEVHIAESKSKDDLSFNVVVDPIVAAGDD 120

Query: 121 LDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSA 180
           +  EE    GMNGGGGRGRGRGDAP GKAWQQEGDWLCPNTSC+NVNFAFRGVCNRCGSA
Sbjct: 121 IGSEET-AVGMNGGGGRGRGRGDAP-GKAWQQEGDWLCPNTSCSNVNFAFRGVCNRCGSA 180

Query: 181 RPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSK 240
           RPSGAAGSGAGSIGRGRGRGT NQ+SG NSR VG PTGLFGPNDWPCPMCGNINWAKR+K
Sbjct: 181 RPSGAAGSGAGSIGRGRGRGTSNQDSGGNSRQVGAPTGLFGPNDWPCPMCGNINWAKRTK 240

Query: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300
           CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF
Sbjct: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300

Query: 301 RAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKE 360
           RAKSQQMEAG+I+PGAGRAGWE EELGVVEKDRRERSRD+GR+W+      DRD SRN+E
Sbjct: 301 RAKSQQMEAGRILPGAGRAGWEVEELGVVEKDRRERSRDRGRDWD------DRDSSRNRE 360

Query: 361 REPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRHRY 410
           RE RERHRSRSRERDRG      RDRD DY++ERD+E+GR+K+HR+RHRY
Sbjct: 361 RESRERHRSRSRERDRG------RDRDLDYEYERDREYGRDKDHRNRHRY 396

BLAST of Cp4.1LG04g02120.1 vs. NCBI nr
Match: gi|802761034|ref|XP_012089643.1| (PREDICTED: transcription initiation factor TFIID subunit 15 [Jatropha curcas])

HSP 1 Score: 582.4 bits (1500), Expect = 6.1e-163
Identity = 309/410 (75.37%), Postives = 351/410 (85.61%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MATY GKGA SNGS+Y+CNLP GTDENMLAEYFGTIG+LKKDKR+GRPKIWLYRDK TNE
Sbjct: 1   MATYPGKGAPSNGSIYVCNLPQGTDENMLAEYFGTIGVLKKDKRSGRPKIWLYRDKITNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDTL 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+II V +AESKSKDD ++N GVDP V GD  
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIGVFMAESKSKDDNTYNSGVDPNVAGDFG 120

Query: 121 DFEENDGGGMNGG-GGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNRCGSA 180
             EE+    MNGG GG+GRGRGDA  GKAWQQEGDWLCPNTSC+NVNFAFRGVCNRCGSA
Sbjct: 121 GLEEST---MNGGDGGKGRGRGDAS-GKAWQQEGDWLCPNTSCSNVNFAFRGVCNRCGSA 180

Query: 181 RPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWAKRSK 240
           RPSG +G  AG+ GRGRGRG+  Q +  + R   G TGLFGPNDWPCPMCGNINWAKR+K
Sbjct: 181 RPSGVSGGSAGAGGRGRGRGS--QNTAGHGRSATGSTGLFGPNDWPCPMCGNINWAKRTK 240

Query: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNLKKKF 300
           CNICNTNKPGHNEGGVRGGRGGGYKELDEEE+EET+RRR+EAEEDDGE+YDEFGNLKKKF
Sbjct: 241 CNICNTNKPGHNEGGVRGGRGGGYKELDEEEIEETRRRRKEAEEDDGELYDEFGNLKKKF 300

Query: 301 RAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGSRNKE 360
           R K+QQ E+G+++PGAGRAGWE EELGV ++D RERSRD GRE   RD   DR+ S+N+E
Sbjct: 301 RVKTQQAESGRVLPGAGRAGWEVEELGVADRDARERSRDGGRE---RD---DRESSKNRE 360

Query: 361 REPRERHRSRSRERDRGRDRDRDRDRDRDYDFERDKEHGREKEHRSRHRY 410
              R+RHRSRSRERDRG+DRDR+ D DRD D+ RD++  R+++ R R+RY
Sbjct: 361 YNARDRHRSRSRERDRGKDRDREYDYDRDRDYGRDRDRDRDRD-RDRYRY 397

BLAST of Cp4.1LG04g02120.1 vs. NCBI nr
Match: gi|225425084|ref|XP_002273586.1| (PREDICTED: transcription initiation factor TFIID subunit 15 [Vitis vinifera])

HSP 1 Score: 578.9 bits (1491), Expect = 6.8e-162
Identity = 314/425 (73.88%), Postives = 351/425 (82.59%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA Y+G GA SNGSVY+CNLP GTDE MLAEYFGTIGL+KKDKRTGRPKIWLYRDK TNE
Sbjct: 1   MANYSGTGAPSNGSVYVCNLPHGTDETMLAEYFGTIGLIKKDKRTGRPKIWLYRDKVTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVGVDPIVVGDT- 120
           PKGDATVTYEDPHAALAAVEWF+NKDFHG+II V IAESK+KDD S+N G     VGD  
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNKDFHGSIIGVFIAESKNKDDHSYNSGNQVNDVGDPG 120

Query: 121 ----LDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCNR 180
               L   E D   MNG  GRGRGRGDA  GKAWQQ+GDWLCPNTSC+NVNFAFRGVCNR
Sbjct: 121 AASDLRGLEEDPLDMNGSAGRGRGRGDAA-GKAWQQDGDWLCPNTSCSNVNFAFRGVCNR 180

Query: 181 CGSARPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINWA 240
           CGSARPSG +GSGAG+ GRGRGRG    +S  + R VG PTGLFGPNDWPCPMCGNINWA
Sbjct: 181 CGSARPSGVSGSGAGAGGRGRGRG--GPDSAGHGRSVGAPTGLFGPNDWPCPMCGNINWA 240

Query: 241 KRSKCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGNL 300
           KR+KCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGE+YDEFGNL
Sbjct: 241 KRTKCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGELYDEFGNL 300

Query: 301 KKKFRAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDGS 360
           KKKFRAK+QQ EAGQ++PGAGRAGWE EELG+ ++D RERS+D+GRE +DR+        
Sbjct: 301 KKKFRAKTQQAEAGQVLPGAGRAGWEVEELGMADRDGRERSKDRGRERDDRE-------- 360

Query: 361 RNKEREPRERHRSRSRERDRGRDRDRDR----DRDRDYDFERDKEHGREKEH-------R 410
           +N+ER+ R+R RSRSRERDRG+DRDRDR    DRDRDY  +RD++  R+++        R
Sbjct: 361 KNRERDDRDRRRSRSRERDRGKDRDRDRDYDHDRDRDYGRDRDRDRDRDRDRDRDRDRDR 414

BLAST of Cp4.1LG04g02120.1 vs. NCBI nr
Match: gi|720069005|ref|XP_010277300.1| (PREDICTED: transcription initiation factor TFIID subunit 15 [Nelumbo nucifera])

HSP 1 Score: 575.9 bits (1483), Expect = 5.7e-161
Identity = 314/422 (74.41%), Postives = 354/422 (83.89%), Query Frame = 1

Query: 1   MATYAGKGALSNGSVYICNLPFGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKSTNE 60
           MA+Y+GKG   NGSVY+CNLP GTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDK TNE
Sbjct: 1   MASYSGKGGPPNGSVYVCNLPHGTDENMLAEYFGTIGLLKKDKRTGRPKIWLYRDKVTNE 60

Query: 61  PKGDATVTYEDPHAALAAVEWFDNKDFHGNIIEVHIAESKSKDDVSFNVG------VDPI 120
           PKGDATVTYEDPHAALAAVEWF+N+DFHG+ I V IAESKSKDD ++N G       DP 
Sbjct: 61  PKGDATVTYEDPHAALAAVEWFNNRDFHGSTIGVFIAESKSKDDHTYNSGNQMNHGEDPN 120

Query: 121 VVGDTLDFEENDGGGMNGGGGRGRGRGDAPPGKAWQQEGDWLCPNTSCTNVNFAFRGVCN 180
           + GD    +  D   +NGGGGRGRGRGDA  GKAWQQEGDWLCPNTSC+NVNFAFRGVCN
Sbjct: 121 LTGD-FGGQGEDARDVNGGGGRGRGRGDAS-GKAWQQEGDWLCPNTSCSNVNFAFRGVCN 180

Query: 181 RCGSARPSGAAGSGAGSIGRGRGRGTGNQESGSNSRPVGGPTGLFGPNDWPCPMCGNINW 240
           RCGSARP+G++G GAG+ GRG+GRG  N +S    R VGGPTGLFGPNDWPCPMCGNINW
Sbjct: 181 RCGSARPAGSSGGGAGAGGRGKGRG--NTDSSGRGRTVGGPTGLFGPNDWPCPMCGNINW 240

Query: 241 AKRSKCNICNTNKPGHNEGGVRGGRGGGYKELDEEELEETKRRRREAEEDDGEMYDEFGN 300
           AKR+KCNICNTNKPGHNEGGVRGGR GGYKELDEEE+EETKRRRREAEEDDGE+YDEFGN
Sbjct: 241 AKRTKCNICNTNKPGHNEGGVRGGRAGGYKELDEEEIEETKRRRREAEEDDGELYDEFGN 300

Query: 301 LKKKFRAKSQQMEAGQIIPGAGRAGWEFEELGVVEKDRRERSRDQGREWNDRDRDRDRDG 360
           LKKKFRAK+QQ EAGQ++PGAGRAGWE EELGV E+D R+RSRD+GR   DRD   +RDG
Sbjct: 301 LKKKFRAKTQQAEAGQVLPGAGRAGWEVEELGVTERDGRDRSRDRGR---DRD---ERDG 360

Query: 361 SRNKERE----PRERHRSRSRERDRGRDR----DRDRDRDRDYDFERDKEHGREKEHRSR 409
           S+++ER+     RER RSRSRER+R R++    DR+RDRDRDYD +R KE GR+++ R R
Sbjct: 361 SKSRERDGYIHERERRRSRSRERERDREKDRGKDRNRDRDRDYDHDRGKELGRDRD-RDR 411

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TAF15_ARATH7.1e-13567.16Transcription initiation factor TFIID subunit 15 OS=Arabidopsis thaliana GN=TAF1... [more]
RBP56_HUMAN9.5e-3137.21TATA-binding protein-associated factor 2N OS=Homo sapiens GN=TAF15 PE=1 SV=1[more]
FUS_MOUSE2.0e-2836.00RNA-binding protein FUS OS=Mus musculus GN=Fus PE=1 SV=1[more]
FUS_BOVIN2.6e-2835.84RNA-binding protein FUS OS=Bos taurus GN=FUS PE=2 SV=2[more]
FUS_HUMAN2.6e-2835.84RNA-binding protein FUS OS=Homo sapiens GN=FUS PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LCQ7_CUCSA2.0e-19286.10Uncharacterized protein OS=Cucumis sativus GN=Csa_3G824830 PE=4 SV=1[more]
A0A067JGB5_JATCU4.3e-16375.37Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01663 PE=4 SV=1[more]
D7TAD5_VITVI4.7e-16273.88Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g01410 PE=4 SV=... [more]
B9S3N2_RICCO1.9e-15874.38RNA binding protein, putative OS=Ricinus communis GN=RCOM_1385570 PE=4 SV=1[more]
M5XDK0_PRUPE5.4e-15875.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006646mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G50300.14.0e-13667.16 TBP-associated factor 15[more]
AT5G58470.16.4e-0934.31 TBP-associated factor 15B[more]
AT5G16260.18.6e-0638.81 RNA binding (RRM/RBD/RNP motifs) family protein[more]
Match NameE-valueIdentityDescription
gi|659085353|ref|XP_008443374.1|2.3e-19486.34PREDICTED: transcription initiation factor TFIID subunit 15 [Cucumis melo][more]
gi|778685475|ref|XP_011652224.1|2.8e-19286.10PREDICTED: transcription initiation factor TFIID subunit 15 [Cucumis sativus][more]
gi|802761034|ref|XP_012089643.1|6.1e-16375.37PREDICTED: transcription initiation factor TFIID subunit 15 [Jatropha curcas][more]
gi|225425084|ref|XP_002273586.1|6.8e-16273.88PREDICTED: transcription initiation factor TFIID subunit 15 [Vitis vinifera][more]
gi|720069005|ref|XP_010277300.1|5.7e-16174.41PREDICTED: transcription initiation factor TFIID subunit 15 [Nelumbo nucifera][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0000166nucleotide binding
GO:0008270zinc ion binding
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR012677Nucleotide-bd_a/b_plait_sf
IPR001876Znf_RanBP2
IPR000504RRM_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG04g02120Cp4.1LG04g02120gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG04g02120.1Cp4.1LG04g02120.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG04g02120.1:five_prime_utr:001Cp4.1LG04g02120.1:five_prime_utr:001five_prime_UTR
Cp4.1LG04g02120.1:five_prime_utr:002Cp4.1LG04g02120.1:five_prime_utr:002five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG04g02120.1:cds:001Cp4.1LG04g02120.1:cds:001CDS
Cp4.1LG04g02120.1:cds:002Cp4.1LG04g02120.1:cds:002CDS
Cp4.1LG04g02120.1:cds:003Cp4.1LG04g02120.1:cds:003CDS
Cp4.1LG04g02120.1:cds:004Cp4.1LG04g02120.1:cds:004CDS
Cp4.1LG04g02120.1:cds:005Cp4.1LG04g02120.1:cds:005CDS
Cp4.1LG04g02120.1:cds:006Cp4.1LG04g02120.1:cds:006CDS
Cp4.1LG04g02120.1:cds:007Cp4.1LG04g02120.1:cds:007CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG04g02120.1:three_prime_utr:001Cp4.1LG04g02120.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 15..92
score: 1.
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 14..95
score: 5.4
IPR000504RNA recognition motif domainPROFILEPS50102RRMcoord: 13..99
score: 11
IPR001876Zinc finger, RanBP2-typeGENE3DG3DSA:4.10.1060.10coord: 214..249
score: 3.4E-12coord: 145..184
score: 3.0
IPR001876Zinc finger, RanBP2-typePFAMPF00641zf-RanBPcoord: 152..182
score: 3.1E-7coord: 223..249
score: 3.
IPR001876Zinc finger, RanBP2-typeSMARTSM00547zf_4coord: 222..246
score: 3.0E-6coord: 153..179
score: 1.
IPR001876Zinc finger, RanBP2-typePROSITEPS01358ZF_RANBP2_1coord: 224..243
score: -coord: 155..176
scor
IPR001876Zinc finger, RanBP2-typePROFILEPS50199ZF_RANBP2_2coord: 151..182
score: 9.992coord: 220..249
score: 9
IPR001876Zinc finger, RanBP2-typeunknownSSF90209Ran binding protein zinc finger-likecoord: 144..183
score: 5.49E-13coord: 213..250
score: 3.85
IPR012677Nucleotide-binding alpha-beta plait domainGENE3DG3DSA:3.30.70.330coord: 11..135
score: 1.4
IPR012677Nucleotide-binding alpha-beta plait domainunknownSSF54928RNA-binding domain, RBDcoord: 12..102
score: 3.09
NoneNo IPR availableunknownCoilCoilcoord: 264..284
scor
NoneNo IPR availablePANTHERPTHR12999FAMILY NOT NAMEDcoord: 14..408
score: 5.1E