Csa2G074100.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa2G074100.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionTATA box-binding protein-associated factor RNA polymerase I subunit B
LocationChr2 : 5777928 .. 5782287 (-)
Sequence length2208
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGTCAAGGCCTTTGTTTGGTAAGGATTGATGTTTTATGGATTTTTGTTGAGTTTTACCCAATACCCATGAAAAATTAGGCTAAACCCCCTTTATATCCCATGTTTTTAAGTGTTGGATTGAGGTCTTGAGTGAGAATTAGCTTACTGTCTAGCACGTACGGGTTCGTTTACGGCGAGTTGTTTCGTGAGTTTTAAGGCCATTGTGTAGACCCTTAGATACCCAGTAATAATTGGTTTAAGTATTGACTTGATCTTGGCCTTTGAAAGGCTTAGAGCTATTATCCATTGGGATTTAATAGGGTATTTCGGTAAATTAGTGTTTGGTTCTCGTTGATCTATAAGCTTTGGAGTTGGTTTGGGTCAAACCACTTATGGGCGAGAATTTTGATGTTTGGTTTTAATCCTTTTGGTTATTTCGGATATTCGGTTTCTTTGGGTTTAAACTTGGATTTATATTTAGGTCCTAATTAAGACTTCTGCTGGATTTATGTCTTATGATTTTATTCTTTAAATGTTTGGTGTGCTAGCGTGCCTTCGGATTTTATTCATCTATGTCCCATTATTTTATTCTTTTAGACCTTTATTCTATGAAAATGTTTTATTTATTTTAATTGATCTTTTAAGAATTTTAGATTTTCAAGCTTATGAATGCATGCAGTAGAAAGTTGAGTTTTTACAGATAACACCTCACTTTCAACCTTTCTAAACTTGTTTGAAGTGCTGTAAATTGGTTTATAACTCTTGGAGATTATATTGTAATTTGGAACTGAGTTGGAGCGTTTGGTTTAAGGCTTAAGTTGTTTTTGGTTGGTTTATTCATTATGGGCACTCTTTGGGATTCTGATCAGGGTTTCGAACTGAGAGTTGTTTAAATATTTCATTCATATAATAATTTGATTGATAATCATTTTCTGAATTTTGCGTTTCCATTTTACCTTCAAGATCTCAAATGGCAGACCCCAGCAACTTGTCTTGCTACAACTGTGGCAGCATTGGGCTTACAGATGGTTTTGATGGCTTCTTCTACTGCCTTCAATGTGGTTCCCAGGCTGATGACATCATCGACACTGGTGTTGCTGAGGAGGATTTGGTTTTAAGAGATGTTGGTAAATCTGGTGCCCCAATCTACTCACAATCTCACACTCGCCGCCGTAATCCAACCGTGTTGAAGGTGGAACCCCTATCTCAGTCTCAATCACTTTTTGGTACAAGTCAGTCTGAGTTCTGGGATAGTCTTAATTTAATGGAAGACCCATCTGGCAATGTAGGTGGAAAGGATGGTGACATTGTGATGTTAAACGATGGTGTCGGGCCTACTGGTCCAGAGGATTTTGGGTCTGGTGATGTGCTATCTGGAAAACCAAGTTTTGAGGAGTATGCTGATGAAGTGAGGATGAGGTATGTAATGGGGTTGCAGCTGATTATGGAGCTTCAGTGCGAAGTTTTGGTGAAGGAATTTAAGGCAACCCCTATAATTTGTGGGTTGGCTGCGAGCATTTGGTTGAGGTTTGTGACTGCTACACGGGTTTTTGATGAGGATTGGGCCTTTCAGACCGTCCAGGAATCTGAAAGCCAATGCCTAGGTTTGTTAACTCGTTACTTCCACTTAAAGTTCTTGTTTATATTTGATTGTCCATTATAATCATTTTGATGCTTGAAAGTTTTAACTGAAATCTTTAGCTTCAGATTTAAAAGTGGTACTTGTGGGTAGGAGACTTTTAATGTATACCTTTTGTTTGATTATTTTAAACATGGTGAATTCTGATGGTTGTCTTGAAAACTAAACTTTTTTTTTAACTAAAGAAACAAATCTCTCAGATGTATGAAATTTAGAGAAAGAGGCTGCAGCCCAATTCAACAGAGTTACAACTCCAATTGGTCAAAAGAGAGGATAGTCTGTACAAACTATAGAAATGAAAGATATACGTTTGCACCAAGTCATAGCATAAGTTCAGAAAAGTTATCAAAAGTGCTTTCCTTGTCGTTGAAAAGGCAGTTGTTTCTCTCCTTCCATGTAATCCATAAAAAAGGATGAATCAAATTCATCCATATTCTCTTCTCAATGTTCTCATTAGTGTAGGGGTGGCCAATGATATTGCATTGGTCATTTTAGCCTATAATATCGTTTGCATATTCAAATATCATAGTTTTTAGTCTATATGGTCATATGTGATGCCTTTCTCCCAAGATTCTTTGTAACAGTGCTTCATTTTGTCTGACATCCTACAGTTTTGCAAGTACCTTTTTTCACCATCTTAGTGTTGGTTGAGTTTTTTTAATTTATTTTTATGTTTAACTATTTTCACCATCTCAGTATTTTGAGTTCTTGCTGTATTTCTTATTCATTTTAGTTGCAAACTGAATGTTTCTTCACTCAACAGATCCTGAACGCATAAGACGTGTTTGCTCTAGCCATAAAGACGAACCCCACAACTTTTATGGTCAACGAGTAGTAGTTTTATGGGTTAAATCTTTAAGAAAGAAGATACCATTATTTAGTACATTAGCTGTTTCGTTTTTGGCTTGCCATGTTGCTCGGGAAGCAATTTTGCCAACAGACATAATAAAGTGGTCACTTGAGGGGAAACTTCCATATTATGCTGCTTTTGTTGACATTGAAAGTCGCATTGGGAAAACTTCAAGGGCTTGTCCTATAAGTTCAAAGCTTATGCATAGGCCCTCTCGAATTTCGTCGCTGCAGAAATTAGAATCGCTGGCAGCATCGATAGCTCATACCATAGGGTTAAATCTTCCTCCAGTTAATTTTCATTCAATAGCTTGTCGTTATCTCAACAAGCTGGCCCTTCCTGTTGATAAGATTTTACCTCATGCTTGCCGCATTTATGAGTGGTCAATGCCTCCTGATTTGTGGTTATCAACCAATGAATTGAGACTTCCCTCTCGTGTTTGTGTAATGTCAATTCTAATTATAGCAATGAGAATTCTGTATAATCTCCATGGTTTTGGAGAATGGGAAAAGAGTTTGTCTGTTGATTGTGCTTCATGTTTCCCACCACATCAGAAGACGCATTCAAGTCCTGCTAATAATTTTAGTAACATGCAGGCTGATAGTGAAAACAGACCCGGATTTACTTCACACGATGTGGATAATCCATCTGTTTCTCCAGAGAACCCCCATCTCACCACTACAGAGTTTCTTCGCAAAATTGAAGCTCGCTATCATGAGATTGCTGAAACTTATGGTATGATTTCATTATTATTGTATTTTTAAAATTAGACCAACCTTAGTAATACATTTGCTTTTACTTATTACTCTTACCATTACTGTTCGCTTTTGTTTCTTATTAAATTACCATTTGATAGTTCTCAAAATTGTGTTGGCAGAATACTCTAAAGACTTGCCGACCTATCTTCAATACTGTAAGGATGTTGCCTTTGCTGGTTCGGAGTCTTTGTTTATTGATGATCACGATGAACAAAAGATGATCGAAAAATTATGGAATTATTATCAGAATGAAAAGGTAAATGATTAACGACTATCATGCTTAAGTCTTGTGTACCTAGATATTGGGAAAAGATTTTATCTGGCGAAATTAATCTGTGCATAGATGTTGAAAAAATGACTGGGATATAATAGTGTTTCTACTTTGCAATATTACTATCTGTTGTTGGTATGATTCTTACCATATCAATGTTTTTTAGTTGGAAAGCAACAATCATAATGCAAGCATAATTCCTAACTAGAAAAAGAAAAATTGTTAGCAGGATTACGATCAAACAGAAGATGTGGATCAAAATGCTGCTTCCAATCAAAAGAGATTGAGGGAAGGTTCCAATGATCGATTATCCAACGAAAGTAAGAAAGTCAAAGGCGAAGAAGACCGTATTAGCAGAGAATCATTGAATAACAGAACTGGGTCAATTGACTCTCGGCAAAGTCATTCATCAAAAAGTTTAGACAATAGTGATGATGATGAACAATCCTCCGTAGACAAAGCGGCTTCATCCCTAACTTCCATAAATGAAGCGATCAGACAACTGAAACTCGACATGGAGGAGAAAAGGTTCTGCTACATTCCCCCAAGGATTAACCCAAAGAGATTCGATTACCTTCACTACTCAAGGAAGATCGACGAGGGTGCACTGACATATGCTGCTCATGCTGATTACTACATATTGCTACGAGCATGTGCTAGAGCCGCACAAGTTGATATTAGGATAATGCATATTGGAGTGTTGAGTCTTGAGAAAAGATTGTCTTGGTTAGAAGATAGAATCCATAAATCCCTTCGTCTAACACCCACAAGTATTACTTGTGAGTTTTGTAGCGATGTGCCTGATCATGTTGGCTCCGTTGGACTCTCAGATTTGGATATTTAAGTGATTTATTTAGTCTAT

mRNA sequence

ATGGCAGACCCCAGCAACTTGTCTTGCTACAACTGTGGCAGCATTGGGCTTACAGATGGTTTTGATGGCTTCTTCTACTGCCTTCAATGTGGTTCCCAGGCTGATGACATCATCGACACTGGTGTTGCTGAGGAGGATTTGGTTTTAAGAGATGTTGGTAAATCTGGTGCCCCAATCTACTCACAATCTCACACTCGCCGCCGTAATCCAACCGTGTTGAAGGTGGAACCCCTATCTCAGTCTCAATCACTTTTTGGTACAAGTCAGTCTGAGTTCTGGGATAGTCTTAATTTAATGGAAGACCCATCTGGCAATGTAGGTGGAAAGGATGGTGACATTGTGATGTTAAACGATGGTGTCGGGCCTACTGGTCCAGAGGATTTTGGGTCTGGTGATGTGCTATCTGGAAAACCAAGTTTTGAGGAGTATGCTGATGAAGTGAGGATGAGGTATGTAATGGGGTTGCAGCTGATTATGGAGCTTCAGTGCGAAGTTTTGGTGAAGGAATTTAAGGCAACCCCTATAATTTGTGGGTTGGCTGCGAGCATTTGGTTGAGGTTTGTGACTGCTACACGGGTTTTTGATGAGGATTGGGCCTTTCAGACCGTCCAGGAATCTGAAAGCCAATGCCTAGATCCTGAACGCATAAGACGTGTTTGCTCTAGCCATAAAGACGAACCCCACAACTTTTATGGTCAACGAGTAGTAGTTTTATGGGTTAAATCTTTAAGAAAGAAGATACCATTATTTAGTACATTAGCTGTTTCGTTTTTGGCTTGCCATGTTGCTCGGGAAGCAATTTTGCCAACAGACATAATAAAGTGGTCACTTGAGGGGAAACTTCCATATTATGCTGCTTTTGTTGACATTGAAAGTCGCATTGGGAAAACTTCAAGGGCTTGTCCTATAAGTTCAAAGCTTATGCATAGGCCCTCTCGAATTTCGTCGCTGCAGAAATTAGAATCGCTGGCAGCATCGATAGCTCATACCATAGGGTTAAATCTTCCTCCAGTTAATTTTCATTCAATAGCTTGTCGTTATCTCAACAAGCTGGCCCTTCCTGTTGATAAGATTTTACCTCATGCTTGCCGCATTTATGAGTGGTCAATGCCTCCTGATTTGTGGTTATCAACCAATGAATTGAGACTTCCCTCTCGTGTTTGTGTAATGTCAATTCTAATTATAGCAATGAGAATTCTGTATAATCTCCATGGTTTTGGAGAATGGGAAAAGAGTTTGTCTGTTGATTGTGCTTCATGTTTCCCACCACATCAGAAGACGCATTCAAGTCCTGCTAATAATTTTAGTAACATGCAGGCTGATAGTGAAAACAGACCCGGATTTACTTCACACGATGTGGATAATCCATCTGTTTCTCCAGAGAACCCCCATCTCACCACTACAGAGTTTCTTCGCAAAATTGAAGCTCGCTATCATGAGATTGCTGAAACTTATGAATACTCTAAAGACTTGCCGACCTATCTTCAATACTGTAAGGATGTTGCCTTTGCTGGTTCGGAGTCTTTGTTTATTGATGATCACGATGAACAAAAGATGATCGAAAAATTATGGAATTATTATCAGAATGAAAAGGATTACGATCAAACAGAAGATGTGGATCAAAATGCTGCTTCCAATCAAAAGAGATTGAGGGAAGGTTCCAATGATCGATTATCCAACGAAAGTAAGAAAGTCAAAGGCGAAGAAGACCGTATTAGCAGAGAATCATTGAATAACAGAACTGGGTCAATTGACTCTCGGCAAAGTCATTCATCAAAAAGTTTAGACAATAGTGATGATGATGAACAATCCTCCGTAGACAAAGCGGCTTCATCCCTAACTTCCATAAATGAAGCGATCAGACAACTGAAACTCGACATGGAGGAGAAAAGGTTCTGCTACATTCCCCCAAGGATTAACCCAAAGAGATTCGATTACCTTCACTACTCAAGGAAGATCGACGAGGGTGCACTGACATATGCTGCTCATGCTGATTACTACATATTGCTACGAGCATGTGCTAGAGCCGCACAAGTTGATATTAGGATAATGCATATTGGAGTGTTGAGTCTTGAGAAAAGATTGTCTTGGTTAGAAGATAGAATCCATAAATCCCTTCGTCTAACACCCACAAGTATTACTTGTGAGTTTTGTAGCGATGTGCCTGATCATGTTGGCTCCGTTGGACTCTCAGATTTGGATATTTAA

Coding sequence (CDS)

ATGGCAGACCCCAGCAACTTGTCTTGCTACAACTGTGGCAGCATTGGGCTTACAGATGGTTTTGATGGCTTCTTCTACTGCCTTCAATGTGGTTCCCAGGCTGATGACATCATCGACACTGGTGTTGCTGAGGAGGATTTGGTTTTAAGAGATGTTGGTAAATCTGGTGCCCCAATCTACTCACAATCTCACACTCGCCGCCGTAATCCAACCGTGTTGAAGGTGGAACCCCTATCTCAGTCTCAATCACTTTTTGGTACAAGTCAGTCTGAGTTCTGGGATAGTCTTAATTTAATGGAAGACCCATCTGGCAATGTAGGTGGAAAGGATGGTGACATTGTGATGTTAAACGATGGTGTCGGGCCTACTGGTCCAGAGGATTTTGGGTCTGGTGATGTGCTATCTGGAAAACCAAGTTTTGAGGAGTATGCTGATGAAGTGAGGATGAGGTATGTAATGGGGTTGCAGCTGATTATGGAGCTTCAGTGCGAAGTTTTGGTGAAGGAATTTAAGGCAACCCCTATAATTTGTGGGTTGGCTGCGAGCATTTGGTTGAGGTTTGTGACTGCTACACGGGTTTTTGATGAGGATTGGGCCTTTCAGACCGTCCAGGAATCTGAAAGCCAATGCCTAGATCCTGAACGCATAAGACGTGTTTGCTCTAGCCATAAAGACGAACCCCACAACTTTTATGGTCAACGAGTAGTAGTTTTATGGGTTAAATCTTTAAGAAAGAAGATACCATTATTTAGTACATTAGCTGTTTCGTTTTTGGCTTGCCATGTTGCTCGGGAAGCAATTTTGCCAACAGACATAATAAAGTGGTCACTTGAGGGGAAACTTCCATATTATGCTGCTTTTGTTGACATTGAAAGTCGCATTGGGAAAACTTCAAGGGCTTGTCCTATAAGTTCAAAGCTTATGCATAGGCCCTCTCGAATTTCGTCGCTGCAGAAATTAGAATCGCTGGCAGCATCGATAGCTCATACCATAGGGTTAAATCTTCCTCCAGTTAATTTTCATTCAATAGCTTGTCGTTATCTCAACAAGCTGGCCCTTCCTGTTGATAAGATTTTACCTCATGCTTGCCGCATTTATGAGTGGTCAATGCCTCCTGATTTGTGGTTATCAACCAATGAATTGAGACTTCCCTCTCGTGTTTGTGTAATGTCAATTCTAATTATAGCAATGAGAATTCTGTATAATCTCCATGGTTTTGGAGAATGGGAAAAGAGTTTGTCTGTTGATTGTGCTTCATGTTTCCCACCACATCAGAAGACGCATTCAAGTCCTGCTAATAATTTTAGTAACATGCAGGCTGATAGTGAAAACAGACCCGGATTTACTTCACACGATGTGGATAATCCATCTGTTTCTCCAGAGAACCCCCATCTCACCACTACAGAGTTTCTTCGCAAAATTGAAGCTCGCTATCATGAGATTGCTGAAACTTATGAATACTCTAAAGACTTGCCGACCTATCTTCAATACTGTAAGGATGTTGCCTTTGCTGGTTCGGAGTCTTTGTTTATTGATGATCACGATGAACAAAAGATGATCGAAAAATTATGGAATTATTATCAGAATGAAAAGGATTACGATCAAACAGAAGATGTGGATCAAAATGCTGCTTCCAATCAAAAGAGATTGAGGGAAGGTTCCAATGATCGATTATCCAACGAAAGTAAGAAAGTCAAAGGCGAAGAAGACCGTATTAGCAGAGAATCATTGAATAACAGAACTGGGTCAATTGACTCTCGGCAAAGTCATTCATCAAAAAGTTTAGACAATAGTGATGATGATGAACAATCCTCCGTAGACAAAGCGGCTTCATCCCTAACTTCCATAAATGAAGCGATCAGACAACTGAAACTCGACATGGAGGAGAAAAGGTTCTGCTACATTCCCCCAAGGATTAACCCAAAGAGATTCGATTACCTTCACTACTCAAGGAAGATCGACGAGGGTGCACTGACATATGCTGCTCATGCTGATTACTACATATTGCTACGAGCATGTGCTAGAGCCGCACAAGTTGATATTAGGATAATGCATATTGGAGTGTTGAGTCTTGAGAAAAGATTGTCTTGGTTAGAAGATAGAATCCATAAATCCCTTCGTCTAACACCCACAAGTATTACTTGTGAGTTTTGTAGCGATGTGCCTGATCATGTTGGCTCCGTTGGACTCTCAGATTTGGATATTTAA

Protein sequence

MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYSQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASCFPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVDQNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDDDEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSDVPDHVGSVGLSDLDI*
BLAST of Csa2G074100.1 vs. Swiss-Prot
Match: MEE12_ARATH (TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Arabidopsis thaliana GN=MEE12 PE=1 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 2.2e-134
Identity = 292/723 (40.39%), Postives = 404/723 (55.88%), Query Frame = 1

Query: 9   CYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYSQSHTRRR 68
           C  C +    +  DG++YC +CG Q +++I TGV + DL+    G  GA +Y+  H RR 
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDLIGEGGGTQGA-LYNPKH-RRT 62

Query: 69  NPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV---GPTGP 128
            P     +P++ SQ  F    S +    +  E  +GN      ++    D      PT P
Sbjct: 63  EP-----QPITPSQPRFTDDTSRYSQFKSQFESENGNKE-LPREVKRAPDSYVDKEPTEP 122

Query: 129 EDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLAASIWL 188
            DF +  +     S+E Y DE R RYV    +++  QC+ LV +F  TP+I GL   I L
Sbjct: 123 VDFAAETL-----SYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLVGPISL 182

Query: 189 RFVTATRVFDEDWAFQTVQESESQCLDPE-RIRRVCSSHKDEPHNFYGQRVVVLWVKSLR 248
           R+V  + V+ +DWA   +++SE Q  D E +  +    HK EP N  G+R V +W   L+
Sbjct: 183 RYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIWFGILK 242

Query: 249 KKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRACPIS 308
           K +PL S+L +SFLACH A   +LPTDI++W+ EGKLPY + F+DI  ++G+ S ACP+ 
Sbjct: 243 KTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSAACPVK 302

Query: 309 SKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILPHACR 368
             +M RP ++ S Q LE+ A+ IA TIGL LPPVNF+ IA  Y+ +L++P DKIL  A  
Sbjct: 303 VSIMARPFQVISAQMLEARASVIADTIGLPLPPVNFYGIASNYIKQLSIPEDKILDLARL 362

Query: 369 IYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASCFPPH 428
           I  WS+PP+L+LSTNE +LPSRVCVMSILI+A+R+LYN++G G WE+SL    AS     
Sbjct: 363 IQNWSLPPELYLSTNEQKLPSRVCVMSILIVAIRMLYNINGLGVWERSLGFVNAS----- 422

Query: 429 QKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHEI-AE 488
                           DSE           N   + +     T E L+ +EA+YHE+ AE
Sbjct: 423 --------------DGDSET----------NSGTAEKATEFDTQELLKNLEAKYHEVAAE 482

Query: 489 TYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYD----QTEDV 548
           T E  KDL +YL   K+  FAG E    D  D  ++++ LWN Y  ++D +    +  D 
Sbjct: 483 TLESEKDLVSYLSLGKNEFFAGLEEDSPD--DTYRIVDNLWNGYPKDEDIECLPKRGRDW 542

Query: 549 DQNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHS-SKSLDNS 608
           D + + NQ  L +                    SR S  N   S  SR++ S S  LD S
Sbjct: 543 DDDVSLNQLSLYD--------------------SRFSDGNNPCSSSSRRNESVSIGLDLS 602

Query: 609 DDDEQSSVDKAASSLTSINE-AIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGA 668
                SS  + +SS   + E AI++L  DM +  FCYIPPR+  KR DYL Y RK ++GA
Sbjct: 603 -----SSEHRESSSPEKLKEIAIKRLITDMGDDLFCYIPPRVKVKRLDYLQYVRKKEDGA 656

Query: 669 LTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEF 721
           L YAAHADYYILLR CA+ A++D+R MH GVLS E+RL+W+E RI + L LT   +TC+ 
Sbjct: 663 LIYAAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIEKRIDQVLHLTRPLMTCKH 656

BLAST of Csa2G074100.1 vs. Swiss-Prot
Match: TAF1B_ORYSJ (TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza sativa subsp. japonica GN=Os05g0352700 PE=3 SV=2)

HSP 1 Score: 352.8 bits (904), Expect = 8.9e-96
Identity = 237/714 (33.19%), Postives = 341/714 (47.76%), Query Frame = 1

Query: 6   NLSCYNCG---SIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYSQ 65
           +L C  CG        D  DGFF C QC +      +T     D  +        P +  
Sbjct: 18  HLVCEYCGHGSEYAEDDADDGFFTCRQCSAIHTSTQNTATNPFDFPM-------TPAHLS 77

Query: 66  SHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGVG- 125
           +H R   PT       +   +  G +  +F                         D +G 
Sbjct: 78  AHRRPTQPTPTPKPFPAPRGAATGAAAPDF-------------------------DDLGE 137

Query: 126 PTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLAA 185
           P+ P DF +G    G P  E+ A  VR RYV GLQ+I++ Q E LV+  +   +   LA 
Sbjct: 138 PSEPRDFATGANAWGNP--EDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAASLAG 197

Query: 186 SIWLRFVTATRVFDEDWAFQ------TVQESESQCLDPERIRRVCSSHKDEPHNFYGQRV 245
           +IWLR+V A++VFDE W  +      +V+E  S   D +      +      + F     
Sbjct: 198 TIWLRWVAASKVFDEMWVHKMLAIAASVEEGHSASKDKQSELEGDAQKSQSSYEF----- 257

Query: 246 VVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIG 305
             L+++SLR  +P++STLAV FLACHVARE ILPTDI +W++EGKLPY AAF  ++  +G
Sbjct: 258 --LFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLG 317

Query: 306 KTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPV 365
            +   CP+SS+ + RP+R+    +LE+ A SIA  IGL LP VNF+ IA R+L +L+LP+
Sbjct: 318 SSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPI 377

Query: 366 DKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSV 425
           +KILPHACRIYEW+MP +LWLS+N  R+PSRVCVM+ILI+A+R+LY ++G G WE     
Sbjct: 378 EKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWESI--- 437

Query: 426 DCASCFPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPH---LTTTEFLR 485
                                   A +EN  G         S+ P++ +       E L 
Sbjct: 438 ------------------------AQTENAVGSDPEASAPHSIEPDSNNSEEFDARELLC 497

Query: 486 KIEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKD 545
            + A Y +I   ++YSK++                                 +Y +  KD
Sbjct: 498 TLAASYDKINVGHDYSKEVH--------------------------------SYLKYCKD 557

Query: 546 YDQTEDVDQNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSS 605
              T       +  ++ L +   D    +   +  E  ++ +E L    G          
Sbjct: 558 VVFT---GMTFSLEEEHLIDIFWDMYKGKEVMLLDENAKLCQEKLRTTNGV--------- 615

Query: 606 KSLDNSDDDEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRK 665
               N    +    D    S    N A++ +K  MEE  FCY+ PR       YL Y+R+
Sbjct: 618 ----NKRCRDGRFADTKCCSTPLGNCALQSIKSKMEENGFCYVSPRKRLVSDGYLLYTRR 615

Query: 666 IDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSL 707
              G+L Y AHADYYILLR  A+ A+VD+R++H  VL LE+RL W+E+R+ +SL
Sbjct: 678 ESSGSLIYVAHADYYILLRPFAKLAEVDVRVLHSSVLKLERRLGWIEERVGRSL 615

BLAST of Csa2G074100.1 vs. Swiss-Prot
Match: TAF1B_ORYSI (TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza sativa subsp. indica GN=OsI_19584 PE=3 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 4.1e-85
Identity = 202/564 (35.82%), Postives = 293/564 (51.95%), Query Frame = 1

Query: 6   NLSCYNCG---SIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYSQ 65
           +L C  CG        D  +GFF C QC +      +T     D  +        P +  
Sbjct: 18  HLVCEYCGHGSEYAEDDADNGFFTCRQCSAIHTSTQNTATNPFDFPM-------TPAHLS 77

Query: 66  SHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGVG- 125
           +H R   PT       +   +  G +  +F                         D +G 
Sbjct: 78  AHRRPTQPTPTPKPFPAPRGAATGAAAPDF-------------------------DDLGE 137

Query: 126 PTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLAA 185
           P+ P DF +G    G P  E+ A  VR RYV GLQ+I++ Q E LV+  +   +   LA 
Sbjct: 138 PSEPRDFATGANAWGNP--EDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAASLAG 197

Query: 186 SIWLRFVTATRVFDEDWAFQ------TVQESESQCLDPERIRRVCSSHKDEPHNFYGQRV 245
           +IWLR+V A++VFDE W  +      +V+E  S   D +      +      + F     
Sbjct: 198 TIWLRWVAASKVFDEMWVHKMLAIAASVEEGHSASKDKQSELEGDAQKSQSSYEF----- 257

Query: 246 VVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIG 305
             L+++SLR  +P++STLAV FLACHVARE ILPTDI +W++EGKLPY AAF  ++  +G
Sbjct: 258 --LFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLG 317

Query: 306 KTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPV 365
            +   CP+SS+ + RP+R+    +LE+ A SIA  IGL LP VNF+ IA R+L +L+LP+
Sbjct: 318 SSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPI 377

Query: 366 DKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSV 425
           +KILPHACRIYEW+MP +LWLS+N  R+PSRVCVM+ILI+A+R+LY ++G G WE     
Sbjct: 378 EKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWESIAQT 437

Query: 426 DCASCFPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIE 485
           + A    P     S+P     +++ DS N   F +                  E L  + 
Sbjct: 438 ENAVGSDPEA---SAP----HSIEPDSNNSEEFDAR-----------------ELLCTLA 497

Query: 486 ARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQ 545
           A Y +I   ++YSK++ +YL+YCKDV F G      ++H    +I+  W+ Y+ +    +
Sbjct: 498 ASYDKIDVGHDYSKEVHSYLKYCKDVVFTGMTFSLEEEH----LIDIFWDMYKGK----E 508

Query: 546 TEDVDQNAASNQKRLR--EGSNDR 558
              +D+NA   Q++LR   G N R
Sbjct: 558 VMLLDENAKLCQEKLRTTNGVNKR 508


HSP 2 Score: 97.1 bits (240), Expect = 8.8e-19
Identity = 45/90 (50.00%), Postives = 60/90 (66.67%), Query Frame = 1

Query: 617 NEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARA 676
           N A++ +K  MEE  FCY+ PR       YL Y+R+   G+L Y AHADYYILLR  A+ 
Sbjct: 526 NCALQSIKSKMEENGFCYVSPRKRLVSDGYLLYTRRESSGSLIYVAHADYYILLRPFAKL 585

Query: 677 AQVDIRIMHIGVLSLEKRLSWLEDRIHKSL 707
           A+VD+R++H  VL LE+RL W+E+R+ +SL
Sbjct: 586 AEVDVRVLHSSVLKLERRLGWIEERVGRSL 615

BLAST of Csa2G074100.1 vs. TrEMBL
Match: A0A0A0LHI6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G074100 PE=4 SV=1)

HSP 1 Score: 1494.6 bits (3868), Expect = 0.0e+00
Identity = 735/735 (100.00%), Postives = 735/735 (100.00%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY
Sbjct: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120
           SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV
Sbjct: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120

Query: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180
           GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA
Sbjct: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180

Query: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240
           ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV
Sbjct: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240

Query: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300
           KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA
Sbjct: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300

Query: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP
Sbjct: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420
           HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC
Sbjct: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420

Query: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480
           FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE
Sbjct: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480

Query: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540
           IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD
Sbjct: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540

Query: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD 600
           QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD
Sbjct: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD 600

Query: 601 DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY 660
           DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY
Sbjct: 601 DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY 660

Query: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720
           AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD
Sbjct: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720

Query: 721 VPDHVGSVGLSDLDI 736
           VPDHVGSVGLSDLDI
Sbjct: 721 VPDHVGSVGLSDLDI 735

BLAST of Csa2G074100.1 vs. TrEMBL
Match: W9QI88_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001445 PE=4 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 2.0e-211
Identity = 398/768 (51.82%), Postives = 520/768 (67.71%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           M+DP   +C+ CG+ G  DGFDGF+YCL+CGSQA+DII+TGVA+ED   +  G +GAP+Y
Sbjct: 1   MSDPHAWTCHTCGNAGFADGFDGFYYCLRCGSQAEDIIETGVADEDFADKG-GTAGAPLY 60

Query: 61  SQSHTRRRN--PTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLN- 120
           S +H R R    + +K EP+SQ QS   T QS+FW +L L +D  G  G +     +   
Sbjct: 61  SATHRRNRPVAASAIKAEPISQVQS--ATLQSQFWAALTLDDDGEGEGGDRFNRASIKTE 120

Query: 121 ----DGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKAT 180
               DGVGPTGP DFGS  V    PSFEEY  ++R+RYVMGLQL++E QCE LV+EFK  
Sbjct: 121 EIEFDGVGPTGPRDFGS--VGESVPSFEEYYSDIRIRYVMGLQLMIEFQCEALVREFKVN 180

Query: 181 PIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCL-DPERIRRVCSSHKDEPHNFYG 240
           P+ICGLA ++WLRFV  TRVFD+ W  +++ ESESQ   +P +  +  + +  EPHN YG
Sbjct: 181 PLICGLAGTVWLRFVAGTRVFDDGWVDESINESESQQQGEPPQDFKPRAKYGSEPHNMYG 240

Query: 241 QRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIES 300
           QR V++W +SLRKKIPL  TL VSFLACH+ARE +L TDI+KWSLEGK+PY+AAFV+IE 
Sbjct: 241 QRAVMIWFRSLRKKIPLSYTLGVSFLACHLAREPVLTTDIVKWSLEGKVPYFAAFVEIEK 300

Query: 301 RIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLA 360
            +G+ S  CPIS+ LM RP+   + QKLESL+ASIA ++GL LPPVNF+SIA RYL +L+
Sbjct: 301 HMGQRSSGCPISTTLMFRPNESVTAQKLESLSASIADSVGLALPPVNFYSIASRYLRELS 360

Query: 361 LPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKS 420
           +P++KILPHA R+YEWSMPPDLWLSTNELRLP+RVCVMS+LI+A+RILYN+HGFGEWEKS
Sbjct: 361 IPLEKILPHARRMYEWSMPPDLWLSTNELRLPTRVCVMSMLIVAIRILYNIHGFGEWEKS 420

Query: 421 L-SVDCASCFPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNP---SVSPENPHLTTT 480
           L      S     + +   P  +           P        NP   +   +   L   
Sbjct: 421 LCRRHTCSTSKGTEDSELKPDLDVKECAGKGSGSPQILDDSGTNPGRDTSHAQKTELDAA 480

Query: 481 EFLRKIEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQ 540
           + L  +EARY  I ETYEYSKDLP+YLQ+CKDV FAG E  F +D++E+++IE+LW++Y 
Sbjct: 481 KLLCDLEARYRGINETYEYSKDLPSYLQFCKDVVFAGLEPSF-EDYEEKRLIEELWDFYW 540

Query: 541 NEKDYDQT--EDVDQNAASNQKRLREGSNDRL--SNESKKVKGEEDRISRESLNNRTGSI 600
           +E+D      E+   + A  QKR R     R     E+K+ + ++  +   S  N + S 
Sbjct: 541 SERDSKTAVQEEAHGSEAVTQKRARVDVECRSFPFKENKRFR-DKGCVGDPSPYNGSRSA 600

Query: 601 DSRQSHSSKSLDNSDDD------EQSSVD----------KAASSLTSINEAIRQLKLDME 660
             + S +S   D+S DD      +Q+S D          K  +S    +EAIR LK D E
Sbjct: 601 GDQHSENSDQFDSSQDDQISEQKDQTSADSLKDETADTLKDETSEILKDEAIRLLKSDFE 660

Query: 661 EKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGV 720
           E RF YIPPR+NPKRFDYLHY+RK DEGALTY AHADYYILLRACA+AA++DIR+MHIGV
Sbjct: 661 ENRFYYIPPRVNPKRFDYLHYARKKDEGALTYVAHADYYILLRACAKAAKIDIRLMHIGV 720

Query: 721 LSLEKRLSWLEDRIHKSLRLTPTSITCEFCSDVPD-HVGSVGLSDLDI 736
           LS E+RL+W+E RI+  L L P ++ CEFC+ + +  V S+GLS+L+I
Sbjct: 721 LSFERRLAWIEQRINHCLHLKPATVFCEFCNYLGNASVESLGLSNLNI 761

BLAST of Csa2G074100.1 vs. TrEMBL
Match: F6HPR5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g02430 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 1.7e-207
Identity = 384/731 (52.53%), Postives = 493/731 (67.44%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           M +  +L+C+ CGS+G +DG DGFFYC +CGSQA+DIIDTGVAEED V +  G +   IY
Sbjct: 1   MPERLDLTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAK--GDARGAIY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120
           S SH R+R+    K EPLSQSQS F           NL  D    V  ++     + D V
Sbjct: 61  SASHRRQRHSIAPKPEPLSQSQSQFLN---------NLTLDDDYRVENEETREETVADEV 120

Query: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180
           GP+GP DFG G   S   SFE+Y  ++R+RYVMG+Q+++ELQC+ LV++FKA+P+ICG+A
Sbjct: 121 GPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGVA 180

Query: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240
            +IWLRFV  TRVFD++WA + +Q+SE Q        +  + +  EPHN YGQR V++W 
Sbjct: 181 GTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAVIIWH 240

Query: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300
           +SL+KKIPL  +L +SFLACH+AREAILPTDI+KWSLEGKLPY+AAF++IE +IG  S  
Sbjct: 241 RSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSSP 300

Query: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CP+SS  M RPS    LQKLE+ AASIA  IGL+LPPVNF++IA RYL +L LPV+KILP
Sbjct: 301 CPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420
           +ACR+YEWSMPPDLWLS NELRLP+RVCVMSILI+ +RILYN+HGFG+WE SLS    S 
Sbjct: 361 YACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSSSGSS 420

Query: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSP-------ENPHLTTTEFLRK 480
               Q    + ++N   M    +  P    HD++  +  P       +      TE L  
Sbjct: 421 SSSSQIVKLNASDNIKMMDGAKQGSP---LHDLNGSNEEPVTNSSHAQKSEFDATELLCN 480

Query: 481 IEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDY 540
           ++ARY E+ +TYEYSKDLPTYLQYCKDV FAG E  F +DH+E+K+IE+LW +YQN+KD 
Sbjct: 481 LDARYDELIDTYEYSKDLPTYLQYCKDVVFAGLELPF-EDHEEEKIIEQLWEFYQNQKDS 540

Query: 541 DQTED--VDQNAASNQKRLR--EGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQS 600
           + +ED  V+  +A N+KR R  EG  + +  E KK++ +              S+     
Sbjct: 541 EPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRDD-------------CSVPLGLD 600

Query: 601 HSSKSLDNSDDDEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHY 660
               SL NS   ++S     AS  T   EAI ++K DMEE RFCYIPPR+N KRFDYLHY
Sbjct: 601 GDDTSL-NSQGGQKSVPTHQASVETLKEEAILRMKADMEENRFCYIPPRVNVKRFDYLHY 660

Query: 661 SRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLT 720
            RK DEG+  YAAHADYYILLRACAR AQVD+R MH+GV+SLE+RL W+E RI   L   
Sbjct: 661 VRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDHCLHFK 702

BLAST of Csa2G074100.1 vs. TrEMBL
Match: A5ACR3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027088 PE=4 SV=1)

HSP 1 Score: 728.4 bits (1879), Expect = 8.7e-207
Identity = 384/735 (52.24%), Postives = 495/735 (67.35%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           M +  +L+C+ CGS+G +DG DGFFYC +CGSQA+DIIDTGVAEED V +  G +   IY
Sbjct: 1   MPERLBLTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAK--GDARGAIY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120
           S SH R+R+    K EPLSQSQS F           NL  D    V  ++     + D V
Sbjct: 61  SASHRRQRHSIAPKPEPLSQSQSQFLN---------NLTLDDDYRVENEETREETVADEV 120

Query: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180
           GP+GP DFG G   S   SFE+Y  ++R+RYVMG+Q+++ELQC+ LV++FKA+P+ICG+A
Sbjct: 121 GPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGVA 180

Query: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240
            +IWLRFV  TRVFD++WA + +Q+SE Q        +  + +  EPHN YGQR V++W 
Sbjct: 181 GTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRTKYSAEPHNIYGQRAVIIWH 240

Query: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300
           +SL+KKIPL  +L +SFLACH+AREAILPTDI+KWSLEGKLPY+AAF++IE +IG  S  
Sbjct: 241 RSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSSP 300

Query: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CP+SS  M RPS    LQKLE+ AASIA  IGL+LPPVNF++IA RYL +L LPV+KILP
Sbjct: 301 CPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420
           +ACR+YEWSMPPDLWLS NELRLP+RVCVMSILI+ +RILYN+HGFG+WE SLS    S 
Sbjct: 361 YACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSSSGSS 420

Query: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSP-------ENPHLTTTEFLRK 480
               Q    + ++N   M    +  P    HD++  +  P       +      TE L  
Sbjct: 421 SSSSQIVKLNASDNIKMMDGAKQGSP---LHDLNGSNEEPVTNSSHAQKSEFDATELLCN 480

Query: 481 IEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDY 540
           ++ARY E+ +TYEYSKDLPTYLQYCKDV FAG E  F +DH+E+K+IE+LW +YQN+KD 
Sbjct: 481 LDARYDELIDTYEYSKDLPTYLQYCKDVVFAGLELPF-EDHEEEKIIEQLWEFYQNQKDS 540

Query: 541 DQTED--VDQNAASNQKRLR--EGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQS 600
           + +ED  V+  +A N+KR R  EG  + +  E KK+        R+  +   G      S
Sbjct: 541 EPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKI--------RDDCSVPLGLDGDDTS 600

Query: 601 HSSKSLDNSDDDEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHY 660
            +S+    S    Q+SV+      T   EAI ++K DMEE RFCYIP R+N KRFDYLHY
Sbjct: 601 LNSQGGQXSVPTHQASVE------TVKEEAILRMKADMEENRFCYIPXRVNVKRFDYLHY 660

Query: 661 SRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLT 720
            RK DEG+  YAAHADYYILLRACAR AQVD+R MH+GV+SLE+RL W+E RI   L   
Sbjct: 661 VRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDHCLHFK 706

Query: 721 PTSITCEFCS-DVPD 724
           P   + + C+ D P+
Sbjct: 721 PPKFSSDXCNXDAPE 706

BLAST of Csa2G074100.1 vs. TrEMBL
Match: B9S8W3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0836380 PE=4 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 1.1e-206
Identity = 392/755 (51.92%), Postives = 511/755 (67.68%), Query Frame = 1

Query: 8   SCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYSQSHTRR 67
           +C  CG +GL +  DGF+YC +CG+QADDII TGVA+ED + +D G+ G  +YS   TR 
Sbjct: 13  ACRRCGHVGLEES-DGFYYCQECGAQADDIILTGVADEDFIEKD-GEGGGALYSARFTRY 72

Query: 68  RNPT-VLKVEPLSQSQSLFGTSQSEF-WDSLNLMEDPSGNVGGK-----DGDIVMLNDGV 127
             PT  ++  P SQ+   +   + +  + +   +     N+  K     D D  +  DG+
Sbjct: 73  SQPTRTIQTNPSSQAWFRYTQEEEDINFTTTTTLNGTYSNIKIKKEERFDDDEYL--DGL 132

Query: 128 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 187
           GP  PEDFG   +     S+E+Y +EVR+RYVMG+Q +++LQCE LV++F  +P+ICG+A
Sbjct: 133 GPVEPEDFGGKSL-----SYEDYYNEVRIRYVMGMQWMIQLQCESLVEKFNVSPLICGVA 192

Query: 188 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 247
            ++WLRF+ AT VF ++WA   + ESESQ        +  SSH++EPHN YGQR V++W 
Sbjct: 193 GNVWLRFLVATGVFKDNWADDVILESESQVQGEPEDWKPRSSHRNEPHNAYGQRAVMVWF 252

Query: 248 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 307
           K LRK IPL S+LA+SFLACHVAREAILPTDI++WS+EGKLPY+AA V+IE R   +S A
Sbjct: 253 KYLRKTIPLSSSLAISFLACHVAREAILPTDIVRWSIEGKLPYFAAHVEIEKRFEHSSPA 312

Query: 308 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 367
           CPISS LM RPS+    QKLES+AA+ A +IGL+LPPVNF+ IA RYL  LALPV+KILP
Sbjct: 313 CPISSSLMFRPSQAVPAQKLESMAAAFAESIGLHLPPVNFYEIASRYLKNLALPVEKILP 372

Query: 368 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 427
           HACRIYEWSMPPDLWLSTNELRLP+RV VMSILI+A+RILYNL+GFG WE+SLS    +C
Sbjct: 373 HACRIYEWSMPPDLWLSTNELRLPTRVTVMSILIVAIRILYNLNGFGAWERSLS--SLNC 432

Query: 428 FPPHQKTHSSPANNF-----SNMQADSENRPGFTSHD------VDNPSVSPENPHLTTTE 487
            P    ++S PA+       S MQ D+E    F S D      + NPS   + P L + E
Sbjct: 433 SP----SNSHPASRLDSMCRSVMQGDAETGSPFYSLDGSAEKFLRNPS-HMQMPELDSAE 492

Query: 488 FLRKIEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQN 547
            L  +E +Y+ IA+ YE++KDLP+YLQYCKDV FAG+    +DD +E++++EKLW++YQN
Sbjct: 493 LLHHLEVKYNFIADAYEFTKDLPSYLQYCKDVVFAGAGPSHMDDLEEEELMEKLWDFYQN 552

Query: 548 EKDYDQTED---VDQNAASNQKRLREGSNDRLSN--ESKKVKGEEDRISRESLNNRTGSI 607
           EKD +  ++      +  SNQKR R        N  E +K+K E        +++     
Sbjct: 553 EKDSELAKEPRTQSSSRLSNQKRSRNDDGSVFVNLSEKEKIKEEWHDSPSADISSHNADN 612

Query: 608 DSRQSHSSKSLDNSDDDEQSSVDKAASSLTSI-NEAIRQLKLDMEEKRFCYIPPRINPKR 667
            S QS  +    N+  ++Q+   K   S  ++   AIR+LKLDMEE RFCYIPPR+N KR
Sbjct: 613 SSHQSFDNGHFSNNSLEDQNVEHKEKDSEKTLEGRAIRRLKLDMEENRFCYIPPRVNLKR 672

Query: 668 FDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIH 727
           FDYLHY RK DEGA TY AHADYYILLRACAR AQVDIRIMHIGVLS E+RL+WLE RI 
Sbjct: 673 FDYLHYVRKKDEGAFTYVAHADYYILLRACARVAQVDIRIMHIGVLSFERRLAWLEKRID 732

Query: 728 KSLRLTPTSITCEFCSDVPDHVGS---VGLSDLDI 736
             L L+P +ITCEFC D+PDH  +   +GLS L++
Sbjct: 733 YCLHLSPPTITCEFCRDMPDHNSNDDVIGLSKLNL 751

BLAST of Csa2G074100.1 vs. TAIR10
Match: AT2G02955.1 (AT2G02955.1 maternal effect embryo arrest 12)

HSP 1 Score: 481.1 bits (1237), Expect = 1.2e-135
Identity = 292/723 (40.39%), Postives = 404/723 (55.88%), Query Frame = 1

Query: 9   CYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYSQSHTRRR 68
           C  C +    +  DG++YC +CG Q +++I TGV + DL+    G  GA +Y+  H RR 
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDLIGEGGGTQGA-LYNPKH-RRT 62

Query: 69  NPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV---GPTGP 128
            P     +P++ SQ  F    S +    +  E  +GN      ++    D      PT P
Sbjct: 63  EP-----QPITPSQPRFTDDTSRYSQFKSQFESENGNKE-LPREVKRAPDSYVDKEPTEP 122

Query: 129 EDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLAASIWL 188
            DF +  +     S+E Y DE R RYV    +++  QC+ LV +F  TP+I GL   I L
Sbjct: 123 VDFAAETL-----SYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLVGPISL 182

Query: 189 RFVTATRVFDEDWAFQTVQESESQCLDPE-RIRRVCSSHKDEPHNFYGQRVVVLWVKSLR 248
           R+V  + V+ +DWA   +++SE Q  D E +  +    HK EP N  G+R V +W   L+
Sbjct: 183 RYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIWFGILK 242

Query: 249 KKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRACPIS 308
           K +PL S+L +SFLACH A   +LPTDI++W+ EGKLPY + F+DI  ++G+ S ACP+ 
Sbjct: 243 KTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSAACPVK 302

Query: 309 SKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILPHACR 368
             +M RP ++ S Q LE+ A+ IA TIGL LPPVNF+ IA  Y+ +L++P DKIL  A  
Sbjct: 303 VSIMARPFQVISAQMLEARASVIADTIGLPLPPVNFYGIASNYIKQLSIPEDKILDLARL 362

Query: 369 IYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASCFPPH 428
           I  WS+PP+L+LSTNE +LPSRVCVMSILI+A+R+LYN++G G WE+SL    AS     
Sbjct: 363 IQNWSLPPELYLSTNEQKLPSRVCVMSILIVAIRMLYNINGLGVWERSLGFVNAS----- 422

Query: 429 QKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHEI-AE 488
                           DSE           N   + +     T E L+ +EA+YHE+ AE
Sbjct: 423 --------------DGDSET----------NSGTAEKATEFDTQELLKNLEAKYHEVAAE 482

Query: 489 TYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYD----QTEDV 548
           T E  KDL +YL   K+  FAG E    D  D  ++++ LWN Y  ++D +    +  D 
Sbjct: 483 TLESEKDLVSYLSLGKNEFFAGLEEDSPD--DTYRIVDNLWNGYPKDEDIECLPKRGRDW 542

Query: 549 DQNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHS-SKSLDNS 608
           D + + NQ  L +                    SR S  N   S  SR++ S S  LD S
Sbjct: 543 DDDVSLNQLSLYD--------------------SRFSDGNNPCSSSSRRNESVSIGLDLS 602

Query: 609 DDDEQSSVDKAASSLTSINE-AIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGA 668
                SS  + +SS   + E AI++L  DM +  FCYIPPR+  KR DYL Y RK ++GA
Sbjct: 603 -----SSEHRESSSPEKLKEIAIKRLITDMGDDLFCYIPPRVKVKRLDYLQYVRKKEDGA 656

Query: 669 LTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEF 721
           L YAAHADYYILLR CA+ A++D+R MH GVLS E+RL+W+E RI + L LT   +TC+ 
Sbjct: 663 LIYAAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIEKRIDQVLHLTRPLMTCKH 656

BLAST of Csa2G074100.1 vs. NCBI nr
Match: gi|449470354|ref|XP_004152882.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Cucumis sativus])

HSP 1 Score: 1494.6 bits (3868), Expect = 0.0e+00
Identity = 735/735 (100.00%), Postives = 735/735 (100.00%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY
Sbjct: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120
           SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV
Sbjct: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120

Query: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180
           GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA
Sbjct: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180

Query: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240
           ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV
Sbjct: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240

Query: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300
           KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA
Sbjct: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300

Query: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP
Sbjct: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420
           HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC
Sbjct: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420

Query: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480
           FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE
Sbjct: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480

Query: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540
           IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD
Sbjct: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540

Query: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD 600
           QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD
Sbjct: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD 600

Query: 601 DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY 660
           DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY
Sbjct: 601 DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY 660

Query: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720
           AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD
Sbjct: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720

Query: 721 VPDHVGSVGLSDLDI 736
           VPDHVGSVGLSDLDI
Sbjct: 721 VPDHVGSVGLSDLDI 735

BLAST of Csa2G074100.1 vs. NCBI nr
Match: gi|659082497|ref|XP_008441871.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Cucumis melo])

HSP 1 Score: 1417.1 bits (3667), Expect = 0.0e+00
Identity = 699/737 (94.84%), Postives = 714/737 (96.88%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDL LRDVGKSG PIY
Sbjct: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLALRDVGKSGGPIY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120
           SQSHTRRRNPTVLKVEPLSQSQ LFGTSQSEFWDSL+L EDPSGNVG  D  IVMLNDGV
Sbjct: 61  SQSHTRRRNPTVLKVEPLSQSQPLFGTSQSEFWDSLHLKEDPSGNVGQNDDGIVMLNDGV 120

Query: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180
           GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA
Sbjct: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180

Query: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240
           ASIWLRFVTATRVFDEDWAFQTVQESESQCLD ERIRRVCSSHKDEPHNFYGQRVVVLWV
Sbjct: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDTERIRRVCSSHKDEPHNFYGQRVVVLWV 240

Query: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300
           KSLRKKIPLF TLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA
Sbjct: 241 KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300

Query: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP
Sbjct: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420
           HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMR+LYNLHGFGEWEKSLSV+CAS 
Sbjct: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRVLYNLHGFGEWEKSLSVNCASY 420

Query: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480
           FPP+QKTHSSPANNFSNMQADSENRPGFTSHD+DNPSVSPENPHLTTTEFLRKIEARYHE
Sbjct: 421 FPPNQKTHSSPANNFSNMQADSENRPGFTSHDLDNPSVSPENPHLTTTEFLRKIEARYHE 480

Query: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540
           IAETYEYSKDLP+YLQYCKDV FAGSESLFIDDH+EQKMIEKLWNYYQNEKD+DQTEDVD
Sbjct: 481 IAETYEYSKDLPSYLQYCKDVVFAGSESLFIDDHEEQKMIEKLWNYYQNEKDHDQTEDVD 540

Query: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNS-- 600
           QN ASN KRLREGSNDRLS+ESKKVKGEED IS ES NNRTGSIDSRQSHSSKSL+NS  
Sbjct: 541 QNVASNLKRLREGSNDRLSSESKKVKGEEDCISGESSNNRTGSIDSRQSHSSKSLENSDD 600

Query: 601 DDDEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGAL 660
           DDDEQSS DKAASSLTSINEAIRQLKLDMEEKRFCYIPPR NPKRF YLHYSRKIDEGAL
Sbjct: 601 DDDEQSSEDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRANPKRFGYLHYSRKIDEGAL 660

Query: 661 TYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFC 720
           TYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHK+LRLTP+S+TCEFC
Sbjct: 661 TYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKTLRLTPSSVTCEFC 720

Query: 721 SDVPDHVGSVGLSDLDI 736
           SDVP+H+ SVGLSDLDI
Sbjct: 721 SDVPNHIDSVGLSDLDI 737

BLAST of Csa2G074100.1 vs. NCBI nr
Match: gi|1009154268|ref|XP_015895077.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Ziziphus jujuba])

HSP 1 Score: 795.8 bits (2054), Expect = 6.4e-227
Identity = 430/769 (55.92%), Postives = 532/769 (69.18%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           M+DP    C  CG++GL DG DGFFYCLQCGSQA+DI+DTGVA+ED V    G++G  +Y
Sbjct: 1   MSDPQAWRCNTCGNLGLADGSDGFFYCLQCGSQAEDIVDTGVADEDFVDAG-GETGGALY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFG-TSQSEFWDSLNLMEDPSGNVGGK----DGDIVM 120
             SH R+RNP+V+K EP+SQS  L   T QS+FW SLNL ++       K    D D+  
Sbjct: 61  IASHRRQRNPSVIKAEPISQSDFLLSSTVQSQFWASLNLNDETPKRDRVKTVEYDYDVGP 120

Query: 121 LNDGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPI 180
            +DGVGPT PEDFG   V    PSFE+Y +E R+RYVMGLQL++E QCE LV+EFK TP+
Sbjct: 121 FSDGVGPTEPEDFGW--VGESVPSFEDYYNETRIRYVMGLQLMIESQCEALVREFKVTPL 180

Query: 181 ICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIR-RVCSSHKDEPHNFYGQR 240
           ICG A +IWLRFV  TRVFD+ W  +T+ ESE+Q         +  S +K EP N YGQR
Sbjct: 181 ICGFAGTIWLRFVAGTRVFDDAWVEETIHESETQAHGESPTDFKPRSKYKAEPQNIYGQR 240

Query: 241 VVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRI 300
            V++W +SLRK+IPL  +LAVSFLACH+AREAILPTD++KWSLEGKLPY+AAFV+IE+ I
Sbjct: 241 AVMIWFRSLRKRIPLTCSLAVSFLACHLAREAILPTDLVKWSLEGKLPYFAAFVEIENVI 300

Query: 301 GKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALP 360
           GK SRACPISS  M RP    S QKLESLAAS+A +I L+LPPVNF++IA  YL KL+LP
Sbjct: 301 GKPSRACPISSSTMFRPRESVSAQKLESLAASVAESICLDLPPVNFYAIASCYLQKLSLP 360

Query: 361 VDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLS 420
           V+KILPHACRIYEWS PPDLWLST+ELRLP+RVCVMSILI+A+RILYN+HGFG+WE+ LS
Sbjct: 361 VEKILPHACRIYEWSTPPDLWLSTSELRLPTRVCVMSILIVAIRILYNIHGFGDWERRLS 420

Query: 421 VDCASCFPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSP-------ENPHLTT 480
            D  +    +    S P+ N   M+ D  N  G     VD+   +P       +   L  
Sbjct: 421 NDGEAPSTSYWTGESDPSCN-PKMRDDLTNGSGSPPDIVDDSGTNPVENTSRAQKSELDA 480

Query: 481 TEFLRKIEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYY 540
            E L  +EARY+EI ETYEYSKDLPTYLQ+CKDV F+G E  F +D +E+KMIE+LWNYY
Sbjct: 481 AELLHNLEARYNEIVETYEYSKDLPTYLQFCKDVVFSGLEPSF-EDREEEKMIEQLWNYY 540

Query: 541 QNEKDYDQTEDVDQ---NAASNQKRLR--EGSNDRLSNESKKVKGEEDRISRESLNN--- 600
           QN KD +   + +    + A +QKR R  EG   RL  E KK++  +  +S  S N+   
Sbjct: 541 QNNKDSETASEKEMLCGSGAVSQKRPRTDEGCTSRLPKEKKKIRDRD--VSGSSSNDDDS 600

Query: 601 -RTGSIDSRQS--HSSKSLD---NSDDDEQSSVDKAASSLTSI--NEAIRQLKLDMEEKR 660
             TG+    Q+  HS  SL    NSD ++Q S +      T    NEA+R++KLDMEE  
Sbjct: 601 YHTGNQQWSQNGDHSFDSLQGGRNSDSNDQISAETLVDETTETTKNEAVRRIKLDMEENN 660

Query: 661 FCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSL 720
           FCYIPPR+N KRF YL Y RK DEGA TYAAHADYYILLRACA+ AQV+IR MHIGVL+L
Sbjct: 661 FCYIPPRVNIKRFGYLFYVRKKDEGAFTYAAHADYYILLRACAKTAQVEIRCMHIGVLNL 720

Query: 721 EKRLSWLEDRIHKSLRLTPTSITCEFCS-----DVPDHVGSVGLSDLDI 736
           E+RL+WLE RI+  L LTP  ++CEFCS     +  D  G  G S+L+I
Sbjct: 721 ERRLAWLEHRINHCLHLTPPIVSCEFCSGMGQGNATDEDGLHGFSNLNI 762

BLAST of Csa2G074100.1 vs. NCBI nr
Match: gi|703061724|ref|XP_010086553.1| (hypothetical protein L484_001445 [Morus notabilis])

HSP 1 Score: 743.8 bits (1919), Expect = 2.9e-211
Identity = 398/768 (51.82%), Postives = 520/768 (67.71%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           M+DP   +C+ CG+ G  DGFDGF+YCL+CGSQA+DII+TGVA+ED   +  G +GAP+Y
Sbjct: 1   MSDPHAWTCHTCGNAGFADGFDGFYYCLRCGSQAEDIIETGVADEDFADKG-GTAGAPLY 60

Query: 61  SQSHTRRRN--PTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLN- 120
           S +H R R    + +K EP+SQ QS   T QS+FW +L L +D  G  G +     +   
Sbjct: 61  SATHRRNRPVAASAIKAEPISQVQS--ATLQSQFWAALTLDDDGEGEGGDRFNRASIKTE 120

Query: 121 ----DGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKAT 180
               DGVGPTGP DFGS  V    PSFEEY  ++R+RYVMGLQL++E QCE LV+EFK  
Sbjct: 121 EIEFDGVGPTGPRDFGS--VGESVPSFEEYYSDIRIRYVMGLQLMIEFQCEALVREFKVN 180

Query: 181 PIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCL-DPERIRRVCSSHKDEPHNFYG 240
           P+ICGLA ++WLRFV  TRVFD+ W  +++ ESESQ   +P +  +  + +  EPHN YG
Sbjct: 181 PLICGLAGTVWLRFVAGTRVFDDGWVDESINESESQQQGEPPQDFKPRAKYGSEPHNMYG 240

Query: 241 QRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIES 300
           QR V++W +SLRKKIPL  TL VSFLACH+ARE +L TDI+KWSLEGK+PY+AAFV+IE 
Sbjct: 241 QRAVMIWFRSLRKKIPLSYTLGVSFLACHLAREPVLTTDIVKWSLEGKVPYFAAFVEIEK 300

Query: 301 RIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLA 360
            +G+ S  CPIS+ LM RP+   + QKLESL+ASIA ++GL LPPVNF+SIA RYL +L+
Sbjct: 301 HMGQRSSGCPISTTLMFRPNESVTAQKLESLSASIADSVGLALPPVNFYSIASRYLRELS 360

Query: 361 LPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKS 420
           +P++KILPHA R+YEWSMPPDLWLSTNELRLP+RVCVMS+LI+A+RILYN+HGFGEWEKS
Sbjct: 361 IPLEKILPHARRMYEWSMPPDLWLSTNELRLPTRVCVMSMLIVAIRILYNIHGFGEWEKS 420

Query: 421 L-SVDCASCFPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNP---SVSPENPHLTTT 480
           L      S     + +   P  +           P        NP   +   +   L   
Sbjct: 421 LCRRHTCSTSKGTEDSELKPDLDVKECAGKGSGSPQILDDSGTNPGRDTSHAQKTELDAA 480

Query: 481 EFLRKIEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQ 540
           + L  +EARY  I ETYEYSKDLP+YLQ+CKDV FAG E  F +D++E+++IE+LW++Y 
Sbjct: 481 KLLCDLEARYRGINETYEYSKDLPSYLQFCKDVVFAGLEPSF-EDYEEKRLIEELWDFYW 540

Query: 541 NEKDYDQT--EDVDQNAASNQKRLREGSNDRL--SNESKKVKGEEDRISRESLNNRTGSI 600
           +E+D      E+   + A  QKR R     R     E+K+ + ++  +   S  N + S 
Sbjct: 541 SERDSKTAVQEEAHGSEAVTQKRARVDVECRSFPFKENKRFR-DKGCVGDPSPYNGSRSA 600

Query: 601 DSRQSHSSKSLDNSDDD------EQSSVD----------KAASSLTSINEAIRQLKLDME 660
             + S +S   D+S DD      +Q+S D          K  +S    +EAIR LK D E
Sbjct: 601 GDQHSENSDQFDSSQDDQISEQKDQTSADSLKDETADTLKDETSEILKDEAIRLLKSDFE 660

Query: 661 EKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGV 720
           E RF YIPPR+NPKRFDYLHY+RK DEGALTY AHADYYILLRACA+AA++DIR+MHIGV
Sbjct: 661 ENRFYYIPPRVNPKRFDYLHYARKKDEGALTYVAHADYYILLRACAKAAKIDIRLMHIGV 720

Query: 721 LSLEKRLSWLEDRIHKSLRLTPTSITCEFCSDVPD-HVGSVGLSDLDI 736
           LS E+RL+W+E RI+  L L P ++ CEFC+ + +  V S+GLS+L+I
Sbjct: 721 LSFERRLAWIEQRINHCLHLKPATVFCEFCNYLGNASVESLGLSNLNI 761

BLAST of Csa2G074100.1 vs. NCBI nr
Match: gi|225425686|ref|XP_002269865.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Vitis vinifera])

HSP 1 Score: 730.7 bits (1885), Expect = 2.5e-207
Identity = 384/731 (52.53%), Postives = 493/731 (67.44%), Query Frame = 1

Query: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60
           M +  +L+C+ CGS+G +DG DGFFYC +CGSQA+DIIDTGVAEED V +  G +   IY
Sbjct: 1   MPERLDLTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAK--GDARGAIY 60

Query: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120
           S SH R+R+    K EPLSQSQS F           NL  D    V  ++     + D V
Sbjct: 61  SASHRRQRHSIAPKPEPLSQSQSQFLN---------NLTLDDDYRVENEETREETVADEV 120

Query: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180
           GP+GP DFG G   S   SFE+Y  ++R+RYVMG+Q+++ELQC+ LV++FKA+P+ICG+A
Sbjct: 121 GPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGVA 180

Query: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240
            +IWLRFV  TRVFD++WA + +Q+SE Q        +  + +  EPHN YGQR V++W 
Sbjct: 181 GTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAVIIWH 240

Query: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300
           +SL+KKIPL  +L +SFLACH+AREAILPTDI+KWSLEGKLPY+AAF++IE +IG  S  
Sbjct: 241 RSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSSP 300

Query: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CP+SS  M RPS    LQKLE+ AASIA  IGL+LPPVNF++IA RYL +L LPV+KILP
Sbjct: 301 CPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420
           +ACR+YEWSMPPDLWLS NELRLP+RVCVMSILI+ +RILYN+HGFG+WE SLS    S 
Sbjct: 361 YACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSSSGSS 420

Query: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSP-------ENPHLTTTEFLRK 480
               Q    + ++N   M    +  P    HD++  +  P       +      TE L  
Sbjct: 421 SSSSQIVKLNASDNIKMMDGAKQGSP---LHDLNGSNEEPVTNSSHAQKSEFDATELLCN 480

Query: 481 IEARYHEIAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDY 540
           ++ARY E+ +TYEYSKDLPTYLQYCKDV FAG E  F +DH+E+K+IE+LW +YQN+KD 
Sbjct: 481 LDARYDELIDTYEYSKDLPTYLQYCKDVVFAGLELPF-EDHEEEKIIEQLWEFYQNQKDS 540

Query: 541 DQTED--VDQNAASNQKRLR--EGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQS 600
           + +ED  V+  +A N+KR R  EG  + +  E KK++ +              S+     
Sbjct: 541 EPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRDD-------------CSVPLGLD 600

Query: 601 HSSKSLDNSDDDEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHY 660
               SL NS   ++S     AS  T   EAI ++K DMEE RFCYIPPR+N KRFDYLHY
Sbjct: 601 GDDTSL-NSQGGQKSVPTHQASVETLKEEAILRMKADMEENRFCYIPPRVNVKRFDYLHY 660

Query: 661 SRKIDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLT 720
            RK DEG+  YAAHADYYILLRACAR AQVD+R MH+GV+SLE+RL W+E RI   L   
Sbjct: 661 VRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDHCLHFK 702

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MEE12_ARATH2.2e-13440.39TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Arabido... [more]
TAF1B_ORYSJ8.9e-9633.19TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza s... [more]
TAF1B_ORYSI4.1e-8535.82TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza s... [more]
Match NameE-valueIdentityDescription
A0A0A0LHI6_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G074100 PE=4 SV=1[more]
W9QI88_9ROSA2.0e-21151.82Uncharacterized protein OS=Morus notabilis GN=L484_001445 PE=4 SV=1[more]
F6HPR5_VITVI1.7e-20752.53Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g02430 PE=4 SV=... [more]
A5ACR3_VITVI8.7e-20752.24Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027088 PE=4 SV=1[more]
B9S8W3_RICCO1.1e-20651.92Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0836380 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G02955.11.2e-13540.39 maternal effect embryo arrest 12[more]
Match NameE-valueIdentityDescription
gi|449470354|ref|XP_004152882.1|0.0e+00100.00PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
gi|659082497|ref|XP_008441871.1|0.0e+0094.84PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
gi|1009154268|ref|XP_015895077.1|6.4e-22755.92PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
gi|703061724|ref|XP_010086553.1|2.9e-21151.82hypothetical protein L484_001445 [Morus notabilis][more]
gi|225425686|ref|XP_002269865.1|2.5e-20752.53PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
Vocabulary: Molecular Function
TermDefinition
GO:0001164RNA polymerase I CORE element sequence-specific DNA binding
GO:0001187transcription factor activity, RNA polymerase I CORE element binding transcription factor recruiting
Vocabulary: Biological Process
TermDefinition
GO:0001188RNA polymerase I transcriptional preinitiation complex assembly
GO:0006360transcription from RNA polymerase I promoter
Vocabulary: Cellular Component
TermDefinition
GO:0070860RNA polymerase I core factor complex
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0001188 RNA polymerase I transcriptional preinitiation complex assembly
biological_process GO:0006360 transcription from RNA polymerase I promoter
cellular_component GO:0070860 RNA polymerase I core factor complex
molecular_function GO:0001164 RNA polymerase I CORE element sequence-specific DNA binding
molecular_function GO:0001187 transcription factor activity, RNA polymerase I CORE element binding transcription factor recruiting

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa2G074100Csa2G074100gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa2G074100.1Csa2G074100.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa2G074100.1.utr3p1Csa2G074100.1.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa2G074100.1.cds5Csa2G074100.1.cds5CDS
Csa2G074100.1.cds4Csa2G074100.1.cds4CDS
Csa2G074100.1.cds3Csa2G074100.1.cds3CDS
Csa2G074100.1.cds2Csa2G074100.1.cds2CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa2G074100.1.utr5p2Csa2G074100.1.utr5p2five_prime_UTR
Csa2G074100.1.utr5p1Csa2G074100.1.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31576FAMILY NOT NAMEDcoord: 1..55
score: 2.5E-108coord: 125..724
score: 2.5E