Cla005738 (gene) Watermelon (97103) v1

NameCla005738
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionMaternal effect embryo arrest 12 protein (AHRD V1 ***- Q5XVF0_ARATH)
LocationChr11 : 10941503 .. 10944607 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGACCCCTGCAACTTGTCTTGCTTCAACTGTGGCAGCATTGGGCTTACAGACGGTTTTGATGGCTTCTTCTACTGCCTTCAGTGTGGTTCCCAGGCTGACGACATCATTGACACTGGTGTCGCCGACGAGGATTTGGTTATAAGAGATGCTGGTAAATCCGGTGGCCCAATCTACTCACAATCTCACACACGACGCCGTAATCCAGCTGTGTTGAAGGTGGAACCCCTATCTCAGTCTCAACCACTTTTTGGTACAAGTCAGTCTGAGTTCTGGGATAGCCTTAATTTAAAGGAAGACCCCTATGGCAATGAAGCTCGAAAGGATGATGACATTGTGATGTTAAATGATGGTGTTGGGCCTACTCGTCCAGAGGATTTTGGGTCTGGTGGTGTGCCATCTGGAAAACCAAGTTTTGAGGAGTATGCTGATGAAGTGAGGATGAGGTATGTAATGGGGTTGCAGCTAATTATGGAGCTTCAATGCGAAGTTTTGGTGAAGGAATTCAAGGCAACCCCTTTAATTTGTGGGTTGGCAGCGAGCATTTGGTTGAGATTTGTGACTGCTTCACGGGTTTTTGATGAAGATTGGGCCTTTGAGACCGTCCAGGATTCTGAGAGCCAATACCTAGGTTTGATAACTCGTTACTTCCACTATAAGTTCTTGTTTATATATTTCTGCCCATTAATCATTTTGGTACTTGAAACTGAAATAGTGAAATCTTTAGCTTTAGATTTTGTTTGATTTTCGAGTGGCACATGGTGGATTCTGATGGTTGTCTTGAAAAGTTAACTCATTCTTATTGTTGCATTGTTCTTTTTGGGGTATAATATTGTTTGCATATTCAAATATCATAGTTTTTAGCCTATATGGCCATACGCGATGCCCATGTCCCAAGTTTCTTTGTAACAGTGCATCATTTTGTCTGACATCTTACAGTTTTGCAAGTACAGTAAGATAACACCTCATTTATTCCTTTATGTAGTGCTGGTTGAGTTTTCTTAATTTATTTTAATATGTTTAACTGTTTTCACCATCTTAGTATTTTGAATTTTTGTCGTACTTTTTCATTCTAGTTGCAAACTGAATGTTCCTTCATTCAACAGATCCTGAATGCATTAGACGTGTTCGCTTTAGCAATAAAGACGAACCCCACAATTTGTATGGTCAACGGGTAGTAGTTGTATGGATTAAATCTTTAAGAAAGAAGATTCCATTATTTTGTACATTAGCTGTTTCATTTTTGGCTTGCCATGTTGCTCGAGAAGCAATTTTGCCAACAGATATAATAAAGTGGTCACTTGAGGGGAAACTTCCATATTATGCTGCTTTTATTGACATTGAAAATCGCATTGGGAAAACGTCAAGCGCTTGTCCTATAAGTTCAAAGCATATGCATAGGCCCTCTCGAATTTCCTCGCTGCAGAAGTTAGAATCGTTGGCAGCATCAATAGCTCATACCATAGGGCTAAATCTTCCTCCAGTTAATTTTCATTCAATAGCATGTCGTTATCTCAACAAGCTGGCCCTTCCTGTTGATAAGATTTTGCCTCATGCTTGCCGCATTTATGAGTGGTCAATGCCTCCTGATTTATGGTTATCAACCAATGAATTGAGACTTCCCTCTCGTGTTTGTGTAATGTCAATTCTAATTATAGCAATGAGAATTCTGTATAATCTCCATGGGTTTGGAGAATGGGAAAAGAGTTTGTCTGTTAATTGTGCTTCATCTTTTCCGCTGAACCAGAAGACGGATTCAGGTCCTGCTAATAATTTCAGTAACATGGAGGCTGATAGTGAAAACAGATCTGGATTTACTTCACATAATGTGGATAATCCATCTGTTTCTCCAAAGAACTCGCATCCGACTGCTACAGAGTTTCTTCGCAAGCTTGAAGCTCGGTATCATGAGATTGCCGAGACTTACGGTATGATTATGATTTCATCATTACTGTATTTTTAAAGTTAGACCAACCTTAGTTATACATTTGCTTTACTAGATAAGAGTACTCTTCCTATTCCTGCTCACTTTTATTTCTTACTAAATTGCCATTTGATAATTTGCAAAATTGTGTTGGCAGAATACTCTAAAGACTTGCCAACCTATCTTCAGTACTGTAAGGATGTCGTGTTTGCTGGTTCAGAATCGTTGTTTGTCGATGATCACGAGGAACAAAAGTTGATCGAACAATTATGGAATTATTACCAAAATGGAAAGGTAATGACTATCAAGCTAAAGTCTTGTGTGCGTAGATATTGCAAAAATATTATCTGATGAAATTAGTCAATGTGTGCATAGATATCAGGAAAAAAAAATGACTGGGAAATAATAGTTATAATAGTTCCACTTTACAATATTACTTGCTTTTGTTGCTATGCCTTAGGGATTTCATAACATTGCTTTAATTGCTACCATATAAATGTTTAGTGAAGCAAAAATCAAAATGCAAGCATAATTCCTAACTAGAAAAAGGAAAATTGTTGCAGGATCTAGAACAAACAGAAGATGTTGATCAAAATGCTGCGTCCAATCAAAAGAGATTGAGGGAAGGTTCCAATGATCGATTATCCATTGAAAGCAAGAAAGTTAAAGGCGAAGAAGAACGTATTAACGGCGAATCATCAAATAACAGACCTGGGTCAATTAATTCTCAGCAAAGTCATTCATCAAAAAGTCTAGAAAATAGTGATGATGATGAACTATCCTCTGAAGACAAAACAGCTTCAACGGTAACTTCCATAGATGGAGCGATCAGACGACTGAAACTCGACATGGAGGAGAAAAGGTTCTGCTACATTCCCCCAAGGGTTAACCCGAAGAGATTCGATTACCTTCACTACTCAAGGAAGATGGATGAGGGTGCGCTGACTTATGCCGCGCACGCGGATTACTACATATTGCTTCGTGCTTGTGCTAGAGCCGCACAAGTTGATATTAGGATTATGCACATTGGAGTCTTGAGTCTTGAGAAAAGATTGTCTTGGTTAGAAGACAGAATCCATAAATCCCTCCATCTAACACCCTCTAGTGTTTCTTGTGAGTTTTGCAGTGATGTGCCTGATCATGTTGACTCCATTGGACTGTCAGATTTGGATATTTAA

mRNA sequence

ATGGCAGACCCCTGCAACTTGTCTTGCTTCAACTGTGGCAGCATTGGGCTTACAGACGGTTTTGATGGCTTCTTCTACTGCCTTCAGTGTGGTTCCCAGGCTGACGACATCATTGACACTGGTGTCGCCGACGAGGATTTGGTTATAAGAGATGCTGGTAAATCCGGTGGCCCAATCTACTCACAATCTCACACACGACGCCGTAATCCAGCTGTGTTGAAGGTGGAACCCCTATCTCAGTCTCAACCACTTTTTGGTACAAGTCAGTCTGAGTTCTGGGATAGCCTTAATTTAAAGGAAGACCCCTATGGCAATGAAGCTCGAAAGGATGATGACATTGTGATGTTAAATGATGGTGTTGGGCCTACTCGTCCAGAGGATTTTGGGTCTGGTGGTGTGCCATCTGGAAAACCAAGTTTTGAGGAGTATGCTGATGAAGTGAGGATGAGGTATGTAATGGGGTTGCAGCTAATTATGGAGCTTCAATGCGAAGTTTTGGTGAAGGAATTCAAGGCAACCCCTTTAATTTGTGGGTTGGCAGCGAGCATTTGGTTGAGATTTGTGACTGCTTCACGGGTTTTTGATGAAGATTGGGCCTTTGAGACCGTCCAGGATTCTGAGAGCCAATACCTAGATCCTGAATGCATTAGACGTGTTCGCTTTAGCAATAAAGACGAACCCCACAATTTGTATGGTCAACGGGTAGTAGTTGTATGGATTAAATCTTTAAGAAAGAAGATTCCATTATTTTGTACATTAGCTGTTTCATTTTTGGCTTGCCATGTTGCTCGAGAAGCAATTTTGCCAACAGATATAATAAAGTGGTCACTTGAGGGGAAACTTCCATATTATGCTGCTTTTATTGACATTGAAAATCGCATTGGGAAAACGTCAAGCGCTTGTCCTATAAGTTCAAAGCATATGCATAGGCCCTCTCGAATTTCCTCGCTGCAGAAGTTAGAATCGTTGGCAGCATCAATAGCTCATACCATAGGGCTAAATCTTCCTCCAGTTAATTTTCATTCAATAGCATGTCGTTATCTCAACAAGCTGGCCCTTCCTGTTGATAAGATTTTGCCTCATGCTTGCCGCATTTATGAGTGGTCAATGCCTCCTGATTTATGGTTATCAACCAATGAATTGAGACTTCCCTCTCGTGTTTGTGTAATGTCAATTCTAATTATAGCAATGAGAATTCTGTATAATCTCCATGGGTTTGGAGAATGGGAAAAGAGTTTGTCTGTTAATTGTGCTTCATCTTTTCCGCTGAACCAGAAGACGGATTCAGGTCCTGCTAATAATTTCAGTAACATGGAGGCTGATAGTGAAAACAGATCTGGATTTACTTCACATAATGTGGATAATCCATCTGTTTCTCCAAAGAACTCGCATCCGACTGCTACAGAGTTTCTTCGCAAGCTTGAAGCTCGGTATCATGAGATTGCCGAGACTTACGAATACTCTAAAGACTTGCCAACCTATCTTCAGTACTGTAAGGATGTCGTGTTTGCTGGTTCAGAATCGTTGTTTGTCGATGATCACGAGGAACAAAAGTTGATCGAACAATTATGGAATTATTACCAAAATGGAAAGGATCTAGAACAAACAGAAGATGTTGATCAAAATGCTGCGTCCAATCAAAAGAGATTGAGGGAAGGTTCCAATGATCGATTATCCATTGAAAGCAAGAAAGTTAAAGGCGAAGAAGAACGTATTAACGGCGAATCATCAAATAACAGACCTGGGTCAATTAATTCTCAGCAAAGTCATTCATCAAAAAGTCTAGAAAATAGTGATGATGATGAACTATCCTCTGAAGACAAAACAGCTTCAACGGTAACTTCCATAGATGGAGCGATCAGACGACTGAAACTCGACATGGAGGAGAAAAGGTTCTGCTACATTCCCCCAAGGGTTAACCCGAAGAGATTCGATTACCTTCACTACTCAAGGAAGATGGATGAGGGTGCGCTGACTTATGCCGCGCACGCGGATTACTACATATTGCTTCGTGCTTGTGCTAGAGCCGCACAAGTTGATATTAGGATTATGCACATTGGAGTCTTGAGTCTTGAGAAAAGATTGTCTTGGTTAGAAGACAGAATCCATAAATCCCTCCATCTAACACCCTCTAGTGTTTCTTGTGAGTTTTGCAGTGATGTGCCTGATCATGTTGACTCCATTGGACTGTCAGATTTGGATATTTAA

Coding sequence (CDS)

ATGGCAGACCCCTGCAACTTGTCTTGCTTCAACTGTGGCAGCATTGGGCTTACAGACGGTTTTGATGGCTTCTTCTACTGCCTTCAGTGTGGTTCCCAGGCTGACGACATCATTGACACTGGTGTCGCCGACGAGGATTTGGTTATAAGAGATGCTGGTAAATCCGGTGGCCCAATCTACTCACAATCTCACACACGACGCCGTAATCCAGCTGTGTTGAAGGTGGAACCCCTATCTCAGTCTCAACCACTTTTTGGTACAAGTCAGTCTGAGTTCTGGGATAGCCTTAATTTAAAGGAAGACCCCTATGGCAATGAAGCTCGAAAGGATGATGACATTGTGATGTTAAATGATGGTGTTGGGCCTACTCGTCCAGAGGATTTTGGGTCTGGTGGTGTGCCATCTGGAAAACCAAGTTTTGAGGAGTATGCTGATGAAGTGAGGATGAGGTATGTAATGGGGTTGCAGCTAATTATGGAGCTTCAATGCGAAGTTTTGGTGAAGGAATTCAAGGCAACCCCTTTAATTTGTGGGTTGGCAGCGAGCATTTGGTTGAGATTTGTGACTGCTTCACGGGTTTTTGATGAAGATTGGGCCTTTGAGACCGTCCAGGATTCTGAGAGCCAATACCTAGATCCTGAATGCATTAGACGTGTTCGCTTTAGCAATAAAGACGAACCCCACAATTTGTATGGTCAACGGGTAGTAGTTGTATGGATTAAATCTTTAAGAAAGAAGATTCCATTATTTTGTACATTAGCTGTTTCATTTTTGGCTTGCCATGTTGCTCGAGAAGCAATTTTGCCAACAGATATAATAAAGTGGTCACTTGAGGGGAAACTTCCATATTATGCTGCTTTTATTGACATTGAAAATCGCATTGGGAAAACGTCAAGCGCTTGTCCTATAAGTTCAAAGCATATGCATAGGCCCTCTCGAATTTCCTCGCTGCAGAAGTTAGAATCGTTGGCAGCATCAATAGCTCATACCATAGGGCTAAATCTTCCTCCAGTTAATTTTCATTCAATAGCATGTCGTTATCTCAACAAGCTGGCCCTTCCTGTTGATAAGATTTTGCCTCATGCTTGCCGCATTTATGAGTGGTCAATGCCTCCTGATTTATGGTTATCAACCAATGAATTGAGACTTCCCTCTCGTGTTTGTGTAATGTCAATTCTAATTATAGCAATGAGAATTCTGTATAATCTCCATGGGTTTGGAGAATGGGAAAAGAGTTTGTCTGTTAATTGTGCTTCATCTTTTCCGCTGAACCAGAAGACGGATTCAGGTCCTGCTAATAATTTCAGTAACATGGAGGCTGATAGTGAAAACAGATCTGGATTTACTTCACATAATGTGGATAATCCATCTGTTTCTCCAAAGAACTCGCATCCGACTGCTACAGAGTTTCTTCGCAAGCTTGAAGCTCGGTATCATGAGATTGCCGAGACTTACGAATACTCTAAAGACTTGCCAACCTATCTTCAGTACTGTAAGGATGTCGTGTTTGCTGGTTCAGAATCGTTGTTTGTCGATGATCACGAGGAACAAAAGTTGATCGAACAATTATGGAATTATTACCAAAATGGAAAGGATCTAGAACAAACAGAAGATGTTGATCAAAATGCTGCGTCCAATCAAAAGAGATTGAGGGAAGGTTCCAATGATCGATTATCCATTGAAAGCAAGAAAGTTAAAGGCGAAGAAGAACGTATTAACGGCGAATCATCAAATAACAGACCTGGGTCAATTAATTCTCAGCAAAGTCATTCATCAAAAAGTCTAGAAAATAGTGATGATGATGAACTATCCTCTGAAGACAAAACAGCTTCAACGGTAACTTCCATAGATGGAGCGATCAGACGACTGAAACTCGACATGGAGGAGAAAAGGTTCTGCTACATTCCCCCAAGGGTTAACCCGAAGAGATTCGATTACCTTCACTACTCAAGGAAGATGGATGAGGGTGCGCTGACTTATGCCGCGCACGCGGATTACTACATATTGCTTCGTGCTTGTGCTAGAGCCGCACAAGTTGATATTAGGATTATGCACATTGGAGTCTTGAGTCTTGAGAAAAGATTGTCTTGGTTAGAAGACAGAATCCATAAATCCCTCCATCTAACACCCTCTAGTGTTTCTTGTGAGTTTTGCAGTGATGTGCCTGATCATGTTGACTCCATTGGACTGTCAGATTTGGATATTTAA

Protein sequence

MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIYSQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYGNEARKDDDIVMLNDGVGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLAASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVWIKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSSACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCASSFPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKLEARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLEQTEDVDQNAASNQKRLREGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSHSSKSLENSDDDELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVSCEFCSDVPDHVDSIGLSDLDI
BLAST of Cla005738 vs. Swiss-Prot
Match: MEE12_ARATH (TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Arabidopsis thaliana GN=MEE12 PE=1 SV=1)

HSP 1 Score: 496.9 bits (1278), Expect = 3.8e-139
Identity = 298/726 (41.05%), Postives = 409/726 (56.34%), Query Frame = 1

Query: 9   CFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIYSQSHTRRR 68
           C  C +    +  DG++YC +CG Q +++I TGV D DL I + G + G +Y+  H RR 
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDL-IGEGGGTQGALYNPKH-RRT 62

Query: 69  NPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYGN-----EARKDDDIVMLNDGVGPT 128
            P     +P++ SQP F    S +    +  E   GN     E ++  D  +  +   PT
Sbjct: 63  EP-----QPITPSQPRFTDDTSRYSQFKSQFESENGNKELPREVKRAPDSYVDKE---PT 122

Query: 129 RPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLAASI 188
            P DF +  +     S+E Y DE R RYV    +++  QC+ LV +F  TPLI GL   I
Sbjct: 123 EPVDFAAETL-----SYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLVGPI 182

Query: 189 WLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFS-NKDEPHNLYGQRVVVVWIKS 248
            LR+V  S V+ +DWA   ++DSE Q  D E     R   +K EP N+ G+R V +W   
Sbjct: 183 SLRYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIWFGI 242

Query: 249 LRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSSACP 308
           L+K +PL  +L +SFLACH A   +LPTDI++W+ EGKLPY + F+DI  ++G+ S+ACP
Sbjct: 243 LKKTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSAACP 302

Query: 309 ISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILPHA 368
           +    M RP ++ S Q LE+ A+ IA TIGL LPPVNF+ IA  Y+ +L++P DKIL  A
Sbjct: 303 VKVSIMARPFQVISAQMLEARASVIADTIGLPLPPVNFYGIASNYIKQLSIPEDKILDLA 362

Query: 369 CRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCASSFP 428
             I  WS+PP+L+LSTNE +LPSRVCVMSILI+A+R+LYN++G G WE+SL    AS   
Sbjct: 363 RLIQNWSLPPELYLSTNEQKLPSRVCVMSILIVAIRMLYNINGLGVWERSLGFVNAS--- 422

Query: 429 LNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKLEARYHEI- 488
                           + DSE  SG           + K +     E L+ LEA+YHE+ 
Sbjct: 423 ----------------DGDSETNSG----------TAEKATEFDTQELLKNLEAKYHEVA 482

Query: 489 AETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLE----QTE 548
           AET E  KDL +YL   K+  FAG E    DD    ++++ LWN Y   +D+E    +  
Sbjct: 483 AETLESEKDLVSYLSLGKNEFFAGLEEDSPDD--TYRIVDNLWNGYPKDEDIECLPKRGR 542

Query: 549 DVDQNAASNQKRLREGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSHS---SKS 608
           D D + + NQ          LS+   +           S  N P S +S+++ S      
Sbjct: 543 DWDDDVSLNQ----------LSLYDSRF----------SDGNNPCSSSSRRNESVSIGLD 602

Query: 609 LENSDDDELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMD 668
           L +S+  E SS +K          AI+RL  DM +  FCYIPPRV  KR DYL Y RK +
Sbjct: 603 LSSSEHRESSSPEKLKEI------AIKRLITDMGDDLFCYIPPRVKVKRLDYLQYVRKKE 656

Query: 669 EGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVS 721
           +GAL YAAHADYYILLR CA+ A++D+R MH GVLS E+RL+W+E RI + LHLT   ++
Sbjct: 663 DGALIYAAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIEKRIDQVLHLTRPLMT 656

BLAST of Cla005738 vs. Swiss-Prot
Match: TAF1B_ORYSJ (TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza sativa subsp. japonica GN=Os05g0352700 PE=3 SV=2)

HSP 1 Score: 322.8 bits (826), Expect = 9.9e-87
Identity = 210/564 (37.23%), Postives = 295/564 (52.30%), Query Frame = 1

Query: 6   NLSCFNCG---SIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIYSQ 65
           +L C  CG        D  DGFF C QC +      +T     D  +        P +  
Sbjct: 18  HLVCEYCGHGSEYAEDDADDGFFTCRQCSAIHTSTQNTATNPFDFPMT-------PAHLS 77

Query: 66  SHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYGNEARKDDDIVMLNDGVGP 125
           +H R   P        +      G +  +F        D  G                 P
Sbjct: 78  AHRRPTQPTPTPKPFPAPRGAATGAAAPDF--------DDLGE----------------P 137

Query: 126 TRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLAAS 185
           + P DF +G    G P  E+ A  VR RYV GLQ+I++ Q E LV+  +   L   LA +
Sbjct: 138 SEPRDFATGANAWGNP--EDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAASLAGT 197

Query: 186 IWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQR------VV 245
           IWLR+V AS+VFDE W  + +  + S       +     ++KD+   L G          
Sbjct: 198 IWLRWVAASKVFDEMWVHKMLAIAAS-------VEEGHSASKDKQSELEGDAQKSQSSYE 257

Query: 246 VVWIKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGK 305
            ++++SLR  +P++ TLAV FLACHVARE ILPTDI +W++EGKLPY AAF  ++  +G 
Sbjct: 258 FLFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLGS 317

Query: 306 TSSACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVD 365
           + + CP+SS+ + RP+R+    +LE+ A SIA  IGL LP VNF+ IA R+L +L+LP++
Sbjct: 318 SLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPIE 377

Query: 366 KILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEK-SLSV 425
           KILPHACRIYEW+MP +LWLS+N  R+PSRVCVM+ILI+A+R+LY ++G G WE  + + 
Sbjct: 378 KILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWESIAQTE 437

Query: 426 NCASSFPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKLE 485
           N   S P         A+   ++E DS N   F                  A E L  L 
Sbjct: 438 NAVGSDP--------EASAPHSIEPDSNNSEEF-----------------DARELLCTLA 497

Query: 486 ARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLEQ 545
           A Y +I   ++YSK++ +YL+YCKDVVF G         EE+ LI+  W+ Y+ GK++  
Sbjct: 498 ASYDKINVGHDYSKEVHSYLKYCKDVVFTG----MTFSLEEEHLIDIFWDMYK-GKEVML 508

Query: 546 TEDVDQNAASNQKRLR--EGSNDR 558
              +D+NA   Q++LR   G N R
Sbjct: 558 ---LDENAKLCQEKLRTTNGVNKR 508


HSP 2 Score: 97.1 bits (240), Expect = 8.8e-19
Identity = 44/89 (49.44%), Postives = 60/89 (67.42%), Query Frame = 1

Query: 619 AIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTYAAHADYYILLRACARAAQ 678
           A++ +K  MEE  FCY+ PR       YL Y+R+   G+L Y AHADYYILLR  A+ A+
Sbjct: 528 ALQSIKSKMEENGFCYVSPRKRLVSDGYLLYTRRESSGSLIYVAHADYYILLRPFAKLAE 587

Query: 679 VDIRIMHIGVLSLEKRLSWLEDRIHKSLH 708
           VD+R++H  VL LE+RL W+E+R+ +SL+
Sbjct: 588 VDVRVLHSSVLKLERRLGWIEERVGRSLN 616

BLAST of Cla005738 vs. Swiss-Prot
Match: TAF1B_ORYSI (TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza sativa subsp. indica GN=OsI_19584 PE=3 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 3.7e-86
Identity = 188/445 (42.25%), Postives = 264/445 (59.33%), Query Frame = 1

Query: 122 PTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLAA 181
           P+ P DF +G    G P  E+ A  VR RYV GLQ+I++ Q E LV+  +   L   LA 
Sbjct: 106 PSEPRDFATGANAWGNP--EDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAASLAG 165

Query: 182 SIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQR------V 241
           +IWLR+V AS+VFDE W  + +  + S       +     ++KD+   L G         
Sbjct: 166 TIWLRWVAASKVFDEMWVHKMLAIAAS-------VEEGHSASKDKQSELEGDAQKSQSSY 225

Query: 242 VVVWIKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIG 301
             ++++SLR  +P++ TLAV FLACHVARE ILPTDI +W++EGKLPY AAF  ++  +G
Sbjct: 226 EFLFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLG 285

Query: 302 KTSSACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPV 361
            + + CP+SS+ + RP+R+    +LE+ A SIA  IGL LP VNF+ IA R+L +L+LP+
Sbjct: 286 SSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPI 345

Query: 362 DKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEK-SLS 421
           +KILPHACRIYEW+MP +LWLS+N  R+PSRVCVM+ILI+A+R+LY ++G G WE  + +
Sbjct: 346 EKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWESIAQT 405

Query: 422 VNCASSFPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKL 481
            N   S P         A+   ++E DS N   F                  A E L  L
Sbjct: 406 ENAVGSDP--------EASAPHSIEPDSNNSEEF-----------------DARELLCTL 465

Query: 482 EARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLE 541
            A Y +I   ++YSK++ +YL+YCKDVVF G         EE+ LI+  W+ Y+ GK++ 
Sbjct: 466 AASYDKIDVGHDYSKEVHSYLKYCKDVVFTG----MTFSLEEEHLIDIFWDMYK-GKEVM 508

Query: 542 QTEDVDQNAASNQKRLR--EGSNDR 558
               +D+NA   Q++LR   G N R
Sbjct: 526 L---LDENAKLCQEKLRTTNGVNKR 508


HSP 2 Score: 97.1 bits (240), Expect = 8.8e-19
Identity = 44/89 (49.44%), Postives = 60/89 (67.42%), Query Frame = 1

Query: 619 AIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTYAAHADYYILLRACARAAQ 678
           A++ +K  MEE  FCY+ PR       YL Y+R+   G+L Y AHADYYILLR  A+ A+
Sbjct: 528 ALQSIKSKMEENGFCYVSPRKRLVSDGYLLYTRRESSGSLIYVAHADYYILLRPFAKLAE 587

Query: 679 VDIRIMHIGVLSLEKRLSWLEDRIHKSLH 708
           VD+R++H  VL LE+RL W+E+R+ +SL+
Sbjct: 588 VDVRVLHSSVLKLERRLGWIEERVGRSLN 616

BLAST of Cla005738 vs. TrEMBL
Match: A0A0A0LHI6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G074100 PE=4 SV=1)

HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 654/735 (88.98%), Postives = 692/735 (94.15%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           MADP NLSC+NCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVA+EDLV+RD GKSG PIY
Sbjct: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60

Query: 61  SQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYGNEARKDDDIVMLNDGV 120
           SQSHTRRRNP VLKVEPLSQSQ LFGTSQSEFWDSLNL EDP GN   KD DIVMLNDGV
Sbjct: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120

Query: 121 GPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLA 180
           GPT PEDFGSG V SGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATP+ICGLA
Sbjct: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180

Query: 181 ASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVWI 240
           ASIWLRFVTA+RVFDEDWAF+TVQ+SESQ LDPE IRRV  S+KDEPHN YGQRVVV+W+
Sbjct: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240

Query: 241 KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSSA 300
           KSLRKKIPLF TLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAF+DIE+RIGKTS A
Sbjct: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300

Query: 301 CPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CPISSK MHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP
Sbjct: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCASS 420
           HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSV+CAS 
Sbjct: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420

Query: 421 FPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKLEARYHE 480
           FP +QKT S PANNFSNM+ADSENR GFTSH+VDNPSVSP+N H T TEFLRK+EARYHE
Sbjct: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480

Query: 481 IAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLEQTEDVD 540
           IAETYEYSKDLPTYLQYCKDV FAGSESLF+DDH+EQK+IE+LWNYYQN KD +QTEDVD
Sbjct: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540

Query: 541 QNAASNQKRLREGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSHSSKSLENSDD 600
           QNAASNQKRLREGSNDRLS ESKKVKGEE+RI+ ES NNR GSI+S+QSHSSKSL+NSDD
Sbjct: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD 600

Query: 601 DELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTY 660
           DE SS DK AS++TSI+ AIR+LKLDMEEKRFCYIPPR+NPKRFDYLHYSRK+DEGALTY
Sbjct: 601 DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY 660

Query: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVSCEFCSD 720
           AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSL LTP+S++CEFCSD
Sbjct: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720

Query: 721 VPDHVDSIGLSDLDI 736
           VPDHV S+GLSDLDI
Sbjct: 721 VPDHVGSVGLSDLDI 735

BLAST of Cla005738 vs. TrEMBL
Match: W9QI88_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001445 PE=4 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 1.9e-214
Identity = 411/775 (53.03%), Postives = 528/775 (68.13%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           M+DP   +C  CG+ G  DGFDGF+YCL+CGSQA+DII+TGVADED   +  G +G P+Y
Sbjct: 1   MSDPHAWTCHTCGNAGFADGFDGFYYCLRCGSQAEDIIETGVADEDFADK-GGTAGAPLY 60

Query: 61  SQSHTRRRNPAV--LKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYG------NEAR-KDD 120
           S +H R R  A   +K EP+SQ Q    T QS+FW +L L +D  G      N A  K +
Sbjct: 61  SATHRRNRPVAASAIKAEPISQVQS--ATLQSQFWAALTLDDDGEGEGGDRFNRASIKTE 120

Query: 121 DIVMLNDGVGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFK 180
           +I    DGVGPT P DFGS G     PSFEEY  ++R+RYVMGLQL++E QCE LV+EFK
Sbjct: 121 EIEF--DGVGPTGPRDFGSVG--ESVPSFEEYYSDIRIRYVMGLQLMIEFQCEALVREFK 180

Query: 181 ATPLICGLAASIWLRFVTASRVFDEDWAFETVQDSESQYL-DPECIRRVRFSNKDEPHNL 240
             PLICGLA ++WLRFV  +RVFD+ W  E++ +SESQ   +P    + R     EPHN+
Sbjct: 181 VNPLICGLAGTVWLRFVAGTRVFDDGWVDESINESESQQQGEPPQDFKPRAKYGSEPHNM 240

Query: 241 YGQRVVVVWIKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDI 300
           YGQR V++W +SLRKKIPL  TL VSFLACH+ARE +L TDI+KWSLEGK+PY+AAF++I
Sbjct: 241 YGQRAVMIWFRSLRKKIPLSYTLGVSFLACHLAREPVLTTDIVKWSLEGKVPYFAAFVEI 300

Query: 301 ENRIGKTSSACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNK 360
           E  +G+ SS CPIS+  M RP+   + QKLESL+ASIA ++GL LPPVNF+SIA RYL +
Sbjct: 301 EKHMGQRSSGCPISTTLMFRPNESVTAQKLESLSASIADSVGLALPPVNFYSIASRYLRE 360

Query: 361 LALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWE 420
           L++P++KILPHA R+YEWSMPPDLWLSTNELRLP+RVCVMS+LI+A+RILYN+HGFGEWE
Sbjct: 361 LSIPLEKILPHARRMYEWSMPPDLWLSTNELRLPTRVCVMSMLIVAIRILYNIHGFGEWE 420

Query: 421 KSLSVNCASSFPLNQKTDSGPANNFSNMEADSENRSGFTSHN---VDNPSVSP--KNSHP 480
           KSL      S      T  G  ++    + D +  +G  S +   +D+   +P    SH 
Sbjct: 421 KSLCRRHTCS------TSKGTEDSELKPDLDVKECAGKGSGSPQILDDSGTNPGRDTSHA 480

Query: 481 TATE-----FLRKLEARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLI 540
             TE      L  LEARY  I ETYEYSKDLP+YLQ+CKDVVFAG E  F +D+EE++LI
Sbjct: 481 QKTELDAAKLLCDLEARYRGINETYEYSKDLPSYLQFCKDVVFAGLEPSF-EDYEEKRLI 540

Query: 541 EQLWNYYQNGKDLEQT--EDVDQNAASNQKRLREGSNDR-LSIESKKVKGEEERINGESS 600
           E+LW++Y + +D +    E+   + A  QKR R     R    +  K   ++  +   S 
Sbjct: 541 EELWDFYWSERDSKTAVQEEAHGSEAVTQKRARVDVECRSFPFKENKRFRDKGCVGDPSP 600

Query: 601 NNRPGSINSQQSHSSKSLENSDDDELSSE----------DKTASTVTS------IDGAIR 660
            N   S   Q S +S   ++S DD++S +          D+TA T+         D AIR
Sbjct: 601 YNGSRSAGDQHSENSDQFDSSQDDQISEQKDQTSADSLKDETADTLKDETSEILKDEAIR 660

Query: 661 RLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTYAAHADYYILLRACARAAQVDI 720
            LK D EE RF YIPPRVNPKRFDYLHY+RK DEGALTY AHADYYILLRACA+AA++DI
Sbjct: 661 LLKSDFEENRFYYIPPRVNPKRFDYLHYARKKDEGALTYVAHADYYILLRACAKAAKIDI 720

Query: 721 RIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVSCEFCSDVPD-HVDSIGLSDLDI 736
           R+MHIGVLS E+RL+W+E RI+  LHL P++V CEFC+ + +  V+S+GLS+L+I
Sbjct: 721 RLMHIGVLSFERRLAWIEQRINHCLHLKPATVFCEFCNYLGNASVESLGLSNLNI 761

BLAST of Cla005738 vs. TrEMBL
Match: F6HPR5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g02430 PE=4 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 7.3e-214
Identity = 400/730 (54.79%), Postives = 506/730 (69.32%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           M +  +L+C  CGS+G +DG DGFFYC +CGSQA+DIIDTGVA+ED V +  G + G IY
Sbjct: 1   MPERLDLTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAK--GDARGAIY 60

Query: 61  SQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKED-PYGNEARKDDDIVMLNDG 120
           S SH R+R+    K EPLSQSQ       S+F ++L L +D    NE  +++ +    D 
Sbjct: 61  SASHRRQRHSIAPKPEPLSQSQ-------SQFLNNLTLDDDYRVENEETREETVA---DE 120

Query: 121 VGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGL 180
           VGP+ P DFG G   S   SFE+Y  ++R+RYVMG+Q+++ELQC+ LV++FKA+PLICG+
Sbjct: 121 VGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGV 180

Query: 181 AASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVW 240
           A +IWLRFV  +RVFD++WA + +QDSE Q        + R     EPHN+YGQR V++W
Sbjct: 181 AGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAVIIW 240

Query: 241 IKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSS 300
            +SL+KKIPL C+L +SFLACH+AREAILPTDI+KWSLEGKLPY+AAFI+IE +IG  SS
Sbjct: 241 HRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSS 300

Query: 301 ACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKIL 360
            CP+SS  M RPS    LQKLE+ AASIA  IGL+LPPVNF++IA RYL +L LPV+KIL
Sbjct: 301 PCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKIL 360

Query: 361 PHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCAS 420
           P+ACR+YEWSMPPDLWLS NELRLP+RVCVMSILI+ +RILYN+HGFG+WE SLS +  S
Sbjct: 361 PYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSSSGS 420

Query: 421 SFPLNQKTDSGPANNFSNMEADSE-----NRSGFTSHNVDNPSVSPKNSHPTATEFLRKL 480
           S   +Q      ++N   M+   +     + +G     V N S + K S   ATE L  L
Sbjct: 421 SSSSSQIVKLNASDNIKMMDGAKQGSPLHDLNGSNEEPVTNSSHAQK-SEFDATELLCNL 480

Query: 481 EARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLE 540
           +ARY E+ +TYEYSKDLPTYLQYCKDVVFAG E  F +DHEE+K+IEQLW +YQN KD E
Sbjct: 481 DARYDELIDTYEYSKDLPTYLQYCKDVVFAGLELPF-EDHEEEKIIEQLWEFYQNQKDSE 540

Query: 541 QTED--VDQNAASNQKRLR--EGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSH 600
            +ED  V+  +A N+KR R  EG  + +  E KK++ +     G   ++   S+NSQ   
Sbjct: 541 PSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRDDCSVPLGLDGDDT--SLNSQGGQ 600

Query: 601 SSKSLENSDDDELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYS 660
            S     +  + L  E            AI R+K DMEE RFCYIPPRVN KRFDYLHY 
Sbjct: 601 KSVPTHQASVETLKEE------------AILRMKADMEENRFCYIPPRVNVKRFDYLHYV 660

Query: 661 RKMDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTP 720
           RK DEG+  YAAHADYYILLRACAR AQVD+R MH+GV+SLE+RL W+E RI   LH  P
Sbjct: 661 RKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDHCLHFKP 702

BLAST of Cla005738 vs. TrEMBL
Match: A5ACR3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027088 PE=4 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 6.2e-213
Identity = 401/734 (54.63%), Postives = 508/734 (69.21%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           M +  +L+C  CGS+G +DG DGFFYC +CGSQA+DIIDTGVA+ED V +  G + G IY
Sbjct: 1   MPERLBLTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAK--GDARGAIY 60

Query: 61  SQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKED-PYGNEARKDDDIVMLNDG 120
           S SH R+R+    K EPLSQSQ       S+F ++L L +D    NE  +++ +    D 
Sbjct: 61  SASHRRQRHSIAPKPEPLSQSQ-------SQFLNNLTLDDDYRVENEETREETVA---DE 120

Query: 121 VGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGL 180
           VGP+ P DFG G   S   SFE+Y  ++R+RYVMG+Q+++ELQC+ LV++FKA+PLICG+
Sbjct: 121 VGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGV 180

Query: 181 AASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVW 240
           A +IWLRFV  +RVFD++WA + +QDSE Q        + R     EPHN+YGQR V++W
Sbjct: 181 AGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRTKYSAEPHNIYGQRAVIIW 240

Query: 241 IKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSS 300
            +SL+KKIPL C+L +SFLACH+AREAILPTDI+KWSLEGKLPY+AAFI+IE +IG  SS
Sbjct: 241 HRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSS 300

Query: 301 ACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKIL 360
            CP+SS  M RPS    LQKLE+ AASIA  IGL+LPPVNF++IA RYL +L LPV+KIL
Sbjct: 301 PCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKIL 360

Query: 361 PHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCAS 420
           P+ACR+YEWSMPPDLWLS NELRLP+RVCVMSILI+ +RILYN+HGFG+WE SLS +  S
Sbjct: 361 PYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSSSGS 420

Query: 421 SFPLNQKTDSGPANNFSNMEADSE-----NRSGFTSHNVDNPSVSPKNSHPTATEFLRKL 480
           S   +Q      ++N   M+   +     + +G     V N S + K S   ATE L  L
Sbjct: 421 SSSSSQIVKLNASDNIKMMDGAKQGSPLHDLNGSNEEPVTNSSHAQK-SEFDATELLCNL 480

Query: 481 EARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLE 540
           +ARY E+ +TYEYSKDLPTYLQYCKDVVFAG E  F +DHEE+K+IEQLW +YQN KD E
Sbjct: 481 DARYDELIDTYEYSKDLPTYLQYCKDVVFAGLELPF-EDHEEEKIIEQLWEFYQNQKDSE 540

Query: 541 QTED--VDQNAASNQKRLR--EGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSH 600
            +ED  V+  +A N+KR R  EG  + +  E KK++ +     G   ++   S+NSQ   
Sbjct: 541 PSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRDDCSVPLGLDGDDT--SLNSQ--- 600

Query: 601 SSKSLENSDDDELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYS 660
                      + S     AS  T  + AI R+K DMEE RFCYIP RVN KRFDYLHY 
Sbjct: 601 ---------GGQXSVPTHQASVETVKEEAILRMKADMEENRFCYIPXRVNVKRFDYLHYV 660

Query: 661 RKMDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTP 720
           RK DEG+  YAAHADYYILLRACAR AQVD+R MH+GV+SLE+RL W+E RI   LH  P
Sbjct: 661 RKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDHCLHFKP 706

Query: 721 SSVSCEFCS-DVPD 724
              S + C+ D P+
Sbjct: 721 PKFSSDXCNXDAPE 706

BLAST of Cla005738 vs. TrEMBL
Match: B9S8W3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0836380 PE=4 SV=1)

HSP 1 Score: 729.2 bits (1881), Expect = 5.1e-207
Identity = 392/755 (51.92%), Postives = 511/755 (67.68%), Query Frame = 1

Query: 8   SCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIYSQSHTRR 67
           +C  CG +GL +  DGF+YC +CG+QADDII TGVADED + +D G+ GG +YS   TR 
Sbjct: 13  ACRRCGHVGLEES-DGFYYCQECGAQADDIILTGVADEDFIEKD-GEGGGALYSARFTRY 72

Query: 68  RNPA-VLKVEPLSQSQPLFGTSQSEF-WDSLNLKEDPYGN-----EARKDDDIVMLNDGV 127
             P   ++  P SQ+   +   + +  + +       Y N     E R DDD  +  DG+
Sbjct: 73  SQPTRTIQTNPSSQAWFRYTQEEEDINFTTTTTLNGTYSNIKIKKEERFDDDEYL--DGL 132

Query: 128 GPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLA 187
           GP  PEDFG   +     S+E+Y +EVR+RYVMG+Q +++LQCE LV++F  +PLICG+A
Sbjct: 133 GPVEPEDFGGKSL-----SYEDYYNEVRIRYVMGMQWMIQLQCESLVEKFNVSPLICGVA 192

Query: 188 ASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVWI 247
            ++WLRF+ A+ VF ++WA + + +SESQ        + R S+++EPHN YGQR V+VW 
Sbjct: 193 GNVWLRFLVATGVFKDNWADDVILESESQVQGEPEDWKPRSSHRNEPHNAYGQRAVMVWF 252

Query: 248 KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSSA 307
           K LRK IPL  +LA+SFLACHVAREAILPTDI++WS+EGKLPY+AA ++IE R   +S A
Sbjct: 253 KYLRKTIPLSSSLAISFLACHVAREAILPTDIVRWSIEGKLPYFAAHVEIEKRFEHSSPA 312

Query: 308 CPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 367
           CPISS  M RPS+    QKLES+AA+ A +IGL+LPPVNF+ IA RYL  LALPV+KILP
Sbjct: 313 CPISSSLMFRPSQAVPAQKLESMAAAFAESIGLHLPPVNFYEIASRYLKNLALPVEKILP 372

Query: 368 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSL-SVNCAS 427
           HACRIYEWSMPPDLWLSTNELRLP+RV VMSILI+A+RILYNL+GFG WE+SL S+NC+ 
Sbjct: 373 HACRIYEWSMPPDLWLSTNELRLPTRVTVMSILIVAIRILYNLNGFGAWERSLSSLNCSP 432

Query: 428 SFPLNQKTDSGPANNF-----SNMEADSENRSGFTSHNVDNPSVSPKNSHP-----TATE 487
           S       +S PA+       S M+ D+E  S F S +          SH       + E
Sbjct: 433 S-------NSHPASRLDSMCRSVMQGDAETGSPFYSLDGSAEKFLRNPSHMQMPELDSAE 492

Query: 488 FLRKLEARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQN 547
            L  LE +Y+ IA+ YE++KDLP+YLQYCKDVVFAG+    +DD EE++L+E+LW++YQN
Sbjct: 493 LLHHLEVKYNFIADAYEFTKDLPSYLQYCKDVVFAGAGPSHMDDLEEEELMEKLWDFYQN 552

Query: 548 GKDLEQTEDVDQNAASNQKRLREGSNDRLSI-----ESKKVKGEEERINGESSNNRPGSI 607
            KD E  ++    ++S     +   ND  S+     E +K+K E         ++     
Sbjct: 553 EKDSELAKEPRTQSSSRLSNQKRSRNDDGSVFVNLSEKEKIKEEWHDSPSADISSHNADN 612

Query: 608 NSQQSHSSKSLENSDDDELSSEDKTASTVTSIDG-AIRRLKLDMEEKRFCYIPPRVNPKR 667
           +S QS  +    N+  ++ + E K   +  +++G AIRRLKLDMEE RFCYIPPRVN KR
Sbjct: 613 SSHQSFDNGHFSNNSLEDQNVEHKEKDSEKTLEGRAIRRLKLDMEENRFCYIPPRVNLKR 672

Query: 668 FDYLHYSRKMDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIH 727
           FDYLHY RK DEGA TY AHADYYILLRACAR AQVDIRIMHIGVLS E+RL+WLE RI 
Sbjct: 673 FDYLHYVRKKDEGAFTYVAHADYYILLRACARVAQVDIRIMHIGVLSFERRLAWLEKRID 732

Query: 728 KSLHLTPSSVSCEFCSDVPDH---VDSIGLSDLDI 736
             LHL+P +++CEFC D+PDH    D IGLS L++
Sbjct: 733 YCLHLSPPTITCEFCRDMPDHNSNDDVIGLSKLNL 751

BLAST of Cla005738 vs. NCBI nr
Match: gi|659082497|ref|XP_008441871.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Cucumis melo])

HSP 1 Score: 1335.1 bits (3454), Expect = 0.0e+00
Identity = 655/737 (88.87%), Postives = 694/737 (94.17%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           MADP NLSC+NCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVA+EDL +RD GKSGGPIY
Sbjct: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLALRDVGKSGGPIY 60

Query: 61  SQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYGNEARKDDDIVMLNDGV 120
           SQSHTRRRNP VLKVEPLSQSQPLFGTSQSEFWDSL+LKEDP GN  + DD IVMLNDGV
Sbjct: 61  SQSHTRRRNPTVLKVEPLSQSQPLFGTSQSEFWDSLHLKEDPSGNVGQNDDGIVMLNDGV 120

Query: 121 GPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLA 180
           GPT PEDFGSG V SGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATP+ICGLA
Sbjct: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180

Query: 181 ASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVWI 240
           ASIWLRFVTA+RVFDEDWAF+TVQ+SESQ LD E IRRV  S+KDEPHN YGQRVVV+W+
Sbjct: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDTERIRRVCSSHKDEPHNFYGQRVVVLWV 240

Query: 241 KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSSA 300
           KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAF+DIE+RIGKTS A
Sbjct: 241 KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300

Query: 301 CPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CPISSK MHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP
Sbjct: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCASS 420
           HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMR+LYNLHGFGEWEKSLSVNCAS 
Sbjct: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRVLYNLHGFGEWEKSLSVNCASY 420

Query: 421 FPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKLEARYHE 480
           FP NQKT S PANNFSNM+ADSENR GFTSH++DNPSVSP+N H T TEFLRK+EARYHE
Sbjct: 421 FPPNQKTHSSPANNFSNMQADSENRPGFTSHDLDNPSVSPENPHLTTTEFLRKIEARYHE 480

Query: 481 IAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLEQTEDVD 540
           IAETYEYSKDLP+YLQYCKDVVFAGSESLF+DDHEEQK+IE+LWNYYQN KD +QTEDVD
Sbjct: 481 IAETYEYSKDLPSYLQYCKDVVFAGSESLFIDDHEEQKMIEKLWNYYQNEKDHDQTEDVD 540

Query: 541 QNAASNQKRLREGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSHSSKSLENS-- 600
           QN ASN KRLREGSNDRLS ESKKVKGEE+ I+GESSNNR GSI+S+QSHSSKSLENS  
Sbjct: 541 QNVASNLKRLREGSNDRLSSESKKVKGEEDCISGESSNNRTGSIDSRQSHSSKSLENSDD 600

Query: 601 DDDELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGAL 660
           DDDE SSEDK AS++TSI+ AIR+LKLDMEEKRFCYIPPR NPKRF YLHYSRK+DEGAL
Sbjct: 601 DDDEQSSEDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRANPKRFGYLHYSRKIDEGAL 660

Query: 661 TYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVSCEFC 720
           TYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHK+L LTPSSV+CEFC
Sbjct: 661 TYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKTLRLTPSSVTCEFC 720

Query: 721 SDVPDHVDSIGLSDLDI 736
           SDVP+H+DS+GLSDLDI
Sbjct: 721 SDVPNHIDSVGLSDLDI 737

BLAST of Cla005738 vs. NCBI nr
Match: gi|449470354|ref|XP_004152882.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Cucumis sativus])

HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 654/735 (88.98%), Postives = 692/735 (94.15%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           MADP NLSC+NCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVA+EDLV+RD GKSG PIY
Sbjct: 1   MADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIY 60

Query: 61  SQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYGNEARKDDDIVMLNDGV 120
           SQSHTRRRNP VLKVEPLSQSQ LFGTSQSEFWDSLNL EDP GN   KD DIVMLNDGV
Sbjct: 61  SQSHTRRRNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVMLNDGV 120

Query: 121 GPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGLA 180
           GPT PEDFGSG V SGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATP+ICGLA
Sbjct: 121 GPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPIICGLA 180

Query: 181 ASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVWI 240
           ASIWLRFVTA+RVFDEDWAF+TVQ+SESQ LDPE IRRV  S+KDEPHN YGQRVVV+W+
Sbjct: 181 ASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNFYGQRVVVLWV 240

Query: 241 KSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSSA 300
           KSLRKKIPLF TLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAF+DIE+RIGKTS A
Sbjct: 241 KSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDIESRIGKTSRA 300

Query: 301 CPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360
           CPISSK MHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP
Sbjct: 301 CPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKILP 360

Query: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCASS 420
           HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSV+CAS 
Sbjct: 361 HACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVDCASC 420

Query: 421 FPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSPKNSHPTATEFLRKLEARYHE 480
           FP +QKT S PANNFSNM+ADSENR GFTSH+VDNPSVSP+N H T TEFLRK+EARYHE
Sbjct: 421 FPPHQKTHSSPANNFSNMQADSENRPGFTSHDVDNPSVSPENPHLTTTEFLRKIEARYHE 480

Query: 481 IAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLEQTEDVD 540
           IAETYEYSKDLPTYLQYCKDV FAGSESLF+DDH+EQK+IE+LWNYYQN KD +QTEDVD
Sbjct: 481 IAETYEYSKDLPTYLQYCKDVAFAGSESLFIDDHDEQKMIEKLWNYYQNEKDYDQTEDVD 540

Query: 541 QNAASNQKRLREGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSHSSKSLENSDD 600
           QNAASNQKRLREGSNDRLS ESKKVKGEE+RI+ ES NNR GSI+S+QSHSSKSL+NSDD
Sbjct: 541 QNAASNQKRLREGSNDRLSNESKKVKGEEDRISRESLNNRTGSIDSRQSHSSKSLDNSDD 600

Query: 601 DELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTY 660
           DE SS DK AS++TSI+ AIR+LKLDMEEKRFCYIPPR+NPKRFDYLHYSRK+DEGALTY
Sbjct: 601 DEQSSVDKAASSLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTY 660

Query: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVSCEFCSD 720
           AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSL LTP+S++CEFCSD
Sbjct: 661 AAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720

Query: 721 VPDHVDSIGLSDLDI 736
           VPDHV S+GLSDLDI
Sbjct: 721 VPDHVGSVGLSDLDI 735

BLAST of Cla005738 vs. NCBI nr
Match: gi|1009154268|ref|XP_015895077.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Ziziphus jujuba])

HSP 1 Score: 812.8 bits (2098), Expect = 5.0e-232
Identity = 442/770 (57.40%), Postives = 543/770 (70.52%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAG-KSGGPI 60
           M+DP    C  CG++GL DG DGFFYCLQCGSQA+DI+DTGVADED V  DAG ++GG +
Sbjct: 1   MSDPQAWRCNTCGNLGLADGSDGFFYCLQCGSQAEDIVDTGVADEDFV--DAGGETGGAL 60

Query: 61  YSQSHTRRRNPAVLKVEPLSQSQPLFG-TSQSEFWDSLNLKEDPYGNEARK----DDDIV 120
           Y  SH R+RNP+V+K EP+SQS  L   T QS+FW SLNL ++    +  K    D D+ 
Sbjct: 61  YIASHRRQRNPSVIKAEPISQSDFLLSSTVQSQFWASLNLNDETPKRDRVKTVEYDYDVG 120

Query: 121 MLNDGVGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATP 180
             +DGVGPT PEDFG  G     PSFE+Y +E R+RYVMGLQL++E QCE LV+EFK TP
Sbjct: 121 PFSDGVGPTEPEDFGWVG--ESVPSFEDYYNETRIRYVMGLQLMIESQCEALVREFKVTP 180

Query: 181 LICGLAASIWLRFVTASRVFDEDWAFETVQDSESQ-YLDPECIRRVRFSNKDEPHNLYGQ 240
           LICG A +IWLRFV  +RVFD+ W  ET+ +SE+Q + +     + R   K EP N+YGQ
Sbjct: 181 LICGFAGTIWLRFVAGTRVFDDAWVEETIHESETQAHGESPTDFKPRSKYKAEPQNIYGQ 240

Query: 241 RVVVVWIKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENR 300
           R V++W +SLRK+IPL C+LAVSFLACH+AREAILPTD++KWSLEGKLPY+AAF++IEN 
Sbjct: 241 RAVMIWFRSLRKRIPLTCSLAVSFLACHLAREAILPTDLVKWSLEGKLPYFAAFVEIENV 300

Query: 301 IGKTSSACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLAL 360
           IGK S ACPISS  M RP    S QKLESLAAS+A +I L+LPPVNF++IA  YL KL+L
Sbjct: 301 IGKPSRACPISSSTMFRPRESVSAQKLESLAASVAESICLDLPPVNFYAIASCYLQKLSL 360

Query: 361 PVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSL 420
           PV+KILPHACRIYEWS PPDLWLST+ELRLP+RVCVMSILI+A+RILYN+HGFG+WE+ L
Sbjct: 361 PVEKILPHACRIYEWSTPPDLWLSTSELRLPTRVCVMSILIVAIRILYNIHGFGDWERRL 420

Query: 421 SVNCASSFPLNQKTDSGPANNFSNMEADSENRSGFTSHNVDNPSVSP-------KNSHPT 480
           S +  +        +S P+ N   M  D  N SG     VD+   +P       + S   
Sbjct: 421 SNDGEAPSTSYWTGESDPSCN-PKMRDDLTNGSGSPPDIVDDSGTNPVENTSRAQKSELD 480

Query: 481 ATEFLRKLEARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNY 540
           A E L  LEARY+EI ETYEYSKDLPTYLQ+CKDVVF+G E  F +D EE+K+IEQLWNY
Sbjct: 481 AAELLHNLEARYNEIVETYEYSKDLPTYLQFCKDVVFSGLEPSF-EDREEEKMIEQLWNY 540

Query: 541 YQNGKDLEQTEDVDQ---NAASNQKRLR--EGSNDRLSIESKKVKGEEERINGESSNNRP 600
           YQN KD E   + +    + A +QKR R  EG   RL  E KK++  +  ++G SSN+  
Sbjct: 541 YQNNKDSETASEKEMLCGSGAVSQKRPRTDEGCTSRLPKEKKKIRDRD--VSGSSSNDDD 600

Query: 601 GSINSQQ------SHSSKSLE---NSD-DDELSSEDKT-ASTVTSIDGAIRRLKLDMEEK 660
                 Q       HS  SL+   NSD +D++S+E     +T T+ + A+RR+KLDMEE 
Sbjct: 601 SYHTGNQQWSQNGDHSFDSLQGGRNSDSNDQISAETLVDETTETTKNEAVRRIKLDMEEN 660

Query: 661 RFCYIPPRVNPKRFDYLHYSRKMDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLS 720
            FCYIPPRVN KRF YL Y RK DEGA TYAAHADYYILLRACA+ AQV+IR MHIGVL+
Sbjct: 661 NFCYIPPRVNIKRFGYLFYVRKKDEGAFTYAAHADYYILLRACAKTAQVEIRCMHIGVLN 720

Query: 721 LEKRLSWLEDRIHKSLHLTPSSVSCEFCS-----DVPDHVDSIGLSDLDI 736
           LE+RL+WLE RI+  LHLTP  VSCEFCS     +  D     G S+L+I
Sbjct: 721 LERRLAWLEHRINHCLHLTPPIVSCEFCSGMGQGNATDEDGLHGFSNLNI 762

BLAST of Cla005738 vs. NCBI nr
Match: gi|703061724|ref|XP_010086553.1| (hypothetical protein L484_001445 [Morus notabilis])

HSP 1 Score: 753.8 bits (1945), Expect = 2.8e-214
Identity = 411/775 (53.03%), Postives = 528/775 (68.13%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           M+DP   +C  CG+ G  DGFDGF+YCL+CGSQA+DII+TGVADED   +  G +G P+Y
Sbjct: 1   MSDPHAWTCHTCGNAGFADGFDGFYYCLRCGSQAEDIIETGVADEDFADK-GGTAGAPLY 60

Query: 61  SQSHTRRRNPAV--LKVEPLSQSQPLFGTSQSEFWDSLNLKEDPYG------NEAR-KDD 120
           S +H R R  A   +K EP+SQ Q    T QS+FW +L L +D  G      N A  K +
Sbjct: 61  SATHRRNRPVAASAIKAEPISQVQS--ATLQSQFWAALTLDDDGEGEGGDRFNRASIKTE 120

Query: 121 DIVMLNDGVGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFK 180
           +I    DGVGPT P DFGS G     PSFEEY  ++R+RYVMGLQL++E QCE LV+EFK
Sbjct: 121 EIEF--DGVGPTGPRDFGSVG--ESVPSFEEYYSDIRIRYVMGLQLMIEFQCEALVREFK 180

Query: 181 ATPLICGLAASIWLRFVTASRVFDEDWAFETVQDSESQYL-DPECIRRVRFSNKDEPHNL 240
             PLICGLA ++WLRFV  +RVFD+ W  E++ +SESQ   +P    + R     EPHN+
Sbjct: 181 VNPLICGLAGTVWLRFVAGTRVFDDGWVDESINESESQQQGEPPQDFKPRAKYGSEPHNM 240

Query: 241 YGQRVVVVWIKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDI 300
           YGQR V++W +SLRKKIPL  TL VSFLACH+ARE +L TDI+KWSLEGK+PY+AAF++I
Sbjct: 241 YGQRAVMIWFRSLRKKIPLSYTLGVSFLACHLAREPVLTTDIVKWSLEGKVPYFAAFVEI 300

Query: 301 ENRIGKTSSACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNK 360
           E  +G+ SS CPIS+  M RP+   + QKLESL+ASIA ++GL LPPVNF+SIA RYL +
Sbjct: 301 EKHMGQRSSGCPISTTLMFRPNESVTAQKLESLSASIADSVGLALPPVNFYSIASRYLRE 360

Query: 361 LALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWE 420
           L++P++KILPHA R+YEWSMPPDLWLSTNELRLP+RVCVMS+LI+A+RILYN+HGFGEWE
Sbjct: 361 LSIPLEKILPHARRMYEWSMPPDLWLSTNELRLPTRVCVMSMLIVAIRILYNIHGFGEWE 420

Query: 421 KSLSVNCASSFPLNQKTDSGPANNFSNMEADSENRSGFTSHN---VDNPSVSP--KNSHP 480
           KSL      S      T  G  ++    + D +  +G  S +   +D+   +P    SH 
Sbjct: 421 KSLCRRHTCS------TSKGTEDSELKPDLDVKECAGKGSGSPQILDDSGTNPGRDTSHA 480

Query: 481 TATE-----FLRKLEARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLI 540
             TE      L  LEARY  I ETYEYSKDLP+YLQ+CKDVVFAG E  F +D+EE++LI
Sbjct: 481 QKTELDAAKLLCDLEARYRGINETYEYSKDLPSYLQFCKDVVFAGLEPSF-EDYEEKRLI 540

Query: 541 EQLWNYYQNGKDLEQT--EDVDQNAASNQKRLREGSNDR-LSIESKKVKGEEERINGESS 600
           E+LW++Y + +D +    E+   + A  QKR R     R    +  K   ++  +   S 
Sbjct: 541 EELWDFYWSERDSKTAVQEEAHGSEAVTQKRARVDVECRSFPFKENKRFRDKGCVGDPSP 600

Query: 601 NNRPGSINSQQSHSSKSLENSDDDELSSE----------DKTASTVTS------IDGAIR 660
            N   S   Q S +S   ++S DD++S +          D+TA T+         D AIR
Sbjct: 601 YNGSRSAGDQHSENSDQFDSSQDDQISEQKDQTSADSLKDETADTLKDETSEILKDEAIR 660

Query: 661 RLKLDMEEKRFCYIPPRVNPKRFDYLHYSRKMDEGALTYAAHADYYILLRACARAAQVDI 720
            LK D EE RF YIPPRVNPKRFDYLHY+RK DEGALTY AHADYYILLRACA+AA++DI
Sbjct: 661 LLKSDFEENRFYYIPPRVNPKRFDYLHYARKKDEGALTYVAHADYYILLRACAKAAKIDI 720

Query: 721 RIMHIGVLSLEKRLSWLEDRIHKSLHLTPSSVSCEFCSDVPD-HVDSIGLSDLDI 736
           R+MHIGVLS E+RL+W+E RI+  LHL P++V CEFC+ + +  V+S+GLS+L+I
Sbjct: 721 RLMHIGVLSFERRLAWIEQRINHCLHLKPATVFCEFCNYLGNASVESLGLSNLNI 761

BLAST of Cla005738 vs. NCBI nr
Match: gi|225425686|ref|XP_002269865.1| (PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B [Vitis vinifera])

HSP 1 Score: 751.9 bits (1940), Expect = 1.1e-213
Identity = 400/730 (54.79%), Postives = 506/730 (69.32%), Query Frame = 1

Query: 1   MADPCNLSCFNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVADEDLVIRDAGKSGGPIY 60
           M +  +L+C  CGS+G +DG DGFFYC +CGSQA+DIIDTGVA+ED V +  G + G IY
Sbjct: 1   MPERLDLTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAK--GDARGAIY 60

Query: 61  SQSHTRRRNPAVLKVEPLSQSQPLFGTSQSEFWDSLNLKED-PYGNEARKDDDIVMLNDG 120
           S SH R+R+    K EPLSQSQ       S+F ++L L +D    NE  +++ +    D 
Sbjct: 61  SASHRRQRHSIAPKPEPLSQSQ-------SQFLNNLTLDDDYRVENEETREETVA---DE 120

Query: 121 VGPTRPEDFGSGGVPSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEFKATPLICGL 180
           VGP+ P DFG G   S   SFE+Y  ++R+RYVMG+Q+++ELQC+ LV++FKA+PLICG+
Sbjct: 121 VGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGV 180

Query: 181 AASIWLRFVTASRVFDEDWAFETVQDSESQYLDPECIRRVRFSNKDEPHNLYGQRVVVVW 240
           A +IWLRFV  +RVFD++WA + +QDSE Q        + R     EPHN+YGQR V++W
Sbjct: 181 AGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAVIIW 240

Query: 241 IKSLRKKIPLFCTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFIDIENRIGKTSS 300
            +SL+KKIPL C+L +SFLACH+AREAILPTDI+KWSLEGKLPY+AAFI+IE +IG  SS
Sbjct: 241 HRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSS 300

Query: 301 ACPISSKHMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNKLALPVDKIL 360
            CP+SS  M RPS    LQKLE+ AASIA  IGL+LPPVNF++IA RYL +L LPV+KIL
Sbjct: 301 PCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKIL 360

Query: 361 PHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWEKSLSVNCAS 420
           P+ACR+YEWSMPPDLWLS NELRLP+RVCVMSILI+ +RILYN+HGFG+WE SLS +  S
Sbjct: 361 PYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSSSGS 420

Query: 421 SFPLNQKTDSGPANNFSNMEADSE-----NRSGFTSHNVDNPSVSPKNSHPTATEFLRKL 480
           S   +Q      ++N   M+   +     + +G     V N S + K S   ATE L  L
Sbjct: 421 SSSSSQIVKLNASDNIKMMDGAKQGSPLHDLNGSNEEPVTNSSHAQK-SEFDATELLCNL 480

Query: 481 EARYHEIAETYEYSKDLPTYLQYCKDVVFAGSESLFVDDHEEQKLIEQLWNYYQNGKDLE 540
           +ARY E+ +TYEYSKDLPTYLQYCKDVVFAG E  F +DHEE+K+IEQLW +YQN KD E
Sbjct: 481 DARYDELIDTYEYSKDLPTYLQYCKDVVFAGLELPF-EDHEEEKIIEQLWEFYQNQKDSE 540

Query: 541 QTED--VDQNAASNQKRLR--EGSNDRLSIESKKVKGEEERINGESSNNRPGSINSQQSH 600
            +ED  V+  +A N+KR R  EG  + +  E KK++ +     G   ++   S+NSQ   
Sbjct: 541 PSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRDDCSVPLGLDGDDT--SLNSQGGQ 600

Query: 601 SSKSLENSDDDELSSEDKTASTVTSIDGAIRRLKLDMEEKRFCYIPPRVNPKRFDYLHYS 660
            S     +  + L  E            AI R+K DMEE RFCYIPPRVN KRFDYLHY 
Sbjct: 601 KSVPTHQASVETLKEE------------AILRMKADMEENRFCYIPPRVNVKRFDYLHYV 660

Query: 661 RKMDEGALTYAAHADYYILLRACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLHLTP 720
           RK DEG+  YAAHADYYILLRACAR AQVD+R MH+GV+SLE+RL W+E RI   LH  P
Sbjct: 661 RKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDHCLHFKP 702

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MEE12_ARATH3.8e-13941.05TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Arabido... [more]
TAF1B_ORYSJ9.9e-8737.23TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza s... [more]
TAF1B_ORYSI3.7e-8642.25TATA box-binding protein-associated factor RNA polymerase I subunit B OS=Oryza s... [more]
Match NameE-valueIdentityDescription
A0A0A0LHI6_CUCSA0.0e+0088.98Uncharacterized protein OS=Cucumis sativus GN=Csa_2G074100 PE=4 SV=1[more]
W9QI88_9ROSA1.9e-21453.03Uncharacterized protein OS=Morus notabilis GN=L484_001445 PE=4 SV=1[more]
F6HPR5_VITVI7.3e-21454.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g02430 PE=4 SV=... [more]
A5ACR3_VITVI6.2e-21354.63Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027088 PE=4 SV=1[more]
B9S8W3_RICCO5.1e-20751.92Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0836380 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659082497|ref|XP_008441871.1|0.0e+0088.87PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
gi|449470354|ref|XP_004152882.1|0.0e+0088.98PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
gi|1009154268|ref|XP_015895077.1|5.0e-23257.40PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
gi|703061724|ref|XP_010086553.1|2.8e-21453.03hypothetical protein L484_001445 [Morus notabilis][more]
gi|225425686|ref|XP_002269865.1|1.1e-21354.79PREDICTED: TATA box-binding protein-associated factor RNA polymerase I subunit B... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0001188 RNA polymerase I transcriptional preinitiation complex assembly
biological_process GO:0001189 RNA polymerase I transcriptional preinitiation complex assembly at the promoter for the nuclear large rRNA transcript
biological_process GO:0006360 transcription from RNA polymerase I promoter
biological_process GO:0042790 transcription of nuclear large rRNA transcript from RNA polymerase I promoter
cellular_component GO:0005575 cellular_component
cellular_component GO:0070860 RNA polymerase I core factor complex
cellular_component GO:0000120 RNA polymerase I transcription factor complex
cellular_component GO:0005668 RNA polymerase transcription factor SL1 complex
molecular_function GO:0003674 molecular_function
molecular_function GO:0001164 RNA polymerase I CORE element sequence-specific DNA binding
molecular_function GO:0001187 transcription factor activity, RNA polymerase I CORE element binding transcription factor recruiting
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU57547watermelon EST collection version 2.0transcribed_cluster
WMU64785watermelon EST collection version 2.0transcribed_cluster
WMU68266watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla005738Cla005738.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU57547WMU57547transcribed_cluster
WMU68266WMU68266transcribed_cluster
WMU64785WMU64785transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31576FAMILY NOT NAMEDcoord: 1..55
score: 5.6E-110coord: 125..724
score: 5.6E