CmaCh14G007840 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G007840
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionWD40 repeat protein
LocationCma_Chr14 : 3911963 .. 3913003 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGCAAACAGCGACACCAATCCGGATGCTTCCGACGAGCAGCAGAAGCGGTCTGAGATCTATACCTACGAGGCGCCATGGCACATCTACGCCATGAACTGGAGCGTCCGCCGTGACAAGAAGTACCGCCTCGCCATTGCTAGCCTTCTCGAGCAGTATCCCAACCGTGTCGAGATTGTCCAACTCGATGATTCCAGTGGTGAGATTCGCTCTGACCCTAATCTCTCCTTCGAGCATCCCTATCCTCCCACCAAGACCATCTTCATCCCGGATAAGGAGTGCCAGCGCCCTGATCTCCTCGCTACTTCCAGCGACTTTCTCCGTGTTTGGCGCATCTCGGATGACTCTTCTTCGGTGGAGCTCAAGAGCCTTCTTAATGGCAACAAGAACAGCGAGTTTTGTGGTCCTCTTACCTCCTTTGATTGGAACGATGCCGAGCCCAAGCGTATTGGAACCTCCAGTATCGATACTACCTGCACAATCTGGGATATTGAGCGGGAGACCGTTGATACGCAACTTATCGCCCATGATAAGGAAGTCTACGATATCGCCTGGGGCGGCGTTGGTGTATTTGCTTCCGTTTCTGCCGACGGTTCCGTCCGGGTCTTCGATTTGCGCGACAAGGAGCACTCTACCATCATCTACGAGAGCTCCGAGCCTGACACTCCCTTGGTTCGACTAGGCTGGAACAAGCAGGACCCTAGATATATGGCTACAATTATCATGGACAGCGCCAAGGTCGTCGTTCTTGACATTCGATTCCCAACACTCCCCGTCGTCGAGTTACAGAGACACCAAGCTAGTGTCAACGCTATTGCTTGGGCGCCGCATAGTTCTTGCCACATCTGCACCGCCGGGGATGATTCTCAGGCCTTGATTTGGGACTTGTCGTCCATGGGGCAACCCGTCGAAGGTGGCCTCGATCCCATTCTTGCATATACAGCTGGAGCAGAAATCGAGCAGCTGCAATGGTCCTCTTCCCAGCCGGACTGGGTTGCGATTGCCTTTTCGACTAAGCTTCAGATTCTAAGGGTATGA

mRNA sequence

ATGGGTGCAAACAGCGACACCAATCCGGATGCTTCCGACGAGCAGCAGAAGCGGTCTGAGATCTATACCTACGAGGCGCCATGGCACATCTACGCCATGAACTGGAGCGTCCGCCGTGACAAGAAGTACCGCCTCGCCATTGCTAGCCTTCTCGAGCAGTATCCCAACCGTGTCGAGATTGTCCAACTCGATGATTCCAGTGGTGAGATTCGCTCTGACCCTAATCTCTCCTTCGAGCATCCCTATCCTCCCACCAAGACCATCTTCATCCCGGATAAGGAGTGCCAGCGCCCTGATCTCCTCGCTACTTCCAGCGACTTTCTCCGTGTTTGGCGCATCTCGGATGACTCTTCTTCGGTGGAGCTCAAGAGCCTTCTTAATGGCAACAAGAACAGCGAGTTTTGTGGTCCTCTTACCTCCTTTGATTGGAACGATGCCGAGCCCAAGCGTATTGGAACCTCCAGTATCGATACTACCTGCACAATCTGGGATATTGAGCGGGAGACCGTTGATACGCAACTTATCGCCCATGATAAGGAAGTCTACGATATCGCCTGGGGCGGCGTTGGTGTATTTGCTTCCGTTTCTGCCGACGGTTCCGTCCGGGTCTTCGATTTGCGCGACAAGGAGCACTCTACCATCATCTACGAGAGCTCCGAGCCTGACACTCCCTTGGTTCGACTAGGCTGGAACAAGCAGGACCCTAGATATATGGCTACAATTATCATGGACAGCGCCAAGGTCGTCGTTCTTGACATTCGATTCCCAACACTCCCCGTCGTCGAGTTACAGAGACACCAAGCTAGTGTCAACGCTATTGCTTGGGCGCCGCATAGTTCTTGCCACATCTGCACCGCCGGGGATGATTCTCAGGCCTTGATTTGGGACTTGTCGTCCATGGGGCAACCCGTCGAAGGTGGCCTCGATCCCATTCTTGCATATACAGCTGGAGCAGAAATCGAGCAGCTGCAATGGTCCTCTTCCCAGCCGGACTGGGTTGCGATTGCCTTTTCGACTAAGCTTCAGATTCTAAGGGTATGA

Coding sequence (CDS)

ATGGGTGCAAACAGCGACACCAATCCGGATGCTTCCGACGAGCAGCAGAAGCGGTCTGAGATCTATACCTACGAGGCGCCATGGCACATCTACGCCATGAACTGGAGCGTCCGCCGTGACAAGAAGTACCGCCTCGCCATTGCTAGCCTTCTCGAGCAGTATCCCAACCGTGTCGAGATTGTCCAACTCGATGATTCCAGTGGTGAGATTCGCTCTGACCCTAATCTCTCCTTCGAGCATCCCTATCCTCCCACCAAGACCATCTTCATCCCGGATAAGGAGTGCCAGCGCCCTGATCTCCTCGCTACTTCCAGCGACTTTCTCCGTGTTTGGCGCATCTCGGATGACTCTTCTTCGGTGGAGCTCAAGAGCCTTCTTAATGGCAACAAGAACAGCGAGTTTTGTGGTCCTCTTACCTCCTTTGATTGGAACGATGCCGAGCCCAAGCGTATTGGAACCTCCAGTATCGATACTACCTGCACAATCTGGGATATTGAGCGGGAGACCGTTGATACGCAACTTATCGCCCATGATAAGGAAGTCTACGATATCGCCTGGGGCGGCGTTGGTGTATTTGCTTCCGTTTCTGCCGACGGTTCCGTCCGGGTCTTCGATTTGCGCGACAAGGAGCACTCTACCATCATCTACGAGAGCTCCGAGCCTGACACTCCCTTGGTTCGACTAGGCTGGAACAAGCAGGACCCTAGATATATGGCTACAATTATCATGGACAGCGCCAAGGTCGTCGTTCTTGACATTCGATTCCCAACACTCCCCGTCGTCGAGTTACAGAGACACCAAGCTAGTGTCAACGCTATTGCTTGGGCGCCGCATAGTTCTTGCCACATCTGCACCGCCGGGGATGATTCTCAGGCCTTGATTTGGGACTTGTCGTCCATGGGGCAACCCGTCGAAGGTGGCCTCGATCCCATTCTTGCATATACAGCTGGAGCAGAAATCGAGCAGCTGCAATGGTCCTCTTCCCAGCCGGACTGGGTTGCGATTGCCTTTTCGACTAAGCTTCAGATTCTAAGGGTATGA

Protein sequence

MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEIVQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSVELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMGQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
BLAST of CmaCh14G007840 vs. Swiss-Prot
Match: LWD1_ARATH (WD repeat-containing protein LWD1 OS=Arabidopsis thaliana GN=LWD1 PE=2 SV=1)

HSP 1 Score: 666.8 bits (1719), Expect = 1.3e-190
Identity = 323/346 (93.35%), Postives = 332/346 (95.95%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MG +SD   D SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAI SLLEQYPNRVEI
Sbjct: 1   MGTSSDPIQDGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAITSLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLD+S+GEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLR+WRI+DD S V
Sbjct: 61  VQLDESNGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRLWRIADDHSRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKS LN NKNSEFCGPLTSFDWN+AEP+RIGTSS DTTCTIWDIERE VDTQLIAHDKE
Sbjct: 121 ELKSCLNSNKNSEFCGPLTSFDWNEAEPRRIGTSSTDTTCTIWDIEREAVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           V+DIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VFDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFP LPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWD+SSM
Sbjct: 241 IIMDSAKVVVLDIRFPALPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDISSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQ VEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQHVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. Swiss-Prot
Match: LWD2_ARATH (WD repeat-containing protein LWD2 OS=Arabidopsis thaliana GN=LWD2 PE=2 SV=1)

HSP 1 Score: 626.7 bits (1615), Expect = 1.5e-178
Identity = 302/346 (87.28%), Postives = 320/346 (92.49%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           M  +SD   + S+EQ KRSEIYTYEAPW IYAMNWS+RRDKKYRLAI SL+EQYPNRVEI
Sbjct: 1   MVTSSDQIQNGSEEQSKRSEIYTYEAPWQIYAMNWSIRRDKKYRLAITSLIEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLD+S+GEIRSDPNL FEHPYPPTKT FIPDKECQRPDLLATSSDFLR+WRISDD S V
Sbjct: 61  VQLDESNGEIRSDPNLCFEHPYPPTKTSFIPDKECQRPDLLATSSDFLRLWRISDDESRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKS L+ +KNSEF GP+TSFDWN+AEP+RIGTSSIDTTCTIWDIERE VDTQLIAHDKE
Sbjct: 121 ELKSCLSSDKNSEFSGPITSFDWNEAEPRRIGTSSIDTTCTIWDIEREVVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVS DGSVRVFDLRDKEHSTIIYES EP TPLVRL WNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSEDGSVRVFDLRDKEHSTIIYESGEPSTPLVRLSWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           +IM SAK+VVLDIRFP LPVVELQRHQASVNAIAWAPHSS HIC+AGDDSQALIWD+SSM
Sbjct: 241 VIMGSAKIVVLDIRFPALPVVELQRHQASVNAIAWAPHSSSHICSAGDDSQALIWDISSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQ VEGGLDPILAYTAGAE+EQLQWSSSQPDWVAIAFS KLQILRV
Sbjct: 301 GQHVEGGLDPILAYTAGAEVEQLQWSSSQPDWVAIAFSNKLQILRV 346

BLAST of CmaCh14G007840 vs. Swiss-Prot
Match: TTG1_ARATH (Protein TRANSPARENT TESTA GLABRA 1 OS=Arabidopsis thaliana GN=TTG1 PE=1 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 1.3e-126
Identity = 220/340 (64.71%), Postives = 268/340 (78.82%), Query Frame = 1

Query: 11  ASDEQQKRSEIYTYEAPWHIYAMNWS-VRRDKKYRLAIASLLEQYPNRVEIVQLDDSSGE 70
           A D   +     TY++P+ +YAM +S +R    +R+A+ S LE Y NR++I+  D  S  
Sbjct: 5   APDSLSRSETAVTYDSPYPLYAMAFSSLRSSSGHRIAVGSFLEDYNNRIDILSFDSDSMT 64

Query: 71  IRSDPNLSFEHPYPPTKTIFIPDKECQRP---DLLATSSDFLRVWRISDDSSSVELKSLL 130
           ++  PNLSFEHPYPPTK +F P    +RP   DLLA+S DFLR+W I++DSS+VE  S+L
Sbjct: 65  VKPLPNLSFEHPYPPTKLMFSPPS-LRRPSSGDLLASSGDFLRLWEINEDSSTVEPISVL 124

Query: 131 NGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAW 190
           N +K SEFC PLTSFDWND EPKR+GT SIDTTCTIWDIE+  V+TQLIAHDKEV+DIAW
Sbjct: 125 NNSKTSEFCAPLTSFDWNDVEPKRLGTCSIDTTCTIWDIEKSVVETQLIAHDKEVHDIAW 184

Query: 191 GGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIMDSA 250
           G   VFASVSADGSVR+FDLRDKEHSTIIYES +PDTPL+RL WNKQD RYMATI+MDS 
Sbjct: 185 GEARVFASVSADGSVRIFDLRDKEHSTIIYESPQPDTPLLRLAWNKQDLRYMATILMDSN 244

Query: 251 KVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMGQPVEG 310
           KVV+LDIR PT+PV EL+RHQASVNAIAWAP S  HIC+ GDD+QALIW+L ++  P   
Sbjct: 245 KVVILDIRSPTMPVAELERHQASVNAIAWAPQSCKHICSGGDDTQALIWELPTVAGP--N 304

Query: 311 GLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           G+DP+  Y+AG+EI QLQWSSSQPDW+ IAF+ K+Q+LRV
Sbjct: 305 GIDPMSVYSAGSEINQLQWSSSQPDWIGIAFANKMQLLRV 341

BLAST of CmaCh14G007840 vs. Swiss-Prot
Match: DCAF7_DICDI (DDB1- and CUL4-associated factor 7 homolog OS=Dictyostelium discoideum GN=wdr68 PE=3 SV=2)

HSP 1 Score: 411.0 bits (1055), Expect = 1.3e-113
Identity = 198/331 (59.82%), Postives = 251/331 (75.83%), Query Frame = 1

Query: 17  KRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEIVQLDDSSGEIRSDPNL 76
           ++  IYTY +PW IY ++WS R ++ +RLAI S LE Y NRV+++QL++ + +   +   
Sbjct: 2   EKKRIYTYNSPWVIYGLSWSSRVNRPFRLAIGSFLEDYTNRVDVIQLNEETDQF--EVVC 61

Query: 77  SFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSVELKSLLNGNKNSEFCG 136
            FEHPYPPTK ++IPDK   RPDLLAT+ D+LR+W +  +  S++LKSLL  N  SEFC 
Sbjct: 62  GFEHPYPPTKCMWIPDKNSNRPDLLATTGDYLRLWEVGSNQRSIKLKSLLT-NVISEFCA 121

Query: 137 PLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAWG-GVGVFASV 196
           PL+SFDWN+ +P  + TSSIDTTCTIW+IE     TQLIAHDKEV+DIA+  G  +FASV
Sbjct: 122 PLSSFDWNETDPSLLATSSIDTTCTIWNIETGQAKTQLIAHDKEVFDIAFARGTDLFASV 181

Query: 197 SADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIMDSAKVVVLDIRF 256
            ADGS+R+FDLR+ EHSTIIYE+     PL+RL WNKQDP Y+ATI  DS KV++LDIR 
Sbjct: 182 GADGSLRMFDLRNLEHSTIIYETPS-FVPLLRLCWNKQDPNYLATIQQDSPKVIILDIRV 241

Query: 257 PTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMGQPVEGGLDPILAYT 316
           P++P  EL  H+++VN I+WAPHSSCHICT  DD QALIWDLSSM +P+E   DP+L Y 
Sbjct: 242 PSVPAAELVFHKSAVNGISWAPHSSCHICTVSDDKQALIWDLSSMPKPIE---DPLLTYN 301

Query: 317 AGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           A AEI QL WSSSQPDW+AIAFS+ LQIL+V
Sbjct: 302 ALAEINQLSWSSSQPDWIAIAFSSHLQILKV 325

BLAST of CmaCh14G007840 vs. Swiss-Prot
Match: DCAF7_HUMAN (DDB1- and CUL4-associated factor 7 OS=Homo sapiens GN=DCAF7 PE=1 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 4.9e-113
Identity = 203/345 (58.84%), Postives = 249/345 (72.17%), Query Frame = 1

Query: 17  KRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEIVQLDDSSGEIRSDPNL 76
           KR EIY YEAPW +YAMNWSVR DK++RLA+ S +E+Y N+V++V LD+ S E       
Sbjct: 6   KRKEIYKYEAPWTVYAMNWSVRPDKRFRLALGSFVEEYNNKVQLVGLDEESSEFIC--RN 65

Query: 77  SFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSVELKSLLNGNKNSEFCG 136
           +F+HPYP TK ++IPD +   PDLLATS D+LRVWR+ +  +   L+ LLN NKNS+FC 
Sbjct: 66  TFDHPYPTTKLMWIPDTKGVYPDLLATSGDYLRVWRVGE--TETRLECLLNNNKNSDFCA 125

Query: 137 PLTSFDWNDAEPKRIGTSSIDTTCTIWDIERET-----------VDTQLIAHDKEVYDIA 196
           PLTSFDWN+ +P  +GTSSIDTTCTIW +E              V TQLIAHDKEVYDIA
Sbjct: 126 PLTSFDWNEVDPYLLGTSSIDTTCTIWGLETGQVLGRVNLVSGHVKTQLIAHDKEVYDIA 185

Query: 197 W----GGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATI 256
           +    GG  +FASV ADGSVR+FDLR  EHSTIIYE  +   PL+RL WNKQDP Y+AT+
Sbjct: 186 FSRAGGGRDMFASVGADGSVRMFDLRHLEHSTIIYEDPQ-HHPLLRLCWNKQDPNYLATM 245

Query: 257 IMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMG 316
            MD  +VV+LD+R P  PV  L  H+A VN IAWAPHSSCHICTA DD QALIWD+  M 
Sbjct: 246 AMDGMEVVILDVRVPCTPVARLNNHRACVNGIAWAPHSSCHICTAADDHQALIWDIQQMP 305

Query: 317 QPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           + +E   DPILAYTA  EI  +QW+S+QPDW+AI ++  L+ILRV
Sbjct: 306 RAIE---DPILAYTAEGEINNVQWASTQPDWIAICYNNCLEILRV 342

BLAST of CmaCh14G007840 vs. TrEMBL
Match: A0A0A0LCB1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G253490 PE=4 SV=1)

HSP 1 Score: 700.3 bits (1806), Expect = 1.2e-198
Identity = 342/346 (98.84%), Postives = 343/346 (99.13%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA+SD N DASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI
Sbjct: 1   MGASSDPNQDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDD SSV
Sbjct: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDPSSV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. TrEMBL
Match: A0A061EDG3_THECC (Transducin/WD40 repeat-like superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_017435 PE=4 SV=1)

HSP 1 Score: 688.7 bits (1776), Expect = 3.6e-195
Identity = 333/346 (96.24%), Postives = 341/346 (98.55%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA SD NP+ SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNR+EI
Sbjct: 1   MGAISDPNPEGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRLEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSF+HPYPPTKTIFIPDKECQ+PDLLATSSDFLR+WRISDD S V
Sbjct: 61  VQLDDSNGEIRSDPNLSFDHPYPPTKTIFIPDKECQKPDLLATSSDFLRIWRISDDHSRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPR+MAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRFMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. TrEMBL
Match: A0A061EL36_THECC (Transducin/WD40 repeat-like superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_017435 PE=4 SV=1)

HSP 1 Score: 687.6 bits (1773), Expect = 8.0e-195
Identity = 332/346 (95.95%), Postives = 341/346 (98.55%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA SD NP+ SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNR+EI
Sbjct: 1   MGAISDPNPEGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRLEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSF+HPYPPTKTIFIPDKECQ+PDLLATSSDFLR+WRISDD S V
Sbjct: 61  VQLDDSNGEIRSDPNLSFDHPYPPTKTIFIPDKECQKPDLLATSSDFLRIWRISDDHSRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPR+MAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRFMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILR+
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRL 346

BLAST of CmaCh14G007840 vs. TrEMBL
Match: B9IA35_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s13680g PE=4 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 8.8e-194
Identity = 330/346 (95.38%), Postives = 339/346 (97.98%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA+SD N D SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI
Sbjct: 1   MGASSDPNQDGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSFEHPYPPTKTIFIPDKECQ+PDLLATSSDFLRVWRI+D+   V
Sbjct: 61  VQLDDSNGEIRSDPNLSFEHPYPPTKTIFIPDKECQKPDLLATSSDFLRVWRINDEQPRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEP+RIGTSSIDTTCTIWDIE+ETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPRRIGTSSIDTTCTIWDIEKETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRH ASVNA+AWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHHASVNAVAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. TrEMBL
Match: B9GTC9_POPTR (Transducin family protein OS=Populus trichocarpa GN=POPTR_0002s22620g PE=4 SV=1)

HSP 1 Score: 682.2 bits (1759), Expect = 3.4e-193
Identity = 329/346 (95.09%), Postives = 338/346 (97.69%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MG +SD N D SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI
Sbjct: 1   MGGSSDPNQDGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLD+S+GEIRSDPNLSFEHPYPPTKTIFIPDKECQ+PDLLATSSDFLRVWRI+D+   V
Sbjct: 61  VQLDESNGEIRSDPNLSFEHPYPPTKTIFIPDKECQKPDLLATSSDFLRVWRINDEQPRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEP+RIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPRRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRH ASVNA+AWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHHASVNAVAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. TAIR10
Match: AT1G12910.1 (AT1G12910.1 Transducin/WD40 repeat-like superfamily protein)

HSP 1 Score: 666.8 bits (1719), Expect = 7.4e-192
Identity = 323/346 (93.35%), Postives = 332/346 (95.95%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MG +SD   D SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAI SLLEQYPNRVEI
Sbjct: 1   MGTSSDPIQDGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAITSLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLD+S+GEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLR+WRI+DD S V
Sbjct: 61  VQLDESNGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRLWRIADDHSRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKS LN NKNSEFCGPLTSFDWN+AEP+RIGTSS DTTCTIWDIERE VDTQLIAHDKE
Sbjct: 121 ELKSCLNSNKNSEFCGPLTSFDWNEAEPRRIGTSSTDTTCTIWDIEREAVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           V+DIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VFDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFP LPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWD+SSM
Sbjct: 241 IIMDSAKVVVLDIRFPALPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDISSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQ VEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQHVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. TAIR10
Match: AT3G26640.1 (AT3G26640.1 Transducin/WD40 repeat-like superfamily protein)

HSP 1 Score: 626.7 bits (1615), Expect = 8.5e-180
Identity = 302/346 (87.28%), Postives = 320/346 (92.49%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           M  +SD   + S+EQ KRSEIYTYEAPW IYAMNWS+RRDKKYRLAI SL+EQYPNRVEI
Sbjct: 1   MVTSSDQIQNGSEEQSKRSEIYTYEAPWQIYAMNWSIRRDKKYRLAITSLIEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLD+S+GEIRSDPNL FEHPYPPTKT FIPDKECQRPDLLATSSDFLR+WRISDD S V
Sbjct: 61  VQLDESNGEIRSDPNLCFEHPYPPTKTSFIPDKECQRPDLLATSSDFLRLWRISDDESRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKS L+ +KNSEF GP+TSFDWN+AEP+RIGTSSIDTTCTIWDIERE VDTQLIAHDKE
Sbjct: 121 ELKSCLSSDKNSEFSGPITSFDWNEAEPRRIGTSSIDTTCTIWDIEREVVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVS DGSVRVFDLRDKEHSTIIYES EP TPLVRL WNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSEDGSVRVFDLRDKEHSTIIYESGEPSTPLVRLSWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           +IM SAK+VVLDIRFP LPVVELQRHQASVNAIAWAPHSS HIC+AGDDSQALIWD+SSM
Sbjct: 241 VIMGSAKIVVLDIRFPALPVVELQRHQASVNAIAWAPHSSSHICSAGDDSQALIWDISSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQ VEGGLDPILAYTAGAE+EQLQWSSSQPDWVAIAFS KLQILRV
Sbjct: 301 GQHVEGGLDPILAYTAGAEVEQLQWSSSQPDWVAIAFSNKLQILRV 346

BLAST of CmaCh14G007840 vs. TAIR10
Match: AT5G24520.1 (AT5G24520.1 Transducin/WD40 repeat-like superfamily protein)

HSP 1 Score: 454.1 bits (1167), Expect = 7.5e-128
Identity = 220/340 (64.71%), Postives = 268/340 (78.82%), Query Frame = 1

Query: 11  ASDEQQKRSEIYTYEAPWHIYAMNWS-VRRDKKYRLAIASLLEQYPNRVEIVQLDDSSGE 70
           A D   +     TY++P+ +YAM +S +R    +R+A+ S LE Y NR++I+  D  S  
Sbjct: 5   APDSLSRSETAVTYDSPYPLYAMAFSSLRSSSGHRIAVGSFLEDYNNRIDILSFDSDSMT 64

Query: 71  IRSDPNLSFEHPYPPTKTIFIPDKECQRP---DLLATSSDFLRVWRISDDSSSVELKSLL 130
           ++  PNLSFEHPYPPTK +F P    +RP   DLLA+S DFLR+W I++DSS+VE  S+L
Sbjct: 65  VKPLPNLSFEHPYPPTKLMFSPPS-LRRPSSGDLLASSGDFLRLWEINEDSSTVEPISVL 124

Query: 131 NGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAW 190
           N +K SEFC PLTSFDWND EPKR+GT SIDTTCTIWDIE+  V+TQLIAHDKEV+DIAW
Sbjct: 125 NNSKTSEFCAPLTSFDWNDVEPKRLGTCSIDTTCTIWDIEKSVVETQLIAHDKEVHDIAW 184

Query: 191 GGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIMDSA 250
           G   VFASVSADGSVR+FDLRDKEHSTIIYES +PDTPL+RL WNKQD RYMATI+MDS 
Sbjct: 185 GEARVFASVSADGSVRIFDLRDKEHSTIIYESPQPDTPLLRLAWNKQDLRYMATILMDSN 244

Query: 251 KVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMGQPVEG 310
           KVV+LDIR PT+PV EL+RHQASVNAIAWAP S  HIC+ GDD+QALIW+L ++  P   
Sbjct: 245 KVVILDIRSPTMPVAELERHQASVNAIAWAPQSCKHICSGGDDTQALIWELPTVAGP--N 304

Query: 311 GLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           G+DP+  Y+AG+EI QLQWSSSQPDW+ IAF+ K+Q+LRV
Sbjct: 305 GIDPMSVYSAGSEINQLQWSSSQPDWIGIAFANKMQLLRV 341

BLAST of CmaCh14G007840 vs. TAIR10
Match: AT1G29260.1 (AT1G29260.1 peroxin 7)

HSP 1 Score: 70.5 bits (171), Expect = 2.3e-12
Identity = 48/166 (28.92%), Postives = 73/166 (43.98%), Query Frame = 1

Query: 133 EFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAWGGV--G 192
           E    + S D+N        TSS D T  +W ++R         H   VY   W      
Sbjct: 104 EHAREVQSVDYNPTRRDSFLTSSWDDTVKLWAMDRPASVRTFKEHAYCVYQAVWNPKHGD 163

Query: 193 VFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIMDSAKVVV 252
           VFAS S D ++R++D+R+   + II      D  ++   WNK D   +AT  +D   V V
Sbjct: 164 VFASASGDCTLRIWDVREPGSTMII---PAHDFEILSCDWNKYDDCILATSSVDKT-VKV 223

Query: 253 LDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWD 297
            D+R   +P+  L  H  +V  + ++PH    I +   D    +WD
Sbjct: 224 WDVRSYRVPLAVLNGHGYAVRKVKFSPHRRSLIASCSYDMSVCLWD 265

BLAST of CmaCh14G007840 vs. NCBI nr
Match: gi|449455770|ref|XP_004145624.1| (PREDICTED: WD repeat-containing protein LWD1 [Cucumis sativus])

HSP 1 Score: 700.3 bits (1806), Expect = 1.7e-198
Identity = 342/346 (98.84%), Postives = 343/346 (99.13%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA+SD N DASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI
Sbjct: 1   MGASSDPNQDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDD SSV
Sbjct: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDPSSV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. NCBI nr
Match: gi|590648166|ref|XP_007032099.1| (Transducin/WD40 repeat-like superfamily protein isoform 2 [Theobroma cacao])

HSP 1 Score: 688.7 bits (1776), Expect = 5.1e-195
Identity = 333/346 (96.24%), Postives = 341/346 (98.55%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA SD NP+ SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNR+EI
Sbjct: 1   MGAISDPNPEGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRLEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSF+HPYPPTKTIFIPDKECQ+PDLLATSSDFLR+WRISDD S V
Sbjct: 61  VQLDDSNGEIRSDPNLSFDHPYPPTKTIFIPDKECQKPDLLATSSDFLRIWRISDDHSRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPR+MAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRFMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. NCBI nr
Match: gi|590648159|ref|XP_007032098.1| (Transducin/WD40 repeat-like superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 687.6 bits (1773), Expect = 1.1e-194
Identity = 332/346 (95.95%), Postives = 341/346 (98.55%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA SD NP+ SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNR+EI
Sbjct: 1   MGAISDPNPEGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRLEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSF+HPYPPTKTIFIPDKECQ+PDLLATSSDFLR+WRISDD S V
Sbjct: 61  VQLDDSNGEIRSDPNLSFDHPYPPTKTIFIPDKECQKPDLLATSSDFLRIWRISDDHSRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPR+MAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRFMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILR+
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRL 346

BLAST of CmaCh14G007840 vs. NCBI nr
Match: gi|224131364|ref|XP_002321066.1| (hypothetical protein POPTR_0014s13680g [Populus trichocarpa])

HSP 1 Score: 684.1 bits (1764), Expect = 1.3e-193
Identity = 330/346 (95.38%), Postives = 339/346 (97.98%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA+SD N D SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI
Sbjct: 1   MGASSDPNQDGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSFEHPYPPTKTIFIPDKECQ+PDLLATSSDFLRVWRI+D+   V
Sbjct: 61  VQLDDSNGEIRSDPNLSFEHPYPPTKTIFIPDKECQKPDLLATSSDFLRVWRINDEQPRV 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           ELKSLLNGNKNSEFCGPLTSFDWN+AEP+RIGTSSIDTTCTIWDIE+ETVDTQLIAHDKE
Sbjct: 121 ELKSLLNGNKNSEFCGPLTSFDWNEAEPRRIGTSSIDTTCTIWDIEKETVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRH ASVNA+AWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHHASVNAVAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 347
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILRV 346

BLAST of CmaCh14G007840 vs. NCBI nr
Match: gi|1009152054|ref|XP_015893885.1| (PREDICTED: WD repeat-containing protein LWD1-like [Ziziphus jujuba])

HSP 1 Score: 682.2 bits (1759), Expect = 4.8e-193
Identity = 332/345 (96.23%), Postives = 339/345 (98.26%), Query Frame = 1

Query: 1   MGANSDTNPDASDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60
           MGA+SD N + SDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI
Sbjct: 1   MGASSDPNQEGSDEQQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLEQYPNRVEI 60

Query: 61  VQLDDSSGEIRSDPNLSFEHPYPPTKTIFIPDKECQRPDLLATSSDFLRVWRISDDSSSV 120
           VQLDDS+GEIRSDPNLSFEHPYPPTKTIFIPDKECQ+PDLLATSSD+LRVWRISDDS  V
Sbjct: 61  VQLDDSNGEIRSDPNLSFEHPYPPTKTIFIPDKECQKPDLLATSSDYLRVWRISDDS--V 120

Query: 121 ELKSLLNGNKNSEFCGPLTSFDWNDAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKE 180
           E+KSLLNGNKNSEFCGPLTSFDWN+AEPKRIGTSSIDTTCTIWDIERE VDTQLIAHDKE
Sbjct: 121 EIKSLLNGNKNSEFCGPLTSFDWNEAEPKRIGTSSIDTTCTIWDIEREAVDTQLIAHDKE 180

Query: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240
           VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT
Sbjct: 181 VYDIAWGGVGVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMAT 240

Query: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300
           IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM
Sbjct: 241 IIMDSAKVVVLDIRFPTLPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSM 300

Query: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILR 346
           GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILR
Sbjct: 301 GQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSTKLQILR 343

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LWD1_ARATH1.3e-19093.35WD repeat-containing protein LWD1 OS=Arabidopsis thaliana GN=LWD1 PE=2 SV=1[more]
LWD2_ARATH1.5e-17887.28WD repeat-containing protein LWD2 OS=Arabidopsis thaliana GN=LWD2 PE=2 SV=1[more]
TTG1_ARATH1.3e-12664.71Protein TRANSPARENT TESTA GLABRA 1 OS=Arabidopsis thaliana GN=TTG1 PE=1 SV=1[more]
DCAF7_DICDI1.3e-11359.82DDB1- and CUL4-associated factor 7 homolog OS=Dictyostelium discoideum GN=wdr68 ... [more]
DCAF7_HUMAN4.9e-11358.84DDB1- and CUL4-associated factor 7 OS=Homo sapiens GN=DCAF7 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LCB1_CUCSA1.2e-19898.84Uncharacterized protein OS=Cucumis sativus GN=Csa_3G253490 PE=4 SV=1[more]
A0A061EDG3_THECC3.6e-19596.24Transducin/WD40 repeat-like superfamily protein isoform 2 OS=Theobroma cacao GN=... [more]
A0A061EL36_THECC8.0e-19595.95Transducin/WD40 repeat-like superfamily protein isoform 1 OS=Theobroma cacao GN=... [more]
B9IA35_POPTR8.8e-19495.38Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s13680g PE=4 SV=1[more]
B9GTC9_POPTR3.4e-19395.09Transducin family protein OS=Populus trichocarpa GN=POPTR_0002s22620g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G12910.17.4e-19293.35 Transducin/WD40 repeat-like superfamily protein[more]
AT3G26640.18.5e-18087.28 Transducin/WD40 repeat-like superfamily protein[more]
AT5G24520.17.5e-12864.71 Transducin/WD40 repeat-like superfamily protein[more]
AT1G29260.12.3e-1228.92 peroxin 7[more]
Match NameE-valueIdentityDescription
gi|449455770|ref|XP_004145624.1|1.7e-19898.84PREDICTED: WD repeat-containing protein LWD1 [Cucumis sativus][more]
gi|590648166|ref|XP_007032099.1|5.1e-19596.24Transducin/WD40 repeat-like superfamily protein isoform 2 [Theobroma cacao][more]
gi|590648159|ref|XP_007032098.1|1.1e-19495.95Transducin/WD40 repeat-like superfamily protein isoform 1 [Theobroma cacao][more]
gi|224131364|ref|XP_002321066.1|1.3e-19395.38hypothetical protein POPTR_0014s13680g [Populus trichocarpa][more]
gi|1009152054|ref|XP_015893885.1|4.8e-19396.23PREDICTED: WD repeat-containing protein LWD1-like [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001680WD40_repeat
IPR015943WD40/YVTN_repeat-like_dom_sf
IPR017986WD40_repeat_dom
IPR019775WD40_repeat_CS
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G007840.1CmaCh14G007840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001680WD40 repeatPFAMPF00400WD40coord: 174..205
score: 0.14coord: 263..296
score: 0
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 119..164
score: 0.57coord: 256..296
score: 3.5E-6coord: 167..205
score: 2.
IPR001680WD40 repeatPROFILEPS50082WD_REPEATS_2coord: 263..299
score: 11
IPR015943WD40/YVTN repeat-like-containing domainGENE3DG3DSA:2.130.10.10coord: 30..337
score: 3.8
IPR017986WD40-repeat-containing domainPROFILEPS50294WD_REPEATS_REGIONcoord: 131..305
score: 13
IPR017986WD40-repeat-containing domainunknownSSF50978WD40 repeat-likecoord: 56..304
score: 2.17
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 151..165
score: -coord: 283..297
scor
NoneNo IPR availableunknownCoilCoilcoord: 346..346
scor
NoneNo IPR availablePANTHERPTHR19919WD REPEAT CONTAINING PROTEINcoord: 13..346
score: 2.0E