Cp4.1LG20g01070 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g01070
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranscription factor BIM1-like protein
LocationCp4.1LG20 : 496764 .. 501808 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTTTTTATTTTATTTATTTTTTGGGATTTGAAGAGAAAAAGAGAGATTCTAACCCCAAAGGAAAAAAGAAAACACAATGTTTTGTTTGTAAGCGGCAATTCGAAGCAGATTCTCTTCTTCTTCTTCTTCTTCTTCATTTTACCAATTCTTCTCTGCAATTCTAGTTTACTATCTTCACTGCTTTTCAATGGCGCTATAGTTTCTGTGTTTGCGTTCAAGTTCTAAATTTAGTTCAGTTTAGAAGAGTCACTTTGTCAAAGCCTGTGAAAATCTTCTCTCTTATCAGATTCTTGGAGAAACTTATGGTACTATTCTCTCCTAATTCATTCGATTTTATTATTATTATTATTATTATTATTGTTATTGTTTTCTGGATTTGTTCTGCTTTATCTTGTTCTTCAACTGTTGTAGTTGAAATTGGAGAATTGTTTTCTCCAGGATGAGAAGCATGTTTTTCTGGAGGAACAGTCGCAATTTTATAGAGTATTATAACACTTTACAGTTCGATTACTGAAAAGGTTAAGTGAATCTGCCTTTTAGATCTTCATATCGACGGCATAAATTCTCTGATTAGCAAGCTTTTGTAGGAATTCAGGTTCTGAGCGCTTCATTTCTGGAAAAACTCTCGATATCCGTAGTCAATTTTAGGGTGAATTACTGAGAATTGAACTTCTTTGAGAGGTTTGAATTCGTAATCCATCAAGAGAAGAAGGCCGAGGGATTCGCGTTTAACCGTTTGGAAAGCTTGATTGTAATTGGCTATGGAGCTTCCTCAGCCTTCCCGTCCCCTCGGAACCGAAGGTATGGTTCGATTTCAACTTTTGTCAAGTTTAGTACTCCTGAACTTGAGTTTTTGGAATTCGATGGGGAAAACCCCCCGAAACGAATGATTTGTGTTGAATGTTCATTTGTGACGGTAATCTGGAAGCTCTTGTTCAGGACGGAAACCAACGCATGACTTCCTATCACTCTATAGTCATTCAACTGCCGACCAACAAGATCTATCTCAGTCCTTTCAAGGTACTATTCCTCTATCAAATCACCAATTTACAGATTTACTTCCTCCTATAAGCCATTTGATAAAGTAAAGCCATGTCTCCGTTCTGATATTTCTTTAGTAAATTACTTTTGTTCGAAAACGAAATGAAATTATGGTTGTCCGTGCACGTGACTTGAATAAAATCATGGATCCGTGGCACGAAGTAAAAGTAGGGCTGAACTAAATTAGGAGCCATACTTGGCTTTCCCTACTGCATCTGTCATTTTGCAGTTCACTCCAAATTCAAAAAAAGAAGAAGAAGAAGAAGAATTTTCTATGACGAACTGTAATTGAATATCGGGATGAACTCGATAAAATATACCCTAATATTTTTCTTTTTCTAAAATTAGCACAGAATATATGATAGAATAGGGTCACTGAATTCCAAAAAGGAAAAGGAAAGGCCGGCAGGTAAGTAATTAGCACCACCAGATTGGTGCGCGCCCTTTCGATGGCGGGACCTGTCTCATGTGTCGACAACTGACTTGACGTATCGCACAATTCAGGATAAGCGATGCAAATTCAGCTCCCTTGCGGAACCCCTTAAAAATTGCAATCTTTTTCTTTTTTCTTTTGGGATGTGCATATGCTTCTGAATAAATTCCATAATAAATAAAGCGAATCTCTCTAAAACTTGCCATGTGGATATATTTACTGCCCCCATCATGCGTATCTCTTACGGAATCGTCAATATTAAGTGCTCTTTCAGTGGGGATTTTGAAATTTTTGCATTCAGCCTGTCATTGTTACTGATAAGAGAATAGTAAAACGTGAACATGGGTTTCAATTTCCACGTTTGTAATCCATTTTTTAGCTGGACAGTGAAAAAGGTTCTATCCTGCTTGTTTCGTGGTTAATTGCATCACTATTTGTTTAGAAGCCATGGGTGCATTTGGATTAGCTATTATAATGTATGTTTTTCTATTTTTGTGAGAACAGGAGGTTTTCTTACAACGCATGATTTTCTCCAGCCGTTGGAACGGATTGAGAAGACTTGTGCCAAGAAAGAGCAAACAATGAATGTGTCCACTGTCGAACGAGCAGCGCCATCCTTGGCAACTCTAGCACGTAATTCTCCTGTCGAGCGTCTTTTACCAGGTGGAATTGGGAGTTGCATTATTGGCCACATTTCTTATTTCAACCATAGGGTTCCAAAGACAGAAGGCTCGGTATTTCCAGTCCCTCAGGCGAGCAATACTGACAAAAGTGACGACAACTCCCGCTGTAGCTCTTTGTCTGCAAGTGGTTTCAAGTTGTGGGAAGAATCTGCTGTTAAAAAGGGAAAGACAAGGAAGGAGAATTTGAGTGAAAATCCGGGTCTAAGAGGTAAAGTTTTTTTATCATTACAATCTTTTCAGTTCTAATTATTTGACTACTAATCGATCAACAGAGTTCGCATGGACTTTCTAGTTTCTAATGTTGGTGGGGTCCTACTGACTTCTTAACGTAGCTTTTTCTTTAGTCAGTAAGAACTCATGACCGAAATTTACCACAAAACTTGAAATGTTTGGAGAGAAAACAAACGTATGTGGGTGCATATGTGTGCTTTCTCTTATCATCTTTTTGTTTGTTTTGCTTGCATGTATAAGAATTCTAAGGTGGGGTTTTGGTATTCGCTCATATTTTTTCCCCATCATTTGAGCTTTTTTATTACGTTCCTTCCCATTGAACTGATCTTCCGTTCTTTCTAGCAGAACCACCGGCAAAGATGGAACAGTGGGCCGTTTCAACAGAGCGGCCAATACAATCATCATGCAGCAGCTTCAGCTCTCTCTCGTCCTCTCAGTCAGTCATTCTTGTTTCTAACTTTTCAAAACGTCAACATTATTTTTTTGGTTCCCGTGTATAAATGTTGAGACTGGACTTCCTGGAAATTTACGAACAGGCCTTCAGGCCAGCAGAAACGGAGCTTCGGTGAGATGTTGAAGTCCGCAGCCAATGTTTCCTCTAAGGAAGAGGAACTAGACGATGGCAAAGCATTTGTTATTAAGAAGGAATCATCGCCATCCACTGCCTACAGAGGTTCGTTTCTTTCATGATGCTCTGTTCGTTATTCCCCCCACCAGAAATTTTAAAAAAAAATTAGATAAGTAGTGCAATTTGGCCTTCAATTTGAAAATGTTGCCATTTTTACTCTCTTCTAAGTTTTCCTTTGAAAATTCATTAAAAAAAAAAAAAAAAAAACGTGTTTTGATATACTTTTGATGAAGTGTAAAATATGTATGATACAAAATTGATAGGATTTAAAAGAAAATGACGTTAGAATTAATTTTTTGTCAAAATTTGTGAGCTCTGGCAAAATTAGTCCCCGAAAATCTCTCCTCAATTATTCATCTTTCTAAAATCTTGGATTTACAGGTGACTTGAGGATAAACGTCGGTGGAAAATGCTCTGATCAGAAGGCTAACACGCCGAGATCCAAACATTCTGCAACAGAACAGCGTCGAAGGAGCAAAATTAATGATAGGCAAGATTTCACAGATATATGTTAGTTCAGTAGCAGACATTACCTTTTTCCAAAACATGTGTATATATTTCCTTATAGAAAGCAATTCGTAGAAAAATCACTCTAGAAGTTATTTTATTTGTGTGAGAAGTTACATGCAGGTTTCAGAAGCTGAGAGAACTTCTTCCCTGTAGTGATCAGAAGAGAGACAAAGCATCATTTTTGTTAGAGGTAAATGTGCATGTCACTAGGTGATATTTTACTTCTCTTTTGCATATGGTGTGCAGTAGATTCATTTCTGGACATGTTACCAGGTTATCGAGTACATTCAATTTTTACAAGAAAAGGTTCGCAAATATGAAAGCTCACCACCGCAAGGATGGTACGATGAACCAGCGAAATTACTCCCCTGTGTAAATACTACCAACAAACTTTCTCTCCATTGTTTGTGCATACTATATAACACTGTTCCTTATAAAAATATTTTGACCTTTTGTTCTTGTTCCACTAAATTTAATTGATATCAGCTTCCTAATATGCAAACCCTCTGATCCTCTGTTGTGCAGAGAAACAATTGTAATCCAGCACGACGTTACATCGATCAATCTCAAGTTGCCAAGAGCGGCCCTGTGTTTATATTTGCTGGGTCTGACCAGAAAAATACGTCTCACTTCCCTGCATTTCTTCCAAGGTGCTCGCACAACCCAGTCGAATCTAATACTTCCACCACCTTCGGAGAAGCGGATCATCATCTTGGATCAACAAACAAAGCAACATCTTATCCAATGTTGGACCCACGCTATTTCATACCTGTCACAAGTGAAGGTGAAAAAACTAAAATCTATTCTCAAGTGGCACATAGTGCAGACAGAAAACCATATAAAATGCAACGACTATTATGTGAGACGAGAGCATGTACTACCGATATTGCTGCTGTTGATAATAAGCTGAAAGAACCTGAGCAGAGCATTGAAGGTGGTAGAATTAGCATCTCAAGTGCATATTCTCAAGAGTGAGTATATTCATATTCTATATTCTTGTTTAATATGGTACATTCTGATATGCATGATCTTGAGTTTTCATCCAATAAACAATAAATTTAGCCATCTTCAAGACTGACATTATAATATTAGTAGATAGCTCTTCTAGACAGCGAAGTTAACTGTTGGAAGAGGCCAATAGATCAATCTGAGACTTGATTCATCAATTAAGCGACATACATCTGGTTTCTCGATGAGCAGGTTGTTAAAAATTCTCACACAAGCGCTACAGAGTTCTGGAGTGGATATGTCACAGGCCAGTATCGCTGTACAAATAGAACTTGGAAAGAGGACAAATTGTAGAGATACTGCCGCAAATCCTGCTATTGTGGTAAACGAACTGCATACTAGTATGCAATGAATGTCCTTTTACTAGCTGCATCAACTTTTCCACCCTTTTCTTTACTTTTTCATAGGATGATTCAGCTCGCCCAAGTGAGAGAGCGGTCGTCGGTACCAGAGTTGCAGCTGCAGAAGAGGACATGGAGCAGGCATTCAGAAAGAAGCAAAATACTTGA

mRNA sequence

ATTTTTTTATTTTATTTATTTTTTGGGATTTGAAGAGAAAAAGAGAGATTCTAACCCCAAAGGAAAAAAGAAAACACAATGTTTTGTTTGTAAGCGGCAATTCGAAGCAGATTCTCTTCTTCTTCTTCTTCTTCTTCATTTTACCAATTCTTCTCTGCAATTCTAGTTTACTATCTTCACTGCTTTTCAATGGCGCTATAGTTTCTGTGTTTGCGTTCAAGTTCTAAATTTAGTTCAGTTTAGAAGAGTCACTTTGTCAAAGCCTGTGAAAATCTTCTCTCTTATCAGATTCTTGGAGAAACTTATGGATGAGAAGCATGTTTTTCTGGAGGAACAGTCGCAATTTTATAGAGTATTATAACACTTTACAGTTCGATTACTGAAAAGGTTCTGAGCGCTTCATTTCTGGAAAAACTCTCGATATCCGTAGTCAATTTTAGGGTGAATTACTGAGAATTGAACTTCTTTGAGAGGTTTGAATTCGTAATCCATCAAGAGAAGAAGGCCGAGGGATTCGCGTTTAACCGTTTGGAAAGCTTGATTGTAATTGGCTATGGAGCTTCCTCAGCCTTCCCGTCCCCTCGGAACCGAAGGACGGAAACCAACGCATGACTTCCTATCACTCTATAGTCATTCAACTGCCGACCAACAAGATCTATCTCAGTCCTTTCAAGGAGGTTTTCTTACAACGCATGATTTTCTCCAGCCGTTGGAACGGATTGAGAAGACTTGTGCCAAGAAAGAGCAAACAATGAATGTGTCCACTGTCGAACGAGCAGCGCCATCCTTGGCAACTCTAGCACGTAATTCTCCTGTCGAGCGTCTTTTACCAGGTGGAATTGGGAGTTGCATTATTGGCCACATTTCTTATTTCAACCATAGGGTTCCAAAGACAGAAGGCTCGGTATTTCCAGTCCCTCAGGCGAGCAATACTGACAAAAGTGACGACAACTCCCGCTGTAGCTCTTTGTCTGCAAGTGGTTTCAAGTTGTGGGAAGAATCTGCTGTTAAAAAGGGAAAGACAAGGAAGGAGAATTTGAGTGAAAATCCGGGTCTAAGAGAACCACCGGCAAAGATGGAACAGTGGGCCGTTTCAACAGAGCGGCCAATACAATCATCATGCAGCAGCTTCAGCTCTCTCTCGTCCTCTCAGCCTTCAGGCCAGCAGAAACGGAGCTTCGGTGAGATGTTGAAGTCCGCAGCCAATGTTTCCTCTAAGGAAGAGGAACTAGACGATGGCAAAGCATTTGTTATTAAGAAGGAATCATCGCCATCCACTGCCTACAGAGTTACATGCAGGTTTCAGAAGCTGAGAGAACTTCTTCCCTGTAGTGATCAGAAGAGAGACAAAGCATCATTTTTGTTAGAGGTTATCGAGTACATTCAATTTTTACAAGAAAAGGTTCGCAAATATGAAAGCTCACCACCGCAAGGATGGTACGATGAACCAGCGAAATTACTCCCCTGTAGAAACAATTGTAATCCAGCACGACGTTACATCGATCAATCTCAAGTTGCCAAGAGCGGCCCTGTGTTTATATTTGCTGGGTCTGACCAGAAAAATACGTCTCACTTCCCTGCATTTCTTCCAAGGTGCTCGCACAACCCAGTCGAATCTAATACTTCCACCACCTTCGGAGAAGCGGATCATCATCTTGGATCAACAAACAAAGCAACATCTTATCCAATGTTGGACCCACGCTATTTCATACCTGTCACAAGTGAAGGTGAAAAAACTAAAATCTATTCTCAAGTGGCACATAGTGCAGACAGAAAACCATATAAAATGCAACGACTATTATGTGAGACGAGAGCATGTACTACCGATATTGCTGCTGTTGATAATAAGCTGAAAGAACCTGAGCAGAGCATTGAAGGTGGTAGAATTAGCATCTCAAGTGCATATTCTCAAGAGTTGTTAAAAATTCTCACACAAGCGCTACAGAGTTCTGGAGTGGATATGTCACAGGCCAGTATCGCTGTACAAATAGAACTTGGAAAGAGGACAAATTGTAGAGATACTGCCGCAAATCCTGCTATTGTGGATGATTCAGCTCGCCCAAGTGAGAGAGCGGTCGTCGGTACCAGAGTTGCAGCTGCAGAAGAGGACATGGAGCAGGCATTCAGAAAGAAGCAAAATACTTGA

Coding sequence (CDS)

ATGGAGCTTCCTCAGCCTTCCCGTCCCCTCGGAACCGAAGGACGGAAACCAACGCATGACTTCCTATCACTCTATAGTCATTCAACTGCCGACCAACAAGATCTATCTCAGTCCTTTCAAGGAGGTTTTCTTACAACGCATGATTTTCTCCAGCCGTTGGAACGGATTGAGAAGACTTGTGCCAAGAAAGAGCAAACAATGAATGTGTCCACTGTCGAACGAGCAGCGCCATCCTTGGCAACTCTAGCACGTAATTCTCCTGTCGAGCGTCTTTTACCAGGTGGAATTGGGAGTTGCATTATTGGCCACATTTCTTATTTCAACCATAGGGTTCCAAAGACAGAAGGCTCGGTATTTCCAGTCCCTCAGGCGAGCAATACTGACAAAAGTGACGACAACTCCCGCTGTAGCTCTTTGTCTGCAAGTGGTTTCAAGTTGTGGGAAGAATCTGCTGTTAAAAAGGGAAAGACAAGGAAGGAGAATTTGAGTGAAAATCCGGGTCTAAGAGAACCACCGGCAAAGATGGAACAGTGGGCCGTTTCAACAGAGCGGCCAATACAATCATCATGCAGCAGCTTCAGCTCTCTCTCGTCCTCTCAGCCTTCAGGCCAGCAGAAACGGAGCTTCGGTGAGATGTTGAAGTCCGCAGCCAATGTTTCCTCTAAGGAAGAGGAACTAGACGATGGCAAAGCATTTGTTATTAAGAAGGAATCATCGCCATCCACTGCCTACAGAGTTACATGCAGGTTTCAGAAGCTGAGAGAACTTCTTCCCTGTAGTGATCAGAAGAGAGACAAAGCATCATTTTTGTTAGAGGTTATCGAGTACATTCAATTTTTACAAGAAAAGGTTCGCAAATATGAAAGCTCACCACCGCAAGGATGGTACGATGAACCAGCGAAATTACTCCCCTGTAGAAACAATTGTAATCCAGCACGACGTTACATCGATCAATCTCAAGTTGCCAAGAGCGGCCCTGTGTTTATATTTGCTGGGTCTGACCAGAAAAATACGTCTCACTTCCCTGCATTTCTTCCAAGGTGCTCGCACAACCCAGTCGAATCTAATACTTCCACCACCTTCGGAGAAGCGGATCATCATCTTGGATCAACAAACAAAGCAACATCTTATCCAATGTTGGACCCACGCTATTTCATACCTGTCACAAGTGAAGGTGAAAAAACTAAAATCTATTCTCAAGTGGCACATAGTGCAGACAGAAAACCATATAAAATGCAACGACTATTATGTGAGACGAGAGCATGTACTACCGATATTGCTGCTGTTGATAATAAGCTGAAAGAACCTGAGCAGAGCATTGAAGGTGGTAGAATTAGCATCTCAAGTGCATATTCTCAAGAGTTGTTAAAAATTCTCACACAAGCGCTACAGAGTTCTGGAGTGGATATGTCACAGGCCAGTATCGCTGTACAAATAGAACTTGGAAAGAGGACAAATTGTAGAGATACTGCCGCAAATCCTGCTATTGTGGATGATTCAGCTCGCCCAAGTGAGAGAGCGGTCGTCGGTACCAGAGTTGCAGCTGCAGAAGAGGACATGGAGCAGGCATTCAGAAAGAAGCAAAATACTTGA

Protein sequence

MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTCAKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFPVPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAVSTERPIQSSCSSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKKESSPSTAYRVTCRFQKLRELLPCSDQKRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVAKSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVESNTSTTFGEADHHLGSTNKATSYPMLDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDSARPSERAVVGTRVAAAEEDMEQAFRKKQNT
BLAST of Cp4.1LG20g01070 vs. Swiss-Prot
Match: BIM1_ARATH (Transcription factor BIM1 OS=Arabidopsis thaliana GN=BIM1 PE=1 SV=2)

HSP 1 Score: 245.4 bits (625), Expect = 1.4e-63
Identity = 209/538 (38.85%), Postives = 281/538 (52.23%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTA---DQQDLSQSFQGGFLTTHDFLQPLERIE 60
           MELPQP RP  T+GRKPTHDFLSL SHST     +     S QG  L THDFLQPLE + 
Sbjct: 1   MELPQP-RPFKTQGRKPTHDFLSLCSHSTVHPDPKPTPPPSSQGSHLKTHDFLQPLECV- 60

Query: 61  KTCAKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNH---RVPKT 120
                KE    +++   A+      A   P++ +LPGGIG+  I  I YF+H   R+PK 
Sbjct: 61  ---GAKEDVSRINSTTTASEKPPPPAPPPPLQHVLPGGIGTYTISPIPYFHHHHQRIPKP 120

Query: 121 EGSVFPVPQASNTDKSDD--NSRCSSLSA--SGFKLWEESAV-KKGKTRKEN-LSENPGL 180
           E S   +  A+  +  D+  NS CSS +A  SGF LW+ESA  KKG+TRKEN + E   +
Sbjct: 121 ELSPPMMFNANERNVLDENSNSNCSSYAAASSGFTLWDESASGKKGQTRKENSVGERVNM 180

Query: 181 R-EPPAKMEQWAVSTERP---IQSSCSSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEE 240
           R +  A + QW V+  R      +  S FSSLSSSQ S  + +SF +M++SA   SS+E+
Sbjct: 181 RADVAATVGQWPVAERRSQSLTNNHMSGFSSLSSSQGSVLKSQSFMDMIRSAKG-SSQED 240

Query: 241 ELDDGKAFVIKKE-SSPSTAYRVTCRF----------QKLREL----------------- 300
           +LDD + F++KKE SS S ++RV  R           QKL                    
Sbjct: 241 DLDDEEDFIMKKESSSTSQSHRVDLRVKADVRGSPNDQKLNTPRSKHSATEQRRRSKIND 300

Query: 301 --------LPCSDQKRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNN 360
                   +P SDQKRDKASFLLEVIEYIQFLQEK  KY +S  QGW  EPAKLL  ++N
Sbjct: 301 RFQMLRQLIPNSDQKRDKASFLLEVIEYIQFLQEKADKYVTS-YQGWNHEPAKLLNWQSN 360

Query: 361 CNPARRYIDQSQVAKSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVESNTSTTFGEADHHL 420
            N         Q+   G  F     ++KN            + PV    +      DH  
Sbjct: 361 NN--------QQLVPEGVAFAPKLEEEKN------------NIPVSVLATAQGVVIDHPT 420

Query: 421 GSTNKATSYPMLDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAA 480
            +T       +    +F PV +     + +++VA S   +P    R              
Sbjct: 421 TATTSPFPLSIQSNSFFSPVIAGNPVPQFHARVASSEAVEPSPSSR-------------- 480

Query: 481 VDNKLKEPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTN 487
              K +E E+ +E G I ISS YSQ L+K L +AL++SGVD+++ASI+V+IEL K+++
Sbjct: 481 -SQKEEEDEEVLE-GNIRISSVYSQGLVKTLREALENSGVDLTKASISVEIELAKQSS 495

BLAST of Cp4.1LG20g01070 vs. Swiss-Prot
Match: BIM3_ARATH (Transcription factor BIM3 OS=Arabidopsis thaliana GN=BIM3 PE=1 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 1.0e-16
Identity = 84/246 (34.15%), Postives = 115/246 (46.75%), Query Frame = 1

Query: 245 RVTCRFQKLRELLPCS--DQKRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKL 304
           ++  RFQ L +++P +  DQKRDKASFLLEVIEYI FLQEKV  YE S  Q WY  P KL
Sbjct: 48  KINERFQSLMDIIPQNQNDQKRDKASFLLEVIEYIHFLQEKVHMYEDSH-QMWYQSPTKL 107

Query: 305 LPCRNNCNPARRYIDQSQVAKSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVESNTS---- 364
           +P RN+                       GS  +   H P  +   S N   + +S    
Sbjct: 108 IPWRNS----------------------HGSVAEENDH-PQIVKSFSSNDKVAASSGFLL 167

Query: 365 TTFGEADHHLGSTNKATSYPMLDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCE 424
            T+   +  + S   +T  P   P   +   S   +T+   Q       +P    +  C 
Sbjct: 168 DTYNSVNPDIDSA-VSTKIPEHSP---VSAVSSYLRTEPSLQFVQHDFWQP----KTSCG 227

Query: 425 TRACTTDIAAVDNKLKEPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQ 484
           T  C T+         E   S E    S+S+  SQ +L  LT+AL+SSGV+MS+  I+VQ
Sbjct: 228 TINCFTN---------ELLTSDEKTSASLSTVCSQRVLNTLTEALKSSGVNMSETMISVQ 252

BLAST of Cp4.1LG20g01070 vs. Swiss-Prot
Match: BIM2_ARATH (Transcription factor BIM2 OS=Arabidopsis thaliana GN=BIM2 PE=1 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 5.0e-16
Identity = 96/319 (30.09%), Postives = 140/319 (43.89%), Query Frame = 1

Query: 215 SAANVSSKEEELDDGKAFVIKKESSPSTAYR---VTCRFQKLRELLPCSDQKRDKASFLL 274
           S   V S  +  ++ KA  I+ + S +   R   +  RFQ LREL+P S+QKRD ASFLL
Sbjct: 27  SNTTVHSNRDSKENDKASAIRSKHSVTEQRRRSKINERFQILRELIPNSEQKRDTASFLL 86

Query: 275 EVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVA---KSGPVF 334
           EVI+Y+Q+LQEKV+KYE S P GW  EP KL P RNN +   + +    VA    SGP  
Sbjct: 87  EVIDYVQYLQEKVQKYEGSYP-GWSQEPTKLTPWRNN-HWRVQSLGNHPVAINNGSGPGI 146

Query: 335 IFAGSDQKNT-SHFPAFLPRCSHNPVESNTSTTFGEADHHLGSTNKATSYPMLDPRYFIP 394
            F G  + NT +  PA +      P+ES+ +                   P L P   +P
Sbjct: 147 PFPGKFEDNTVTSTPAIIAE-PQIPIESDKARAITGISIESQPELDDKGLPPLQP--ILP 206

Query: 395 VTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSIEGGRISI 454
           +  +GE+       +    +       L+ E    +   A     L    Q+++   I +
Sbjct: 207 MV-QGEQANECPATSDGLGQS----NDLVIEGGTISISSAYSHELLSSLTQALQNAGIDL 266

Query: 455 SSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDSARPSERA 514
           S                        A ++VQI+LGKR N   T   P+    S  P    
Sbjct: 267 SQ-----------------------AKLSVQIDLGKRANQGLTHEEPS----SKNPLSYD 307

Query: 515 VVGTRVAAAEEDMEQAFRK 527
             G R ++ EE+ E + ++
Sbjct: 327 TQG-RDSSVEEESEHSHKR 307

BLAST of Cp4.1LG20g01070 vs. TrEMBL
Match: A0A0A0LSX0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G039980 PE=4 SV=1)

HSP 1 Score: 760.4 bits (1962), Expect = 1.5e-216
Identity = 417/569 (73.29%), Postives = 453/569 (79.61%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RPLGTEGRKPTHDFLSLY HSTA QQD SQSFQGGFL THDFL+PLER  KTC
Sbjct: 1   MELPQPPRPLGTEGRKPTHDFLSLYGHSTALQQDPSQSFQGGFLKTHDFLRPLERTGKTC 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           AK+E+T+NVSTVERAAP+L T A NS  ERLLPGGIG+  I HISYFN RVPK EGSVFP
Sbjct: 61  AKEEKTINVSTVERAAPTLGTQAHNSAAERLLPGGIGTYSISHISYFNQRVPKPEGSVFP 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
           VPQASNTDKSDDNSRCSSLS +GF LWEESAVKKGKTRKENL E P L+E PAK+EQW V
Sbjct: 121 VPQASNTDKSDDNSRCSSLSGNGFTLWEESAVKKGKTRKENLGEKPALKESPAKIEQWTV 180

Query: 181 STERPIQSSC----SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKK 240
           +TERP+QSS     SSFSSLS SQPSGQQ RSFGEMLKS  NVSS EEELDD KAFVIKK
Sbjct: 181 TTERPMQSSSSNHRSSFSSLSPSQPSGQQSRSFGEMLKSTVNVSSMEEELDDDKAFVIKK 240

Query: 241 ESSPSTAYRVTCRF----------------------QKLRELL-----------PCSDQK 300
           ESSPSTAY+   +                       Q+ R  +           P SDQK
Sbjct: 241 ESSPSTAYKGDLKINICGKSSDQKANTPRSKHSATEQRRRSKINDRFQKLRELIPRSDQK 300

Query: 301 RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVAK 360
           RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWY EPAKL+PCRNNCNPA+ YIDQSQ+AK
Sbjct: 301 RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYHEPAKLIPCRNNCNPAQCYIDQSQIAK 360

Query: 361 SGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPMLD 420
           SGPVFIFAGSD+KN  H PAF PRCSHNPVES  +TSTTF EAD H G+ NK T YPMLD
Sbjct: 361 SGPVFIFAGSDEKNMCHSPAF-PRCSHNPVESEVSTSTTFREADQHPGTNNK-TCYPMLD 420

Query: 421 PRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSIE 480
           PR+F PV SEG KT+++SQV H+AD KP ++Q L CE R+CTT+I   +NKLKEPEQ I+
Sbjct: 421 PRHFTPVISEGAKTRLHSQVGHNADNKPCEIQPLSCEMRSCTTNIVDGNNKLKEPEQRID 480

Query: 481 GGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDSA 531
           GGRISIS AYSQ LLKILTQALQSSGVDMSQAS+AVQIELGKRTN R+   +P IVDDSA
Sbjct: 481 GGRISISGAYSQGLLKILTQALQSSGVDMSQASVAVQIELGKRTNYREIVPSP-IVDDSA 540

BLAST of Cp4.1LG20g01070 vs. TrEMBL
Match: D7SIP8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g04790 PE=4 SV=1)

HSP 1 Score: 438.3 bits (1126), Expect = 1.3e-119
Identity = 283/567 (49.91%), Postives = 352/567 (62.08%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RP GTEGRKPTHDFLSL SHST  QQD   S Q G+L THDFLQPLER EK  
Sbjct: 1   MELPQP-RPFGTEGRKPTHDFLSLCSHSTV-QQDPRPS-QAGYLKTHDFLQPLERGEKNS 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
            K+E  + + TV++  P       NS VE +LPGGIG+  I HISYFN R+PK EGS+F 
Sbjct: 61  IKEENAIEIRTVDKPPPPAPP--PNSSVEHILPGGIGTYSISHISYFNQRLPKPEGSIFT 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
             QAS +D++++NS CSS + SGF LWEE+AVKKGK  KEN  E     EP  K+ QW  
Sbjct: 121 AAQASTSDRNEENSNCSSYTGSGFTLWEETAVKKGKAGKENAVERSIGIEPAVKLGQW-- 180

Query: 181 STERPIQSSC---SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKKE 240
           ++ERP QS     SSFSSLSSSQ SGQ+ +SF EM++SA+   ++EEE +D + FV+KKE
Sbjct: 181 TSERPSQSPSNHRSSFSSLSSSQSSGQKNQSFMEMIQSASAKGTQEEEEEDEEEFVLKKE 240

Query: 241 SS---------------------PSTAYRVTCR-----------FQKLRELLPCSDQKRD 300
           SS                     P + +  T +           FQ LR+L+P SDQKRD
Sbjct: 241 SSSNKGDLTVKVDGKSSDQKAVTPRSKHSATEQRRRSKINDRRVFQMLRDLIPHSDQKRD 300

Query: 301 KASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQ--VAK 360
           KASFLLEVIEYIQFLQEKV KYE S  QGW  E AKL+P RN+  PA  + DQS+   + 
Sbjct: 301 KASFLLEVIEYIQFLQEKVHKYEGS-FQGWNHESAKLMPWRNSHRPAESFADQSRGINSG 360

Query: 361 SGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPM-L 420
           SGP  +F+    +N       + R + NPVES  + STTF   D H G TNKA    M L
Sbjct: 361 SGPALMFSAKFDENNVAVSPNISRNTQNPVESDLSASTTFKAMDRHPGLTNKAVPIHMQL 420

Query: 421 DPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETR-ACTTDIAAVDNKLKEPEQS 480
            P  F PV   G   ++  ++A  A+      Q  L ++R + TT+     +KLKE E +
Sbjct: 421 QPNIFTPVVGGGGLAQLPPRLAPDAENMASLPQSQLWQSRSSVTTECTVASDKLKEQELT 480

Query: 481 IEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDD 527
           IEGG ISISSAYSQ LL  LTQALQSSGVD+S+ASI+VQI+LG + N R TA  P I D+
Sbjct: 481 IEGGTISISSAYSQGLLNTLTQALQSSGVDLSKASISVQIDLGNKANSRPTAPTPIIKDN 540

BLAST of Cp4.1LG20g01070 vs. TrEMBL
Match: A0A061G597_THECC (Transcription factor BIM1, putative isoform 2 OS=Theobroma cacao GN=TCM_014392 PE=4 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.6e-117
Identity = 278/572 (48.60%), Postives = 352/572 (61.54%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQ SRP G EGRK THDFLSLYSH +  QQD     QGG+L THDFLQ LER+ KT 
Sbjct: 1   MELPQ-SRPFGAEGRKSTHDFLSLYSHPSV-QQDPRPPAQGGYLKTHDFLQ-LERLGKTS 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           AK+E  + V+T E+  P     A    VE +LPGGIG+  I HISYFN RVPK EG+++ 
Sbjct: 61  AKEETPVEVATAEKPPPP----APPPSVEHILPGGIGTYSISHISYFNPRVPKAEGAIYN 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
           V Q SNT+++D+NS CSS + SGF LWEESA KKGKT KEN  E P +RE   K+ QWA 
Sbjct: 121 VAQGSNTERNDENSNCSSYAGSGFTLWEESAGKKGKTGKENAGETPVVREAAGKVGQWAT 180

Query: 181 ST-ERPIQSSCS----SFSSLSSSQPSGQQK-RSFGEMLKSAANVSSKEEELDDGKAFVI 240
           S+ ER  QSS +    SFSSLSSSQPS +QK +SF EM+KSA   S+++++ ++ + FV+
Sbjct: 181 SSLERASQSSTNNHRNSFSSLSSSQPSSKQKSQSFMEMIKSAKG-SAQDDDFEEDEDFVL 240

Query: 241 KKESSPST----------------------------------AYRVTCRFQKLRELLPCS 300
           KKESS +T                                    ++  RFQ LR+L+P S
Sbjct: 241 KKESSTTTHSKGELRVKVDGKSAPDQKANTPRSKHSATEQRRRSKINDRFQMLRDLIPHS 300

Query: 301 DQKRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQ 360
           DQKRDKASFLLEVIEYIQFLQEKV KYE +  QGW  EP+KL+P RNN  P   Y DQSQ
Sbjct: 301 DQKRDKASFLLEVIEYIQFLQEKVHKYEGTY-QGWSHEPSKLMPWRNNHRPTENYADQSQ 360

Query: 361 VAK--SGPVFIFAGS-DQKNTSHFPAFLPRCSHNPVESN--TSTTFGEADHHLGSTNKAT 420
                S P  +F+   D+KN +  P  +P  +HNP+ES+  T+TTF   D   G  NK  
Sbjct: 361 AINGVSAPALVFSAKFDEKNITVAPT-IPGSAHNPIESDMSTATTFRAIDLSPGMMNKTM 420

Query: 421 SYPM-LDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLK 480
            +P+ L P +F    S G   ++  ++         + Q + C + + TTD A    KLK
Sbjct: 421 PFPVSLQPNFFASAQSTGAAAQLVPRLPSDVANCASQPQSIACHSGSFTTDGALPSEKLK 480

Query: 481 EPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANP 527
           E E +IEGG ISISS YSQ LL  LTQALQ+SGVD+S ASI+VQIELGKR++ R TA+  
Sbjct: 481 EQELTIEGGTISISSVYSQGLLNTLTQALQTSGVDLSHASISVQIELGKRSSSRPTASAS 540

BLAST of Cp4.1LG20g01070 vs. TrEMBL
Match: B9IE21_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s03550g PE=4 SV=2)

HSP 1 Score: 430.3 bits (1105), Expect = 3.5e-117
Identity = 281/572 (49.13%), Postives = 356/572 (62.24%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQ  RP GTEGRKPTHDFLSLYSHS+  QQD     QGGFL THDFLQPLE++ K  
Sbjct: 1   MELPQ-QRPFGTEGRKPTHDFLSLYSHSSTVQQDPRPPSQGGFLQTHDFLQPLEQVSKAT 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           A++E  + + T+E+  P     A    VE  LPGGIG+  I H+SYFN RVPK E ++F 
Sbjct: 61  AREETNVEILTIEKPPPP----APPPSVEHTLPGGIGTYSISHVSYFNQRVPKPENTIFS 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGL-REPPAKMEQWA 180
           V QAS+TDK+D+NS CSS SASGF LWEES +KKGKT KEN+ E   + RE  AK +QW 
Sbjct: 121 VAQASSTDKNDENSNCSSYSASGFTLWEESTLKKGKTGKENVGERSNIIREAAAKTDQWT 180

Query: 181 VSTERPIQSSCS----SFSSLSSSQPSGQQ-KRSFGEMLKSAA------NVSSKEEEL-- 240
            S ERP QSS +    SFSSLSSSQP G +  +SF EM+KSA       ++  +E  L  
Sbjct: 181 TS-ERPSQSSSNNHRNSFSSLSSSQPPGLKCTQSFIEMIKSAKGSNLDDDLDDEETFLLK 240

Query: 241 ---------------DDGKAFVIKKESSPSTAY---------RVTCRFQKLRELLPCSDQ 300
                           DGK+   +K ++P + +         ++  RFQ LR L+P  DQ
Sbjct: 241 KETPSPIHKGELRVKVDGKSND-QKPNTPRSKHSATEQRRRSKINDRFQMLRALIPHGDQ 300

Query: 301 KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVA 360
           KRDKASFLLEVIE++QFLQEKV+KYE S  QGW  E AKL P RNN  P    +DQS+  
Sbjct: 301 KRDKASFLLEVIEHVQFLQEKVQKYEGSY-QGWNHEHAKLGPWRNNSRPVESSVDQSRGV 360

Query: 361 KSG--PVFIFAGS-DQKNTSHFPAFLPRCSHNPVESNTST--TFGEADHH--LGSTNKAT 420
            SG  P  +FA + D+KN +  P+  P  + N VESN S+  TF   DHH  LG TNKA 
Sbjct: 361 NSGVGPALLFAANLDEKNITISPSINPGGARNAVESNMSSASTFNAMDHHPNLGITNKAM 420

Query: 421 SYPM-LDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLK 480
            +P+ L P  F P    G   +   ++A  A+    + Q   C   +CT+D A   +KLK
Sbjct: 421 PFPISLQPNLFHPGRIAGAAAQFPPRLAFDAENTATQPQP--CHAISCTSDGAVASDKLK 480

Query: 481 EPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANP 527
           +   ++EGG ISISSAYSQ LL  LTQALQSSGVD+SQA+I+VQIELGK+ N R TA   
Sbjct: 481 QQNLTVEGGTISISSAYSQGLLNTLTQALQSSGVDLSQATISVQIELGKKGNSRQTAPTS 540

BLAST of Cp4.1LG20g01070 vs. TrEMBL
Match: A5AMI1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_038183 PE=4 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 6.6e-116
Identity = 274/537 (51.02%), Postives = 336/537 (62.57%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RP GTEGRKPTHDFLSL S ST  QQD   S Q G+L THDFLQPLER EK  
Sbjct: 1   MELPQP-RPFGTEGRKPTHDFLSLXSXSTV-QQDPRPS-QAGYLKTHDFLQPLERGEKNS 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
            K+E  + + TV++  P       NS VE +LPGGIG+  I HISYFN R+PK EGS+F 
Sbjct: 61  IKEENAIEIRTVDKPPPPAPP--PNSSVEHILPGGIGTYSISHISYFNQRLPKPEGSIFT 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
             QAS +D++++NS CSS + SGF LWEE+AVKKGK  KEN  E P   EP  K+ QW  
Sbjct: 121 AAQASTSDRNEENSNCSSYTGSGFTLWEETAVKKGKAGKENAVERPIGIEPAVKLGQW-- 180

Query: 181 STERPIQSSC---SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKKE 240
           ++ERP QS     SSFSSLSSSQ SGQ+ +SF EM++SA+   ++EEE +D + FV+KKE
Sbjct: 181 TSERPSQSPSNHRSSFSSLSSSQXSGQKNQSFMEMIQSASAKGTQEEEEEDEEEFVLKKE 240

Query: 241 SS---------------------PSTAYRVTCR-----------FQKLRELLPCSDQKRD 300
           SS                     P + +  T +           FQ LR+L+P SDQKRD
Sbjct: 241 SSSNKGDLTVKVDGKSSDQKAVTPRSKHSATEQRRRSKINDRRVFQMLRDLIPHSDQKRD 300

Query: 301 KASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQ--VAK 360
           KASFLLEVIEYIQFLQEKV KYE S  QGW  E AKL+P RN+  PA  + DQS+   + 
Sbjct: 301 KASFLLEVIEYIQFLQEKVHKYEGS-FQGWNHESAKLMPWRNSHRPAESFADQSRGINSG 360

Query: 361 SGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPM-L 420
           SGP  +F+    +N       + R + NPVES  + STTF   D H G TNKA    M L
Sbjct: 361 SGPALMFSAKFDENNVAVSPNISRNTQNPVESDLSASTTFKAMDRHPGLTNKAVPIHMQL 420

Query: 421 DPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETR-ACTTDIAAVDNKLKEPEQS 480
            P  F PV   G   ++  ++A  A+      Q  L ++R + TT+     +KLKE E +
Sbjct: 421 QPNIFTPVVGGGGLAQLPPRLAPDAENMASLPQSQLWQSRSSVTTECTVASDKLKEQELT 480

Query: 481 IEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAI 497
           IEGG ISISSAYSQ LL  LTQALQSSGVD+S+ASI+VQI+LG + N R TA  P I
Sbjct: 481 IEGGTISISSAYSQGLLNTLTQALQSSGVDLSKASISVQIDLGNKANSRPTAPTPII 529

BLAST of Cp4.1LG20g01070 vs. TAIR10
Match: AT5G08130.5 (AT5G08130.5 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 238.8 bits (608), Expect = 7.6e-63
Identity = 209/542 (38.56%), Postives = 278/542 (51.29%), Query Frame = 1

Query: 1   MELPQPSRPLGTE----GRKPTHDFLSLYSHSTADQQDLSQ---SFQGGFLTTHDFLQPL 60
           MELPQP RP  T+    GRKPTHDFLSL SHST           S QG  L THDFLQPL
Sbjct: 1   MELPQP-RPFKTQEFRTGRKPTHDFLSLCSHSTVHPDPKPTPPPSSQGSHLKTHDFLQPL 60

Query: 61  ERIEKTCAKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNH---R 120
           E +      KE    +++   A+      A   P++ +LPGGIG+  I  I YF+H   R
Sbjct: 61  ECV----GAKEDVSRINSTTTASEKPPPPAPPPPLQHVLPGGIGTYTISPIPYFHHHHQR 120

Query: 121 VPKTEGSVFPVPQASNTDKSDDNSR--CSSLSA--SGFKLWEESAV-KKGKTRKEN-LSE 180
           +PK E S   +  A+  +  D+NS   CSS +A  SGF LW+ESA  KKG+TRKEN + E
Sbjct: 121 IPKPELSPPMMFNANERNVLDENSNSNCSSYAAASSGFTLWDESASGKKGQTRKENSVGE 180

Query: 181 NPGLR-EPPAKMEQWAVSTERP---IQSSCSSFSSLSSSQPSGQQKRSFGEMLKSAANVS 240
              +R +  A + QW V+  R      +  S FSSLSSSQ S  + +SF +M++SA   S
Sbjct: 181 RVNMRADVAATVGQWPVAERRSQSLTNNHMSGFSSLSSSQGSVLKSQSFMDMIRSAKG-S 240

Query: 241 SKEEELDDGKAFVIKKE-SSPSTAYRVTCRF----------QKLREL------------- 300
           S+E++LDD + F++KKE SS S ++RV  R           QKL                
Sbjct: 241 SQEDDLDDEEDFIMKKESSSTSQSHRVDLRVKADVRGSPNDQKLNTPRSKHSATEQRRRS 300

Query: 301 ------------LPCSDQKRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLP 360
                       +P SDQKRDKASFLLEVIEYIQFLQEK  KY +S  QGW  EPAKLL 
Sbjct: 301 KINDRFQMLRQLIPNSDQKRDKASFLLEVIEYIQFLQEKADKYVTS-YQGWNHEPAKLLN 360

Query: 361 CRNNCNPARRYIDQSQVAKSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVESNTSTTFGEA 420
             NN           Q+   G  F     ++KN            + PV    +      
Sbjct: 361 WSNN---------NQQLVPEGVAFAPKLEEEKN------------NIPVSVLATAQGVVI 420

Query: 421 DHHLGSTNKATSYPMLDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTT 480
           DH   +T       +    +F PV +     + +++VA S   +P    R          
Sbjct: 421 DHPTTATTSPFPLSIQSNSFFSPVIAGNPVPQFHARVASSEAVEPSPSSR---------- 480

Query: 481 DIAAVDNKLKEPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKR 487
                  K +E E+ +E G I ISS YSQ L+K L +AL++SGVD+++ASI+V+IEL K+
Sbjct: 481 -----SQKEEEDEEVLE-GNIRISSVYSQGLVKTLREALENSGVDLTKASISVEIELAKQ 498

BLAST of Cp4.1LG20g01070 vs. TAIR10
Match: AT5G38860.1 (AT5G38860.1 BES1-interacting Myc-like protein 3)

HSP 1 Score: 89.7 bits (221), Expect = 5.7e-18
Identity = 84/246 (34.15%), Postives = 115/246 (46.75%), Query Frame = 1

Query: 245 RVTCRFQKLRELLPCS--DQKRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKL 304
           ++  RFQ L +++P +  DQKRDKASFLLEVIEYI FLQEKV  YE S  Q WY  P KL
Sbjct: 48  KINERFQSLMDIIPQNQNDQKRDKASFLLEVIEYIHFLQEKVHMYEDSH-QMWYQSPTKL 107

Query: 305 LPCRNNCNPARRYIDQSQVAKSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVESNTS---- 364
           +P RN+                       GS  +   H P  +   S N   + +S    
Sbjct: 108 IPWRNS----------------------HGSVAEENDH-PQIVKSFSSNDKVAASSGFLL 167

Query: 365 TTFGEADHHLGSTNKATSYPMLDPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCE 424
            T+   +  + S   +T  P   P   +   S   +T+   Q       +P    +  C 
Sbjct: 168 DTYNSVNPDIDSA-VSTKIPEHSP---VSAVSSYLRTEPSLQFVQHDFWQP----KTSCG 227

Query: 425 TRACTTDIAAVDNKLKEPEQSIEGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQ 484
           T  C T+         E   S E    S+S+  SQ +L  LT+AL+SSGV+MS+  I+VQ
Sbjct: 228 TINCFTN---------ELLTSDEKTSASLSTVCSQRVLNTLTEALKSSGVNMSETMISVQ 252

BLAST of Cp4.1LG20g01070 vs. TAIR10
Match: AT1G69010.1 (AT1G69010.1 BES1-interacting Myc-like protein 2)

HSP 1 Score: 87.4 bits (215), Expect = 2.8e-17
Identity = 96/319 (30.09%), Postives = 140/319 (43.89%), Query Frame = 1

Query: 215 SAANVSSKEEELDDGKAFVIKKESSPSTAYR---VTCRFQKLRELLPCSDQKRDKASFLL 274
           S   V S  +  ++ KA  I+ + S +   R   +  RFQ LREL+P S+QKRD ASFLL
Sbjct: 27  SNTTVHSNRDSKENDKASAIRSKHSVTEQRRRSKINERFQILRELIPNSEQKRDTASFLL 86

Query: 275 EVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVA---KSGPVF 334
           EVI+Y+Q+LQEKV+KYE S P GW  EP KL P RNN +   + +    VA    SGP  
Sbjct: 87  EVIDYVQYLQEKVQKYEGSYP-GWSQEPTKLTPWRNN-HWRVQSLGNHPVAINNGSGPGI 146

Query: 335 IFAGSDQKNT-SHFPAFLPRCSHNPVESNTSTTFGEADHHLGSTNKATSYPMLDPRYFIP 394
            F G  + NT +  PA +      P+ES+ +                   P L P   +P
Sbjct: 147 PFPGKFEDNTVTSTPAIIAE-PQIPIESDKARAITGISIESQPELDDKGLPPLQP--ILP 206

Query: 395 VTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSIEGGRISI 454
           +  +GE+       +    +       L+ E    +   A     L    Q+++   I +
Sbjct: 207 MV-QGEQANECPATSDGLGQS----NDLVIEGGTISISSAYSHELLSSLTQALQNAGIDL 266

Query: 455 SSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDSARPSERA 514
           S                        A ++VQI+LGKR N   T   P+    S  P    
Sbjct: 267 SQ-----------------------AKLSVQIDLGKRANQGLTHEEPS----SKNPLSYD 307

Query: 515 VVGTRVAAAEEDMEQAFRK 527
             G R ++ EE+ E + ++
Sbjct: 327 TQG-RDSSVEEESEHSHKR 307

BLAST of Cp4.1LG20g01070 vs. NCBI nr
Match: gi|659066651|ref|XP_008454530.1| (PREDICTED: transcription factor BIM1-like isoform X1 [Cucumis melo])

HSP 1 Score: 765.8 bits (1976), Expect = 5.1e-218
Identity = 420/577 (72.79%), Postives = 455/577 (78.86%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RPLGTEGRKPTHDFLSLYSHSTA QQD SQSFQGGFL THDFL+PLER  KTC
Sbjct: 1   MELPQPPRPLGTEGRKPTHDFLSLYSHSTAHQQDPSQSFQGGFLKTHDFLRPLERTGKTC 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           AK+E+T+NVSTVERAAP+  T A NSP ERLLPGGIG+  I HISYFN RVPK EGSVFP
Sbjct: 61  AKEEKTINVSTVERAAPTSGTQAHNSPAERLLPGGIGTYSISHISYFNQRVPKPEGSVFP 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
           VPQASNTDKSDDNS CSSLS +GF LWEESAVKKGKTRKENL E PGL+E PAK+EQWAV
Sbjct: 121 VPQASNTDKSDDNSHCSSLSGNGFTLWEESAVKKGKTRKENLGEKPGLKESPAKIEQWAV 180

Query: 181 STERPIQSSC----SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKK 240
           +TERPIQSS     SSFSSLS SQPSGQQ RSFGEMLKS  NVSS EEELDD KAFVIKK
Sbjct: 181 TTERPIQSSSSNHRSSFSSLSPSQPSGQQSRSFGEMLKSTVNVSSTEEELDDDKAFVIKK 240

Query: 241 ESSPSTAYRVTCRF----------------------QKLRELL-----------PCSDQK 300
           ESSPSTAY+   +                       Q+ R  +           P SDQK
Sbjct: 241 ESSPSTAYKGDLKINIGGKCSDQKANTPRSKHSATEQRRRSKINDRFQKLRELIPRSDQK 300

Query: 301 RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVAK 360
           RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWY EPAKL+PCRNNCNPA+ YIDQSQ+AK
Sbjct: 301 RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYHEPAKLIPCRNNCNPAQCYIDQSQIAK 360

Query: 361 SGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPMLD 420
           SGPVFIFAGSD+KN  H PAF PRCSHNPVES  +T+TTF E D H G+ NK TSYPML+
Sbjct: 361 SGPVFIFAGSDEKNMCHSPAF-PRCSHNPVESEVSTNTTFREVDQHPGTNNK-TSYPMLE 420

Query: 421 PRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSIE 480
           PR+F  V SEG KT+++SQVAH+AD KP +MQ L CE R+CTT+I  V+NKLKEPEQ I+
Sbjct: 421 PRHFTSVISEGAKTRLHSQVAHNADNKPCEMQPLSCEVRSCTTNIVDVNNKLKEPEQRID 480

Query: 481 GGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIV---- 531
           GGRISIS AYSQ LLKILTQALQSSGVDMSQAS+AVQIELGKRTN R+T   P +V    
Sbjct: 481 GGRISISGAYSQGLLKILTQALQSSGVDMSQASVAVQIELGKRTNYRETVPTPIVVNKNC 540

BLAST of Cp4.1LG20g01070 vs. NCBI nr
Match: gi|659066653|ref|XP_008454608.1| (PREDICTED: transcription factor BIM1-like isoform X2 [Cucumis melo])

HSP 1 Score: 765.0 bits (1974), Expect = 8.6e-218
Identity = 421/570 (73.86%), Postives = 455/570 (79.82%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RPLGTEGRKPTHDFLSLYSHSTA QQD SQSFQGGFL THDFL+PLER  KTC
Sbjct: 1   MELPQPPRPLGTEGRKPTHDFLSLYSHSTAHQQDPSQSFQGGFLKTHDFLRPLERTGKTC 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           AK+E+T+NVSTVERAAP+  T A NSP ERLLPGGIG+  I HISYFN RVPK EGSVFP
Sbjct: 61  AKEEKTINVSTVERAAPTSGTQAHNSPAERLLPGGIGTYSISHISYFNQRVPKPEGSVFP 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLR-EPPAKMEQWA 180
           VPQASNTDKSDDNS CSSLS +GF LWEESAVKKGKTRKENL E PGL+ E PAK+EQWA
Sbjct: 121 VPQASNTDKSDDNSHCSSLSGNGFTLWEESAVKKGKTRKENLGEKPGLKAESPAKIEQWA 180

Query: 181 VSTERPIQSSC----SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIK 240
           V+TERPIQSS     SSFSSLS SQPSGQQ RSFGEMLKS  NVSS EEELDD KAFVIK
Sbjct: 181 VTTERPIQSSSSNHRSSFSSLSPSQPSGQQSRSFGEMLKSTVNVSSTEEELDDDKAFVIK 240

Query: 241 KESSPSTAYRVTCRF----------------------QKLRELL-----------PCSDQ 300
           KESSPSTAY+   +                       Q+ R  +           P SDQ
Sbjct: 241 KESSPSTAYKGDLKINIGGKCSDQKANTPRSKHSATEQRRRSKINDRFQKLRELIPRSDQ 300

Query: 301 KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVA 360
           KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWY EPAKL+PCRNNCNPA+ YIDQSQ+A
Sbjct: 301 KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYHEPAKLIPCRNNCNPAQCYIDQSQIA 360

Query: 361 KSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPML 420
           KSGPVFIFAGSD+KN  H PAF PRCSHNPVES  +T+TTF E D H G+ NK TSYPML
Sbjct: 361 KSGPVFIFAGSDEKNMCHSPAF-PRCSHNPVESEVSTNTTFREVDQHPGTNNK-TSYPML 420

Query: 421 DPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSI 480
           +PR+F  V SEG KT+++SQVAH+AD KP +MQ L CE R+CTT+I  V+NKLKEPEQ I
Sbjct: 421 EPRHFTSVISEGAKTRLHSQVAHNADNKPCEMQPLSCEVRSCTTNIVDVNNKLKEPEQRI 480

Query: 481 EGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDS 531
           +GGRISIS AYSQ LLKILTQALQSSGVDMSQAS+AVQIELGKRTN R+T   P IVDDS
Sbjct: 481 DGGRISISGAYSQGLLKILTQALQSSGVDMSQASVAVQIELGKRTNYRETVPTP-IVDDS 540

BLAST of Cp4.1LG20g01070 vs. NCBI nr
Match: gi|778657165|ref|XP_011650371.1| (PREDICTED: transcription factor BIM1 isoform X2 [Cucumis sativus])

HSP 1 Score: 760.4 bits (1962), Expect = 2.1e-216
Identity = 417/569 (73.29%), Postives = 453/569 (79.61%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RPLGTEGRKPTHDFLSLY HSTA QQD SQSFQGGFL THDFL+PLER  KTC
Sbjct: 1   MELPQPPRPLGTEGRKPTHDFLSLYGHSTALQQDPSQSFQGGFLKTHDFLRPLERTGKTC 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           AK+E+T+NVSTVERAAP+L T A NS  ERLLPGGIG+  I HISYFN RVPK EGSVFP
Sbjct: 61  AKEEKTINVSTVERAAPTLGTQAHNSAAERLLPGGIGTYSISHISYFNQRVPKPEGSVFP 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
           VPQASNTDKSDDNSRCSSLS +GF LWEESAVKKGKTRKENL E P L+E PAK+EQW V
Sbjct: 121 VPQASNTDKSDDNSRCSSLSGNGFTLWEESAVKKGKTRKENLGEKPALKESPAKIEQWTV 180

Query: 181 STERPIQSSC----SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKK 240
           +TERP+QSS     SSFSSLS SQPSGQQ RSFGEMLKS  NVSS EEELDD KAFVIKK
Sbjct: 181 TTERPMQSSSSNHRSSFSSLSPSQPSGQQSRSFGEMLKSTVNVSSMEEELDDDKAFVIKK 240

Query: 241 ESSPSTAYRVTCRF----------------------QKLRELL-----------PCSDQK 300
           ESSPSTAY+   +                       Q+ R  +           P SDQK
Sbjct: 241 ESSPSTAYKGDLKINICGKSSDQKANTPRSKHSATEQRRRSKINDRFQKLRELIPRSDQK 300

Query: 301 RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVAK 360
           RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWY EPAKL+PCRNNCNPA+ YIDQSQ+AK
Sbjct: 301 RDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYHEPAKLIPCRNNCNPAQCYIDQSQIAK 360

Query: 361 SGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPMLD 420
           SGPVFIFAGSD+KN  H PAF PRCSHNPVES  +TSTTF EAD H G+ NK T YPMLD
Sbjct: 361 SGPVFIFAGSDEKNMCHSPAF-PRCSHNPVESEVSTSTTFREADQHPGTNNK-TCYPMLD 420

Query: 421 PRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSIE 480
           PR+F PV SEG KT+++SQV H+AD KP ++Q L CE R+CTT+I   +NKLKEPEQ I+
Sbjct: 421 PRHFTPVISEGAKTRLHSQVGHNADNKPCEIQPLSCEMRSCTTNIVDGNNKLKEPEQRID 480

Query: 481 GGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDSA 531
           GGRISIS AYSQ LLKILTQALQSSGVDMSQAS+AVQIELGKRTN R+   +P IVDDSA
Sbjct: 481 GGRISISGAYSQGLLKILTQALQSSGVDMSQASVAVQIELGKRTNYREIVPSP-IVDDSA 540

BLAST of Cp4.1LG20g01070 vs. NCBI nr
Match: gi|778657162|ref|XP_011650369.1| (PREDICTED: transcription factor BIM1 isoform X1 [Cucumis sativus])

HSP 1 Score: 755.7 bits (1950), Expect = 5.2e-215
Identity = 417/570 (73.16%), Postives = 453/570 (79.47%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RPLGTEGRKPTHDFLSLY HSTA QQD SQSFQGGFL THDFL+PLER  KTC
Sbjct: 1   MELPQPPRPLGTEGRKPTHDFLSLYGHSTALQQDPSQSFQGGFLKTHDFLRPLERTGKTC 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
           AK+E+T+NVSTVERAAP+L T A NS  ERLLPGGIG+  I HISYFN RVPK EGSVFP
Sbjct: 61  AKEEKTINVSTVERAAPTLGTQAHNSAAERLLPGGIGTYSISHISYFNQRVPKPEGSVFP 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLR-EPPAKMEQWA 180
           VPQASNTDKSDDNSRCSSLS +GF LWEESAVKKGKTRKENL E P L+ E PAK+EQW 
Sbjct: 121 VPQASNTDKSDDNSRCSSLSGNGFTLWEESAVKKGKTRKENLGEKPALKAESPAKIEQWT 180

Query: 181 VSTERPIQSSC----SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIK 240
           V+TERP+QSS     SSFSSLS SQPSGQQ RSFGEMLKS  NVSS EEELDD KAFVIK
Sbjct: 181 VTTERPMQSSSSNHRSSFSSLSPSQPSGQQSRSFGEMLKSTVNVSSMEEELDDDKAFVIK 240

Query: 241 KESSPSTAYRVTCRF----------------------QKLRELL-----------PCSDQ 300
           KESSPSTAY+   +                       Q+ R  +           P SDQ
Sbjct: 241 KESSPSTAYKGDLKINICGKSSDQKANTPRSKHSATEQRRRSKINDRFQKLRELIPRSDQ 300

Query: 301 KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQVA 360
           KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWY EPAKL+PCRNNCNPA+ YIDQSQ+A
Sbjct: 301 KRDKASFLLEVIEYIQFLQEKVRKYESSPPQGWYHEPAKLIPCRNNCNPAQCYIDQSQIA 360

Query: 361 KSGPVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPML 420
           KSGPVFIFAGSD+KN  H PAF PRCSHNPVES  +TSTTF EAD H G+ NK T YPML
Sbjct: 361 KSGPVFIFAGSDEKNMCHSPAF-PRCSHNPVESEVSTSTTFREADQHPGTNNK-TCYPML 420

Query: 421 DPRYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETRACTTDIAAVDNKLKEPEQSI 480
           DPR+F PV SEG KT+++SQV H+AD KP ++Q L CE R+CTT+I   +NKLKEPEQ I
Sbjct: 421 DPRHFTPVISEGAKTRLHSQVGHNADNKPCEIQPLSCEMRSCTTNIVDGNNKLKEPEQRI 480

Query: 481 EGGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDS 531
           +GGRISIS AYSQ LLKILTQALQSSGVDMSQAS+AVQIELGKRTN R+   +P IVDDS
Sbjct: 481 DGGRISISGAYSQGLLKILTQALQSSGVDMSQASVAVQIELGKRTNYREIVPSP-IVDDS 540

BLAST of Cp4.1LG20g01070 vs. NCBI nr
Match: gi|731426138|ref|XP_010663503.1| (PREDICTED: transcription factor BIM1 isoform X1 [Vitis vinifera])

HSP 1 Score: 441.4 bits (1134), Expect = 2.2e-120
Identity = 282/565 (49.91%), Postives = 351/565 (62.12%), Query Frame = 1

Query: 1   MELPQPSRPLGTEGRKPTHDFLSLYSHSTADQQDLSQSFQGGFLTTHDFLQPLERIEKTC 60
           MELPQP RP GTEGRKPTHDFLSL SHST  QQD   S Q G+L THDFLQPLER EK  
Sbjct: 1   MELPQP-RPFGTEGRKPTHDFLSLCSHSTV-QQDPRPS-QAGYLKTHDFLQPLERGEKNS 60

Query: 61  AKKEQTMNVSTVERAAPSLATLARNSPVERLLPGGIGSCIIGHISYFNHRVPKTEGSVFP 120
            K+E  + + TV++  P       NS VE +LPGGIG+  I HISYFN R+PK EGS+F 
Sbjct: 61  IKEENAIEIRTVDKPPPPAPP--PNSSVEHILPGGIGTYSISHISYFNQRLPKPEGSIFT 120

Query: 121 VPQASNTDKSDDNSRCSSLSASGFKLWEESAVKKGKTRKENLSENPGLREPPAKMEQWAV 180
             QAS +D++++NS CSS + SGF LWEE+AVKKGK  KEN  E     EP  K+ QW  
Sbjct: 121 AAQASTSDRNEENSNCSSYTGSGFTLWEETAVKKGKAGKENAVERSIGIEPAVKLGQW-- 180

Query: 181 STERPIQSSC---SSFSSLSSSQPSGQQKRSFGEMLKSAANVSSKEEELDDGKAFVIKKE 240
           ++ERP QS     SSFSSLSSSQ SGQ+ +SF EM++SA+   ++EEE +D + FV+KKE
Sbjct: 181 TSERPSQSPSNHRSSFSSLSSSQSSGQKNQSFMEMIQSASAKGTQEEEEEDEEEFVLKKE 240

Query: 241 SSPSTA------------------------------YRVTCRFQKLRELLPCSDQKRDKA 300
           SS +                                 ++  RFQ LR+L+P SDQKRDKA
Sbjct: 241 SSSNKGDLTVKVDGKSSDQKAVTPRSKHSATEQRRRSKINDRFQMLRDLIPHSDQKRDKA 300

Query: 301 SFLLEVIEYIQFLQEKVRKYESSPPQGWYDEPAKLLPCRNNCNPARRYIDQSQ--VAKSG 360
           SFLLEVIEYIQFLQEKV KYE S  QGW  E AKL+P RN+  PA  + DQS+   + SG
Sbjct: 301 SFLLEVIEYIQFLQEKVHKYEGS-FQGWNHESAKLMPWRNSHRPAESFADQSRGINSGSG 360

Query: 361 PVFIFAGSDQKNTSHFPAFLPRCSHNPVES--NTSTTFGEADHHLGSTNKATSYPM-LDP 420
           P  +F+    +N       + R + NPVES  + STTF   D H G TNKA    M L P
Sbjct: 361 PALMFSAKFDENNVAVSPNISRNTQNPVESDLSASTTFKAMDRHPGLTNKAVPIHMQLQP 420

Query: 421 RYFIPVTSEGEKTKIYSQVAHSADRKPYKMQRLLCETR-ACTTDIAAVDNKLKEPEQSIE 480
             F PV   G   ++  ++A  A+      Q  L ++R + TT+     +KLKE E +IE
Sbjct: 421 NIFTPVVGGGGLAQLPPRLAPDAENMASLPQSQLWQSRSSVTTECTVASDKLKEQELTIE 480

Query: 481 GGRISISSAYSQELLKILTQALQSSGVDMSQASIAVQIELGKRTNCRDTAANPAIVDDSA 527
           GG ISISSAYSQ LL  LTQALQSSGVD+S+ASI+VQI+LG + N R TA  P I D+  
Sbjct: 481 GGTISISSAYSQGLLNTLTQALQSSGVDLSKASISVQIDLGNKANSRPTAPTPIIKDNQV 540

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BIM1_ARATH1.4e-6338.85Transcription factor BIM1 OS=Arabidopsis thaliana GN=BIM1 PE=1 SV=2[more]
BIM3_ARATH1.0e-1634.15Transcription factor BIM3 OS=Arabidopsis thaliana GN=BIM3 PE=1 SV=1[more]
BIM2_ARATH5.0e-1630.09Transcription factor BIM2 OS=Arabidopsis thaliana GN=BIM2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LSX0_CUCSA1.5e-21673.29Uncharacterized protein OS=Cucumis sativus GN=Csa_1G039980 PE=4 SV=1[more]
D7SIP8_VITVI1.3e-11949.91Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g04790 PE=4 SV=... [more]
A0A061G597_THECC1.6e-11748.60Transcription factor BIM1, putative isoform 2 OS=Theobroma cacao GN=TCM_014392 P... [more]
B9IE21_POPTR3.5e-11749.13Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s03550g PE=4 SV=2[more]
A5AMI1_VITVI6.6e-11651.02Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_038183 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G08130.57.6e-6338.56 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G38860.15.7e-1834.15 BES1-interacting Myc-like protein 3[more]
AT1G69010.12.8e-1730.09 BES1-interacting Myc-like protein 2[more]
Match NameE-valueIdentityDescription
gi|659066651|ref|XP_008454530.1|5.1e-21872.79PREDICTED: transcription factor BIM1-like isoform X1 [Cucumis melo][more]
gi|659066653|ref|XP_008454608.1|8.6e-21873.86PREDICTED: transcription factor BIM1-like isoform X2 [Cucumis melo][more]
gi|778657165|ref|XP_011650371.1|2.1e-21673.29PREDICTED: transcription factor BIM1 isoform X2 [Cucumis sativus][more]
gi|778657162|ref|XP_011650369.1|5.2e-21573.16PREDICTED: transcription factor BIM1 isoform X1 [Cucumis sativus][more]
gi|731426138|ref|XP_010663503.1|2.2e-12049.91PREDICTED: transcription factor BIM1 isoform X1 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g01070.1Cp4.1LG20g01070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 249..288
score: 1.5
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 249..281
score: 7.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 230..280
score: 9
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 247..289
score: 4.0
NoneNo IPR availableunknownCoilCoilcoord: 513..530
scor
NoneNo IPR availablePANTHERPTHR12565STEROL REGULATORY ELEMENT-BINDING PROTEINcoord: 1..342
score: 1.4E-131coord: 379..490
score: 1.4E
NoneNo IPR availablePANTHERPTHR12565:SF153TRANSCRIPTION FACTOR BIM1coord: 379..490
score: 1.4E-131coord: 1..342
score: 1.4E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG20g01070Cucsa.113490Cucumber (Gy14) v1cgycpeB0257
Cp4.1LG20g01070CmaCh11G002330Cucurbita maxima (Rimu)cmacpeB139
Cp4.1LG20g01070CmaCh13G010540Cucurbita maxima (Rimu)cmacpeB234
Cp4.1LG20g01070CmaCh18G000150Cucurbita maxima (Rimu)cmacpeB446
Cp4.1LG20g01070CmaCh10G002770Cucurbita maxima (Rimu)cmacpeB084
Cp4.1LG20g01070CmoCh10G002940Cucurbita moschata (Rifu)cmocpeB059
Cp4.1LG20g01070CmoCh18G000160Cucurbita moschata (Rifu)cmocpeB406
Cp4.1LG20g01070CmoCh13G010970Cucurbita moschata (Rifu)cmocpeB201
Cp4.1LG20g01070CmoCh11G002360Cucurbita moschata (Rifu)cmocpeB116
Cp4.1LG20g01070Cla018537Watermelon (97103) v1cpewmB523
Cp4.1LG20g01070Cla019030Watermelon (97103) v1cpewmB529
Cp4.1LG20g01070Csa1G039980Cucumber (Chinese Long) v2cpecuB505
Cp4.1LG20g01070Csa1G015080Cucumber (Chinese Long) v2cpecuB509
Cp4.1LG20g01070MELO3C017257Melon (DHL92) v3.5.1cpemeB483
Cp4.1LG20g01070MELO3C002057Melon (DHL92) v3.5.1cpemeB478
Cp4.1LG20g01070ClCG04G011530Watermelon (Charleston Gray)cpewcgB471
Cp4.1LG20g01070CSPI01G06420Wild cucumber (PI 183967)cpecpiB505
Cp4.1LG20g01070Lsi02G001450Bottle gourd (USVL1VR-Ls)cpelsiB418
Cp4.1LG20g01070Lsi06G013840Bottle gourd (USVL1VR-Ls)cpelsiB432
Cp4.1LG20g01070MELO3C018898.2Melon (DHL92) v3.6.1cpemedB570
Cp4.1LG20g01070MELO3C002057.2Melon (DHL92) v3.6.1cpemedB560
Cp4.1LG20g01070MELO3C017257.2Melon (DHL92) v3.6.1cpemedB568
Cp4.1LG20g01070CsaV3_1G006280Cucumber (Chinese Long) v3cpecucB0623
Cp4.1LG20g01070CsaV3_1G003270Cucumber (Chinese Long) v3cpecucB0627
Cp4.1LG20g01070CsaV3_4G010010Cucumber (Chinese Long) v3cpecucB0646
Cp4.1LG20g01070Bhi02G000036Wax gourdcpewgoB0667
Cp4.1LG20g01070Bhi08G000882Wax gourdcpewgoB0652
Cp4.1LG20g01070CsGy4G009480Cucumber (Gy14) v2cgybcpeB538
Cp4.1LG20g01070CsGy1G003190Cucumber (Gy14) v2cgybcpeB089
Cp4.1LG20g01070CsGy1G006120Cucumber (Gy14) v2cgybcpeB069
Cp4.1LG20g01070Carg22774Silver-seed gourdcarcpeB0724
Cp4.1LG20g01070Carg17884Silver-seed gourdcarcpeB0647
Cp4.1LG20g01070Carg09179Silver-seed gourdcarcpeB0846
Cp4.1LG20g01070Carg04782Silver-seed gourdcarcpeB0544
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g01070Cp4.1LG09g11620Cucurbita pepo (Zucchini)cpecpeB047
Cp4.1LG20g01070Cp4.1LG18g07390Cucurbita pepo (Zucchini)cpecpeB349
Cp4.1LG20g01070Cp4.1LG04g14350Cucurbita pepo (Zucchini)cpecpeB442
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG20g01070Silver-seed gourdcarcpeB0725
Cp4.1LG20g01070Cucumber (Chinese Long) v3cpecucB0642
Cp4.1LG20g01070Cucurbita pepo (Zucchini)cpecpeB049
Cp4.1LG20g01070Cucumber (Gy14) v1cgycpeB0558
Cp4.1LG20g01070Cucurbita maxima (Rimu)cmacpeB439
Cp4.1LG20g01070Cucurbita moschata (Rifu)cmocpeB404
Cp4.1LG20g01070Wild cucumber (PI 183967)cpecpiB509
Cp4.1LG20g01070Wild cucumber (PI 183967)cpecpiB512
Cp4.1LG20g01070Watermelon (Charleston Gray)cpewcgB475
Cp4.1LG20g01070Cucumber (Gy14) v2cgybcpeB073