Cp4.1LG14g04460 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g04460
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranscriptional adapter 1
LocationCp4.1LG14 : 1576208 .. 1579445 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTTCCAACTCGTTCAAATGAACCAGACAAAAACAGAAAACGAAACTACTGTTCCAGATCCTTCTTCCATCAATTTTCTTGGAAACCAGAGGAGTGAGAAGACATTTGCTTACCAGGTGCGAGAACCAGGGCCATAGCTCTTCGGATGAGAGTTCCACACATTGGAGTGCCCCATGATTGTGAGGTATTGAAGCGCTTCGGAAATCACAACGTGAAGCAGAGATTTTGCGGGGAAGAAGAAGTCGGCAAAAAGGATTGCTTTTATATGAGAAATAAACCCTAATTGTGGTGGCGTTTTTACGATTTTAGTCCCATAAAAAAAACCCGACCCGATTGAAGGAGACGGTTTATGGTCCAGCCCATTATAATATTTTACAATATTACCTGCCAAATAATTTAAAAATAAAAAATATATATGTATATATAACCGTAAGCAAAAATATATATTTTAAAATTAATAATAAATGTAAGCTTGTTTGTCGAGCTCATGAACCAAACAATCCATAAAATTTCATTTATATGAACTATAGCTCAACCCAAATCGTAGGAGTTGGGTTGAGTTGTGTTGATCGTGTTTTCGGATCATTTGATTATTTTTAACGTACTTAATTGAAAGTATAAAAATTTGAACTTTTATCCTCAAGTCGCAATCTTTTTCTCTTTAAAAAAAATTAATTTCTCAGTACGACAAAGTTTTAAGAAGATATTTACTAAGAACACACAACTCGTCATCGAATGAATGCAAGATAAACCATATCGTTGAAATGCAACAATGATGTCTCGAGACGTGTGACATGCAGTACCAATGAAATGTCCGATAAATCATTGTATAGAACTGCGAAGACTGCATCAAAGATGCACTTGCTAGTATTAGTACATTTGAGAATTTTTAATTGGCTAGTTGTAAATAGACACAGACTGTGGAACAAAGGATTATCCCGAACCTATACCGAGGTATTGACAATGATTCTCAAATATCGCAGAACAGAACAAAGGCACAGGGAACGAAACGGGTTACCTACTCTTAATGATCGCAACAAGGAACGGAACGTGTTACCTACTCTTAACAGTTAAAGCGAACCCTTTCATTCCGAATTCTCGAGCCAAGAGAAAAATTAATCCATAGGAGGAAAGAAAAAAAGGAAAAAAAAAAAAACATGTTTGGAATTGTAAATAAATAAATAAATAAATAAAGAGAAAAAAACATGTCTTGGACCAAAAATATAAAATAGAGGGCGCACGTGCGGAATGGAATTGCGATGCGATTGGCCGAGGAGACACACATTCGCAAGAACTCCCTTCGAAGGTCGCTGTTTTGATTTTGGTCTCCAATCTGCAAATCGCCAAAATCAATCCCACCATAAACAATTCAAAGAAACGGCAAAAAGTTGCGAATCTAGATTCAGAAGACGGCGGCGCTAAATCATAGAGCGTTGATCCCAATTAGGCAACACTTCGGTCAATTTTTGTAAGTATCTGAATTGTTTGCTTGCTAATTAGGGTTTTCACTTCCTGTTCCAATTTGGAGTTGTAATTGAATGCTTCATGTTTGCGACACTTCTTTGCATCATTGATTGCTTTAGGATTTTGGTTTTTATTGTTCCGACAATGATAAGAAAAAACCCTTGTCGGGGTACGATTTTCCTGCAATCGATGTTTTGTTTCGAAATGAAATGTAATATGGTTTGTTTCGAGCTGTTCTTCAGTTTGGGGTTTGTTCCTCATGATTTCAAAGTTCAGCAATCGATAATCAAATGTTTTGAAACTTAAGCTGTGAATTTAGGGTTTCTGGCTTTCTGATGAATCATTTCTTGCACTTTCAGTCTACCGATTTGTGGGGGATTCTGAGGGCTTTGTCTCAAATTCAGCCATTGCTGAAATTGCGGCCCTGAAAATGGTTCCCAGGAAAGACAATTCTCGTATAGATACATCTGAGCTGAAAGCTATGATCTATCGAAAGCTTGGACATCAGAGATCGGAGAAATACTTTGATCAGCTCAAAAAACTGTTGAGTTTAAAGATTAACAAAAGAGAATTCGACAAGTTTTGCATTCAGATTATTGGGAGGGAGATCATACCTCTTCATAATCGGTTTATCAAGGCAATTCTTCAAAATGCATGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTACTAGGAAAGTAGATAGCAATCTCAGTGTAAAGGTTGTGAATGGGTATCAGAGGAGTTGTCTTCAATCACTTCATGGGGATGCATTTCTTTCCTCCCCTCGAAAGGGAAGGTCTCCGGTCAGTAGAGACCGTAAGATTCGAGATCGTCCGAGTCCTCTAGGACCATGTGGAAAGCCCCAGAATATTGCACTTGAAGAACTTGCTTTCAAGGCACAAGAACAGCAAAGTGCAACAGAGTTGCATTCTCTTGGCAGCCGTCCTCCGGTGGATATGGCATCCGTAGAAGATGGAGAAGAGGTTGAGCAGGTAGCTGGAAGTCCAGGAGTTCAGAGTAGAAGCCCGGTTACCGCTCCGCTTGGAATATCCATGAACTTTGTTGGGTCTGGTAAAACTCTGTCTAATATATCTGTAGGAAGAAGAAACTGTCATGTAACAACATGTCAAAACGGCGGCGAGCTACCCGACACGAGGTTGCTAAGGACTCATTTGAAGCAAAAGTTGGAAATGGAGCAGATTGATATATCTGTTGATGGTGTAAACCTTCTTAACAATGCACTGGATGTTTACTTAAAGAGGTTGATTGAACCATGTTTGAGTTTCTCTCGGTCGAGGTGCGAGCGACCGAGATTTACAGACAATCAACCAATAACTGGCTCGAGAATCGCATTGAAGGAACAATATAGGCACAGAGCTCAACGATTGAATGCATCGTTGTTGGACTTTCGTGTTGCAATGCAACTGAATCCTGAAGTTCTTGGGAGAGACTGGACGTCACAGCTCGAGAAGATCAGTTTACGAGTTTCGGAAGAGTGAAGTGGCCGAGTTGATCGGATCTAAATGAGAAAGAGTATAGTAGGATGGATATAGGTAAAAGGTGATCATATATTGTGAATAATTTGATGGGGTTCGGTGTGTTTGAAAGTTGGTTGCTAATTTTTGCGATTCTTGTGAGGAGGAGGATAGGTTTTGTGATCGATTTGGAGCCTTGAATCGAGTTGGTTTAGGTTACATGTAGATAAAGGCTCCAATGATCATTGTCCTGAGGAATGTGTTTATGATATAATAAAAGCATTCAACTTCTG

mRNA sequence

ATGAACTTCCAACTCGTTCAAATGAACCAGACAAAAACAGAAAACGAAACTACTGTTCCAGATCCTTCTTCCATCAATTTTCTTGGAAACCAGAGGAGTGAGAAGACATTTGCTTACCAGAAGACGGCGGCGCTAAATCATAGAGCGTTGATCCCAATTAGGCAACACTTCGTCTACCGATTTGTGGGGGATTCTGAGGGCTTTGTCTCAAATTCAGCCATTGCTGAAATTGCGGCCCTGAAAATGGTTCCCAGGAAAGACAATTCTCGTATAGATACATCTGAGCTGAAAGCTATGATCTATCGAAAGCTTGGACATCAGAGATCGGAGAAATACTTTGATCAGCTCAAAAAACTGTTGAGTTTAAAGATTAACAAAAGAGAATTCGACAAGTTTTGCATTCAGATTATTGGGAGGGAGATCATACCTCTTCATAATCGGTTTATCAAGGCAATTCTTCAAAATGCATGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTACTAGGAAAGTAGATAGCAATCTCAGTGTAAAGGTTGTGAATGGGTATCAGAGGAGTTGTCTTCAATCACTTCATGGGGATGCATTTCTTTCCTCCCCTCGAAAGGGAAGGTCTCCGGTCAGTAGAGACCGTAAGATTCGAGATCGTCCGAGTCCTCTAGGACCATGTGGAAAGCCCCAGAATATTGCACTTGAAGAACTTGCTTTCAAGGCACAAGAACAGCAAAGTGCAACAGAGTTGCATTCTCTTGGCAGCCGTCCTCCGGTGGATATGGCATCCGTAGAAGATGGAGAAGAGGTTGAGCAGGTAGCTGGAAGTCCAGGAGTTCAGAGTAGAAGCCCGGTTACCGCTCCGCTTGGAATATCCATGAACTTTGTTGGGTCTGGTAAAACTCTGTCTAATATATCTGTAGGAAGAAGAAACTGTCATGTAACAACATGTCAAAACGGCGGCGAGCTACCCGACACGAGGTTGCTAAGGACTCATTTGAAGCAAAAGTTGGAAATGGAGCAGATTGATATATCTGTTGATGGTGTAAACCTTCTTAACAATGCACTGGATGTTTACTTAAAGAGGTTGATTGAACCATGTTTGAGTTTCTCTCGGTCGAGGTGCGAGCGACCGAGATTTACAGACAATCAACCAATAACTGGCTCGAGAATCGCATTGAAGGAACAATATAGGCACAGAGCTCAACGATTGAATGCATCGTTGTTGGACTTTCGTGTTGCAATGCAACTGAATCCTGAAGTTCTTGGGAGAGACTGGACGTCACAGCTCGAGAAGATCAGTTTACGAGTTTCGGAAGAGTGAAGTGGCCGAGTTGATCGGATCTAAATGAGAAAGAGTATAGTAGGATGGATATAGGTAAAAGGTGATCATATATTGTGAATAATTTGATGGGGTTCGGTGTGTTTGAAAGTTGGTTGCTAATTTTTGCGATTCTTGTGAGGAGGAGGATAGGTTTTGTGATCGATTTGGAGCCTTGAATCGAGTTGGTTTAGGTTACATGTAGATAAAGGCTCCAATGATCATTGTCCTGAGGAATGTGTTTATGATATAATAAAAGCATTCAACTTCTG

Coding sequence (CDS)

ATGAACTTCCAACTCGTTCAAATGAACCAGACAAAAACAGAAAACGAAACTACTGTTCCAGATCCTTCTTCCATCAATTTTCTTGGAAACCAGAGGAGTGAGAAGACATTTGCTTACCAGAAGACGGCGGCGCTAAATCATAGAGCGTTGATCCCAATTAGGCAACACTTCGTCTACCGATTTGTGGGGGATTCTGAGGGCTTTGTCTCAAATTCAGCCATTGCTGAAATTGCGGCCCTGAAAATGGTTCCCAGGAAAGACAATTCTCGTATAGATACATCTGAGCTGAAAGCTATGATCTATCGAAAGCTTGGACATCAGAGATCGGAGAAATACTTTGATCAGCTCAAAAAACTGTTGAGTTTAAAGATTAACAAAAGAGAATTCGACAAGTTTTGCATTCAGATTATTGGGAGGGAGATCATACCTCTTCATAATCGGTTTATCAAGGCAATTCTTCAAAATGCATGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTACTAGGAAAGTAGATAGCAATCTCAGTGTAAAGGTTGTGAATGGGTATCAGAGGAGTTGTCTTCAATCACTTCATGGGGATGCATTTCTTTCCTCCCCTCGAAAGGGAAGGTCTCCGGTCAGTAGAGACCGTAAGATTCGAGATCGTCCGAGTCCTCTAGGACCATGTGGAAAGCCCCAGAATATTGCACTTGAAGAACTTGCTTTCAAGGCACAAGAACAGCAAAGTGCAACAGAGTTGCATTCTCTTGGCAGCCGTCCTCCGGTGGATATGGCATCCGTAGAAGATGGAGAAGAGGTTGAGCAGGTAGCTGGAAGTCCAGGAGTTCAGAGTAGAAGCCCGGTTACCGCTCCGCTTGGAATATCCATGAACTTTGTTGGGTCTGGTAAAACTCTGTCTAATATATCTGTAGGAAGAAGAAACTGTCATGTAACAACATGTCAAAACGGCGGCGAGCTACCCGACACGAGGTTGCTAAGGACTCATTTGAAGCAAAAGTTGGAAATGGAGCAGATTGATATATCTGTTGATGGTGTAAACCTTCTTAACAATGCACTGGATGTTTACTTAAAGAGGTTGATTGAACCATGTTTGAGTTTCTCTCGGTCGAGGTGCGAGCGACCGAGATTTACAGACAATCAACCAATAACTGGCTCGAGAATCGCATTGAAGGAACAATATAGGCACAGAGCTCAACGATTGAATGCATCGTTGTTGGACTTTCGTGTTGCAATGCAACTGAATCCTGAAGTTCTTGGGAGAGACTGGACGTCACAGCTCGAGAAGATCAGTTTACGAGTTTCGGAAGAGTGA

Protein sequence

MNFQLVQMNQTKTENETTVPDPSSINFLGNQRSEKTFAYQKTAALNHRALIPIRQHFVYRFVGDSEGFVSNSAIAEIAALKMVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREIIPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTDNQPITGSRIALKEQYRHRAQRLNASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE
BLAST of Cp4.1LG14g04460 vs. TrEMBL
Match: A0A0A0KWF9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000960 PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 5.2e-183
Identity = 334/373 (89.54%), Postives = 351/373 (94.10%), Query Frame = 1

Query: 66  EGFVSNSAIAEIAALKMVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKIN 125
           EGFVSNSAIA IAALKM+PRKD SRIDTSELKAMIYRKLGHQRS+KYFDQLKKLLSLK N
Sbjct: 91  EGFVSNSAIAGIAALKMLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTN 150

Query: 126 KREFDKFCIQIIGREIIPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQR 185
           KREFDKFCIQIIGREIIPLHNR I+AILQNACVAKTPPVLSSTRKV  NLSVKVVNGYQR
Sbjct: 151 KREFDKFCIQIIGREIIPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQR 210

Query: 186 SCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSAT 245
           SCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQSAT
Sbjct: 211 SCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSAT 270

Query: 246 ELHSLGSRPPVDMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVG 305
           ELHSLGSRPPV+MASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GSGKTLSN+ VG
Sbjct: 271 ELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVG 330

Query: 306 RRNCHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCL 365
             N HVTTCQ+ GELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCL
Sbjct: 331 -SNYHVTTCQDVGELPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCL 390

Query: 366 SFSRSRCERPRFTDNQPITGSRIALKEQYRHRAQRL-NASLLDFRVAMQLNPEVLGRDWT 425
           +FSRSRCER +FT NQPITGSRI  +EQ+RHRAQ+L N SLLDFRVAMQLNP+VLGR+WT
Sbjct: 391 NFSRSRCERLKFTGNQPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWT 450

Query: 426 SQLEKISLRVSEE 438
            QLEKISLR SEE
Sbjct: 451 MQLEKISLRASEE 462

BLAST of Cp4.1LG14g04460 vs. TrEMBL
Match: A0A067LJK3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16307 PE=4 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 1.0e-114
Identity = 220/359 (61.28%), Postives = 276/359 (76.88%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M P +  +RI+T ELKA+I +K+GH+R+EKYFDQL +L S KI K EFDKFC++IIGRE 
Sbjct: 1   MSPNQSYTRINTLELKALIVKKIGHERAEKYFDQLTRLFSFKITKSEFDKFCVRIIGREN 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHN  I++I++NAC++K PP  +  R+  S+L+VK  NGY ++CLQSL+GDAF  SPR
Sbjct: 61  IPLHNHLIRSIVKNACLSKVPPQKAIKRQA-SSLNVKTANGYHKNCLQSLYGDAFPPSPR 120

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASV 261
           KGRSPV+R RK RDRPSPLGP GKPQ++  EEL+ +AQEQQSATELHSLGSRPP ++ASV
Sbjct: 121 KGRSPVNRYRKFRDRPSPLGPLGKPQSLVCEELSSRAQEQQSATELHSLGSRPPAEVASV 180

Query: 262 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELP 321
           E+GEEVEQVAGSPGVQSRSPVTAPLG+SMN  G+ K LS+ +V   + H  TC N GELP
Sbjct: 181 EEGEEVEQVAGSPGVQSRSPVTAPLGVSMNLGGARKALSSFTVCGSH-HQETCVNSGELP 240

Query: 322 DTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERP--RFTD 381
           DTR LR+ L+QKL ME I++S+D VNLLNN LD YLKRLIEPC+  + SRC     +  +
Sbjct: 241 DTRSLRSRLEQKLGMEGINVSMDCVNLLNNGLDTYLKRLIEPCMGLASSRCGNGHLKMVN 300

Query: 382 NQPITGSRIALKEQY-RHRAQRLNASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
            Q + G    L  +Y + R + + AS+LDF VAM++NP++LG DW   LEKISLR SEE
Sbjct: 301 GQLLPGLDGRLPGRYMQRRTESVYASMLDFHVAMEVNPQILGEDWIILLEKISLRASEE 357

BLAST of Cp4.1LG14g04460 vs. TrEMBL
Match: B9HMX5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s09370g PE=4 SV=2)

HSP 1 Score: 416.8 bits (1070), Expect = 3.3e-113
Identity = 221/349 (63.32%), Postives = 272/349 (77.94%), Query Frame = 1

Query: 89  SRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREIIPLHNRF 148
           SRIDT ELK++I +K+GHQR++KYFD+L +L SLKI K EFDK CI+IIGRE IPLHNR 
Sbjct: 8   SRIDTLELKSLILKKIGHQRADKYFDELTQLFSLKITKCEFDKLCIRIIGRENIPLHNRL 67

Query: 149 IKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVS 208
           I++IL+NAC+ K PP     R+  SNL+VK  NG+QR+ LQSL+ DAF SSPRKGRSPV+
Sbjct: 68  IRSILKNACLGKVPPP-KGVRRAGSNLTVKTTNGHQRNYLQSLYRDAFPSSPRKGRSPVN 127

Query: 209 RDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASVEDGEEVE 268
           RDRK RDRPSPLGP GKPQ++A EEL  +AQEQQSATELHSLGSRPP+++ASVE+GEEVE
Sbjct: 128 RDRKFRDRPSPLGPLGKPQSMACEELNSRAQEQQSATELHSLGSRPPIEVASVEEGEEVE 187

Query: 269 QVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELPDTRLLRT 328
           Q+A SPGVQSRSPVTAP GIS+N  GS K LSNIS+G  N    TC N GELPDTR LR+
Sbjct: 188 QMAVSPGVQSRSPVTAPFGISLNPGGSRKALSNISIGS-NYIPETCLNSGELPDTRSLRS 247

Query: 329 HLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTDNQPITGSRI 388
            L++KLEME I +S+D VN+LN  LD YLKRLIEPC++ + +RC      D++ + G   
Sbjct: 248 RLERKLEMEGIGVSLDCVNVLNIGLDAYLKRLIEPCMALAGARC------DSEQLKG--- 307

Query: 389 ALKEQYRHRAQRLNASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
           A  +  + + + +NAS+LDFRVAM+ NP++LG DW  QLEKISL   EE
Sbjct: 308 ANGQYVKRQTESVNASMLDFRVAMESNPQILGEDWPVQLEKISLSGFEE 345

BLAST of Cp4.1LG14g04460 vs. TrEMBL
Match: A5C224_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_008500 PE=4 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 5.7e-113
Identity = 227/359 (63.23%), Postives = 273/359 (76.04%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           MVP ++ +RIDT ELKA+I R++G Q++EKYFDQLK+L SLK++KREFDKFC+  IGRE 
Sbjct: 1   MVPNQNFTRIDTLELKALIVRRIGRQKAEKYFDQLKRLFSLKLSKREFDKFCLWTIGREN 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHNR I +IL+NAC++K PP+    R V     VKV NGYQR+CLQSL+GDAF  SPR
Sbjct: 61  IPLHNRLIGSILKNACLSKVPPLKG--RNVGVPKDVKVANGYQRNCLQSLYGDAFPPSPR 120

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASV 261
           KGRS V+RDR+ RDR SPLGP GKPQ++A EELA KAQEQQSATEL SLGSRPP ++ SV
Sbjct: 121 KGRSQVNRDRRFRDRLSPLGPLGKPQSVACEELASKAQEQQSATELLSLGSRPPGEVVSV 180

Query: 262 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELP 321
           EDGEEVEQ+AGSP VQSRSPV AP GISMN +G  K+L N SV   N H  TC N GELP
Sbjct: 181 EDGEEVEQLAGSPSVQSRSPVRAPFGISMN-MGGRKSLCNGSV--CNYHPETCHNSGELP 240

Query: 322 DTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRC--ERPRFTD 381
           DT  LR+HL++KLEME   +S+D VNLLNN+LDV+LKRLIEPCL    SRC  E  R  +
Sbjct: 241 DTGSLRSHLERKLEMEGFSVSMDCVNLLNNSLDVFLKRLIEPCLQLVGSRCGNEHLRQLN 300

Query: 382 NQPITGSRIALKEQYRHRAQRLN-ASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
            Q + G    L  +Y  ++++   AS+LDFRVAM+LNP++LG DW  QLEKI L  SEE
Sbjct: 301 AQTLPGMNRILPGRYIQKSRKPKYASVLDFRVAMELNPQILGEDWPVQLEKICLHASEE 354

BLAST of Cp4.1LG14g04460 vs. TrEMBL
Match: A0A061E832_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_007200 PE=4 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 9.7e-113
Identity = 229/359 (63.79%), Postives = 276/359 (76.88%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M+  ++ +R+DT ELKA+I RK+GHQR+EKYFDQL++L SLKI K +FDK CI+ IGRE 
Sbjct: 1   MMLNQNYARVDTLELKALIVRKVGHQRAEKYFDQLRRLFSLKIGKCDFDKSCIKTIGREN 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHNR I++I++NAC+AK PP L + +K  SNL  ++ NGYQR+ LQSL+GDAF  SPR
Sbjct: 61  IPLHNRLIRSIIKNACIAKVPP-LKTIKKGGSNL--QIGNGYQRNRLQSLYGDAFPPSPR 120

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASV 261
           KGRSPV+RDRK RDRPSPLGP GKPQ+I  EE   KAQE QSATEL SLGSRPP ++ASV
Sbjct: 121 KGRSPVNRDRKFRDRPSPLGPLGKPQSIVCEESVSKAQE-QSATELLSLGSRPPAEVASV 180

Query: 262 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELP 321
           EDGEEVEQVAGSPGVQSRSPVTAPLGIS+NF G+ K LSN  V   N H+ TCQN GELP
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISINFGGARKALSNAFVS-NNYHLETCQNRGELP 240

Query: 322 DTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFS--RSRCERPRFTD 381
           DTR LR+ L+QKLEME I +SVD VNLLNN LD +LKRLIEPC++ +  RS     + ++
Sbjct: 241 DTRSLRSRLQQKLEMEGISVSVDCVNLLNNGLDAFLKRLIEPCVALAGLRSGDGNLKQSN 300

Query: 382 NQPITGSRIALKEQY-RHRAQRLNASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
            Q I      L   Y +H A+  +AS+LDFR AM+LNP+VLG DW  QLEKISL   E+
Sbjct: 301 GQFIPRLNGMLHRNYLQHSAKSCHASMLDFRAAMELNPQVLGEDWAMQLEKISLSSFED 354

BLAST of Cp4.1LG14g04460 vs. TAIR10
Match: AT4G33890.1 (AT4G33890.1 unknown protein)

HSP 1 Score: 318.5 bits (815), Expect = 6.2e-87
Identity = 182/363 (50.14%), Postives = 250/363 (68.87%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M   + +SR+DT E+KA+IYR++G+QR+E YF+QL +  +LKI K EFDK CI+ IGR+ 
Sbjct: 1   MGSNQGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQN 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGD-AFLSSP 201
           I LHNR I++I++NAC+AK+PP +   +K  S +     +  + S +Q LHGD AF  S 
Sbjct: 61  IHLHNRLIRSIIKNACIAKSPPFI---KKGGSFVRFGNGDSKKNSQIQPLHGDSAFSPST 120

Query: 202 RKGRSPVSRDRKIRDRPSPLGPCGKPQNIAL--EELAFKAQEQQSATELHSLGSRPPVDM 261
           RK RS     RK+RDRPSPLGP GKP ++    EE   KA   QSATEL SLGSRPPV++
Sbjct: 121 RKCRS-----RKLRDRPSPLGPLGKPHSLTTTNEESMSKA---QSATELLSLGSRPPVEV 180

Query: 262 ASVEDGEEVEQVA-GSPGVQSRSPVTAPLGISMNFVGSG--KTLSNISVGRRNCHVTTCQ 321
            SVE+GEEVEQ+A GSP VQSR P+TAPLG+SM+       K++SN+S+  R+ +  TCQ
Sbjct: 181 VSVEEGEEVEQIAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQ 240

Query: 322 NGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERP 381
           N GELPDTR LR+ L+++LEME + I++D V+LLN+ LDV+++RLIEPCLS + +RC   
Sbjct: 241 NNGELPDTRTLRSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD 300

Query: 382 RFTDNQPITGSRIALKEQYRHRAQRLN-ASLLDFRVAMQLNPEVLGRDWTSQLEKISLRV 438
           R  +          +  QY  +++RL+  S+ DFR  M+LN E+LG DW   +EKI  R 
Sbjct: 301 RVRE----------MNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRA 342

BLAST of Cp4.1LG14g04460 vs. TAIR10
Match: AT2G14850.1 (AT2G14850.1 unknown protein)

HSP 1 Score: 260.0 bits (663), Expect = 2.6e-69
Identity = 157/350 (44.86%), Postives = 203/350 (58.00%), Query Frame = 1

Query: 89  SRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREIIPLHNRF 148
           SR+++ E+KA+IY+K+GHQR++ YFDQL K L+ +I+K EFDK C + +GRE I LHNR 
Sbjct: 8   SRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRL 67

Query: 149 IKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGD-AFLSSPRKGRSPV 208
           +++IL+NA VAK+PP                     R   +SL+GD  F  SPRK RS  
Sbjct: 68  VRSILKNASVAKSPP--------------------PRYPKKSLYGDPVFPPSPRKCRS-- 127

Query: 209 SRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASVEDGEEV 268
              RK RDRPSPLGP GKPQ++                E  S   R P+++ SVEDGEEV
Sbjct: 128 ---RKFRDRPSPLGPLGKPQSLTTTN-----------DESMSKAQRLPMEVVSVEDGEEV 187

Query: 269 EQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELPDTRLLR 328
           EQ+ GSP VQSRSP+TAPLG+S +     +  +   + R      TCQ+ GELPD   LR
Sbjct: 188 EQMTGSPSVQSRSPLTAPLGVSFHLKSKARFSTYNGINRE-----TCQSSGELPDMITLR 247

Query: 329 THLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTDNQPITGSR 388
             L++KLEME I +S+D  NLLN  L+ Y++RLIEPCLS                     
Sbjct: 248 ARLEKKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLS--------------------- 291

Query: 389 IALKEQYRHRAQRLNASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
             L  Q +      N S+LDF  AM++NP VLG +W  QLEKI  R SEE
Sbjct: 308 --LASQQKRAVS--NVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

BLAST of Cp4.1LG14g04460 vs. TAIR10
Match: AT2G24530.1 (AT2G24530.1 unknown protein)

HSP 1 Score: 155.6 bits (392), Expect = 7.0e-38
Identity = 131/404 (32.43%), Postives = 195/404 (48.27%), Query Frame = 1

Query: 85  RKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREIIPL 144
           R  + RI   ELK  I +K G +RS +YF  L + LS K+ K EFDK C++++GRE + L
Sbjct: 3   RSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSL 62

Query: 145 HNRFIKAILQNACVAKTPPV-LSSTRKVDSNLSVKVVNGYQRSCL----QSLHGDAFLS- 204
           HN+ I++IL+NA VAK+PP    +     +N      +G ++S       S H   + + 
Sbjct: 63  HNQLIRSILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTLIPNHSQHEPVWSNG 122

Query: 205 ----SPRKGRSPVSRDRKIRDRPSPLGPCGKPQ--------------NIALEELAFKAQE 264
               SPRK RS + ++RK RDRPSPLG  GK +              ++ +E   ++   
Sbjct: 123 VLPISPRKVRSGM-QNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRSG 182

Query: 265 QQSATELHSLGSRP----------PVDMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISM 324
           +  A E      RP           +   S+ D +  E+ A      S SP+ APLGI  
Sbjct: 183 RYVADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVN--LSMSPLIAPLGIPF 242

Query: 325 NFVGSGKTLSNISVGRRNCHVTTCQNGGELPDTRLLRTHLKQKLEMEQID-ISVDGVNLL 384
                G +   I V   N  + +C + G LPD  +LR  ++     + ++ +S++    L
Sbjct: 243 CSASVGGSPRTIPVS-TNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTL 302

Query: 385 NNALDVYLKRLIEPCLSFSRSRCER---------PRFTDNQPITG--SRIALKEQYRH-- 438
           NN LDVYLK+LI  C     +R             + + N+ + G     +LK Q  +  
Sbjct: 303 NNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNGS 362

BLAST of Cp4.1LG14g04460 vs. TAIR10
Match: AT5G67410.1 (AT5G67410.1 unknown protein)

HSP 1 Score: 152.5 bits (384), Expect = 5.9e-37
Identity = 114/344 (33.14%), Postives = 173/344 (50.29%), Query Frame = 1

Query: 90  RIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREIIPLHNRFI 149
           R D SELK+ I +++G  ++E Y + L K LSLKI+K +FDK  I  + RE I LHN  +
Sbjct: 10  RTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRENISLHNALL 69

Query: 150 KAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSR 209
           + IL+N C++KT P          N   K +NG  +S  + L       SPRKGR+    
Sbjct: 70  RGILKNICLSKTLPPFVKNGVESDNKKKKQLNGAFQSLCKELP-----RSPRKGRT---- 129

Query: 210 DRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASVEDGEEVEQ 269
               + R +  G   K +++  E ++   ++Q                  S+E+ EEV+Q
Sbjct: 130 ----QRRLNKDGNISKGKSLVTEVVSSSGRQQW-----------------SMENVEEVDQ 189

Query: 270 VAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELPDTRLLRTH 329
           +   P  +S+ P+ AP G+++  V          + +++   T C + GELPD+  L+  
Sbjct: 190 LI--PCWRSQ-PIEAPFGVNLRDV----------IKKQHRIDTCCYSSGELPDSVSLKKK 249

Query: 330 LKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTDNQPITGSRIA 389
           L+  LE E +++SV   N LN  LDV+LKRLI+PCL  + SR                  
Sbjct: 250 LEDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASSA----------- 285

Query: 390 LKEQYRHRAQRLNASLLDFRVAMQLNPEVLGRDWTSQLEKISLR 434
                        +SL+DF+VAM LNP +LG DW ++LEKI+ R
Sbjct: 310 -------------SSLVDFQVAMALNPSILGEDWPTKLEKIACR 285

BLAST of Cp4.1LG14g04460 vs. TAIR10
Match: AT4G31440.1 (AT4G31440.1 unknown protein)

HSP 1 Score: 149.1 bits (375), Expect = 6.5e-36
Identity = 120/380 (31.58%), Postives = 186/380 (48.95%), Query Frame = 1

Query: 85  RKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREIIPL 144
           R  + RID +ELK  I +K+G +RS +YF  L + LS K+ K EFDK C +++GRE + L
Sbjct: 3   RLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSL 62

Query: 145 HNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCL---QSLHGDAFLSSP- 204
           HN+ I++IL+NA +AK+PP +  +     +L +   +G + S       +  D  LS+  
Sbjct: 63  HNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSNGV 122

Query: 205 -RKGRSPVSRDRKIRDRPSPLGPCGKPQN-IALEELAFKAQEQQSA----TELHSLGSRP 264
             K R     DR IRD+P PLG  GK     A         E+ SA     E  ++  + 
Sbjct: 123 LAKVRPGTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSGKD 182

Query: 265 PVDMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTC 324
            V      D E   ++  +P      PV APLGI       G     + V   +    +C
Sbjct: 183 QVAAPISRDDEAQVRILSTP------PVMAPLGIPFCSASVGGDRRTVPVS-TSAAAISC 242

Query: 325 QNGGELPDTRLLRTHLKQKLEMEQI-DISVDGVNLLNNALDVYLKRLIEPCLSFSRSR-- 384
            + G L DT +LR  ++     + +  +S +   +LNN LD+YLK+L++ C+  + +R  
Sbjct: 243 YDSGGLSDTEMLRKRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSM 302

Query: 385 --------CERPRFTD---NQPITGSRIALKEQYRH---RAQRLNASLLDFRVAMQLNPE 438
                    E+ +  D   N   T +   ++   +      ++ + SLLDFRVAM+LNP 
Sbjct: 303 NGTPGKHSLEKQQSRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNPH 362

BLAST of Cp4.1LG14g04460 vs. NCBI nr
Match: gi|700197619|gb|KGN52777.1| (hypothetical protein Csa_4G000960 [Cucumis sativus])

HSP 1 Score: 648.7 bits (1672), Expect = 7.4e-183
Identity = 334/373 (89.54%), Postives = 351/373 (94.10%), Query Frame = 1

Query: 66  EGFVSNSAIAEIAALKMVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKIN 125
           EGFVSNSAIA IAALKM+PRKD SRIDTSELKAMIYRKLGHQRS+KYFDQLKKLLSLK N
Sbjct: 91  EGFVSNSAIAGIAALKMLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTN 150

Query: 126 KREFDKFCIQIIGREIIPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQR 185
           KREFDKFCIQIIGREIIPLHNR I+AILQNACVAKTPPVLSSTRKV  NLSVKVVNGYQR
Sbjct: 151 KREFDKFCIQIIGREIIPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQR 210

Query: 186 SCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSAT 245
           SCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQSAT
Sbjct: 211 SCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSAT 270

Query: 246 ELHSLGSRPPVDMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVG 305
           ELHSLGSRPPV+MASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GSGKTLSN+ VG
Sbjct: 271 ELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVG 330

Query: 306 RRNCHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCL 365
             N HVTTCQ+ GELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCL
Sbjct: 331 -SNYHVTTCQDVGELPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCL 390

Query: 366 SFSRSRCERPRFTDNQPITGSRIALKEQYRHRAQRL-NASLLDFRVAMQLNPEVLGRDWT 425
           +FSRSRCER +FT NQPITGSRI  +EQ+RHRAQ+L N SLLDFRVAMQLNP+VLGR+WT
Sbjct: 391 NFSRSRCERLKFTGNQPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWT 450

Query: 426 SQLEKISLRVSEE 438
            QLEKISLR SEE
Sbjct: 451 MQLEKISLRASEE 462

BLAST of Cp4.1LG14g04460 vs. NCBI nr
Match: gi|659108776|ref|XP_008454383.1| (PREDICTED: uncharacterized protein LOC103494799 [Cucumis melo])

HSP 1 Score: 627.1 bits (1616), Expect = 2.3e-176
Identity = 319/357 (89.36%), Postives = 335/357 (93.84%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M PRKD SRIDTSELKAMIYRKLGHQRS+KYFDQLKKLLSLK NKREFDKFCIQIIGREI
Sbjct: 1   MFPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHNR I+AILQNACVAKTPPVLSSTRKV  NLSVKVVNGYQRSCLQSLHGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 120

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASV 261
           KGRSPVSRDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQSATELHSLGSRPPV+MASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 180

Query: 262 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELP 321
           EDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GS KTLSN+ VG RN HVTTCQ+GGELP
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSSKTLSNVPVGGRNYHVTTCQDGGELP 240

Query: 322 DTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTDNQ 381
           DTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCL+FSRSRCER +FT NQ
Sbjct: 241 DTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ 300

Query: 382 PITGSRIALKEQYRHRAQRL-NASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
           PITGSRI  +EQ RHRAQ++ N SLLDFRVAMQLNP+VLGR+WT QLEKISLR SEE
Sbjct: 301 PITGSRITFQEQNRHRAQQINNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 357

BLAST of Cp4.1LG14g04460 vs. NCBI nr
Match: gi|449469122|ref|XP_004152270.1| (PREDICTED: uncharacterized protein LOC101211126 [Cucumis sativus])

HSP 1 Score: 622.9 bits (1605), Expect = 4.4e-175
Identity = 319/357 (89.36%), Postives = 336/357 (94.12%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M+PRKD SRIDTSELKAMIYRKLGHQRS+KYFDQLKKLLSLK NKREFDKFCIQIIGREI
Sbjct: 1   MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHNR I+AILQNACVAKTPPVLSSTRKV  NLSVKVVNGYQRSCLQSLHGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 120

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASV 261
           KGRSPVSRDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQSATELHSLGSRPPV+MASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 180

Query: 262 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELP 321
           EDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GSGKTLSN+ VG  N HVTTCQ+ GELP
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVG-SNYHVTTCQDVGELP 240

Query: 322 DTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTDNQ 381
           DTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCL+FSRSRCER +FT NQ
Sbjct: 241 DTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ 300

Query: 382 PITGSRIALKEQYRHRAQRL-NASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
           PITGSRI  +EQ+RHRAQ+L N SLLDFRVAMQLNP+VLGR+WT QLEKISLR SEE
Sbjct: 301 PITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 356

BLAST of Cp4.1LG14g04460 vs. NCBI nr
Match: gi|1009171750|ref|XP_015866909.1| (PREDICTED: uncharacterized protein LOC107404471 [Ziziphus jujuba])

HSP 1 Score: 436.4 bits (1121), Expect = 5.8e-119
Identity = 231/359 (64.35%), Postives = 279/359 (77.72%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M+P +  SRIDT ELKA+I +K+GHQR+EKYFDQL++L S KI+K EF+KFC + +GRE 
Sbjct: 2   MLPNQSYSRIDTLELKALIIQKIGHQRAEKYFDQLQRLFSFKISKCEFNKFCCRTLGREN 61

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHN+ I++I++NACVAK PPV +S +K+ +  +VKV NGYQR+CLQSL+GD F  SPR
Sbjct: 62  IPLHNQLIRSIVKNACVAKVPPVKAS-KKLGNTPNVKVTNGYQRNCLQSLYGDVFPPSPR 121

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVDMASV 261
           KGRSPV+RDRK RDRPSPLGP GKPQ++  EEL  KAQEQQSATEL SLGSRPPV++ASV
Sbjct: 122 KGRSPVNRDRKFRDRPSPLGPLGKPQSVTCEELVSKAQEQQSATELLSLGSRPPVEVASV 181

Query: 262 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQNGGELP 321
           EDGEEVEQ+AGSPGVQSRSPVTAPLG+SMN  G+ K LSN+S+   N H  TCQN GELP
Sbjct: 182 EDGEEVEQIAGSPGVQSRSPVTAPLGVSMNLGGARKALSNVSIS-GNYHPETCQNCGELP 241

Query: 322 DTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRCERPRFTD-- 381
           DTR LR+ L++KLE+E  +ISVD VNLLNN LDVYLKRL+EPC+  + SRC      +  
Sbjct: 242 DTRSLRSRLQRKLEIEGFNISVDCVNLLNNGLDVYLKRLLEPCMRLAASRCGNEHLIELN 301

Query: 382 NQPITGSRIALKEQYRHRAQRLN-ASLLDFRVAMQLNPEVLGRDWTSQLEKISLRVSEE 438
            Q   G    L  +Y  RA++   ASLLDF  AM+LNP +LG DW  QLEKI LR SEE
Sbjct: 302 AQCNPGLNGMLPGRYMERAKKSTYASLLDFHAAMELNPCILGEDWAIQLEKIMLRSSEE 358

BLAST of Cp4.1LG14g04460 vs. NCBI nr
Match: gi|1009171746|ref|XP_015866907.1| (PREDICTED: uncharacterized protein LOC107404470 [Ziziphus jujuba])

HSP 1 Score: 423.7 bits (1088), Expect = 3.9e-115
Identity = 232/364 (63.74%), Postives = 276/364 (75.82%), Query Frame = 1

Query: 82  MVPRKDNSRIDTSELKAMIYRKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREI 141
           M+P +  SRIDT ELKA+I +K+GHQR+EKYFDQL++L SLKI+K EF+KFC + +GRE 
Sbjct: 1   MLPNQSYSRIDTLELKALIIQKIGHQRAEKYFDQLQRLFSLKISKCEFNKFCFRTLGREN 60

Query: 142 IPLHNRFIKAILQNACVAKTPPVLSSTRKVDSNLSVKVVNGYQRSCLQSLHGDAFLSSPR 201
           IPLHN+ I++I++NACVAK PPV +S +K  + L+VKV NGYQR+ LQSL+GD F  SPR
Sbjct: 61  IPLHNQLIRSIVKNACVAKVPPVKAS-KKFGNTLNVKVTNGYQRNRLQSLYGDVFPPSPR 120

Query: 202 KGRSPVSRDRKIRDRPSPLGPCGKPQNIAL-----EELAFKAQEQQSATELHSLGSRPPV 261
           KGRSPV+RDRK RDRPSPLGP GKPQ++        EL   AQEQQSATEL SLGSRPPV
Sbjct: 121 KGRSPVNRDRKFRDRPSPLGPLGKPQSLTCGDFPSGELVSMAQEQQSATELLSLGSRPPV 180

Query: 262 DMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNISVGRRNCHVTTCQN 321
           ++ASVEDGEEVEQVAGSPGVQSRSPVTAPLG+SMN  G+ K LSN+S+   N H  TCQN
Sbjct: 181 EVASVEDGEEVEQVAGSPGVQSRSPVTAPLGVSMNLGGARKALSNVSIS-GNYHPETCQN 240

Query: 322 GGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLSFSRSRC--ER 381
            GELPDTR LR+ L+QKLEME  +IS+D VNLLNN LD YLKRL+EPC+  + SRC  E 
Sbjct: 241 CGELPDTRSLRSRLQQKLEMEGFNISIDCVNLLNNGLDAYLKRLLEPCMRLAVSRCGSEH 300

Query: 382 PRFTDNQPITGSRIALKEQYRHRAQRLN-ASLLDFRVAMQLNPEVLGRDWTSQLEKISLR 438
               + Q   G    L  +Y  RA+    ASLLDF  AM+LNP +LG DW  QLEKI LR
Sbjct: 301 LNQLNAQFNPGLNGMLPGRYMERAKSSTYASLLDFHAAMELNPCILGEDWAIQLEKIMLR 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KWF9_CUCSA5.2e-18389.54Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000960 PE=4 SV=1[more]
A0A067LJK3_JATCU1.0e-11461.28Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16307 PE=4 SV=1[more]
B9HMX5_POPTR3.3e-11363.32Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s09370g PE=4 SV=2[more]
A5C224_VITVI5.7e-11363.23Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_008500 PE=4 SV=1[more]
A0A061E832_THECC9.7e-11363.79Uncharacterized protein OS=Theobroma cacao GN=TCM_007200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33890.16.2e-8750.14 unknown protein[more]
AT2G14850.12.6e-6944.86 unknown protein[more]
AT2G24530.17.0e-3832.43 unknown protein[more]
AT5G67410.15.9e-3733.14 unknown protein[more]
AT4G31440.16.5e-3631.58 unknown protein[more]
Match NameE-valueIdentityDescription
gi|700197619|gb|KGN52777.1|7.4e-18389.54hypothetical protein Csa_4G000960 [Cucumis sativus][more]
gi|659108776|ref|XP_008454383.1|2.3e-17689.36PREDICTED: uncharacterized protein LOC103494799 [Cucumis melo][more]
gi|449469122|ref|XP_004152270.1|4.4e-17589.36PREDICTED: uncharacterized protein LOC101211126 [Cucumis sativus][more]
gi|1009171750|ref|XP_015866909.1|5.8e-11964.35PREDICTED: uncharacterized protein LOC107404471 [Ziziphus jujuba][more]
gi|1009171746|ref|XP_015866907.1|3.9e-11563.74PREDICTED: uncharacterized protein LOC107404470 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0070461SAGA-type complex
Vocabulary: INTERPRO
TermDefinition
IPR024738Hfi1/Tada1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0070461 SAGA-type complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g04460.1Cp4.1LG14g04460.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 87..366
score: 2.3
NoneNo IPR availablePANTHERPTHR21277FAMILY NOT NAMEDcoord: 245..437
score: 1.4E-157coord: 69..211
score: 1.4E
NoneNo IPR availablePANTHERPTHR21277:SF12SUBFAMILY NOT NAMEDcoord: 69..211
score: 1.4E-157coord: 245..437
score: 1.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g04460Cp4.1LG01g02660Cucurbita pepo (Zucchini)cpecpeB234