Cp4.1LG02g17070 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g17070
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionLuxR family transcriptional regulator, putative
LocationCp4.1LG02 : 12336242 .. 12339280 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAGAACAGTGGCCATTCCCAAAGCACAAAGCATTTGAGTAGTGAAAAGTTTGAAGGTCTGGTAAATTTGTTAAGCTCAAAATTCCACTTCCATGGAGCAACCTCCATTCATTTCTCACCGGCGAGACGAACCGGAGTTCAACCTCCGTGAATGGGTGGCAAAGGCTAAAATTGGCCGCGATCCCGCCATTTCCAGGCGATTCTCCGGGTCCTACATCAGAAGCTTTCGAGAAGACGCGAGGTCGTTTCGATCAAATGTCACCACCGTCACTAGCACCGCCTCCTCTCCTGGATACCCTTTCGGAGGTTTTTGTTTTCTTTTTCTCTCTGTGGAGTTTTGATTCTTGCTATGGATTCCTCTGTTCTTTAACTTATCTGGGGTCTGTTTGCAGACGAAATTGACCCTGCTACTTATTCGTTCACTAATGCTATCAAGGGTGAGTTTATTTGGATTCGGATTTTACTTTGCCCACAAGGTGTTCTTGAAAATGCGTGTTGTGGAAATTAGAGTTTTGAGGTTGTTTTTTGTGATTGGGTGCTGCTTCGTGTGCAGCACTGCAAGCCAGGTCGCTTAACAGTTGGGAATGCTTTTCTCTTGATGGGTTTACTTTGAATTCGAAGTGGAATGAAGCCGAGAAGTATATATGTAATCCACTTTCTGGGGAAGTACCCATGGAGTGTTTGTCTGCAAAATCGCTTAGTGGGAGGTCATTCAAGAACTTAGCGAACAGAATTGCCATATCTGCTCCTTTAGTTTATTCCAGTCATTCACAAATTGAAACAAAGCCATATTCTATTTCACAATTAGTTCAGAAACTCCCAATTCCAGGTCGAATCATCTGTGTTCTTTAGAATTTCGAGTTCGATCGAATTTGGGATTGGAAAATTTTGAATTCCTGTTGTGATTTTCAGAGAACAAAGTGGAAGCCAATGGCATGACCAGAGATATGGGAACTCAAAGCACACCAACAGTCGTTGGTTCAAACAGTCCTAGTCCTGCTTCCACGCCTCCTATCGTGGAGAGAGCATTAAAGAGATGCGAATTAGAAGAAGACTCGCCCAATTCCAATTCTAAAAGTACTCCCGAGACAGAGGTATGCCACCTCCAATCTTTCATAGGCTATAGCTTTGTTATATTCGTCTGATGCTTTTAGTTTGATTTCAACAATGTCTAAATGGAAAATTTGAAAGGAGATCTTAAATTAACCATTGAGAGGATAGGTTATGATTCATATAATGTTTATAAATATAGTTTTAACACACTGATAGCTCATAAGTTAACACGGCAGCCAACAAATGAGTTGTTATTGATGTTAATTTTTCAGAATGAGACTTAGTTTTAGTGAACGTTCGAAACAATTTGGCTTGGGATCCCAACATTATGGCCTTTGTCCATAGCTTGAGAGAAGCTTAATGCAAAATTGAACTGTCAGCCTAACCCTCAAAGAATATAATCTACAAGAAGAAACAAGTGGATCATAAGTGAAGTTCAGCTTTCTCAGACATAGAAGCTTTCTCTCCAATCTATTTCCCTCTAAGGATGAGCCAACCAAATCAGGATTGTCCCCATACATGTCATGAAACAGAGTGCTATAACCATCCAAGGTGGGGGACTGTTTTTGTTAGCCACAAGTTGTTATGTTTTGGTCCCTTTTTGATACTATTTTTTTCCTCATATTGGCCTACTTTCCAGCTCGTACGGTTGTGATTGTCTCTATACCATGGAAACATATTTTTCTAGAAAATAAAGTGATAATATCAGATGTCAGAAAACCCAGCCTTAAGTTTCAGAAGCAAACAGAAAGTTCAATTCTTGGTGTTAGAGAGATTTGTCTCAACACTTGAATTACTGCAGAAATTTCAAATTTCCCTCCTTGAAATACGTATCTGTTCGTTTTCGTTCTTCGGTGTCTATTTCATTTTCTTTCCGTCTACCTTAGGGGGGGACCCATCAAAATTCTCCTGTTCATACATATTCCTTTGTTTTTACATCAAGATATCGAGCTGTATTAATATTTCTCATTGAAAATGCTGTTGTAGATCTTTTTTTCATGTATATTCCCTTATGTTGAGCATTGTCGGGAGGGAGTCCCACATTAAGGGGATGATCATGAGTTTATAAGTAAGAAATACATCTCCATTTGTATGAAACATTTTCAGGAATCCAAAAGCAAAACCATGAGAGCTAATGCTCAAAGTGGACAATATCATACCATTGTGGAGGTCCGTGGTTCGTAACACCTTAGTGGTCCATTAAGCAACAATAAGGGTATTACCGGAATGAGCATCCAGGCCTTAAAAAGCCTTTTTAGATGACACTGATATAAAAAGCAACGTTATCTGAATGCAGGTGATAAGAAGAGAATTGAATGAGGAGAGAGCAAAAGAAGAAGAAAAGGAAGTAGTACATAGAGAAACAATAGCAGAAGAGAAGTTCAAGCAAGGAGGATGCTTGTCATGGATGATGAAGAAGAAGAAACAGAAAGAAGAAGAGAGATCAAGAAGAAAGAGGTTAATTTGTCATCTAAAACTCAAAGGTTGTTGAAAAACAGTGTGAAAAAGCTAAGAACAAATTAAAAAACATGGAAAAGCAGAGATTTGGATTATGTGTTATGCAGCAAAATGGAAGAACAGGGGCAGGTGAGAATGTGGGTTGGTATTGGAGATGATGAGAGGGGTGGGTATCATAAAGGATTGTGTGTTGTGTTGGTGTTAGTGGTCCTATTTGCCAGTGCCCCTCGGCCATTTCATAATCTGTTCTTTGTGGCTTCACGTGCATGCTTTTGTCTTTGTGTTGCTGAATAGTGTAAGGTAAGGGCTCCAAAGGGCCTCCCCTTTTTGTTTTGAACTTTCAAATCTCTCCTCTCTCATAGTTATTATTTTGACCTTTGTTGTAGAATTTTTATACTGTGTTTAACAAAAATCAATGTGGATACTGCTACTTAAACTTCTGTCCCTACTTTCATGGATACCGGGTCGTTTAGAATCTAAGTTTGAATGATGCTATAATAAACTTTTATGTTACAAAGTTGAGGACT

mRNA sequence

AGAAGAACAGTGGCCATTCCCAAAGCACAAAGCATTTGAGTAGTGAAAAGTTTGAAGGTCTGGTAAATTTGTTAAGCTCAAAATTCCACTTCCATGGAGCAACCTCCATTCATTTCTCACCGGCGAGACGAACCGGAGTTCAACCTCCGTGAATGGGTGGCAAAGGCTAAAATTGGCCGCGATCCCGCCATTTCCAGGCGATTCTCCGGGTCCTACATCAGAAGCTTTCGAGAAGACGCGAGGTCGTTTCGATCAAATGTCACCACCGTCACTAGCACCGCCTCCTCTCCTGGATACCCTTTCGGAGACGAAATTGACCCTGCTACTTATTCGTTCACTAATGCTATCAAGGCACTGCAAGCCAGGTCGCTTAACAGTTGGGAATGCTTTTCTCTTGATGGGTTTACTTTGAATTCGAAGTGGAATGAAGCCGAGAAGTATATATGTAATCCACTTTCTGGGGAAGTACCCATGGAGTGTTTGTCTGCAAAATCGCTTAGTGGGAGGTCATTCAAGAACTTAGCGAACAGAATTGCCATATCTGCTCCTTTAGTTTATTCCAGTCATTCACAAATTGAAACAAAGCCATATTCTATTTCACAATTAGTTCAGAAACTCCCAATTCCAGAGAACAAAGTGGAAGCCAATGGCATGACCAGAGATATGGGAACTCAAAGCACACCAACAGTCGTTGGTTCAAACAGTCCTAGTCCTGCTTCCACGCCTCCTATCGTGGAGAGAGCATTAAAGAGATGCGAATTAGAAGAAGACTCGCCCAATTCCAATTCTAAAAGTACTCCCGAGACAGAGGTGATAAGAAGAGAATTGAATGAGGAGAGAGCAAAAGAAGAAGAAAAGGAAGTAGTACATAGAGAAACAATAGCAGAAGAGAAGTTCAAGCAAGGAGGATGCTTGTCATGGATGATGAAGAAGAAGAAACAGAAAGAAGAAGAGAGATCAAGAAGAAAGAGGTTAATTTGTCATCTAAAACTCAAAGGTTGTTGAAAAACAGTGTGAAAAAGCTAAGAACAAATTAAAAAACATGGAAAAGCAGAGATTTGGATTATGTGTTATGCAGCAAAATGGAAGAACAGGGGCAGGTGAGAATGTGGGTTGGTATTGGAGATGATGAGAGGGGTGGGTATCATAAAGGATTGTGTGTTGTGTTGGTGTTAGTGGTCCTATTTGCCAGTGCCCCTCGGCCATTTCATAATCTGTTCTTTGTGGCTTCACGTGCATGCTTTTGTCTTTGTGTTGCTGAATAGTGTAAGGTAAGGGCTCCAAAGGGCCTCCCCTTTTTGTTTTGAACTTTCAAATCTCTCCTCTCTCATAGTTATTATTTTGACCTTTGTTGTAGAATTTTTATACTGTGTTTAACAAAAATCAATGTGGATACTGCTACTTAAACTTCTGTCCCTACTTTCATGGATACCGGGTCGTTTAGAATCTAAGTTTGAATGATGCTATAATAAACTTTTATGTTACAAAGTTGAGGACT

Coding sequence (CDS)

ATGGAGCAACCTCCATTCATTTCTCACCGGCGAGACGAACCGGAGTTCAACCTCCGTGAATGGGTGGCAAAGGCTAAAATTGGCCGCGATCCCGCCATTTCCAGGCGATTCTCCGGGTCCTACATCAGAAGCTTTCGAGAAGACGCGAGGTCGTTTCGATCAAATGTCACCACCGTCACTAGCACCGCCTCCTCTCCTGGATACCCTTTCGGAGACGAAATTGACCCTGCTACTTATTCGTTCACTAATGCTATCAAGGCACTGCAAGCCAGGTCGCTTAACAGTTGGGAATGCTTTTCTCTTGATGGGTTTACTTTGAATTCGAAGTGGAATGAAGCCGAGAAGTATATATGTAATCCACTTTCTGGGGAAGTACCCATGGAGTGTTTGTCTGCAAAATCGCTTAGTGGGAGGTCATTCAAGAACTTAGCGAACAGAATTGCCATATCTGCTCCTTTAGTTTATTCCAGTCATTCACAAATTGAAACAAAGCCATATTCTATTTCACAATTAGTTCAGAAACTCCCAATTCCAGAGAACAAAGTGGAAGCCAATGGCATGACCAGAGATATGGGAACTCAAAGCACACCAACAGTCGTTGGTTCAAACAGTCCTAGTCCTGCTTCCACGCCTCCTATCGTGGAGAGAGCATTAAAGAGATGCGAATTAGAAGAAGACTCGCCCAATTCCAATTCTAAAAGTACTCCCGAGACAGAGGTGATAAGAAGAGAATTGAATGAGGAGAGAGCAAAAGAAGAAGAAAAGGAAGTAGTACATAGAGAAACAATAGCAGAAGAGAAGTTCAAGCAAGGAGGATGCTTGTCATGGATGATGAAGAAGAAGAAACAGAAAGAAGAAGAGAGATCAAGAAGAAAGAGGTTAATTTGTCATCTAAAACTCAAAGGTTGTTGA

Protein sequence

MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVTSTASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNPLSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKPYSISQLVQKLPIPENKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETEVIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLICHLKLKGC
BLAST of Cp4.1LG02g17070 vs. TrEMBL
Match: A0A0A0L6V9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G259700 PE=4 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 3.2e-139
Identity = 259/304 (85.20%), Postives = 280/304 (92.11%), Query Frame = 1

Query: 1   MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVT 60
           MEQPPFIS RRDEPEF+LREW AKAKI RDPA SRRFSGSYIRSFREDARSFRSN+TT+T
Sbjct: 1   MEQPPFISQRRDEPEFSLREWAAKAKITRDPATSRRFSGSYIRSFREDARSFRSNITTIT 60

Query: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP 120
           STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP
Sbjct: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP 120

Query: 121 LSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHS-QIETKPYSISQLVQKLPIPE 180
           LSGEVPMECLSAKSLSGRSF+N  NRIAISAPLVYS+HS Q +TKP SI+Q+VQKLPIPE
Sbjct: 121 LSGEVPMECLSAKSLSGRSFRNFTNRIAISAPLVYSNHSQQTQTKPCSIAQVVQKLPIPE 180

Query: 181 NKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETE 240
            +++AN +TRD+GTQSTPT VGS SPSPASTPPIV+RALKRCELEEDSPNSNSK TP TE
Sbjct: 181 KQLDANALTRDVGTQSTPTNVGSKSPSPASTPPIVDRALKRCELEEDSPNSNSKITPVTE 240

Query: 241 VIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLICHLK 300
           VI+RE+ EERAKEE+   VH+E IAEEK+KQGGCLSWM  KKKQKEE+RSRRKR + HLK
Sbjct: 241 VIKREMKEERAKEEK---VHKEIIAEEKYKQGGCLSWM--KKKQKEEQRSRRKRFLSHLK 299

Query: 301 LKGC 304
           LKGC
Sbjct: 301 LKGC 299

BLAST of Cp4.1LG02g17070 vs. TrEMBL
Match: K7MV79_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_18G281000 PE=4 SV=1)

HSP 1 Score: 314.7 bits (805), Expect = 1.2e-82
Identity = 179/300 (59.67%), Postives = 212/300 (70.67%), Query Frame = 1

Query: 2   EQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVTS 61
           E  P+ S RRDE EFNLREW  KA+I R+   SRR+SGSY+RSFRED RSFRSN+T ++S
Sbjct: 6   EPLPYSSRRRDESEFNLREWAVKARISREGTNSRRYSGSYMRSFREDTRSFRSNIT-ISS 65

Query: 62  TASSPGYPFGDEIDPATYSFTNAIKALQARS-LNSWECFSLDGFTLNSKWNEAEKYICNP 121
           TASSPGY   DEIDP+TYSFT A+KALQARS   SWEC S DGF LNSKWNEAE+YICNP
Sbjct: 66  TASSPGYLLKDEIDPSTYSFTTALKALQARSSYYSWECLSPDGFALNSKWNEAERYICNP 125

Query: 122 LSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKPYSISQLVQKLPIPEN 181
           LSGEVP+ECLSAK+LSGRSF+N  NRIA+SAPLVYSS   I TKP + +Q    L  P  
Sbjct: 126 LSGEVPLECLSAKTLSGRSFRNSINRIAMSAPLVYSS-KHIPTKPATFTQEEVALQFPNP 185

Query: 182 KVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETEV 241
           + +  GMTRD+GTQSTP  + S SPSPASTP I ER+     L  DSPNSN+K+  E EV
Sbjct: 186 EKKKEGMTRDVGTQSTPPYISSTSPSPASTPSITERSK---PLVSDSPNSNAKTKSEEEV 245

Query: 242 IRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRL-ICHLK 300
             ++      KE E+E        E+  K  GC SWM KKK ++E+ER RR  + + H K
Sbjct: 246 EEKDKETWETKETEREKKVWRKQEEQLCKLSGCFSWMRKKKAEREKERQRRNNIFLTHFK 300

BLAST of Cp4.1LG02g17070 vs. TrEMBL
Match: A0A061EHB0_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_019503 PE=4 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 2.1e-82
Identity = 183/307 (59.61%), Postives = 223/307 (72.64%), Query Frame = 1

Query: 1   MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVT 60
           +EQ  + S R DEPEFNLREW  KA+I R+   SRR+S SYIRSFREDARSFRSN+T ++
Sbjct: 5   VEQSSYSSRRHDEPEFNLREWGLKARISRENTTSRRYSASYIRSFREDARSFRSNIT-IS 64

Query: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNS-WECFSLDGFTLNSKWNEAEKYICN 120
           STASSPGY   DEIDP+TYSFT A+KALQAR++ S WEC S DGF LNSKWNEAEKYICN
Sbjct: 65  STASSPGYSLKDEIDPSTYSFTTALKALQARTVCSGWECLSPDGFALNSKWNEAEKYICN 124

Query: 121 PLSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKP-YSISQLVQKLPIP 180
           PLSGEVPMECLSAK+LSGRSF+NL NRI +SAPLVYS    I+T P  ++ + V + P P
Sbjct: 125 PLSGEVPMECLSAKTLSGRSFRNLTNRITMSAPLVYSHSCHIQTNPSRTVPEDVAQFPTP 184

Query: 181 ENKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEE-DSPNSNSKSTPE 240
           E K E+  MTRD+GTQSTP  + S S SPASTP I+ERALKRC  E  DSPN+N+K   E
Sbjct: 185 EKKAES--MTRDVGTQSTPPDLSSGSLSPASTPSILERALKRCGTENGDSPNTNTKPRAE 244

Query: 241 TEVIRRELNE-ERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLIC 300
            +V  +E  E E    ++ E   ++ +     +Q GCLSWM  +++Q+E+ +S RKR I 
Sbjct: 245 EQVEVKETGEREETIIDKAERRRKDELMCRCSRQPGCLSWM--RRRQREKHKS-RKRSIF 304

Query: 301 HLKLKGC 304
               KGC
Sbjct: 305 FPHFKGC 305

BLAST of Cp4.1LG02g17070 vs. TrEMBL
Match: A0A0S3RRS5_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G023900 PE=4 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 2.3e-81
Identity = 178/295 (60.34%), Postives = 216/295 (73.22%), Query Frame = 1

Query: 2   EQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVTS 61
           E  P+ S RRDE EFNLREW  KA+I R+   SRR+SGSY+RSFRED RSFRSN+  V+S
Sbjct: 5   EPLPYSSRRRDESEFNLREWNVKARISRENTNSRRYSGSYMRSFREDTRSFRSNIA-VSS 64

Query: 62  TASSPGYPFGDEIDPATYSFTNAIKALQARSL-NSWECFSLDGFTLNSKWNEAEKYICNP 121
           TASSPGYP  DEIDP+TYSFT A+KALQAR+  NSWEC S DGF LNSKWNEAE+YICNP
Sbjct: 65  TASSPGYPLKDEIDPSTYSFTTALKALQARAAYNSWECSSPDGFALNSKWNEAERYICNP 124

Query: 122 LSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKPYSISQLVQKLPIPEN 181
           LSGEVP+ECLSAK+LSGRSF+N  NRIA+SAPLVYSS   I TKP + +Q  + L  P +
Sbjct: 125 LSGEVPLECLSAKTLSGRSFRNSVNRIAMSAPLVYSS-KHIPTKPSAYTQEEEALQFPNS 184

Query: 182 KVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETEV 241
           + +  GMTRD+GTQSTP  + S+SPSPASTP I+ER+  +    EDSPNSN+K+  E EV
Sbjct: 185 EKKKEGMTRDVGTQSTPPYLSSSSPSPASTPSIMERSKPQ---PEDSPNSNAKTKSEEEV 244

Query: 242 IRRELNEERAKEEEKEVVHRETIAEEKFK-QGGCLSWMMKKKK-QKEEERSRRKR 294
             ++      KE E+E        E+  K Q GC  WM KK+K ++E ER R++R
Sbjct: 245 EVKDEEIWETKETEREKKEWRKREEQLCKQQSGCFWWMRKKEKAERERERERQRR 294

BLAST of Cp4.1LG02g17070 vs. TrEMBL
Match: A0A067K9Y6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12963 PE=4 SV=1)

HSP 1 Score: 309.7 bits (792), Expect = 4.0e-81
Identity = 184/329 (55.93%), Postives = 230/329 (69.91%), Query Frame = 1

Query: 3   QPPFISHRRDE-----PEFNLREWVAKAKIGRDPAISRRFSGSYIR-SFREDARSFRSNV 62
           Q P+ S RRD+     P+FNLREW  +A+I R+   SRRFSGS+IR SFREDARSFRSN+
Sbjct: 9   QSPYGSGRRDQHQHPQPDFNLREWALRAQISRENTKSRRFSGSHIRTSFREDARSFRSNI 68

Query: 63  TTVTSTASSPGYPFGDEIDPATYSFTNAIKALQARS-LNSWECFSLDGFTLNSKWNEAEK 122
           T ++ST SSPGYPF +EIDP+TYSFT A+KALQAR+  NSWEC S DGF LNSKWNEAEK
Sbjct: 69  T-ISSTPSSPGYPFNEEIDPSTYSFTTALKALQARAGYNSWECLSPDGFALNSKWNEAEK 128

Query: 123 YICNPLSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSH-SQIETKP-YSISQLVQ 182
           YICNPLSGEVP ECLSAK+LSGRSF+N  NRI +SAPL+YS+H  +++TKP ++++    
Sbjct: 129 YICNPLSGEVPRECLSAKTLSGRSFRNPTNRITMSAPLMYSTHLKKVQTKPSHNVTPDHD 188

Query: 183 KLPIPENKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVER-ALKRCELE-EDSPNSN 242
               P  + +  G TRD+GTQSTP  + S+SPSPASTPPI+ER  LKRCE E  DSPN N
Sbjct: 189 SFHFPIQEKKMEGSTRDVGTQSTPFDLSSSSPSPASTPPIMERLTLKRCEEEGGDSPNCN 248

Query: 243 SKSTPETEVI-----------------RRELNEERAKEEEKEVVHRETIAEEKFKQGGCL 302
            K   E +VI                 + E  +E +K++E E + R + +     QGGCL
Sbjct: 249 GKLGAEGKVIEEEETIRSSPSRKEEATKGEKEKEESKKKENEQMWRCSNSSSNSMQGGCL 308

Query: 303 SWMMKKKKQKEEERSRRKRLICHLKLKGC 304
           SWM  +K+Q+E+ + R KR IC L  KGC
Sbjct: 309 SWM--RKRQREKHKPRNKRNICLLNPKGC 334

BLAST of Cp4.1LG02g17070 vs. TAIR10
Match: AT5G16030.3 (AT5G16030.3 unknown protein)

HSP 1 Score: 205.7 bits (522), Expect = 4.1e-53
Identity = 146/316 (46.20%), Postives = 204/316 (64.56%), Query Frame = 1

Query: 10  RRDEPEF-NLREWVAKAKIGRDPAISRRFSGSYIRSFREDAR--SFRS-NVTTVTSTASS 69
           R DE EF NLREW  +A++ R+   SRRFS SYI SFRED    SFR+ N   ++STASS
Sbjct: 4   RGDEHEFMNLREWDRRARLIRENPSSRRFSASYIGSFREDHHKSSFRTTNFNNISSTASS 63

Query: 70  PGYPFGDEIDPATYSFTNAIKALQARSL-NSWECFSLDGFTLNSKWNEAEKYICNPLSGE 129
           PGY   +EIDP+TYSFTNA+KALQA+++ N+ E  + +GF LNSKWNEAEKYICNPLSGE
Sbjct: 64  PGYTLKEEIDPSTYSFTNALKALQAKTMYNNREWLAQEGFALNSKWNEAEKYICNPLSGE 123

Query: 130 VPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQI-----ETKPY---SISQLVQKLP 189
           VPMECLSAK+LS RSF+NL+    +SAPL + S + +     + KP    ++  + + L 
Sbjct: 124 VPMECLSAKTLSARSFRNLS---TMSAPLHFPSPNPLMNNIAQNKPNNNPNVRVIHEDLY 183

Query: 190 IPENKV--------------EANGMTRDMGTQSTPTV-VGSNSPSPASTPPIVERALKRC 249
            P+ ++              +  GM RD+G QST +V + S SPSPA TPPI+ER+LKR 
Sbjct: 184 APDPELLALVNYGGVFLAEKKVVGMKRDVGIQSTTSVDLSSGSPSPAKTPPIMERSLKRH 243

Query: 250 ELEEDSP-NSNSKSTPETEVIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKK 294
              +D P + N K   + + ++    EE+ KEEEK+ +  E   EE+ ++   +S    K
Sbjct: 244 VEADDWPVDINLKVKGQQQDVKL---EEKEKEEEKQDMSNEEDEEEEEEEKQDMSEEDDK 303

BLAST of Cp4.1LG02g17070 vs. TAIR10
Match: AT3G02500.1 (AT3G02500.1 unknown protein)

HSP 1 Score: 178.7 bits (452), Expect = 5.3e-45
Identity = 123/298 (41.28%), Postives = 176/298 (59.06%), Query Frame = 1

Query: 10  RRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVTSTASSPGYP 69
           R +E EFNLREW  +  + R+   SRRFS S IRSFRED +S  +NVT ++STASSPGY 
Sbjct: 4   RGEELEFNLREWARQGHLTREDQSSRRFSASCIRSFREDHKSCTTNVT-ISSTASSPGYS 63

Query: 70  FGDEIDPATYSFTNAIKALQARSL--NSWECFSLDGFTLNSKWNEAEKYICNPLSGEVPM 129
             DEIDP+ YSF++A+KALQA+S+   +W+    +G  LNSKWNEAEKYICNPLSGEVP+
Sbjct: 64  LKDEIDPSNYSFSSALKALQAKSVYKKNWDWLKPEGVELNSKWNEAEKYICNPLSGEVPL 123

Query: 130 ECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKPYSISQLVQKL--------PIPE 189
           ECLS+K+L+ RSF+NL+ +    APL+    +     P +++  V+ +        P+  
Sbjct: 124 ECLSSKTLNSRSFRNLSTK---HAPLMILPSNYNLNIPRTVNPKVRIIHEDPRSPDPVLI 183

Query: 190 NKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNS-KSTPET 249
              +  G  RD+       V    + S A T PI+ER  KR    +DSP   + K   + 
Sbjct: 184 QDKKVVGSKRDV-------VSAQGNVSAAKTTPIMERLTKRQVGADDSPVEYALKLKAQQ 243

Query: 250 EVIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLIC 297
           E ++ E NE+    +E   +  E   ++K +  G  SW+  +K Q++  +S+   LIC
Sbjct: 244 EDVKLEENEQNMMTKE---IQEEKKEKKKRRGSGFSSWI--RKMQRQPRKSKCIFLIC 285

BLAST of Cp4.1LG02g17070 vs. NCBI nr
Match: gi|659123255|ref|XP_008461568.1| (PREDICTED: uncharacterized protein LOC103500139 [Cucumis melo])

HSP 1 Score: 503.4 bits (1295), Expect = 2.7e-139
Identity = 260/304 (85.53%), Postives = 280/304 (92.11%), Query Frame = 1

Query: 1   MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVT 60
           MEQPPFIS RR EPEF+LREW AKAKI RDPA SRRFSGSYIRSFREDARSFRSN+TT+T
Sbjct: 1   MEQPPFISQRRGEPEFSLREWAAKAKITRDPATSRRFSGSYIRSFREDARSFRSNITTIT 60

Query: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP 120
           STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP
Sbjct: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP 120

Query: 121 LSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHS-QIETKPYSISQLVQKLPIPE 180
           LSGEVPMECLSAKSLSGRSF+N  NRIAISAPLVYS+HS Q +TKP SI+Q+VQKLPIPE
Sbjct: 121 LSGEVPMECLSAKSLSGRSFRNFTNRIAISAPLVYSNHSQQTQTKPCSIAQVVQKLPIPE 180

Query: 181 NKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETE 240
            +V+AN +TRD+GTQSTPT VGSNSPSPASTPPIV+RALKRCELEEDSPNSNSK TP TE
Sbjct: 181 KQVDANALTRDVGTQSTPTNVGSNSPSPASTPPIVDRALKRCELEEDSPNSNSKITPVTE 240

Query: 241 VIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLICHLK 300
           VI+RE+ EERAKEE+   VH+E IAEEK+KQGGCLSWM  KKKQKEE+RSRRKR + HLK
Sbjct: 241 VIKREMKEERAKEEK---VHKEIIAEEKYKQGGCLSWM--KKKQKEEQRSRRKRFLSHLK 299

Query: 301 LKGC 304
           LKGC
Sbjct: 301 LKGC 299

BLAST of Cp4.1LG02g17070 vs. NCBI nr
Match: gi|449455789|ref|XP_004145633.1| (PREDICTED: uncharacterized protein LOC101205687 [Cucumis sativus])

HSP 1 Score: 502.7 bits (1293), Expect = 4.6e-139
Identity = 259/304 (85.20%), Postives = 280/304 (92.11%), Query Frame = 1

Query: 1   MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVT 60
           MEQPPFIS RRDEPEF+LREW AKAKI RDPA SRRFSGSYIRSFREDARSFRSN+TT+T
Sbjct: 1   MEQPPFISQRRDEPEFSLREWAAKAKITRDPATSRRFSGSYIRSFREDARSFRSNITTIT 60

Query: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP 120
           STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP
Sbjct: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNSWECFSLDGFTLNSKWNEAEKYICNP 120

Query: 121 LSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHS-QIETKPYSISQLVQKLPIPE 180
           LSGEVPMECLSAKSLSGRSF+N  NRIAISAPLVYS+HS Q +TKP SI+Q+VQKLPIPE
Sbjct: 121 LSGEVPMECLSAKSLSGRSFRNFTNRIAISAPLVYSNHSQQTQTKPCSIAQVVQKLPIPE 180

Query: 181 NKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETE 240
            +++AN +TRD+GTQSTPT VGS SPSPASTPPIV+RALKRCELEEDSPNSNSK TP TE
Sbjct: 181 KQLDANALTRDVGTQSTPTNVGSKSPSPASTPPIVDRALKRCELEEDSPNSNSKITPVTE 240

Query: 241 VIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLICHLK 300
           VI+RE+ EERAKEE+   VH+E IAEEK+KQGGCLSWM  KKKQKEE+RSRRKR + HLK
Sbjct: 241 VIKREMKEERAKEEK---VHKEIIAEEKYKQGGCLSWM--KKKQKEEQRSRRKRFLSHLK 299

Query: 301 LKGC 304
           LKGC
Sbjct: 301 LKGC 299

BLAST of Cp4.1LG02g17070 vs. NCBI nr
Match: gi|1009127162|ref|XP_015880548.1| (PREDICTED: uncharacterized protein LOC107416553 [Ziziphus jujuba])

HSP 1 Score: 332.4 bits (851), Expect = 8.2e-88
Identity = 190/310 (61.29%), Postives = 236/310 (76.13%), Query Frame = 1

Query: 1   MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVT 60
           ME+ P+ S RR+E EFNLREW AKA+I R+   SRR+S SYIRSFREDARSFRS++T ++
Sbjct: 1   MEKSPYTSRRREEAEFNLREWGAKARISRENTNSRRYSASYIRSFREDARSFRSSIT-IS 60

Query: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSL-NSWECFSLDGFTLNSKWNEAEKYICN 120
           STASSPGY   DEIDP+TYSFT A++ALQARS+ NSWEC S DGF LNSKWNEAEKYICN
Sbjct: 61  STASSPGYCLRDEIDPSTYSFTTALQALQARSVYNSWECLSPDGFALNSKWNEAEKYICN 120

Query: 121 PLSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQ-IETKPYSIS----QLVQK 180
           PLSGEVPMECLSAK+LSGRSF+NL NRI +SAPL+Y SHS+  +T+P + +     +V  
Sbjct: 121 PLSGEVPMECLSAKTLSGRSFRNLTNRITMSAPLIYPSHSRHFQTRPPNPNTVHEDVVHP 180

Query: 181 LPIPENKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEE-DSPNSNSK 240
           +PIPE K+    MTRD+GTQSTP  + S+SPSPASTPPIVER+LKR  LE  DSPNS +K
Sbjct: 181 VPIPEKKM--GSMTRDVGTQSTPPDLSSSSPSPASTPPIVERSLKRFGLENGDSPNSYAK 240

Query: 241 STPETEVIRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKR 300
              + EV   ++ E R KEE K    +E   +++  QGGCLSWM  +K+Q+E+ + R+K 
Sbjct: 241 LKSQQEV---KMPETREKEETKREEGKEKDEQKQQSQGGCLSWM--RKRQREKHKPRKKN 300

Query: 301 LICHLKLKGC 304
           +   L+LKGC
Sbjct: 301 IFA-LRLKGC 301

BLAST of Cp4.1LG02g17070 vs. NCBI nr
Match: gi|356567180|ref|XP_003551799.1| (PREDICTED: uncharacterized protein LOC100786993 [Glycine max])

HSP 1 Score: 314.7 bits (805), Expect = 1.8e-82
Identity = 179/300 (59.67%), Postives = 212/300 (70.67%), Query Frame = 1

Query: 2   EQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVTS 61
           E  P+ S RRDE EFNLREW  KA+I R+   SRR+SGSY+RSFRED RSFRSN+T ++S
Sbjct: 6   EPLPYSSRRRDESEFNLREWAVKARISREGTNSRRYSGSYMRSFREDTRSFRSNIT-ISS 65

Query: 62  TASSPGYPFGDEIDPATYSFTNAIKALQARS-LNSWECFSLDGFTLNSKWNEAEKYICNP 121
           TASSPGY   DEIDP+TYSFT A+KALQARS   SWEC S DGF LNSKWNEAE+YICNP
Sbjct: 66  TASSPGYLLKDEIDPSTYSFTTALKALQARSSYYSWECLSPDGFALNSKWNEAERYICNP 125

Query: 122 LSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKPYSISQLVQKLPIPEN 181
           LSGEVP+ECLSAK+LSGRSF+N  NRIA+SAPLVYSS   I TKP + +Q    L  P  
Sbjct: 126 LSGEVPLECLSAKTLSGRSFRNSINRIAMSAPLVYSS-KHIPTKPATFTQEEVALQFPNP 185

Query: 182 KVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEEDSPNSNSKSTPETEV 241
           + +  GMTRD+GTQSTP  + S SPSPASTP I ER+     L  DSPNSN+K+  E EV
Sbjct: 186 EKKKEGMTRDVGTQSTPPYISSTSPSPASTPSITERSK---PLVSDSPNSNAKTKSEEEV 245

Query: 242 IRRELNEERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRL-ICHLK 300
             ++      KE E+E        E+  K  GC SWM KKK ++E+ER RR  + + H K
Sbjct: 246 EEKDKETWETKETEREKKVWRKQEEQLCKLSGCFSWMRKKKAEREKERQRRNNIFLTHFK 300

BLAST of Cp4.1LG02g17070 vs. NCBI nr
Match: gi|590653052|ref|XP_007033315.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 313.9 bits (803), Expect = 3.0e-82
Identity = 183/307 (59.61%), Postives = 223/307 (72.64%), Query Frame = 1

Query: 1   MEQPPFISHRRDEPEFNLREWVAKAKIGRDPAISRRFSGSYIRSFREDARSFRSNVTTVT 60
           +EQ  + S R DEPEFNLREW  KA+I R+   SRR+S SYIRSFREDARSFRSN+T ++
Sbjct: 5   VEQSSYSSRRHDEPEFNLREWGLKARISRENTTSRRYSASYIRSFREDARSFRSNIT-IS 64

Query: 61  STASSPGYPFGDEIDPATYSFTNAIKALQARSLNS-WECFSLDGFTLNSKWNEAEKYICN 120
           STASSPGY   DEIDP+TYSFT A+KALQAR++ S WEC S DGF LNSKWNEAEKYICN
Sbjct: 65  STASSPGYSLKDEIDPSTYSFTTALKALQARTVCSGWECLSPDGFALNSKWNEAEKYICN 124

Query: 121 PLSGEVPMECLSAKSLSGRSFKNLANRIAISAPLVYSSHSQIETKP-YSISQLVQKLPIP 180
           PLSGEVPMECLSAK+LSGRSF+NL NRI +SAPLVYS    I+T P  ++ + V + P P
Sbjct: 125 PLSGEVPMECLSAKTLSGRSFRNLTNRITMSAPLVYSHSCHIQTNPSRTVPEDVAQFPTP 184

Query: 181 ENKVEANGMTRDMGTQSTPTVVGSNSPSPASTPPIVERALKRCELEE-DSPNSNSKSTPE 240
           E K E+  MTRD+GTQSTP  + S S SPASTP I+ERALKRC  E  DSPN+N+K   E
Sbjct: 185 EKKAES--MTRDVGTQSTPPDLSSGSLSPASTPSILERALKRCGTENGDSPNTNTKPRAE 244

Query: 241 TEVIRRELNE-ERAKEEEKEVVHRETIAEEKFKQGGCLSWMMKKKKQKEEERSRRKRLIC 300
            +V  +E  E E    ++ E   ++ +     +Q GCLSWM  +++Q+E+ +S RKR I 
Sbjct: 245 EQVEVKETGEREETIIDKAERRRKDELMCRCSRQPGCLSWM--RRRQREKHKS-RKRSIF 304

Query: 301 HLKLKGC 304
               KGC
Sbjct: 305 FPHFKGC 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L6V9_CUCSA3.2e-13985.20Uncharacterized protein OS=Cucumis sativus GN=Csa_3G259700 PE=4 SV=1[more]
K7MV79_SOYBN1.2e-8259.67Uncharacterized protein OS=Glycine max GN=GLYMA_18G281000 PE=4 SV=1[more]
A0A061EHB0_THECC2.1e-8259.61Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_019503 PE=4 SV=1[more]
A0A0S3RRS5_PHAAN2.3e-8160.34Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G023900 PE=... [more]
A0A067K9Y6_JATCU4.0e-8155.93Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12963 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G16030.34.1e-5346.20 unknown protein[more]
AT3G02500.15.3e-4541.28 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659123255|ref|XP_008461568.1|2.7e-13985.53PREDICTED: uncharacterized protein LOC103500139 [Cucumis melo][more]
gi|449455789|ref|XP_004145633.1|4.6e-13985.20PREDICTED: uncharacterized protein LOC101205687 [Cucumis sativus][more]
gi|1009127162|ref|XP_015880548.1|8.2e-8861.29PREDICTED: uncharacterized protein LOC107416553 [Ziziphus jujuba][more]
gi|356567180|ref|XP_003551799.1|1.8e-8259.67PREDICTED: uncharacterized protein LOC100786993 [Glycine max][more]
gi|590653052|ref|XP_007033315.1|3.0e-8259.61Uncharacterized protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g17070.1Cp4.1LG02g17070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36748FAMILY NOT NAMEDcoord: 1..303
score: 1.1E