Cp4.1LG03g14060.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG03g14060.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTransmembrane protein, putative
LocationCp4.1LG03 : 9148062 .. 9151384 (+)
Sequence length2261
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCAGAATCGCGTGTCGAGTTTTATAACAGAAATACCAGACGAAGTCTCGCAATAGAGAGAAAGAAAGAGAGATCGGAAAGCGAAATAGGAAGATGGAAATGGCGACGAAGACCGATAGCAGAGCGGAGCGAAGGCAAAGAATTAGGAGCAAAGAAACGGATCGAATGGCTCTCATCACCGGCCGTTTACGAACTCTACCTCCCTCGCCTCCGCCTTCTCCTTCTTCACCGTCTCCATTTCTTCAATATCAAATTCACCAACGCGGCCATTCGCACACCGGAATCTCACCGTCCTTCTTATCCAAGGAGCTCCAAAAGAATCCAGATTCCCTTCCCCTCCGTCCCGTCCACGGTATTCTTCTTGTTTTTTTTTTCCTTCTCTCCATTCCGTATTGCCGCTCCCTGTTTTTCTTCTTCTAAATCGAATCTCGACTATTTTGCATTTTGCGATATGACTCTTAGGTTTCTTTCCTCATTCCTTCTTATTTTATATGTTCCGATTTGTATGTTCCGCCTGATTGTGAATTAGAAGCCAATTTTTACTAATTAACTTGCAGTTTTTCTTTTGGTTATGGATTATTGCTAATTAGTCTTCCTTCAACATCCATGGACGTATTAGAGATCAAATCTTTTTATCTGCGTTGTATAACCTAAGTTTAGGAAATGTTAGGATTCTGATATGCACGCACATGAATTTGTTTGTTATAAGCATTGGATGAATCGATTTCGCAAATGCATGCATTGTATTGCTCATGTTGATTCAAGTTGTATGTGTAACTATTGAGGCATCGAGAGATTTCATAAGATAAGCCTCATATATGCTAACGATGATTACAATTATATCAGGTTGAAATTTATATATCCTTTTTAAATTATTCATCACGTGAATAAGATAATCATCTATCTGACAGTGGAGGTATACCAAGTTCTATATCTACCATGAATTTATTACTTATATTTTAAAGGCTTTATTATTGTTATTATTGTTATTATGATTATTATTTATCTCAATGCAGTTAGAAAGAGGTTACTACATTACAAGTGTGCTAACATTGTTCAGAATTTGGTTGCCTAATATTCGCATATTCGAGCATCTCTGAGGTTGGGAAGAACAAGAGAGATATTGACTGTGAAATTTTTAACAGCAAGTTATTAATGTGACCATTATGCATCGGATTAGAAATAAATCAATAACAATATAATAGCTCTATCTTGGTTATTAGTTGTTGTTTGAATCTTTTCCAACAATAAGTCTGGTTAAGATTCATTGTTCGTTTTTCCTATCTAGGAACTGATAACATGGTTTATGTGGGTTAATTTTTTAGATGAGAAAACCACATTTGATACTAAGTGGTGTCAAAACGAATTGGGATTATGGAGATGGTGGAATGATCTCACATTGATTGTGAAAGAAGTTGAATGTTGGTAAATAAATGGGAATGAGAACTTATATGACTTAAACTAACAATTTTGGAGTGAGGAGTCCTGTTTACTTTCGTTTGTTTTGCAGCTATTCCGAAGCTTAAAGATGGAACGGCTGTCCCTTTACCGAAGCATATGCCCATCAATGAAGTCCAAGAAGAAAAAATTACAGCCACAGGATTCCAAATCAATGATAAAAAAATCGACCCCATTGGAGAAGTATGCAAAGAAATGATATCTCCATCTGCCTTACCAATGGTTCAGAAAGCCACCATTGTTAACGAGCCACTGTCAAAACCACAGCCTTCAAAGCCTAGAATCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCAAACGAGTTTTCTGTTCACTAATCATCGCTTCTTTGGCCATTCTATCACATGTCGATCACCCACTTTTCACGATTAGGAACGTAGTGAGTTTGGAGAGTGTGATGGCCTCAAAGCCTCTCTACATTCTACTGCTCACCAACGTAACAATCGTAATGGCGAGGATGTTGGCTGAGAGGCCGAAACACGGTGGGGAGGCAGAGGAAGAATGCGAGAAGATGAAGGAAGATGGACAAAACTGGGAGTCAGCTGTGAAAGTTTTGGAGAGAGGTTTGGTGTTTTACCAAGCTTTTCGTGCGATTTTCATTGATTTTAGTGTTTATGCAGTGGTGGTCATTTGTGGCCTCTCTTTGCTATAGCTTGTCTCTGGTTTTGATCCAGTGAACCATCTTCAGTTCTTTGGAGCTCTTTGCAGATGGATCCCATTTTCAAAAATTGTTGTCTTATTTGTTTCTTTCTTTTAAAGGTACATAACTACGCAAGGCACCATTTTGGTGTGGCTGCCTATGCCCAATCTTAGTTGTTAATTATTTGAAATGCTTAAAATTCATCTTCTTTTTCTCCTTTTGAATATCGAATCTATCATTTTAGATATTAGTTCTCATTTTCATGTCTTTAAGAACAAGAGATCATTATCGATCTCCATAGAAAAGATAGCAATAGTTCCAAGACTTTTCATATGAAAATTCCATAACAGTTGTTTGAATGTCAAATAAATAACGCAGTGGGAAATCCTACAAAACTTACAGGGAAAGTTTAGACCTGACAAATCAACCTCAAGTTCTTGTACATGTCTGAACCCGCAGAAGAAAATTCTGACAATGCCCTGAACAGTTCTGGCAACTGATTCTTAAGACTCACCAGTGATTTCTCCCTCACATGAAGGCATTGCTTTGCATGAGTTTCCTTTTCCTCCTCCACTCTCTTTTTCAATGACTCTACCACGACCAACTTCTCTGTCACTACGGCGTCCTGCGAGTTTTCTTCAGACTTCTCGGGGTCCAACTCGTCGGGCATTCTCCGTTGCTGGTATTTGTAATGCCAGTCGTCAAATTGCCTCTGCTTGCGCTCGAGCTCTTTCATGCTCTCGTCGCATCTTAACTTCAACTTCATCTCTTCTTCCTGCTGCAGTACAATAGTACTTATCACAGCACTGAAACTGGATATCGCAGTTCTAAGATGCTCGTCCGGGAGTTTTTCGAGTTGGTCGTGCCAAGCAATGAGGAGTCTTTGAATTGGTGGATTTTGAGCTCTTGGTGGAGAAGAAACCTTCTCTTTCAAGCTACTCTCTATAGGAACTAGATTTAGTTTCAACCAACTGCTTAATGCTTTTATGTAGTCTTTCTGACAGAGCGTGAGCTTCTCGAACTGCGAGTGCCATTCTCGCACAACGTTGCAGAGCTGTACCGTGCGCTCATGGTGATGCATACTGGTTTCTTTTGGGGATTGAGAGAGATCCAGATATCTCAATGCGTTCACAATTTTCAATTGCTCTTCATGGTGCATTCGCATTGTGTCCCACATCAACATCATCCT

mRNA sequence

CGCCAGAATCGCGTGTCGAGTTTTATAACAGAAATACCAGACGAAGTCTCGCAATAGAGAGAAAGAAAGAGAGATCGGAAAGCGAAATAGGAAGATGGAAATGGCGACGAAGACCGATAGCAGAGCGGAGCGAAGGCAAAGAATTAGGAGCAAAGAAACGGATCGAATGGCTCTCATCACCGGCCGTTTACGAACTCTACCTCCCTCGCCTCCGCCTTCTCCTTCTTCACCGTCTCCATTTCTTCAATATCAAATTCACCAACGCGGCCATTCGCACACCGGAATCTCACCGTCCTTCTTATCCAAGGAGCTCCAAAAGAATCCAGATTCCCTTCCCCTCCGTCCCGTCCACGATGAGAAAACCACATTTGATACTAAGTGGTGTCAAAACGAATTGGGATTATGGAGATGGTGGAATGATCTCACATTGATTGTGAAAGAAGTTGAATCTATTCCGAAGCTTAAAGATGGAACGGCTGTCCCTTTACCGAAGCATATGCCCATCAATGAAGTCCAAGAAGAAAAAATTACAGCCACAGGATTCCAAATCAATGATAAAAAAATCGACCCCATTGGAGAAGTATGCAAAGAAATGATATCTCCATCTGCCTTACCAATGGTTCAGAAAGCCACCATTGTTAACGAGCCACTGTCAAAACCACAGCCTTCAAAGCCTAGAATCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCAAACGAGTTTTCTGTTCACTAATCATCGCTTCTTTGGCCATTCTATCACATGTCGATCACCCACTTTTCACGATTAGGAACGTAGTGAGTTTGGAGAGTGTGATGGCCTCAAAGCCTCTCTACATTCTACTGCTCACCAACGTAACAATCGTAATGGCGAGGATGTTGGCTGAGAGGCCGAAACACGGTGGGGAGGCAGAGGAAGAATGCGAGAAGATGAAGGAAGATGGACAAAACTGGGAGTCAGCTGTGAAAGTTTTGGAGAGAGGTTTGGTGTTTTACCAAGCTTTTCGTGCGATTTTCATTGATTTTAGTGTTTATGCAGTGGTGGTCATTTGTGGCCTCTCTTTGCTATAGCTTGTCTCTGGTTTTGATCCAGTGAACCATCTTCAGTTCTTTGGAGCTCTTTGCAGATGGATCCCATTTTCAAAAATTGTTGTCTTATTTGTTTCTTTCTTTTAAAGGTACATAACTACGCAAGGCACCATTTTGGTGTGGCTGCCTATGCCCAATCTTAGTTGTTAATTATTTGAAATGCTTAAAATTCATCTTCTTTTTCTCCTTTTGAATATCGAATCTATCATTTTAGATATTAGTTCTCATTTTCATGTCTTTAAGAACAAGAGATCATTATCGATCTCCATAGAAAAGATAGCAATAGTTCCAAGACTTTTCATATGAAAATTCCATAACAGTTGTTTGAATGTCAAATAAATAACGCAGTGGGAAATCCTACAAAACTTACAGGGAAAGTTTAGACCTGACAAATCAACCTCAAGTTCTTGTACATGTCTGAACCCGCAGAAGAAAATTCTGACAATGCCCTGAACAGTTCTGGCAACTGATTCTTAAGACTCACCAGTGATTTCTCCCTCACATGAAGGCATTGCTTTGCATGAGTTTCCTTTTCCTCCTCCACTCTCTTTTTCAATGACTCTACCACGACCAACTTCTCTGTCACTACGGCGTCCTGCGAGTTTTCTTCAGACTTCTCGGGGTCCAACTCGTCGGGCATTCTCCGTTGCTGGTATTTGTAATGCCAGTCGTCAAATTGCCTCTGCTTGCGCTCGAGCTCTTTCATGCTCTCGTCGCATCTTAACTTCAACTTCATCTCTTCTTCCTGCTGCAGTACAATAGTACTTATCACAGCACTGAAACTGGATATCGCAGTTCTAAGATGCTCGTCCGGGAGTTTTTCGAGTTGGTCGTGCCAAGCAATGAGGAGTCTTTGAATTGGTGGATTTTGAGCTCTTGGTGGAGAAGAAACCTTCTCTTTCAAGCTACTCTCTATAGGAACTAGATTTAGTTTCAACCAACTGCTTAATGCTTTTATGTAGTCTTTCTGACAGAGCGTGAGCTTCTCGAACTGCGAGTGCCATTCTCGCACAACGTTGCAGAGCTGTACCGTGCGCTCATGGTGATGCATACTGGTTTCTTTTGGGGATTGAGAGAGATCCAGATATCTCAATGCGTTCACAATTTTCAATTGCTCTTCATGGTGCATTCGCATTGTGTCCCACATCAACATCATCCT

Coding sequence (CDS)

ATGGAAATGGCGACGAAGACCGATAGCAGAGCGGAGCGAAGGCAAAGAATTAGGAGCAAAGAAACGGATCGAATGGCTCTCATCACCGGCCGTTTACGAACTCTACCTCCCTCGCCTCCGCCTTCTCCTTCTTCACCGTCTCCATTTCTTCAATATCAAATTCACCAACGCGGCCATTCGCACACCGGAATCTCACCGTCCTTCTTATCCAAGGAGCTCCAAAAGAATCCAGATTCCCTTCCCCTCCGTCCCGTCCACGATGAGAAAACCACATTTGATACTAAGTGGTGTCAAAACGAATTGGGATTATGGAGATGGTGGAATGATCTCACATTGATTGTGAAAGAAGTTGAATCTATTCCGAAGCTTAAAGATGGAACGGCTGTCCCTTTACCGAAGCATATGCCCATCAATGAAGTCCAAGAAGAAAAAATTACAGCCACAGGATTCCAAATCAATGATAAAAAAATCGACCCCATTGGAGAAGTATGCAAAGAAATGATATCTCCATCTGCCTTACCAATGGTTCAGAAAGCCACCATTGTTAACGAGCCACTGTCAAAACCACAGCCTTCAAAGCCTAGAATCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCAAACGAGTTTTCTGTTCACTAATCATCGCTTCTTTGGCCATTCTATCACATGTCGATCACCCACTTTTCACGATTAGGAACGTAGTGAGTTTGGAGAGTGTGATGGCCTCAAAGCCTCTCTACATTCTACTGCTCACCAACGTAACAATCGTAATGGCGAGGATGTTGGCTGAGAGGCCGAAACACGGTGGGGAGGCAGAGGAAGAATGCGAGAAGATGAAGGAAGATGGACAAAACTGGGAGTCAGCTGTGAAAGTTTTGGAGAGAGGTTTGGTGTTTTACCAAGCTTTTCGTGCGATTTTCATTGATTTTAGTGTTTATGCAGTGGTGGTCATTTGTGGCCTCTCTTTGCTATAG

Protein sequence

MEMATKTDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHSHTGISPSFLSKELQKNPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIVKEVESIPKLKDGTAVPLPKHMPINEVQEEKITATGFQINDKKIDPIGEVCKEMISPSALPMVQKATIVNEPLSKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKEDGQNWESAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGLSLL
BLAST of Cp4.1LG03g14060.1 vs. TrEMBL
Match: A0A0A0L815_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G481250 PE=4 SV=1)

HSP 1 Score: 377.5 bits (968), Expect = 1.7e-101
Identity = 216/331 (65.26%), Postives = 243/331 (73.41%), Query Frame = 1

Query: 1   MEMATKTDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHS 60
           MEMATK ++R +RR+RI S+E DRMALITGRLR LPPSPPPSPSSPSPFL +Q HQRGHS
Sbjct: 1   MEMATKIENRTDRRRRIMSREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60

Query: 61  HTGISPSFLSKELQKNPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIVKEVESI 120
           HTGISPSF SK++  NPDS PL                                   + +
Sbjct: 61  HTGISPSFFSKDIHANPDSPPL--------------------------------PNAQGV 120

Query: 121 PKLKDGTAVPLPKHMPINEVQEEKITATGFQINDKKIDPIGEVCKEMIS-PSALPMVQKA 180
           PK KD  A PL K + ++E +EEKI A GFQIN KK+DPIGE+  E +S PSA  MVQK 
Sbjct: 121 PKPKDAKATPLLKRLSMSEAREEKIAAIGFQINHKKLDPIGEIHTETVSTPSASSMVQKV 180

Query: 181 TIV-NEPLSKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRN 240
           T   NE L K  PSKP++FTSKRLNASILASQT RVFCSLIIASLA+LSHV+HPL  I  
Sbjct: 181 TSTDNEILLKAHPSKPKLFTSKRLNASILASQTTRVFCSLIIASLAVLSHVNHPLSMIWK 240

Query: 241 VVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKEDGQNWESAVKVL 300
           +V  E V+ASKPLYILLLT+ TIV+ARMLA R K   EAEEE EKMKEDG NW+SAVKVL
Sbjct: 241 MVRSERVVASKPLYILLLTDATIVVARMLAARQKDSREAEEESEKMKEDGHNWDSAVKVL 299

Query: 301 ERGLVFYQAFRAIFIDFSVYAVVVICGLSLL 330
           ERGLVFYQAFRAIFIDFSVYAVVVICG+SLL
Sbjct: 301 ERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299

BLAST of Cp4.1LG03g14060.1 vs. TrEMBL
Match: B9RQR3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1495360 PE=4 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 3.2e-36
Identity = 125/326 (38.34%), Postives = 173/326 (53.07%), Query Frame = 1

Query: 7   TDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHSHTGISP 66
           +++R ERR+RI  + +DR+ALITG+++ L  SP  +P+       YQ  QR H+HT  SP
Sbjct: 3   SNARQERRRRIVERGSDRLALITGQIQNLNESPSSTPT-------YQ--QRHHAHTESSP 62

Query: 67  SFLSKELQKNPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIVKEVESIPKLKDG 126
           S +                    + +D      E           L    V  + KL+  
Sbjct: 63  SIMY-------------------SPYDHSQINAE----------GLDGASVAKLTKLRTI 122

Query: 127 TAVPLPKHMPINEVQEEKITATGFQINDKKIDPIGEVCKEMISPSALPMVQKATIVNEPL 186
             +P  K+    +  E +        + +KI P     K  I  S +       + NE  
Sbjct: 123 NGIPESKNFDSPKKPESRFRNNA--TSPEKIRPQVFDTKTEIQESVIVATNSPLVPNE-- 182

Query: 187 SKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVM 246
                   R F+SKR+NA I++S+ +R  CSLI+A L ++SH+ +PLF+  N+V  ESV+
Sbjct: 183 ---SNHHYRFFSSKRINACIISSERRRAMCSLIVAFLVVISHIGYPLFS-ANIVRSESVI 242

Query: 247 ASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKM---KEDGQNWESAVKVLERGLV 306
           AS+PLYI+LLT+V IV+  M  E   +G E E E E+M   KEDG NW+ AVK+LERGLV
Sbjct: 243 ASRPLYIILLTDVAIVLGHMFHESGNNGSE-EAEAERMEPNKEDGDNWDGAVKLLERGLV 281

Query: 307 FYQAFRAIFIDFSVYAVVVICGLSLL 330
            YQA R IFID SVY VVVI GLSLL
Sbjct: 303 LYQAIRGIFIDCSVYLVVVISGLSLL 281

BLAST of Cp4.1LG03g14060.1 vs. TrEMBL
Match: U5GLN1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s05400g PE=4 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 1.6e-35
Identity = 98/198 (49.49%), Postives = 129/198 (65.15%), Query Frame = 1

Query: 136 PINEVQEEKITATGFQINDKKIDPIGEVCKEMISPSAL--PMVQKATIVNEPLSKPQPSK 195
           P  +   E     GF I ++    + +  +E ++P+     M +  T +  P  +    K
Sbjct: 83  PKRKASNEAFEGIGFDIRNQ----VEQHLQERVTPTEAYSKMTEIQTSIATPSIQKASDK 142

Query: 196 PRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVMASKPLYI 255
           P  F+SKR+N+ I+ASQ  RV CSLIIASL ++S++D+PL  I N+VS ES++AS+PLYI
Sbjct: 143 PNFFSSKRINSCIIASQRSRVICSLIIASLVLISYIDYPLLGI-NIVSSESIIASRPLYI 202

Query: 256 LLLTNVTIVMARMLAERPKHGGEAEEECEKM--KEDGQNWESAVKVLERGLVFYQAFRAI 315
           +LLT+VTIV+ R+  ER  HG E E E E+M  KEDG NW  AVK+LERGL  YQA R I
Sbjct: 203 VLLTDVTIVLVRLFRERGNHGSE-ESERERMVSKEDGDNWVGAVKLLERGLTVYQAVRGI 262

Query: 316 FIDFSVYAVVVICGLSLL 330
           FID SVY VVVIC LSLL
Sbjct: 263 FIDCSVYLVVVICALSLL 274

BLAST of Cp4.1LG03g14060.1 vs. TrEMBL
Match: W9RGT9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_026179 PE=4 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 1.7e-32
Identity = 84/142 (59.15%), Postives = 112/142 (78.87%), Query Frame = 1

Query: 193 KPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVMASKPLY 252
           +P+ F+SKRLN+SI+ S+T R+FC+LIIA L +LS+VD+PLF  RN+V+ ESV+AS+PLY
Sbjct: 115 RPKFFSSKRLNSSIIVSETPRIFCALIIAFLVVLSYVDYPLFG-RNIVTTESVVASRPLY 174

Query: 253 ILLLTNVTIVMARMLAERPK----HGGEA-EEECEKMKEDGQNWESAVKVLERGLVFYQA 312
           ILLLT+V+IV+AR+  E  +     G E  EEE  ++++DG +W  A++ LERGLV YQA
Sbjct: 175 ILLLTDVSIVIARLHLENRRAPEDEGAEGQEEEVLRIRDDGHDWTQALRFLERGLVVYQA 234

Query: 313 FRAIFIDFSVYAVVVICGLSLL 330
            R IFID SVYAVVVICGLSL+
Sbjct: 235 IRGIFIDCSVYAVVVICGLSLV 255

BLAST of Cp4.1LG03g14060.1 vs. TrEMBL
Match: A0A067EBJ1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g021998mg PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 2.0e-30
Identity = 118/339 (34.81%), Postives = 181/339 (53.39%), Query Frame = 1

Query: 3   MATKTDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHSHT 62
           MAT + SR ERR+RI  + +DR+ALI+GR+++LP SP  SP           H+   +H 
Sbjct: 1   MATGS-SREERRKRILDRGSDRLALISGRIQSLPSSPISSPHH---------HRAATTHL 60

Query: 63  GISPSFLSKELQKNPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIV-------- 122
                  +++   +     L+ +   ++   T+      G  +     +L+V        
Sbjct: 61  STQSLVFTRDHHDH-----LQSLISNQSNAGTEGPITPSGPHQLLKHSSLVVPREKTYDI 120

Query: 123 -KEVE-SIPKLKDGTAVPLPKH--MPINEVQEEKITATGFQINDKKIDPIGEVCKEMISP 182
            ++VE  +PK +      +PK+    +NEV                     E  K    P
Sbjct: 121 GRQVEPQVPKHERKVDPQVPKYDAKSVNEVA-----------------AADESTKSQPEP 180

Query: 183 SALPMVQKATIVNEPLSKPQ-PSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHV 242
             +P    A IV +P +  +   KP +F+ K++N+ I+ASQ  R  C+L+IA L +LS++
Sbjct: 181 EPMP---AAPIVEKPSNDTELLPKPILFSCKQINSCIIASQGIRSLCALLIALLVVLSYI 240

Query: 243 DHPLFTIRNVVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKEDGQ 302
           D  L  I N+VS ESV A +PLYILLLT+VTIV+A++  ++ K   EAE+E  + +EDG 
Sbjct: 241 DDALLGI-NIVSSESVEALRPLYILLLTDVTIVLAQVFLKQQKQSEEAEKENVEPQEDGN 300

Query: 303 NWESAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGLSL 329
           N   A++++ER LV YQ  RA+FIDFS+Y VVV+CG SL
Sbjct: 301 NMTQAIQLMERSLVVYQTARAVFIDFSIYTVVVVCGFSL 303

BLAST of Cp4.1LG03g14060.1 vs. TAIR10
Match: AT1G52343.1 (AT1G52343.1 unknown protein)

HSP 1 Score: 79.0 bits (193), Expect = 6.3e-15
Identity = 59/146 (40.41%), Postives = 85/146 (58.22%), Query Frame = 1

Query: 188 KPQPSKPR-IFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVM 247
           K Q  +P   F+SK+LNASI++S+  R   SL IA+  +L           N+ S  +++
Sbjct: 110 KSQNQRPICFFSSKKLNASIISSERTRSLSSLTIAAFVVL-------LPRLNITSSNTIL 169

Query: 248 ASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKEDG---QNWESAVKVLERGLV 307
           A +PL++L+LT+  IVM+ +  E    G   E E +    DG   +NW  A ++LERG+V
Sbjct: 170 ALRPLWLLILTDCAIVMSHLTTEASGGGLSHEMEEDGKGRDGNNGENWSDAERLLERGVV 229

Query: 308 FYQAFRAIFIDFSVY-AVVVICGLSL 329
            YQA R +FID S+Y  VVVI G SL
Sbjct: 230 VYQALRGMFIDCSLYMVVVVIFGASL 248

BLAST of Cp4.1LG03g14060.1 vs. TAIR10
Match: AT4G32680.1 (AT4G32680.1 unknown protein)

HSP 1 Score: 57.8 bits (138), Expect = 1.5e-08
Identity = 48/160 (30.00%), Postives = 83/160 (51.88%), Query Frame = 1

Query: 170 PSALPMVQKATIVNEPLSK---PQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAIL 229
           P     VQ  ++V+   S+   P  S     T K + A+I AS+  R+F +L IA + IL
Sbjct: 126 PPTTSSVQNPSVVDLGASQAFIPVVSFVNAITPKHIGAAIDASEYARMFTALAIALVVIL 185

Query: 230 SHVDHPLFTIRNVVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKE 289
           SH+           SL ++++ +P+++L+LT+ TIV+ R+L     H G++      +  
Sbjct: 186 SHLGFS--------SLGNIVSFRPVFLLVLTDATIVLGRVLLS---HRGDSSSASGTVMS 245

Query: 290 DGQNWESAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGL 327
                +     LE  ++  +   A+ +DFS+YAV++ICGL
Sbjct: 246 GQGIVDQVGNALETVMMVKKIMDALLMDFSLYAVILICGL 274

BLAST of Cp4.1LG03g14060.1 vs. NCBI nr
Match: gi|449465278|ref|XP_004150355.1| (PREDICTED: uncharacterized protein LOC101203675 [Cucumis sativus])

HSP 1 Score: 377.5 bits (968), Expect = 2.4e-101
Identity = 216/331 (65.26%), Postives = 243/331 (73.41%), Query Frame = 1

Query: 1   MEMATKTDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHS 60
           MEMATK ++R +RR+RI S+E DRMALITGRLR LPPSPPPSPSSPSPFL +Q HQRGHS
Sbjct: 1   MEMATKIENRTDRRRRIMSREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60

Query: 61  HTGISPSFLSKELQKNPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIVKEVESI 120
           HTGISPSF SK++  NPDS PL                                   + +
Sbjct: 61  HTGISPSFFSKDIHANPDSPPL--------------------------------PNAQGV 120

Query: 121 PKLKDGTAVPLPKHMPINEVQEEKITATGFQINDKKIDPIGEVCKEMIS-PSALPMVQKA 180
           PK KD  A PL K + ++E +EEKI A GFQIN KK+DPIGE+  E +S PSA  MVQK 
Sbjct: 121 PKPKDAKATPLLKRLSMSEAREEKIAAIGFQINHKKLDPIGEIHTETVSTPSASSMVQKV 180

Query: 181 TIV-NEPLSKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRN 240
           T   NE L K  PSKP++FTSKRLNASILASQT RVFCSLIIASLA+LSHV+HPL  I  
Sbjct: 181 TSTDNEILLKAHPSKPKLFTSKRLNASILASQTTRVFCSLIIASLAVLSHVNHPLSMIWK 240

Query: 241 VVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKEDGQNWESAVKVL 300
           +V  E V+ASKPLYILLLT+ TIV+ARMLA R K   EAEEE EKMKEDG NW+SAVKVL
Sbjct: 241 MVRSERVVASKPLYILLLTDATIVVARMLAARQKDSREAEEESEKMKEDGHNWDSAVKVL 299

Query: 301 ERGLVFYQAFRAIFIDFSVYAVVVICGLSLL 330
           ERGLVFYQAFRAIFIDFSVYAVVVICG+SLL
Sbjct: 301 ERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299

BLAST of Cp4.1LG03g14060.1 vs. NCBI nr
Match: gi|659093364|ref|XP_008447503.1| (PREDICTED: uncharacterized protein LOC103489936 [Cucumis melo])

HSP 1 Score: 377.5 bits (968), Expect = 2.4e-101
Identity = 218/333 (65.47%), Postives = 246/333 (73.87%), Query Frame = 1

Query: 1   MEMATKTDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHS 60
           MEM TKTD+R ERR+RI S+E DRMALITGRL  LPPSPPPSPSSPSPFL +Q HQRGHS
Sbjct: 1   MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60

Query: 61  HTGISPSFLSKELQK--NPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIVKEVE 120
           HTGISPSF SK+L    NPDSLP                                    +
Sbjct: 61  HTGISPSFFSKDLHNHNNPDSLPF--------------------------------PNAQ 120

Query: 121 SIPKLKDGTAVPLPKHMPINEVQEEKITATGFQINDKKIDPIGEVCKEMIS-PSALPMVQ 180
            IPK KD  A PL K + ++E +EEKI A GFQ N KK+DPIGEV  E +S PSA  MVQ
Sbjct: 121 GIPKPKDAKATPLLKRLSMSEAREEKIAAIGFQFNHKKLDPIGEVHTETVSTPSASSMVQ 180

Query: 181 KATIVNEP-LSKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTI 240
           K T +++  L K  PSKP++FTSKR+NASILASQT RVFCSLIIASL++LSHV+HPL  I
Sbjct: 181 KITSIDDKILLKTHPSKPKLFTSKRINASILASQTTRVFCSLIIASLSVLSHVNHPLSII 240

Query: 241 RNVVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKMKEDGQNWESAVK 300
            N+V  ESV+ASKPLYILLLT+ TIV+ARMLAER K GG AEEE EKMKEDG+NW+SAVK
Sbjct: 241 WNMVRSESVVASKPLYILLLTDATIVLARMLAERQKDGGVAEEEIEKMKEDGRNWDSAVK 300

Query: 301 VLERGLVFYQAFRAIFIDFSVYAVVVICGLSLL 330
           VLERGLVFYQAFRAIFIDFSVYAVVVICG+ LL
Sbjct: 301 VLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 301

BLAST of Cp4.1LG03g14060.1 vs. NCBI nr
Match: gi|255550062|ref|XP_002516082.1| (PREDICTED: uncharacterized protein LOC8274534 [Ricinus communis])

HSP 1 Score: 160.6 bits (405), Expect = 4.6e-36
Identity = 125/326 (38.34%), Postives = 173/326 (53.07%), Query Frame = 1

Query: 7   TDSRAERRQRIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHSHTGISP 66
           +++R ERR+RI  + +DR+ALITG+++ L  SP  +P+       YQ  QR H+HT  SP
Sbjct: 3   SNARQERRRRIVERGSDRLALITGQIQNLNESPSSTPT-------YQ--QRHHAHTESSP 62

Query: 67  SFLSKELQKNPDSLPLRPVHDEKTTFDTKWCQNELGLWRWWNDLTLIVKEVESIPKLKDG 126
           S +                    + +D      E           L    V  + KL+  
Sbjct: 63  SIMY-------------------SPYDHSQINAE----------GLDGASVAKLTKLRTI 122

Query: 127 TAVPLPKHMPINEVQEEKITATGFQINDKKIDPIGEVCKEMISPSALPMVQKATIVNEPL 186
             +P  K+    +  E +        + +KI P     K  I  S +       + NE  
Sbjct: 123 NGIPESKNFDSPKKPESRFRNNA--TSPEKIRPQVFDTKTEIQESVIVATNSPLVPNE-- 182

Query: 187 SKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVM 246
                   R F+SKR+NA I++S+ +R  CSLI+A L ++SH+ +PLF+  N+V  ESV+
Sbjct: 183 ---SNHHYRFFSSKRINACIISSERRRAMCSLIVAFLVVISHIGYPLFS-ANIVRSESVI 242

Query: 247 ASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECEKM---KEDGQNWESAVKVLERGLV 306
           AS+PLYI+LLT+V IV+  M  E   +G E E E E+M   KEDG NW+ AVK+LERGLV
Sbjct: 243 ASRPLYIILLTDVAIVLGHMFHESGNNGSE-EAEAERMEPNKEDGDNWDGAVKLLERGLV 281

Query: 307 FYQAFRAIFIDFSVYAVVVICGLSLL 330
            YQA R IFID SVY VVVI GLSLL
Sbjct: 303 LYQAIRGIFIDCSVYLVVVISGLSLL 281

BLAST of Cp4.1LG03g14060.1 vs. NCBI nr
Match: gi|566160981|ref|XP_006385474.1| (hypothetical protein POPTR_0003s05400g [Populus trichocarpa])

HSP 1 Score: 158.3 bits (399), Expect = 2.3e-35
Identity = 98/198 (49.49%), Postives = 129/198 (65.15%), Query Frame = 1

Query: 136 PINEVQEEKITATGFQINDKKIDPIGEVCKEMISPSAL--PMVQKATIVNEPLSKPQPSK 195
           P  +   E     GF I ++    + +  +E ++P+     M +  T +  P  +    K
Sbjct: 83  PKRKASNEAFEGIGFDIRNQ----VEQHLQERVTPTEAYSKMTEIQTSIATPSIQKASDK 142

Query: 196 PRIFTSKRLNASILASQTKRVFCSLIIASLAILSHVDHPLFTIRNVVSLESVMASKPLYI 255
           P  F+SKR+N+ I+ASQ  RV CSLIIASL ++S++D+PL  I N+VS ES++AS+PLYI
Sbjct: 143 PNFFSSKRINSCIIASQRSRVICSLIIASLVLISYIDYPLLGI-NIVSSESIIASRPLYI 202

Query: 256 LLLTNVTIVMARMLAERPKHGGEAEEECEKM--KEDGQNWESAVKVLERGLVFYQAFRAI 315
           +LLT+VTIV+ R+  ER  HG E E E E+M  KEDG NW  AVK+LERGL  YQA R I
Sbjct: 203 VLLTDVTIVLVRLFRERGNHGSE-ESERERMVSKEDGDNWVGAVKLLERGLTVYQAVRGI 262

Query: 316 FIDFSVYAVVVICGLSLL 330
           FID SVY VVVIC LSLL
Sbjct: 263 FIDCSVYLVVVICALSLL 274

BLAST of Cp4.1LG03g14060.1 vs. NCBI nr
Match: gi|743915339|ref|XP_011001619.1| (PREDICTED: uncharacterized protein LOC105108848 [Populus euphratica])

HSP 1 Score: 154.8 bits (390), Expect = 2.5e-34
Identity = 93/169 (55.03%), Postives = 120/169 (71.01%), Query Frame = 1

Query: 165 KEMISPSAL--PMVQKATIVNEPLSKPQPSKPRIFTSKRLNASILASQTKRVFCSLIIAS 224
           +E ++P+     M +  T++  P  +    KP  F+SKR+N+ I+ASQ  RV CSLIIAS
Sbjct: 117 QERVAPTEAHNTMTKVQTLIVTPSIRKASDKPNFFSSKRINSCIIASQRSRVICSLIIAS 176

Query: 225 LAILSHVDHPLFTIRNVVSLESVMASKPLYILLLTNVTIVMARMLAERPKHGGEAEEECE 284
           L ++S++D+PL  I N+VS ES++AS+PLYI+LLT+VTIV+ R+  ER  HG E E E E
Sbjct: 177 LVLISYIDYPLLGI-NIVSSESIIASRPLYIVLLTDVTIVLVRLFRERGNHGTE-ESERE 236

Query: 285 KM--KEDGQNWESAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGLSLL 330
           +M  KEDG NW  AVK+LERGL  YQA R IFID SVY VVVIC LSLL
Sbjct: 237 RMVSKEDGDNWVGAVKLLERGLAMYQAVRGIFIDCSVYLVVVICTLSLL 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L815_CUCSA1.7e-10165.26Uncharacterized protein OS=Cucumis sativus GN=Csa_3G481250 PE=4 SV=1[more]
B9RQR3_RICCO3.2e-3638.34Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1495360 PE=4 SV=1[more]
U5GLN1_POPTR1.6e-3549.49Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s05400g PE=4 SV=1[more]
W9RGT9_9ROSA1.7e-3259.15Uncharacterized protein OS=Morus notabilis GN=L484_026179 PE=4 SV=1[more]
A0A067EBJ1_CITSI2.0e-3034.81Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g021998mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G52343.16.3e-1540.41 unknown protein[more]
AT4G32680.11.5e-0830.00 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449465278|ref|XP_004150355.1|2.4e-10165.26PREDICTED: uncharacterized protein LOC101203675 [Cucumis sativus][more]
gi|659093364|ref|XP_008447503.1|2.4e-10165.47PREDICTED: uncharacterized protein LOC103489936 [Cucumis melo][more]
gi|255550062|ref|XP_002516082.1|4.6e-3638.34PREDICTED: uncharacterized protein LOC8274534 [Ricinus communis][more]
gi|566160981|ref|XP_006385474.1|2.3e-3549.49hypothetical protein POPTR_0003s05400g [Populus trichocarpa][more]
gi|743915339|ref|XP_011001619.1|2.5e-3455.03PREDICTED: uncharacterized protein LOC105108848 [Populus euphratica][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG03g14060Cp4.1LG03g14060gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG03g14060.1:five_prime_utr:001Cp4.1LG03g14060.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG03g14060.1:cds:003Cp4.1LG03g14060.1:cds:003CDS
Cp4.1LG03g14060.1:cds:002Cp4.1LG03g14060.1:cds:002CDS
Cp4.1LG03g14060.1:cds:001Cp4.1LG03g14060.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG03g14060.1:three_prime_utr:001Cp4.1LG03g14060.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG03g14060.1Cp4.1LG03g14060.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35469FAMILY NOT NAMEDcoord: 6..329
score: 7.1
NoneNo IPR availablePANTHERPTHR35469:SF1SUBFAMILY NOT NAMEDcoord: 6..329
score: 7.1