Cp4.1LG17g04500.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG17g04500.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBnaC04g00790D protein
LocationCp4.1LG17 : 2994287 .. 2996064 (+)
Sequence length1075
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCCAATTCCTTCTACATTCTTTTTCATTCTTATTCTCAATGGCCGGTCAAGTTCCTCGTCCATGGTTCCGTTTGGGCTCTATGACCCGACCAACAACCGCCACAACCACCGCTCCGCCCACCGACCAACCGCGACCTGCTCCCCCTCGGATGCGCCCGCCCTTGATCCGAACCACCTCCTTGACGGAGCTTGCAGAACCCACCACCCCACAGCAATCTGCCAAATCGCCGCCGCAGTTACCTCGCCTAGCCCCACTAACCCCTCAGCCGAAAAAGGAAGCGTCGCCACCGCCACCGCCCCCGCCTTTGACCCGACCTGTTGTTACGTCCACCGCAGCGCCAAAACCCAGTAGCCCCAAGGGAAAAAGTCATTCGCTTGGAGGGTCTCCGCCGCGCAATGGCTCGTTTGATAAAGAAAGTAATAAACATCCCACCCCAGCTCCGTCTCCTTCTATTCCCAAGTCTATTCCGGCCGTTCCGTCGCCTTATCAGTCGCCGAAGCCGAAGCCTAAAGCCACCGCTTCGCCCCCGTCGCCGCTTGTTTTACCACCGCCGCAGTTGCAGTTTGTCGCCGAGCCTAGACCTGAGACCATTCCTCAAGAGGTAGGGTTAATTTAACTTCATTGAGATTGGTGTGTTTTGAGATTGGTGTGTTTTTAGCATTGGCAAACTTTTGAAAATCGAGAGTAAAAATAGTAGGTTTAGCCCGTTGAGGATTGTTGGGAGAGAGTACCACGTTAGCTAATTTAGAGAATGATCGTGTGTTTATAAGCCGGGAATATATCTTCATTGATACGAGGCCTTTTAAAAGCCATGAGAGCTTATGCCCAAATTCGACAATATCATACCATTATGGAGAGTAATGATTAGTAACATGGTATCAGAGTCATGCCCTTAACTTAGTCATGTCAATAGAATCTTCAAATGTCGAACAAAGAAGTTGTGAACCTCGAGGGTATAGTCAAAAGTGACTCAAGTGTCGAATAAATGGTGTACTTTCTTCGAGTACTCCAGAGAAGGAGTCGAGCCTCGATTAAGAGGAGGTGTTTCTCGACTCCTTCCTGTAGTCCTCGAACAAAGTACACCCTTTATTCGAGGGCTCGAGAGGAGGATTGTGGAGTCCCACCTTAGCTAATTTAAGAAATAATCAAGGGTTTATAAATAAGGAATACATCTTTATTGGTACGAGGTGTTTTGGATGCTCAAAGCGAACCATATCTTACCATTATGGAGAATCGTGATTTCTAACATAATCAATGAACAGGTAGAACGAAAGACGGTTGTTTTTCAAAAAGTGATGGAGAAGCCTTCACCGCCCGATCACGATATCAATAACTACAATTCCCAAGCATCTGGATTTGGCAAAAATGGGAAGAAACATGATGACAATCAAGACAAGGGAAATGGAGCCAAGAAACCGTTCGTGTCGTCCGGTGATAAAGACTCGAGCACAAGAGTTATAACAATGGCAGGAGAAAACAGAGGAGCTTCCATGGAAGTAGTTCTCTCAGACAAAAACAACAACAGCCGCCATCTTCAAACAAATCAAAGCGACAAGGAAGAGACCACCAACGATTCTAAAGACAACAAAAGTAACAAGAAATCAACGGCGGGTCGTGGTGCATGGCCAATGAAAGCTTTCTTTAACAGCAATGTTCAAGGGATTAACAATTCAATTCTGATGGACTCCAAATTCACCCACCACGACCCTGGGATTCATCTTGTCTTCTCCAAGATGCCGCCGCCATCTGCCCATCAGGAAGACGGCGGACAATAA

mRNA sequence

CACCCAATTCCTTCTACATTCTTTTTCATTCTTATTCTCAATGGCCGGTCAAGTTCCTCGTCCATGGTTCCGTTTGGGCTCTATGACCCGACCAACAACCGCCACAACCACCGCTCCGCCCACCGACCAACCGCGACCTGCTCCCCCTCGGATGCGCCCGCCCTTGATCCGAACCACCTCCTTGACGGAGCTTGCAGAACCCACCACCCCACAGCAATCTGCCAAATCGCCGCCGCAGTTACCTCGCCTAGCCCCACTAACCCCTCAGCCGAAAAAGGAAGCGTCGCCACCGCCACCGCCCCCGCCTTTGACCCGACCTGTTGTTACGTCCACCGCAGCGCCAAAACCCAGTAGCCCCAAGGGAAAAAGTCATTCGCTTGGAGGGTCTCCGCCGCGCAATGGCTCGTTTGATAAAGAAAGTAATAAACATCCCACCCCAGCTCCGTCTCCTTCTATTCCCAAGTCTATTCCGGCCGTTCCGTCGCCTTATCAGTCGCCGAAGCCGAAGCCTAAAGCCACCGCTTCGCCCCCGTCGCCGCTTGTTTTACCACCGCCGCAGTTGCAGTTTGTCGCCGAGCCTAGACCTGAGACCATTCCTCAAGAGCCTTCACCGCCCGATCACGATATCAATAACTACAATTCCCAAGCATCTGGATTTGGCAAAAATGGGAAGAAACATGATGACAATCAAGACAAGGGAAATGGAGCCAAGAAACCGTTCGTGTCGTCCGGTGATAAAGACTCGAGCACAAGAGTTATAACAATGGCAGGAGAAAACAGAGGAGCTTCCATGGAAGTAGTTCTCTCAGACAAAAACAACAACAGCCGCCATCTTCAAACAAATCAAAGCGACAAGGAAGAGACCACCAACGATTCTAAAGACAACAAAAGTAACAAGAAATCAACGGCGGGTCGTGGTGCATGGCCAATGAAAGCTTTCTTTAACAGCAATGTTCAAGGGATTAACAATTCAATTCTGATGGACTCCAAATTCACCCACCACGACCCTGGGATTCATCTTGTCTTCTCCAAGATGCCGCCGCCATCTGCCCATCAGGAAGACGGCGGACAATAA

Coding sequence (CDS)

ATGGCCGGTCAAGTTCCTCGTCCATGGTTCCGTTTGGGCTCTATGACCCGACCAACAACCGCCACAACCACCGCTCCGCCCACCGACCAACCGCGACCTGCTCCCCCTCGGATGCGCCCGCCCTTGATCCGAACCACCTCCTTGACGGAGCTTGCAGAACCCACCACCCCACAGCAATCTGCCAAATCGCCGCCGCAGTTACCTCGCCTAGCCCCACTAACCCCTCAGCCGAAAAAGGAAGCGTCGCCACCGCCACCGCCCCCGCCTTTGACCCGACCTGTTGTTACGTCCACCGCAGCGCCAAAACCCAGTAGCCCCAAGGGAAAAAGTCATTCGCTTGGAGGGTCTCCGCCGCGCAATGGCTCGTTTGATAAAGAAAGTAATAAACATCCCACCCCAGCTCCGTCTCCTTCTATTCCCAAGTCTATTCCGGCCGTTCCGTCGCCTTATCAGTCGCCGAAGCCGAAGCCTAAAGCCACCGCTTCGCCCCCGTCGCCGCTTGTTTTACCACCGCCGCAGTTGCAGTTTGTCGCCGAGCCTAGACCTGAGACCATTCCTCAAGAGCCTTCACCGCCCGATCACGATATCAATAACTACAATTCCCAAGCATCTGGATTTGGCAAAAATGGGAAGAAACATGATGACAATCAAGACAAGGGAAATGGAGCCAAGAAACCGTTCGTGTCGTCCGGTGATAAAGACTCGAGCACAAGAGTTATAACAATGGCAGGAGAAAACAGAGGAGCTTCCATGGAAGTAGTTCTCTCAGACAAAAACAACAACAGCCGCCATCTTCAAACAAATCAAAGCGACAAGGAAGAGACCACCAACGATTCTAAAGACAACAAAAGTAACAAGAAATCAACGGCGGGTCGTGGTGCATGGCCAATGAAAGCTTTCTTTAACAGCAATGTTCAAGGGATTAACAATTCAATTCTGATGGACTCCAAATTCACCCACCACGACCCTGGGATTCATCTTGTCTTCTCCAAGATGCCGCCGCCATCTGCCCATCAGGAAGACGGCGGACAATAA

Protein sequence

MAGQVPRPWFRLGSMTRPTTATTTAPPTDQPRPAPPRMRPPLIRTTSLTELAEPTTPQQSAKSPPQLPRLAPLTPQPKKEASPPPPPPPLTRPVVTSTAAPKPSSPKGKSHSLGGSPPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLPPPQLQFVAEPRPETIPQEPSPPDHDINNYNSQASGFGKNGKKHDDNQDKGNGAKKPFVSSGDKDSSTRVITMAGENRGASMEVVLSDKNNNSRHLQTNQSDKEETTNDSKDNKSNKKSTAGRGAWPMKAFFNSNVQGINNSILMDSKFTHHDPGIHLVFSKMPPPSAHQEDGGQ
BLAST of Cp4.1LG17g04500.1 vs. TrEMBL
Match: A0A0A0KAP9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G107920 PE=4 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 1.1e-66
Identity = 198/388 (51.03%), Postives = 234/388 (60.31%), Query Frame = 1

Query: 1   MAGQVPRPWFRLGSMTRPTTATTTAPPTDQPRPAPPRMRPPLIRTTSLTELAEPTTPQQS 60
           MA Q+PRPWFRLGSMTRPTTA    P  +Q RP P R+RPP+IR  +LT+ AEPTTPQ+S
Sbjct: 1   MASQLPRPWFRLGSMTRPTTA----PNPEQSRPVPARVRPPIIRPAALTDPAEPTTPQRS 60

Query: 61  AKSPPQLPRLAPLTPQPKKEASP--PPPPPPLTRPVVTSTAAPK--PSSPKGKSHSLGGS 120
            KSPP   R A LTP  K   SP    P P   +   T+ A+P   PSSPK KS S+ GS
Sbjct: 61  -KSPPPFSRPASLTPPVKSSPSPLRALPSPSTGQGGGTAMASPAVIPSSPKEKSSSVVGS 120

Query: 121 PPRN------GSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLP 180
           P         GS    ++K P+P PSPSIPKSIP+VP+PYQS  PKPK   SPPSPLVLP
Sbjct: 121 PKGRSTSSVVGSPKTINSKQPSPFPSPSIPKSIPSVPTPYQS--PKPKTFVSPPSPLVLP 180

Query: 181 PPQLQFVAEPRPETIPQE--------------PSPPD----HDINNYNSQASGFGKNGKK 240
           PPQLQ VAE + ETIPQE              PS  +     +I NY +  S F KNGK+
Sbjct: 181 PPQLQSVAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQ 240

Query: 241 HD------DNQDKGNGAKKPFVSSG----------DKDSSTRVITMAGENRGASMEVVLS 300
                   D  +K   +KK   + G          D D++TRVITMAGEN+GA ME+ LS
Sbjct: 241 ESNKGDDGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNNTRVITMAGENKGAFMEINLS 300

Query: 301 DKNNNSRHLQTNQSDKEETTNDSKDNKSNKKSTAG---RGAWPMKAFFNSNVQGINNSIL 342
            + NNSRH Q  Q     T    KD+  +   T G   R   PM+AFFNSNVQGINNSIL
Sbjct: 301 SEKNNSRHQQQQQIQDNNTVVSVKDSNKSINKTKGIKTRNVLPMRAFFNSNVQGINNSIL 360

BLAST of Cp4.1LG17g04500.1 vs. TrEMBL
Match: A0A059CJ89_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D02643 PE=4 SV=1)

HSP 1 Score: 104.8 bits (260), Expect = 2.2e-19
Identity = 136/403 (33.75%), Postives = 184/403 (45.66%), Query Frame = 1

Query: 1   MAGQVP--RPWFRLGSMTRPTT------------ATTTAPPTDQPRPA------------ 60
           MA Q P  RPW R+ S+ RP              A     P  QPRP             
Sbjct: 1   MASQPPPSRPWLRMTSIARPVAPPPPPAPQPPPQAPPPPAPAPQPRPTISLPTFRPSAPP 60

Query: 61  --PPRMRPPLIRTTSLTELAEPTTP-QQSAKSPPQLPRL-APLTPQPKKEASPPPPPPPL 120
             PPR RPP + +       EP TP +Q  + PP +  + AP +P P   A  P  PP +
Sbjct: 61  PPPPRPRPPSLPS-------EPRTPPRQPVQPPPDVKTIVAPTSPLPASPAKLPSAPPAV 120

Query: 121 TRPVVTSTAAPKPSSPKGKSHSLGGSPPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPY 180
           T P        K   P   + S   SPP+  + +        P P PS+    P   +P 
Sbjct: 121 TSPPA------KAQLPPPAARSPVPSPPKPATSEPPI---AMPVPVPSLKVVKPGAQTPP 180

Query: 181 QSPKPKPKATASPPSPLVLPPPQLQFVAEPR------------PETIPQEPSPPDHDINN 240
           QSP  KP   A PPSPLVLPPPQL+   +PR             ET+P  P  P   + +
Sbjct: 181 QSPSMKP--VAPPPSPLVLPPPQLRADDQPRFPPEAEQKTVLVQETVPVPP--PSSRLPS 240

Query: 241 YNSQ-------ASGFGKNGKKHDDNQDKGNGAKKPFVSSGDKDSSTRVITMAGENRGASM 300
            N +        +G  ++ K  D  + +G   KK  + SG+     +VIT+AGEN+GA+M
Sbjct: 241 LNGRFPSDEIPKAGIARDTKNGDAERHRGPHPKK-HLDSGEVPGM-KVITIAGENKGATM 300

Query: 301 EVVLSDKNNNSRHLQ---TNQSDKEETTN----------DSKDNKSNKKSTAGRGAW--- 338
           E+  S K+    HL+   T++S+ +E+ N          D  D K  KK  + +  +   
Sbjct: 301 ELTRSPKHPPPHHLKASPTSKSNGQESRNSTLPGSSSSSDEGDGKMKKKDKSHQWKFFTS 360

BLAST of Cp4.1LG17g04500.1 vs. TrEMBL
Match: V4LHP3_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10001731mg PE=4 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 4.9e-19
Identity = 127/376 (33.78%), Postives = 172/376 (45.74%), Query Frame = 1

Query: 7   RPWFRLGSMTRPTT-ATTTAPPTDQPRPAPPR---MRPPLIRTT--------SLTELAEP 66
           RPWFRL S+ RPT+  ++  PP  QPRP P R   +RPP+ + +        S     +P
Sbjct: 5   RPWFRLSSIARPTSQGSSDPPPPPQPRPTPRRQVTVRPPVKQPSPPRQQQPPSPPRQQQP 64

Query: 67  TTPQQSAKSPPQLPRLAPLTPQPKKEASP--PPPPPPLTRPVVTSTAAPKPSSPKGKSHS 126
            +P +  + PP  PR    TP P +E SP   PP   ++ P     A+P P  P+     
Sbjct: 65  PSPPRQQQ-PPSPPRHQ--TPPPPQERSPYHSPPSRHMSPPTPPKAASPSPPPPQ----- 124

Query: 127 LGGSPPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPK----PKATASPP--SP 186
               PPR+      S K    A  P  P S P+     +S K +    P+  ASP   SP
Sbjct: 125 ----PPRSSYMPPPSPKEVQEALPPRKPTSPPSPAHSTRSMKSESSESPRKAASPRVLSP 184

Query: 187 LVLPPPQLQFVAEPRPETI------PQEPSPPDHDINNYNSQASGFGKNGKKHDDNQDKG 246
             LPP QL    E   + I       Q   P  H+  NYN        N  ++ ++  +G
Sbjct: 185 YSLPPSQLHSERETTQKIILTAEKTSQLYEPNHHENQNYNH-------NHNQNQNHNHQG 244

Query: 247 NGAKKPF-----VSSGDKDSSTRVITMAGENRGASMEVVLSDKNNNS------------- 306
           N  KK        S  +    TRVIT+AGEN+GA ME++ S  NN +             
Sbjct: 245 NNTKKTHHQQSSFSDSENIMGTRVITIAGENKGAVMEILRSPSNNKTGGTGPHSSRVLNG 304

Query: 307 -----RHLQTNQSDKEETTNDSKDNKSNKKSTAGRGAWPMKAFFNSNVQGINNSILMDSK 334
                R LQ++ S   +     K    N+ +T+     PMKAF NSNVQ INNSI+ +S 
Sbjct: 305 AGEKGRRLQSSSSSSSDEGEGKKKTTKNRNNTSNNSNLPMKAFMNSNVQMINNSIVYNST 361

BLAST of Cp4.1LG17g04500.1 vs. TrEMBL
Match: D7LF69_ARALL (Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_904132 PE=4 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.4e-18
Identity = 129/381 (33.86%), Postives = 177/381 (46.46%), Query Frame = 1

Query: 7   RPWFRLGSMTRPTTATTTAPPTDQPRPAPPR---MRPPLIRTTSLTELA----------- 66
           RPWFRL S+ RPT+  ++ PP  QPRP P R   +RPP  + +   +             
Sbjct: 5   RPWFRLSSIARPTSQGSSEPPPPQPRPTPRRTVVVRPPAKQPSPPRQRQPPSPPRQQQPP 64

Query: 67  -------EPTTP---QQSAKSPPQ--LPRLAPLT-----PQPKKEASPPPPPPPLTRPVV 126
                  +P TP   QQ   SPPQ   P  +P +     P P K A+PPPPPP       
Sbjct: 65  SPPRQQQQPLTPPRQQQQPTSPPQERSPYHSPPSRHMSPPTPPKAATPPPPPP------- 124

Query: 127 TSTAAPKPSSPKGKSHSLGGSPPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKP 186
            S+  P PS PK    +L   PPR  +  + S  H + + S    K+  A  S      P
Sbjct: 125 RSSYTPPPS-PKEVQEAL---PPRKPNSPR-SPAHSSRSTSSESVKTRSASESENHRKAP 184

Query: 187 KPKATASPPSPLVLPPPQLQFVAEPRPETI-PQEPSPPDHDINNYNSQASGFGKNGKKHD 246
            P+      SP  LPP QL    E   + I   E +   H+ +++N Q   + +N   + 
Sbjct: 185 SPRVL----SPYSLPPSQLHSERETTQKNILTAEKTSQTHEPSHHN-QNHNYNQNHNYNQ 244

Query: 247 DNQDKGNGAKK----PFVSSGDKDSSTRVITMAGENRGASMEVVLSDKNNNS-------- 306
           ++  +GN  KK    P  S  +   STRVIT+AGEN+GA ME++ S + N +        
Sbjct: 245 NHNHQGNNPKKMHRQPSTSDSENIMSTRVITIAGENKGAVMEILRSPQGNKTGGSGTHSS 304

Query: 307 ----------RHLQTNQSDKEETTNDSKDNKSNKKSTAGRGAWPMKAFFNSNVQGINNSI 334
                     R LQ++ S   +     K  K+ K    G    PMKAF NSNVQ INNSI
Sbjct: 305 RVSHGTGEKGRRLQSSSSSSSDEGEGKK--KTTKNPNNGNSNLPMKAFMNSNVQMINNSI 364

BLAST of Cp4.1LG17g04500.1 vs. TrEMBL
Match: B9I919_POPTR (Uncharacterized protein (Fragment) OS=Populus trichocarpa GN=POPTR_0014s098101g PE=4 SV=2)

HSP 1 Score: 100.5 bits (249), Expect = 4.1e-18
Identity = 93/243 (38.27%), Postives = 130/243 (53.50%), Query Frame = 1

Query: 133 PAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLPP----------PQLQFVAEPRP 192
           P P+PS+    P V +P QSPKPKP  TA PPSPL  PP          P++  VAE + 
Sbjct: 25  PGPTPSLRIIKPTVQTPPQSPKPKP--TAPPPSPLTRPPSRVKSDADLEPKIPLVAEQKT 84

Query: 193 ---ETIPQEPSPPDHDINNY-NSQASGFGKNGKKH---DDNQDKGNGAKKPFVSSGDKDS 252
              + I  +P      +  + +S +SG  +  K     D  ++KG+G K   +SS  +D 
Sbjct: 85  VLVQKIIDKPKEAGDSLRAFADSLSSGIARLAKPETAKDQTKEKGSGKK---ISSDSEDV 144

Query: 253 STRVITMAGENRGASMEVVLSDKNN----NSRHLQTNQSDKEE-------TTNDSKDNKS 312
             RVIT+AGEN+GA MEV+ S K +    NS  L    + + E       +++  + N  
Sbjct: 145 GMRVITIAGENKGAFMEVIRSPKKHFFEGNSHTLNKKGNPRSEGSDWGSQSSSGEEGNSK 204

Query: 313 NKKSTAGR--GAWPMKAFFNSNVQGINNSILMDSKFTHHDPGIHLVFSKMPPPSA--HQE 344
             K+  GR  G  PM AF NSNVQG+NNSI+ +S  +HHDPG+H+  S+ P  SA  H +
Sbjct: 205 KDKNHKGRSMGPSPMSAFMNSNVQGVNNSIVYNSSCSHHDPGVHVALSRKPSGSAGFHVK 262

BLAST of Cp4.1LG17g04500.1 vs. TAIR10
Match: AT2G46630.1 (AT2G46630.1 unknown protein)

HSP 1 Score: 93.6 bits (231), Expect = 2.6e-19
Identity = 128/388 (32.99%), Postives = 170/388 (43.81%), Query Frame = 1

Query: 7   RPWFRLGSMTRPTTATTTAPPTDQPR---------------PAPPRMR------------ 66
           RPWFRL S+ RPT   ++ PP  QPR               P+PPR R            
Sbjct: 5   RPWFRLSSIARPTAQGSSDPPPPQPRPTSRSSLVVRPPAKQPSPPRQRQPRSPPRQQDPP 64

Query: 67  -PPLIRTTSLT---ELAEPTTPQQSAKSPPQLPRLAPLTPQPKKEASPPPPPPPLTRPVV 126
            PP  +   LT   + A PT+P Q        P      P P K A+PPPPPP   R   
Sbjct: 65  SPPRQQQQPLTPPRQKAPPTSPPQERSPYHSPPSRHMSPPTPPKAATPPPPPP---RSSY 124

Query: 127 TSTAAPKPSSPKGKSHSLGGSPPRNGSFDKESNKHPTPAPSP-SIPKSIPAVPSPYQSPK 186
           TS     P SPK    +L   PPR      + N  P+PA S  S         SP +S  
Sbjct: 125 TS-----PPSPKEVQEAL---PPR------KPNSPPSPAHSSRSTTSESVKTRSPSESEN 184

Query: 187 PKPKATASPPSPLVLPPPQLQFVAEPRPETI-PQEPSPPDHDINNYN-------SQASGF 246
            +   +    SP  LP   L    E   + I   E +   H+ N++N       +Q   +
Sbjct: 185 HRKAPSPRVLSPYSLPASLLHSERETTQKNILTAEKTSQTHETNHHNQNHNHDYNQNHNY 244

Query: 247 GKNGKKHDDNQDKGNGAKK----PFVSSGDKDSSTRVITMAGENRGASMEVVLSDKNNNS 306
            +N   + +   +GN  KK    P  S  +   STRVIT+AGEN+GA ME++ S + N +
Sbjct: 245 NQNHSYNQNQNHQGNNPKKMHRQPSSSDSENIMSTRVITIAGENKGAVMEILRSPQGNKT 304

Query: 307 RHLQTNQS-----------DKEETTNDSKDNKSNKKSTA------GRGAWPMKAFFNSNV 334
               T+ S             + +++ S D    KK T       G    PMKAF NSNV
Sbjct: 305 GGSGTHSSRVSHGTGEKGRRLQSSSSSSSDEGEGKKKTTKNVPNKGNSNLPMKAFMNSNV 364

BLAST of Cp4.1LG17g04500.1 vs. TAIR10
Match: AT1G75260.1 (AT1G75260.1 oxidoreductases, acting on NADH or NADPH)

HSP 1 Score: 57.4 bits (137), Expect = 2.0e-08
Identity = 37/103 (35.92%), Postives = 59/103 (57.28%), Query Frame = 1

Query: 229 SSGDKDSSTRVITMAGENRGASMEVVLS-DKNNNSRHLQTN-QSDKEETTNDSKDNKSNK 288
           S+GD D S  V T+ GEN+GA+M +    DK +   H++   +S+ +E++N +     N 
Sbjct: 336 SNGD-DKSVSVYTLTGENKGATMGIGSEKDKKDGEVHIRRGYRSNPDESSNTTATETENP 395

Query: 289 KSTAGRGAWPMKAFFNSNVQGINNSILMDSKFTHHDPGIHLVF 330
           K           A+ N N QGINNSI+++S  + +DPG+H+ F
Sbjct: 396 KDDEAEEEASFTAYINGNTQGINNSIVVESSVSENDPGVHMSF 437

BLAST of Cp4.1LG17g04500.1 vs. NCBI nr
Match: gi|659119611|ref|XP_008459747.1| (PREDICTED: zyxin isoform X2 [Cucumis melo])

HSP 1 Score: 269.6 bits (688), Expect = 7.4e-69
Identity = 198/382 (51.83%), Postives = 230/382 (60.21%), Query Frame = 1

Query: 1   MAGQVPRPWFRLGSMTRPTTATTTAPPTDQPRPAPPRMRPPLIRTTSLTELAEPTTPQQS 60
           MA Q+PRPWFRLGSMTRP    TT P  +QPRP P R+RPP+IR  +LT+ AEPTTPQ+S
Sbjct: 1   MASQLPRPWFRLGSMTRP----TTTPNPEQPRPVPARVRPPIIRPAALTDPAEPTTPQRS 60

Query: 61  AKSPPQLPRLAPLTPQPKKEASP----PPPPPPLTRPVVTSTAAPKPSSPKGKSHSLGGS 120
            KSPP LPR   LTP  K  ASP    P P          ++ A  P+SPK KS S+ GS
Sbjct: 61  -KSPPPLPRPVTLTPPVKNSASPLRVLPSPSSGQGGGTAMASPAVNPNSPKEKSSSVVGS 120

Query: 121 PPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLPPPQLQF 180
           P    S      K P+P PSPSIPKSIP+VP+PYQS  PKPK   SPPSPLVLPPPQLQ 
Sbjct: 121 PKTINS------KQPSPFPSPSIPKSIPSVPTPYQS--PKPKTVVSPPSPLVLPPPQLQS 180

Query: 181 VAEPRPETIPQE--------------PSPPDH----DINNYNSQASG-FGKNGKKHDDNQ 240
           VAE + ETIPQE              PS  +     +I NY +  SG F KNGK+  +  
Sbjct: 181 VAETKHETIPQEVERKTVVFQKVMDKPSQEEEHHHLNITNYRTHTSGLFDKNGKQESNKG 240

Query: 241 DKGN--------------GAKKPFVSSG-DKDSSTRVITMAGENRGASMEVVLSDKNNNS 300
           D G+              G    F  +  D D+  RVITMAGEN+GA ME+ LS   NNS
Sbjct: 241 DDGDHVENGVSKNKGTIFGGNSNFKRTAFDHDNHARVITMAGENKGAFMEINLSSDKNNS 300

Query: 301 RHLQTNQSDKEETTNDSKDNKSNKKSTAG---RGAWPMKAFFNSNVQGINNSILMDSKFT 342
           RH Q  Q+    T  D KDN  +K  T G   R   PM+AFFNSNVQGINNSILMDSKF+
Sbjct: 301 RHQQQQQNQDNNTVVDLKDNNKSKNKTKGSNTRNVLPMRAFFNSNVQGINNSILMDSKFS 360

BLAST of Cp4.1LG17g04500.1 vs. NCBI nr
Match: gi|659119609|ref|XP_008459746.1| (PREDICTED: uncharacterized protein DDB_G0284459 isoform X1 [Cucumis melo])

HSP 1 Score: 269.6 bits (688), Expect = 7.4e-69
Identity = 199/388 (51.29%), Postives = 233/388 (60.05%), Query Frame = 1

Query: 1   MAGQVPRPWFRLGSMTRPTTATTTAPPTDQPRPAPPRMRPPLIRTTSLTELAEPTTPQQS 60
           MA Q+PRPWFRLGSMTRPTT     P  +QPRP P R+RPP+IR  +LT+ AEPTTPQ+S
Sbjct: 1   MASQLPRPWFRLGSMTRPTTT----PNPEQPRPVPARVRPPIIRPAALTDPAEPTTPQRS 60

Query: 61  AKSPPQLPRLAPLTPQPKKEASP----PPPPPPLTRPVVTSTAAPKPSSPKGKSHSLGGS 120
            KSPP LPR   LTP  K  ASP    P P          ++ A  P+SPK KS S+ GS
Sbjct: 61  -KSPPPLPRPVTLTPPVKNSASPLRVLPSPSSGQGGGTAMASPAVNPNSPKEKSSSVVGS 120

Query: 121 PPRN------GSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLP 180
           P         GS    ++K P+P PSPSIPKSIP+VP+PYQS  PKPK   SPPSPLVLP
Sbjct: 121 PKGRSNSSVVGSPKTINSKQPSPFPSPSIPKSIPSVPTPYQS--PKPKTVVSPPSPLVLP 180

Query: 181 PPQLQFVAEPRPETIPQE--------------PSPPDH----DINNYNSQASG-FGKNGK 240
           PPQLQ VAE + ETIPQE              PS  +     +I NY +  SG F KNGK
Sbjct: 181 PPQLQSVAETKHETIPQEVERKTVVFQKVMDKPSQEEEHHHLNITNYRTHTSGLFDKNGK 240

Query: 241 KHDDNQDKGN--------------GAKKPFVSSG-DKDSSTRVITMAGENRGASMEVVLS 300
           +  +  D G+              G    F  +  D D+  RVITMAGEN+GA ME+ LS
Sbjct: 241 QESNKGDDGDHVENGVSKNKGTIFGGNSNFKRTAFDHDNHARVITMAGENKGAFMEINLS 300

Query: 301 DKNNNSRHLQTNQSDKEETTNDSKDNKSNKKSTAG---RGAWPMKAFFNSNVQGINNSIL 342
              NNSRH Q  Q+    T  D KDN  +K  T G   R   PM+AFFNSNVQGINNSIL
Sbjct: 301 SDKNNSRHQQQQQNQDNNTVVDLKDNNKSKNKTKGSNTRNVLPMRAFFNSNVQGINNSIL 360

BLAST of Cp4.1LG17g04500.1 vs. NCBI nr
Match: gi|778712181|ref|XP_011656858.1| (PREDICTED: probable serine/threonine-protein kinase samkC isoform X2 [Cucumis sativus])

HSP 1 Score: 261.9 bits (668), Expect = 1.5e-66
Identity = 197/382 (51.57%), Postives = 231/382 (60.47%), Query Frame = 1

Query: 1   MAGQVPRPWFRLGSMTRPTTATTTAPPTDQPRPAPPRMRPPLIRTTSLTELAEPTTPQQS 60
           MA Q+PRPWFRLGSMTRP    TTAP  +Q RP P R+RPP+IR  +LT+ AEPTTPQ+S
Sbjct: 1   MASQLPRPWFRLGSMTRP----TTAPNPEQSRPVPARVRPPIIRPAALTDPAEPTTPQRS 60

Query: 61  AKSPPQLPRLAPLTPQPKKEASP--PPPPPPLTRPVVTSTAAPK--PSSPKGKSHSLGGS 120
            KSPP   R A LTP  K   SP    P P   +   T+ A+P   PSSPK KS S+ GS
Sbjct: 61  -KSPPPFSRPASLTPPVKSSPSPLRALPSPSTGQGGGTAMASPAVIPSSPKEKSSSVVGS 120

Query: 121 PPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLPPPQLQF 180
           P    S      K P+P PSPSIPKSIP+VP+PYQS  PKPK   SPPSPLVLPPPQLQ 
Sbjct: 121 PKTINS------KQPSPFPSPSIPKSIPSVPTPYQS--PKPKTFVSPPSPLVLPPPQLQS 180

Query: 181 VAEPRPETIPQE--------------PSPPD----HDINNYNSQASGFGKNGKKHD---- 240
           VAE + ETIPQE              PS  +     +I NY +  S F KNGK+      
Sbjct: 181 VAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQESNKGD 240

Query: 241 --DNQDKGNGAKKPFVSSG----------DKDSSTRVITMAGENRGASMEVVLSDKNNNS 300
             D  +K   +KK   + G          D D++TRVITMAGEN+GA ME+ LS + NNS
Sbjct: 241 DGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNNTRVITMAGENKGAFMEINLSSEKNNS 300

Query: 301 RHLQTNQSDKEETTNDSKDNKSNKKSTAG---RGAWPMKAFFNSNVQGINNSILMDSKFT 342
           RH Q  Q     T    KD+  +   T G   R   PM+AFFNSNVQGINNSILMDSKF+
Sbjct: 301 RHQQQQQIQDNNTVVSVKDSNKSINKTKGIKTRNVLPMRAFFNSNVQGINNSILMDSKFS 360

BLAST of Cp4.1LG17g04500.1 vs. NCBI nr
Match: gi|778712178|ref|XP_004140616.2| (PREDICTED: probable serine/threonine-protein kinase samkC isoform X1 [Cucumis sativus])

HSP 1 Score: 261.9 bits (668), Expect = 1.5e-66
Identity = 198/388 (51.03%), Postives = 234/388 (60.31%), Query Frame = 1

Query: 1   MAGQVPRPWFRLGSMTRPTTATTTAPPTDQPRPAPPRMRPPLIRTTSLTELAEPTTPQQS 60
           MA Q+PRPWFRLGSMTRPTTA    P  +Q RP P R+RPP+IR  +LT+ AEPTTPQ+S
Sbjct: 1   MASQLPRPWFRLGSMTRPTTA----PNPEQSRPVPARVRPPIIRPAALTDPAEPTTPQRS 60

Query: 61  AKSPPQLPRLAPLTPQPKKEASP--PPPPPPLTRPVVTSTAAPK--PSSPKGKSHSLGGS 120
            KSPP   R A LTP  K   SP    P P   +   T+ A+P   PSSPK KS S+ GS
Sbjct: 61  -KSPPPFSRPASLTPPVKSSPSPLRALPSPSTGQGGGTAMASPAVIPSSPKEKSSSVVGS 120

Query: 121 PPRN------GSFDKESNKHPTPAPSPSIPKSIPAVPSPYQSPKPKPKATASPPSPLVLP 180
           P         GS    ++K P+P PSPSIPKSIP+VP+PYQS  PKPK   SPPSPLVLP
Sbjct: 121 PKGRSTSSVVGSPKTINSKQPSPFPSPSIPKSIPSVPTPYQS--PKPKTFVSPPSPLVLP 180

Query: 181 PPQLQFVAEPRPETIPQE--------------PSPPD----HDINNYNSQASGFGKNGKK 240
           PPQLQ VAE + ETIPQE              PS  +     +I NY +  S F KNGK+
Sbjct: 181 PPQLQSVAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQ 240

Query: 241 HD------DNQDKGNGAKKPFVSSG----------DKDSSTRVITMAGENRGASMEVVLS 300
                   D  +K   +KK   + G          D D++TRVITMAGEN+GA ME+ LS
Sbjct: 241 ESNKGDDGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNNTRVITMAGENKGAFMEINLS 300

Query: 301 DKNNNSRHLQTNQSDKEETTNDSKDNKSNKKSTAG---RGAWPMKAFFNSNVQGINNSIL 342
            + NNSRH Q  Q     T    KD+  +   T G   R   PM+AFFNSNVQGINNSIL
Sbjct: 301 SEKNNSRHQQQQQIQDNNTVVSVKDSNKSINKTKGIKTRNVLPMRAFFNSNVQGINNSIL 360

BLAST of Cp4.1LG17g04500.1 vs. NCBI nr
Match: gi|629113543|gb|KCW78503.1| (hypothetical protein EUGRSUZ_D02643 [Eucalyptus grandis])

HSP 1 Score: 104.8 bits (260), Expect = 3.2e-19
Identity = 136/403 (33.75%), Postives = 184/403 (45.66%), Query Frame = 1

Query: 1   MAGQVP--RPWFRLGSMTRPTT------------ATTTAPPTDQPRPA------------ 60
           MA Q P  RPW R+ S+ RP              A     P  QPRP             
Sbjct: 1   MASQPPPSRPWLRMTSIARPVAPPPPPAPQPPPQAPPPPAPAPQPRPTISLPTFRPSAPP 60

Query: 61  --PPRMRPPLIRTTSLTELAEPTTP-QQSAKSPPQLPRL-APLTPQPKKEASPPPPPPPL 120
             PPR RPP + +       EP TP +Q  + PP +  + AP +P P   A  P  PP +
Sbjct: 61  PPPPRPRPPSLPS-------EPRTPPRQPVQPPPDVKTIVAPTSPLPASPAKLPSAPPAV 120

Query: 121 TRPVVTSTAAPKPSSPKGKSHSLGGSPPRNGSFDKESNKHPTPAPSPSIPKSIPAVPSPY 180
           T P        K   P   + S   SPP+  + +        P P PS+    P   +P 
Sbjct: 121 TSPPA------KAQLPPPAARSPVPSPPKPATSEPPI---AMPVPVPSLKVVKPGAQTPP 180

Query: 181 QSPKPKPKATASPPSPLVLPPPQLQFVAEPR------------PETIPQEPSPPDHDINN 240
           QSP  KP   A PPSPLVLPPPQL+   +PR             ET+P  P  P   + +
Sbjct: 181 QSPSMKP--VAPPPSPLVLPPPQLRADDQPRFPPEAEQKTVLVQETVPVPP--PSSRLPS 240

Query: 241 YNSQ-------ASGFGKNGKKHDDNQDKGNGAKKPFVSSGDKDSSTRVITMAGENRGASM 300
            N +        +G  ++ K  D  + +G   KK  + SG+     +VIT+AGEN+GA+M
Sbjct: 241 LNGRFPSDEIPKAGIARDTKNGDAERHRGPHPKK-HLDSGEVPGM-KVITIAGENKGATM 300

Query: 301 EVVLSDKNNNSRHLQ---TNQSDKEETTN----------DSKDNKSNKKSTAGRGAW--- 338
           E+  S K+    HL+   T++S+ +E+ N          D  D K  KK  + +  +   
Sbjct: 301 ELTRSPKHPPPHHLKASPTSKSNGQESRNSTLPGSSSSSDEGDGKMKKKDKSHQWKFFTS 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KAP9_CUCSA1.1e-6651.03Uncharacterized protein OS=Cucumis sativus GN=Csa_6G107920 PE=4 SV=1[more]
A0A059CJ89_EUCGR2.2e-1933.75Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D02643 PE=4 SV=1[more]
V4LHP3_EUTSA4.9e-1933.78Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10001731mg PE=4 SV=1[more]
D7LF69_ARALL1.4e-1833.86Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
B9I919_POPTR4.1e-1838.27Uncharacterized protein (Fragment) OS=Populus trichocarpa GN=POPTR_0014s098101g ... [more]
Match NameE-valueIdentityDescription
AT2G46630.12.6e-1932.99 unknown protein[more]
AT1G75260.12.0e-0835.92 oxidoreductases, acting on NADH or NADPH[more]
Match NameE-valueIdentityDescription
gi|659119611|ref|XP_008459747.1|7.4e-6951.83PREDICTED: zyxin isoform X2 [Cucumis melo][more]
gi|659119609|ref|XP_008459746.1|7.4e-6951.29PREDICTED: uncharacterized protein DDB_G0284459 isoform X1 [Cucumis melo][more]
gi|778712181|ref|XP_011656858.1|1.5e-6651.57PREDICTED: probable serine/threonine-protein kinase samkC isoform X2 [Cucumis sa... [more]
gi|778712178|ref|XP_004140616.2|1.5e-6651.03PREDICTED: probable serine/threonine-protein kinase samkC isoform X1 [Cucumis sa... [more]
gi|629113543|gb|KCW78503.1|3.2e-1933.75hypothetical protein EUGRSUZ_D02643 [Eucalyptus grandis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG17g04500Cp4.1LG17g04500gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG17g04500.1:five_prime_utr:001Cp4.1LG17g04500.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG17g04500.1:cds:002Cp4.1LG17g04500.1:cds:002CDS
Cp4.1LG17g04500.1:cds:001Cp4.1LG17g04500.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG17g04500.1Cp4.1LG17g04500.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33472FAMILY NOT NAMEDcoord: 17..333
score: 8.2
NoneNo IPR availablePANTHERPTHR33472:SF1EXTENSIN-RELATEDcoord: 17..333
score: 8.2