Cp4.1LG05g10490.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG05g10490.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionU2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 1, putative isoform 1
LocationCp4.1LG05 : 6949845 .. 6953833 (-)
Sequence length1425
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGTTATTCCCAAAAAGAAAAAAGAGCGAGAAACCCCAATAGTCAATCAAAATTGTATTTGGATCCTCCGATCATTGCAGAGCAAAGCTCATTCGGCGGATTTAAAGACGAAGGCGTCTCTTGAGAGTTCGAGAATCGAAAATGGAGATTGGAGATTTTGCAGCCATTTTTGGGGAACCCAAGGTGGAGTGGATAAACAGGGGTTCGCTTACTTCGCACCCATTTTTGTTCCATGTTCATACTCCTAATCCCTCGCATCTTAGATTTTGTGTTACTGATTTCCATTCAAACACTTGGGAATCCACCAAATCGATTCTCCAGCTTCACGACATGGTGGGCTTCTTTAATTCTCTCTGTACGTTCTCTCTTTCTTACTTCCTTCTTTGATTTCTACTCATTTTTTGTTGTGGTTTCTTAGAATTGATTGGTGATCAGAACCCTAGTTTATGTTAATTTGTGTTTGTTCAGAGGGATGAGATTGGAATCGGAGGGGCTATGTCAGAATTTGTTGATTATATCATTACTTCCCTGAAATTTGGAGATGTAAGGCTTAGGTTGGAAGAACAATCGGTGGATGATGGTACTCTTCTTAGAATTATGATTGCTTCAAGTATTCTTAACCATTCAAATATGTTTTGAAAGTTGAAACCCCTTTTTTTCCATTTTGTTTTGCTTCGTCAGGTGCTGCATTTGCCAAATTATTTGCTCAGAAATCTAAAGGAATGCCTGTTTTTTCCCTTTCTCTCACGAAACTCTTTGATTGTGCTGCTTCCGAAGCTATAGCTACTCTGTCCTTGGGGCTCTTTAACTCATTAAAAGCCAAGGAATGTTCACTTCTAAAAGGTTTTCATCGCTTATCTATAGACTGCCTATACTATAATCATATATGTATATATAACAATGAATTTCTTTAATGTGATGGTATCTGGAAAAAAGTTAGAAGGTTGTTATAAAAATATTGTGAGATCCCACGTTAGATGGGGAGAAGAACGATGAATTCTTTATAAGGGTGTGGAAACCTCTCCCTAACATACATGCTTTCTAAAAACCTTGAGGGGAAGTCCGAAAGGGAAAGCTCAAAGAGAAGAATACCTGCTAGTGGTGGGCTTGAGCTGTTACAAATGGTATTAGGGCCAGACATGAGGCGATATGCCAGCGAGGAGGCTAAGTACCGAAGGGGGTGGACACGAGGCGGTGTGCCAGTAAGGGTGCCGGGCCTCGAAAGGGGGTGCATTAGGGGGTCTCACAAGAAGGGAACGAGTGTCAGCGAGGACGCTGGGCTCTGAAGGGGGGTGGATTGTGAGATCCCACCGATTGGGGAGGGGAACGGAACATTCTTTATAAGGGTAGGCTTGGGCTGTTACAAATATATGTTTAAGCTTTTGGTATCCGACTTGCATTACCCTCAAAACTTGTCACAACGTTCTTGAAAATAGTTTGGTATTTGAATCTCATTTTTTTGACTTTCATTGGTGATATTGAACTGGCTCAAATGCTACCATCAGAGAATTTCTTGGAAGACATTTAACTTGGAGATTACATTGTTTATATTCATATTCTACACCAAACTATATAGTTTACACTGTTCTGTTTCTTCTTTGTTTTTCCTTTTTGGTTATGAATCTGTATCTCTCACTTGTCTTGATAACTTTGGTAACATAGTTTGATGTCATTGACTTTCCTTTTGTACTAATTTTTTAGAACAAGAGCGCTCCCTTCAGTTGACGACCATGATATCAGCCGAAAAGGTATATCCATCAACTTCTCACACTGTAAATAGACATTATATCATGATGGGCACTTTTACTCTCGTAGGTACCTTTCTATTGTCTTGAAATACCAATGATCCCTCTTCAAATCATAATTGAAACGGTTTACTCTGGTCGGTATTGTTGTTGTTGCCTGTTTATTTGTTTCGATGTGTTGCGTTATTTCTCTGACTTTGCATTTGAAACTTCATTGAATGAAGAATTTGGATGCCCTTTTATCTGATTCAAGTCAGTAGGTAATTTATAGAATTAATTCTTATTGATACCAGAGGGGTACATGCCAAGGGAGGATTTGAGAGCTCTTATCTCTTCTAAAGCTTAAAGGACGCTTAAGCCGAAAAGCTACCCCTCTACGATCTATGTATATATATTCATGAAATGACTAGAACATCAATTGTAGAGCTCCAATAATAGAGATTCTTCTCGTTTCCTGCTGCTCTTGTTCTCGTCCTACCGACATTAGTGGAGATGATGGCATTTCAACTTTTGAAGATAATGCTATAACTTAGCCATATTTCTGTTGAAAAGGCCATCATAATGTCAATATATTCCTTATCCACAACGAATTCCAAGTCGTCGGTCTCTCTATCTGGTACCGAACGTTCTTGTTTATATTTATTTTTACTTTGTTCATGTCAGGAGAAGTACGAAAGTATTCAAAGCCAACTCGGGCAATATACAAAGAAACAGAAGTTACAAAATATGAACGCCTCAAATTCTCCAGGTTTGTTCCTATGAAGAACCATCAAATTTATTTCATTATATATTTTCTACATGAACACTAATTGAATTTTCTTATGCTTCATAGACAAATCTATTGGTCATTATATTGGCTCGTCAAAAACCACCAATCGTGCAGTGCCAGCACATCGCAGGTCGGTATCTCAAAGCTACATTTATCACATAACATATTAATTTATTTCCATTAGAAGTAGTTGTATTTAGCATTGTATTTGGGCAGGGCGAAAACAAGAGGTGCCCTTTTGCAAGACTCTGAAGATGACAATGGTAGGTACTTTGAGAAGTTGAATGTCTTTGTTCCACTGTGCATAGTTTGAGTCCTAGAAAGCTGCTCTAGCCAATCACTCGCGGTGTCGAGTCCCTGCAAAAGGCTTAAAATGGGTCGAGGATTCCATGTTTAAACCGGGGTCAGGGTCGAGAGGCTCGTAGTTTTCCGAAATTGACAAACTTTTATGTCTTCCCAGGAAGATTTTTGTTTTTGTATTCTTTTCTTACAAATATATTTCTCTGTTTCTTATCGATTCTTTTTTCCATTTTTACTTTCAGAACAAGAACATTCCCTTCGATCGACATCGGAGGATAAGGTAGATAAATAAGTCGTTGATTACTGTCACGTTATGCTCTTATAGACAGGTTTTTATCTTGTAGTACCTCGAAATTAGTAGCTTCGGGTTCTTTAGAGCGTATTTTTTACTTATTAATTCTGCTCATACTTTATAAATCTTGAATTTATTTTCGAAAATGTCAGGAAAAGAAAAAAGAGTTACAAAATACAAATGCATCAGCCAATGTAGATGGGTTTCAAAAATCTCCTGGTATGCTTTTTATTAGATTCATTGATGTTGTTTGAGTGTACAAAAGGTTCTAAATTTTGAGCTTATTTTTTAACGTAATCGACTATTAAAATGTGTCGATCTTAGCAGACAAACCTGTTATTCATGATATAGGCTCGACAAAAATCACCAATCGTGTCGTGCCAGCGCATCGCAGGTCAGTTTCTACACTAAAATTGACCCGAGTTTATTCGTTATTGTTTGTCAAAACTAGTAAACATACTCTAGTTTTGTGTACTTGGGCAGGGGACGAACAAGAGGTGCCCTTCTGCAAGATGATGAAAATGACAATGACAGGTAAAACCCGACATTTGTGATGACGAATAACCAATCGTAGTTTCAATATCGAGGAGACAACGAAGTAATACCGATGAGCGATCGGTTTCATCTTTCCCAGTAAGGTTTAATTTTCCTGCCTATACCTATGACGAAAGAGCTTGTGCAACCAGATATTTGTTTCTTGTACATATTTATTTGCATTCGTACTAATCATATCGTGGAAATATCTTGATGCACGTGACATACAAAATACATGAGTAACATAAAAAAATACATGAGTTCTCTATAGCTTTAAATTATGACATTTATTTATGGTGTACATATCTCCATTCTTTTCATGACATAGCGTTCTTATGAGACATACAT

mRNA sequence

AATGTTATTCCCAAAAAGAAAAAAGAGCGAGAAACCCCAATAGTCAATCAAAATTGTATTTGGATCCTCCGATCATTGCAGAGCAAAGCTCATTCGGCGGATTTAAAGACGAAGGCGTCTCTTGAGAGTTCGAGAATCGAAAATGGAGATTGGAGATTTTGCAGCCATTTTTGGGGAACCCAAGGTGGAGTGGATAAACAGGGGTTCGCTTACTTCGCACCCATTTTTGTTCCATGTTCATACTCCTAATCCCTCGCATCTTAGATTTTGTGTTACTGATTTCCATTCAAACACTTGGGAATCCACCAAATCGATTCTCCAGCTTCACGACATGAGGGATGAGATTGGAATCGGAGGGGCTATGTCAGAATTTGTTGATTATATCATTACTTCCCTGAAATTTGGAGATGTAAGGCTTAGGTTGGAAGAACAATCGGTGGATGATGGTGCTGCATTTGCCAAATTATTTGCTCAGAAATCTAAAGGAATGCCTGTTTTTTCCCTTTCTCTCACGAAACTCTTTGATTGTGCTGCTTCCGAAGCTATAGCTACTCTGTCCTTGGGGCTCTTTAACTCATTAAAAGCCAAGGAATGTTCACTTCTAAAAGAACAAGAGCGCTCCCTTCAGTTGACGACCATGATATCAGCCGAAAAGGAGAAGTACGAAAGTATTCAAAGCCAACTCGGGCAATATACAAAGAAACAGAAGTTACAAAATATGAACGCCTCAAATTCTCCAGACAAATCTATTGGTCATTATATTGGCTCGTCAAAAACCACCAATCGTGCAGTGCCAGCACATCGCAGGGCGAAAACAAGAGGTGCCCTTTTGCAAGACTCTGAAGATGACAATGAACAAGAACATTCCCTTCGATCGACATCGGAGGATAAGGAAAAGAAAAAAGAGTTACAAAATACAAATGCATCAGCCAATGTAGATGGGTTTCAAAAATCTCCTGACAAACCTGTTATTCATGATATAGGCTCGACAAAAATCACCAATCGTGTCGTGCCAGCGCATCGCAGGGGACGAACAAGAGGTGCCCTTCTGCAAGATGATGAAAATGACAATGACAGGTAAAACCCGACATTTGTGATGACGAATAACCAATCGTAGTTTCAATATCGAGGAGACAACGAAGTAATACCGATGAGCGATCGGTTTCATCTTTCCCAGTAAGGTTTAATTTTCCTGCCTATACCTATGACGAAAGAGCTTGTGCAACCAGATATTTGTTTCTTGTACATATTTATTTGCATTCGTACTAATCATATCGTGGAAATATCTTGATGCACGTGACATACAAAATACATGAGTAACATAAAAAAATACATGAGTTCTCTATAGCTTTAAATTATGACATTTATTTATGGTGTACATATCTCCATTCTTTTCATGACATAGCGTTCTTATGAGACATACAT

Coding sequence (CDS)

ATGGAGATTGGAGATTTTGCAGCCATTTTTGGGGAACCCAAGGTGGAGTGGATAAACAGGGGTTCGCTTACTTCGCACCCATTTTTGTTCCATGTTCATACTCCTAATCCCTCGCATCTTAGATTTTGTGTTACTGATTTCCATTCAAACACTTGGGAATCCACCAAATCGATTCTCCAGCTTCACGACATGAGGGATGAGATTGGAATCGGAGGGGCTATGTCAGAATTTGTTGATTATATCATTACTTCCCTGAAATTTGGAGATGTAAGGCTTAGGTTGGAAGAACAATCGGTGGATGATGGTGCTGCATTTGCCAAATTATTTGCTCAGAAATCTAAAGGAATGCCTGTTTTTTCCCTTTCTCTCACGAAACTCTTTGATTGTGCTGCTTCCGAAGCTATAGCTACTCTGTCCTTGGGGCTCTTTAACTCATTAAAAGCCAAGGAATGTTCACTTCTAAAAGAACAAGAGCGCTCCCTTCAGTTGACGACCATGATATCAGCCGAAAAGGAGAAGTACGAAAGTATTCAAAGCCAACTCGGGCAATATACAAAGAAACAGAAGTTACAAAATATGAACGCCTCAAATTCTCCAGACAAATCTATTGGTCATTATATTGGCTCGTCAAAAACCACCAATCGTGCAGTGCCAGCACATCGCAGGGCGAAAACAAGAGGTGCCCTTTTGCAAGACTCTGAAGATGACAATGAACAAGAACATTCCCTTCGATCGACATCGGAGGATAAGGAAAAGAAAAAAGAGTTACAAAATACAAATGCATCAGCCAATGTAGATGGGTTTCAAAAATCTCCTGACAAACCTGTTATTCATGATATAGGCTCGACAAAAATCACCAATCGTGTCGTGCCAGCGCATCGCAGGGGACGAACAAGAGGTGCCCTTCTGCAAGATGATGAAAATGACAATGACAGGTAA

Protein sequence

MEIGDFAAIFGEPKVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSILQLHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFSLSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQLGQYTKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQEHSLRSTSEDKEKKKELQNTNASANVDGFQKSPDKPVIHDIGSTKITNRVVPAHRRGRTRGALLQDDENDNDR
BLAST of Cp4.1LG05g10490.1 vs. TrEMBL
Match: A0A0A0LK82_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G335570 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 3.8e-103
Identity = 204/273 (74.73%), Postives = 228/273 (83.52%), Query Frame = 1

Query: 1   MEIGDFAAIFGEP-KVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSIL 60
           ME+ DFA IFG+P +VEW+NRGSL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  
Sbjct: 1   MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAF 60

Query: 61  QLHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVF 120
           QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS  DGAA  KL AQKSKGMPVF
Sbjct: 61  QLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAASVKLIAQKSKGMPVF 120

Query: 121 SLSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQS 180
           S+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMIS EKEK E+IQ+
Sbjct: 121 SISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQT 180

Query: 181 QLGQYTKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQ 240
           QLGQY KKQKLQNMNASNSPDKS  H IG +K TNR VP HRRAK RGALLQDSEDDNE+
Sbjct: 181 QLGQYRKKQKLQNMNASNSPDKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEK 240

Query: 241 EHSLRSTSEDKEKKKELQNTNASANVDGFQKSP 273
           E SL+ST E+KEKK+ L NT+  A VD   K P
Sbjct: 241 ERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP 273

BLAST of Cp4.1LG05g10490.1 vs. TrEMBL
Match: V4T1Y5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10002244mg PE=4 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 8.8e-60
Identity = 135/253 (53.36%), Postives = 172/253 (67.98%), Query Frame = 1

Query: 1   MEIGDFAAIFGEPKVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSILQ 60
           M+I  F  IFGEPK EW +  S +   FLFHV  P+ SHL   VTDF SNTWE+ +S+LQ
Sbjct: 1   MKIEGFEPIFGEPKAEWADSRSDSLGRFLFHVSAPDSSHLLIQVTDFRSNTWEAKRSVLQ 60

Query: 61  LHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFS 120
           L DMRDEIGIGG+ SEF+DY++ S+K  DV+L LE  S  DGAA+AK+ AQKSKGMP  S
Sbjct: 61  LDDMRDEIGIGGSWSEFIDYVVASIKSEDVKLILEGHSNADGAAYAKIVAQKSKGMPRIS 120

Query: 121 LSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQ 180
           +SLT+L   AA+EA+A LSL LF + ++ +  +++EQER LQL    +AEKE+ E+IQ+Q
Sbjct: 121 ISLTRLTGSAATEAMAKLSLELFTAFRSMQTLIVQEQERCLQLEKEAAAEKERNENIQNQ 180

Query: 181 LGQYTKKQKLQNMNAS---------------NSPDKSIGHYIGSSKTTNRAVPAHRRAKT 239
              Y+K+QKLQ MN S               +SPDK       +SK  NR +PAHRRAK 
Sbjct: 181 -PLYSKRQKLQKMNFSDKTDISASILSNGSQDSPDKQAAQSPVASKVANRVIPAHRRAKV 240

BLAST of Cp4.1LG05g10490.1 vs. TrEMBL
Match: A0A059AK38_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00189 PE=4 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 1.7e-58
Identity = 127/239 (53.14%), Postives = 164/239 (68.62%), Query Frame = 1

Query: 5   DFAAIFGEPKVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSILQLHDM 64
           DF AIFGEPKVEW NRG      FL++VH P+PSHLR  VTDFHSN WE+ +S+ +L DM
Sbjct: 5   DFEAIFGEPKVEWSNRGGHPLRRFLYYVHAPDPSHLRIHVTDFHSNAWEALRSVHELEDM 64

Query: 65  RDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFSLSLT 124
           RD IGIGG+ SEF+DY + S+K  DV+L L  QS   GA  AKL AQK+KGMP+  ++L 
Sbjct: 65  RDSIGIGGSWSEFIDYFVASIKSEDVKLVLHGQSDSGGATSAKLVAQKAKGMPLIFVALV 124

Query: 125 KLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQLGQY 184
           KL D AA E I  LS+ LF + K  + SL +E+ER LQ T +++AEKEK ESI+++L  +
Sbjct: 125 KLVDFAAREVIGNLSMQLFMAFKNTQTSLAEERERYLQFTKLVTAEKEKNESIENKLESF 184

Query: 185 TKKQK------LQNMNASN-SPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDD 237
            K+QK      L   +ASN S +K +     S++ TNR VPAHRR+K RG LLQ+ ED+
Sbjct: 185 PKRQKLPITSALAMSDASNPSTEKEVAQDASSTRVTNRVVPAHRRSKVRGVLLQNPEDE 243

BLAST of Cp4.1LG05g10490.1 vs. TrEMBL
Match: G7LE39_MEDTR (Uncharacterized protein OS=Medicago truncatula GN=MTR_8g093850 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.1e-54
Identity = 121/243 (49.79%), Postives = 160/243 (65.84%), Query Frame = 1

Query: 1   MEIGDFAAIFGEPKVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSILQ 60
           M   DF  IF EPK+EW    S    PFLFHVH PN SHL   VT+FHS+TWE+  S+  
Sbjct: 1   MAFQDFEPIFAEPKLEWKPHCSHPLRPFLFHVHPPNSSHLVIHVTNFHSDTWEAHLSVSS 60

Query: 61  LHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFS 120
           L D+ D IGIGG+ SEF +Y + SLK  D++L LE  S  DG + AKL AQKSKGMP+ +
Sbjct: 61  LEDIMDIIGIGGSWSEFANYFVNSLKSEDLKLVLEPNSNSDGVSSAKLIAQKSKGMPLIT 120

Query: 121 LSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQ 180
           + LTKL D +ASEA++ LSL LF + ++ +CSL+  QERS+QLT M+++EKE+ E+I   
Sbjct: 121 IPLTKLVDSSASEAVSNLSLSLFKAFRSTKCSLVDVQERSVQLTNMMASEKERNETIPLD 180

Query: 181 LGQY------TKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSE 238
             Q       ++K  + N  A NSPDK      G++K  NR +PAHRR K RGALL+DS+
Sbjct: 181 RRQKFQKISDSEKAGVSNNGAQNSPDKQKARDTGTTKVKNRVMPAHRRTKVRGALLRDSD 240

BLAST of Cp4.1LG05g10490.1 vs. TrEMBL
Match: I3T5J0_LOTJA (Uncharacterized protein OS=Lotus japonicus PE=2 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.1e-54
Identity = 122/240 (50.83%), Postives = 154/240 (64.17%), Query Frame = 1

Query: 1   MEIGDFAAIFGEPKVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSILQ 60
           M   DF  IFGEPKVEW    S    PFLFH   P+ SH+  CVTDFHS+TWE+  S+  
Sbjct: 1   MAFEDFEPIFGEPKVEWAAHSSCPLRPFLFHATAPDSSHIVVCVTDFHSDTWEARLSVSF 60

Query: 61  LHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFS 120
           L D+RD IGIGG+ +EFV+Y +TSLK  D++L LE  S  DG + AKL AQKSKGMP+ +
Sbjct: 61  LEDIRDIIGIGGSWAEFVEYFVTSLKSEDLKLVLEANSNSDGVSHAKLVAQKSKGMPLIT 120

Query: 121 LSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQ 180
           + LTKL D A +EA++ LSL LF + K   CSL+KEQERS+ LT MI+AEK K E++Q++
Sbjct: 121 IPLTKLLDSAVNEAMSNLSLNLFRAFKNITCSLVKEQERSVWLTNMIAAEKAKNETLQTE 180

Query: 181 LGQYTK-----KQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSED 236
             ++ K     K  +       SPDK        +K  NR VP HRR K RGALL  S+D
Sbjct: 181 YQKFQKISDSEKAGVSTNGLKKSPDKQAAR---DTKVKNRVVPVHRRTKVRGALLHGSDD 237

BLAST of Cp4.1LG05g10490.1 vs. TAIR10
Match: AT5G64010.1 (AT5G64010.1 unknown protein)

HSP 1 Score: 156.8 bits (395), Expect = 2.2e-38
Identity = 95/234 (40.60%), Postives = 139/234 (59.40%), Query Frame = 1

Query: 6   FAAIFGEPKVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSILQLHDMR 65
           F  IFGE   E  + GS      LFHV+  +  +L   VTDF S  W +  S+ QL DMR
Sbjct: 7   FEPIFGEVVPERSDPGSGLLRRCLFHVYASDSYNLTVHVTDFISGVWTTILSVSQLDDMR 66

Query: 66  DEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFSLSLTK 125
           D +GIGG+ SEFVDY + SLK  +V+L L E SV +G   A+L +QK+KGMP  ++ LTK
Sbjct: 67  DTVGIGGSWSEFVDYTVASLKSDNVKLLLGETSVSNGVKTARLVSQKAKGMPRINVPLTK 126

Query: 126 LFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQLGQYT 185
           + + +ASEA+A LSL LF + K+K+       +  +  +   + EK+K ++  +QL +Y+
Sbjct: 127 MVESSASEAMANLSLELFRAFKSKQ-----HLQGEVSFSAAATDEKDKRDATYNQLERYS 186

Query: 186 KKQKLQNMNASNSPDKSIGHYIGSSKTTN--RAVPAHRRAKTRGALLQDSEDDN 238
           +K  +   + +N  D         + T N  + VPAHRR + RGALLQDSE+++
Sbjct: 187 RKLDVMAPSTNNRQDSPANQSAREANTKNPVKRVPAHRRTRKRGALLQDSEEED 235

BLAST of Cp4.1LG05g10490.1 vs. NCBI nr
Match: gi|659121917|ref|XP_008460879.1| (PREDICTED: uncharacterized protein LOC103499622 isoform X1 [Cucumis melo])

HSP 1 Score: 453.8 bits (1166), Expect = 2.5e-124
Identity = 243/313 (77.64%), Postives = 267/313 (85.30%), Query Frame = 1

Query: 1   MEIGDFAAIFGEP-KVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSIL 60
           ME+ DFA IFGEP +VEW+NRGSL+ H FLFHV+TP+PS LRF VTDFHSNTWESTKS  
Sbjct: 1   MELQDFAPIFGEPTRVEWVNRGSLSLHQFLFHVYTPSPSQLRFLVTDFHSNTWESTKSAF 60

Query: 61  QLHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVF 120
           QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS  DGAA  KL AQKSKGMPVF
Sbjct: 61  QLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAACVKLIAQKSKGMPVF 120

Query: 121 SLSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQS 180
           S+SLTKL D AASEA+AT+SLGLFNSLK KECSL+KEQE SLQL TMIS EKEK E+IQ+
Sbjct: 121 SISLTKLIDSAASEAMATMSLGLFNSLKEKECSLVKEQEHSLQLATMISTEKEKNENIQT 180

Query: 181 QLGQYTKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQ 240
           QLGQY KKQKLQNMNASNSPDKS  H IGS+KTTNR VPAHRRAKTRGALLQDSEDDNEQ
Sbjct: 181 QLGQYRKKQKLQNMNASNSPDKSAVHNIGSTKTTNRVVPAHRRAKTRGALLQDSEDDNEQ 240

Query: 241 EHSLRSTSEDKEKKKELQNTNASANVDGFQKSPDKPVIHDIGSTKITNRVVPAHRRGRTR 300
           E SL+ST E+KEKK+ L NT+  A VD  QKSPDK V HDIGSTKITN VVP HRR RTR
Sbjct: 241 ERSLQSTFEEKEKKEGLLNTDTLAIVDRLQKSPDKSV-HDIGSTKITNHVVPTHRRARTR 300

Query: 301 GALLQDDENDNDR 313
           GALLQD+E+D+ R
Sbjct: 301 GALLQDNEDDDGR 312

BLAST of Cp4.1LG05g10490.1 vs. NCBI nr
Match: gi|778670289|ref|XP_011649429.1| (PREDICTED: uncharacterized protein LOC101217609 isoform X1 [Cucumis sativus])

HSP 1 Score: 432.6 bits (1111), Expect = 6.0e-118
Identity = 232/313 (74.12%), Postives = 260/313 (83.07%), Query Frame = 1

Query: 1   MEIGDFAAIFGEP-KVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSIL 60
           ME+ DFA IFG+P +VEW+NRGSL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  
Sbjct: 1   MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAF 60

Query: 61  QLHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVF 120
           QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS  DGAA  KL AQKSKGMPVF
Sbjct: 61  QLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAASVKLIAQKSKGMPVF 120

Query: 121 SLSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQS 180
           S+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMIS EKEK E+IQ+
Sbjct: 121 SISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQT 180

Query: 181 QLGQYTKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQ 240
           QLGQY KKQKLQNMNASNSPDKS  H IG +K TNR VP HRRAK RGALLQDSEDDNE+
Sbjct: 181 QLGQYRKKQKLQNMNASNSPDKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEK 240

Query: 241 EHSLRSTSEDKEKKKELQNTNASANVDGFQKSPDKPVIHDIGSTKITNRVVPAHRRGRTR 300
           E SL+ST E+KEKK+ L NT+  A VD   K P+K V HDIGSTKIT  VVP H R RTR
Sbjct: 241 ERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPNKSV-HDIGSTKITKHVVPTHHRARTR 300

Query: 301 GALLQDDENDNDR 313
           GALLQD+E+D+ R
Sbjct: 301 GALLQDNEDDDGR 312

BLAST of Cp4.1LG05g10490.1 vs. NCBI nr
Match: gi|700207089|gb|KGN62208.1| (hypothetical protein Csa_2G335570 [Cucumis sativus])

HSP 1 Score: 382.9 bits (982), Expect = 5.4e-103
Identity = 204/273 (74.73%), Postives = 228/273 (83.52%), Query Frame = 1

Query: 1   MEIGDFAAIFGEP-KVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSIL 60
           ME+ DFA IFG+P +VEW+NRGSL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  
Sbjct: 1   MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAF 60

Query: 61  QLHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVF 120
           QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS  DGAA  KL AQKSKGMPVF
Sbjct: 61  QLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAASVKLIAQKSKGMPVF 120

Query: 121 SLSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQS 180
           S+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMIS EKEK E+IQ+
Sbjct: 121 SISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQT 180

Query: 181 QLGQYTKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQ 240
           QLGQY KKQKLQNMNASNSPDKS  H IG +K TNR VP HRRAK RGALLQDSEDDNE+
Sbjct: 181 QLGQYRKKQKLQNMNASNSPDKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEK 240

Query: 241 EHSLRSTSEDKEKKKELQNTNASANVDGFQKSP 273
           E SL+ST E+KEKK+ L NT+  A VD   K P
Sbjct: 241 ERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP 273

BLAST of Cp4.1LG05g10490.1 vs. NCBI nr
Match: gi|778670291|ref|XP_004147362.2| (PREDICTED: uncharacterized protein LOC101217609 isoform X2 [Cucumis sativus])

HSP 1 Score: 367.5 bits (942), Expect = 2.4e-98
Identity = 201/282 (71.28%), Postives = 224/282 (79.43%), Query Frame = 1

Query: 1   MEIGDFAAIFGEP-KVEWINRGSLTSHPFLFHVHTPNPSHLRFCVTDFHSNTWESTKSIL 60
           ME+ DFA IFG+P +VEW+NRGSL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  
Sbjct: 1   MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAF 60

Query: 61  QLHDMRDEIGIGGAMSEFVDYIITSLKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVF 120
           QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS  DGAA  KL AQKSKGMPVF
Sbjct: 61  QLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAASVKLIAQKSKGMPVF 120

Query: 121 SLSLTKLFDCAASEAIATLSLGLFNSLKAKECSLLKEQERSLQLTTMISAEKEKYESIQS 180
           S+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMIS EKEK E+IQ+
Sbjct: 121 SISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQT 180

Query: 181 QLGQYTKKQKLQNMNASNSPDKSIGHYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQ 240
           QLGQY KKQKLQNMNASNSPDKS  H IG +K TNR VP HRRAK RGALLQDSEDDNE+
Sbjct: 181 QLGQYRKKQKLQNMNASNSPDKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEK 240

Query: 241 EHSLRSTSEDKEKKKELQNTNASANVDGFQKSPDKPVIHDIG 282
           E SL+ST E+K        TN    +   QKSP     H  G
Sbjct: 241 ERSLQSTFEEK--------TNLFM-ISARQKSPSMSCQHITG 273

BLAST of Cp4.1LG05g10490.1 vs. NCBI nr
Match: gi|659121919|ref|XP_008460880.1| (PREDICTED: uncharacterized protein LOC103499622 isoform X2 [Cucumis melo])

HSP 1 Score: 317.0 bits (811), Expect = 3.7e-83
Identity = 178/228 (78.07%), Postives = 194/228 (85.09%), Query Frame = 1

Query: 85  LKFGDVRLRLEEQSVDDGAAFAKLFAQKSKGMPVFSLSLTKLFDCAASEAIATLSLGLFN 144
           +KFGDVRL +E QS  DGAA  KL AQKSKGMPVFS+SLTKL D AASEA+AT+SLGLFN
Sbjct: 1   MKFGDVRLCMEGQSGKDGAACVKLIAQKSKGMPVFSISLTKLIDSAASEAMATMSLGLFN 60

Query: 145 SLKAKECSLLKEQERSLQLTTMISAEKEKYESIQSQLGQYTKKQKLQNMNASNSPDKSIG 204
           SLK KECSL+KEQE SLQL TMIS EKEK E+IQ+QLGQY KKQKLQNMNASNSPDKS  
Sbjct: 61  SLKEKECSLVKEQEHSLQLATMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSPDKSAV 120

Query: 205 HYIGSSKTTNRAVPAHRRAKTRGALLQDSEDDNEQEHSLRSTSEDKEKKKELQNTNASAN 264
           H IGS+KTTNR VPAHRRAKTRGALLQDSEDDNEQE SL+ST E+KEKK+ L NT+  A 
Sbjct: 121 HNIGSTKTTNRVVPAHRRAKTRGALLQDSEDDNEQERSLQSTFEEKEKKEGLLNTDTLAI 180

Query: 265 VDGFQKSPDKPVIHDIGSTKITNRVVPAHRRGRTRGALLQDDENDNDR 313
           VD  QKSPDK V HDIGSTKITN VVP HRR RTRGALLQD+E+D+ R
Sbjct: 181 VDRLQKSPDKSV-HDIGSTKITNHVVPTHRRARTRGALLQDNEDDDGR 227

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LK82_CUCSA3.8e-10374.73Uncharacterized protein OS=Cucumis sativus GN=Csa_2G335570 PE=4 SV=1[more]
V4T1Y5_9ROSI8.8e-6053.36Uncharacterized protein OS=Citrus clementina GN=CICLE_v10002244mg PE=4 SV=1[more]
A0A059AK38_EUCGR1.7e-5853.14Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00189 PE=4 SV=1[more]
G7LE39_MEDTR1.1e-5449.79Uncharacterized protein OS=Medicago truncatula GN=MTR_8g093850 PE=4 SV=1[more]
I3T5J0_LOTJA1.1e-5450.83Uncharacterized protein OS=Lotus japonicus PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G64010.12.2e-3840.60 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659121917|ref|XP_008460879.1|2.5e-12477.64PREDICTED: uncharacterized protein LOC103499622 isoform X1 [Cucumis melo][more]
gi|778670289|ref|XP_011649429.1|6.0e-11874.12PREDICTED: uncharacterized protein LOC101217609 isoform X1 [Cucumis sativus][more]
gi|700207089|gb|KGN62208.1|5.4e-10374.73hypothetical protein Csa_2G335570 [Cucumis sativus][more]
gi|778670291|ref|XP_004147362.2|2.4e-9871.28PREDICTED: uncharacterized protein LOC101217609 isoform X2 [Cucumis sativus][more]
gi|659121919|ref|XP_008460880.1|3.7e-8378.07PREDICTED: uncharacterized protein LOC103499622 isoform X2 [Cucumis melo][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG05g10490Cp4.1LG05g10490gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG05g10490.1Cp4.1LG05g10490.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g10490.1:five_prime_utr:001Cp4.1LG05g10490.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g10490.1:cds:011Cp4.1LG05g10490.1:cds:011CDS
Cp4.1LG05g10490.1:cds:010Cp4.1LG05g10490.1:cds:010CDS
Cp4.1LG05g10490.1:cds:009Cp4.1LG05g10490.1:cds:009CDS
Cp4.1LG05g10490.1:cds:008Cp4.1LG05g10490.1:cds:008CDS
Cp4.1LG05g10490.1:cds:007Cp4.1LG05g10490.1:cds:007CDS
Cp4.1LG05g10490.1:cds:006Cp4.1LG05g10490.1:cds:006CDS
Cp4.1LG05g10490.1:cds:005Cp4.1LG05g10490.1:cds:005CDS
Cp4.1LG05g10490.1:cds:004Cp4.1LG05g10490.1:cds:004CDS
Cp4.1LG05g10490.1:cds:003Cp4.1LG05g10490.1:cds:003CDS
Cp4.1LG05g10490.1:cds:002Cp4.1LG05g10490.1:cds:002CDS
Cp4.1LG05g10490.1:cds:001Cp4.1LG05g10490.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g10490.1:three_prime_utr:001Cp4.1LG05g10490.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35770FAMILY NOT NAMEDcoord: 1..239
score: 3.7