Cp4.1LG04g01740 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g01740
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyelin-associated oligodendrocyte basic protein isoform 1
LocationCp4.1LG04 : 329580 .. 334029 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAACACGCCGTACTCTTCACTACCACCACAATATTTCCTCCCGGGAGAGAGAATTCATCCTCAGCCTCATCTTCATCCTCATCATTCCCACCAAAAACCAATCAACAACAAGAAAAAAAAAGGAAGAAAATATCTCAATTTTCCGTTTCATCTTCGCTTCAAAATCCCTCCAAATTCCAGTTCCAAATCTCTAAAATCCATGGTTTCACCTTCATTGCTTGAACCCCATCCATTGAAATGGAACCGATTCCCAGTTCATTTCGTCCATTCTCCTTCTCCATCAACGAGTGGTCGCTGTCATCTCTACAGACCCTTTGTTCTACCCAAGAATTTGAGAATGGGTGGTTCTAGCTCTGTCAAATTCGGTGTTAGATGCTTCTTTTCCAAGGAGGGGGGGAAGAGTTTGACAACTTCGAATTCCAGGTTGGAGCGATTGGTCGATGGAGATAAACCCAAAAGGCCAATGGAGGTAATTGCCAATTCGATTATGAATGCTCTCAAGGCGTTGCGGAAGCCTGCGATTGCTGCGGTGTTGTTGGGATTGCTGTTGATGTACGATCCTAATTCTGCCTTAGCCGCTTCTGGTGGACGTGTTGGAGGAAGTTCGTTTTCGTCGCGCTCGTCTTCGTCTTCGAGGAGTTACTCAACTCCTTCGATGATTTCTGGTTATTCATATTCTGCGCCTTATAGTTCGCCGTCGTTGTTTGGCGGCGGCGGGATTTACGTTGGGCCGGCGGTTGGGGTCGGAGTCGGTGCCGGATCAAGTTTTTTCTTCATATTGGCTGGTTTCGCGGCGTTTCTTCTGGTTTCTGGATTTCTCTCCGATCGGTCCGATAGTAGTGTTCTCACTGCTTCTGAGAAAACTAGCGTGCTTAAACTTCAAGTATGACAAACCCTTCCCTCTCTGTATCTTGTTTTCTGATATATGGATTTTGTGTTACTGTTTACTCTCATTCGTCTTTTGAATCAACGGTTGAAATTCAAAACATGGCTGGGTGTTGTGCATATTCATTATGGATAAACAAGTCGTTTGTCAAGCAAGGATGTAGCCAGTGTTCTTGAAGGATAAAATGTAATGGCCCAAGCCCATGGTCCTCCTTGGGCTTTCCTTTCCGGCTTCCCATCAAGGCTTTCAAAACGCTTCGGCTAGGGAGAGGTTTTCACACCCTTATAAATAATGTTTCGTTCTCTTTCCCAAATGTGGGATCTCACATAAACAGTTGTAGGTTTCTATGTTTAAAATTTAGACCCTTTTGATGCAGACTGAATCTTTTTCGAAAACAACAATATGAACGGAGAATATGAAAAATAGAAGTATGGTTTCTTGCAGTTAACCCGAATGCCCTCATCACCTATGCTCATGGAGGTCATTCTTTTGAAAAGAAGATGATGATCTTGATGTCTGATGTTATAGCATTGCTAGTATTTGTTTTTAGGGTGGGGATGTTTCTTGTTGAAAAGAAATAGAACCTTCAAAGACGTTGAGAGCCCTATTTCTCTTTTTAACTATAGCTTGTCCTATCTTTATCTTAATTGGAAGAGATTTTGTAAATTCCATTTCCCTTGATTGATGCTCCCCTCTCCCTTTGTAGGTTTTTTAAATTATTTTAATGGAAAGTTCCTTTGGTTTCTTACGGCACAAAAAAGGTTGTAAAGTTCATTTCACGATTTGAAAAAAAAAATGTAATGATGTACTGGCACAGTGTAGAAAACTTGGTTTCTGTAGTCATGAATTATTGATTTTCTTGTAGGATAGATGAATGCAGATTCTAGAGTTACAATTTACTTTATGAAAAATTTCTACAAGTATGTCATCATTCTTAACTCATGGCACAACTCTCTCTCTCTCTCTCTCTAGGTTGGGTTGTTGGGCATGGGCCGGGGACTCCAAAGAGATCTAAATAGAATTGCTGAAAGTGCTGATACATCAACCCCTGAGGGTTTGAGCTATGTACTAACTGGTAAACTATTGATTTCTTCCTCTCGAGATTAATTACCTCGCTCACTGATTCCTTTAATAAAACAAGCAGTAAACTGGTACTGTTAGTCCCTGCATGTTAGAAGATACTATAGAAAAATAAAAGACCCTAATAACAATGATGCCATCTTACGTTTTCTGAAACTGTTTCCAGCAGGTGTCGTAAAACAAATTTTCTAAGTCAATATTGTTATTTGTTTCGGAACTGTTTAAAATTCTTCCAGAATGGTTTATAGAGATTTCTTTAAGCAGCAATCACATGCATACCCTTCTGTTGTGCAGTTAAAGTAACTTATTCTCTATGTAATTTTTGTTTTCCCCTTTGGGTTGCAGAAACGATTCTTGCGCTACTTCGACATCCTGATTATTGCATTTCAGGTTATTCATCTGTAAGTCCCACTATGTCCTTGGAGACTGCCTGATCTAGTTGCACAGAGTTAACACATGCATATTAATTATGTATTGAGGGTTGCTGGGTACTAGTGGTGATTGTTCATGCCTGTTTATGAATAGAACACATTGATAAGGCAGGGGAATGTAAAATGAGATACTTCAATAAACGAACATAGTACACAACAAAATCAATAGACTACGAACACCTTTACAATCAAGAACAGCCTGATGAGTTGAATCTTTGAATAGTTATACTAAACTTGCTCAAATGGCAAGAATATGTTATCAAGAAAGTTGTTTCTCAGATTTGTAGTCAACAAATTTTTAACTATATTAGATTCGGATCATGTTTTCACTGAAGAAATTTCATTTTTCAAAGGATCCCACTACCTTTCATTCTCTCTCACATCATGTTGATGATTGTGATGTTGCATTTGAAATTAGTTTGACTAGTATGGAATCTTTTAAGCTTATAAATGTAACTCTTCAGGTTTAGAATTCTGATTGAAAAGCTTGCTAAAAGAAAACTTTTGATATCTCCTAGAAATGGAATTTTAGTGCATGAACTTGCTAGGTCTTCAATGGATTAGTGGGGTGGACTTTGTATAAGCTGCTTAGCTATGTTATGCTCTTAATTAGTGGCATATATAGCCTCCCCATTTTGCCTACACGATATTGTGGTGGTGAAGTTTCTTCTCCTATTCTTGTTTCCAATGAGAATGCTTATTATCACATTCTTCTTTGAATTATTTCTCACAAGGCTTGCTAGCCTTAGTTGGTATTGGAAGCTTAATCTCACTCTTTTTGGAAAAAACTTAGATGGATCTGAAACGTAGTATAGAAGATGGAGAGAAGCGTTTCAATAAACTATCAATTGAAGAGCGAGGGAAATTTGATGAGGAAACTCTTGTCAACGTGAATAGCATCAAAAGACAGAGTACAAGCAGCCAGAGAACAAGTGGATTTAGCAATGAATACATTGTGGTTAGTTTCACTTGGCCTTGAGTTTTATGCAATGTATACTATTCTACATTATTTTTTGTCTTGATTTTTCTTGCTGTGCTGATTTAGACTATGTTTTTTGGTGTATTCTTTCTCTTGATTTTTGTGAGGTGTGTGTGGTAATATTGGAAAACCACTCATTTTAGAGAAAAAAAACGTGATGTTCCGTTTCTCTTATTCATAATTGAAGTTTTTTTCTCGATTAAAGCAGAACACTTCTGGAATGTAATTCAGATTTTATTTTGATGATTGGTTACTAAGAGGTGCATTCGTTTTGCTGGAATGATAATCTGATATTCATGAGAGAAAAGACCACGGTTTCCTTTCTTCTACCCTTTGATCACAATATTTTGAATATTAGATAACAATATTGGTGGCTGCTGAGGGAGTGCACAAGCTACCTGCTATCAATGGTAGTGGGGACTTGAAGGAAGCTTTGCAAAAACTAGCATCTATTCCTTCCAGCAAAATATTGGTAAGTTGGTATTTTAGGCCCAATAATCATTTCTATTTTAGCTTTTTTGTTTTTGAAAATTAAATTTGTTTTTTCCCAGTTTCTTCATAGTTGGTCCTACCTATCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAGGAAACCAAAATTCAAAAACAAAAATAAGACTTTAAAAACTACTTTTTTTAGTGTTCAAAAGTTGTTTGAGTTTTGAAAACATTGGTCAAAAGTAGATAACAAATTGAAGAATCTCATCTCATGGGAGTGATTTTGTAAGTTTCATTTTAAATGACTAAAAACCAAAAGCAAAAGTTCTGTTTATTCTTAGATCTCAAAGTCTATTCTGTTTGCATGCAGGCAGTTGAGGTTTTATGGACGCCGCAGAACGAGAACGACACGTTGTCCGAGCGTGAACTCCTGGAAGATTATCCACTTCTAAGGCCTCTATAA

mRNA sequence

CAAACACGCCGTACTCTTCACTACCACCACAATATTTCCTCCCGGGAGAGAGAATTCATCCTCAGCCTCATCTTCATCCTCATCATTCCCACCAAAAACCAATCAACAACAAGAAAAAAAAAGGAAGAAAATATCTCAATTTTCCGTTTCATCTTCGCTTCAAAATCCCTCCAAATTCCAGTTCCAAATCTCTAAAATCCATGGTTTCACCTTCATTGCTTGAACCCCATCCATTGAAATGGAACCGATTCCCAGTTCATTTCGTCCATTCTCCTTCTCCATCAACGAGTGGTCGCTGTCATCTCTACAGACCCTTTGTTCTACCCAAGAATTTGAGAATGGGTGGTTCTAGCTCTGTCAAATTCGGTGTTAGATGCTTCTTTTCCAAGGAGGGGGGGAAGAGTTTGACAACTTCGAATTCCAGGTTGGAGCGATTGGTCGATGGAGATAAACCCAAAAGGCCAATGGAGGTAATTGCCAATTCGATTATGAATGCTCTCAAGGCGTTGCGGAAGCCTGCGATTGCTGCGGTGTTGTTGGGATTGCTGTTGATGTACGATCCTAATTCTGCCTTAGCCGCTTCTGGTGGACGTGTTGGAGGAAGTTCGTTTTCGTCGCGCTCGTCTTCGTCTTCGAGGAGTTACTCAACTCCTTCGATGATTTCTGGTTATTCATATTCTGCGCCTTATAGTTCGCCGTCGTTGTTTGGCGGCGGCGGGATTTACGTTGGGCCGGCGGTTGGGGTCGGAGTCGGTGCCGGATCAAGTTTTTTCTTCATATTGGCTGGTTTCGCGGCGTTTCTTCTGGTTTCTGGATTTCTCTCCGATCGGTCCGATAGTAGTGTTCTCACTGCTTCTGAGAAAACTAGCGTGCTTAAACTTCAAGTTGGGTTGTTGGGCATGGGCCGGGGACTCCAAAGAGATCTAAATAGAATTGCTGAAAGTGCTGATACATCAACCCCTGAGGGTTTGAGCTATGTACTAACTGAAACGATTCTTGCGCTACTTCGACATCCTGATTATTGCATTTCAGGTTATTCATCTATGGATCTGAAACGTAGTATAGAAGATGGAGAGAAGCGTTTCAATAAACTATCAATTGAAGAGCGAGGGAAATTTGATGAGGAAACTCTTGTCAACGTGAATAGCATCAAAAGACAGAGTACAAGCAGCCAGAGAACAAGTGGATTTAGCAATGAATACATTGTGATAACAATATTGGTGGCTGCTGAGGGAGTGCACAAGCTACCTGCTATCAATGGTAGTGGGGACTTGAAGGAAGCTTTGCAAAAACTAGCATCTATTCCTTCCAGCAAAATATTGAACGAGAACGACACGTTGTCCGAGCGTGAACTCCTGGAAGATTATCCACTTCTAAGGCCTCTATAA

Coding sequence (CDS)

ATGGTTTCACCTTCATTGCTTGAACCCCATCCATTGAAATGGAACCGATTCCCAGTTCATTTCGTCCATTCTCCTTCTCCATCAACGAGTGGTCGCTGTCATCTCTACAGACCCTTTGTTCTACCCAAGAATTTGAGAATGGGTGGTTCTAGCTCTGTCAAATTCGGTGTTAGATGCTTCTTTTCCAAGGAGGGGGGGAAGAGTTTGACAACTTCGAATTCCAGGTTGGAGCGATTGGTCGATGGAGATAAACCCAAAAGGCCAATGGAGGTAATTGCCAATTCGATTATGAATGCTCTCAAGGCGTTGCGGAAGCCTGCGATTGCTGCGGTGTTGTTGGGATTGCTGTTGATGTACGATCCTAATTCTGCCTTAGCCGCTTCTGGTGGACGTGTTGGAGGAAGTTCGTTTTCGTCGCGCTCGTCTTCGTCTTCGAGGAGTTACTCAACTCCTTCGATGATTTCTGGTTATTCATATTCTGCGCCTTATAGTTCGCCGTCGTTGTTTGGCGGCGGCGGGATTTACGTTGGGCCGGCGGTTGGGGTCGGAGTCGGTGCCGGATCAAGTTTTTTCTTCATATTGGCTGGTTTCGCGGCGTTTCTTCTGGTTTCTGGATTTCTCTCCGATCGGTCCGATAGTAGTGTTCTCACTGCTTCTGAGAAAACTAGCGTGCTTAAACTTCAAGTTGGGTTGTTGGGCATGGGCCGGGGACTCCAAAGAGATCTAAATAGAATTGCTGAAAGTGCTGATACATCAACCCCTGAGGGTTTGAGCTATGTACTAACTGAAACGATTCTTGCGCTACTTCGACATCCTGATTATTGCATTTCAGGTTATTCATCTATGGATCTGAAACGTAGTATAGAAGATGGAGAGAAGCGTTTCAATAAACTATCAATTGAAGAGCGAGGGAAATTTGATGAGGAAACTCTTGTCAACGTGAATAGCATCAAAAGACAGAGTACAAGCAGCCAGAGAACAAGTGGATTTAGCAATGAATACATTGTGATAACAATATTGGTGGCTGCTGAGGGAGTGCACAAGCTACCTGCTATCAATGGTAGTGGGGACTTGAAGGAAGCTTTGCAAAAACTAGCATCTATTCCTTCCAGCAAAATATTGAACGAGAACGACACGTTGTCCGAGCGTGAACTCCTGGAAGATTATCCACTTCTAAGGCCTCTATAA

Protein sequence

MVSPSLLEPHPLKWNRFPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSVKFGVRCFFSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMYDPNSALAASGGRVGGSSFSSRSSSSSRSYSTPSMISGYSYSAPYSSPSLFGGGGIYVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGMGRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKRFNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAINGSGDLKEALQKLASIPSSKILNENDTLSERELLEDYPLLRPL
BLAST of Cp4.1LG04g01740 vs. TrEMBL
Match: W9R6R7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006177 PE=4 SV=1)

HSP 1 Score: 478.8 bits (1231), Expect = 6.4e-132
Identity = 285/409 (69.68%), Postives = 318/409 (77.75%), Query Frame = 1

Query: 6   LLEPHPLKWNR-FPVHFVHSPSPSTSGRCH-LYRP----FVLPKNLRMGGSSSVKFGVRC 65
           LLE +P KW + FP+     P P      H L R     F     LR G S   K  V+C
Sbjct: 7   LLEANPSKWTQSFPIPLSAPPLPRPRSYNHDLARSSSSRFTGIIKLRHGSS---KLSVKC 66

Query: 66  FFSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMY 125
           FF+ +  +S+ +        V  +    P EV+A++IM ALKAL+KPAIA VLLGLLLMY
Sbjct: 67  FFAAKKNQSIDSEK------VPEENRANPFEVVASTIMKALKALKKPAIAVVLLGLLLMY 126

Query: 126 DPNSALAASGGRVGGSSFSSRS--SSSSRSYSTPSMIS-GYSYSAPYSSPSLFG-GGGIY 185
           DPN+ALAASGGR+GG SFSSRS  +SSSRSYS P     G+SYS PY +PS FG GGG+Y
Sbjct: 127 DPNTALAASGGRMGGKSFSSRSPAASSSRSYSVPRTSGPGFSYSVPYYAPSPFGFGGGVY 186

Query: 186 VGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGMG 245
           VGPAVGVG  AGSSFF IL GFAAF+LVSGFLSDRS+ SVLTA+EKTSVLK+QVGLLGMG
Sbjct: 187 VGPAVGVG--AGSSFFLILMGFAAFVLVSGFLSDRSEDSVLTATEKTSVLKVQVGLLGMG 246

Query: 246 RGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKRF 305
           R LQRDLNRIAE+ADTSTPEGLSYVLTET LALLRHPDYCISGYSS+D+KRS+EDGEKRF
Sbjct: 247 RALQRDLNRIAETADTSTPEGLSYVLTETTLALLRHPDYCISGYSSVDIKRSMEDGEKRF 306

Query: 306 NKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAINGS 365
           N+LSIEERGKFDEETLVNVN+IKRQST+SQR SGFSNEYIVI+ILVAAEGVHK+P ING 
Sbjct: 307 NQLSIEERGKFDEETLVNVNNIKRQSTTSQRASGFSNEYIVISILVAAEGVHKMPVINGG 366

Query: 366 GDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
            DLKEALQKL SIPS KIL         NENDTLSERELLEDYPLLRPL
Sbjct: 367 RDLKEALQKLGSIPSHKILAVEVLWTPQNENDTLSERELLEDYPLLRPL 404

BLAST of Cp4.1LG04g01740 vs. TrEMBL
Match: M5VZ21_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005402mg PE=4 SV=1)

HSP 1 Score: 476.1 bits (1224), Expect = 4.2e-131
Identity = 285/413 (69.01%), Postives = 314/413 (76.03%), Query Frame = 1

Query: 1   MVSPSLLEPHPLKWNRFPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSVKFGVRCF 60
           M + SLL+P+PLKW                   H   PF+LP  LR   +  +KF     
Sbjct: 76  MATASLLQPNPLKWK------------------HSSCPFLLPP-LRPLPTKPIKF----- 135

Query: 61  FSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMYD 120
            S        + N +   LV  D P  P+E+IA+ ++NALKALRKPA+AAVLLGLLLM D
Sbjct: 136 -SSTFDVEQASCNFKSVPLVIKDNPANPLELIASKLLNALKALRKPAMAAVLLGLLLMSD 195

Query: 121 PNSALAASGGRVGGSSFSSRSSSSS---RSYSTPSMISG---YSYSAPYSSPSLFG---G 180
           PNSALAASGGRVGG+SFSSRSSSSS   RSYS P   S    +SYSAPY +PS FG   G
Sbjct: 196 PNSALAASGGRVGGNSFSSRSSSSSSSSRSYSVPRTSSSRPDFSYSAPYYAPSPFGFSGG 255

Query: 181 GGIYVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGL 240
           GG+YVGPA G GVGAGSSFF IL GFAAF+LVSGFLSDRS+ SVLTA+EKT+VLKLQVGL
Sbjct: 256 GGVYVGPAFGFGVGAGSSFFLILTGFAAFVLVSGFLSDRSEGSVLTATEKTTVLKLQVGL 315

Query: 241 LGMGRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDG 300
           LGMGR LQRDLNRIAE+ADTST EGL YVLTET LALLRHPDYCISGYSS+  KR IED 
Sbjct: 316 LGMGRALQRDLNRIAETADTSTSEGLGYVLTETTLALLRHPDYCISGYSSVAQKRGIEDA 375

Query: 301 EKRFNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPA 360
           EKRFN+LSIEERGKFDEETLVNVN+IKRQS+SSQR +GF NEYIVITILVAAEGVHKLPA
Sbjct: 376 EKRFNQLSIEERGKFDEETLVNVNNIKRQSSSSQRANGFRNEYIVITILVAAEGVHKLPA 435

Query: 361 INGSGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           INGSGDLKEALQKL SIPS+KIL         NE DTLSERELLEDYPLLRPL
Sbjct: 436 INGSGDLKEALQKLGSIPSNKILAVEVLWTPQNEIDTLSERELLEDYPLLRPL 463

BLAST of Cp4.1LG04g01740 vs. TrEMBL
Match: A0A067JUH0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23463 PE=4 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.2e-130
Identity = 280/410 (68.29%), Postives = 314/410 (76.59%), Query Frame = 1

Query: 1   MVSPSLLEPHPLKWNRFPVHFVHSPSPSTSGRCHLY----RPFVLPKNLRMGGSSSVKFG 60
           M + S LE +PLKWN+  V F   PS      C+ Y    RP        M G    KF 
Sbjct: 1   MATASFLEWNPLKWNQ-TVPFQLPPSRPVP--CNPYSAPLRPVKYGAAFTMKGHGLAKFS 60

Query: 61  VRCFFSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLL 120
            +C F  +   +  +S+S     V       P E+I  +++ ALKAL+KPAIAA+L+GLL
Sbjct: 61  GKCSFMDKHLTTSCSSSSSNPISVSRSITTNPFEMIYGAMVRALKALQKPAIAAILVGLL 120

Query: 121 LMYDPNSALAASGGRVGGSSFSSRSSSSSRSYSTPSMISG-YSYSAPYSSPSLFGG-GGI 180
           LMYDPNSA AASGGR+GG SFS  SSSSSRSYS P   S  +SYS PY +PS FGG GGI
Sbjct: 121 LMYDPNSAFAASGGRMGGRSFSRSSSSSSRSYSVPRTSSPEFSYSVPYYAPSPFGGGGGI 180

Query: 181 YVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGM 240
           YVGPAVGVGVGAGSS F ILAGFAAF+LVSGFLSDRS+  VLTA+EKTSV+KLQVGLLGM
Sbjct: 181 YVGPAVGVGVGAGSSLFLILAGFAAFMLVSGFLSDRSEGGVLTATEKTSVVKLQVGLLGM 240

Query: 241 GRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKR 300
           GR LQ DLNRIAE ADTS+ EGLSYVLTET L+LLRHPDYCISGYS +D+KRSIEDGEKR
Sbjct: 241 GRSLQSDLNRIAEIADTSSSEGLSYVLTETTLSLLRHPDYCISGYSYVDVKRSIEDGEKR 300

Query: 301 FNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAING 360
           FN+LSIEERGKFDEETLVNVN+IK+QSTSSQR +GF+NEYIVITILVAAEGVHKLPAIN 
Sbjct: 301 FNQLSIEERGKFDEETLVNVNNIKKQSTSSQRANGFNNEYIVITILVAAEGVHKLPAINS 360

Query: 361 SGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           SGDLKEALQKL SIP+SKIL         NENDTL+ERELLEDYPLLRPL
Sbjct: 361 SGDLKEALQKLGSIPASKILAVEVLWTPQNENDTLTERELLEDYPLLRPL 407

BLAST of Cp4.1LG04g01740 vs. TrEMBL
Match: A0A061F284_THECC (Myelin-associated oligodendrocyte basic protein isoform 1 OS=Theobroma cacao GN=TCM_026403 PE=4 SV=1)

HSP 1 Score: 471.5 bits (1212), Expect = 1.0e-129
Identity = 290/410 (70.73%), Postives = 316/410 (77.07%), Query Frame = 1

Query: 5   SLLEPHPLKWNR-FPVHFVHSPSPSTSGRCHLYRPFVLPK---NLRMGGSSSVKFGVRCF 64
           SLL+ +PLK    FP H    P PS         P  LPK   NL      S K  VRCF
Sbjct: 7   SLLQSNPLKLKHYFPRHL---PPPS---------PPRLPKFSGNLTFQTQGSTKCNVRCF 66

Query: 65  FSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMYD 124
              E  KS    ++ L    DG   K P E+IA +I  AL AL+KPAIAAVLLGLLLMYD
Sbjct: 67  LLPEKRKSSKLDSNHLSSSSDG---KHPFEIIAETISKALNALKKPAIAAVLLGLLLMYD 126

Query: 125 PNSA-LAASGGRVGGSSFSSRSSSSSRSYSTP----SMISGYSYSAPYSSPSLFGGGG-I 184
           PN+A LAASGGR+GG SFSS SSSSSRSYS P    S  S  SYS PY +P+ FGGGG  
Sbjct: 127 PNNAALAASGGRMGGRSFSS-SSSSSRSYSVPRNGGSRFS--SYSVPYYAPAPFGGGGGF 186

Query: 185 YVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGM 244
           Y+GPAVGVGVGAGSSFF IL GFAAF+LVSGFLSDRS+SSVLTASE+TSVLKLQVGLLGM
Sbjct: 187 YMGPAVGVGVGAGSSFFLILIGFAAFVLVSGFLSDRSESSVLTASERTSVLKLQVGLLGM 246

Query: 245 GRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKR 304
           GR LQ+DLNRIAE ADTST EGLS+VLTET LALLRHP YCISGYSS+D KRSI+DGEKR
Sbjct: 247 GRSLQKDLNRIAEVADTSTSEGLSFVLTETTLALLRHPHYCISGYSSVDAKRSIDDGEKR 306

Query: 305 FNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAING 364
           FN+LSIEERGKFDEETLVNVN+IKRQST+S++ SGFSNEYIVITILVAAEG+HKLP ING
Sbjct: 307 FNQLSIEERGKFDEETLVNVNNIKRQSTTSRKASGFSNEYIVITILVAAEGLHKLPPING 366

Query: 365 SGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           S DLKEALQKLASIP+SKIL         NENDTLSERELLEDYPLLRPL
Sbjct: 367 SRDLKEALQKLASIPTSKILAVEVLWTPQNENDTLSERELLEDYPLLRPL 398

BLAST of Cp4.1LG04g01740 vs. TrEMBL
Match: A9PJ10_9ROSI (Putative uncharacterized protein OS=Populus trichocarpa x Populus deltoides PE=2 SV=1)

HSP 1 Score: 465.7 bits (1197), Expect = 5.6e-128
Identity = 270/421 (64.13%), Postives = 315/421 (74.82%), Query Frame = 1

Query: 1   MVSPSLLEPHPLKWNR-FPV-------------HFVHSPSPSTSGRCHLYRPFVLPKNLR 60
           M + SL+E  PLKW + FP              + +    P  + +C    P + P +  
Sbjct: 1   MATASLIESSPLKWKKAFPFQPQPPLLTPNHYPYLIPLKPPKFTSKCISQLPILNPNSKN 60

Query: 61  MGGSSSVKFGVRCFFSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKP 120
             G   +   +    S+   K  T+ N              P+E+I  +++ AL  L+KP
Sbjct: 61  NNGPFPISPSITKPISQNLSKPPTSKN--------------PLEIIYETMLKALDILKKP 120

Query: 121 AIAAVLLGLLLMYDPNSALAASGGRVGGSSFSSRSSS--SSRSYSTP-SMISGYSYSAPY 180
           AIAA+L+G+LL++DPNSA AASGGR+GG+SFS RSSS  SSRSYS P    SG+SYS PY
Sbjct: 121 AIAAILIGVLLLHDPNSAFAASGGRIGGNSFSRRSSSEYSSRSYSVPRGGSSGFSYSVPY 180

Query: 181 SSPSLFGGGGIYVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTS 240
            +PS FGGGG+YVGPAVGVGVGAGSS FFILAGFAAF+LVSGFLSDR++  VLTA+EKTS
Sbjct: 181 YAPSPFGGGGVYVGPAVGVGVGAGSSLFFILAGFAAFMLVSGFLSDRNEGGVLTAAEKTS 240

Query: 241 VLKLQVGLLGMGRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMD 300
           VLKLQVGLLGMGR LQRDLNRIAE ADTS+ EGL+YVLTET LALLRHPDYCISG+S +D
Sbjct: 241 VLKLQVGLLGMGRSLQRDLNRIAEVADTSSSEGLNYVLTETSLALLRHPDYCISGHSFVD 300

Query: 301 LKRSIEDGEKRFNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAA 360
           +KRS+EDGEKRFN+LSIEERGKFDEETLVNVNSIKRQSTSS+R++GFSNEYIVITILVAA
Sbjct: 301 VKRSMEDGEKRFNQLSIEERGKFDEETLVNVNSIKRQSTSSKRSNGFSNEYIVITILVAA 360

Query: 361 EGVHKLPAINGSGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRP 396
           EGV+KLP INGSGDLKEALQKL SI +SKIL         NENDTLSERELLEDYPLLRP
Sbjct: 361 EGVYKLPTINGSGDLKEALQKLGSISASKILAVEVLWTPQNENDTLSERELLEDYPLLRP 407

BLAST of Cp4.1LG04g01740 vs. TAIR10
Match: AT1G54520.1 (AT1G54520.1 unknown protein)

HSP 1 Score: 404.4 bits (1038), Expect = 7.8e-113
Identity = 252/411 (61.31%), Postives = 289/411 (70.32%), Query Frame = 1

Query: 3   SPSLLEPHPLKWNRFPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSVKFGVRCFFS 62
           S + LE  P +WN+        P P T  R H     +  K  R   S  ++  V+   S
Sbjct: 4   SSTFLELTPFQWNQ--------PLPYTQ-RPHHRTVLLYSKPQRRSNSIRLQISVKYKQS 63

Query: 63  KEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMYDPN 122
                    SN              P E IA  +  AL +L+KPAIAAVLLGLLL YDPN
Sbjct: 64  TSSSDPDLRSNFN------------PFEQIAIQVKKALDSLKKPAIAAVLLGLLLFYDPN 123

Query: 123 SALAASGGRVGGSSFSSRS----SSSSRSYSTPSMIS-GYSYSA---PYSSPSLFGGGGI 182
           SALAASGGR+GG+SFSSRS    SSSS+SYS P   +  +SYSA   PY  PS FGGG  
Sbjct: 124 SALAASGGRIGGNSFSSRSRSSSSSSSQSYSVPRTSNPSFSYSARTAPYYGPSPFGGG-- 183

Query: 183 YVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRS-DSSVLTASEKTSVLKLQVGLLG 242
           +VGPAVG G G  SSF  IL GFAAF+LVSGFLSDRS D S+LT ++KTSV+KLQVGLLG
Sbjct: 184 FVGPAVGFGFGGFSSFSLILVGFAAFVLVSGFLSDRSQDDSILTDTQKTSVIKLQVGLLG 243

Query: 243 MGRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEK 302
           +GR LQ+D NR+AES+DTSTPEGLSYVLTE  LALLRHPDYCIS YSS+D+K SIE GEK
Sbjct: 244 LGRTLQQDFNRLAESSDTSTPEGLSYVLTEATLALLRHPDYCISCYSSVDVKPSIEKGEK 303

Query: 303 RFNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAIN 362
           RFN+LSIEERGKFDEETLVNVNSIKRQS+  ++ SGFSNEYIV+TIL+AAEG+HKLP IN
Sbjct: 304 RFNQLSIEERGKFDEETLVNVNSIKRQSSKIRKASGFSNEYIVVTILMAAEGIHKLPPIN 363

Query: 363 GSGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           G+ DLKEAL KL SIP +KI+         NE D LSERELLEDYPLLRPL
Sbjct: 364 GTTDLKEALLKLGSIPRNKIMAVEVLWTPQNEADALSERELLEDYPLLRPL 391

BLAST of Cp4.1LG04g01740 vs. NCBI nr
Match: gi|449449537|ref|XP_004142521.1| (PREDICTED: uncharacterized protein LOC101210275 [Cucumis sativus])

HSP 1 Score: 637.1 bits (1642), Expect = 2.0e-179
Identity = 347/406 (85.47%), Postives = 371/406 (91.38%), Query Frame = 1

Query: 1   MVSPSLLEPHPLKWNR-FPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSVKFGVRC 60
           MVS SLLEPHPLKWN+ F VH   SPSPSTSGR HL RPF++P+NL++  SSSVK+ VRC
Sbjct: 1   MVSASLLEPHPLKWNKTFRVHLPSSPSPSTSGRYHLCRPFIVPRNLKIHDSSSVKYPVRC 60

Query: 61  FFSKEGGKSLTT-SNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLM 120
           FFS++G  S T+ SNS    LV+GDKPK PMEVI NSI+NALKAL+KPAIAAVLLGLLLM
Sbjct: 61  FFSEKGRSSTTSISNSSSVELVNGDKPKSPMEVIGNSIINALKALQKPAIAAVLLGLLLM 120

Query: 121 YDPNSALAASGGRVGGSSFSSRSSSSSRSYSTPSMISGYSYSAPYSSPSLFGGGGIYVGP 180
           YDPNSALAASGGRVGG++FSSRSSSSSRSYSTP M SG+SYSAPY+SPS+FGGGGIYVGP
Sbjct: 121 YDPNSALAASGGRVGGNAFSSRSSSSSRSYSTPRMSSGFSYSAPYTSPSMFGGGGIYVGP 180

Query: 181 AVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGMGRGL 240
           AVGVG+GAGSSF FILAGFAAFLLVSGFLSDRSD+SVLTAS+KTSVLKLQVGLLGMGRGL
Sbjct: 181 AVGVGLGAGSSFVFILAGFAAFLLVSGFLSDRSDTSVLTASDKTSVLKLQVGLLGMGRGL 240

Query: 241 QRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKRFNKL 300
           QRDLNRIAESADTSTPEGL YVLTETILALLRHPDYCISGYSS+D+KRSIE+GEKRFNKL
Sbjct: 241 QRDLNRIAESADTSTPEGLCYVLTETILALLRHPDYCISGYSSIDVKRSIEEGEKRFNKL 300

Query: 301 SIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAINGSGDL 360
           SIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLP INGSGDL
Sbjct: 301 SIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPTINGSGDL 360

Query: 361 KEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           KEALQKLASIPSSKIL         NENDTLSERELLEDYPLLRPL
Sbjct: 361 KEALQKLASIPSSKILAVEVLWTPQNENDTLSERELLEDYPLLRPL 406

BLAST of Cp4.1LG04g01740 vs. NCBI nr
Match: gi|823196285|ref|XP_012492877.1| (PREDICTED: uncharacterized protein LOC105804719 isoform X1 [Gossypium raimondii])

HSP 1 Score: 486.1 bits (1250), Expect = 5.8e-134
Identity = 287/407 (70.52%), Postives = 318/407 (78.13%), Query Frame = 1

Query: 5   SLLEPHPLKWNR-FPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSVKFGVRCFFSK 64
           SLL+ + L     FP+H      P  S              L   G  S KF V+CFFS 
Sbjct: 7   SLLQSNTLNLKHYFPLHLPLPSPPRLSN---------FTGTLAFNGQRSAKFTVKCFFSS 66

Query: 65  EGGKSLTTSN-SRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMYDPN 124
           E  K LT+S+     +L      K P E+IA +++ AL AL+KPAIAAVL+GLLLMYDPN
Sbjct: 67  EQRKHLTSSSLGSNNQLSSSSNDKNPFEIIAQTMLKALNALKKPAIAAVLMGLLLMYDPN 126

Query: 125 S-ALAASGGRVGGSSFSSRSSSSSRSYSTP----SMISGYSYSAPYSSPSLFGGGGIYVG 184
           + ALAASGGR+GG SFSS SSSSSRSYS P    S  S  SYSAPY +P+ FGGGG Y+G
Sbjct: 127 NVALAASGGRMGGRSFSS-SSSSSRSYSVPRNGGSRFS--SYSAPYYAPAPFGGGGFYMG 186

Query: 185 PAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGMGRG 244
           PAVGVGVGAGSSFF IL GFAAF+LVSGFLSDRS+SSVLTASE+TSV+KLQVGLLGMGR 
Sbjct: 187 PAVGVGVGAGSSFFLILVGFAAFVLVSGFLSDRSESSVLTASERTSVIKLQVGLLGMGRS 246

Query: 245 LQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKRFNK 304
           LQRDLNRIAE ADTST EGLS+VLTET LALLRHPDYCISGYSS+D+KRSI+DGE RFN+
Sbjct: 247 LQRDLNRIAEVADTSTSEGLSFVLTETTLALLRHPDYCISGYSSVDVKRSIDDGENRFNQ 306

Query: 305 LSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAINGSGD 364
           LSIEERGKFDEETLVNVN+IKRQSTSSQ+ SGFSNEYIVITILVAAEG+HKLP INGSGD
Sbjct: 307 LSIEERGKFDEETLVNVNNIKRQSTSSQKASGFSNEYIVITILVAAEGMHKLPPINGSGD 366

Query: 365 LKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           LKEALQKLASIPSSKIL         NENDTLSE+ELLEDYPLLRPL
Sbjct: 367 LKEALQKLASIPSSKILAVEVLWTPQNENDTLSEQELLEDYPLLRPL 401

BLAST of Cp4.1LG04g01740 vs. NCBI nr
Match: gi|1009117523|ref|XP_015875363.1| (PREDICTED: uncharacterized protein LOC107412168 [Ziziphus jujuba])

HSP 1 Score: 486.1 bits (1250), Expect = 5.8e-134
Identity = 286/411 (69.59%), Postives = 315/411 (76.64%), Query Frame = 1

Query: 6   LLEPHPLKWNR-FPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSV-----KFGVRC 65
           LLE +PL+W + FP+                 R F L +  ++ G   V     KF VRC
Sbjct: 7   LLEANPLRWKQSFPILLPR-------------RKFCLVQPAKLNGGFKVEQVSGKFSVRC 66

Query: 66  FFSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMY 125
           F      K           L      + P E++A++I+ ALKALRKPAIA VLLGLLLMY
Sbjct: 67  F---SASKKHLLGLDNCSGLASKQNRRNPFEIVADTIVKALKALRKPAIAVVLLGLLLMY 126

Query: 126 DPNSALAASGGRVGGSSFSSRSSSSS-RSYSTPSMIS-GYSYSAPYSSPSLFG----GGG 185
           DPNSALAASGGRVGG +FSSRSSSSS RSYS P   S G+SYS PY +PS FG    GGG
Sbjct: 127 DPNSALAASGGRVGGKAFSSRSSSSSSRSYSVPRTSSPGFSYSVPYYAPSPFGFGGGGGG 186

Query: 186 IYVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLG 245
            YVGPA GVGVGAGSSFF IL GFAAF+LVSGFLSDRSD SVLTA+EKTSVLKLQVGLLG
Sbjct: 187 FYVGPAFGVGVGAGSSFFLILVGFAAFVLVSGFLSDRSDGSVLTATEKTSVLKLQVGLLG 246

Query: 246 MGRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEK 305
           M R LQRDLNRIA++ADTS+PEGLSYVLTET LALLRHPDYCISGYS++DLKRS+EDGEK
Sbjct: 247 MARVLQRDLNRIADAADTSSPEGLSYVLTETTLALLRHPDYCISGYSAVDLKRSMEDGEK 306

Query: 306 RFNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAIN 365
           RFN+LSIEERGKFDEETLVNVN+IK+QSTSSQR SGFSNEYIVITILVAAEGVHKLPAIN
Sbjct: 307 RFNQLSIEERGKFDEETLVNVNNIKKQSTSSQRASGFSNEYIVITILVAAEGVHKLPAIN 366

Query: 366 GSGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           GSGDLKEALQKL SIPS +IL         NEND+LSERELLEDYPLLRPL
Sbjct: 367 GSGDLKEALQKLGSIPSGRILAVEVLWTPQNENDSLSERELLEDYPLLRPL 401

BLAST of Cp4.1LG04g01740 vs. NCBI nr
Match: gi|703076158|ref|XP_010090270.1| (hypothetical protein L484_006177 [Morus notabilis])

HSP 1 Score: 478.8 bits (1231), Expect = 9.2e-132
Identity = 285/409 (69.68%), Postives = 318/409 (77.75%), Query Frame = 1

Query: 6   LLEPHPLKWNR-FPVHFVHSPSPSTSGRCH-LYRP----FVLPKNLRMGGSSSVKFGVRC 65
           LLE +P KW + FP+     P P      H L R     F     LR G S   K  V+C
Sbjct: 7   LLEANPSKWTQSFPIPLSAPPLPRPRSYNHDLARSSSSRFTGIIKLRHGSS---KLSVKC 66

Query: 66  FFSKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMY 125
           FF+ +  +S+ +        V  +    P EV+A++IM ALKAL+KPAIA VLLGLLLMY
Sbjct: 67  FFAAKKNQSIDSEK------VPEENRANPFEVVASTIMKALKALKKPAIAVVLLGLLLMY 126

Query: 126 DPNSALAASGGRVGGSSFSSRS--SSSSRSYSTPSMIS-GYSYSAPYSSPSLFG-GGGIY 185
           DPN+ALAASGGR+GG SFSSRS  +SSSRSYS P     G+SYS PY +PS FG GGG+Y
Sbjct: 127 DPNTALAASGGRMGGKSFSSRSPAASSSRSYSVPRTSGPGFSYSVPYYAPSPFGFGGGVY 186

Query: 186 VGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGLLGMG 245
           VGPAVGVG  AGSSFF IL GFAAF+LVSGFLSDRS+ SVLTA+EKTSVLK+QVGLLGMG
Sbjct: 187 VGPAVGVG--AGSSFFLILMGFAAFVLVSGFLSDRSEDSVLTATEKTSVLKVQVGLLGMG 246

Query: 246 RGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDGEKRF 305
           R LQRDLNRIAE+ADTSTPEGLSYVLTET LALLRHPDYCISGYSS+D+KRS+EDGEKRF
Sbjct: 247 RALQRDLNRIAETADTSTPEGLSYVLTETTLALLRHPDYCISGYSSVDIKRSMEDGEKRF 306

Query: 306 NKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPAINGS 365
           N+LSIEERGKFDEETLVNVN+IKRQST+SQR SGFSNEYIVI+ILVAAEGVHK+P ING 
Sbjct: 307 NQLSIEERGKFDEETLVNVNNIKRQSTTSQRASGFSNEYIVISILVAAEGVHKMPVINGG 366

Query: 366 GDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
            DLKEALQKL SIPS KIL         NENDTLSERELLEDYPLLRPL
Sbjct: 367 RDLKEALQKLGSIPSHKILAVEVLWTPQNENDTLSERELLEDYPLLRPL 404

BLAST of Cp4.1LG04g01740 vs. NCBI nr
Match: gi|470132290|ref|XP_004302018.1| (PREDICTED: uncharacterized protein LOC101293892 [Fragaria vesca subsp. vesca])

HSP 1 Score: 478.8 bits (1231), Expect = 9.2e-132
Identity = 284/413 (68.77%), Postives = 324/413 (78.45%), Query Frame = 1

Query: 2   VSPSLLEPHPLKWNRFPVHFVHSPSPSTSGRCHLYRPFVLPKNLRMGGSSSVKFGVRCFF 61
           ++ SLL+P PLKW+  P+ F    SPS         P + PK+L +   +S +F + CFF
Sbjct: 3   ITASLLQPTPLKWHHHPLLF----SPSRP-------PPLPPKSLHLITPTS-RFKLNCFF 62

Query: 62  SKEGGKSLTTSNSRLERLVDGDKPKRPMEVIANSIMNALKALRKPAIAAVLLGLLLMYDP 121
           S E    LTT            KP  P+EVIA+ ++NA+KAL KPA+ AVLLGL+LM DP
Sbjct: 63  SPESKPELTT------------KPN-PIEVIADKLLNAVKALSKPAMVAVLLGLVLMSDP 122

Query: 122 NSALAASGGRVGGSSFSSRSSSSS-----RSYSTP-SMISGYSYSAPYSSPSLFG----G 181
           N ALAASGGRVGG SFSSRSSSSS     RSYS P +    +SYSAPY +PS FG    G
Sbjct: 123 NPALAASGGRVGGKSFSSRSSSSSSSSSARSYSVPRTSRPDFSYSAPYYAPSPFGFGGGG 182

Query: 182 GGIYVGPAVGVGVGAGSSFFFILAGFAAFLLVSGFLSDRSDSSVLTASEKTSVLKLQVGL 241
           GG YVGPAVGVGVGAGSSFF IL GFAAF+LVSGFLSDRS+ SVLTA+EKT+VLKLQVGL
Sbjct: 183 GGFYVGPAVGVGVGAGSSFFLILTGFAAFVLVSGFLSDRSEGSVLTATEKTTVLKLQVGL 242

Query: 242 LGMGRGLQRDLNRIAESADTSTPEGLSYVLTETILALLRHPDYCISGYSSMDLKRSIEDG 301
           LG+GR LQRDL+RIA++ADTSTPEGLSYV+TET LALLRHPDYCISGYSS+ LKRSIED 
Sbjct: 243 LGLGRELQRDLDRIADTADTSTPEGLSYVMTETTLALLRHPDYCISGYSSVSLKRSIEDA 302

Query: 302 EKRFNKLSIEERGKFDEETLVNVNSIKRQSTSSQRTSGFSNEYIVITILVAAEGVHKLPA 361
           EK FN+LSIEERGKFDEETLVNVNSI+++S++SQR++GF NEYIVITILVAAEGVHKLPA
Sbjct: 303 EKGFNQLSIEERGKFDEETLVNVNSIRKRSSTSQRSNGFRNEYIVITILVAAEGVHKLPA 362

Query: 362 INGSGDLKEALQKLASIPSSKIL---------NENDTLSERELLEDYPLLRPL 396
           INGSGDLKEALQKLASIP SK+L         NENDTLSERELLEDYPLLRPL
Sbjct: 363 INGSGDLKEALQKLASIPPSKLLAVEVLWTPQNENDTLSERELLEDYPLLRPL 390

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
W9R6R7_9ROSA6.4e-13269.68Uncharacterized protein OS=Morus notabilis GN=L484_006177 PE=4 SV=1[more]
M5VZ21_PRUPE4.2e-13169.01Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005402mg PE=4 SV=1[more]
A0A067JUH0_JATCU1.2e-13068.29Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23463 PE=4 SV=1[more]
A0A061F284_THECC1.0e-12970.73Myelin-associated oligodendrocyte basic protein isoform 1 OS=Theobroma cacao GN=... [more]
A9PJ10_9ROSI5.6e-12864.13Putative uncharacterized protein OS=Populus trichocarpa x Populus deltoides PE=2... [more]
Match NameE-valueIdentityDescription
AT1G54520.17.8e-11361.31 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449449537|ref|XP_004142521.1|2.0e-17985.47PREDICTED: uncharacterized protein LOC101210275 [Cucumis sativus][more]
gi|823196285|ref|XP_012492877.1|5.8e-13470.52PREDICTED: uncharacterized protein LOC105804719 isoform X1 [Gossypium raimondii][more]
gi|1009117523|ref|XP_015875363.1|5.8e-13469.59PREDICTED: uncharacterized protein LOC107412168 [Ziziphus jujuba][more]
gi|703076158|ref|XP_010090270.1|9.2e-13269.68hypothetical protein L484_006177 [Morus notabilis][more]
gi|470132290|ref|XP_004302018.1|9.2e-13268.77PREDICTED: uncharacterized protein LOC101293892 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR010903DUF1517
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g01740.1Cp4.1LG04g01740.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010903Protein of unknown function DUF1517PFAMPF07466DUF1517coord: 126..395
score: 2.9
NoneNo IPR availablePANTHERPTHR33975FAMILY NOT NAMEDcoord: 86..395
score: 2.6E
NoneNo IPR availablePANTHERPTHR33975:SF2SUBFAMILY NOT NAMEDcoord: 86..395
score: 2.6E