Cp4.1LG14g05650 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g05650
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF668)
LocationCp4.1LG14 : 785985 .. 787737 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGCCCACGAGGCTTAACAACGCTAAATTCCCTCTGCCTAAAATCTCCCCCACTGTTATATCCTCATCTATCTATCTATCTATCAATTCCTTCTTCCTCCACATTAACCAAACCAAATCAATATGGCCCTCGAGACTTGGCTAATTAAGGTCAAGACCGCCGTCTCCAACAAATTCGATGTGGTTCGAACTTCAACTATCCTCAACACCAAACCCACCTCGTCTAAGAAATCGCCGAACCTCGGCGTTTTGGCCTTCGAGATTGCTGGCCTCATGTCCAAGCTTTTGCATCTATGGCACTCCCTCTCCGACCACAACATTCTTCGTCTCCGAGAATCTATTTCCCTCGAAGGTGTCCTTAAGATTGTCTCTAACGACGAGACCTTCCTCCTCGGCCTCGCCTGCGCTGAAATGACCGAAAATCTTCGCCTCCTCGCCAACTCTGTCTCCCGCTTGACCATTAAATGTCACCACGCCCATCTTCGATCCTTCCCTCGCTTCTTCCACGACTTCGCCGACTCCGGCCGCGATCTTCACAATTGGGTTCGTTGAATTACAGCTAAATTCTAAATCTTTACTCTAAATCTTCACGTTCTGTTTCTAACAATGGGTTTCATAGTTTTCTCTGTTTTTTTTTTTTTTTTTAAAATTGATGAATGTTGCATTTTGATTTACAGATTATGAGCGAGAAAGAAATGGAGTGTAGGAACAAGAGGATTGAGAGATTAGTTACATTCACTGCAGCCCTTCATAGAGAAATGAATGAGCTTTCAATTATGGAAACTGGGTTAAGAAAAGCAGTGGCAAATTTAGAGTTCTGTGATCAAGAGAAGAGTTCACCATTAGAGATCTCTCTCAAAGAGCAAAAGATCCTCGACCTCCGACACAAGATCTTATGCCAGAGACAAGAAGTAAAGTATCTAAAAGAGAAATCTCTCTGGAACAGAACATTCGACACGGTGATTTTAATTCTAGCAACCTCAATTTTCACAACTCTAGCAAGAATTAAGGTTGTTTTTGGTATTGCTCAAGCCCCATCTTCTCTCCCACGTAGCCTCTCCGCCTCTGCTGCTGTCCACCCGCTCAAGAACATAAACGACAATGGAAAAAACGCATTGTTTGAACCCAATTCGAAACTTTTAAAGCCTCCCCCGACCACATTGGGAGCGGCGGGGTTAGCCCTGCATTACTCCAACCTGATCATAGTGATGGATAAAATGATCAAGGCTCCGCAACTAGTCGGCGTGGATGCGAGAGACGATCTATACTCGATGCTACCGAAGAGCATTCGGTCGTCGTTGAGAGCACGGTTGAGAGGGGTTGGGTTCATAGCGAGCGATGCGTCGTTGGCGGGAGAATGGAGGGAGGCAATGGGGAGGATATTGGGATGGCTGTCGCCATTGTCACAAAACATGGTAAAATGGCAAAGTGAAAGGAGCTTTGAGCAGCAAAATTACATGGCACCAAAGACTAATGTAATGCTTTTGCAGACGCTTTATTTTGCTAACAAAGACAAGACAGAAGCCGCCATTACAGAATTGCTTGTGGGGTTGAATTATATTTGGAGATTTGAAAGAGAAATGACTGCCAAGGCCTTCTTTGCCTCCAACAATTTCACTGCCTCTTGATGTATATTGTATATTTTTATTATACAAAGTATATAAATTATATGATGATTAGGATCAAAATATTATGAACTTTTAATCATAATTATTCACATATTTAGTTTTAATTTGGAAATTATAATTAGATTT

mRNA sequence

CAGCCCACGAGGCTTAACAACGCTAAATTCCCTCTGCCTAAAATCTCCCCCACTGTTATATCCTCATCTATCTATCTATCTATCAATTCCTTCTTCCTCCACATTAACCAAACCAAATCAATATGGCCCTCGAGACTTGGCTAATTAAGGTCAAGACCGCCGTCTCCAACAAATTCGATGTGGTTCGAACTTCAACTATCCTCAACACCAAACCCACCTCGTCTAAGAAATCGCCGAACCTCGGCGTTTTGGCCTTCGAGATTGCTGGCCTCATGTCCAAGCTTTTGCATCTATGGCACTCCCTCTCCGACCACAACATTCTTCGTCTCCGAGAATCTATTTCCCTCGAAGGTGTCCTTAAGATTGTCTCTAACGACGAGACCTTCCTCCTCGGCCTCGCCTGCGCTGAAATGACCGAAAATCTTCGCCTCCTCGCCAACTCTGTCTCCCGCTTGACCATTAAATGTCACCACGCCCATCTTCGATCCTTCCCTCGCTTCTTCCACGACTTCGCCGACTCCGGCCGCGATCTTCACAATTGGATTATGAGCGAGAAAGAAATGGAGTGTAGGAACAAGAGGATTGAGAGATTAGTTACATTCACTGCAGCCCTTCATAGAGAAATGAATGAGCTTTCAATTATGGAAACTGGGTTAAGAAAAGCAGTGGCAAATTTAGAGTTCTGTGATCAAGAGAAGAGTTCACCATTAGAGATCTCTCTCAAAGAGCAAAAGATCCTCGACCTCCGACACAAGATCTTATGCCAGAGACAAGAAGTAAAGTATCTAAAAGAGAAATCTCTCTGGAACAGAACATTCGACACGGTGATTTTAATTCTAGCAACCTCAATTTTCACAACTCTAGCAAGAATTAAGGTTGTTTTTGGTATTGCTCAAGCCCCATCTTCTCTCCCACGTAGCCTCTCCGCCTCTGCTGCTGTCCACCCGCTCAAGAACATAAACGACAATGGAAAAAACGCATTGTTTGAACCCAATTCGAAACTTTTAAAGCCTCCCCCGACCACATTGGGAGCGGCGGGGTTAGCCCTGCATTACTCCAACCTGATCATAGTGATGGATAAAATGATCAAGGCTCCGCAACTAGTCGGCGTGGATGCGAGAGACGATCTATACTCGATGCTACCGAAGAGCATTCGGTCGTCGTTGAGAGCACGGTTGAGAGGGGTTGGGTTCATAGCGAGCGATGCGTCGTTGGCGGGAGAATGGAGGGAGGCAATGGGGAGGATATTGGGATGGCTGTCGCCATTGTCACAAAACATGGTAAAATGGCAAAGTGAAAGGAGCTTTGAGCAGCAAAATTACATGGCACCAAAGACTAATGTAATGCTTTTGCAGACGCTTTATTTTGCTAACAAAGACAAGACAGAAGCCGCCATTACAGAATTGCTTGTGGGGTTGAATTATATTTGGAGATTTGAAAGAGAAATGACTGCCAAGGCCTTCTTTGCCTCCAACAATTTCACTGCCTCTTGATGTATATTGTATATTTTTATTATACAAAGTATATAAATTATATGATGATTAGGATCAAAATATTATGAACTTTTAATCATAATTATTCACATATTTAGTTTTAATTTGGAAATTATAATTAGATTT

Coding sequence (CDS)

ATGGCCCTCGAGACTTGGCTAATTAAGGTCAAGACCGCCGTCTCCAACAAATTCGATGTGGTTCGAACTTCAACTATCCTCAACACCAAACCCACCTCGTCTAAGAAATCGCCGAACCTCGGCGTTTTGGCCTTCGAGATTGCTGGCCTCATGTCCAAGCTTTTGCATCTATGGCACTCCCTCTCCGACCACAACATTCTTCGTCTCCGAGAATCTATTTCCCTCGAAGGTGTCCTTAAGATTGTCTCTAACGACGAGACCTTCCTCCTCGGCCTCGCCTGCGCTGAAATGACCGAAAATCTTCGCCTCCTCGCCAACTCTGTCTCCCGCTTGACCATTAAATGTCACCACGCCCATCTTCGATCCTTCCCTCGCTTCTTCCACGACTTCGCCGACTCCGGCCGCGATCTTCACAATTGGATTATGAGCGAGAAAGAAATGGAGTGTAGGAACAAGAGGATTGAGAGATTAGTTACATTCACTGCAGCCCTTCATAGAGAAATGAATGAGCTTTCAATTATGGAAACTGGGTTAAGAAAAGCAGTGGCAAATTTAGAGTTCTGTGATCAAGAGAAGAGTTCACCATTAGAGATCTCTCTCAAAGAGCAAAAGATCCTCGACCTCCGACACAAGATCTTATGCCAGAGACAAGAAGTAAAGTATCTAAAAGAGAAATCTCTCTGGAACAGAACATTCGACACGGTGATTTTAATTCTAGCAACCTCAATTTTCACAACTCTAGCAAGAATTAAGGTTGTTTTTGGTATTGCTCAAGCCCCATCTTCTCTCCCACGTAGCCTCTCCGCCTCTGCTGCTGTCCACCCGCTCAAGAACATAAACGACAATGGAAAAAACGCATTGTTTGAACCCAATTCGAAACTTTTAAAGCCTCCCCCGACCACATTGGGAGCGGCGGGGTTAGCCCTGCATTACTCCAACCTGATCATAGTGATGGATAAAATGATCAAGGCTCCGCAACTAGTCGGCGTGGATGCGAGAGACGATCTATACTCGATGCTACCGAAGAGCATTCGGTCGTCGTTGAGAGCACGGTTGAGAGGGGTTGGGTTCATAGCGAGCGATGCGTCGTTGGCGGGAGAATGGAGGGAGGCAATGGGGAGGATATTGGGATGGCTGTCGCCATTGTCACAAAACATGGTAAAATGGCAAAGTGAAAGGAGCTTTGAGCAGCAAAATTACATGGCACCAAAGACTAATGTAATGCTTTTGCAGACGCTTTATTTTGCTAACAAAGACAAGACAGAAGCCGCCATTACAGAATTGCTTGTGGGGTTGAATTATATTTGGAGATTTGAAAGAGAAATGACTGCCAAGGCCTTCTTTGCCTCCAACAATTTCACTGCCTCTTGA

Protein sequence

MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHSLSDHNILRLRESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAHLRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLRKAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILILATSIFTTLARIKVVFGIAQAPSSLPRSLSASAAVHPLKNINDNGKNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAPKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNFTAS
BLAST of Cp4.1LG14g05650 vs. TrEMBL
Match: A0A0A0L5N3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177390 PE=4 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 3.5e-206
Identity = 389/471 (82.59%), Postives = 416/471 (88.32%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVR-TSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWH 60
           MALETWLIKVK AVSNKFDVVR +ST  N KPTSSKKSPN+ VL+FEIAGLMSKLLHLW+
Sbjct: 1   MALETWLIKVKNAVSNKFDVVRASSTAPNFKPTSSKKSPNVAVLSFEIAGLMSKLLHLWN 60

Query: 61  SLSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHA 120
           SLSDHNI RLR +SISLEGV KIVSND+ FLL LACAE+TENLRLLANSVS L IKC H 
Sbjct: 61  SLSDHNITRLRNQSISLEGVHKIVSNDDDFLLALACAEITENLRLLANSVSPLCIKCDHP 120

Query: 121 HLRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGL 180
            LRSF R F +FADSGRDLHNW++SEKEMECRNKRIERLVT TA LHREM+ELSIMETGL
Sbjct: 121 DLRSFHRLFLEFADSGRDLHNWLLSEKEMECRNKRIERLVTLTANLHREMDELSIMETGL 180

Query: 181 RKAVANLEFCDQEKSS----PLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDT 240
           RK VA+L+ C QE+S+    PLEISLKEQKILDL+ KIL QRQEVKYLKEKSLWNRTFDT
Sbjct: 181 RKTVASLQLCQQEQSNSSTPPLEISLKEQKILDLQQKILWQRQEVKYLKEKSLWNRTFDT 240

Query: 241 VILILATSIFTTLARIKVVFGIA-QAPSSLPRSLSASAAVHPLKNINDNG--------KN 300
           VI ILA SIFTTLARIK+VFG+A Q PSSLPRSLSASAAVHPLKN+NDN         KN
Sbjct: 241 VISILARSIFTTLARIKLVFGLAHQFPSSLPRSLSASAAVHPLKNLNDNANDSDPTTTKN 300

Query: 301 ALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKSIR 360
             FE N KLLKPP TTLGAAGLALHY+NLIIVMDKMIK+PQLVGVDARDDLYSMLP S+R
Sbjct: 301 GFFESNLKLLKPPRTTLGAAGLALHYANLIIVMDKMIKSPQLVGVDARDDLYSMLPNSVR 360

Query: 361 SSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAPKT 420
           +SLRARLRGVGF ASDASLAGEWREAMGRILGW+SPL+QNM+KWQSERSFEQQNYMAPKT
Sbjct: 361 TSLRARLRGVGFTASDASLAGEWREAMGRILGWMSPLAQNMIKWQSERSFEQQNYMAPKT 420

Query: 421 NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNFTAS 457
           NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTA A FA +NF  S
Sbjct: 421 NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTANALFACSNFITS 471

BLAST of Cp4.1LG14g05650 vs. TrEMBL
Match: B9I4J5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0012s14310g PE=4 SV=2)

HSP 1 Score: 584.3 bits (1505), Expect = 1.3e-163
Identity = 322/472 (68.22%), Postives = 372/472 (78.81%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALETWLIKVKTA+S+ FD V TS   N  P  SK++ ++GVLAFEIAGLMSK+ HLW S
Sbjct: 1   MALETWLIKVKTAISHSFDSVITS---NPIPKPSKRA-SVGVLAFEIAGLMSKVFHLWQS 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD NI+R+R +SISLEGV KIVSNDE+FLLGLACAEM ENLRL+A SVSRL+ +C  + 
Sbjct: 61  LSDKNIIRVRNDSISLEGVRKIVSNDESFLLGLACAEMAENLRLIAKSVSRLSKRCEDSG 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LR F R F DF + G D + W++S K+ME + K+++R VT TA L++EM ELS +E GLR
Sbjct: 121 LRRFERLFDDFTNLGNDANCWVLSWKDMETKTKKMDRYVTVTATLYKEMEELSALENGLR 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           KA+         +   LE + KEQK+LDL+ KIL QRQEVKYLKE+SLWNR+FDTV+LIL
Sbjct: 181 KAL---------QCGELEGTSKEQKVLDLQQKILWQRQEVKYLKERSLWNRSFDTVVLIL 240

Query: 241 ATSIFTTLARIKVVFGIAQA-PSSLPRSLSASAAVHPLKNI-----------------ND 300
           A SIFT LARIK+VFGIA   P+SLPRSLSASA VHP +N                  N 
Sbjct: 241 AKSIFTVLARIKLVFGIAHGYPTSLPRSLSASATVHPTENPTTCNIVSGPLKSSKLEGNK 300

Query: 301 NGKNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLP 360
           +  N  FE NSKLLKPPPTTLGAA LALHY+NLIIVM+KMIK+PQLVGVDARDDLYSMLP
Sbjct: 301 DSSNGFFESNSKLLKPPPTTLGAAALALHYANLIIVMEKMIKSPQLVGVDARDDLYSMLP 360

Query: 361 KSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYM 420
            SIRSSLRARL+GVGF ASD  LAGEWR+A+GRIL WLSPL+ NM+KWQSERSFEQQN +
Sbjct: 361 NSIRSSLRARLKGVGFSASDPVLAGEWRDALGRILAWLSPLAHNMIKWQSERSFEQQN-L 420

Query: 421 APKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNF 454
            PKTNV+LLQTL FANK+KTEAAITELLVGLNYIWRFEREMTAKAFF   NF
Sbjct: 421 LPKTNVLLLQTLSFANKEKTEAAITELLVGLNYIWRFEREMTAKAFFECANF 458

BLAST of Cp4.1LG14g05650 vs. TrEMBL
Match: A0A067JEQ7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26212 PE=4 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 4.5e-161
Identity = 317/472 (67.16%), Postives = 374/472 (79.24%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALETWLIKVKTA+S++ D V TS  +   P +SKKS  +GVLAFEIAGLMSKL HLW S
Sbjct: 1   MALETWLIKVKTAISHRLDSVVTSAPI---PKASKKS-TVGVLAFEIAGLMSKLFHLWQS 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD NI+RLR ESISLEGV KIVSNDE+FLL LACAE+ ENLRL+A SVSRL+ +C  A+
Sbjct: 61  LSDKNIIRLRNESISLEGVRKIVSNDESFLLALACAEIAENLRLVAKSVSRLSKRCDDAN 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LR F R F DFA+SG D ++W+++ KEME +NK+++R VT TA L++EM ELSI+E+GL+
Sbjct: 121 LRRFERLFDDFANSGHDPNSWVLNCKEMEAKNKKMDRYVTITATLYKEMEELSILESGLK 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           KA   L++ + E ++      KEQKI+DL+ KI  QRQEVKYLKE+SLWNR+FD V+ +L
Sbjct: 181 KA---LQYSEHESTT------KEQKIMDLQQKIFWQRQEVKYLKERSLWNRSFDGVVSML 240

Query: 241 ATSIFTTLARIKVVFGIAQA-PSSLPRSLSASAAVHPLKNINDNG--------------- 300
             SIFT LARIK+VFGI    P+SLPRSLSASA VHP +N N                  
Sbjct: 241 VRSIFTVLARIKLVFGIGHGYPTSLPRSLSASATVHPTENPNTCSFVSGPLKGSELEENK 300

Query: 301 --KNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLP 360
              +  FE NSKLLKPP TTLGAA LALHY+NLIIVM+KMIK+PQLVGVDARDDLYSMLP
Sbjct: 301 DLSDGFFESNSKLLKPPETTLGAAALALHYANLIIVMEKMIKSPQLVGVDARDDLYSMLP 360

Query: 361 KSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYM 420
            SIRSSLRARL+GVGF ASD  LAGEWR+A+GRILGWLSP++ NM+KWQSERSFEQQN +
Sbjct: 361 NSIRSSLRARLKGVGFSASDPVLAGEWRDALGRILGWLSPIAHNMIKWQSERSFEQQN-L 420

Query: 421 APKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNF 454
            PKTNV+LLQTL+FAN++KTEAAITELLVGLNYIWRFEREMTAKA F   NF
Sbjct: 421 LPKTNVLLLQTLFFANQEKTEAAITELLVGLNYIWRFEREMTAKALFECANF 458

BLAST of Cp4.1LG14g05650 vs. TrEMBL
Match: M5XC82_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004872mg PE=4 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 2.4e-159
Identity = 316/480 (65.83%), Postives = 372/480 (77.50%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALET L+KVKTA+S+ FD VRTST  +  P    +  N+GVLAFEIAGLMSKL+HLW +
Sbjct: 1   MALETLLLKVKTAISHSFDAVRTST--HPHPNKPMRKSNVGVLAFEIAGLMSKLIHLWQA 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD N++RL  +SISLEGV KIVSND+ FLL LACAE+ ENLR+LA ++S L+ KC   +
Sbjct: 61  LSDKNMIRLHNDSISLEGVRKIVSNDDAFLLALACAELVENLRILATAISSLSTKCQDPN 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRN-KRIERLVTFTAALHREMNELSIMETGL 180
           LR+F R F DFADSGRD +NW++  KEM+ +N K++ER VT T+ L+REM+ELS++E+GL
Sbjct: 121 LRAFHRLFLDFADSGRDPYNWVIGFKEMDTKNVKKLERYVTVTSTLYREMDELSVLESGL 180

Query: 181 RKAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILI 240
            KA    E C+  +SS   +S KEQKI+DL+ KI+ QRQEVKYLK++SLW+R+FDTV  +
Sbjct: 181 SKAWKYNE-CETNQSSS-SMSSKEQKIVDLQQKIVWQRQEVKYLKDRSLWSRSFDTVTWV 240

Query: 241 LATSIFTTLARIKVVFGIAQAP-SSLPRSLSASAAVHP---------------------- 300
           LA SIFT LAR K+VFGI Q P SSLPRSLSASA V+P                      
Sbjct: 241 LARSIFTVLARTKLVFGIGQCPPSSLPRSLSASATVYPSDQTTCRFVSGPLKPAKSHHHQ 300

Query: 301 ---LKNINDNGKNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDA 360
              + N+ D      FE NSKLLKPPP+TLGAA LALHY+NLIIVM+KMIK PQ+VGVDA
Sbjct: 301 ENAIDNLKDLENIGFFESNSKLLKPPPSTLGAAALALHYANLIIVMEKMIKFPQMVGVDA 360

Query: 361 RDDLYSMLPKSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSE 420
           RDDLYSMLP SIRSSLRARLRGVGF ASD  LAGEWREA+GRILGWLSPL+ NM+KWQSE
Sbjct: 361 RDDLYSMLPTSIRSSLRARLRGVGFSASDPVLAGEWREALGRILGWLSPLAHNMIKWQSE 420

Query: 421 RSFEQQNYMAPKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNN 453
           RSFEQQN + PKTNVMLLQTL+FANKDKTEAAITELLVGLNYI RFEREMTAKA F  NN
Sbjct: 421 RSFEQQN-LVPKTNVMLLQTLFFANKDKTEAAITELLVGLNYICRFEREMTAKALFECNN 475

BLAST of Cp4.1LG14g05650 vs. TrEMBL
Match: A0A067FPI9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012421mg PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 2.3e-157
Identity = 302/466 (64.81%), Postives = 366/466 (78.54%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALETWLIKVK+A+S+ FD V    +      S+ K  N+GVLAFE+AG+MSKLLH W +
Sbjct: 1   MALETWLIKVKSAISSSFDSVPKPKL---SVISNNKKSNVGVLAFEVAGIMSKLLHQWQA 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD NI+RLR +SISLEGV KIVSNDE+FLLGLACAE+ EN+RL+A +VS L+ +C  ++
Sbjct: 61  LSDKNIIRLRNDSISLEGVRKIVSNDESFLLGLACAELAENIRLVAKAVSSLSKRCEDSN 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LRSF R F +FA+SG D  +W++S K+ME +NK+++R VT TA L++EM ELS++E  L+
Sbjct: 121 LRSFDRSFEEFANSGHDSFSWVLSSKDMEAKNKKMDRYVTVTATLYKEMEELSVLENALK 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           K +   E+         E S+KEQKILDL+ K++ QRQEVKYLKE+SLWNR+FDTV+ IL
Sbjct: 181 KTLQCNEY---------EASVKEQKILDLQQKVIWQRQEVKYLKERSLWNRSFDTVVSIL 240

Query: 241 ATSIFTTLARIKVVFGIAQA--PSSLPRSLSASAAVHPLKNIND-NG----------KNA 300
              +FT LARIK+VFGI     P+ LPRSLSASA VHP +N N  NG           + 
Sbjct: 241 TRHVFTVLARIKLVFGIGHHGYPAYLPRSLSASATVHPTENPNSCNGFVSGPIMIKSSDG 300

Query: 301 LFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKSIRS 360
            FE NSK+LKPP TTLGAA L+LHYSNLII+++KMIK+PQLVGVDARDDLY+MLP+SIRS
Sbjct: 301 FFESNSKILKPPATTLGAAALSLHYSNLIIIIEKMIKSPQLVGVDARDDLYAMLPRSIRS 360

Query: 361 SLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAPKTN 420
           SLRARL+GVGF ASD  LAGEWR A+GRIL WLSPL+ NM+KWQSERSFEQQN + PKTN
Sbjct: 361 SLRARLKGVGFSASDPVLAGEWRNALGRILSWLSPLAHNMIKWQSERSFEQQN-LLPKTN 420

Query: 421 VMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNN 453
           V+LLQTLYFANK+KTEAAITELLVGLNYIWRFEREMTAKA F   N
Sbjct: 421 VLLLQTLYFANKEKTEAAITELLVGLNYIWRFEREMTAKALFECAN 453

BLAST of Cp4.1LG14g05650 vs. TAIR10
Match: AT5G51670.1 (AT5G51670.1 Protein of unknown function (DUF668))

HSP 1 Score: 482.3 bits (1240), Expect = 3.4e-136
Identity = 272/469 (58.00%), Postives = 345/469 (73.56%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALET+LIK+K A+S+K    R      + P  S  + ++GVL+FE+A +M+KLLHL HS
Sbjct: 1   MALETFLIKLKNAISSKPTSRRPH---RSSPPISTTTSSVGVLSFEVARVMTKLLHLTHS 60

Query: 61  LSDHNILRLRE-SISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           L+D N+L  R+ S+SLEG+ KIV+ DETF L L CAE+ ++L   ANSVSRL+ +C  A 
Sbjct: 61  LTDSNLLTPRDHSLSLEGLTKIVNGDETFHLSLVCAELADSLAHAANSVSRLSNRCTTAS 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LRSF R FH+FAD GRD H W+M+ K+ E +NK+IER V+ T AL+REM E++I+E  LR
Sbjct: 121 LRSFHRLFHEFADMGRDPHGWVMNCKDTEAKNKKIERYVSVTTALYREMEEMAILENSLR 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           K    +   + E+    E      K++DL++KI  Q+Q VKYLK++SLWN++FDTV+LIL
Sbjct: 181 KQSLQIGI-EFEEEEDYENKKDVMKVIDLQNKIERQKQHVKYLKDRSLWNKSFDTVVLIL 240

Query: 241 ATSIFTTLARIKVVFGIAQAP--------SSLPRSLSASAA----VHPLKNINDNGK--- 300
           A S+FT LAR+K VF  A A         SSLPRSLS+S++    VHP  N  +  K   
Sbjct: 241 ARSVFTALARLKSVFSSAAATGYMVPTVVSSLPRSLSSSSSSMNLVHPSPNDEERDKTTT 300

Query: 301 -NALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKS 360
            +A  E +S+LLKPP TTLG AG+ALHY+NLI+VM+KMIK PQLVG+DARDDLYSMLP S
Sbjct: 301 SSAFLEESSRLLKPPETTLGGAGVALHYANLIVVMEKMIKQPQLVGLDARDDLYSMLPAS 360

Query: 361 IRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAP 420
           +RSSLR+RL+GVGF A+D  LA EW+ A+GRIL WL PL+QNM++WQSERSFEQQ +MA 
Sbjct: 361 VRSSLRSRLKGVGFTATDGGLATEWKAALGRILRWLLPLAQNMIRWQSERSFEQQ-HMAT 420

Query: 421 KTN----VMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFF 449
            TN    VML+QTL FA+K KTEAAITELLVGLNYIWRFEREMTAKA F
Sbjct: 421 ATNSQNRVMLVQTLVFADKVKTEAAITELLVGLNYIWRFEREMTAKALF 464

BLAST of Cp4.1LG14g05650 vs. TAIR10
Match: AT5G04550.1 (AT5G04550.1 Protein of unknown function (DUF668))

HSP 1 Score: 183.7 bits (465), Expect = 2.5e-46
Identity = 97/185 (52.43%), Postives = 126/185 (68.11%), Query Frame = 1

Query: 270 SAAVHPLKNINDNG---KNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQ 329
           S+A H +   N N    +N       KL    P TLG A LALHY+N+IIV+++ + +P 
Sbjct: 398 SSAEHHILEDNSNSVHVENLTLPSRPKLSDAAPNTLGTACLALHYANVIIVIERFVASPH 457

Query: 330 LVGVDARDDLYSMLPKSIRSSLRARLRGVGFIAS-----DASLAGEWREAMGRILGWLSP 389
           L+G DARDDLY+MLP S+R+SLR RL+      S     D  LA EW +AM  IL WL P
Sbjct: 458 LIGDDARDDLYNMLPASVRTSLRERLKPYSKNLSSSTVYDPGLAREWTDAMAGILEWLGP 517

Query: 390 LSQNMVKWQSERSFEQQNYMAPKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFERE 447
           L+ NM+KWQSERS+E Q+ +  +T+++L QTL+FAN+ KTEA ITELLVGLNY+WRF RE
Sbjct: 518 LAHNMIKWQSERSYEHQS-LVSRTHIVLAQTLFFANQQKTEAIITELLVGLNYVWRFGRE 577

BLAST of Cp4.1LG14g05650 vs. TAIR10
Match: AT3G23160.1 (AT3G23160.1 Protein of unknown function (DUF668))

HSP 1 Score: 171.8 bits (434), Expect = 9.8e-43
Identity = 81/154 (52.60%), Postives = 118/154 (76.62%), Query Frame = 1

Query: 300 TTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKSIRSSLRARLRGV--GF 359
           +T+G + L+LHY+N++IV++K++K P L+G +ARDDLY MLP S++++L+A LR      
Sbjct: 359 STIGGSALSLHYANVVIVVEKLLKYPHLIGEEARDDLYQMLPTSLKTTLKASLRSYLKNI 418

Query: 360 IASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAPKTNVMLLQTLYFAN 419
              DA LA +W+E +  IL WL+PL+ NM++WQSER+FEQQN +  +TNV+LLQTLYFA+
Sbjct: 419 SIYDAPLAHDWKETIDGILSWLAPLAHNMIRWQSERNFEQQNQIVKRTNVLLLQTLYFAD 478

Query: 420 KDKTEAAITELLVGLNYIWRFEREMTAKAFFASN 452
           ++KTEAAI +LLVGLNYI  +E++  A    AS+
Sbjct: 479 REKTEAAICKLLVGLNYICHYEQQQNALLDCASS 512

BLAST of Cp4.1LG14g05650 vs. TAIR10
Match: AT1G34320.1 (AT1G34320.1 Protein of unknown function (DUF668))

HSP 1 Score: 129.4 bits (324), Expect = 5.6e-30
Identity = 116/416 (27.88%), Postives = 192/416 (46.15%), Query Frame = 1

Query: 36  KSPNLGVLAFEIAGLMSKLLHLWHSLSDHNILRLRESI-SLEGVLKIVSNDETFLLGLAC 95
           K   + +L+FE+A  + K  +L HSLS  +I  L+E +   EGV  ++S D   LL +A 
Sbjct: 151 KGNKISILSFEVANTIVKGANLMHSLSKDSITHLKEVVLPSEGVQNLISKDMDELLRIAA 210

Query: 96  AEMTENLRLLANSVSRLTIKCHHAHLRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRI 155
           A+  E LR+ +  V R   +C      +  RFF      G +       ++E E    ++
Sbjct: 211 ADKREELRIFSGEVVRFGNRCKDPQYHNLDRFFDRL---GSEFTPQKHLKQEAETIMHQM 270

Query: 156 ERLVTFTAALHREMNELSIMETGLRKAVANLEFCDQEKSSPLEISLKEQKILD----LRH 215
              V FTA L+ E++ L   E   ++ +       QE+ +P   S  ++ + D    LR 
Sbjct: 271 MSFVHFTADLYHELHALDRFEQDYQRKI-------QEEENP---STAQRGVGDTLAILRT 330

Query: 216 KILCQRQEVKYLKEKSLWNRTFDTVILILATSIFTTLARIKVVFGIAQAPSSLPRSLSAS 275
           ++  Q++ V+ LK+KSLW+R  + V+  L   +      +++                A 
Sbjct: 331 ELKSQKKHVRNLKKKSLWSRILEEVMEKLVDVVH--FLHLEIH--------------EAF 390

Query: 276 AAVHPLKNINDNGKNALFEPNSKLLKPPPTT---LGAAGLALHYSNLIIVMDKMIKAPQL 335
               P K  ND                PP     LG+AGLALHY+N+I  +D ++     
Sbjct: 391 GGADPDKPAND----------------PPINHKKLGSAGLALHYANIITQIDTLVSRSST 450

Query: 336 VGVDARDDLYSMLPKSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMV 395
           +    RD LY  LP SI+S+LR+R++   F   +     + +  M + L WL P++ N  
Sbjct: 451 MPASTRDALYQGLPPSIKSALRSRIQ--SFQVKEELTVPQIKAEMEKTLQWLVPVATNTT 510

Query: 396 K------WQSE--RSFEQQNYMAPKTNVMLLQTLYFANKDKTEAAITELLVGLNYI 436
           K      W  E   S  + N       ++ + TL+ A+K+KTEA I +L+V L+++
Sbjct: 511 KAHHGFGWVGEWASSGSEANQRPAGQTILRIDTLHHADKEKTEAYILDLVVWLHHL 519

BLAST of Cp4.1LG14g05650 vs. TAIR10
Match: AT5G08660.1 (AT5G08660.1 Protein of unknown function (DUF668))

HSP 1 Score: 122.9 bits (307), Expect = 5.2e-28
Identity = 111/409 (27.14%), Postives = 191/409 (46.70%), Query Frame = 1

Query: 36  KSPNLGVLAFEIAGLMSKLLHLWHSLSDHNILRLRESISL-EGVLKIVSNDETFLLGLAC 95
           K   LG+LAFE+A  + K  +L  SLS  NI  L+ +I   EGV  +VSND   LL L  
Sbjct: 145 KGNELGILAFEVANTIVKSSNLIESLSKRNIEHLKGTILYSEGVQNLVSNDFDELLRLVA 204

Query: 96  AEMTENLRLLANSVSRLTIKCHHAHLRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRI 155
           A+  + L++ +  V R   +       +  R+F   +   ++L      +++      ++
Sbjct: 205 ADKRQELQVFSGEVVRFGNRSKDFQWHNLQRYFDRIS---KELTPQRQLKEDAVLVVDQL 264

Query: 156 ERLVTFTAALHREMNELSIMETGLRKAVANLEFCDQEKSSPLEISLKEQKILDLRHKILC 215
             LV +TA L++E+  L  +E    +        ++E S+    S K   +  L+ ++  
Sbjct: 265 MVLVQYTAELYQELQVLYRLEKDYEQKRR-----EEENSAN---SSKGDGLAILKTELKA 324

Query: 216 QRQEVKYLKEKSLWNRTFDTVILILATSIFTTLARIKVVFGIAQAPSSLPRSLSASAAVH 275
           QR+ VK LK+KSLW+R F+ V+  L   +   L  I  +FG A                 
Sbjct: 325 QRKVVKSLKKKSLWSRGFEEVMEKLVDIVHFLLLEIHNIFGGADD--------------- 384

Query: 276 PLKNINDNGKNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARD 335
                         +P+ K        LG AGLALHY+N+I+ +D ++     +  +ARD
Sbjct: 385 --------------QPSKKGAAEYDKRLGPAGLALHYANIIVQIDTLVARASSITSNARD 444

Query: 336 DLYSMLPKSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVK------ 395
            LY  LP  I+ +LR++++   F         + ++ M R L WL P++ N  K      
Sbjct: 445 SLYQSLPPGIKLALRSKIK--SFNVDKELSVTQIKDEMERTLHWLVPVAGNTTKAHHGFG 504

Query: 396 WQSERSFEQQNYMAPKT--NVMLLQTLYFANKDKTEAAITELLVGLNYI 436
           W  E +    ++ +  +  +++ ++TLY A+K+KTE  I   ++ L ++
Sbjct: 505 WVGEWANTGTDFTSKPSGGDILRIETLYHASKEKTEIYILGQIIWLQHL 511

BLAST of Cp4.1LG14g05650 vs. NCBI nr
Match: gi|659077271|ref|XP_008439117.1| (PREDICTED: uncharacterized protein LOC103484007 [Cucumis melo])

HSP 1 Score: 733.0 bits (1891), Expect = 3.1e-208
Identity = 392/471 (83.23%), Postives = 418/471 (88.75%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVR-TSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWH 60
           MALETWLIKVK AVSNKFDVVR +ST  N KPTSSKKSPN+ VL+FEIAGLMSKLLHLW+
Sbjct: 1   MALETWLIKVKNAVSNKFDVVRASSTTPNFKPTSSKKSPNVAVLSFEIAGLMSKLLHLWN 60

Query: 61  SLSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHA 120
           SLSDHNI RLR +SISLEGV KIVSND+ FLL LACAE+TENLRLLANSVS L IKC H 
Sbjct: 61  SLSDHNITRLRNQSISLEGVHKIVSNDDDFLLALACAEITENLRLLANSVSPLCIKCDHP 120

Query: 121 HLRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGL 180
            LRSF R F +FADSGRDLHNWI+SEKEMECRNKRIERLVT TA LHREM+ELSIMETGL
Sbjct: 121 DLRSFHRLFLEFADSGRDLHNWILSEKEMECRNKRIERLVTMTANLHREMDELSIMETGL 180

Query: 181 RKAVANLEFCDQEKSS----PLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDT 240
           RKAV NL+ C QE+++    PLEISLKEQKILDL+ KIL QRQEVKYLKEKSLWN+TFDT
Sbjct: 181 RKAVTNLQLCQQEQNNSSTPPLEISLKEQKILDLQQKILWQRQEVKYLKEKSLWNKTFDT 240

Query: 241 VILILATSIFTTLARIKVVFGIA-QAPSSLPRSLSASAAVHPLKNINDNGK--------N 300
           VI ILA SIFTTLARIK+VFG+A Q PSSLPRSLSASAAVHPLKN+NDN K        N
Sbjct: 241 VISILARSIFTTLARIKLVFGLAHQFPSSLPRSLSASAAVHPLKNLNDNAKDSDPTTTKN 300

Query: 301 ALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKSIR 360
             FE N KLLKPPPTTLGAAGLALHY+NLIIVMDKMIK+PQLVGVDARDDLYSMLP S+R
Sbjct: 301 GFFESNLKLLKPPPTTLGAAGLALHYANLIIVMDKMIKSPQLVGVDARDDLYSMLPNSVR 360

Query: 361 SSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAPKT 420
           +SLRARLRGVGF ASDASLAGEWREAMGRILGW+SPL+QNM+KWQSERSFEQQNYMAPKT
Sbjct: 361 TSLRARLRGVGFTASDASLAGEWREAMGRILGWMSPLAQNMIKWQSERSFEQQNYMAPKT 420

Query: 421 NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNFTAS 457
           NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTA A FA NNF AS
Sbjct: 421 NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTANALFACNNFIAS 471

BLAST of Cp4.1LG14g05650 vs. NCBI nr
Match: gi|449460852|ref|XP_004148158.1| (PREDICTED: uncharacterized protein LOC101216982 [Cucumis sativus])

HSP 1 Score: 725.7 bits (1872), Expect = 5.0e-206
Identity = 389/471 (82.59%), Postives = 416/471 (88.32%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVR-TSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWH 60
           MALETWLIKVK AVSNKFDVVR +ST  N KPTSSKKSPN+ VL+FEIAGLMSKLLHLW+
Sbjct: 1   MALETWLIKVKNAVSNKFDVVRASSTAPNFKPTSSKKSPNVAVLSFEIAGLMSKLLHLWN 60

Query: 61  SLSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHA 120
           SLSDHNI RLR +SISLEGV KIVSND+ FLL LACAE+TENLRLLANSVS L IKC H 
Sbjct: 61  SLSDHNITRLRNQSISLEGVHKIVSNDDDFLLALACAEITENLRLLANSVSPLCIKCDHP 120

Query: 121 HLRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGL 180
            LRSF R F +FADSGRDLHNW++SEKEMECRNKRIERLVT TA LHREM+ELSIMETGL
Sbjct: 121 DLRSFHRLFLEFADSGRDLHNWLLSEKEMECRNKRIERLVTLTANLHREMDELSIMETGL 180

Query: 181 RKAVANLEFCDQEKSS----PLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDT 240
           RK VA+L+ C QE+S+    PLEISLKEQKILDL+ KIL QRQEVKYLKEKSLWNRTFDT
Sbjct: 181 RKTVASLQLCQQEQSNSSTPPLEISLKEQKILDLQQKILWQRQEVKYLKEKSLWNRTFDT 240

Query: 241 VILILATSIFTTLARIKVVFGIA-QAPSSLPRSLSASAAVHPLKNINDNG--------KN 300
           VI ILA SIFTTLARIK+VFG+A Q PSSLPRSLSASAAVHPLKN+NDN         KN
Sbjct: 241 VISILARSIFTTLARIKLVFGLAHQFPSSLPRSLSASAAVHPLKNLNDNANDSDPTTTKN 300

Query: 301 ALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLPKSIR 360
             FE N KLLKPP TTLGAAGLALHY+NLIIVMDKMIK+PQLVGVDARDDLYSMLP S+R
Sbjct: 301 GFFESNLKLLKPPRTTLGAAGLALHYANLIIVMDKMIKSPQLVGVDARDDLYSMLPNSVR 360

Query: 361 SSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYMAPKT 420
           +SLRARLRGVGF ASDASLAGEWREAMGRILGW+SPL+QNM+KWQSERSFEQQNYMAPKT
Sbjct: 361 TSLRARLRGVGFTASDASLAGEWREAMGRILGWMSPLAQNMIKWQSERSFEQQNYMAPKT 420

Query: 421 NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNFTAS 457
           NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTA A FA +NF  S
Sbjct: 421 NVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTANALFACSNFITS 471

BLAST of Cp4.1LG14g05650 vs. NCBI nr
Match: gi|743856723|ref|XP_011029990.1| (PREDICTED: uncharacterized protein LOC105129571 [Populus euphratica])

HSP 1 Score: 588.2 bits (1515), Expect = 1.2e-164
Identity = 323/472 (68.43%), Postives = 374/472 (79.24%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALETWLIKVKTA+S+ FD V TS   N  P  SK++ ++GVLAFEIAGLMSKL HLW S
Sbjct: 1   MALETWLIKVKTAISHSFDSVITS---NPIPKPSKRA-SVGVLAFEIAGLMSKLFHLWQS 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD NI+R+R +SISLEGV KIVSNDE+FLLGLACAEM ENLRL+A SVSRL+ +C  + 
Sbjct: 61  LSDKNIIRVRNDSISLEGVRKIVSNDESFLLGLACAEMAENLRLIAKSVSRLSKRCEDSG 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LR F R F DF + G D + W++S K+ME + K+++R VT TA L++EM ELS +E GLR
Sbjct: 121 LRRFERLFDDFTNLGNDANCWVLSWKDMEAKTKKMDRYVTVTATLYKEMEELSTLENGLR 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           KA+         +   LE + KEQK+LDL+ KIL QRQEVKYLKE+SLWNR+FDTV+LIL
Sbjct: 181 KAL---------QCGELEGTTKEQKVLDLQQKILWQRQEVKYLKERSLWNRSFDTVVLIL 240

Query: 241 ATSIFTTLARIKVVFGIAQA-PSSLPRSLSASAAVHPLKNI-----------------ND 300
           A SIFT LARIK+VFGIA   P+SLPRSLSASA VHP +N                  N 
Sbjct: 241 AKSIFTVLARIKLVFGIAHGYPTSLPRSLSASATVHPTENPTTCNIVSGPLKSSKLEGNK 300

Query: 301 NGKNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLP 360
           +  N  FE NSKLLKPPPTTLGAA LALHY+NLIIVM+KMIK+PQLVGVDARDDLYSMLP
Sbjct: 301 DPSNGFFESNSKLLKPPPTTLGAAALALHYANLIIVMEKMIKSPQLVGVDARDDLYSMLP 360

Query: 361 KSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYM 420
            SIRSSLRARL+GVGF ASD  LAGEWR+A+GRIL WLSPL+ NM+KWQSERSFEQQN +
Sbjct: 361 NSIRSSLRARLKGVGFSASDPVLAGEWRDALGRILAWLSPLAHNMIKWQSERSFEQQN-L 420

Query: 421 APKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNF 454
           +PKTNV+LLQTL+FANK+KTEAAITELLVGLNYIWRFEREMTAKAFF   NF
Sbjct: 421 SPKTNVLLLQTLFFANKEKTEAAITELLVGLNYIWRFEREMTAKAFFECANF 458

BLAST of Cp4.1LG14g05650 vs. NCBI nr
Match: gi|566198458|ref|XP_002318869.2| (hypothetical protein POPTR_0012s14310g [Populus trichocarpa])

HSP 1 Score: 584.3 bits (1505), Expect = 1.8e-163
Identity = 322/472 (68.22%), Postives = 372/472 (78.81%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALETWLIKVKTA+S+ FD V TS   N  P  SK++ ++GVLAFEIAGLMSK+ HLW S
Sbjct: 1   MALETWLIKVKTAISHSFDSVITS---NPIPKPSKRA-SVGVLAFEIAGLMSKVFHLWQS 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD NI+R+R +SISLEGV KIVSNDE+FLLGLACAEM ENLRL+A SVSRL+ +C  + 
Sbjct: 61  LSDKNIIRVRNDSISLEGVRKIVSNDESFLLGLACAEMAENLRLIAKSVSRLSKRCEDSG 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LR F R F DF + G D + W++S K+ME + K+++R VT TA L++EM ELS +E GLR
Sbjct: 121 LRRFERLFDDFTNLGNDANCWVLSWKDMETKTKKMDRYVTVTATLYKEMEELSALENGLR 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           KA+         +   LE + KEQK+LDL+ KIL QRQEVKYLKE+SLWNR+FDTV+LIL
Sbjct: 181 KAL---------QCGELEGTSKEQKVLDLQQKILWQRQEVKYLKERSLWNRSFDTVVLIL 240

Query: 241 ATSIFTTLARIKVVFGIAQA-PSSLPRSLSASAAVHPLKNI-----------------ND 300
           A SIFT LARIK+VFGIA   P+SLPRSLSASA VHP +N                  N 
Sbjct: 241 AKSIFTVLARIKLVFGIAHGYPTSLPRSLSASATVHPTENPTTCNIVSGPLKSSKLEGNK 300

Query: 301 NGKNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLP 360
           +  N  FE NSKLLKPPPTTLGAA LALHY+NLIIVM+KMIK+PQLVGVDARDDLYSMLP
Sbjct: 301 DSSNGFFESNSKLLKPPPTTLGAAALALHYANLIIVMEKMIKSPQLVGVDARDDLYSMLP 360

Query: 361 KSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYM 420
            SIRSSLRARL+GVGF ASD  LAGEWR+A+GRIL WLSPL+ NM+KWQSERSFEQQN +
Sbjct: 361 NSIRSSLRARLKGVGFSASDPVLAGEWRDALGRILAWLSPLAHNMIKWQSERSFEQQN-L 420

Query: 421 APKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNF 454
            PKTNV+LLQTL FANK+KTEAAITELLVGLNYIWRFEREMTAKAFF   NF
Sbjct: 421 LPKTNVLLLQTLSFANKEKTEAAITELLVGLNYIWRFEREMTAKAFFECANF 458

BLAST of Cp4.1LG14g05650 vs. NCBI nr
Match: gi|802769547|ref|XP_012090387.1| (PREDICTED: uncharacterized protein LOC105648569 [Jatropha curcas])

HSP 1 Score: 575.9 bits (1483), Expect = 6.4e-161
Identity = 317/472 (67.16%), Postives = 374/472 (79.24%), Query Frame = 1

Query: 1   MALETWLIKVKTAVSNKFDVVRTSTILNTKPTSSKKSPNLGVLAFEIAGLMSKLLHLWHS 60
           MALETWLIKVKTA+S++ D V TS  +   P +SKKS  +GVLAFEIAGLMSKL HLW S
Sbjct: 1   MALETWLIKVKTAISHRLDSVVTSAPI---PKASKKS-TVGVLAFEIAGLMSKLFHLWQS 60

Query: 61  LSDHNILRLR-ESISLEGVLKIVSNDETFLLGLACAEMTENLRLLANSVSRLTIKCHHAH 120
           LSD NI+RLR ESISLEGV KIVSNDE+FLL LACAE+ ENLRL+A SVSRL+ +C  A+
Sbjct: 61  LSDKNIIRLRNESISLEGVRKIVSNDESFLLALACAEIAENLRLVAKSVSRLSKRCDDAN 120

Query: 121 LRSFPRFFHDFADSGRDLHNWIMSEKEMECRNKRIERLVTFTAALHREMNELSIMETGLR 180
           LR F R F DFA+SG D ++W+++ KEME +NK+++R VT TA L++EM ELSI+E+GL+
Sbjct: 121 LRRFERLFDDFANSGHDPNSWVLNCKEMEAKNKKMDRYVTITATLYKEMEELSILESGLK 180

Query: 181 KAVANLEFCDQEKSSPLEISLKEQKILDLRHKILCQRQEVKYLKEKSLWNRTFDTVILIL 240
           KA   L++ + E ++      KEQKI+DL+ KI  QRQEVKYLKE+SLWNR+FD V+ +L
Sbjct: 181 KA---LQYSEHESTT------KEQKIMDLQQKIFWQRQEVKYLKERSLWNRSFDGVVSML 240

Query: 241 ATSIFTTLARIKVVFGIAQA-PSSLPRSLSASAAVHPLKNINDNG--------------- 300
             SIFT LARIK+VFGI    P+SLPRSLSASA VHP +N N                  
Sbjct: 241 VRSIFTVLARIKLVFGIGHGYPTSLPRSLSASATVHPTENPNTCSFVSGPLKGSELEENK 300

Query: 301 --KNALFEPNSKLLKPPPTTLGAAGLALHYSNLIIVMDKMIKAPQLVGVDARDDLYSMLP 360
              +  FE NSKLLKPP TTLGAA LALHY+NLIIVM+KMIK+PQLVGVDARDDLYSMLP
Sbjct: 301 DLSDGFFESNSKLLKPPETTLGAAALALHYANLIIVMEKMIKSPQLVGVDARDDLYSMLP 360

Query: 361 KSIRSSLRARLRGVGFIASDASLAGEWREAMGRILGWLSPLSQNMVKWQSERSFEQQNYM 420
            SIRSSLRARL+GVGF ASD  LAGEWR+A+GRILGWLSP++ NM+KWQSERSFEQQN +
Sbjct: 361 NSIRSSLRARLKGVGFSASDPVLAGEWRDALGRILGWLSPIAHNMIKWQSERSFEQQN-L 420

Query: 421 APKTNVMLLQTLYFANKDKTEAAITELLVGLNYIWRFEREMTAKAFFASNNF 454
            PKTNV+LLQTL+FAN++KTEAAITELLVGLNYIWRFEREMTAKA F   NF
Sbjct: 421 LPKTNVLLLQTLFFANQEKTEAAITELLVGLNYIWRFEREMTAKALFECANF 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L5N3_CUCSA3.5e-20682.59Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177390 PE=4 SV=1[more]
B9I4J5_POPTR1.3e-16368.22Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0012s14310g PE=4 SV=2[more]
A0A067JEQ7_JATCU4.5e-16167.16Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26212 PE=4 SV=1[more]
M5XC82_PRUPE2.4e-15965.83Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004872mg PE=4 SV=1[more]
A0A067FPI9_CITSI2.3e-15764.81Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012421mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G51670.13.4e-13658.00 Protein of unknown function (DUF668)[more]
AT5G04550.12.5e-4652.43 Protein of unknown function (DUF668)[more]
AT3G23160.19.8e-4352.60 Protein of unknown function (DUF668)[more]
AT1G34320.15.6e-3027.88 Protein of unknown function (DUF668)[more]
AT5G08660.15.2e-2827.14 Protein of unknown function (DUF668)[more]
Match NameE-valueIdentityDescription
gi|659077271|ref|XP_008439117.1|3.1e-20883.23PREDICTED: uncharacterized protein LOC103484007 [Cucumis melo][more]
gi|449460852|ref|XP_004148158.1|5.0e-20682.59PREDICTED: uncharacterized protein LOC101216982 [Cucumis sativus][more]
gi|743856723|ref|XP_011029990.1|1.2e-16468.43PREDICTED: uncharacterized protein LOC105129571 [Populus euphratica][more]
gi|566198458|ref|XP_002318869.2|1.8e-16368.22hypothetical protein POPTR_0012s14310g [Populus trichocarpa][more]
gi|802769547|ref|XP_012090387.1|6.4e-16167.16PREDICTED: uncharacterized protein LOC105648569 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021864DUF3475
IPR007700DUF668
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g05650.1Cp4.1LG14g05650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007700Protein of unknown function DUF668PFAMPF05003DUF668coord: 301..388
score: 8.6
IPR021864Protein of unknown function DUF3475PFAMPF11961DUF3475coord: 42..97
score: 1.9
NoneNo IPR availablePANTHERPTHR31371FAMILY NOT NAMEDcoord: 2..453
score: 8.3E
NoneNo IPR availablePANTHERPTHR31371:SF4SIMILARITY TO UNKNOWN PROTEINcoord: 2..453
score: 8.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g05650Cp4.1LG09g01490Cucurbita pepo (Zucchini)cpecpeB023
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG14g05650Silver-seed gourdcarcpeB0998
Cp4.1LG14g05650Silver-seed gourdcarcpeB1272
Cp4.1LG14g05650Cucumber (Chinese Long) v3cpecucB0281
Cp4.1LG14g05650Cucurbita pepo (Zucchini)cpecpeB232
Cp4.1LG14g05650Cucurbita pepo (Zucchini)cpecpeB245
Cp4.1LG14g05650Cucurbita moschata (Rifu)cmocpeB557
Cp4.1LG14g05650Cucumber (Gy14) v2cgybcpeB746
Cp4.1LG14g05650Melon (DHL92) v3.6.1cpemedB219