Cp4.1LG01g19760 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g19760
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTransmembrane protein, putative
LocationCp4.1LG01 : 16917418 .. 16922042 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATGTATAGTCAAGTAGGCGTGCTTCCTCAAGGTTGGGCCCGCGGGTTAAATGGGTTGCAAAAGGACTAATTTCTTTATGATTACCCTTCATTTTCTCCGATTTCCCTCTTGGAAACCCCTTGTTTAATTGGTCATGAAGTATGAACCCACGAAGTACAAAAGTTCATACTGAGCTGCAATTCGCATTTCTGTTTGGAATTATAATCTGAATCTCTACACCAGATTGTGAAGCAACAGAACAATGAGGCATGGTGGATCCAGGAAGAAAAGATCGTCATCGTTTGCGCGATATGTCGTTGTTCTATGTGCCGTCGGTGCTTCAATTGGATTTTTGATGCTCAATTTTCTTATGAGGATGGAAGCTCAAGAATCAGAATCGTCCTCTGATCAGTTAGGTAATGGCGATGACGTTGAAGAGAGTCGGGTTCTGAGTGAAATGGACGGAAGGCGGAGCTGCGCGACGGTGGAGCAGATGGGAGAGGCCTTCAAAGATGGTGTCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTAATATTTCCCTATATGCTTTTCTGATAACCATCTGCCCTCCCCTTGTTGATCGTTTTATAGTTAAATGAATGATTAATAGCTACTGTCAAGCAGCTCTTCTTTTGTATATTAGGACTGTTTAATGGAGTCATGATTCGGCTCACGATCCCTTTTTTGCCATTATAATAATCGTTTGATGTTCCATAGTGAGTGTGTTGGCTGTATCGATTGAATATAAAATACTGATTATTTGGGGGTGTTGAGTAAAATGTCGTAGATCAGAAACTACACTTGTCAACCTCCGATCGATCTACGATCACGAGTAGGTAGGCTGCTAGGGGGAGGGGCGGCTGGGCGGCCTTTTCAATCAATATCAGACTGACCCCTAATTACTAGACCAACCTGGTGCGAGTTGTAAGGAGTTTTGGTCTCGAGTTGAAGGAATGTGGAAGATACAGGTTGTAAACAGGGGAAAGTAGTTGGAATTTGGAGAAGGATGGTGGATGAAACTAGAGAAAACCTTAGGCTTCCAAGATGTAATTTGGATGGTCAAAATTCCCAAAGAGATTCGATATTTTTTCACCTGGCTATAAGTTCATAGGAATATAGTTGATAATGGATTGGTGCAATAGAAATTCTTGGTCGACTGTTCCNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATGTTTCTTTTGCTTAATGATAGATCCAGCTTCTTGTTAAATAATTAAAAAAAGAAAACTGCAGAATGGTGGAATATTATTATTATTGTTCTTTCTCTAGTAGGTCAAAGAGACCATTCCACTTGGTCAGTGGGGATAATAAGGCCAGAACAATAGTAGGATATCATTGTGTGGTTAAAAGTAATCTTGATTGCATTATGGTGCCATATGAGCGTGAAGAATGGTCTCCTTGAGCTTCTTTCATATTAAGAATGAGGATTCGAGGTTCTTCAAGCTGCCAACTCTATAATAAAGAAGATAGTTGTTGGAGTGAATTGAAAATGAACTGCTTGTTGGTTTGGTTTAGCTCAATTCTTTGTAGTCAGTAGTTGTTAATAGTTAAATTCCTGGGTTCTTAAGTAGTCTGCTTTGTTAGTAAAAGAGATTGTTTTGAATTTAATAAAGATTCCACGAGGACATTTTGTGATGCCAAGCAGTTGTATTAAAGACTTCTCGAGTCTTACGAGTTATGGTGGATCTCTTGGCGAAAGAACACATAAAGTGAGAAAGATGAAAACTCACGTATACATATTCGTTCTTTAGAATAGGTTATAGGGAAGTTCATATTGGTTTTTTTTTCTGATGGTGTAGTTTGTGCTCTTCTTCTTATTCAAATAACAATTTTAAAACTAATCTTCTTCATCTTGGCTTTTCCTTTACTGTTTAATCTTTTGTCATAACTGTTTTCTGGTGGAAAACCTGTTAGCTCTCTGAGATTTATAAAAACTGACATGGTTAATTTCATTTTAGGTGCTTCAAGAGTGCGACAACTTCCTCCGGAGCAGTTTTGCAAACACGGTTTTGTCATGGGCAAATCTTCAGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGTTGAACAGATCCTTGATTATCGGGCAAACCAGGCATGTTGGTTCTATATCAAGCCTCTTTATTTCTATTTAAATATTGGTTGTGAATACTATAATGTATCGAGTCGTTAGAAGCATTTTGTGTCTGTGATAGTTCCATTTCACACTCACATTTCAAGTTATAGGCTAAGTAATGATGCCTTGGATCTGGTATATAAAAACTTGTTAGCTTTTCTCAGCTAAAATCTCAATAAATGGATGGAATAACTTCAAAGAAGCCATCAGATAAATTTCAGAAGAAACTATGCAAATCTATGCAATAGGATTATAGTCGTGCATTGATGGAATAACTTTCATAGTTCTTATGTTATTACTTCTGTTTTCAGGGGCAAGTTTCCATTTGGAGACTACATTTCTTATTCCAATGTCACGTTTACCATGAAAGAAATCAAGCATTTGTGGAGACTTAAAGGTTGTATTAGGAAATTCAATAGGCATTTGATTATGCGAACTGATGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTATAGTTAACTTTCAGTTTACTTTAAACCGGTACTCAATTCGTACTCTGTTTATGCTGACTGATGTGCATCAGGTTCCAAGGTACAACAGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCCATGAGGGCCGCTGCTTCTAATTTGTTTGGACATCCAGAGGTTTTGGAATCTAGACCTAATGTATTCGGAGAGCTGATGAGAGTTCTTATATCTCCGTCAAAGGATGTTGAAGAAGCAGTGTTCTCAGTTCTTAAAAGTGGGGATGATCCTGATATTTCATTGCACATGAGGATGCTTATGAATAGGTAGTGTTACTACTTATTTACCGTTATTTCCTTCTCTCTCTACTTCCACTAGCTATAAGAATTGAATCCACAATGTCTATTTCCAATAAATTCCATTTAGTTGATAGAAATCCTGGATATTAGTCACTTTTGTAATCCTTTCCCTGAGTAATGTGCATCTGCATTTCAGGTCTGTGAGAGGCTTACAGGCTGCATTGCAGTGCATCAGAAAAGGCATACGCAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATTGTGCCCATGCTTGGTGAATTTGCAGAGGTAACTGCCTCATTGTGTGATTCTATTCTTCAAATGTAGTCCTGATTTCAAATACATGCTTAACGGAATAATTGTTCAGTCAAAGACCCCCATAACTTCTGTATGCCTGTAAAACAATTTCGGACCAGTTTCATTTGTGACTCTGTTTTCAGGTTATTCATTTTGATTATGAACACTTCAGAGGAATCATTTCTGGAACGCACGATGAATTTCATAAATTGGATTTCAGAGTGAAGGACTGGGGCCCTTCACCAAGATGGGTTGCCTTTGTGGATTTTTTTCTTGCATCCCGTGCCAAGCGTGCTGTTATTTCTGGTGCTCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCATTGGCTGCAGCACACAATCTCGACAATCTCGGTATCTTTGAATACTCTATCTATCATTTTACATCATGTAGGAACATAATTTGTTTCTTTGAGGAACATCTGTACAACTGTTCCTAACATAACATTACAAATATGCTCAGGGAAAAATTCTACTGGTTCAGACTTCTTCTTCTTGAGTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGTCATATCTGGAACAGATTTGCAGGCCCTTTAAGCTGCCCTAGCCAGCCTAACCAGTGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATCAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTTTGGCACGATTGATGAAGACAGCCTTCGATCGTTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATATTATAGTCATCTTATGCTTCTGGTTTGTCCTGTAAGCTTTATAACTAACATAGTTCTTATAACTTTTGCCAAACTCTGCTTATTTTGTCAATTTTGCTCGGTCCAGGAATTTGTTTCTGTTAAATCAAACGTCCCAGTAATGTCTTCTTGTATTTATTGGGCCATTTGATTTACAAAACAAAGTCTTATATGCTGTCAAGAGAGTCCAATTTCATTTTCAGTCGATACCATTGGATCAAATATGCATAAGACGGCATTTTCCCTTTTTGCTCCCCCAGTGAACAGTTTAGTATTGGCTTATAGTTCCAAGGATGTCTTGAATAGTATTAGTGGGATATTGAATGATTGGGTTGTACTTGTTCTTGGTTGTTGATTGCCACCAAAGGTAGGTGGCGGTAGATTTACACT

mRNA sequence

TATGTATAGTCAAGTAGGCGTGCTTCCTCAAGGTTGGGCCCGCGGGTTAAATGGGTTGCAAAAGGACTAATTTCTTTATGATTACCCTTCATTTTCTCCGATTTCCCTCTTGGAAACCCCTTGTTTAATTGGTCATGAAGTATGAACCCACGAAGTACAAAAGTTCATACTGAGCTGCAATTCGCATTTCTGTTTGGAATTATAATCTGAATCTCTACACCAGATTGTGAAGCAACAGAACAATGAGGCATGGTGGATCCAGGAAGAAAAGATCGTCATCGTTTGCGCGATATGTCGTTGTTCTATGTGCCGTCGGTGCTTCAATTGGATTTTTGATGCTCAATTTTCTTATGAGGATGGAAGCTCAAGAATCAGAATCGTCCTCTGATCAGTTAGGTAATGGCGATGACGTTGAAGAGAGTCGGGTTCTGAGTGAAATGGACGGAAGGCGGAGCTGCGCGACGGTGGAGCAGATGGGAGAGGCCTTCAAAGATGGTGTCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTGCTTCAAGAGTGCGACAACTTCCTCCGGAGCAGTTTTGCAAACACGGTTTTGTCATGGGCAAATCTTCAGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGTTGAACAGATCCTTGATTATCGGGCAAACCAGGCATGCTAAGGGCAAGTTTCCATTTGGAGACTACATTTCTTATTCCAATGTCACGTTTACCATGAAAGAAATCAAGCATTTGTGGAGACTTAAAGGTTGTATTAGGAAATTCAATAGGCATTTGATTATGCGAACTGATGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTACAACAGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCCATGAGGGCCGCTGCTTCTAATTTGTTTGGACATCCAGAGGTTTTGGAATCTAGACCTAATGTATTCGGAGAGCTGATGAGAGTTCTTATATCTCCGTCAAAGGATGTTGAAGAAGCAGTGTTCTCAGTTCTTAAAAGTGGGGATGATCCTGATATTTCATTGCACATGAGGATGCTTATGAATAGGTCTGTGAGAGGCTTACAGGCTGCATTGCAGTGCATCAGAAAAGGCATACGCAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATTGTGCCCATGCTTGGTGAATTTGCAGAGGTTATTCATTTTGATTATGAACACTTCAGAGGAATCATTTCTGGAACGCACGATGAATTTCATAAATTGGATTTCAGAGTGAAGGACTGGGGCCCTTCACCAAGATGGGTTGCCTTTGTGGATTTTTTTCTTGCATCCCGTGCCAAGCGTGCTGTTATTTCTGGTGCTCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCATTGGCTGCAGCACACAATCTCGACAATCTCGGGAAAAATTCTACTGGTTCAGACTTCTTCTTCTTGAGTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGTCATATCTGGAACAGATTTGCAGGCCCTTTAAGCTGCCCTAGCCAGCCTAACCAGTGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATCAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTTTGGCACGATTGATGAAGACAGCCTTCGATCGTTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATATTATAGTCATCTTATGCTTCTGGTTTGTCCTGTAAGCTTTATAACTAACATAGTTCTTATAACTTTTGCCAAACTCTGCTTATTTTGTCAATTTTGCTCGGTCCAGGAATTTGTTTCTGTTAAATCAAACGTCCCAGTAATGTCTTCTTGTATTTATTGGGCCATTTGATTTACAAAACAAAGTCTTATATGCTGTCAAGAGAGTCCAATTTCATTTTCAGTCGATACCATTGGATCAAATATGCATAAGACGGCATTTTCCCTTTTTGCTCCCCCAGTGAACAGTTTAGTATTGGCTTATAGTTCCAAGGATGTCTTGAATAGTATTAGTGGGATATTGAATGATTGGGTTGTACTTGTTCTTGGTTGTTGATTGCCACCAAAGGTAGGTGGCGGTAGATTTACACT

Coding sequence (CDS)

ATGAGGCATGGTGGATCCAGGAAGAAAAGATCGTCATCGTTTGCGCGATATGTCGTTGTTCTATGTGCCGTCGGTGCTTCAATTGGATTTTTGATGCTCAATTTTCTTATGAGGATGGAAGCTCAAGAATCAGAATCGTCCTCTGATCAGTTAGGTAATGGCGATGACGTTGAAGAGAGTCGGGTTCTGAGTGAAATGGACGGAAGGCGGAGCTGCGCGACGGTGGAGCAGATGGGAGAGGCCTTCAAAGATGGTGTCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTGCTTCAAGAGTGCGACAACTTCCTCCGGAGCAGTTTTGCAAACACGGTTTTGTCATGGGCAAATCTTCAGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGTTGAACAGATCCTTGATTATCGGGCAAACCAGGCATGCTAAGGGCAAGTTTCCATTTGGAGACTACATTTCTTATTCCAATGTCACGTTTACCATGAAAGAAATCAAGCATTTGTGGAGACTTAAAGGTTGTATTAGGAAATTCAATAGGCATTTGATTATGCGAACTGATGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTACAACAGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCCATGAGGGCCGCTGCTTCTAATTTGTTTGGACATCCAGAGGTTTTGGAATCTAGACCTAATGTATTCGGAGAGCTGATGAGAGTTCTTATATCTCCGTCAAAGGATGTTGAAGAAGCAGTGTTCTCAGTTCTTAAAAGTGGGGATGATCCTGATATTTCATTGCACATGAGGATGCTTATGAATAGGTCTGTGAGAGGCTTACAGGCTGCATTGCAGTGCATCAGAAAAGGCATACGCAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATTGTGCCCATGCTTGGTGAATTTGCAGAGGTTATTCATTTTGATTATGAACACTTCAGAGGAATCATTTCTGGAACGCACGATGAATTTCATAAATTGGATTTCAGAGTGAAGGACTGGGGCCCTTCACCAAGATGGGTTGCCTTTGTGGATTTTTTTCTTGCATCCCGTGCCAAGCGTGCTGTTATTTCTGGTGCTCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCATTGGCTGCAGCACACAATCTCGACAATCTCGGGAAAAATTCTACTGGTTCAGACTTCTTCTTCTTGAGTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGTCATATCTGGAACAGATTTGCAGGCCCTTTAAGCTGCCCTAGCCAGCCTAACCAGTGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATCAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTTTGGCACGATTGATGAAGACAGCCTTCGATCGTTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATATTATAG

Protein sequence

MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEESRVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNAKKNVVRTIPFIL
BLAST of Cp4.1LG01g19760 vs. TrEMBL
Match: A0A0A0KNC7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G429950 PE=4 SV=1)

HSP 1 Score: 988.8 bits (2555), Expect = 2.7e-285
Identity = 486/554 (87.73%), Postives = 512/554 (92.42%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
           MRHGGSR+KRSSSF RY+V+LCAVGA+I FLMLN LMRMEA     SSDQ GNG+  EE 
Sbjct: 1   MRHGGSRRKRSSSFVRYLVLLCAVGAAICFLMLNVLMRMEA-----SSDQYGNGERFEEP 60

Query: 61  RVLSE-MDGRRS-CATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFC 120
              +  M+GRRS CA VEQMG+ FKDGV KESLRVRTIIQNHFYLNGASRVRQLPPEQFC
Sbjct: 61  PAQTTGMEGRRSSCAMVEQMGDPFKDGVRKESLRVRTIIQNHFYLNGASRVRQLPPEQFC 120

Query: 121 KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTF 180
           KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR   GKFPFGDYISYS+++F
Sbjct: 121 KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISF 180

Query: 181 TMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA 240
           T+KEIKHLWRL GC++KFNR LIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA
Sbjct: 181 TLKEIKHLWRLNGCVKKFNRRLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA 240

Query: 241 AQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDD 300
           AQFFLKN+HP MRAAASNLFG PEVLESRPNVFGELMRVLISPSK+VEEAVFSVLKSG D
Sbjct: 241 AQFFLKNIHPTMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKNVEEAVFSVLKSGAD 300

Query: 301 PDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEF 360
           PDISLHMRMLMNRSVRGLQAA+QCIRK + NLT  SKPRLVLVSDTPNFVKSIVP+L EF
Sbjct: 301 PDISLHMRMLMNRSVRGLQAAVQCIRKAMLNLTGLSKPRLVLVSDTPNFVKSIVPILDEF 360

Query: 361 AEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHR 420
           AEVIHFDYEHFRG ISGT DEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHR
Sbjct: 361 AEVIHFDYEHFRGNISGTDDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHR 420

Query: 421 RVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFA 480
           RVGTTYAQLIAALAAA+NLDNLG  STGSDF FLSSFQSNLLREGLKNQ+GWGHIWNRFA
Sbjct: 421 RVGTTYAQLIAALAAANNLDNLGNKSTGSDFLFLSSFQSNLLREGLKNQIGWGHIWNRFA 480

Query: 481 GPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFC 540
           GPLSCPSQPNQCA+TPLLPPAWWDGLWQSPIPRD+KRMENYGVHL+ FGT+DEDSLRSFC
Sbjct: 481 GPLSCPSQPNQCAVTPLLPPAWWDGLWQSPIPRDVKRMENYGVHLTSFGTVDEDSLRSFC 540

Query: 541 NAKKNVVRTIPFIL 553
           NAKKNV+RTIPFIL
Sbjct: 541 NAKKNVLRTIPFIL 546

BLAST of Cp4.1LG01g19760 vs. TrEMBL
Match: A0A061EKP0_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_017276 PE=4 SV=1)

HSP 1 Score: 771.9 bits (1992), Expect = 5.1e-220
Identity = 375/566 (66.25%), Postives = 445/566 (78.62%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSD----------- 60
           MR+GGSRKKR+    R+ ++LCA    I +LML  L  ++   + +++            
Sbjct: 1   MRYGGSRKKRA--LVRWFLILCAAFTFISWLMLLTLRSIDTPPTTTTTKTTDVALVDLPG 60

Query: 61  ----QLGNGDDVEESRVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNG 120
               QL   D V  S    +    +SCATVE+MG++FK  + KESL VR IIQ HF +NG
Sbjct: 61  KLEHQLFQRDGVLSSAEAPKKASAKSCATVEEMGKSFKGRILKESLGVRRIIQRHFSVNG 120

Query: 121 ASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKF 180
           ASR+R+LPPEQFC+HGFV+GK+SEAGFGNEMYKILTA ALS+MLNRSLIIGQTR   GK+
Sbjct: 121 ASRIRELPPEQFCRHGFVIGKASEAGFGNEMYKILTAAALSVMLNRSLIIGQTR---GKY 180

Query: 181 PFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEH 240
           PFGDYI YSN+TFT++E+KHLWR  GC + + RHL+MRTDDFEKP +TN LC NW++W  
Sbjct: 181 PFGDYILYSNLTFTLREVKHLWRQNGCAKIYGRHLVMRTDDFEKPTKTNALCGNWRKWRQ 240

Query: 241 PIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDV 300
           PIIW+QGTTDAVAAQFFLKN+HP MR AAS LFG PE L SRPNVFGELMR+LISPS+D+
Sbjct: 241 PIIWYQGTTDAVAAQFFLKNIHPDMRNAASELFGKPESLRSRPNVFGELMRILISPSRDI 300

Query: 301 EEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTP 360
           EEAV  VL  G DPDI+LHMRMLMNR VR  QAAL C+R+  RNL   S+PR+V+VSDTP
Sbjct: 301 EEAVNWVLCGGRDPDITLHMRMLMNRPVRAAQAALNCLRRATRNLQQGSRPRVVVVSDTP 360

Query: 361 NFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLA 420
           +FVKSI P + EFAEV+HFDY+ FRG  S        LDFRVKDWGP+PRWVAFVDFFLA
Sbjct: 361 SFVKSITPNISEFAEVLHFDYKLFRGNASHDIKASPNLDFRVKDWGPAPRWVAFVDFFLA 420

Query: 421 SRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLK 480
           S AK AV+SGAHRRVGTTYAQLIAALAAA   +++G+NSTGS F FLSSFQSNLL +GLK
Sbjct: 421 SSAKHAVVSGAHRRVGTTYAQLIAALAAA---NSIGENSTGSSFSFLSSFQSNLLADGLK 480

Query: 481 NQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSG 540
            QVGWGH+WNRFAGPLSC  QPNQCA TPLLPPAWW+G+WQSPIPRDI R+E YGVHLSG
Sbjct: 481 LQVGWGHVWNRFAGPLSCRGQPNQCAYTPLLPPAWWEGIWQSPIPRDIHRLEQYGVHLSG 540

Query: 541 FGTIDEDSLRSFCNAKKNVVRTIPFI 552
           FGT DE+ +RSFC+++KN+V+T+ FI
Sbjct: 541 FGTTDENQIRSFCSSRKNIVKTVTFI 558

BLAST of Cp4.1LG01g19760 vs. TrEMBL
Match: W9SEL0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005663 PE=4 SV=1)

HSP 1 Score: 761.9 bits (1966), Expect = 5.3e-217
Identity = 378/572 (66.08%), Postives = 443/572 (77.45%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCA-VGASIGFLMLNF-----------LMRMEAQESESSS 60
           MRHGGSR++R+S   R  VV CA +  + G LMLN            L   ++     SS
Sbjct: 1   MRHGGSRRRRAS--IRSFVVACAMIFGATGLLMLNLRAVDPPTGPTILTNRDSDPISRSS 60

Query: 61  DQLGNGDDVEESRVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASR 120
             +GNG    +++  +   G R CATVE+MGE F  G WKESLRVR II  HF L+GA+R
Sbjct: 61  GDVGNGT---QTQAQTTKRGTRPCATVEEMGEHFNGGFWKESLRVRRIILRHFSLSGATR 120

Query: 121 VRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRH-------- 180
           VR LPPEQFC+HGFV+ K+S+AGFGNEMYKIL+A ALSIMLNRSLI+GQTRH        
Sbjct: 121 VRNLPPEQFCRHGFVLAKASQAGFGNEMYKILSAAALSIMLNRSLIVGQTRHIGPFSSLS 180

Query: 181 AKGKFPFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNW 240
              +FPFGDYISYSNV+FT+KE+KHLWR   C +KF R L +RTD+FEKP QTNVLC NW
Sbjct: 181 VTERFPFGDYISYSNVSFTLKEVKHLWRQNKCEKKFGRRLTIRTDNFEKPTQTNVLCGNW 240

Query: 241 KEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLIS 300
           K W+ PIIWFQGTTDAVA QFFLKN+HP MR+AAS+LFG  EVL+SRPNVFGELMRVLIS
Sbjct: 241 KAWKQPIIWFQGTTDAVAVQFFLKNIHPEMRSAASDLFGQSEVLQSRPNVFGELMRVLIS 300

Query: 301 PSKDVEEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVL 360
           PSK VEEAV  VL  G DPDISLHMRMLMN+SVR LQAAL CI+K   NL+  SKPR+V+
Sbjct: 301 PSKSVEEAVNWVLAGGADPDISLHMRMLMNKSVRALQAALNCIKKATNNLSKTSKPRVVV 360

Query: 361 VSDTPNFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFV 420
           VSDTP+ V SI P + +FAEV+HF+YEHFRG IS   +     DFRVKDWGP+PRWVAFV
Sbjct: 361 VSDTPSLVTSITPDIIKFAEVLHFNYEHFRGNISVRANSLQGPDFRVKDWGPAPRWVAFV 420

Query: 421 DFFLASRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLL 480
           DFFLASRAK AV+SGAHRRVGTTYAQLIAALAAA   ++LG N+T S F FLSSFQ NLL
Sbjct: 421 DFFLASRAKHAVVSGAHRRVGTTYAQLIAALAAA---NSLGDNATSSSFSFLSSFQRNLL 480

Query: 481 REGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYG 540
           REGL+ Q+GWGH+WNRFAG LSC +QP+QCA TP+LPPAWWDGLWQSP+PRD++R+E++G
Sbjct: 481 REGLRFQIGWGHVWNRFAGLLSCHNQPHQCAFTPVLPPAWWDGLWQSPLPRDVRRLEDFG 540

Query: 541 VHLSGFGTIDEDSLRSFCNAKKNVVRTIPFIL 553
           V LSG GTIDE  L SFCN++K+VV+ IP  L
Sbjct: 541 VQLSGLGTIDESHLHSFCNSRKSVVKAIPIPL 564

BLAST of Cp4.1LG01g19760 vs. TrEMBL
Match: A0A067L042_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26829 PE=4 SV=1)

HSP 1 Score: 760.4 bits (1962), Expect = 1.5e-216
Identity = 365/508 (71.85%), Postives = 425/508 (83.66%), Query Frame = 1

Query: 45  ESSSDQLGNGDDVEESRVLSEMDG-RRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYL 104
           +SSSD L NG  ++E+ V  E +G  +SCATV++MGE+FK  VWKESLRVR IIQ HF +
Sbjct: 65  DSSSDSL-NGVVLDETEVHRERNGGSKSCATVDEMGESFKGSVWKESLRVRRIIQEHFAV 124

Query: 105 NGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKG 164
           NGAS +R LPPEQFCKHGFV+GK+SEAGFGNEMYKIL A ALSIMLNRSLII QTR   G
Sbjct: 125 NGASIIRHLPPEQFCKHGFVLGKASEAGFGNEMYKILNAAALSIMLNRSLIIRQTR---G 184

Query: 165 KFPFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEW 224
           K+PFGD+ISYSN +FT+ E+KHLWR  GC++ + RHL+MR DDFEKPA+TNVLCSNW++W
Sbjct: 185 KYPFGDFISYSNHSFTLNEVKHLWRKNGCVKNYGRHLVMRIDDFEKPAKTNVLCSNWRKW 244

Query: 225 EHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSK 284
           E PIIW Q TTDAVA+QFFLKNV+P MR +ASNLFG PE L+SRPNVFGELMRVLISPS+
Sbjct: 245 EQPIIWLQNTTDAVASQFFLKNVYPEMRVSASNLFGEPEQLQSRPNVFGELMRVLISPSE 304

Query: 285 DVEEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSD 344
           DV EAV  VL  G DPDISLHMRMLMNRSVR  QAAL CI+K + NL   S+P++VLVSD
Sbjct: 305 DVIEAVNWVLGGGADPDISLHMRMLMNRSVRATQAALNCIQKALHNLHQISRPKVVLVSD 364

Query: 345 TPNFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFF 404
           TP FVKS +P L EFAEVI+FDY+HF G +S   +  H LDFRVKDWGP+PRWVAFVDFF
Sbjct: 365 TPAFVKSFLPQLSEFAEVIYFDYKHFEGNVSRNVNASHNLDFRVKDWGPAPRWVAFVDFF 424

Query: 405 LASRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREG 464
           LASRAK  VISGAHRRVGTTYAQL AALAAA   ++LG+NST S+F FLSSFQSNLLR+G
Sbjct: 425 LASRAKNTVISGAHRRVGTTYAQLTAALAAA---NHLGENSTDSNFSFLSSFQSNLLRDG 484

Query: 465 LKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHL 524
           LK Q+GWGH+WNRFAGPLSC +Q NQCA TPLLPPAWWDGLWQSPIPRD++R+  +G+ L
Sbjct: 485 LKLQIGWGHVWNRFAGPLSCQNQSNQCAFTPLLPPAWWDGLWQSPIPRDVRRLMQFGIKL 544

Query: 525 SGFGTIDEDSLRSFCNAKKNVVRTIPFI 552
           SGFGT+DED LRSFC++KK  ++T+  I
Sbjct: 545 SGFGTVDEDHLRSFCSSKKTTMKTVLII 565

BLAST of Cp4.1LG01g19760 vs. TrEMBL
Match: V4W2U9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014761mg PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 3.2e-214
Identity = 371/568 (65.32%), Postives = 445/568 (78.35%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVV-VLCAVGASIGFLMLNFLMR-MEAQESESSSDQLGNGDDVE 60
           M+HGGSR++R S     VV V+C+ G  +   M   ++R ++   + S+   L   +DV+
Sbjct: 1   MKHGGSRRRRLSVQTMVVVFVICSAGVGLLMTMTMLILRPLDTPPNTSADVFLPVENDVD 60

Query: 61  ESRVLSEMDGR-------------------RSCATVEQMGEAFKDGVWKESLRVRTIIQN 120
            S++L E +                     + CATVE+MGE FK  V +ESL+VR +IQ 
Sbjct: 61  SSQLLEETETETETESESESETKTSTVSNPKRCATVEEMGEDFKGSVREESLKVRKLIQR 120

Query: 121 HFYLNGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR 180
           HF LNGASRVR LPPEQFCKHGFV+GK+SEAGFGNEMYKILT  ALS+MLNRSLIIGQTR
Sbjct: 121 HFDLNGASRVRNLPPEQFCKHGFVLGKASEAGFGNEMYKILTGAALSVMLNRSLIIGQTR 180

Query: 181 HAKGKFPFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSN 240
              GK+PFG+YISYSNV+FT++E+KHLWR  GC++K+ RHL+MR DDFEKP QTNVLCSN
Sbjct: 181 ---GKYPFGEYISYSNVSFTLEEVKHLWRRNGCLKKYGRHLVMRIDDFEKPPQTNVLCSN 240

Query: 241 WKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLI 300
           W++WE PIIWFQGTTDAVAAQFFLKNVHP MR AA++LFGHPE L +RPNVFGELMRVLI
Sbjct: 241 WRKWEQPIIWFQGTTDAVAAQFFLKNVHPEMRNAANDLFGHPESLHARPNVFGELMRVLI 300

Query: 301 SPSKDVEEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLV 360
           SPS+DVEEAV  VL +G DPDISLHMRML NRSVR +QAA++CIRK + +L   S+P+ V
Sbjct: 301 SPSEDVEEAVKWVLGNGVDPDISLHMRMLTNRSVRAVQAAVKCIRKVVNSLNLTSRPKTV 360

Query: 361 LVSDTPNFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAF 420
           +VSDTP+F K+I P + EFAEV++FDY+ FRG IS   +    L+FR KDWGP+PRWVAF
Sbjct: 361 IVSDTPSFAKTITPNISEFAEVLYFDYKAFRGNISHDVNRLPSLEFRAKDWGPAPRWVAF 420

Query: 421 VDFFLASRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNL 480
           VDFFLASRAK AV+SGA RRVGTTYAQLIAALAAA++L   G NST   F FLSSFQSNL
Sbjct: 421 VDFFLASRAKHAVVSGAFRRVGTTYAQLIAALAAANSL---GDNSTDLSFSFLSSFQSNL 480

Query: 481 LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENY 540
           L  GL+ QVGWGH+WNRFAGPLSC  Q +QCA TPLLPPAWWDGLW+SPIPRDI R+  +
Sbjct: 481 LTGGLRLQVGWGHVWNRFAGPLSCHHQSHQCAFTPLLPPAWWDGLWESPIPRDINRLAAF 540

Query: 541 GVHLSGFGTIDEDSLRSFCNAKKNVVRT 548
           GVHLSGFGT+DE+ L+SFC++KKN V+T
Sbjct: 541 GVHLSGFGTVDENRLQSFCSSKKNSVKT 562

BLAST of Cp4.1LG01g19760 vs. TAIR10
Match: AT3G26950.1 (AT3G26950.1 unknown protein)

HSP 1 Score: 688.0 bits (1774), Expect = 4.9e-198
Identity = 341/558 (61.11%), Postives = 415/558 (74.37%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
           M+ GG+R+KR   F +  ++L +V   IGF +L   +R     S    D     +  E S
Sbjct: 1   MKRGGTRRKRL--FGK-TILLSSVVFFIGFGLLLLTLRSVDPNSSFIDDDDDESESEEAS 60

Query: 61  RVLSE-------MDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLP 120
           R  +        +DG + CATVE+MG  F  G   +SLRVR +I  HF +NGAS +R+LP
Sbjct: 61  RWSNSSSIGEAMVDGAKLCATVEEMGSEFDGGFVDQSLRVRDVIHRHFQINGASAIRELP 120

Query: 121 PEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISY 180
           PEQFC+HG+V+GK++EAGFGNEMYKILT+ ALSIMLNRSLIIGQTR   GK+PFGDYI+Y
Sbjct: 121 PEQFCRHGYVLGKTAEAGFGNEMYKILTSAALSIMLNRSLIIGQTR---GKYPFGDYIAY 180

Query: 181 SNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGT 240
           SN TFTM E+KHLWR  GC++K+ R L+MR DDFEKPA++NVLCSNWK+WE  IIWFQGT
Sbjct: 181 SNATFTMSEVKHLWRQNGCVKKYKRRLVMRLDDFEKPAKSNVLCSNWKKWEEAIIWFQGT 240

Query: 241 TDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVL 300
           TDAVAAQFFLKNVHP MRAAA  LFG       R NVFGELM  LISP+KDV+EAV  VL
Sbjct: 241 TDAVAAQFFLKNVHPEMRAAAFELFGEQGNSAPRGNVFGELMMSLISPTKDVKEAVDWVL 300

Query: 301 KSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVP 360
               DPDIS+HMRMLM++SVR ++AA+ C+ K I  L   + PR+V+VSDTP+ VK I  
Sbjct: 301 HETGDPDISVHMRMLMSKSVRPMRAAINCLGKAINRLGIPN-PRVVIVSDTPSVVKIIKT 360

Query: 361 MLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVI 420
            +   AEV+HFDY+ FRG I+        LDFR+KDWGP+PRWVAFVDFFLA RAK AVI
Sbjct: 361 NISTIAEVLHFDYKLFRGDIAQRGRGLPMLDFRIKDWGPAPRWVAFVDFFLACRAKHAVI 420

Query: 421 SGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHI 480
           SGA+RRVGTTYAQL+AALAAA++L +    S+ S F FLSSFQSNLL +GLKNQVGWGH+
Sbjct: 421 SGANRRVGTTYAQLVAALAAANSLKD---GSSNSSFAFLSSFQSNLLADGLKNQVGWGHV 480

Query: 481 WNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDS 540
           WNR+AGPLSCP QPNQCA TPL PP WWDG+WQSPIPRD +R+  +G+ LSGFGT++ED 
Sbjct: 481 WNRYAGPLSCPKQPNQCAFTPLAPPGWWDGIWQSPIPRDTRRLAAFGIELSGFGTVNEDR 540

Query: 541 LRSFCNAKKNVVRTIPFI 552
             ++C+AKK  V T+  I
Sbjct: 541 FHAYCSAKKEYVSTVTII 548

BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match: gi|778702675|ref|XP_004140294.2| (PREDICTED: uncharacterized protein LOC101211825 isoform X1 [Cucumis sativus])

HSP 1 Score: 988.8 bits (2555), Expect = 3.8e-285
Identity = 486/554 (87.73%), Postives = 512/554 (92.42%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
           MRHGGSR+KRSSSF RY+V+LCAVGA+I FLMLN LMRMEA     SSDQ GNG+  EE 
Sbjct: 1   MRHGGSRRKRSSSFVRYLVLLCAVGAAICFLMLNVLMRMEA-----SSDQYGNGERFEEP 60

Query: 61  RVLSE-MDGRRS-CATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFC 120
              +  M+GRRS CA VEQMG+ FKDGV KESLRVRTIIQNHFYLNGASRVRQLPPEQFC
Sbjct: 61  PAQTTGMEGRRSSCAMVEQMGDPFKDGVRKESLRVRTIIQNHFYLNGASRVRQLPPEQFC 120

Query: 121 KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTF 180
           KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR   GKFPFGDYISYS+++F
Sbjct: 121 KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISF 180

Query: 181 TMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA 240
           T+KEIKHLWRL GC++KFNR LIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA
Sbjct: 181 TLKEIKHLWRLNGCVKKFNRRLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA 240

Query: 241 AQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDD 300
           AQFFLKN+HP MRAAASNLFG PEVLESRPNVFGELMRVLISPSK+VEEAVFSVLKSG D
Sbjct: 241 AQFFLKNIHPTMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKNVEEAVFSVLKSGAD 300

Query: 301 PDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEF 360
           PDISLHMRMLMNRSVRGLQAA+QCIRK + NLT  SKPRLVLVSDTPNFVKSIVP+L EF
Sbjct: 301 PDISLHMRMLMNRSVRGLQAAVQCIRKAMLNLTGLSKPRLVLVSDTPNFVKSIVPILDEF 360

Query: 361 AEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHR 420
           AEVIHFDYEHFRG ISGT DEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHR
Sbjct: 361 AEVIHFDYEHFRGNISGTDDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHR 420

Query: 421 RVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFA 480
           RVGTTYAQLIAALAAA+NLDNLG  STGSDF FLSSFQSNLLREGLKNQ+GWGHIWNRFA
Sbjct: 421 RVGTTYAQLIAALAAANNLDNLGNKSTGSDFLFLSSFQSNLLREGLKNQIGWGHIWNRFA 480

Query: 481 GPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFC 540
           GPLSCPSQPNQCA+TPLLPPAWWDGLWQSPIPRD+KRMENYGVHL+ FGT+DEDSLRSFC
Sbjct: 481 GPLSCPSQPNQCAVTPLLPPAWWDGLWQSPIPRDVKRMENYGVHLTSFGTVDEDSLRSFC 540

Query: 541 NAKKNVVRTIPFIL 553
           NAKKNV+RTIPFIL
Sbjct: 541 NAKKNVLRTIPFIL 546

BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match: gi|659126386|ref|XP_008463157.1| (PREDICTED: uncharacterized protein LOC103501366 isoform X1 [Cucumis melo])

HSP 1 Score: 978.8 bits (2529), Expect = 4.0e-282
Identity = 481/554 (86.82%), Postives = 507/554 (91.52%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEE- 60
           MRHGGSR+KRSSSF RY++VLCAVGA+I FLMLN LMRMEA     SSDQ G+G+  EE 
Sbjct: 1   MRHGGSRRKRSSSFVRYLLVLCAVGAAICFLMLNVLMRMEA-----SSDQFGDGEHFEEP 60

Query: 61  -SRVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFC 120
            ++      GR SCATVEQMG+ FKDGV KESLRVRTIIQNHFYLNGASRVRQLPPEQFC
Sbjct: 61  PAQTTGMEGGRTSCATVEQMGDPFKDGVRKESLRVRTIIQNHFYLNGASRVRQLPPEQFC 120

Query: 121 KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTF 180
           KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR   GKFPFGDYISYS+++F
Sbjct: 121 KHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISF 180

Query: 181 TMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA 240
           T+KEIKHLWRL GC++KFNR LIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA
Sbjct: 181 TLKEIKHLWRLNGCVKKFNRRLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVA 240

Query: 241 AQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDD 300
           AQFFLKN+HP MRAAASNLFG PEVLESRPNVFGELMRVLISPSK+VEEAVFSVLKSG D
Sbjct: 241 AQFFLKNIHPTMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKNVEEAVFSVLKSGAD 300

Query: 301 PDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEF 360
           PDISLHMRMLMNRSVRGLQAA+QCIRK + NLT  SKPRLVLVSDTPNFVKSIVP+L EF
Sbjct: 301 PDISLHMRMLMNRSVRGLQAAVQCIRKAMLNLTGVSKPRLVLVSDTPNFVKSIVPILDEF 360

Query: 361 AEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHR 420
           AEVIHFDYEHFRG ISGT DEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHR
Sbjct: 361 AEVIHFDYEHFRGNISGTDDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHR 420

Query: 421 RVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFA 480
           RVGTTYAQLIAALAAA+NLD LG  STGSDF FLSSFQSNLLREGLKNQVGWGHIWNRFA
Sbjct: 421 RVGTTYAQLIAALAAANNLDYLGNKSTGSDFSFLSSFQSNLLREGLKNQVGWGHIWNRFA 480

Query: 481 GPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFC 540
           GPLSC SQPNQCA+TPLLPPAWWDG+WQSPIPRDIKRMENYGVHL+ FGT+DED LRSFC
Sbjct: 481 GPLSCSSQPNQCAITPLLPPAWWDGIWQSPIPRDIKRMENYGVHLTSFGTVDEDGLRSFC 540

Query: 541 NAKKNVVRTIPFIL 553
            AKKNV+RTIPFIL
Sbjct: 541 YAKKNVLRTIPFIL 546

BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match: gi|778702678|ref|XP_011655244.1| (PREDICTED: uncharacterized protein LOC101211825 isoform X2 [Cucumis sativus])

HSP 1 Score: 815.8 bits (2106), Expect = 4.4e-233
Identity = 390/429 (90.91%), Postives = 408/429 (95.10%), Query Frame = 1

Query: 124 MGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTMKEI 183
           MGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR   GKFPFGDYISYS+++FT+KEI
Sbjct: 1   MGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISFTLKEI 60

Query: 184 KHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFL 243
           KHLWRL GC++KFNR LIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFL
Sbjct: 61  KHLWRLNGCVKKFNRRLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFL 120

Query: 244 KNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPDISL 303
           KN+HP MRAAASNLFG PEVLESRPNVFGELMRVLISPSK+VEEAVFSVLKSG DPDISL
Sbjct: 121 KNIHPTMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKNVEEAVFSVLKSGADPDISL 180

Query: 304 HMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAEVIH 363
           HMRMLMNRSVRGLQAA+QCIRK + NLT  SKPRLVLVSDTPNFVKSIVP+L EFAEVIH
Sbjct: 181 HMRMLMNRSVRGLQAAVQCIRKAMLNLTGLSKPRLVLVSDTPNFVKSIVPILDEFAEVIH 240

Query: 364 FDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRVGTT 423
           FDYEHFRG ISGT DEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRRVGTT
Sbjct: 241 FDYEHFRGNISGTDDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTT 300

Query: 424 YAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSC 483
           YAQLIAALAAA+NLDNLG  STGSDF FLSSFQSNLLREGLKNQ+GWGHIWNRFAGPLSC
Sbjct: 301 YAQLIAALAAANNLDNLGNKSTGSDFLFLSSFQSNLLREGLKNQIGWGHIWNRFAGPLSC 360

Query: 484 PSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNAKKN 543
           PSQPNQCA+TPLLPPAWWDGLWQSPIPRD+KRMENYGVHL+ FGT+DEDSLRSFCNAKKN
Sbjct: 361 PSQPNQCAVTPLLPPAWWDGLWQSPIPRDVKRMENYGVHLTSFGTVDEDSLRSFCNAKKN 420

Query: 544 VVRTIPFIL 553
           V+RTIPFIL
Sbjct: 421 VLRTIPFIL 426

BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match: gi|659126390|ref|XP_008463159.1| (PREDICTED: uncharacterized protein LOC103501366 isoform X2 [Cucumis melo])

HSP 1 Score: 805.1 bits (2078), Expect = 7.9e-230
Identity = 387/429 (90.21%), Postives = 404/429 (94.17%), Query Frame = 1

Query: 124 MGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTMKEI 183
           MGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR   GKFPFGDYISYS+++FT+KEI
Sbjct: 1   MGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISFTLKEI 60

Query: 184 KHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFL 243
           KHLWRL GC++KFNR LIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFL
Sbjct: 61  KHLWRLNGCVKKFNRRLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFL 120

Query: 244 KNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPDISL 303
           KN+HP MRAAASNLFG PEVLESRPNVFGELMRVLISPSK+VEEAVFSVLKSG DPDISL
Sbjct: 121 KNIHPTMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKNVEEAVFSVLKSGADPDISL 180

Query: 304 HMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAEVIH 363
           HMRMLMNRSVRGLQAA+QCIRK + NLT  SKPRLVLVSDTPNFVKSIVP+L EFAEVIH
Sbjct: 181 HMRMLMNRSVRGLQAAVQCIRKAMLNLTGVSKPRLVLVSDTPNFVKSIVPILDEFAEVIH 240

Query: 364 FDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRVGTT 423
           FDYEHFRG ISGT DEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRRVGTT
Sbjct: 241 FDYEHFRGNISGTDDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTT 300

Query: 424 YAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSC 483
           YAQLIAALAAA+NLD LG  STGSDF FLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSC
Sbjct: 301 YAQLIAALAAANNLDYLGNKSTGSDFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSC 360

Query: 484 PSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNAKKN 543
            SQPNQCA+TPLLPPAWWDG+WQSPIPRDIKRMENYGVHL+ FGT+DED LRSFC AKKN
Sbjct: 361 SSQPNQCAITPLLPPAWWDGIWQSPIPRDIKRMENYGVHLTSFGTVDEDGLRSFCYAKKN 420

Query: 544 VVRTIPFIL 553
           V+RTIPFIL
Sbjct: 421 VLRTIPFIL 426

BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match: gi|590647591|ref|XP_007031943.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 771.9 bits (1992), Expect = 7.4e-220
Identity = 375/566 (66.25%), Postives = 445/566 (78.62%), Query Frame = 1

Query: 1   MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSD----------- 60
           MR+GGSRKKR+    R+ ++LCA    I +LML  L  ++   + +++            
Sbjct: 1   MRYGGSRKKRA--LVRWFLILCAAFTFISWLMLLTLRSIDTPPTTTTTKTTDVALVDLPG 60

Query: 61  ----QLGNGDDVEESRVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNG 120
               QL   D V  S    +    +SCATVE+MG++FK  + KESL VR IIQ HF +NG
Sbjct: 61  KLEHQLFQRDGVLSSAEAPKKASAKSCATVEEMGKSFKGRILKESLGVRRIIQRHFSVNG 120

Query: 121 ASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKF 180
           ASR+R+LPPEQFC+HGFV+GK+SEAGFGNEMYKILTA ALS+MLNRSLIIGQTR   GK+
Sbjct: 121 ASRIRELPPEQFCRHGFVIGKASEAGFGNEMYKILTAAALSVMLNRSLIIGQTR---GKY 180

Query: 181 PFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEH 240
           PFGDYI YSN+TFT++E+KHLWR  GC + + RHL+MRTDDFEKP +TN LC NW++W  
Sbjct: 181 PFGDYILYSNLTFTLREVKHLWRQNGCAKIYGRHLVMRTDDFEKPTKTNALCGNWRKWRQ 240

Query: 241 PIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDV 300
           PIIW+QGTTDAVAAQFFLKN+HP MR AAS LFG PE L SRPNVFGELMR+LISPS+D+
Sbjct: 241 PIIWYQGTTDAVAAQFFLKNIHPDMRNAASELFGKPESLRSRPNVFGELMRILISPSRDI 300

Query: 301 EEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTP 360
           EEAV  VL  G DPDI+LHMRMLMNR VR  QAAL C+R+  RNL   S+PR+V+VSDTP
Sbjct: 301 EEAVNWVLCGGRDPDITLHMRMLMNRPVRAAQAALNCLRRATRNLQQGSRPRVVVVSDTP 360

Query: 361 NFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLA 420
           +FVKSI P + EFAEV+HFDY+ FRG  S        LDFRVKDWGP+PRWVAFVDFFLA
Sbjct: 361 SFVKSITPNISEFAEVLHFDYKLFRGNASHDIKASPNLDFRVKDWGPAPRWVAFVDFFLA 420

Query: 421 SRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLK 480
           S AK AV+SGAHRRVGTTYAQLIAALAAA   +++G+NSTGS F FLSSFQSNLL +GLK
Sbjct: 421 SSAKHAVVSGAHRRVGTTYAQLIAALAAA---NSIGENSTGSSFSFLSSFQSNLLADGLK 480

Query: 481 NQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSG 540
            QVGWGH+WNRFAGPLSC  QPNQCA TPLLPPAWW+G+WQSPIPRDI R+E YGVHLSG
Sbjct: 481 LQVGWGHVWNRFAGPLSCRGQPNQCAYTPLLPPAWWEGIWQSPIPRDIHRLEQYGVHLSG 540

Query: 541 FGTIDEDSLRSFCNAKKNVVRTIPFI 552
           FGT DE+ +RSFC+++KN+V+T+ FI
Sbjct: 541 FGTTDENQIRSFCSSRKNIVKTVTFI 558

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KNC7_CUCSA2.7e-28587.73Uncharacterized protein OS=Cucumis sativus GN=Csa_5G429950 PE=4 SV=1[more]
A0A061EKP0_THECC5.1e-22066.25Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_017276 PE=4 SV=1[more]
W9SEL0_9ROSA5.3e-21766.08Uncharacterized protein OS=Morus notabilis GN=L484_005663 PE=4 SV=1[more]
A0A067L042_JATCU1.5e-21671.85Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26829 PE=4 SV=1[more]
V4W2U9_9ROSI3.2e-21465.32Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014761mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G26950.14.9e-19861.11 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778702675|ref|XP_004140294.2|3.8e-28587.73PREDICTED: uncharacterized protein LOC101211825 isoform X1 [Cucumis sativus][more]
gi|659126386|ref|XP_008463157.1|4.0e-28286.82PREDICTED: uncharacterized protein LOC103501366 isoform X1 [Cucumis melo][more]
gi|778702678|ref|XP_011655244.1|4.4e-23390.91PREDICTED: uncharacterized protein LOC101211825 isoform X2 [Cucumis sativus][more]
gi|659126390|ref|XP_008463159.1|7.9e-23090.21PREDICTED: uncharacterized protein LOC103501366 isoform X2 [Cucumis melo][more]
gi|590647591|ref|XP_007031943.1|7.4e-22066.25Uncharacterized protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g19760.1Cp4.1LG01g19760.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35736FAMILY NOT NAMEDcoord: 1..552
score:
NoneNo IPR availablePANTHERPTHR35736:SF1SUBFAMILY NOT NAMEDcoord: 1..552
score: