Cp4.1LG04g00120 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g00120
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF1644)
LocationCp4.1LG04 : 2033490 .. 2041195 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGTACGAACAAAACTGAAAGAGAAAGTGGCGTGCAAATGACAAACGATTCGCCGTCTCTCTCTCTCCGCTGTTAGGTAATGGAATGGCTAATCTATCTCTCTTCTGCCTGTAACTCCCTCGATTTCATCGCCTCCCTCTCCCTTTTTCTCCTCTAAAACCCTTACTCTCTTCTCACTTCCGTTTTTAAATCTTTGATTCGTGTTCTACTGTGTTTCCCTGAATTTTGGACGGATTTACATACAGTTCTTCACCCTCTGCGTTTTTCGGTATCGTTTTCACTCTGGGGTTTTTATCGTTTCCGTTTTGGTTCTGTTCGTCTTGTAATAATCGCGGTTTAAATGCTAGATGTTTCGTGTTTCTTCTAATTCTTGTCAATTTTTCTTGCGAGGATCTCCGTTGATTAGGATTTTCCCTTTTTTAAGAGGCCATTGTCTGCTCTTGAACTTATATTTCTCTGAATTATTGTTTTTTTCTCACATGGATTAAAGTCAAGCCCATGTGCCTGTTCGGTTTTTTTTTTTAATCAATTCCATCCTGTTTGTTCTGCTAAGCACGTTCTTGTTGTTAGCTCTCTGCTATTTACACGCATCCTTTAAATGATTTTGTGTTTTTATTTGTGCTAATGCGATTTCAGGTGCAGTTTGATCTCTCCCTTCTGACTTCTCTTGGATCTAGATTTCCTTGAACTTTGAAGCTGGTGAGTATCGGCTTTCTAATGTAGATGTGTTTTGGATTTTGTTGGAATTTATCCCAGAATTTTGTTGATCTAGTAGATTTTTGATTCTTACTTTTGATCTATCTGTTTGCCATTCCTTGTATGTTTTGACGTTTGGAAAACTGAATATGATTCTAGGATCTCTTAAAGATGCTGACCTTCTATCATAGATAATGGTTTTGTTCATTTGTTGTTGATCTTTCCTGCAATTTCTTTGTTGAATAGAAGTTAGTTTCTACTGTTAAACGCTAGGCAGAGGATATTCGATCTGAATTCCCTTCCGATTTTCCGTTAATAATAGTTGGTATTGTGCAAATTACTTGGCTAATTAGCTTCATAGTTCGATTTTGGTTGAGAAACCTGTTTTCCAAAACCGATTGGTGTATATGTTTTCATTAAGAAAAGGTGTAATGTTTAAAATTGTTGTGAACATTGAAATTACCTGTTGGATTCGTGATTTGGAAAGAAAGAATAACTTCATTTGTTCTGAAAATAATAAAGCACATATGAGGTTAAAGTAAGGAAAATGATAGAAAGAAAATTTTATAGGACAACTGTCCAACCATTTTTTTTTACTGAATTCTTGTCCTTCAAAACTTTAAATTATTCTGGTAATAGTTCTTAAACAAACCTTAACTTTTCTTGGATTAAATTACAAACTTAGTCTATTTAATACCTAAACTTTAAAAATGTTCAATTATGATCCTATGTGTTAAAAATGTGGTTCTATGCACGATAATGTGACGCTTATTGAACCAATGGAAGAAGGTAAACAACATGACAAAATAAGGTGTGATCTTGTGTAGAATGCATCCAATATTTCTCTTCATAAAGAGAGTATTTATTAATGGTTCTCTTTTATGATTTAAGACTGATCTATGCCCCAGTTATTGGAAATTGCGTTACAGAGATATAATCCTAGAATGATTTTTTTTTAAGTGGGCTTAATCATATTTCTTTATGTAGATTTGAAAGCATGATTGTCTCCTTCATGGGACTGCATGAGATTAAATTAAGTCAAATGATATTTGATGACGTGCGTTTGTATAAAAGCTATTTTGCCCAGATCCTGGACATGATATAATGTTTCAATTGATCATTATTATTTTAAAATTTTAGATATATTTGGGACACATTTAGAACACAGAGAAAATGAAGATACTAAAGTAAAGGAAAAGAAAAAAGAAAAAGAAGAAAAAAACATATAAATCTTCTAAGCTTTGGGGCTAAACAAATAAAGACACAAAATTAGATCAAATGAAGTAGAAAAAGAGATCGAAAATAAATAATTTTTTTACTCAATAGGACCAAGTGGATGATCTGAGAAGTTATGGGAAGTGGGGAATGAGACTAAACTGTCTGTCTGTTTCTAAGGATTTGTTATGTAGTTTGTAGAATATGTTACTTTATTTTCTTCTCGTTTGATGTTTAATGTTTATCATTGAATATTACAATTGGAAGATAAGATCTCTGAATTTTCAATTTTTGATTTGGCGTCATGGAATTCTTTGATCACTGAAGGCACAATTGGCCTTAAAATTCTTTGTCACCAGCCTCCACATTTTCCTGAATTGGGAATTATCCTCGTTTCATCAATGAAATTACTCTCCTTCCCTTCTCCTACTACTGCTTCGATACTCTCCTAATGTCTTACTGCCAATCCCACCGGATATTAGTGCACCCTGGCCCCTAATTTGATTGGACCAACACACCCATGTTTCTATTTTGCTGTCGTTTCAACTTCTCTTTTTCCACAACTTGCGTATCATGTTTCAGTTTATTCCCTAAAAGAACCTGCAACAATTGTTTGAGAATTATTCTTTTAAGCCTTACATTTTGCATCCGTAGGAAGAAGGTTTGCTTTGAATTTTTTTAGTTGATTCGTTTTTTATGGATAAGTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNGTTGTTATCTCTTGGATGGACAATCTCCACTTCAACGAGTATAAAAACTGACAGAAATCCAATCAATCATGCATGTTCATGGTTCCTATTGCGTCTTGAATGAAACCCGTGTAATTTTCATCCATCCCTTCCATAAAGGTTGTTTTTAACCACATCACGATCAATAATAGTAACTCTTGCTCTTTAATTTTTGTGATCAACATTTTTAAAGTCCTCCATTGATCAATAAAATTACCAATTGGCTGAATTGCTTACTTAGATGCATATTAGCACATAATAATAATTACATTAATAAACTTCTGTGTTTTAAGAATATGACATTTATGTAATTCTTTAGTCCTTTTATGAGGCACAGAGACTTGAAGTCTTAGTGTGTTTTCATGGAGGCATGAGCCTTTTATAATCCAAGGAAGTTAAAATATTTGAGGCTAAAATTTAGCCTAGCTGTGAGGGGAGCCTCAAGGTCTAAGCCACGAAAGTCTTTCATAACATTGTAAAAATCTAGTCACAATTTCCCCGTCTAACACATTTTGATTGCCCAACTTCTTCACGAAGTTGCCCAACATTATTTAACATAGTGGATCACAGGTTGTATATCCTCCTCAATCCTATCACCATCATCAGTCGCCCAATAAGAAAGATTCTTGCTTTGAGAGGCCTTAGACAAGGCGATCCCTTTTCAACCTTCCTTTTTATTATTGGCATGGATTGTTTTAGTAGAATGATGATAGAAGAAAGTTCCATAAATTTGAATAACAACTTTGAGATTGAGAGTTTCCCTATGGATACATCACCTTCTTTCTGCCGATGACACCATTTTTTTCTCATCCATGAATCTGTGTTGGTATTCAACAACCTTTTTGGCCTTGTCGACTATTTTGAACAGGCATCCTAGTTGATCACCAGAAATCAGAGACTATCAGGGTTAATACAGAATATAAGGAGGTTGAAATGATAGCTGAGAAATTCGTTTGTAGACAAGGATACTGGCCTAACATCTATCTAGGGCTTCCATTTTACGGTAAACCTTGAACTTCTTTTTGGTCCCCTGTTGAAAAAGTTGAAAGAAGGAGATCCAGTTATGGAACCACACATGCTTATCCAATGAGGGCAGCCTCACTCTTTTGCAAGCAACACTTCCAAATCTTCCCACATACTTTTTGTCCTCTTCAAGATTCCATCAAAGGTTCTGTGGAATATTGCAAGATTATGTAAAATTTCCTTTGGAAGGGTAGCTCCCCCCAGTCTGAATCACACTTCGTAAGATGGGAATAAGTCACTGGGTCTACCTTTCATGGAGGCTTTGGATTGCAACGATTGGAAGAAAATAATATAGCTTTATTGGCCAAATGATAGCTATCCAGCATGGATTAGGGGACCATGAAGATCCATCATGAAATAATTAAATATCATTAAAGAGAGAGTGAAGCAGAAAGTTGGAAACGGAGCAATGGTTGGTTGAACGGTTGAACGTCTTTATTCTGTCATTACAAAAACACATAATACAGGACTTGTGGAGCTTTCAACATGGAGGATGGAATTTGAATTTGAGGGGAAATCTCACAGATGATGAAATTGAGGAATGTGTGACAATAATGGATTTATCGTATGAGGTGAAATAACTTCTAGAGATCAATGGTCATGGAAATTAAATCAAAAGGATTTTTCTCCACAAAATCTTCACTGCTTGATTTAGATAACAACTTGACAGCGCCATTGTTTCCCTGCAGAAGCAATATGGATGGATTTATAGCCAAAGAAAATTAGAATTGTCTTTTGGAGCTTAGCCATATTTCCATGAATACATTTGATAGATTGCAGCGCCGTTTTCCACACGTAGTTTTATCCCCGCAATGCTGCATTATCTGTAAAACAACCCCTGAATCACAGAATCATTTGGCCACCACCTGTTATTTTGCTGGAAAAAGAGTCCAACCCTTCTTATTAAAACCCTAAATGATACTGATATAAGAAAGAACACTAAAAGATCATTGAAGAATAAGGAGATAAAGCACTAGTTTAGAAAGATGGAAGCCCCTTGCCATACATCCACTTCTTTTACCTGCCGTTGTAGCTATAGTTGAGAGAGAGAGAGATTAGTTGCATCCTTATGTGAGAGTGAAAGCAATATTGTCCTTGCTCTTGAAGAACTATTGTTATTTGTATGTGGGAGAGGGCTGATTTTATTTGTTAATTGCATGTGAGAGAGGACTGAATTTATTTTGGGTCGATAGCTCCATTGCCACATACTGAAGGAAAGCTGAGGTGGAAGTAAACCACCAACAGCAAGGTTGTTCTTTTTCTTTTATCCATATCTTGGTAGTCTCTTTACCTCAATAATAATTATAAGATAGAATTTCCTTCTCGCCAAACAATATGACCATGTGCAAGTAATAATATTAGAATAAGGAAGTTAGATATTGCATAGTATGTCAAACTATGCACACACATGCATCTATATATTCAATTTTCTGTTACCCTTATTCTCTCTTTGCAGTTTTTGTTAGCACCATTTTAAAGCTATTAATAAGAACTTTGATTGGATGACACTTGCAAAGACTGATTTTATTATGCACGTTGAAATCATTATTCACATATTTGTAAGGAAGGACAGCTTCCAAACAAGGCCCACATGTCCAACCGAGGGAAGGACAATAAGTGGTGAATTTTGGACAACGACAGTGTATTTTGTTTTCTAGGATGTTAGAAAACCCCACATAGCTCTCTCTCTTCGATCTGGTGTATTCTTTGCAAAAGGCAAGGAGAAGATGTTGCCCATTTCTTTCTGCTACACCCTTACCTCTTTTGGCTCGTCTTGGAGTAGTAACCCATTCAGGAGATAGCAAAAATGTGGATCAAGGCTATAATTGTTCTAGCTCTTGTTTGAGAATTTTGGATGGAAAGAAATGGAAGAATTTTCCGAGCATCAAACACCTTACCTTCACTACTGATGCAGGATCATGCTATTATCTTATTAGAGAGGGAGCATTGGAGAAACTCCCATTAATGAACGACCAAAGGGCCATAAAGAACATGTTGAGTGCTTGCTTCTTTAATATAGTCTATGAAAGCTCAGAGTTAGCCACTAAGCAGGTGTTGTTTACTAATCTCTGAGGAGCGGTGCTAAATTATGTATTTACTACTCGGTATCAGCTTGTTTACATTTCTATCGTTGTGTTCCTTTGTCCTTTTGTTTCTTTTGTTTTAGGTGGTTGTGTGCTGAAAACTCATTGGCTGTGTTCAAGATGTGTGGTTTACACTGTTTGTGAACTTGAATTCCTTAACGAAATTTATTGTTATTTGTTCATCAATTGAAGCTCTTGAGGTCATTATATGAATGAGGACTTGGTACATCGTTGTAGGAGCATACATAATTTGCAGTGTGATGTTCATTATGTATTTCTTACAATCCACTTGGTTTTGCTGATGTTCATATTACCTGCAAATGTTTGGCTATATATCACAGATTGAAACTCGAATGGCAAAAGGGAGAAGGGTACGTCGCAGGATGGCTTCCCGACAATTCAGGTCAAACCCATATCCACTCCCTTATAAGCGGGATTTCACAGAGGATTTGTGCCCAAAGAATTGTGTCAGCGCTCTGGATAAGAAGTATTGGGAAGATGCAACATGCTCTGTATGCATGGAATACCCACACAATGCTGTTCTTTTACTCTGTTCATCTCATGATAAGGGTTGTCGTCCCTATATGTGTGGCACAAGCCTTCGTTATTCCAACTGCCTTGACCAGTACAAAAAGGCTTACACCAAAGTTATTTCGTCCAACAGTGCACAACCAGTACACACCTCAATCGATTATTCAGCTCTTGTAGAGGATGCAAGTTTTCTTGGTGAAAGTCATGAAGTGACTGAGCTTGCATGCCCCCTCTGTCGCGGACAGGTGAAAGGATGGACCGTGGTTGAGCCTGCCAGAGAATATCTGAATGCAAAAAAGAGGACCTGCATGCAGGACAGCTGCACGTTTTTAGGAAACTACAAGGAACTGAGGAAGCATGTGAGACAGGAGCACCCATTCACAAGGCCCCGCGAGGTGGATCCCGTGCTTGAACAAAAATGGAGAAGACTGGAACGGGAATGCGAAAGGAACGACGTGATGAGCACAATCAGGTCGACAATGCCCGGAGCTGTGGTCTTCGGGGATTATGTGATAGAAGGAAATAGCTTTGGGTTTGATTCTGAGGAAGAGGAAGCGGGGTTGAATGCGAATGCGAATGCGAATGCGAATGCGGCTGAGAGGGATGGGGGCTTTGAAGTGGGCTTTGATAGTAATCTGGTGAATGTGTTTCTTCTGCTACATGCCTTTGGTCCGTCAAGTAGCGACCTCAATAGACGGCTAAGACAGCAGCAGCAGCAGCAGCAGCAGCCAGAGAGGAGTTTCCCTCCCAGACCAAATGAAGGCGGTACAGGTATTATTCCCAACACCACTCCGCTCAATAGTTTCGATTCCATTCATCTAGACAGTGACAATGGCGGAAGTGGTGACAATGAGAACGGGGGCAGCATGTCATTGGTTAGCCGCCTCCGCCGCCACGGAAGGGTGCTGTTGGGACGGTCAGGTAGGAGACGTAGGCATAGAGAGAGTAGTAGTAGATGAATGAATGATGAAAATGGACAGAGAGAAAGGAGGGGTAGGAATAGGAAACATCTCTCAGGTACATAGCATAAGAAAAAGAAGTAGAAATCTTGATAATAAAGGAATACGAGTTTGTGGAATTTTGAGCAAAATTTATGATGACATTTATTTGCGCCCCCGTCCGTCCATCTGATAGGGGAGATGAAAAGCTTTGGCAGGTTAAGAAGCCTGTGATGTAATTTATTGTGAATTTGTATAATTTCCTGATTGCCTTATCCTCCTCTTGATGATATGCTTGGATTTGCTTGAAATTTAGTTGTAACCATCTGAAATTAATGGATCTTCTGGACATTGCTTCCTTGCAACCAAGTCATTAGACTAATGACACCCATTTGTTCACATATTATAACTTCTCTGATGACTCTTTCACTGCCTTTACTCTCTATTTATTCATAA

mRNA sequence

ATGAATATTGAAACTCGAATGGCAAAAGGGAGAAGGGTACGTCGCAGGATGGCTTCCCGACAATTCAGGTCAAACCCATATCCACTCCCTTATAAGCGGGATTTCACAGAGGATTTGTGCCCAAAGAATTGTGTCAGCGCTCTGGATAAGAAGTATTGGGAAGATGCAACATGCTCTGTATGCATGGAATACCCACACAATGCTGTTCTTTTACTCTGTTCATCTCATGATAAGGGTTGTCGTCCCTATATGTGTGGCACAAGCCTTCGTTATTCCAACTGCCTTGACCAGTACAAAAAGGCTTACACCAAAGTTATTTCGTCCAACAGTGCACAACCAGTACACACCTCAATCGATTATTCAGCTCTTGTAGAGGATGCAAGTTTTCTTGGTGAAAGTCATGAAGTGACTGAGCTTGCATGCCCCCTCTGTCGCGGACAGGTGAAAGGATGGACCGTGGTTGAGCCTGCCAGAGAATATCTGAATGCAAAAAAGAGGACCTGCATGCAGGACAGCTGCACGTTTTTAGGAAACTACAAGGAACTGAGGAAGCATGTGAGACAGGAGCACCCATTCACAAGGCCCCGCGAGGTGGATCCCGTGCTTGAACAAAAATGGAGAAGACTGGAACGGGAATGCGAAAGGAACGACGTGATGAGCACAATCAGGTCGACAATGCCCGGAGCTGTGGTCTTCGGGGATTATGTGATAGAAGGAAATAGCTTTGGGTTTGATTCTGAGGAAGAGGAAGCGGGGTTGAATGCGAATGCGAATGCGAATGCGAATGCGGCTGAGAGGGATGGGGGCTTTGAAGTGGGCTTTGATAGTAATCTGGTGAATGTGTTTCTTCTGCTACATGCCTTTGGTCCGTCAAGTAGCGACCTCAATAGACGGCTAAGACAGCAGCAGCAGCAGCAGCAGCAGCCAGAGAGGAGTTTCCCTCCCAGACCAAATGAAGGCGGTACAGGTATTATTCCCAACACCACTCCGCTCAATAGTTTCGATTCCATTCATCTAGACAGTGACAATGGCGGAAGTGGTGACAATGAGAACGGGGGCAGCATGTCATTGGTTAGCCGCCTCCGCCGCCACGGAAGGGTGCTGTTGGGACGGTCAGGTAGGAGACGTAGGCATAGAGAGAGTAGTAGTAGATGAATGAATGATGAAAATGGACAGAGAGAAAGGAGGGGTAGGAATAGGAAACATCTCTCAGGTACATAGCATAAGAAAAAGAAGTAGAAATCTTGATAATAAAGGAATACGAGTTTGTGGAATTTTGAGCAAAATTTATGATGACATTTATTTGCGCCCCCGTCCGTCCATCTGATAGGGGAGATGAAAAGCTTTGGCAGGTTAAGAAGCCTGTGATGTAATTTATTGTGAATTTGTATAATTTCCTGATTGCCTTATCCTCCTCTTGATGATATGCTTGGATTTGCTTGAAATTTAGTTGTAACCATCTGAAATTAATGGATCTTCTGGACATTGCTTCCTTGCAACCAAGTCATTAGACTAATGACACCCATTTGTTCACATATTATAACTTCTCTGATGACTCTTTCACTGCCTTTACTCTCTATTTATTCATAA

Coding sequence (CDS)

ATGAATATTGAAACTCGAATGGCAAAAGGGAGAAGGGTACGTCGCAGGATGGCTTCCCGACAATTCAGGTCAAACCCATATCCACTCCCTTATAAGCGGGATTTCACAGAGGATTTGTGCCCAAAGAATTGTGTCAGCGCTCTGGATAAGAAGTATTGGGAAGATGCAACATGCTCTGTATGCATGGAATACCCACACAATGCTGTTCTTTTACTCTGTTCATCTCATGATAAGGGTTGTCGTCCCTATATGTGTGGCACAAGCCTTCGTTATTCCAACTGCCTTGACCAGTACAAAAAGGCTTACACCAAAGTTATTTCGTCCAACAGTGCACAACCAGTACACACCTCAATCGATTATTCAGCTCTTGTAGAGGATGCAAGTTTTCTTGGTGAAAGTCATGAAGTGACTGAGCTTGCATGCCCCCTCTGTCGCGGACAGGTGAAAGGATGGACCGTGGTTGAGCCTGCCAGAGAATATCTGAATGCAAAAAAGAGGACCTGCATGCAGGACAGCTGCACGTTTTTAGGAAACTACAAGGAACTGAGGAAGCATGTGAGACAGGAGCACCCATTCACAAGGCCCCGCGAGGTGGATCCCGTGCTTGAACAAAAATGGAGAAGACTGGAACGGGAATGCGAAAGGAACGACGTGATGAGCACAATCAGGTCGACAATGCCCGGAGCTGTGGTCTTCGGGGATTATGTGATAGAAGGAAATAGCTTTGGGTTTGATTCTGAGGAAGAGGAAGCGGGGTTGAATGCGAATGCGAATGCGAATGCGAATGCGGCTGAGAGGGATGGGGGCTTTGAAGTGGGCTTTGATAGTAATCTGGTGAATGTGTTTCTTCTGCTACATGCCTTTGGTCCGTCAAGTAGCGACCTCAATAGACGGCTAAGACAGCAGCAGCAGCAGCAGCAGCAGCCAGAGAGGAGTTTCCCTCCCAGACCAAATGAAGGCGGTACAGGTATTATTCCCAACACCACTCCGCTCAATAGTTTCGATTCCATTCATCTAGACAGTGACAATGGCGGAAGTGGTGACAATGAGAACGGGGGCAGCATGTCATTGGTTAGCCGCCTCCGCCGCCACGGAAGGGTGCTGTTGGGACGGTCAGGTAGGAGACGTAGGCATAGAGAGAGTAGTAGTAGATGA

Protein sequence

MNIETRMAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVEDASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHVRQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDSEEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQQQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGRVLLGRSGRRRRHRESSSR
BLAST of Cp4.1LG04g00120 vs. TrEMBL
Match: A0A0A0K3F0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G232490 PE=4 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 1.2e-170
Identity = 308/377 (81.70%), Postives = 331/377 (87.80%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG + RRR+ASRQFRSNPYPLPYKR F EDLCPK+CV+A+DKKYWED+TCSVCMEYPH
Sbjct: 1   MAKGSKGRRRLASRQFRSNPYPLPYKRVFPEDLCPKDCVNAVDKKYWEDSTCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSN+AQ V  SID   +V+D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNNAQTVSASIDNPGVVQD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
            S LGE+HE TELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTF+GNYKELRKHV
Sbjct: 121 PSLLGENHEATELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFVGNYKELRKHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           R EHP  RPREVDPVLEQKWR LERE ERNDVMSTIRSTMPGAVVFGDYVIEGN+FGFDS
Sbjct: 181 RSEHPSARPREVDPVLEQKWRSLERERERNDVMSTIRSTMPGAVVFGDYVIEGNNFGFDS 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ GLNAN+      AER+ GFEVGFDSNLVN+FLLLHAFGPSS DLNRRLR      
Sbjct: 241 DEEDGGLNANS------AERNAGFEVGFDSNLVNMFLLLHAFGPSSGDLNRRLR------ 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
            QPER F PR NEG  G IPNTTPL+SFDSI+  SDNGGS D++NG  MSLVSRLR HGR
Sbjct: 301 -QPERIFSPRSNEGVAG-IPNTTPLSSFDSINEGSDNGGS-DDDNGSGMSLVSRLRHHGR 360

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRRRR+RE+SS
Sbjct: 361 VLLGRSGRRRRNRENSS 362

BLAST of Cp4.1LG04g00120 vs. TrEMBL
Match: W9SBD5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024149 PE=4 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 9.0e-131
Identity = 248/377 (65.78%), Postives = 283/377 (75.07%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG R R+R+ SR +RS PYPL + +D  EDL PK C    D K WED TCSVCMEYPH
Sbjct: 1   MAKGGRGRQRLTSRHYRSAPYPLRFDQDIMEDLRPKKCSKTWDNKDWEDVTCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHD GCRPYMCGTS R+SNCLDQYKKAYTKV+S+N  Q +H S D   +++D
Sbjct: 61  NAVLLLCSSHDNGCRPYMCGTSFRHSNCLDQYKKAYTKVVSTNHGQSLHGSADNPIVIQD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
           +S   E+ EVTELACPLCRGQVKGWTVV+PAREYLNAKKR+C+ D C+F+GNYKELRKHV
Sbjct: 121 SSSAVENTEVTELACPLCRGQVKGWTVVDPAREYLNAKKRSCVHDDCSFIGNYKELRKHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNS-FGFD 246
           R EHP  RPREVDP +EQKWRRLERE ERNDV+STI STMPGA+ FGDYVIEGN+ +GF 
Sbjct: 181 RAEHPSARPREVDPAVEQKWRRLERERERNDVISTITSTMPGAMFFGDYVIEGNNYYGFI 240

Query: 247 SEEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSD-LNRRLRQQQQ 306
           S++E          +A A ER+GGFE+GFDSNLVNVFLLLHAFGPS  D LNRRL     
Sbjct: 241 SDDE-------GGTDAGAGERNGGFEMGFDSNLVNVFLLLHAFGPSGGDELNRRL----- 300

Query: 307 QQQQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGG-SGDNENGGSMSLVSRLRR 366
              QP   F    N G  G I +TTP+   +S   D DN     D++ GG MSL SRLRR
Sbjct: 301 --PQPAMPFRQTTN-GSAGGIRHTTPVGGAESSDQDDDNDSHDDDDDGGGDMSLASRLRR 360

Query: 367 HGRVLLGRSGRRRRHRE 381
           HGRVL+GRSGRRRR RE
Sbjct: 361 HGRVLMGRSGRRRRRRE 362

BLAST of Cp4.1LG04g00120 vs. TrEMBL
Match: M5XB06_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007591mg PE=4 SV=1)

HSP 1 Score: 471.5 bits (1212), Expect = 1.0e-129
Identity = 241/377 (63.93%), Postives = 292/377 (77.45%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG R RRR+ASR++R  PYPL   +D  EDLCPK C   L+KK W+DATCSVCMEYPH
Sbjct: 1   MAKGSRGRRRIASRRYRPTPYPLRSNQDILEDLCPKKCSRDLEKKDWDDATCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTS R+SNCL QYKKAYTK++SS+  QP+  S +   ++ D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSFRHSNCLGQYKKAYTKMVSSDHGQPLLGSDNNPIVLPD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
           + +  +  EV+ELACPLCRG+VKGWTV+EPAR+YLNAKKR+CMQ++C+F+GNYKEL++HV
Sbjct: 121 SEWPAQKCEVSELACPLCRGKVKGWTVLEPARDYLNAKKRSCMQENCSFVGNYKELKRHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           R EHP  RPREVDPVLEQKWRRLE E E +DV+STI+S+MPGA+VFGDYVIEGN++GFD+
Sbjct: 181 RAEHPSARPREVDPVLEQKWRRLEHERETDDVISTIQSSMPGAMVFGDYVIEGNNYGFDT 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ G       +A A ER+GGF +GFD NLVNVF LLHAFG S +    RLR      
Sbjct: 241 DEEDGGF------DAEAGERNGGFGLGFDGNLVNVFFLLHAFGSSGTG---RLR------ 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
            QPER+    P++G    I ++TP+   DS   D +N  +GDN  GG MSLVSRLRRHGR
Sbjct: 301 -QPERAL-HHPSDGSAVGIRHSTPIGGSDSSDQDDENDSNGDNA-GGGMSLVSRLRRHGR 359

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRRRR RE +S
Sbjct: 361 VLLGRSGRRRRRREGNS 359

BLAST of Cp4.1LG04g00120 vs. TrEMBL
Match: Q2Z1Y7_PRUMU (Pm27 protein OS=Prunus mume GN=Pm27 PE=2 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 2.9e-129
Identity = 240/377 (63.66%), Postives = 292/377 (77.45%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG R RRR+ASR++R  PYPL   +D  EDLCPK C   L+KK W+DATCSVCMEYPH
Sbjct: 1   MAKGSRGRRRIASRRYRPTPYPLRSNQDILEDLCPKKCSRDLEKKDWDDATCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTS R+SNCL QYKKAYTK++SS+  QP+  S +   ++ D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSFRHSNCLGQYKKAYTKMVSSDHGQPLLGSDNNPIVLPD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
           + +  +  EV+ELACPLCRG+VKGWTV+EPAR+YLNAKKR+CMQ++C+F+GNYKEL++HV
Sbjct: 121 SEWPAQKCEVSELACPLCRGKVKGWTVLEPARDYLNAKKRSCMQENCSFVGNYKELKRHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           R EHP  RPREVDPVLEQKWRRLE E E +DV+STI+S+MPGA+VFGDYVIEGN++GFD+
Sbjct: 181 RAEHPSARPREVDPVLEQKWRRLEHERETDDVISTIQSSMPGAMVFGDYVIEGNNYGFDT 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ G       +A A ER+GGF +GFD NLVNVF LLHAFG S +    RLR      
Sbjct: 241 DEEDGGF------DAEAGERNGGFGLGFDGNLVNVFFLLHAFGSSGTG---RLR------ 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
            QPER+    P++G    I +++P+   DS   D +N  +GDN  GG MSLVSRLRRHGR
Sbjct: 301 -QPERAL-HHPSDGSAVGIRHSSPIGGSDSSDQDDENDSNGDNA-GGGMSLVSRLRRHGR 359

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRRRR RE +S
Sbjct: 361 VLLGRSGRRRRRREGNS 359

BLAST of Cp4.1LG04g00120 vs. TrEMBL
Match: B9SRK5_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1412330 PE=4 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 4.2e-128
Identity = 252/378 (66.67%), Postives = 285/378 (75.40%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPY-KRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYP 66
           MAKG R RR  ASRQ R  PYPLP   RD +EDL  K      DKK WED TCSVCME P
Sbjct: 1   MAKGSRARRGTASRQCRLAPYPLPSCDRDVSEDLYLKKSSKMFDKKDWEDVTCSVCMECP 60

Query: 67  HNAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVE 126
           HNAVLLLCSSHDKGCRPYMCGTS RYSNCLDQYKKAYTKV SSN       + D S L+ 
Sbjct: 61  HNAVLLLCSSHDKGCRPYMCGTSFRYSNCLDQYKKAYTKVTSSNG------TADNSILLS 120

Query: 127 DASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKH 186
           D+ +  +  EVTELACPLCRGQVKGWTVVEPAR+YLNAK+R+CMQD C+F+G +KELRKH
Sbjct: 121 DSGWPVDKCEVTELACPLCRGQVKGWTVVEPARDYLNAKRRSCMQDDCSFVGTFKELRKH 180

Query: 187 VRQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFD 246
           +R  HP  RPREVDP+LEQKWRRLERE E +DV+STIRSTMPGA+VFGDYVIEGN+ GFD
Sbjct: 181 MRTAHPSARPREVDPMLEQKWRRLEREREHDDVISTIRSTMPGAMVFGDYVIEGNNHGFD 240

Query: 247 SEEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQ 306
           S+EE  G +A+A      AER+GG +VGFD NLVNVFLLLHAFGPS   L+RRLRQ    
Sbjct: 241 SDEENGGFDADA------AERNGGLDVGFDRNLVNVFLLLHAFGPSGDGLSRRLRQ---- 300

Query: 307 QQQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENG-GSMSLVSRLRRH 366
              PERS     NE  T    + +P+N  +S    SD+   GDN++G G +SLVSRLRRH
Sbjct: 301 ---PERSSYRASNESATDT-HHVSPVNGLNS----SDDDNDGDNDHGDGGLSLVSRLRRH 354

Query: 367 GRVLLGRSGRRRRHRESS 383
           GRVLLGRSGRRRRHRE+S
Sbjct: 361 GRVLLGRSGRRRRHRETS 354

BLAST of Cp4.1LG04g00120 vs. TAIR10
Match: AT1G68140.1 (AT1G68140.1 Protein of unknown function (DUF1644))

HSP 1 Score: 328.2 bits (840), Expect = 6.9e-90
Identity = 187/379 (49.34%), Postives = 234/379 (61.74%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSN--PYPLPY-KRDFTEDLCPKNCVSALDKKYWEDATCSVCME 66
           MAK R+  RR+ SR+FR+   PY  P  KR    ++  ++C   L+K+ WE+  CSVCME
Sbjct: 1   MAKARKAGRRVPSRRFRARAKPYKFPSSKRLVARNMFAEDCSKCLEKRDWENVICSVCME 60

Query: 67  YPHNAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSAL 126
            PHNAVLLLCSSHDKGCRPYMCGTS RYSNCLDQYKKA  K+ +S      H  I+ S  
Sbjct: 61  CPHNAVLLLCSSHDKGCRPYMCGTSFRYSNCLDQYKKASAKLKTSG-----HQQINKS-- 120

Query: 127 VEDASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELR 186
                      E+  L CPLCRGQVKGWT+V+PAR++LN KKR CMQ++C + G +KELR
Sbjct: 121 -----------ELGNLTCPLCRGQVKGWTIVQPARDFLNLKKRICMQENCVYAGTFKELR 180

Query: 187 KHVRQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFG 246
           KH++ +HP  +PREVDP +EQ WRRLE E +R+DVMSTIRSTMPG VV+GDYVIE N   
Sbjct: 181 KHMKVDHPSAKPREVDPDVEQNWRRLEIEHDRDDVMSTIRSTMPGTVVYGDYVIERN--- 240

Query: 247 FDSEEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQ 306
                     NAN + +    + D G +  F  NLVNVFLLLHAFG S + + R      
Sbjct: 241 ----------NANGSDSDEGGD-DDGIDAAFGRNLVNVFLLLHAFGASGNQIRR------ 300

Query: 307 QQQQQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRR 366
                         +   T I   T+ LN  +    + +     +  +  S SL SR+RR
Sbjct: 301 ----------SDSDSNDSTTINRGTSELNFSEEEEEEEE-----EERHSNSNSLASRMRR 326

Query: 367 HGRVLLGRSGRRRRHRESS 383
            GRVLLGRSGRRRR RE++
Sbjct: 361 QGRVLLGRSGRRRRDREAN 326

BLAST of Cp4.1LG04g00120 vs. TAIR10
Match: AT1G77770.1 (AT1G77770.1 Protein of unknown function (DUF1644))

HSP 1 Score: 238.4 bits (607), Expect = 7.2e-63
Identity = 114/202 (56.44%), Postives = 145/202 (71.78%), Query Frame = 1

Query: 50  KKYWEDATCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSN 109
           KK W  +TC VC+E PHNAVLLLCSS+ KGCRPYMC TS R++NCLDQY+K+Y    + N
Sbjct: 23  KKEWAGSTCPVCLESPHNAVLLLCSSYHKGCRPYMCATSSRFANCLDQYRKSYG---NEN 82

Query: 110 SAQPVHTSIDYSALVEDASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCM 169
           S QP                        EL CPLCRGQVKGWTVV+ AR + N+K+RTCM
Sbjct: 83  SGQP------------------------ELLCPLCRGQVKGWTVVKDARMHFNSKRRTCM 142

Query: 170 QDSCTFLGNYKELRKHVRQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGA 229
           QD+C+FLGN+++L+KH++++HP   PR +DP LE KW+RLERE +R DV+STI S+ PGA
Sbjct: 143 QDNCSFLGNFRKLKKHMKEKHPHACPRAIDPALETKWKRLERERDRRDVISTIMSSTPGA 197

Query: 230 VVFGDYVIEGNSFG-FDSEEEE 251
           VV GDYVIE ++ G +D E+EE
Sbjct: 203 VVLGDYVIEPHNRGVYDEEDEE 197

BLAST of Cp4.1LG04g00120 vs. TAIR10
Match: AT4G08460.1 (AT4G08460.1 Protein of unknown function (DUF1644))

HSP 1 Score: 234.2 bits (596), Expect = 1.4e-61
Identity = 119/241 (49.38%), Postives = 159/241 (65.98%), Query Frame = 1

Query: 34  DFTEDLCPKNCVSALDKKYWEDATCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSLRYSN 93
           D  + L  K    AL +K W   TC VC+E PHN+V+LLCSS+ KGCRPYMC T  R+SN
Sbjct: 25  DINKALQEKGYGKALKRKPWTGVTCPVCLEVPHNSVVLLCSSYHKGCRPYMCATGNRFSN 84

Query: 94  CLDQYKKAYTKVISSNSAQPVHTSIDYSALVEDASFLGESHEVTELACPLCRGQVKGWTV 153
           CL+QYKKAY K     S +P                        EL CPLCRGQVKGWTV
Sbjct: 85  CLEQYKKAYAK--DEKSDKP-----------------------PELLCPLCRGQVKGWTV 144

Query: 154 VEPAREYLNAKKRTCMQDSCTFLGNYKELRKHVRQEHPFTRPREVDPVLEQKWRRLEREC 213
           VE  R+YLN+KKR+CM D C F G+Y++L+KHV++ HP  +PR +DPVLE KW++LE E 
Sbjct: 145 VEKERKYLNSKKRSCMNDECLFYGSYRQLKKHVKENHPRAKPRAIDPVLEAKWKKLEVER 204

Query: 214 ERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDSEEEEAGLNANANANANAAERDGG-FEV 273
           ER+DV+ST+ S+ PGA+VFGDYVIE  + G+D +++     ++  ++++  E +GG FE+
Sbjct: 205 ERSDVISTVMSSTPGAMVFGDYVIEPYN-GYDHQDD-----SDDYSDSSDDEMEGGVFEL 234

BLAST of Cp4.1LG04g00120 vs. TAIR10
Match: AT3G24740.1 (AT3G24740.1 Protein of unknown function (DUF1644))

HSP 1 Score: 202.2 bits (513), Expect = 5.7e-52
Identity = 133/361 (36.84%), Postives = 184/361 (50.97%), Query Frame = 1

Query: 39  LCPKNCVSALDKKYWEDATCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQY 98
           L  ++ V AL K+  ++ +C VCM++PHNAVLLLCSSHDKGCR Y+C TS R+SNCLD++
Sbjct: 8   LSTESDVHALHKEL-DEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHSNCLDRF 67

Query: 99  KKAYTKVIS-------------SNSAQPVHTSIDYSALVEDASFLG-------------- 158
           KK +++  +             +N +   H +   S+   ++   G              
Sbjct: 68  KKLHSESANDPTPEANLASREHNNESLYEHGTASRSSFHRESGNRGSSWDSESLRRRRRV 127

Query: 159 ----ESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHVR 218
               ES ++T L CPLCRG V GW VVE  R YL+ K R+C ++SC+F GNY++LR+H R
Sbjct: 128 EEEVESEDITNLKCPLCRGTVLGWKVVEEVRTYLDHKNRSCSRESCSFTGNYQDLRRHAR 187

Query: 219 QEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDSE 278
           + HP TRP + DP  E+ WRRLE + E  D++S IRS MPGAVV GDYVIE N   F  E
Sbjct: 188 RTHPTTRPSDTDPSRERAWRRLENQREYGDIVSAIRSAMPGAVVVGDYVIE-NGDRFAGE 247

Query: 279 EEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQQ 338
            E     ++        +  G  + G  S          A G      + R R  +  + 
Sbjct: 248 RETGNGGSDLWTTLLLFQMIGSLDNGGSS----------ASGSGGGSRSHRSRAWRNHR- 307

Query: 339 QPERSFPPRPNEGGTGIIPNTTPLNSFD--SIHLDSDNGGSGDNENGGSMSLVSRLRRHG 367
              RS   RP   G  ++      N+ D     L +D GG+       S  +  R RR G
Sbjct: 308 ---RSSSDRPYLWGENLLGLQDERNNNDDEEFRLQNDAGGA-------STPVPRRRRRFG 345

BLAST of Cp4.1LG04g00120 vs. TAIR10
Match: AT4G31410.1 (AT4G31410.1 Protein of unknown function (DUF1644))

HSP 1 Score: 162.9 bits (411), Expect = 3.8e-40
Identity = 105/311 (33.76%), Postives = 148/311 (47.59%), Query Frame = 1

Query: 53  WEDATCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQ 112
           W+D TC +C+++PHN VLL CSS+  GCR ++C T   +SNCLD++      + +  +  
Sbjct: 29  WDDLTCPICLDFPHNGVLLQCSSYGNGCRAFVCNTDHLHSNCLDRF------ISACGTES 88

Query: 113 PVHTSIDYSALVEDASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDS 172
           P       S ++E++          +  CPLCRG+V GW VVE AR  L+ KKR C ++ 
Sbjct: 89  PPAPDEPRSKVLEESC---------KPVCPLCRGEVTGWLVVEEARLRLDEKKRCCEEER 148

Query: 173 CTFLGNYKELRKHVRQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVF 232
           C F+G Y ELRKH + EHP +RP E+DP  +  W   ++  E  DV+STI S +P  VV 
Sbjct: 149 CRFMGTYLELRKHAQSEHPDSRPSEIDPARKLDWENFQQSSEIIDVLSTIHSEVPRGVVL 208

Query: 233 GDYVIEGNSFGFDSEEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSS 292
           GDYVIE        E E+   N                    + N     +L   F    
Sbjct: 209 GDYVIEYGDDDTGDEFEDVPNN--------------------EGNWWTSCILYQMFDNIR 268

Query: 293 SDLNRRLRQQQQQQQQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENG 352
           +  NRR  +  + ++   RS     N   + +     P    D I  D     SG N + 
Sbjct: 269 NARNRRRSRMSESRRGSRRSSYENSNSDDSSVASIEFPEYRVDEID-DEFISTSGANRS- 302

Query: 353 GSMSLVSRLRR 364
            SM   SR RR
Sbjct: 329 SSMHQSSRRRR 302

BLAST of Cp4.1LG04g00120 vs. NCBI nr
Match: gi|659092678|ref|XP_008447168.1| (PREDICTED: uncharacterized protein LOC103489685 [Cucumis melo])

HSP 1 Score: 608.6 bits (1568), Expect = 7.5e-171
Identity = 308/377 (81.70%), Postives = 332/377 (88.06%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG + RRR+ASRQFRSNPYPLPYKR F EDLCP++CV+A+DKKYWED+TCSVCMEYPH
Sbjct: 1   MAKGSKGRRRLASRQFRSNPYPLPYKRVFPEDLCPRDCVNAVDKKYWEDSTCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSN+ Q V  SID   +V+D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNNVQAVSASIDNPGVVQD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
            S LGE+HEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTF+GNYKELRKHV
Sbjct: 121 PSLLGENHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFVGNYKELRKHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           R EHP  RPREVDPVLEQKWR LERE ERNDVMSTIRSTMPGAVVFGDYVIEGN+FGFDS
Sbjct: 181 RSEHPSARPREVDPVLEQKWRSLERERERNDVMSTIRSTMPGAVVFGDYVIEGNNFGFDS 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ GLNAN+      AER+GGFEVGFDSNLVN+FLLLHAFGPSS DLNRRLRQ     
Sbjct: 241 DEEDGGLNANS------AERNGGFEVGFDSNLVNMFLLLHAFGPSSGDLNRRLRQ----- 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
             PER F PR NEG  GI PNTTPL+SFDSI+  SDNGGS D++NG  MSLVSRLR HGR
Sbjct: 301 --PERIFSPRSNEGVAGI-PNTTPLSSFDSINEGSDNGGS-DDDNGSGMSLVSRLRHHGR 360

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRRRR+RE+SS
Sbjct: 361 VLLGRSGRRRRNRENSS 362

BLAST of Cp4.1LG04g00120 vs. NCBI nr
Match: gi|449444094|ref|XP_004139810.1| (PREDICTED: uncharacterized protein LOC101208946 [Cucumis sativus])

HSP 1 Score: 607.4 bits (1565), Expect = 1.7e-170
Identity = 308/377 (81.70%), Postives = 331/377 (87.80%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG + RRR+ASRQFRSNPYPLPYKR F EDLCPK+CV+A+DKKYWED+TCSVCMEYPH
Sbjct: 1   MAKGSKGRRRLASRQFRSNPYPLPYKRVFPEDLCPKDCVNAVDKKYWEDSTCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSN+AQ V  SID   +V+D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNNAQTVSASIDNPGVVQD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
            S LGE+HE TELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTF+GNYKELRKHV
Sbjct: 121 PSLLGENHEATELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFVGNYKELRKHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           R EHP  RPREVDPVLEQKWR LERE ERNDVMSTIRSTMPGAVVFGDYVIEGN+FGFDS
Sbjct: 181 RSEHPSARPREVDPVLEQKWRSLERERERNDVMSTIRSTMPGAVVFGDYVIEGNNFGFDS 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ GLNAN+      AER+ GFEVGFDSNLVN+FLLLHAFGPSS DLNRRLR      
Sbjct: 241 DEEDGGLNANS------AERNAGFEVGFDSNLVNMFLLLHAFGPSSGDLNRRLR------ 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
            QPER F PR NEG  G IPNTTPL+SFDSI+  SDNGGS D++NG  MSLVSRLR HGR
Sbjct: 301 -QPERIFSPRSNEGVAG-IPNTTPLSSFDSINEGSDNGGS-DDDNGSGMSLVSRLRHHGR 360

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRRRR+RE+SS
Sbjct: 361 VLLGRSGRRRRNRENSS 362

BLAST of Cp4.1LG04g00120 vs. NCBI nr
Match: gi|703127504|ref|XP_010103847.1| (hypothetical protein L484_024149 [Morus notabilis])

HSP 1 Score: 474.9 bits (1221), Expect = 1.3e-130
Identity = 248/377 (65.78%), Postives = 283/377 (75.07%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG R R+R+ SR +RS PYPL + +D  EDL PK C    D K WED TCSVCMEYPH
Sbjct: 1   MAKGGRGRQRLTSRHYRSAPYPLRFDQDIMEDLRPKKCSKTWDNKDWEDVTCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHD GCRPYMCGTS R+SNCLDQYKKAYTKV+S+N  Q +H S D   +++D
Sbjct: 61  NAVLLLCSSHDNGCRPYMCGTSFRHSNCLDQYKKAYTKVVSTNHGQSLHGSADNPIVIQD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
           +S   E+ EVTELACPLCRGQVKGWTVV+PAREYLNAKKR+C+ D C+F+GNYKELRKHV
Sbjct: 121 SSSAVENTEVTELACPLCRGQVKGWTVVDPAREYLNAKKRSCVHDDCSFIGNYKELRKHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNS-FGFD 246
           R EHP  RPREVDP +EQKWRRLERE ERNDV+STI STMPGA+ FGDYVIEGN+ +GF 
Sbjct: 181 RAEHPSARPREVDPAVEQKWRRLERERERNDVISTITSTMPGAMFFGDYVIEGNNYYGFI 240

Query: 247 SEEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSD-LNRRLRQQQQ 306
           S++E          +A A ER+GGFE+GFDSNLVNVFLLLHAFGPS  D LNRRL     
Sbjct: 241 SDDE-------GGTDAGAGERNGGFEMGFDSNLVNVFLLLHAFGPSGGDELNRRL----- 300

Query: 307 QQQQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGG-SGDNENGGSMSLVSRLRR 366
              QP   F    N G  G I +TTP+   +S   D DN     D++ GG MSL SRLRR
Sbjct: 301 --PQPAMPFRQTTN-GSAGGIRHTTPVGGAESSDQDDDNDSHDDDDDGGGDMSLASRLRR 360

Query: 367 HGRVLLGRSGRRRRHRE 381
           HGRVL+GRSGRRRR RE
Sbjct: 361 HGRVLMGRSGRRRRRRE 362

BLAST of Cp4.1LG04g00120 vs. NCBI nr
Match: gi|596128229|ref|XP_007222088.1| (hypothetical protein PRUPE_ppa007591mg [Prunus persica])

HSP 1 Score: 471.5 bits (1212), Expect = 1.4e-129
Identity = 241/377 (63.93%), Postives = 292/377 (77.45%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG R RRR+ASR++R  PYPL   +D  EDLCPK C   L+KK W+DATCSVCMEYPH
Sbjct: 1   MAKGSRGRRRIASRRYRPTPYPLRSNQDILEDLCPKKCSRDLEKKDWDDATCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTS R+SNCL QYKKAYTK++SS+  QP+  S +   ++ D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSFRHSNCLGQYKKAYTKMVSSDHGQPLLGSDNNPIVLPD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
           + +  +  EV+ELACPLCRG+VKGWTV+EPAR+YLNAKKR+CMQ++C+F+GNYKEL++HV
Sbjct: 121 SEWPAQKCEVSELACPLCRGKVKGWTVLEPARDYLNAKKRSCMQENCSFVGNYKELKRHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           R EHP  RPREVDPVLEQKWRRLE E E +DV+STI+S+MPGA+VFGDYVIEGN++GFD+
Sbjct: 181 RAEHPSARPREVDPVLEQKWRRLEHERETDDVISTIQSSMPGAMVFGDYVIEGNNYGFDT 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ G       +A A ER+GGF +GFD NLVNVF LLHAFG S +    RLR      
Sbjct: 241 DEEDGGF------DAEAGERNGGFGLGFDGNLVNVFFLLHAFGSSGTG---RLR------ 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
            QPER+    P++G    I ++TP+   DS   D +N  +GDN  GG MSLVSRLRRHGR
Sbjct: 301 -QPERAL-HHPSDGSAVGIRHSTPIGGSDSSDQDDENDSNGDNA-GGGMSLVSRLRRHGR 359

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRRRR RE +S
Sbjct: 361 VLLGRSGRRRRRREGNS 359

BLAST of Cp4.1LG04g00120 vs. NCBI nr
Match: gi|658008162|ref|XP_008339269.1| (PREDICTED: uncharacterized protein LOC103402311 [Malus domestica])

HSP 1 Score: 469.9 bits (1208), Expect = 4.2e-129
Identity = 243/377 (64.46%), Postives = 288/377 (76.39%), Query Frame = 1

Query: 7   MAKGRRVRRRMASRQFRSNPYPLPYKRDFTEDLCPKNCVSALDKKYWEDATCSVCMEYPH 66
           MAKG R RRR+ASR++R+ PYPL   +D  EDLCP+ C   L+KK WEDATCSVCMEYPH
Sbjct: 1   MAKGSRGRRRIASRRYRATPYPLQNNQDLLEDLCPRKCRD-LEKKDWEDATCSVCMEYPH 60

Query: 67  NAVLLLCSSHDKGCRPYMCGTSLRYSNCLDQYKKAYTKVISSNSAQPVHTSIDYSALVED 126
           NAVLLLCSSHDKGCRPYMCGTS R+SNCL QYKKAYTK+ SS+  QP+  S +    V D
Sbjct: 61  NAVLLLCSSHDKGCRPYMCGTSFRHSNCLGQYKKAYTKMGSSDHGQPLLGSNNNPTAVPD 120

Query: 127 ASFLGESHEVTELACPLCRGQVKGWTVVEPAREYLNAKKRTCMQDSCTFLGNYKELRKHV 186
           A +  +  EVTELACPLCRGQVKGWTV+EPAR+YLN K+R+CMQ++C F+GNYKELR+HV
Sbjct: 121 AGWPVQRCEVTELACPLCRGQVKGWTVLEPARDYLNTKRRSCMQENCXFVGNYKELRRHV 180

Query: 187 RQEHPFTRPREVDPVLEQKWRRLERECERNDVMSTIRSTMPGAVVFGDYVIEGNSFGFDS 246
           ++EHP  RPREVDP LEQKWR LE E E ND+MSTI+STMPGA+VFGDYVIE N++G D+
Sbjct: 181 KEEHPSARPREVDPDLEQKWRSLEHERETNDLMSTIQSTMPGAMVFGDYVIERNNYGLDT 240

Query: 247 EEEEAGLNANANANANAAERDGGFEVGFDSNLVNVFLLLHAFGPSSSDLNRRLRQQQQQQ 306
           +EE+ G       +A+A ER+GGF +GFD NLVNVF LLHAFGPS +DL RRLR      
Sbjct: 241 DEEDGGF------DADAPERNGGFGLGFDGNLVNVFFLLHAFGPSGTDLTRRLR------ 300

Query: 307 QQPERSFPPRPNEGGTGIIPNTTPLNSFDSIHLDSDNGGSGDNENGGSMSLVSRLRRHGR 366
            QPER+F     +G    I +TTP+   DS   D +N   GD ++GG MSLVSRLRRHGR
Sbjct: 301 -QPERAF-HHSADGSAAGIRHTTPIGGLDSSDQDEENESDGD-DDGGGMSLVSRLRRHGR 360

Query: 367 VLLGRSGRRRRHRESSS 384
           VLLGRSGRR   RE +S
Sbjct: 361 VLLGRSGRRGARREGNS 361

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K3F0_CUCSA1.2e-17081.70Uncharacterized protein OS=Cucumis sativus GN=Csa_7G232490 PE=4 SV=1[more]
W9SBD5_9ROSA9.0e-13165.78Uncharacterized protein OS=Morus notabilis GN=L484_024149 PE=4 SV=1[more]
M5XB06_PRUPE1.0e-12963.93Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007591mg PE=4 SV=1[more]
Q2Z1Y7_PRUMU2.9e-12963.66Pm27 protein OS=Prunus mume GN=Pm27 PE=2 SV=1[more]
B9SRK5_RICCO4.2e-12866.67Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1412330 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G68140.16.9e-9049.34 Protein of unknown function (DUF1644)[more]
AT1G77770.17.2e-6356.44 Protein of unknown function (DUF1644)[more]
AT4G08460.11.4e-6149.38 Protein of unknown function (DUF1644)[more]
AT3G24740.15.7e-5236.84 Protein of unknown function (DUF1644)[more]
AT4G31410.13.8e-4033.76 Protein of unknown function (DUF1644)[more]
Match NameE-valueIdentityDescription
gi|659092678|ref|XP_008447168.1|7.5e-17181.70PREDICTED: uncharacterized protein LOC103489685 [Cucumis melo][more]
gi|449444094|ref|XP_004139810.1|1.7e-17081.70PREDICTED: uncharacterized protein LOC101208946 [Cucumis sativus][more]
gi|703127504|ref|XP_010103847.1|1.3e-13065.78hypothetical protein L484_024149 [Morus notabilis][more]
gi|596128229|ref|XP_007222088.1|1.4e-12963.93hypothetical protein PRUPE_ppa007591mg [Prunus persica][more]
gi|658008162|ref|XP_008339269.1|4.2e-12964.46PREDICTED: uncharacterized protein LOC103402311 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013083Znf_RING/FYVE/PHD
IPR012866DUF1644
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g00120.1Cp4.1LG04g00120.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012866Protein of unknown function DUF1644PFAMPF07800DUF1644coord: 54..220
score: 9.7
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 47..99
score: 1.5E-4coord: 135..149
score: 1.
NoneNo IPR availablePANTHERPTHR31197FAMILY NOT NAMEDcoord: 8..384
score: 5.1E
NoneNo IPR availablePANTHERPTHR31197:SF12SUBFAMILY NOT NAMEDcoord: 8..384
score: 5.1E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG04g00120CmaCh11G011800Cucurbita maxima (Rimu)cmacpeB147
Cp4.1LG04g00120CmoCh11G012600Cucurbita moschata (Rifu)cmocpeB130
Cp4.1LG04g00120Carg17006Silver-seed gourdcarcpeB0657
The following gene(s) are paralogous to this gene:

None