Cp4.1LG13g04110 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g04110
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRibosome maturation factor rimP
LocationCp4.1LG13 : 6263771 .. 6270716 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATTATTCTACCAATTACCAAATTTCACATTCCAAATATAATTTTATCAATAATTTTGAAAATAATGAAGTATTAATCTTCTGGTATGTAATGAATCGTGAGTTTTAATATTTAAGGATTTAAATTAAAAATAAAAATTATGGACGAGAAGTATAATCGAAACATATAGTTGAATTGACATGTAATTATCCCCATTTGTTGGATTGCGTCTCTGCGGAGCGGCCGCCATAGCAGCTGTGTGTTGTGAGCTTCATCATTCCCAGTTTTCCAGTTTCCTCCGTTAAGAGAAGTGATGGTCATGATTACTGTCCGTAATTTCAGACTCTCCGCAGTTCCAATTTCATCAATCCTTCCACAGAGTGGCTTCCGTTCGTCTTTAATCTCGTCCAGAAATCTTCCCTTCCCATTTGTTGGCCATCGGTTTCCTTCTACGTCGAACAATCCCTTATTACTTCATGCCAGAAAGAAAAACTCAGAATCAGAGCCAGTTATCAAACGAAACATCGTTGAGGAAGTGTCGGAGGATGAAGAAGACGATGAACTTTTCGACGAACTTGAAGACGGTAAGATCCATCAATAAAACCTTCGCTCAATTTATTCCTCTAGTTTCTATTGAATCCTAGTTTAGACTGGTCTTGGAATGAGTAAATTTCGGGGAACTTTCTTGATTGCGTAGATGAGATAATGGACGATGATGGCGAAGATTACTTTGAAGAAGAGTTTATGGAGGATAATGCTGAAGTCTATGTAAGATTCTTGTCTCTAAACATGGAGATATTGTGTTGGAAATCTTAACTGAAATTCTGGAGTTGATAATTTGATTTGAAGGTAGGGGATGGCGGGGAAGGAGGTGGAATTTCCCTTGCTGGGACATGGTGGGATAAAGAAGCACTGGCTATGGCTGAAGAGGTTATTCTTTCATTTCATGGCGACTTGAAGATTTATGCGTTCAAAACAGTCTCTAATTCCACTGTTCAAGTGCGTATTGAAAAGCTTTCTAACAAGTAAGTTTTTAGATTCTATGAACACATATTAGACCGTGGTTTTGGTTTTCTTGTATGATAATAATTTCTTGTTGCTGCATTCTTAAGATTGAAATGCCGTCTGCTAGCTCCTTCTTTCTCAAACGGATTGTTTGAGGGATGATGATATTAGAAAATAGTATGATTACTAATCGTAGAGAGTTCACGTTAGAGATTTAGGACGTGTCTGGAATGACTTTCTAAGAGCTTAAAAACACGTTTGTAAACACTTGAAAAATCATTTCAAACAAGCTCTTATACGAGTTCACCCGCTTCATATGGGTTAGAACCAATACGAGTTCAACCTGTACCTATCTTTTAAGCTGGAAATGGAGATGCCTTTTGCTTTCAAGTAAATCAAGAGAAAGCCATTAGTTACCCTTTGGCTTCAAGTTTTCCTTCTGATATTAGTGTTCTCATTGAACCCGTATTCACAACTCGACTCAGAGAAGAATCGGATTTCTTCGAACTCCTTATTGCTTGAATTAAATCACATTAATTCGAAGGATTTCCAGATATTAAGGACGTTTCGAGTTGAAGTCTACTTTAAGCATATGAAAATGTGTTACATTTTACATGTTGTTATCCCAAGTTTACTGCGTTCTTGGTAAGACTGAGTACCATTTCCTAGGTTCTAATTCCTAACAATGCACCTCAGATCGGGTTCCCCCAGGATGGAAGATATCGAGGCTTTCTCTTCAAGATATCGAGCACGATTGGATGAAGCAGAGCTTTCTAAAACCGTGCCAGAGAACATATCTTTAGAGGTATGCTGCCTGTTTCATTGCTCAAATTCCCTCGTTTCTTTTTTCTATATCATCAACCAATCTATCTTTAGCAATTAGAAGAAGATTTTCACTTCCATTATTCAATTTAGTTTCCCACTTGACAAGACCAAAAGAGAATTGCTCTACTCTGTTAGCTTTTGGGTCTAAATGGATTTTAATCCTTCCAACATTATAAGGTCTCATCTCCTGGTGTTGAACGTGTCGTTCGGATTCCCGACGAGCTGGATCGGTTCAAAGAAAGAGCAATGTATGTGAAATACACGAACGATGTAGTTACACCCAGTTCGTCCTCTGAGAGTGATGGCATTTTCAAGCTTGTGTCATTCGACATAGAAACAAGGTGCTGCATTTGGGGTTTAGCAGATGTGAGGATAAATAGAGAGAAGGCAGGAAAAGGAAGACCACTAAGCAAAAAACAAAGAGAGTGGCGTTTAGAGACTCCTTTCGATTCATTACGCTTGGTTAGACTGTATTCTGATTACTGATAATGCAATCAAATTTTGGTCGAAATTCAAGGTGTCGGTGTATAGTAAACTAGCAATTATTATCTTATGAACGTGTGATTCGTTTTTAATGCTATTCTCTATCTGAACACATTTTGAATACAAATTTTGCAAGATAAACGCTTAATAGTTTCTGGATTGAGTTGGTTTAGTCAACATGAACATTGATATGCATGAATTATACTTGTTTGTTGTCAAATTGTTCTGATTGTATTGGTTGAATTATGAAACTTTATCTGTACTGTAATGGCATTGACTGAGGCACTCCTGGTTTGGTACATGCTAGAAAACGACCAATCAGTCTCTTTACCGGCGTGATGAATGAAAAACGGCTCCTTTTGTCGACACCTTTAGCTTTCTAACAATATGCTGTTTCTATACTATCTTCGTGACTTCACTTATTCATTACATGCATTGTACTTAGTTACCAAAAATATGCATCATTTTACTCATTTATTTCTTGTTGTTTTTCACTTAACTTTTCTGTTTTTGGTGTGTAAGGAAGTCATACAATTGTAAGAGGATTGGAGGCCGTAGCTGAAGGAAAAAGCCTAAAAAGATAGCTTTTTACTTTCTGCTTTATTGACCCATTTCATCATCTTTACGAATTTGAAGGTATGGTGATTAAATAAATGGTGTATTAGGTGCAAATGAAAATTTTAAGTTAAAAGAAGGGCAGGGGTTGGTGCGTGGGGTGATCGAAAGCAGTGTCTGCATGCACCTATTGTTGTCATTATATCCTTATAAAAATCCCATGTGTATTACCTTTTTCATAGCCAGCTCCATTAGAAAATTATCTATAACTCTCTTTTGGAGTACAATATATACTCAACACAGGGTAGTAGGTTTTGAGCACTAATGAGGTTGCTGTTTGAAGGGATGGAGCGGGGGAGAAGAACTCTTGGGGCTGCTAGGGGGTGCTTTTTCATTGCCTTGTGGATGGCCACACAGGGGTTCCCGGTCGAGGATCTCGTCGGGAGGCTCCCCGGCCAGCCAACGGTCAATTTTAGACAATTTGCAGGTTATATCGATGTCGATGTCGACGCCGGAAGAAGTTTGTTTTACTACTTTGCTGAAGGGCAGCAAGATCCCCATCTCCTGCCCCTCACTCTCTGGCTCAATGGAGGTCTGGTTTGGTTTAGAACTGTTTTAGTTTATGATATGATTAGATCTTGAAAGGCCATTCTGTTATACTGATAGAATCAAACTTGGGGGAGGATATTAAATTGTTGAAAGGTCAATGCAGGTCCAGGTTGCTCGTCTGTTGGTGGGGGTGCCTTTACAGAGCTGGGGCCCTTTTATCCCAGAGGTGATGGCCGAGGCCTTCGAAGAAACTCCATGTCCTGGAATAAAGGTAGTGCTTTAGTTTTCTTCTTCTTCTTCTTCTTCTTTTAAATGTTGAAAGAATCTGAATGATATGGTGAATTGAGTTTGATAGCCTCGAATCTGCTGTTTGTGGAGTCGCCTGTGGGGGTAGGATGGTCATACTCAAACACAACGTCAGACTACACTTGTGGTGATGACTCCACAGGTTCTAATTCCTATACTTTAGAGGTTAATTTAATTGAAATTTTGTTTGATTAAAGATTAAATAATGGAGAGAGATTGGATCTGTGCAGCCAGAGACATGCTCACCTTTATGTTGAAATGGTATGACAAGTTTCCAGCTTTCAGAGATAGCTCATTTTTCCTCACAGGAGAAAGTTATGCAGGTCATTTTTTATACTTTTTTTTTTATTATTATTAAATTATAAATTTACTTCGTTTTTTGGAACGGTCCAAGCCCACTACTAGTAAATATTATCTTTTTGGGACCTTTCCTTTCGAGTTTTCCCTCAAGGTTTTTAAAACGCGTCTGCCAGGATAGGTTTTCACACCCTTATAAAAAATGTTTCGTTCCTCTTTTCAACCAATGTGGGATCTCGCAATCCACCTTCTTTTGGGCCCAGTGTCTTCGCTAGCACTCGTTTCCCTCTCCAACCGATGTGGGATCTCATAATCTCTACCCCTTTGGAGCCTAGCATCCTCGCTGCCACTCGTTACTCTCTCCAATCGATGTGGGATCTCACAAACATCGGTTAGAGAAAAGGGAATGAAACATTTCTTATAAGAATATAGAAACCTCTCTCTAGCATATGCATTTTAAAAGTTTGTGGGGGAAGTCCGGAAAGCCCAAAAATGACAATATCAACTACTTGTGGGTTTGGGCTATTCCATTTTCGTTCGTATTTATATGATATCTTTATTTTTTTCAAAAAAAAATTAAAATTAAATTTTTTTTTTTTTTTTTTTTTGGAGAAATTACTCTTTTTAAGCCTTGAGATTTGAGTTTTTGGTCCGTAGATTTCAAAATATTACATTTTTCCTTTGAAATCTAAGTTTATACTCACTTCTATTATTTGAACTTAACTTTTTAGTTAAGTAATTTAAAATTGAATTTTTTTATTTATTAATTTTAATAGTGAAAGAAAATTAATAAAAACTAATTAGTAATTTTTATAGACATTAACACCAAGGATCCAAATGAGTTTCTAGTAAAAAATAATAATAATAAAGATAAGAGTTTCGAGCGTTAATCTTCAAATTTAGAAACCAAATAAAAATAAAATTAAGAACTTCAAGAATATTTTGAAATATAACAATCAAATGAAAATTAAACTCAAAATTTAATGATAAAAATGTAACATTTTGACATAGGAATCAAAATGAAAATGAAAACTAGCGATTAAAACCACGATCATGAATTATTAAGCAATATAGTTAATATGTTCAATACCAGTTCAAAATTCTATCTTTAACTTTATAAGTATTGAAGACTAAATCGACAATCGGAGAATGAACGTTATACTAAATTCGTAACTTAATTTTAATTTATTCTGAATGTTAGAAAGAAATGGGAAAAAATGTGAAAGAGAAAAGATATTGATTTGATGCAGGCCATTACATTCCTCAGCTAGCTGAAGCAATGCTGGACTATAACATTCACTCAAAGGGCTTCAAGTTCAATATCAAGGGAGTTGCTGTAAGAACAATGAACTCTAAAACCATTACTTTTTGAATTCAACTTACTTTTGTAATCTAATGCTGCCTGCAGCTTGGGAACCCATTACTTAATCTGGACAGGGATGTCCCAGCAACCTACGAGTTCTTTTGGTCTCATGGCATGATTTCCGACGAGGTTTGGTTTGCAATCAATAGAGACTGTGATTTTGATGATTATCTGTTGACTACTCCACACAATGTAACCAAATCCTGCATCCAAGCCATCGCGGACGCCAACGGTATCGTTAGCGAGTACATAAACACCTACGATGTTCTTTTGGATGTCTGCTACCCCTCCATTGTCCAGCAAGAGCTGAGATTGAAGAAACTGGTTCGTTTATGCTGTTCCTGTTTTCTTTTAGTGCATCGATACGTGGGTCTAAGTGATTCGAACCTCTAATGTTTTAATCAGTTGAGACTCTAAGTTGGTGTCGAGATTGTTATGAGTGGTTCGGTTTGGTTCGTTTTGCAGGCGACTAAAATAAGCATGGGAGTTGACGTGTGCATGACCCAAGAAAGGCGTTTCTATTTCAATTTACCAGAAGTTCAAAAGGCTCTTCATGCAAATCGTACTAATTTGCCTCATCAATGGTTCATGTGCAGTGAGTAAGTAAGCCCACATTCTCTTTTTTGGCATCCATATTACATCATAATAGTTATGCCATGAACACAGCCTAAATCTTGATTGTTGGTGCTTTTGAGCAGCTTAGTAGATTATAATTACAATGATACGAACATCAACATGCTTCCCTCGCTGAAGAGAATAATCCAAAACCACATCCCAATTTGGATCTTCAGGTACGTTTCGAACTTTCTCTTTAAACTTTGAACGTTAAGAATGGTGAATGAATAAATGAATGAATACACGCACAGTGGGGATGAGGATTCAGTGGTGCCATTGATGGGGTCTAGGACGCTAGTCCGAGAGTTAGCTCATGACCTTAAATTCAAGATCACAGTACCCTACGGAGCTTGGTTTCACAAGGGCCAAGTCGGGGGCTGGGCCATCGAGTACGGTAATATGCTGACATACGCAACGGTGCGTGGGGCAGCTCACATGGTGCCCTACGCTCAGCCCTCAAGAGCTCTGCATTTGTTCTCTTCATTTGTAAGAGGGAGGAGGCTGCCCAATTCTACCCGCCCTTCCATCGATGATTGAAACAAAACAAATTAAACGTGAATGGGTTTGGTTTGGTTTGATTGAGTCGGGTTAATGATAATCGAAAGAGTGGAATGTAATGCAAGAATTTGTTTAGAAATAATGAAAAGCTCCCAACTCATTCATTTAGCTTTTGATACACACTCAGAAATATGCTTGGCGAGCAGAGAAACAGAGGGTGGGAAGCAAGAACAGAACAAGTTTATAATGCACACAATCATTCAGGCTTAGCATCAAAGTTAAGTCTTTTTTTGTTTGTATAAAAACGCATCCCATTTCTTACCAACTTCTTCATATCCTTTCTTTAACCAATACTTCTTCAGATATTAATCCCAATGGCCTCTCTCAATACTTCCGAGTACCCATACTTTCCTTTTCCGCCTTATCATCCTTTCTGGCCGCCTCTCCCACCGCCGCACAACCCTATT

mRNA sequence

ATGAGATTATTCTACCAATTACCAAATTTCACATTCCAAATATAATTTTATCAATAATTTTGAAAATAATGAAGTATTAATCTTCTGGTATGTAATGAATCGTGAGTTTTAATATTTAAGGATTTAAATTAAAAATAAAAATTATGGACGAGAAGTATAATCGAAACATATAGTTGAATTGACATGTAATTATCCCCATTTGTTGGATTGCGTCTCTGCGGAGCGGCCGCCATAGCAGCTGTGTGTTGTGAGCTTCATCATTCCCAGTTTTCCAGTTTCCTCCGTTAAGAGAAGTGATGGTCATGATTACTGTCCGTAATTTCAGACTCTCCGCAGTTCCAATTTCATCAATCCTTCCACAGAGTGGCTTCCGTTCGTCTTTAATCTCGTCCAGAAATCTTCCCTTCCCATTTGTTGGCCATCGGTTTCCTTCTACGTCGAACAATCCCTTATTACTTCATGCCAGAAAGAAAAACTCAGAATCAGAGCCAGTTATCAAACGAAACATCGTTGAGGAAGTGTCGGAGGATGAAGAAGACGATGAACTTTTCGACGAACTTGAAGACGATGAGATAATGGACGATGATGGCGAAGATTACTTTGAAGAAGAGTTTATGGAGGATAATGCTGAAGTCTATGTAGGGGATGGCGGGGAAGGAGGTGGAATTTCCCTTGCTGGGACATGGTGGGATAAAGAAGCACTGGCTATGGCTGAAGAGGTTATTCTTTCATTTCATGGCGACTTGAAGATTTATGCGTTCAAAACAGTCTCTAATTCCACTGTTCAAGTGCGTATTGAAAAGCTTTCTAACAAATCGGGTTCCCCCAGGATGGAAGATATCGAGGCTTTCTCTTCAAGATATCGAGCACGATTGGATGAAGCAGAGCTTTCTAAAACCGTGCCAGAGAACATATCTTTAGAGGTCTCATCTCCTGGTGTTGAACGTGTCGTTCGGATTCCCGACGAGCTGGATCGGTTCAAAGAAAGAGCAATGTATGTGAAATACACGAACGATGTAGTTACACCCAGTTCGTCCTCTGAGAGTGATGGCATTTTCAAGCTTGTGTCATTCGACATAGAAACAAGGTGCTGCATTTGGGGTTTAGCAGATGTGAGGATAAATAGAGAGAAGGCAGGAAAAGGAAGACCACTAAGCAAAAAACAAAGAGAGTGGCGTTTAGAGACTCCTTTCGATTCATTACGCTTGGTTAGACTGTATTCTGATTACTGATAATGCAATCAAATTTTGGTCGAAATTCAAGGTGTCGGTGTATAGTAAACTAGCAATTATTATCTTATGAACGTGTGATTCGTTTTTAATGCTATTCTCTATCTGAACACATTTTGAATACAAATTTTGCAAGATAAACGCTTAATAGTTTCTGGATTGAGTTGGTTTAGTCAACATGAACATTGATATGCATGAATTATACTTGTTTGTTGTCAAATTGTTCTGATTGTATTGGTTGAATTATGAAACTTTATCTGTACTGTAATGGCATTGACTGAGGCACTCCTGGTTTGGTACATGCTAGAAAACGACCAATCAGTCTCTTTACCGGCGTGATGAATGAAAAACGGCTCCTTTTGTCGACACCTTTAGCTTTCTAACAATATGCTGTTTCTATACTATCTTCGTGACTTCACTTATTCATTACATGCATTGTACTTAGTTACCAAAAATATGCATCATTTTACTCATTTATTTCTTGTTGTTTTTCACTTAACTTTTCTGTTTTTGGTGTGTAAGGAAGTCATACAATTGTAAGAGGATTGGAGGCCGTAGCTGAAGGAAAAAGCCTAAAAAGATAGCTTTTTACTTTCTGCTTTATTGACCCATTTCATCATCTTTACGAATTTGAAGGGATGGAGCGGGGGAGAAGAACTCTTGGGGCTGCTAGGGGGTGCTTTTTCATTGCCTTGTGGATGGCCACACAGGGGTTCCCGGTCGAGGATCTCGTCGGGAGGCTCCCCGGCCAGCCAACGGTCAATTTTAGACAATTTGCAGGTTATATCGATGTCGATGTCGACGCCGGAAGAAGTTTGTTTTACTACTTTGCTGAAGGGCAGCAAGATCCCCATCTCCTGCCCCTCACTCTCTGGCTCAATGGAGGTCCAGGTTGCTCGTCTGTTGGTGGGGGTGCCTTTACAGAGCTGGGGCCCTTTTATCCCAGAGGTGATGGCCGAGGCCTTCGAAGAAACTCCATGTCCTGGAATAAAGCCTCGAATCTGCTGTTTGTGGAGTCGCCTGTGGGGGTAGGATGGTCATACTCAAACACAACGTCAGACTACACTTGTGGTGATGACTCCACAGCCAGAGACATGCTCACCTTTATGTTGAAATGGTATGACAAGTTTCCAGCTTTCAGAGATAGCTCATTTTTCCTCACAGGAGAAAGTTATGCAGGCCATTACATTCCTCAGCTAGCTGAAGCAATGCTGGACTATAACATTCACTCAAAGGGCTTCAAGTTCAATATCAAGGGAGTTGCTCTTGGGAACCCATTACTTAATCTGGACAGGGATGTCCCAGCAACCTACGAGTTCTTTTGGTCTCATGGCATGATTTCCGACGAGGTTTGGTTTGCAATCAATAGAGACTGTGATTTTGATGATTATCTGTTGACTACTCCACACAATGTAACCAAATCCTGCATCCAAGCCATCGCGGACGCCAACGGTATCGTTAGCGAGTACATAAACACCTACGATGTTCTTTTGGATGTCTGCTACCCCTCCATTGTCCAGCAAGAGCTGAGATTGAAGAAACTGGCGACTAAAATAAGCATGGGAGTTGACGTGTGCATGACCCAAGAAAGGCGTTTCTATTTCAATTTACCAGAAGTTCAAAAGGCTCTTCATGCAAATCGTACTAATTTGCCTCATCAATGGTTCATGTGCAGTGACTTAGTAGATTATAATTACAATGATACGAACATCAACATGCTTCCCTCGCTGAAGAGAATAATCCAAAACCACATCCCAATTTGGATCTTCAGTGGGGATGAGGATTCAGTGGTGCCATTGATGGGGTCTAGGACGCTAGTCCGAGAGTTAGCTCATGACCTTAAATTCAAGATCACAGTACCCTACGGAGCTTGGTTTCACAAGGGCCAAGTCGGGGGCTGGGCCATCGAGTACGGTAATATGCTGACATACGCAACGGTGCGTGGGGCAGCTCACATGGTGCCCTACGCTCAGCCCTCAAGAGCTCTGCATTTGTTCTCTTCATTTGTAAGAGGGAGGAGGCTGCCCAATTCTACCCGCCCTTCCATCGATGATTGAAACAAAACAAATTAAACGTGAATGGGTTTGGTTTGGTTTGATTGAGTCGGGTTAATGATAATCGAAAGAGTGGAATGTAATGCAAGAATTTGTTTAGAAATAATGAAAAGCTCCCAACTCATTCATTTAGCTTTTGATACACACTCAGAAATATGCTTGGCGAGCAGAGAAACAGAGGGTGGGAAGCAAGAACAGAACAAGTTTATAATGCACACAATCATTCAGGCTTAGCATCAAAGTTAAGTCTTTTTTTGTTTGTATAAAAACGCATCCCATTTCTTACCAACTTCTTCATATCCTTTCTTTAACCAATACTTCTTCAGATATTAATCCCAATGGCCTCTCTCAATACTTCCGAGTACCCATACTTTCCTTTTCCGCCTTATCATCCTTTCTGGCCGCCTCTCCCACCGCCGCACAACCCTATT

Coding sequence (CDS)

ATGGTCATGATTACTGTCCGTAATTTCAGACTCTCCGCAGTTCCAATTTCATCAATCCTTCCACAGAGTGGCTTCCGTTCGTCTTTAATCTCGTCCAGAAATCTTCCCTTCCCATTTGTTGGCCATCGGTTTCCTTCTACGTCGAACAATCCCTTATTACTTCATGCCAGAAAGAAAAACTCAGAATCAGAGCCAGTTATCAAACGAAACATCGTTGAGGAAGTGTCGGAGGATGAAGAAGACGATGAACTTTTCGACGAACTTGAAGACGATGAGATAATGGACGATGATGGCGAAGATTACTTTGAAGAAGAGTTTATGGAGGATAATGCTGAAGTCTATGTAGGGGATGGCGGGGAAGGAGGTGGAATTTCCCTTGCTGGGACATGGTGGGATAAAGAAGCACTGGCTATGGCTGAAGAGGTTATTCTTTCATTTCATGGCGACTTGAAGATTTATGCGTTCAAAACAGTCTCTAATTCCACTGTTCAAGTGCGTATTGAAAAGCTTTCTAACAAATCGGGTTCCCCCAGGATGGAAGATATCGAGGCTTTCTCTTCAAGATATCGAGCACGATTGGATGAAGCAGAGCTTTCTAAAACCGTGCCAGAGAACATATCTTTAGAGGTCTCATCTCCTGGTGTTGAACGTGTCGTTCGGATTCCCGACGAGCTGGATCGGTTCAAAGAAAGAGCAATGTATGTGAAATACACGAACGATGTAGTTACACCCAGTTCGTCCTCTGAGAGTGATGGCATTTTCAAGCTTGTGTCATTCGACATAGAAACAAGGTGCTGCATTTGGGGTTTAGCAGATGTGAGGATAAATAGAGAGAAGGCAGGAAAAGGAAGACCACTAAGCAAAAAACAAAGAGAGTGGCGTTTAGAGACTCCTTTCGATTCATTACGCTTGGTTAGACTGTATTCTGATTACTGA

Protein sequence

MVMITVRNFRLSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKNSESEPVIKRNIVEEVSEDEEDDELFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPFDSLRLVRLYSDY
BLAST of Cp4.1LG13g04110 vs. TrEMBL
Match: A0A0A0KDA0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G338150 PE=3 SV=1)

HSP 1 Score: 500.7 bits (1288), Expect = 1.2e-138
Identity = 261/311 (83.92%), Postives = 284/311 (91.32%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPIS-SILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKK 60
           M++IT  N  LSA+  S  ILP   FR S IS  NLPFPF+ HRFPSTSNN LLL ARK+
Sbjct: 1   MLLITAPNSALSALSSSLPILPHILFRCSSISPANLPFPFLDHRFPSTSNNSLLLRARKR 60

Query: 61  NSESEPVIKRNIVEEVSEDEEDDELFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGG 120
           NSES+PV+K+NIV+EVSEDEEDD LFDE E DEIM+DDGEDYFEEE+MEDNAEVY+GDGG
Sbjct: 61  NSESQPVLKQNIVQEVSEDEEDDVLFDEFEQDEIMEDDGEDYFEEEYMEDNAEVYLGDGG 120

Query: 121 EGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRM 180
           EGGGISLAGTWWDK+ALA+AEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLS KSGSP M
Sbjct: 121 EGGGISLAGTWWDKQALAIAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSTKSGSPNM 180

Query: 181 EDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTN 240
           EDIEAFS+ YRARLD+AEL+K+VPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTN
Sbjct: 181 EDIEAFSTTYRARLDDAELAKSVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTN 240

Query: 241 DVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPF 300
           +VVT SSSSESDG+FKLVSFDIE +CC WG+ADV+INREKAGKGRPLSKKQREWRLETPF
Sbjct: 241 EVVTASSSSESDGVFKLVSFDIEAKCCTWGIADVKINREKAGKGRPLSKKQREWRLETPF 300

Query: 301 DSLRLVRLYSD 311
           DSLRLVRLYSD
Sbjct: 301 DSLRLVRLYSD 311

BLAST of Cp4.1LG13g04110 vs. TrEMBL
Match: E0CQP0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g15560 PE=4 SV=1)

HSP 1 Score: 391.3 bits (1004), Expect = 1.1e-105
Identity = 206/314 (65.61%), Postives = 251/314 (79.94%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKN 60
           M +IT  N R SA+  S++  +S F +    + NL FPF  +  P  S      HA+K++
Sbjct: 1   MDLITTWNTRASAISFSALRSRSSFHNPSRQNHNLLFPFWPYPLPRISYKSFTAHAKKRS 60

Query: 61  SESEPVIKRNIVEEVS-EDEEDDELFDELEDDEIMDDDGE---DYFEEEFMEDNAEVYVG 120
           S+S+P++K+ IVE++S   ++DD L D+ ED+ +MDDD +   + +E+E++ D+AEVYVG
Sbjct: 61  SQSQPLVKQTIVEQISTSQQQDDLLLDDFEDEALMDDDDDNDDEDWEDEYLADDAEVYVG 120

Query: 121 DGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGS 180
           DGGEGGGISLAGTWWDKEAL MAEEV +SF GDLKIYAFKT++NST+QVRIEKLSNKSGS
Sbjct: 121 DGGEGGGISLAGTWWDKEALLMAEEVSMSFEGDLKIYAFKTLANSTIQVRIEKLSNKSGS 180

Query: 181 PRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVK 240
           P M DIEAFSS YRA+LDEAE++ +VPEN+SLEVSSPGVERVV+IP ELDRFKER MYVK
Sbjct: 181 PSMTDIEAFSSIYRAKLDEAEIAGSVPENLSLEVSSPGVERVVQIPQELDRFKERPMYVK 240

Query: 241 YTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLE 300
           Y  + V P S+ ESDGIF+LVSFD+ET CC WGLADVRINR KAGKGRPLSKKQREWRL 
Sbjct: 241 YVTEGVAPGSTIESDGIFRLVSFDLETNCCTWGLADVRINRAKAGKGRPLSKKQREWRLN 300

Query: 301 TPFDSLRLVRLYSD 311
           TPFDSL LVRLYS+
Sbjct: 301 TPFDSLCLVRLYSE 314

BLAST of Cp4.1LG13g04110 vs. TrEMBL
Match: A0A067L3N7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25228 PE=3 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 2.2e-103
Identity = 203/305 (66.56%), Postives = 244/305 (80.00%), Query Frame = 1

Query: 11  LSAVPISSILP--QSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKNSESEPVIK 70
           L  VPI S +P  +S      +S+ N PFP +   FP+       L A+K+NS+SEPV+K
Sbjct: 3   LIKVPIISTIPPTKSFLHRPSLSNHNFPFPILAQPFPAIQIKSYPLQAKKRNSQSEPVLK 62

Query: 71  RNIVEEVSEDEEDDE---LFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGGEGGGIS 130
             I+EEVSE++ED+E     DELED+ +MD + ED  E+EF+ED AE+YVGDG  GGGI+
Sbjct: 63  PTIIEEVSEEDEDEEEQLFLDELEDEALMDTEDED-LEDEFLEDEAELYVGDGTAGGGIA 122

Query: 131 LAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRMEDIEAF 190
           LAGTWWDKEAL +AEEV  SF G+LKIYAFKT+SN T+QVRIE+L+NKSGSP MEDIEAF
Sbjct: 123 LAGTWWDKEALRIAEEVCESFDGELKIYAFKTLSNLTIQVRIERLTNKSGSPNMEDIEAF 182

Query: 191 SSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTNDVVTPS 250
           S+ YR+RLDEAE++KT+  NI+LEVSSPGVERVVRIP+ELDRF +R MYVKY +D  T  
Sbjct: 183 STTYRSRLDEAEVAKTIANNIALEVSSPGVERVVRIPEELDRFNDRPMYVKYVSDAATLD 242

Query: 251 SSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPFDSLRLV 310
           SSSESDGIF+L+SFD+ET+CC WGLADVRINREKAGKGRPLSKKQREWRL TPF SL LV
Sbjct: 243 SSSESDGIFRLISFDMETKCCTWGLADVRINREKAGKGRPLSKKQREWRLNTPFHSLLLV 302

BLAST of Cp4.1LG13g04110 vs. TrEMBL
Match: A0A061FH06_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_035449 PE=3 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 2.2e-103
Identity = 206/323 (63.78%), Postives = 255/323 (78.95%), Query Frame = 1

Query: 2   VMITVRNFRLSAVPISSILPQS-------GFRSSLISSRNLPFPFVGH---RFPSTSNNP 61
           ++++  NF+  AVPIS++LP          F     S+    FPF  +   R P+ S + 
Sbjct: 3   LLVSAWNFKHLAVPISALLPSPTAITCSCSFYKPPGSANKFSFPFWTYPFARIPNKSPSS 62

Query: 62  LLLHARKKNSESEPVIKRNIVEEVSEDEEDDE----LFDELEDDEIMDDDGEDYFEEEFM 121
           + +HARKKNS+SEP++K  IVEEVS D+ED E    LFD+ EDDE M D+ +DYFEEE++
Sbjct: 63  IAIHARKKNSKSEPLLKPTIVEEVSMDDEDKEEEQILFDDSEDDESMIDNDDDYFEEEYL 122

Query: 122 EDNAEVYVGDGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRI 181
           ++  E+YVGDG  GGGISLAGTWWDKEALA+A++V LSF+GDL IYAFKT+SNS++QVRI
Sbjct: 123 DNETELYVGDGAGGGGISLAGTWWDKEALALAQDVCLSFNGDLGIYAFKTLSNSSIQVRI 182

Query: 182 EKLSNKSGSPRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDR 241
           E+L+NKSGSP MED+EAFS  YRARLDEAEL+++VP+NI+LEVSSPGVERVVR+P +LDR
Sbjct: 183 ERLTNKSGSPSMEDVEAFSVSYRARLDEAELARSVPQNITLEVSSPGVERVVRMPQDLDR 242

Query: 242 FKERAMYVKYTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLS 301
           FK+R MYVKY   V    S SE DG+F+LV+FD+ET+CC WGLADVRINREKAGKGRPLS
Sbjct: 243 FKDRPMYVKYVT-VAESGSLSEGDGVFRLVTFDMETKCCTWGLADVRINREKAGKGRPLS 302

Query: 302 KKQREWRLETPFDSLRLVRLYSD 311
           KKQREW LET FDSLRLVRLYS+
Sbjct: 303 KKQREWCLETTFDSLRLVRLYSE 324

BLAST of Cp4.1LG13g04110 vs. TrEMBL
Match: M5W4C9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008923mg PE=3 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 1.4e-102
Identity = 206/324 (63.58%), Postives = 252/324 (77.78%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPISSILPQSGF----RSSLISSRNL-------PFPFVGHRFPSTSN 60
           M + T  N R SAV +S I+ +       R SL     L       PFPF+ ++FP+   
Sbjct: 1   MDLTTTWNLRASAVSLSPIVTRRSRTHLQRPSLSPPPKLSCQFWAYPFPFIPNKFPA--- 60

Query: 61  NPLLLHARKKNSESEPVIKRNIVEEVSEDEEDDE--LFDELEDDEI-MDDDGEDYFEEEF 120
               LHAR +NSE EP++K  I+EEVSED++DD+  + D+ E+DE+ MDD+G+DY+EEE 
Sbjct: 61  ----LHARNRNSEPEPLLKPTIIEEVSEDDDDDDDVILDDFEEDEVSMDDEGDDYYEEE- 120

Query: 121 MEDNAEVYVGDGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVR 180
              +AE+Y GDGG GGGISLAG WWD +AL +AEEVILSF GDLKIYAFKT+ N T+QVR
Sbjct: 121 ---SAELYAGDGGGGGGISLAGVWWDNKALEIAEEVILSFDGDLKIYAFKTLPNFTIQVR 180

Query: 181 IEKLSNKSGSPRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELD 240
           IEKLSN+SGSP MEDIEAFS  YRARLDEAEL+K++PEN+SLEVSSPGVER+VR+P ELD
Sbjct: 181 IEKLSNRSGSPSMEDIEAFSRTYRARLDEAELAKSLPENLSLEVSSPGVERIVRVPHELD 240

Query: 241 RFKERAMYVKYTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPL 300
           RFK+R MYVKY ++       SE+DG+F+LVSFD+ET+CC+WGLADVRINREKAGKGRPL
Sbjct: 241 RFKDRPMYVKYFSETAETGVISENDGVFRLVSFDVETKCCVWGLADVRINREKAGKGRPL 300

Query: 301 SKKQREWRLETPFDSLRLVRLYSD 311
           SKKQR+W+L TPFDSLRLVRLYSD
Sbjct: 301 SKKQRQWQLNTPFDSLRLVRLYSD 313

BLAST of Cp4.1LG13g04110 vs. TAIR10
Match: AT1G77122.1 (AT1G77122.1 Uncharacterised protein family UPF0090)

HSP 1 Score: 311.2 bits (796), Expect = 7.1e-85
Identity = 181/314 (57.64%), Postives = 227/314 (72.29%), Query Frame = 1

Query: 11  LSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKNSES--EPV-- 70
           L A  IS++   + FRS+     +L F F    F  T N     HA++KN  S  EP   
Sbjct: 19  LPAFSISNLSAVNRFRSTT----SLRFGFPATPFRRTPNFTFKTHAKRKNKTSTFEPKPN 78

Query: 71  -IKRNIVEEVSEDEEDDELF---------DELEDDEIMDDDGEDYFEEEFMEDNAEVYVG 130
            ++  I+EE  E+EE++E+          DEL  D+  D+D +D FE  F E   E+Y G
Sbjct: 79  KVEELIIEEEEEEEEEEEIVLPEEIQENQDELLLDDEYDEDDDDDFE--FDESEEELYAG 138

Query: 131 DGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGS 190
           DGG GGGI LAGT WDK ALA+A +V  SF GDL IYAFKT+ NST+QVRIE+L+NK GS
Sbjct: 139 DGGGGGGIKLAGTLWDKVALALAVKVCESFDGDLGIYAFKTLPNSTIQVRIERLTNKFGS 198

Query: 191 PRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVK 250
           P MEDIEAFS+ YRA+L EAEL+K++P+NISLEVSSPGVERVVRIP +LDR+K+R MYV+
Sbjct: 199 PTMEDIEAFSTIYRAKLAEAELAKSIPDNISLEVSSPGVERVVRIPQDLDRYKDRPMYVR 258

Query: 251 YTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLE 310
           YTN+     + +E DGIF+LVSFD+E + CIWG+AD+R+NREKAGKGRPLSKKQREWRLE
Sbjct: 259 YTNE----DTETEGDGIFRLVSFDVEAKICIWGIADIRVNREKAGKGRPLSKKQREWRLE 318

BLAST of Cp4.1LG13g04110 vs. TAIR10
Match: AT1G69210.1 (AT1G69210.1 Uncharacterised protein family UPF0090)

HSP 1 Score: 178.7 bits (452), Expect = 5.5e-45
Identity = 115/307 (37.46%), Postives = 175/307 (57.00%), Query Frame = 1

Query: 4   ITVRNFRLSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKNSES 63
           I+VR F  S      +LP++  R +L   R  PF      F S+S++P       K + S
Sbjct: 26  ISVRTFSSSL-----LLPKTTTRLTL--PRYFPFSSSISTFSSSSSSPSPSARPPKTAGS 85

Query: 64  EPVIKRNIVEEVSEDEEDDELFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGGEGGG 123
                       + DEED   +   +D E+++D  ED       +++ E  +GDGG+GGG
Sbjct: 86  ------------NGDEEDTFEYKATDDVEVIEDWEEDE------DEDVESQLGDGGDGGG 145

Query: 124 ISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRMEDIE 183
           I L G  W +  L++A +V+     DL+++AFKT     + VR++KLS + G P M+++E
Sbjct: 146 IVLKGVAWGERVLSIAAQVLKQSEKDLELFAFKTSPRGYIYVRLDKLSTEYGCPTMDELE 205

Query: 184 AFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTNDVVT 243
            FS  ++ RLD+A   K +PE+++LEVSSPG ER++R+P++L RFK+  M V Y  +  T
Sbjct: 206 EFSREFKKRLDDAGAEKVIPEDLALEVSSPGAERLLRVPEDLPRFKDMPMTVSYVEE--T 265

Query: 244 PSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPFDSLR 303
            S  +   G+F L S D E+  C+W LADVR NR+   KGRPLS+KQ++ R+  PF   +
Sbjct: 266 NSRKAVKSGVFLLESIDAESDNCVWKLADVRENRDPESKGRPLSRKQKDLRITLPFADHK 305

Query: 304 LVRLYSD 311
            + LY D
Sbjct: 326 KINLYLD 305

BLAST of Cp4.1LG13g04110 vs. NCBI nr
Match: gi|659081884|ref|XP_008441560.1| (PREDICTED: uncharacterized protein LOC103485652 isoform X3 [Cucumis melo])

HSP 1 Score: 505.0 bits (1299), Expect = 9.5e-140
Identity = 263/310 (84.84%), Postives = 281/310 (90.65%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKN 60
           M+ IT  N  LSAV     LP S FR S IS  NLPFPF+ HRFPSTSNN LLLHARK+N
Sbjct: 1   MLFITAPNSALSAVSTPPSLPHSVFRWSSISPTNLPFPFLDHRFPSTSNNSLLLHARKRN 60

Query: 61  SESEPVIKRNIVEEVSEDEEDDELFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGGE 120
           SES+PV+K NIV+EVSEDEEDD L DE E+DEIM+DDGEDYFEEE+MEDNAEVY GDGGE
Sbjct: 61  SESQPVLKPNIVQEVSEDEEDDVLVDEFEEDEIMEDDGEDYFEEEYMEDNAEVYEGDGGE 120

Query: 121 GGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRME 180
           GGGISLAG WWDK+ALA+AEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSP ME
Sbjct: 121 GGGISLAGIWWDKQALAIAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPSME 180

Query: 181 DIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTND 240
           DIEAFS+ YRARLDEAEL+K+VPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTND
Sbjct: 181 DIEAFSTTYRARLDEAELAKSVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTND 240

Query: 241 VVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPFD 300
           VVT SSSSESDG+FKLVSFDIE +CC WG+ADV+INREKAGKGRPLSKKQREWRLETPFD
Sbjct: 241 VVTASSSSESDGVFKLVSFDIEAKCCTWGIADVKINREKAGKGRPLSKKQREWRLETPFD 300

Query: 301 SLRLVRLYSD 311
           SLRLVRLYSD
Sbjct: 301 SLRLVRLYSD 310

BLAST of Cp4.1LG13g04110 vs. NCBI nr
Match: gi|449462174|ref|XP_004148816.1| (PREDICTED: uncharacterized protein LOC101204078 [Cucumis sativus])

HSP 1 Score: 500.7 bits (1288), Expect = 1.8e-138
Identity = 261/311 (83.92%), Postives = 284/311 (91.32%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPIS-SILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKK 60
           M++IT  N  LSA+  S  ILP   FR S IS  NLPFPF+ HRFPSTSNN LLL ARK+
Sbjct: 1   MLLITAPNSALSALSSSLPILPHILFRCSSISPANLPFPFLDHRFPSTSNNSLLLRARKR 60

Query: 61  NSESEPVIKRNIVEEVSEDEEDDELFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGG 120
           NSES+PV+K+NIV+EVSEDEEDD LFDE E DEIM+DDGEDYFEEE+MEDNAEVY+GDGG
Sbjct: 61  NSESQPVLKQNIVQEVSEDEEDDVLFDEFEQDEIMEDDGEDYFEEEYMEDNAEVYLGDGG 120

Query: 121 EGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRM 180
           EGGGISLAGTWWDK+ALA+AEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLS KSGSP M
Sbjct: 121 EGGGISLAGTWWDKQALAIAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSTKSGSPNM 180

Query: 181 EDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTN 240
           EDIEAFS+ YRARLD+AEL+K+VPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTN
Sbjct: 181 EDIEAFSTTYRARLDDAELAKSVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTN 240

Query: 241 DVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPF 300
           +VVT SSSSESDG+FKLVSFDIE +CC WG+ADV+INREKAGKGRPLSKKQREWRLETPF
Sbjct: 241 EVVTASSSSESDGVFKLVSFDIEAKCCTWGIADVKINREKAGKGRPLSKKQREWRLETPF 300

Query: 301 DSLRLVRLYSD 311
           DSLRLVRLYSD
Sbjct: 301 DSLRLVRLYSD 311

BLAST of Cp4.1LG13g04110 vs. NCBI nr
Match: gi|659081880|ref|XP_008441558.1| (PREDICTED: uncharacterized protein LOC103485652 isoform X1 [Cucumis melo])

HSP 1 Score: 498.0 bits (1281), Expect = 1.2e-137
Identity = 263/317 (82.97%), Postives = 281/317 (88.64%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKN 60
           M+ IT  N  LSAV     LP S FR S IS  NLPFPF+ HRFPSTSNN LLLHARK+N
Sbjct: 1   MLFITAPNSALSAVSTPPSLPHSVFRWSSISPTNLPFPFLDHRFPSTSNNSLLLHARKRN 60

Query: 61  SESEPVIKRNIVEEVSEDEEDDELFDELED-------DEIMDDDGEDYFEEEFMEDNAEV 120
           SES+PV+K NIV+EVSEDEEDD L DE E+       DEIM+DDGEDYFEEE+MEDNAEV
Sbjct: 61  SESQPVLKPNIVQEVSEDEEDDVLVDEFEEGHVLDSVDEIMEDDGEDYFEEEYMEDNAEV 120

Query: 121 YVGDGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNK 180
           Y GDGGEGGGISLAG WWDK+ALA+AEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNK
Sbjct: 121 YEGDGGEGGGISLAGIWWDKQALAIAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNK 180

Query: 181 SGSPRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAM 240
           SGSP MEDIEAFS+ YRARLDEAEL+K+VPENISLEVSSPGVERVVRIPDELDRFKERAM
Sbjct: 181 SGSPSMEDIEAFSTTYRARLDEAELAKSVPENISLEVSSPGVERVVRIPDELDRFKERAM 240

Query: 241 YVKYTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREW 300
           YVKYTNDVVT SSSSESDG+FKLVSFDIE +CC WG+ADV+INREKAGKGRPLSKKQREW
Sbjct: 241 YVKYTNDVVTASSSSESDGVFKLVSFDIEAKCCTWGIADVKINREKAGKGRPLSKKQREW 300

Query: 301 RLETPFDSLRLVRLYSD 311
           RLETPFDSLRLVRLYSD
Sbjct: 301 RLETPFDSLRLVRLYSD 317

BLAST of Cp4.1LG13g04110 vs. NCBI nr
Match: gi|225457765|ref|XP_002264003.1| (PREDICTED: uncharacterized protein LOC100266148 [Vitis vinifera])

HSP 1 Score: 391.3 bits (1004), Expect = 1.5e-105
Identity = 206/314 (65.61%), Postives = 251/314 (79.94%), Query Frame = 1

Query: 1   MVMITVRNFRLSAVPISSILPQSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKN 60
           M +IT  N R SA+  S++  +S F +    + NL FPF  +  P  S      HA+K++
Sbjct: 1   MDLITTWNTRASAISFSALRSRSSFHNPSRQNHNLLFPFWPYPLPRISYKSFTAHAKKRS 60

Query: 61  SESEPVIKRNIVEEVS-EDEEDDELFDELEDDEIMDDDGE---DYFEEEFMEDNAEVYVG 120
           S+S+P++K+ IVE++S   ++DD L D+ ED+ +MDDD +   + +E+E++ D+AEVYVG
Sbjct: 61  SQSQPLVKQTIVEQISTSQQQDDLLLDDFEDEALMDDDDDNDDEDWEDEYLADDAEVYVG 120

Query: 121 DGGEGGGISLAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGS 180
           DGGEGGGISLAGTWWDKEAL MAEEV +SF GDLKIYAFKT++NST+QVRIEKLSNKSGS
Sbjct: 121 DGGEGGGISLAGTWWDKEALLMAEEVSMSFEGDLKIYAFKTLANSTIQVRIEKLSNKSGS 180

Query: 181 PRMEDIEAFSSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVK 240
           P M DIEAFSS YRA+LDEAE++ +VPEN+SLEVSSPGVERVV+IP ELDRFKER MYVK
Sbjct: 181 PSMTDIEAFSSIYRAKLDEAEIAGSVPENLSLEVSSPGVERVVQIPQELDRFKERPMYVK 240

Query: 241 YTNDVVTPSSSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLE 300
           Y  + V P S+ ESDGIF+LVSFD+ET CC WGLADVRINR KAGKGRPLSKKQREWRL 
Sbjct: 241 YVTEGVAPGSTIESDGIFRLVSFDLETNCCTWGLADVRINRAKAGKGRPLSKKQREWRLN 300

Query: 301 TPFDSLRLVRLYSD 311
           TPFDSL LVRLYS+
Sbjct: 301 TPFDSLCLVRLYSE 314

BLAST of Cp4.1LG13g04110 vs. NCBI nr
Match: gi|802559243|ref|XP_012066077.1| (PREDICTED: uncharacterized protein LOC105629153 [Jatropha curcas])

HSP 1 Score: 383.6 bits (984), Expect = 3.2e-103
Identity = 203/305 (66.56%), Postives = 244/305 (80.00%), Query Frame = 1

Query: 11  LSAVPISSILP--QSGFRSSLISSRNLPFPFVGHRFPSTSNNPLLLHARKKNSESEPVIK 70
           L  VPI S +P  +S      +S+ N PFP +   FP+       L A+K+NS+SEPV+K
Sbjct: 3   LIKVPIISTIPPTKSFLHRPSLSNHNFPFPILAQPFPAIQIKSYPLQAKKRNSQSEPVLK 62

Query: 71  RNIVEEVSEDEEDDE---LFDELEDDEIMDDDGEDYFEEEFMEDNAEVYVGDGGEGGGIS 130
             I+EEVSE++ED+E     DELED+ +MD + ED  E+EF+ED AE+YVGDG  GGGI+
Sbjct: 63  PTIIEEVSEEDEDEEEQLFLDELEDEALMDTEDED-LEDEFLEDEAELYVGDGTAGGGIA 122

Query: 131 LAGTWWDKEALAMAEEVILSFHGDLKIYAFKTVSNSTVQVRIEKLSNKSGSPRMEDIEAF 190
           LAGTWWDKEAL +AEEV  SF G+LKIYAFKT+SN T+QVRIE+L+NKSGSP MEDIEAF
Sbjct: 123 LAGTWWDKEALRIAEEVCESFDGELKIYAFKTLSNLTIQVRIERLTNKSGSPNMEDIEAF 182

Query: 191 SSRYRARLDEAELSKTVPENISLEVSSPGVERVVRIPDELDRFKERAMYVKYTNDVVTPS 250
           S+ YR+RLDEAE++KT+  NI+LEVSSPGVERVVRIP+ELDRF +R MYVKY +D  T  
Sbjct: 183 STTYRSRLDEAEVAKTIANNIALEVSSPGVERVVRIPEELDRFNDRPMYVKYVSDAATLD 242

Query: 251 SSSESDGIFKLVSFDIETRCCIWGLADVRINREKAGKGRPLSKKQREWRLETPFDSLRLV 310
           SSSESDGIF+L+SFD+ET+CC WGLADVRINREKAGKGRPLSKKQREWRL TPF SL LV
Sbjct: 243 SSSESDGIFRLISFDMETKCCTWGLADVRINREKAGKGRPLSKKQREWRLNTPFHSLLLV 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KDA0_CUCSA1.2e-13883.92Uncharacterized protein OS=Cucumis sativus GN=Csa_6G338150 PE=3 SV=1[more]
E0CQP0_VITVI1.1e-10565.61Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g15560 PE=4 SV=... [more]
A0A067L3N7_JATCU2.2e-10366.56Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25228 PE=3 SV=1[more]
A0A061FH06_THECC2.2e-10363.78Uncharacterized protein OS=Theobroma cacao GN=TCM_035449 PE=3 SV=1[more]
M5W4C9_PRUPE1.4e-10263.58Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008923mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G77122.17.1e-8557.64 Uncharacterised protein family UPF0090[more]
AT1G69210.15.5e-4537.46 Uncharacterised protein family UPF0090[more]
Match NameE-valueIdentityDescription
gi|659081884|ref|XP_008441560.1|9.5e-14084.84PREDICTED: uncharacterized protein LOC103485652 isoform X3 [Cucumis melo][more]
gi|449462174|ref|XP_004148816.1|1.8e-13883.92PREDICTED: uncharacterized protein LOC101204078 [Cucumis sativus][more]
gi|659081880|ref|XP_008441558.1|1.2e-13782.97PREDICTED: uncharacterized protein LOC103485652 isoform X1 [Cucumis melo][more]
gi|225457765|ref|XP_002264003.1|1.5e-10565.61PREDICTED: uncharacterized protein LOC100266148 [Vitis vinifera][more]
gi|802559243|ref|XP_012066077.1|3.2e-10366.56PREDICTED: uncharacterized protein LOC105629153 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0042274ribosomal small subunit biogenesis
Vocabulary: INTERPRO
TermDefinition
IPR003728Ribosome_maturation_RimP
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042274 ribosomal small subunit biogenesis
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0004185 serine-type carboxypeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g04110.1Cp4.1LG13g04110.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003728Ribosome maturation factor RimPHAMAPMF_01077RimPcoord: 133..311
score: 11
IPR003728Ribosome maturation factor RimPPFAMPF02576DUF150coord: 155..241
score: 9.8
NoneNo IPR availablePANTHERPTHR34544FAMILY NOT NAMEDcoord: 1..311
score: 4.5E
NoneNo IPR availablePANTHERPTHR34544:SF2SUBFAMILY NOT NAMEDcoord: 1..311
score: 4.5E