Cp4.1LG03g04420 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g04420
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCp4.1LG03: 5565405 .. 5573908 (-)
RNA-Seq ExpressionCp4.1LG03g04420
SyntenyCp4.1LG03g04420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCTGTCTGTGATCGATCTGGCCCCGTATCTTACGGCCTCGTCGGAACTCGCCAGCGGCTCTCCGATCGACTTTGGGCCCCAACTCACTGCTCTGTGCAAGGAAGTCAGTCGGACTCTGAAAGAGACCGGCGCACTCCTGGTTAAGGATCCGAGATGCTCTGCCGAAGATAACGATCGATTTATTGATATGATGGAAAGCTTCTTTGAGAAGCCAACTGAGTTCAAGCGTCTGCACGCAAGGCCTAATTTACATTACCAGGTAAATTTGCAATTGTATTTGCGTATGAGATGTAGTTTGATCACTGGGAATGTGTAGGTCGATTGGAAGCTAGATTTTCCGTTCTTCTTGGAAAGTGTGGGTGATTTTTGTTTCTGTTTTGGATTCTTTAAAAGGATTTGAGACGAGTTTTATAAAATTTCTGCGGATTAGAAAATATGAATTCATTAATTGGTGTACAGTACTTCTTTTCAGTGTCAATCTCCCGCAACTTTGTTATTTTTCAAAGTGAAAAAAGATCGATTGATGCNTTATTGATATGATGGAAAGCTTCTTTGAGAAGCCAACTGAGTTCAAGCGTCTGCACGCAAGGCCTAATTTACATTACCAGGTAAATTTGCAATTGTATTTGCGTATGAGATGTAGTTTGATCACTGGGAATGTGTAGGTCGATTGGAAGCTAGATTTTCCGTTCTTCTTGGAAAGTGTGGGTGATTTTTGTTTCTGTTTTGGATTCTTTAAAAGGATTTGAGACGAGTTTTATAAAATTTCTGCGGATTAGAAAATATGAATTCATTAATTGGTGTACAGTACTTCTTTTCAGTGTCAATCTCCCGCAACTTTGTTATTTTTCAAAGTGAAAAAAGATCGATTGATGCTTCACCACCCACAATAATTGACTTCTCACTGTGAAGAATTGAATAAAAGAACTTAATACTTGAAAAAAAGATANACACTGTGAAGAATTGAATAAAAAAACTTAATACTCGAAAAAAAGATAAGAGAAGAAAGGAGGAAAGTGGGAAATAAAGTAAATGTGAACAGGAAAGAATAGAGAACAACTTCTCGAATATGTCCTTGCTTGATGTCATCAGTTTGAACAATTCCAGATGTTGAAAGTTAACTATGTGATGAAAATTAATGCTGTCAGAAGGAAAACAAGCTTCTTAAATGAAGAAATCGATCTTTAGGTTCTTTCGTGACTGAGCTAAAGTATCAATTTTGGTTCTGATTCTAAACTTTCAAATGGAGACGCCAGGTTGGGGTGACGCCAGAAGGTGTGGAAGTCCCAAAAAGCCTGGTCGATGAAGAAATGCAGGAAAAGATAAGAGCAATGCCCAAGGAATTCCAACCATTGATTCCCAGGGGGCCGGATCCCAAGTGGCGATACATGTGGAGAGTGGGTCCTCGCCCTTCAAACACCCGATTTAAGGTATTGTCATCTCTTTTACCCTTTCTTTTCTTGTTAGTTGTGCATTTCCCCTTGGATGTTTTATTCCTGTTGTGCAGGAACTCAATGCTGAGCCGGTTATCCCTGAGGGGTTTCCTGAATGGAAGGATACCATGGATTCTTGGGGTTTCAAAATGATATCAGCCATAGAGGTAAGTAATTGGATGAGGTTTCTCCAATTTACTAGAATGCTTTTGTGCATATTATTTAAAATTTTCTTATCTCAGGCTGTTGCTGAGATGGCAGCAATTGGGTTTGGCTTGCCCAAGGATGCATTCACTTCTTTGATGAAGCAGGTATGGTCTTTTTTGAGTATGACACGTTCTATCAGTAATCTGTTGCTATCTTTTAGATCTCCTTCCCGCCGTTATTCTCTTATTATTAAACATTTATATTTTATATTAGATAAATGGAATCTGTTCACTCTTTAATCATGTTTTTATTTAAGAAAGAGGAGTGTTGTTCTGATTAGAAAGAATGATCCTACAAATACTTTTTTTATCTGATAAAATATGAAAGGAGCGTTGTAATTTATACGTGGCTTCTAGTTATCGTGAGTTTCTCCAATTGCAATCAAGAAGCAAGGGTACTTCTGTAGGTTGCGGCAATTTCTTCTAAAGGAATTTCTTTACACAGGGTGTTAGACGGTTAGCTCATCCAGTTCTTTGTTACTGAAGGCACGTTGACACTGGTGCCAAGTATTACTAGATCTGGTTCAATAAAGAATTTAGAGTACTTCTGTAATGACCCCTCGAATCACTGGTCGCCACGTACACTCAAGGGTTAAAAGAGTAGTATTTATTTTTTTTTTTTTTTTTTTAAGCGGCATCGCGAGCAGAGACTGTGGCACGTGCATAGGGTAGATTCATATTTAAATTTCTAGTCTTAAGTTAGAATAGGTTGTTTTGCATTTCATTGTTTTAAATTATTTTGTATTTGTTTTTGTTTTCTCTAAATTTCAGTATTTCATATTTAAACACGTAAGACCATGGCTGTGTTTTCCAGAGAAGAAGTTGTTTTAAACGTTTTTTGAAGTTTACGTTATCAATTATGTTCATGTAAGAGTTATATTTCCGCATTATAGATGAGCATGAGACGTTAGTAGCGACTCTGAGTATGTGGAAATTTAGGGTCGTTACACGACACGTGTCATCGCCTCGCTGGAGCTGCGTTGCTCCAGCCGGCGGTCGACGGTGTCTGGCACCTATCGCCGCTTGTCGACCGACCCTTTTTTTTTCTTCTCTATCCTCCGAAATCAGTAACCCAGTCCACTTTTTTGATTTTCATATCTTTCGATCCGTAGATCGGATCGATATGATTTCTTCGGCAAAGTTCTAGATTGTGTTACGGGCTACGTCGTAGTATTTTTTTTTTTTTTTTTTTTTGTAAAACGGGAGACACAAGAAATTTAAAATTTAAATATGTATTCAAGATTGACATAAAAGTAGTTTAAAGGTAATATCCGCGGAGTCAAATCAAAACGGTTTACAAAAAATACATAAACATTTAAAATAATAATAAAATGACAAGAAGGAAAACAACTCAATCTAAAGGCCCCTTTTGCCGTTCGCACGGCTCCACACGCTCCTGCCATCAACAATGGTCTACACTCCATCTAAAAAAAATAGAATAATGTAGAATGAGTATCAAAATACTCAATAAGTAACCTACTTATAGGCTCTTGTTGGACTTAATCCTATACACGAAAACTTAGGCTATGTGCTCGTATCTATCCCTAGCGGTAGTCAAAGTGTGGCTTAACATATGTACCAAACTTTGTACGTCATCATGTAACTTCAGCATATCTCCTTTAAGGAGTACTCAATCCCTCCTAGGTTCACAACCAAGGGGTTTTTTGTTTTCTCTTGGCTCTAGGTTTCACGCATGCCTAGGCCCTTTGTCAGTCTTGCGCAAAGTGCACCTCGAGTCTTGGCCCATGAAAAGGCCCAGGGTAGTACTCATAAGTCTGGAGGAGCTCTTTTGCTCTATAATGATGACCGTACTCCATGCATGTCTACCTACTGAGCTCTAATCGGTGAGTTCTCCCTAGTCGTGTCGGGCAACATCTACATGTTTCTAAGATCAGTAAACTACTTGTCACTATATCTTTCCAGGCTAGAATAGTAAACTACCTGCTCATGTACCCTCCTTGAGTTAGAACAGAAAAAGTACCTGTCTTAGGAACAGTAAAACTACATGTTACTTTCCCTTTCCAGGCTGGAACAGTAAACTGCCTACTCATGTAACCTCATCGAGTTAGAATAGAAAAAGTACCTATCTTAGGAACAGTAAAACTACATGTTACTTCACCTTTTTCAAGCTAGGACAGTAAACTATCTGTCACTATGCCACTTGGGCCGGATCAGTAAAAAAGTCATTGCTCTCCCCACTAGACACCACTGATAGTACTTCAGTAGTATATTCTCTATGACCCATGAAACTCCTCTATGGGTAACAACTAAGTCATGTGTAACAAGTAGACCCTAACCCTTGTTGGTTAGTTCACGAATACGGGTTGCACCCCATCTGTTCCTACAAGGTACCGGGTCAACCCAGGAGGATATAGAATTCATGACTATGTACTAGGATCATAATATGCGAAACAATAACTCATAGTCTATGAAAATCACAACTAGTACTCTACGGCACATAGTGATAGACCAACTCATAACTGGGAAGTAACAGGCTATCCAAGCCCTAAATCATGCATAATAAACGTATAAAGGCATATAACATGTTTATCATGCCCATTAAGTGATAATAATCCAACCTCAATATACTGGAAAGCATAAAAGTCTATTGATCTATTAGAGTTCACAAATCATGCATGCGAAGCTATATAACTATAAATAGAACTAAGTTGTAAACTGTTCACAACATTATCTAAGCACAATTATCATAAGACTAGGTCAAATTAGGCTCCTAAACATAATACTCAATCATTGTCCTCAATCCTAAACGACTCCTTATAAGTATTTTTTAGAAACTTGCTCCATATGGTTACTTACTTGATTGTAGACTTCTTAAGCTTGTTTCGGGTCTTCAAGAATACTCCAAATTTCTCCAAACTTCCTCAATATGGCATAAATTAAGTCAAAAGAATAAAACTGATGAAAATGGCTTGCTTCAAATTCGATAAAAAAAAAAAAAGGAATGTAGGTCGAGAAAATAATTGTTGACTATATGGTCATCTTCTTCAACTTGCGTGCGTATCATTCTGCTTTCCATCCTTCTGTTTTCTTTTTCTTTTTCTTTTGGATCTTAAAATTCCAGGTGTTACAACTTCCACCATCTAACTTTGTACTTAAGAACGTCTTTACTGTTGAAACACCTGCCTAACCTCCTCCAATATCTGGTGTTTCTTGGGTTTTCTTTTTTTCATATTATCTTACTAGGCAGGAGTTAGACACCTTTTTTATTTCCTTAATGAAAAAATCTTCTTTGTTTCATGAGGATCCTTGGCTGTAGACAGTAGGAATATTTTCCTGGATATTTTATTATTTTTCTGGATGTTGGTTATAAATGCTATCATGCATCTTCTTTTTAGTTTGGTTTGGGTGTGCCAGAGAAATGTGTCGGAGGAATAACCTTAATTTGAGGAATATGGTGTAATCCCATTTCATAAAAAAATTAGCCCATTTTCTGTTTCATTTTTCTGTATTAGTTAGCTTTACTTTTCTTTGTTGTTTTCCATTCCCGAGAGATTCAGTTAGTTTTTTTTTTGGGTATTATAAATAGGCTTACGCATTCCTCTGATGAATAATATAAGGAGAGTAATATTACTCTGGGAACAACGCTTGTCTCTGTTAGGATTTGATTTGATTACAGATTTAGTCATGTCTGATTTGATTATGTAACGATATATAATAGCCAATTTTCTTTTTCAATAGAATGAGAGATTTCTCCGTCCATTCCTAACAGTCTTCCTTAATAATCATACCCACTAGTGACTCCATTTCATCTAAAAAGTTCTTTTCTTGAGAGCTCAGGGACCTCATCTTCTTGCTCCAACAGGAAGTGACCTTCATCGTTATGGTCAGGAGGGCACTGTCTTTGCTGGGTATCACTATGATCTTAATTTTCTAACAATCCATGGCAGAAGCAGATTCCCTGGTCTATATATTTGGCTTAGAAATGGGCAAAAAGTTGAAGTCAAAGTACCGATTGGATGTCTTCTCATTCAGACTGGAAAGCAGGTAATATAGTCATGGTGAAATATTAGGACCTATAACCGATGTTGTAGTTTACAGGTTTGTGACTCTGTTGATATACTATCTACCATTACTAAGACTAATTGTTATAGCAAATGCACTGTTGACTTTCAAAGAAGAAAGTTTTTAGTCAAAAGCTTGTTGGTTCTTTCTCAATGGTTAATACTCAATCAGCTAGTGTGTTAAAATTGACATTATGCAGCCTGAATAATCTAGCTTTTATCACAGAGTTGGACGGCTTCATTTATTACGGAGAATAATAGGACATAACATGATAAAAAGAGAGTATCATGTCATTTACTTGACTGAATTCTGATAAAATCTGAATTATGATTAATCTTCTGAGAGAAGTAAGTTGAATTTTGAGAATGGTTTCATTGGAGGAGCATGCGCTGTTTTCTTATTACTAAATGTTTTAGAAATGTGCAATTTCCAGCAAAGATATGTACTCTTTAACTCCATTACGCTTGAGCAGATTGAATGGCTGACTGCTGGTGACTGCATAGCGGGCATGCATGAAGTAGTTGTCACAGAAAGGACAAGGGATGCAATTAAGCTTGCATCAGAGCAAAATCGCAGTCTCTGGAGAGTCTCTTCAACTGTAAGTGAATCGGGTTTCATTATCATTGTTTCTTTTATATAGTGAGCAAACTGAACTTAAATTCATACTTATTCTGGAGGATCTAAGTATTTGTTTCATTGGATGTAGTTATTTGCTCATATAGCATCTGATGCTGTCTTGAATCCTCTTGGCCACTTTGCTGAATCCCCACATGCCCACAAGTATCCACCTATGTTGGCAGGAGAATATGTCGAGAAAGAGCTTTCAGTGATTAATCTGAAAGGACAAAAAGGAGAGCCTTTATAATCAAACTTCGTATCATTGTTCTACCACAGATGCCAATAGCCAACCATTACAGATTGACCGAAAGGAACCAAGAGAAATAAACTTCAGCTGAACATTTTTACAGTGCTCTGTACATTGGTCTAATTTTTATGGATCCTCCATTCATTCAAGCGTATTTTATATTTAAACACCCCCTCCCTCGCTAAGCATGTTTGAAATCAAGAACATTCTTCATTGATAATGTGCGATAACAAAAAACTCGTAATGCCAGTATAGTTGGAATACTCCATTGTTTCATGAGCTTTACATTTACAGCCAAATTGATTCTTGAGATCTGAAACCATTAGCAGTGTTTCACTTGGTGCATATTTAAACTGTACTTCGCTAGTAGAAGAATTTTATTTCTTCCATTCATTTGTAACTGTCATAAAGCTGGCAGATTTGAGTATTATGTAGGATCACAGTGGATGTAAAGATCACCAGCATCCTTCCTAATCCTTGCAGGTTTAGAGGAGGCACAAGTGATTGATGTTTTATAACAATTTAAGTGCAGATAAATCATCACTTCTGTATGTAGTTTTTTTTTTTTTTTTTGGTATTATAAATAGGCTTACGTATTCCTCTGAGGAGAGTAATATTACTCTGGGAACAAGGCTTGTCTTGGTTAGGATTAGATTTGATTGGGTTATTTAGATTATATTTGATTTAATTATTTGAATTAGATTTAGTCATACAGATTTAGTCATGTCTGATTTGATTATGTAACGATATATAATAGCCAATTTTCTTCTTCAATAGAATGAGAGATTTTTCCTTCCATTCCTAACCGTCTTCCTTAATAATCATACCCATACCCACTAGTGACTCCATTTCATCTAAAAAGTTCTTTTCTTGAGATCTCAGGGACCTCATCTTCTTGCTCCAACAGGAAGTGACCTTCATCGTTATGGTCAGGAGGGCACTGTCTTTGCTGGGTATCACTATGATCTTAATTTTCTAACAATCCATGGCAGAAGCAGATTCCCTGGTCTATATATTTGGCTTAGAAATGGGCAAAAAGTTGAAGTCAAAGTACCGATTGGATGTCTTCTCATTCAGACTGGAAAGCAGGTAATATAGTCATGGTGAAATATTAGGACCTATAACCGATGTTGTAGTTTACAGGTTTGTGACTCTGTTGATATACTATCTACCATTACTAAGACTAATTGTTATAGCAAATGCACTGTTGACTTTCAAAGAAGAAAGTTTTTAGTCAAAAGCTTGTTGGTTCTTTCTCAATGGTTAATACTCAATCAGCTAGTGTGTTAAAATTGACATTATGCAGCCTGAATAATCTAGCTTTTATCACAGAGTTGGACGGCTTCATTTATTACGGAGAATAATAGGACATAACATGATAAAAAGAGAGTATCATGTCATTTACTTGACTGAATTCTGATAAAATCTGAATTATGATTAATCTTCTGAGAGAAGTAAGTTGAATTTTGAGAATGGTTTCATTGGAGGAGCATGCGCTGTTTTCTTATTACTAAATGTTTTAGAAATGTGCAATTTCCAGCAAAGATATGTACTCTTTAACTCCATTACGCTTGAGCAGATTGAATGGCTGACTGCTGGTGACTGCATAGCGGGCATGCATGAAGTAGTTGTCACAGAAAGGACAAGGGATGCAATTAAGCTTGCATCAGAGCAAAATCGCAGTCTCTGGAGAGTCTCTTCAACTGTAAGTGAATCGGGTTTCATTATCATTGTTTCTTTTATATAGTGAGCAAACTGAACTTAAATTCATACTTATTCTGGAGGATCTAAGTATTTGTTTCATTGGATGTAGTTATTTGCTCATATAGCATCTGATGCTGTCTTGAATCCTCTTGGCCACTTTGCTGAATCCCCACATGCCCACAAGTATCCACCTATGTTGGCAGGAGAATATGTCGAGAAAGAGCTTTCAGTGATTAATCTGAAAGGACAAAAAGGAGAGCCTTTATAA

mRNA sequence

ATGGACCTGTCTGTGATCGATCTGGCCCCGTATCTTACGGCCTCGTCGGAACTCGCCAGCGGCTCTCCGATCGACTTTGGGCCCCAACTCACTGCTCTGTGCAAGGAAGTCAGTCGGACTCTGAAAGAGACCGGCGCACTCCTGGTTAAGGATCCGAGATGCTCTGCCGAAGATAACGATCGATTTATTGATATGATGGAAAGCTTCTTTGAGAAGCCAACTGAGTTCAAGCGTCTGCACGCAAGGCCTAATTTACATTACCAGGTTGGGGTGACGCCAGAAGGTGTGGAAGTCCCAAAAAGCCTGGTCGATGAAGAAATGCAGGAAAAGATAAGAGCAATGCCCAAGGAATTCCAACCATTGATTCCCAGGGGGCCGGATCCCAAGTGGCGATACATGTGGAGAGTGGGTCCTCGCCCTTCAAACACCCGATTTAAGGAACTCAATGCTGAGCCGGTTATCCCTGAGGGGTTTCCTGAATGGAAGGATACCATGGATTCTTGGGGTTTCAAAATGATATCAGCCATAGAGGCTGTTGCTGAGATGGCAGCAATTGGGTTTGGCTTGCCCAAGGATGCATTCACTTCTTTGATGAAGCAGGGACCTCATCTTCTTGCTCCAACAGGAAGTGACCTTCATCGTTATGGTCAGGAGGGCACTGTCTTTGCTGGGTATCACTATGATCTTAATTTTCTAACAATCCATGGCAGAAGCAGATTCCCTGGTCTATATATTTGGCTTAGAAATGGGCAAAAAGTTGAAGTCAAAGTACCGATTGGATGTCTTCTCATTCAGACTGGAAAGCAGATTGAATGGCTGACTGCTGGTGACTGCATAGCGGGCATGCATGAAGTAGTTGTCACAGAAAGGACAAGGGATGCAATTAAGCTTGCATCAGAGCAAAATCGCAGTCTCTGGAGAGTCTCTTCAACTTTATTTGCTCATATAGCATCTGATGCTGTCTTGAATCCTCTTGGCCACTTTGCTGAATCCCCACATGCCCACAAGTATCCACCTATGTTGGCAGGAGAATATGTCGAGAAAGAGCTTTCAGGACCTCATCTTCTTGCTCCAACAGGAAGTGACCTTCATCGTTATGGTCAGGAGGGCACTGTCTTTGCTGGGTATCACTATGATCTTAATTTTCTAACAATCCATGGCAGAAGCAGATTCCCTGGTCTATATATTTGGCTTAGAAATGGGCAAAAAGTTGAAGTCAAAGTACCGATTGGATGTCTTCTCATTCAGACTGGAAAGCAGATTGAATGGCTGACTGCTGGTGACTGCATAGCGGGCATGCATGAAGTAGTTGTCACAGAAAGGACAAGGGATGCAATTAAGCTTGCATCAGAGCAAAATCGCAGTCTCTGGAGAGTCTCTTCAACTTTATTTGCTCATATAGCATCTGATGCTGTCTTGAATCCTCTTGGCCACTTTGCTGAATCCCCACATGCCCACAAGTATCCACCTATGTTGGCAGGAGAATATGTCGAGAAAGAGCTTTCAGTGATTAATCTGAAAGGACAAAAAGGAGAGCCTTTATAA

Coding sequence (CDS)

ATGGACCTGTCTGTGATCGATCTGGCCCCGTATCTTACGGCCTCGTCGGAACTCGCCAGCGGCTCTCCGATCGACTTTGGGCCCCAACTCACTGCTCTGTGCAAGGAAGTCAGTCGGACTCTGAAAGAGACCGGCGCACTCCTGGTTAAGGATCCGAGATGCTCTGCCGAAGATAACGATCGATTTATTGATATGATGGAAAGCTTCTTTGAGAAGCCAACTGAGTTCAAGCGTCTGCACGCAAGGCCTAATTTACATTACCAGGTTGGGGTGACGCCAGAAGGTGTGGAAGTCCCAAAAAGCCTGGTCGATGAAGAAATGCAGGAAAAGATAAGAGCAATGCCCAAGGAATTCCAACCATTGATTCCCAGGGGGCCGGATCCCAAGTGGCGATACATGTGGAGAGTGGGTCCTCGCCCTTCAAACACCCGATTTAAGGAACTCAATGCTGAGCCGGTTATCCCTGAGGGGTTTCCTGAATGGAAGGATACCATGGATTCTTGGGGTTTCAAAATGATATCAGCCATAGAGGCTGTTGCTGAGATGGCAGCAATTGGGTTTGGCTTGCCCAAGGATGCATTCACTTCTTTGATGAAGCAGGGACCTCATCTTCTTGCTCCAACAGGAAGTGACCTTCATCGTTATGGTCAGGAGGGCACTGTCTTTGCTGGGTATCACTATGATCTTAATTTTCTAACAATCCATGGCAGAAGCAGATTCCCTGGTCTATATATTTGGCTTAGAAATGGGCAAAAAGTTGAAGTCAAAGTACCGATTGGATGTCTTCTCATTCAGACTGGAAAGCAGATTGAATGGCTGACTGCTGGTGACTGCATAGCGGGCATGCATGAAGTAGTTGTCACAGAAAGGACAAGGGATGCAATTAAGCTTGCATCAGAGCAAAATCGCAGTCTCTGGAGAGTCTCTTCAACTTTATTTGCTCATATAGCATCTGATGCTGTCTTGAATCCTCTTGGCCACTTTGCTGAATCCCCACATGCCCACAAGTATCCACCTATGTTGGCAGGAGAATATGTCGAGAAAGAGCTTTCAGGACCTCATCTTCTTGCTCCAACAGGAAGTGACCTTCATCGTTATGGTCAGGAGGGCACTGTCTTTGCTGGGTATCACTATGATCTTAATTTTCTAACAATCCATGGCAGAAGCAGATTCCCTGGTCTATATATTTGGCTTAGAAATGGGCAAAAAGTTGAAGTCAAAGTACCGATTGGATGTCTTCTCATTCAGACTGGAAAGCAGATTGAATGGCTGACTGCTGGTGACTGCATAGCGGGCATGCATGAAGTAGTTGTCACAGAAAGGACAAGGGATGCAATTAAGCTTGCATCAGAGCAAAATCGCAGTCTCTGGAGAGTCTCTTCAACTTTATTTGCTCATATAGCATCTGATGCTGTCTTGAATCCTCTTGGCCACTTTGCTGAATCCCCACATGCCCACAAGTATCCACCTATGTTGGCAGGAGAATATGTCGAGAAAGAGCTTTCAGTGATTAATCTGAAAGGACAAAAAGGAGAGCCTTTATAA

Protein sequence

MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDNDRFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQPLIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVAEMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRFPGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASEQNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRFPGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASEQNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKGEPL
Homology
BLAST of Cp4.1LG03g04420 vs. ExPASy Swiss-Prot
Match: I1R9B5 (2-oxoglutarate-dependent dioxygenase FGSG_00048 OS=Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) OX=229533 GN=FG00048 PE=3 SV=1)

HSP 1 Score: 47.0 bits (110), Expect = 7.5e-04
Identity = 58/264 (21.97%), Postives = 105/264 (39.77%), Query Frame = 0

Query: 32  ALCKEVSRTLKETGALLVKDPRCSAEDNDRFIDMMESFFEKPTEFKRLHARPNLHYQVGV 91
           A   E+   L + G   V+DP    +     + +   FF+ PTE K             +
Sbjct: 24  AFLAELRDALVKVGFFQVRDPPIPLKLQQDALRLSAQFFDLPTEKK-------------L 83

Query: 92  TPEGVEVPKSLVDEEMQEKIRAMPKEFQPLIPRGPDPKWRYMWRVGPRPSNTRFKELNAE 151
             E V   + L    +  +  A   ++   I  GP+     +  +G  P    +  L   
Sbjct: 84  DIENVHSKRFLGYSRINSESTASGTDYLESILLGPN-----LPELG--PEEPVYLHLQGP 143

Query: 152 PVIPE--GFPEWKDTMDSWGFKMISAIEAVAEMAAIGFGLPKDAFTSLMKQGP-HLLAPT 211
              P+    P ++D ++S+  ++       A + A    +P D  T L+ Q     L PT
Sbjct: 144 SQWPDEVSVPGFRDVLESYHSQIQDFSIEFARLIAEALEMPLDTLTKLLGQPLFSRLKPT 203

Query: 212 ---GSDLHRYGQEGTVFAGYHYDLNFLT--IHGRSRFPGLYIWLRNGQKVEVKVPIGCLL 271
                 ++   ++G+   G H D+ F+T  + G +    L +  + G  V V    G L+
Sbjct: 204 RYLPPSMNPAAEDGSHGIGPHKDIAFMTYLLQGGTH-NCLEVQNKLGHWVPVPPVPGALV 263

Query: 272 IQTGKQIEWLTAGDCIAGMHEVVV 288
           +  G+ +E +T G C+A  H V++
Sbjct: 264 VNIGRLLEVITGGVCVATTHRVIL 266

BLAST of Cp4.1LG03g04420 vs. NCBI nr
Match: XP_023527893.1 (uncharacterized protein LOC111790976 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 726 bits (1875), Expect = 1.15e-261
Identity = 354/363 (97.52%), Postives = 355/363 (97.80%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. NCBI nr
Match: KAG6581591.1 (hypothetical protein SDJN03_21593, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 726 bits (1873), Expect = 2.31e-261
Identity = 353/363 (97.25%), Postives = 355/363 (97.80%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIK+ASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKIASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. NCBI nr
Match: XP_022934893.1 (uncharacterized protein LOC111441933 [Cucurbita moschata])

HSP 1 Score: 725 bits (1872), Expect = 3.28e-261
Identity = 352/363 (96.97%), Postives = 355/363 (97.80%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSL+DEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLIDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIK+ASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKIASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. NCBI nr
Match: XP_022983264.1 (uncharacterized protein LOC111481897 [Cucurbita maxima])

HSP 1 Score: 721 bits (1860), Expect = 2.21e-259
Identity = 350/363 (96.42%), Postives = 352/363 (96.97%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELA GSPIDFGPQLT LCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELAGGSPIDFGPQLTVLCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISA EAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISATEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDC+AGMHEVVVTERTRDAIKLASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCLAGMHEVVVTERTRDAIKLASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. NCBI nr
Match: KAG7018092.1 (hypothetical protein SDJN02_19959 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 716 bits (1848), Expect = 2.42e-257
Identity = 352/376 (93.62%), Postives = 355/376 (94.41%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSL+DEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLIDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQ-------------IEWLTAGDCIAGMHEVVV 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQ             IEWLTAGDCIAGMHEVVV
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQQRYVLFNSITLEQIEWLTAGDCIAGMHEVVV 300

Query: 301 TERTRDAIKLASEQNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVE 360
           TERTRDAIK+ASEQNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVE
Sbjct: 301 TERTRDAIKIASEQNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVE 360

Query: 361 KELSGPHLLAPTGSDL 363
           KELS  +L    G  L
Sbjct: 361 KELSVINLKGQKGEPL 376

BLAST of Cp4.1LG03g04420 vs. ExPASy TrEMBL
Match: A0A6J1F905 (uncharacterized protein LOC111441933 OS=Cucurbita moschata OX=3662 GN=LOC111441933 PE=4 SV=1)

HSP 1 Score: 725 bits (1872), Expect = 1.59e-261
Identity = 352/363 (96.97%), Postives = 355/363 (97.80%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSL+DEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLIDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIK+ASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKIASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. ExPASy TrEMBL
Match: A0A6J1J6U8 (uncharacterized protein LOC111481897 OS=Cucurbita maxima OX=3661 GN=LOC111481897 PE=4 SV=1)

HSP 1 Score: 721 bits (1860), Expect = 1.07e-259
Identity = 350/363 (96.42%), Postives = 352/363 (96.97%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDLSVIDLAPYLTASSELA GSPIDFGPQLT LCKEVSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLSVIDLAPYLTASSELAGGSPIDFGPQLTVLCKEVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISA EAVA
Sbjct: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISATEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDC+AGMHEVVVTERTRDAIKLASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCLAGMHEVVVTERTRDAIKLASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSVINLKGQKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. ExPASy TrEMBL
Match: A0A6J1DGY0 (uncharacterized protein LOC111020339 OS=Momordica charantia OX=3673 GN=LOC111020339 PE=4 SV=1)

HSP 1 Score: 676 bits (1745), Expect = 3.42e-242
Identity = 328/363 (90.36%), Postives = 341/363 (93.94%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDL VIDLAP+LTASSELA  SPI+  P LTALC+EVSRTLKETGALLVKDPRCS EDND
Sbjct: 1   MDLPVIDLAPFLTASSELAGDSPIELAPPLTALCEEVSRTLKETGALLVKDPRCSVEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMME FFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP
Sbjct: 61  RFIDMMEKFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           L+P+GPD KWRYMWRVGPRPSNTRFKELNAEPVIP+GFPEWKDTMDSWGFKMISAIEAVA
Sbjct: 121 LLPKGPDCKWRYMWRVGPRPSNTRFKELNAEPVIPDGFPEWKDTMDSWGFKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLP+DAFTSLMKQGPHLLAPTGSDLH YGQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPRDAFTSLMKQGPHLLAPTGSDLHSYGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVT+RT DAIK+ASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTKRTIDAIKIASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           Q RSLWRVSSTLFAHIASDAVL PLGHFAESPHA+KYP +LAGEYVEKEL+  +L    G
Sbjct: 301 QKRSLWRVSSTLFAHIASDAVLKPLGHFAESPHANKYPVILAGEYVEKELAVINLKGKKG 360

Query: 361 SDL 363
             L
Sbjct: 361 EPL 363

BLAST of Cp4.1LG03g04420 vs. ExPASy TrEMBL
Match: A0A6L2L8I5 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein OS=Tanacetum cinerariifolium OX=118510 GN=Tci_028920 PE=4 SV=1)

HSP 1 Score: 686 bits (1771), Expect = 4.13e-242
Identity = 347/616 (56.33%), Postives = 420/616 (68.18%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASS-ELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDN 60
           MD+ VIDL PY+ A+S E      +   P+L  +C EVSR L+ETGALLV+DPRCSAED+
Sbjct: 1   MDIPVIDLTPYVDATSGEFCLDGVLS--PELGKVCLEVSRILRETGALLVRDPRCSAEDD 60

Query: 61  DRFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQ 120
           DRFI MME +FE P EFK L ARP++HYQ G TP G+EVP+SL  ++M EK +A+PKE Q
Sbjct: 61  DRFISMMEKYFEMPDEFKLLQARPDMHYQNGATPGGLEVPRSLAVKDMLEKAKALPKEHQ 120

Query: 121 PLIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAV 180
           PLIP G D KWRYMWR+GPRPS TRFK+LN+E +IPEGFPEW++TMDSWG+K++SA+EAV
Sbjct: 121 PLIPTGADLKWRYMWRIGPRPSTTRFKDLNSEHIIPEGFPEWEETMDSWGYKLMSAVEAV 180

Query: 181 AEMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSR 240
           AEMAAIGFGLPKDAFT L+K GPHLL+PTG DL  +G+EGT+FAGYHYDLNFLTIH RS+
Sbjct: 181 AEMAAIGFGLPKDAFTGLLKNGPHLLSPTGGDLGSHGKEGTIFAGYHYDLNFLTIHYRSK 240

Query: 241 FPGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLAS 300
           FPGLYIWLRNG+KVEVKVP GCLLIQ+GKQ+EW+TAGDC+AG+HEV+VT++T DAIK AS
Sbjct: 241 FPGLYIWLRNGKKVEVKVPEGCLLIQSGKQLEWVTAGDCMAGLHEVIVTKKTVDAIKSAS 300

Query: 301 EQNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS-------- 360
           E NRSLWRVSSTLF+ +ASDA++ PLGH A+SP A  YPP+ AGEY +KEL+        
Sbjct: 301 EANRSLWRVSSTLFSAVASDAIMKPLGHHAQSPLADNYPPVYAGEYFQKELARRYAWSMM 360

Query: 361 ------------------------------------------------------------ 420
                                                                       
Sbjct: 361 EKYFEMPDELKLLQARPDMNYQNGATPGGIQVPRCLVVEDMIKKARALPKEHQPIIPTRA 420

Query: 421 ---------------------------------------------------GPHLLAPTG 480
                                                              GPHLLAPTG
Sbjct: 421 DLKWRYMWRIGPRPSTTRSQNMNSEHIIPEGFPEWEQTMNSWGYKLMAAVEGPHLLAPTG 480

Query: 481 SDLHRYGQEGTVFAGYHYDLNFLTIHGRSRFPGLYIWLRNGQKVEVKVPIGCLLIQTGKQ 496
            DL  +G+EG+VFAGYHYDLNFLTIH R++FPGL IWLR G+K+EV VP GCLLI     
Sbjct: 481 GDLGTHGKEGSVFAGYHYDLNFLTIHYRNKFPGLNIWLRIGKKLEVNVPEGCLLIHA--- 540

BLAST of Cp4.1LG03g04420 vs. ExPASy TrEMBL
Match: A0A0A0LD35 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G698530 PE=4 SV=1)

HSP 1 Score: 675 bits (1741), Expect = 1.39e-241
Identity = 324/351 (92.31%), Postives = 339/351 (96.58%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           MDL VIDLA YLTASSELA+GSPIDF PQLT+LC+ VSRTLKETGALLVKDPRCSAEDND
Sbjct: 1   MDLPVIDLASYLTASSELAAGSPIDFSPQLTSLCEVVSRTLKETGALLVKDPRCSAEDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMME FFEKPTEFKRL ARP+LHYQVGVTPEGVE+PKSLVD+EMQE IRAMPKEFQP
Sbjct: 61  RFIDMMERFFEKPTEFKRLQARPHLHYQVGVTPEGVEIPKSLVDDEMQENIRAMPKEFQP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
           L+P+GPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMD+WG KMISAIEAVA
Sbjct: 121 LLPKGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDAWGVKMISAIEAVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLP+DAFTSLMKQGPHLLAPTGSDL R+GQEGTVFAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPRDAFTSLMKQGPHLLAPTGSDLDRHGQEGTVFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNGQKVEVKVPIGCLLIQ GKQIEWLTAGDCIAGMHEVVVT+RTRDA+KLASE
Sbjct: 241 PGLYIWLRNGQKVEVKVPIGCLLIQIGKQIEWLTAGDCIAGMHEVVVTKRTRDAVKLASE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELS 351
           QNRSLWRVSSTLFAHIASDAVL PLGHFAESPHA+KYP MLAGEYVEKEL+
Sbjct: 301 QNRSLWRVSSTLFAHIASDAVLKPLGHFAESPHANKYPSMLAGEYVEKELA 351

BLAST of Cp4.1LG03g04420 vs. TAIR 10
Match: AT5G48020.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 576.6 bits (1485), Expect = 1.9e-164
Identity = 275/361 (76.18%), Postives = 308/361 (85.32%), Query Frame = 0

Query: 1   MDLSVIDLAPYLTASSELASGSPIDFGPQLTALCKEVSRTLKETGALLVKDPRCSAEDND 60
           M+L V+DL+ YL  S +       + G  L   C++VSR LKETGAL+VKDPRC A+DND
Sbjct: 1   MELPVVDLSSYLDFSGD-------ELGSDLLESCRQVSRILKETGALIVKDPRCCAQDND 60

Query: 61  RFIDMMESFFEKPTEFKRLHARPNLHYQVGVTPEGVEVPKSLVDEEMQEKIRAMPKEFQP 120
           RFIDMME++FEKP +FKRL  RPNLHYQVG TPEGVEVP+SLVDEEMQEK   MP E++P
Sbjct: 61  RFIDMMENYFEKPDDFKRLQQRPNLHYQVGATPEGVEVPRSLVDEEMQEKFNTMPNEYKP 120

Query: 121 LIPRGPDPKWRYMWRVGPRPSNTRFKELNAEPVIPEGFPEWKDTMDSWGFKMISAIEAVA 180
            IP+GPD KWRYMWRVGPRPSNTRFKELN+EPV+PEGFP W++ MDSWG+KMISA+E VA
Sbjct: 121 HIPKGPDHKWRYMWRVGPRPSNTRFKELNSEPVVPEGFPGWEEVMDSWGYKMISAVEVVA 180

Query: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLHRYGQEGTVFAGYHYDLNFLTIHGRSRF 240
           EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDL+ Y +EGT+FAGYHYDLNFLTIHGRSRF
Sbjct: 181 EMAAIGFGLPKDAFTSLMKQGPHLLAPTGSDLNCYNEEGTIFAGYHYDLNFLTIHGRSRF 240

Query: 241 PGLYIWLRNGQKVEVKVPIGCLLIQTGKQIEWLTAGDCIAGMHEVVVTERTRDAIKLASE 300
           PGLYIWLRNG+KV VKVP+GCLLIQ GKQIEWLTAG+CIAGMHEVVVT +T+DAI LA E
Sbjct: 241 PGLYIWLRNGEKVAVKVPVGCLLIQAGKQIEWLTAGECIAGMHEVVVTSKTKDAITLAKE 300

Query: 301 QNRSLWRVSSTLFAHIASDAVLNPLGHFAESPHAHKYPPMLAGEYVEKELSGPHLLAPTG 360
           QNRSLWRVSSTLFAHIASDA L PLGHFAES  A KYP + AGEYVE+ELS  +L    G
Sbjct: 301 QNRSLWRVSSTLFAHIASDAELKPLGHFAESSLASKYPAIPAGEYVEQELSVINLKGNKG 354

Query: 361 S 362
           S
Sbjct: 361 S 354

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
I1R9B57.5e-0421.972-oxoglutarate-dependent dioxygenase FGSG_00048 OS=Gibberella zeae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
XP_023527893.11.15e-26197.52uncharacterized protein LOC111790976 [Cucurbita pepo subsp. pepo][more]
KAG6581591.12.31e-26197.25hypothetical protein SDJN03_21593, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022934893.13.28e-26196.97uncharacterized protein LOC111441933 [Cucurbita moschata][more]
XP_022983264.12.21e-25996.42uncharacterized protein LOC111481897 [Cucurbita maxima][more]
KAG7018092.12.42e-25793.62hypothetical protein SDJN02_19959 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1F9051.59e-26196.97uncharacterized protein LOC111441933 OS=Cucurbita moschata OX=3662 GN=LOC1114419... [more]
A0A6J1J6U81.07e-25996.42uncharacterized protein LOC111481897 OS=Cucurbita maxima OX=3661 GN=LOC111481897... [more]
A0A6J1DGY03.42e-24290.36uncharacterized protein LOC111020339 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6L2L8I54.13e-24256.332-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein OS=Tanac... [more]
A0A0A0LD351.39e-24192.31Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G698530 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G48020.11.9e-16476.182-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027443Isopenicillin N synthase-like superfamilyGENE3D2.60.120.330coord: 1..354
e-value: 6.3E-55
score: 188.9
coord: 355..505
e-value: 2.1E-28
score: 101.7
NoneNo IPR availablePANTHERPTHR10209:SF7652-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEIN-RELATEDcoord: 352..503
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 38..350
NoneNo IPR availablePANTHERPTHR10209:SF7652-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEIN-RELATEDcoord: 38..350
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 352..503
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 368..501
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 19..349

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g04420.1Cp4.1LG03g04420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016706 2-oxoglutarate-dependent dioxygenase activity