Cp4.1LG18g05360 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g05360
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCp4.1LG18: 5947070 .. 5951103 (-)
RNA-Seq ExpressionCp4.1LG18g05360
SyntenyCp4.1LG18g05360
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACACTCATTCAAGTTTGAAAGTTCAAGGTTTAAATATGAGTTCATATCATAAAAATCTACCAAATTCATCCTGAAATTAAAAATTTTAAAGTTGAAATTAATACAATAACAAGTTCAGAGTCAAGAAAATAATTTTTTAGTGTCTTCTCGAGATTGCAGTTGACAAATCCAATTTCAGTTCTAAACCGATTAAAATGCTCTGTATTGATCTCTGTTTTGCAACCGGAACAGCAGAACTCCGTCGCTTCTGCACCACCAGCCATGGCCGGAACAAACCCTTCCGGCACCGTCCAAGGTGTAGCCTCTAAAGGAGAAGTCCCGGAAAGATACATCCATAAAGAAAGCGATCGAGGAGCTCGAGATGCTCCTTTAATGGCAGCTCCTGTAATCGATATGGCTCTCCTCTCCTCTTCCTCCAAATCCGGACCAGAACTGGAGAAACTCCGTCATGGACTTCAATCATGGGGCTGCTTTCAGGTTCGATTTCTTAAACCCTAATCGGTCGACCTAGTCCATTGACTCTGCATCTAGATCTCATCAAAACGAGCGTAAGTCGATTTAAATCAGATTAAAAAAAATCGCTAGGTTTCTGGTATTCAAACAGGTTGTAAATCATGGAATGTCAGCGGAATTTTTGGATGAAGTTCGTCGATTAGCGAAACAGTTCTTTGATCTTCCAATGGAAGAGAAATCGAAATACTCGAGGGAAGAAAATGAGATTGAAGGATATGGAAACGACATGATTCTATCAAATCAACAAATTCTCGATTGGACTGATCGATTATACCTTACTGTATATCCAAAAGAAAGCCATCGATTCAAGTACTGGCCAACAAATCCTGAAAGATTCAGGTACTTTTCTCAAACCTTAACGATTTTCCGAAGAATCTGCCGAAACGAGTTGATTTTTGTGATGAACAATGCAGGGAAGTTCTTCACGAGTACACTGCAAATGTGAAGCTGTTAAGCGAGAAAATCCTTAAAGCTATGGCGATTTCATTGGATTTAAACGAAGATAGCTTCATCAAACAGTATGGTGAGGAAGTTAAACTGGACGCACGGTTCAATTTCTACCCTCGATGTCGAAATCCGGATCTTGTTCTTGGCGTGAAACCGCATGCGGATGGATCGGCCATCACCATTTTGTTGCAGGATGAGGAAGTAGAAGGTCTTCAGTTCTTGAACGGCAATGAGTGGTTCAACGCCCCAATCGTTCCTGGCGCTCTTCTCGTCAATGTTGGGGATCAAGCAGAGGTAAACAAACACTCTCTCGTTCATCATCGCCAGTAGATATTGTACTCTTTCCCTCAAAAATTTTCACAAACATGTTCATGGCAGATCACGAGTAATGGGATATTCAAGAGTCCAGTTCATAGGGTGTTGACGAACTCGGAGAGGGAGAGGATATCGTTGGCAGTGTTTTACCTTCCGGATTGGAAGAAAGAAATCGAACCATTGGAGAAGCTCATCGATGAAACTCGGCCAAGGTTGTACAAGACTGTGAAGAACTTCGTTGGCCTTTACTTTCAGTACTACCAGCAGGGCCAGAGGCCCATGGAGGCCGCACGAATCTAAATTTGGTATCAATCTTAATCAATCGAATCAAATGGAATCATAATTTATTTTGTTTGTTTATGTTTTTCTTCCAATTTGTTCTATGAGGTTCTTGTTTGGACGAGTTTTGTATGGACTTATATCCACGCCATATCTTAGGAATAAGATCGTCATAACTTGTTTGGTTATGGTCACCAAGAACAAACCTATGTGTCGTTTACATCATTGGGACATCTCGAACATCCATCCATTCGAATTATATCGAAGCATTGGACACAGGATGTCATTGGAACACTTTGAATATCCATTTATCTGAGTTACATCAAGGCACTGGACGTCATCAGGGGATCTCGAATATCCAGGTTACATCGCGGCACAGAACATCTCGAACATCCATTCATCCAGGTTATATCGATGTATTCAACTCTCCTGTGTCCATGTTAGTTCATACCCTTTTCTTATTGGTCATATCGACAACGGAGCAAAACAAATTGAATGTCTCAATCAAATAATATAAACAAAAATAAAGACTATGTAACAATCTAAATCCACCTCATTGGGAGGCAGCATCCTCACTGGCACACTAGGTAGTGTGCCTTTAATACTATTTATAACAGTTCAAACCGACCGCTAGTAGTTATCTCATAGTCATTGAATAGCTTGGTGTGACTTCTTATCCATTTTATCAAACTCAACATATTTTCATTAAATGGCGTGACAAATTTTTAATACTATTCTCGCCAAGTGTCTTAAAAGTTGTCGACGCCTTCAATTTAATGTCGCCACGTCTACGTGCCTCGTCTTCAATCCAGGAGGAGAAGCCATTACAGAGCTGTCCATCATTGCTGCGCCGTCGTTGTCTTGTCTTGTTGCTGCCCCATCCTCCATGGCCGGAACCAACCCTTCCGGCAGCGTCCAAGAAATGGGTTCTAAAGCGGAAGTGCCGGAAGGCTACATCCATAAACAAAGGGATCGGGGAGCTCCGGATGCTCCATTAATGCAAGCTTCTGTCATTGATATCGCCCTCCTCTCAGCTTCCCCCAATTCCGGGCCGGAGCTGGAGAAACTCCGACATGGACTCCAGACATGGGGCTGCATTCTGGTCCGATTTCACCCCAACTTTCTGGAACATTTAGATTTTCTTAACCGAATGTTTCAAAAAATGATGCTCTGTTTCAGGGAATAAACCATGGAATGTCTCCTGAATTCCTGGAGGATGTTCGCCACGTAATCAAACAATTCTTCGCTTGTCCAATGGAAGAGAAGCTGAAATACTCCATGGAAGAAGCTGAGATTGAAGGATACGGCAACGACAAGATTCTATCCGACACCCAAATTCTTGATTGGAATCATCGTTTGTTCCTCACTCTGTTCCCAGAAGAAAGCCGTCGTGTTAAGCACTGGCCATCAAATCCTCACAGATTTAGGTAATTTTCTGGAATCTCTTTGAAATCATACAGTTTTGAATGGAATTGAAGAACATTGAGCATTGATTTCAGGGAAGTTATTGATGAGTATAGTGCTAATATGAGAGTGGTATGTGAGAAAATCTTGAAGGCCATGGGAAGATCATTGGATTTGGATGAGAATAGCTTTGTGGAACAGTTTGGTGAGCGATTTGAATTGGCTGCACGCTTCAATTTCTACCCTCCATGTCCGAATCCTGATCTTGTTCTTGGTGCCAAGCCTCATGCTGATGTATCGGCCATCACTGTTTTGCTGCAGGACGAGCAAGTTGAAGGGCTTCAGTTCTTGAAAGGGGATGAGTGGTTCAATGCTCCGATACTTCCCGACGCGCTTCTCGTCCTTGTCGGAGATCAAGTCGAGGTAAACCCTAATTATAATTAGGTTAGCTTTCAAAGTTTAGCTCCTAACAGTTGGTCCTATCTATTTAGTCTTTAAATGGAAACCTTTTCTCTAGAATACACGTTTTATTTTTAAAGGGAAGTCAGAAGGGAAAGCCCAAGAAGGACAATTTCTGGTAGCAGTAGAGTGTGGAAACCTTTCTCTAGAATATGCGTTTTAAAATTTTGAAGGGAAGTCGGGAGGGAAAGCCCAAGAAGGACAATTTCTGGTAGCAGTAGGTTTGGGCTGTTAGTGAGGATGCTAGGCCCCCAAGGGGATGGATTGTGAAATCTCACTTCTGGAGAGGATACAAAGCTTCTAAGAGAGTGAACGCGTTTTAAAGTCACAAAAACATTTATTTTAAAATCATTATGAAACATGCTCGTAATTTGGCAGATTGCAAGTAACGGGTTATTCAAGAGTCCAGTTCATAGAGTATTGACAAATTCAAAGAGCGAGAGGACATCTTTAGCCGTGTTTTACGTCCCAGATCCCAAGAAAGAAGTTGGACCATTAGAGACGCTCATCGATGAAACTCAACCAAGATCGTACAAGCCTGTTAAGAACTTAGTTGACCTTTACTTCGAGTACTACCAACGAGGCCGAAGACCAATCGAGACAGCAAGAATCTAA

mRNA sequence

AAACACTCATTCAAGTTTGAAAGTTCAAGGTTTAAATATGAGTTCATATCATAAAAATCTACCAAATTCATCCTGAAATTAAAAATTTTAAAGTTGAAATTAATACAATAACAAGTTCAGAGTCAAGAAAATAATTTTTTAGTGTCTTCTCGAGATTGCAGTTGACAAATCCAATTTCAGTTCTAAACCGATTAAAATGCTCTGTATTGATCTCTGTTTTGCAACCGGAACAGCAGAACTCCGTCGCTTCTGCACCACCAGCCATGGCCGGAACAAACCCTTCCGGCACCGTCCAAGGTGTAGCCTCTAAAGGAGAAGTCCCGGAAAGATACATCCATAAAGAAAGCGATCGAGGAGCTCGAGATGCTCCTTTAATGGCAGCTCCTGTAATCGATATGGCTCTCCTCTCCTCTTCCTCCAAATCCGGACCAGAACTGGAGAAACTCCGTCATGGACTTCAATCATGGGGCTGCTTTCAGGTTGTAAATCATGGAATGTCAGCGGAATTTTTGGATGAAGTTCGTCGATTAGCGAAACAGTTCTTTGATCTTCCAATGGAAGAGAAATCGAAATACTCGAGGGAAGAAAATGAGATTGAAGGATATGGAAACGACATGATTCTATCAAATCAACAAATTCTCGATTGGACTGATCGATTATACCTTACTGTATATCCAAAAGAAAGCCATCGATTCAAGTACTGGCCAACAAATCCTGAAAGATTCAGGGAAGTTCTTCACGAGTACACTGCAAATGTGAAGCTGTTAAGCGAGAAAATCCTTAAAGCTATGGCGATTTCATTGGATTTAAACGAAGATAGCTTCATCAAACAGTATGGTGAGGAAGTTAAACTGGACGCACGGTTCAATTTCTACCCTCGATGTCGAAATCCGGATCTTGTTCTTGGCGTGAAACCGCATGCGGATGGATCGGCCATCACCATTTTGTTGCAGGATGAGGAAGTAGAAGGTCTTCAGTTCTTGAACGGCAATGAGTGGTTCAACGCCCCAATCGTTCCTGGCGCTCTTCTCGTCAATGTTGGGGATCAAGCAGAGATCACGAGTAATGGGATATTCAAGAGTCCAGTTCATAGGGTGTTGACGAACTCGGAGAGGGAGAGGATATCGTTGGCAGTGTTTTACCTTCCGGATTGGAAGAAAGAAATCGAACCATTGGAGAAGCTCATCGATGAAACTCGGCCAAGTACTACCAGCAGGGCCAGAGGCCCATGGAGGCCGCACGAATCTAAATTTGGAGGAGAAGCCATTACAGAGCTGTCCATCATTGCTGCGCCGTCGTTGTCTTGTCTTGTTGCTGCCCCATCCTCCATGGCCGGAACCAACCCTTCCGGCAGCGTCCAAGAAATGGGTTCTAAAGCGGAAGTGCCGGAAGGCTACATCCATAAACAAAGGGATCGGGGAGCTCCGGATGCTCCATTAATGCAAGCTTCTGTCATTGATATCGCCCTCCTCTCAGCTTCCCCCAATTCCGGGCCGGAGCTGGAGAAACTCCGACATGGACTCCAGACATGGGGCTGCATTCTGGGAATAAACCATGGAATGTCTCCTGAATTCCTGGAGGATGTTCGCCACGTAATCAAACAATTCTTCGCTTGTCCAATGGAAGAGAAGCTGAAATACTCCATGGAAGAAGCTGAGATTGAAGGATACGGCAACGACAAGATTCTATCCGACACCCAAATTCTTGATTGGAATCATCGTTTGTTCCTCACTCTGTTCCCAGAAGAAAGCCGTCGTGTTAAGCACTGGCCATCAAATCCTCACAGATTTAGGGAAGTTATTGATGAGTATAGTGCTAATATGAGAGTGGTATGTGAGAAAATCTTGAAGGCCATGGGAAGATCATTGGATTTGGATGAGAATAGCTTTGTGGAACAGTTTGGTGAGCGATTTGAATTGGCTGCACGCTTCAATTTCTACCCTCCATGTCCGAATCCTGATCTTGTTCTTGGTGCCAAGCCTCATGCTGATGTATCGGCCATCACTGTTTTGCTGCAGGACGAGCAAGTTGAAGGGCTTCAGTTCTTGAAAGGGGATGAGTGGTTCAATGCTCCGATACTTCCCGACGCGCTTCTCGTCCTTGTCGGAGATCAAGTCGAGAGTCCAGTTCATAGAGTATTGACAAATTCAAAGAGCGAGAGGACATCTTTAGCCGTGTTTTACGTCCCAGATCCCAAGAAAGAAGTTGGACCATTAGAGACGCTCATCGATGAAACTCAACCAAGATCGTACAAGCCTGTTAAGAACTTAGTTGACCTTTACTTCGAGTACTACCAACGAGGCCGAAGACCAATCGAGACAGCAAGAATCTAA

Coding sequence (CDS)

ATGGCCGGAACAAACCCTTCCGGCACCGTCCAAGGTGTAGCCTCTAAAGGAGAAGTCCCGGAAAGATACATCCATAAAGAAAGCGATCGAGGAGCTCGAGATGCTCCTTTAATGGCAGCTCCTGTAATCGATATGGCTCTCCTCTCCTCTTCCTCCAAATCCGGACCAGAACTGGAGAAACTCCGTCATGGACTTCAATCATGGGGCTGCTTTCAGGTTGTAAATCATGGAATGTCAGCGGAATTTTTGGATGAAGTTCGTCGATTAGCGAAACAGTTCTTTGATCTTCCAATGGAAGAGAAATCGAAATACTCGAGGGAAGAAAATGAGATTGAAGGATATGGAAACGACATGATTCTATCAAATCAACAAATTCTCGATTGGACTGATCGATTATACCTTACTGTATATCCAAAAGAAAGCCATCGATTCAAGTACTGGCCAACAAATCCTGAAAGATTCAGGGAAGTTCTTCACGAGTACACTGCAAATGTGAAGCTGTTAAGCGAGAAAATCCTTAAAGCTATGGCGATTTCATTGGATTTAAACGAAGATAGCTTCATCAAACAGTATGGTGAGGAAGTTAAACTGGACGCACGGTTCAATTTCTACCCTCGATGTCGAAATCCGGATCTTGTTCTTGGCGTGAAACCGCATGCGGATGGATCGGCCATCACCATTTTGTTGCAGGATGAGGAAGTAGAAGGTCTTCAGTTCTTGAACGGCAATGAGTGGTTCAACGCCCCAATCGTTCCTGGCGCTCTTCTCGTCAATGTTGGGGATCAAGCAGAGATCACGAGTAATGGGATATTCAAGAGTCCAGTTCATAGGGTGTTGACGAACTCGGAGAGGGAGAGGATATCGTTGGCAGTGTTTTACCTTCCGGATTGGAAGAAAGAAATCGAACCATTGGAGAAGCTCATCGATGAAACTCGGCCAAGTACTACCAGCAGGGCCAGAGGCCCATGGAGGCCGCACGAATCTAAATTTGGAGGAGAAGCCATTACAGAGCTGTCCATCATTGCTGCGCCGTCGTTGTCTTGTCTTGTTGCTGCCCCATCCTCCATGGCCGGAACCAACCCTTCCGGCAGCGTCCAAGAAATGGGTTCTAAAGCGGAAGTGCCGGAAGGCTACATCCATAAACAAAGGGATCGGGGAGCTCCGGATGCTCCATTAATGCAAGCTTCTGTCATTGATATCGCCCTCCTCTCAGCTTCCCCCAATTCCGGGCCGGAGCTGGAGAAACTCCGACATGGACTCCAGACATGGGGCTGCATTCTGGGAATAAACCATGGAATGTCTCCTGAATTCCTGGAGGATGTTCGCCACGTAATCAAACAATTCTTCGCTTGTCCAATGGAAGAGAAGCTGAAATACTCCATGGAAGAAGCTGAGATTGAAGGATACGGCAACGACAAGATTCTATCCGACACCCAAATTCTTGATTGGAATCATCGTTTGTTCCTCACTCTGTTCCCAGAAGAAAGCCGTCGTGTTAAGCACTGGCCATCAAATCCTCACAGATTTAGGGAAGTTATTGATGAGTATAGTGCTAATATGAGAGTGGTATGTGAGAAAATCTTGAAGGCCATGGGAAGATCATTGGATTTGGATGAGAATAGCTTTGTGGAACAGTTTGGTGAGCGATTTGAATTGGCTGCACGCTTCAATTTCTACCCTCCATGTCCGAATCCTGATCTTGTTCTTGGTGCCAAGCCTCATGCTGATGTATCGGCCATCACTGTTTTGCTGCAGGACGAGCAAGTTGAAGGGCTTCAGTTCTTGAAAGGGGATGAGTGGTTCAATGCTCCGATACTTCCCGACGCGCTTCTCGTCCTTGTCGGAGATCAAGTCGAGAGTCCAGTTCATAGAGTATTGACAAATTCAAAGAGCGAGAGGACATCTTTAGCCGTGTTTTACGTCCCAGATCCCAAGAAAGAAGTTGGACCATTAGAGACGCTCATCGATGAAACTCAACCAAGATCGTACAAGCCTGTTAAGAACTTAGTTGACCTTTACTTCGAGTACTACCAACGAGGCCGAAGACCAATCGAGACAGCAAGAATCTAA

Protein sequence

MAGTNPSGTVQGVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSKSGPELEKLRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILSNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLDLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFLNGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKEIEPLEKLIDETRPSTTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMAGTNPSGSVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDIALLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKILSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKGDEWFNAPILPDALLVLVGDQVESPVHRVLTNSKSERTSLAVFYVPDPKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI
Homology
BLAST of Cp4.1LG18g05360 vs. ExPASy Swiss-Prot
Match: Q39224 (Protein SRG1 OS=Arabidopsis thaliana OX=3702 GN=SRG1 PE=2 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 1.2e-57
Identity = 111/304 (36.51%), Postives = 178/304 (58.55%), Query Frame = 0

Query: 19  VPERYIHKESDRGARDAPL---MAAPVIDMALLSSSSKSGPELEKLRHGLQSWGCFQVVN 78
           VP RY+  + D+   D      +  P+IDM  L SS+    E+EKL    + WG FQ+VN
Sbjct: 29  VPPRYVRSDQDKTEVDDDFDVKIEIPIIDMKRLCSSTTMDSEVEKLDFACKEWGFFQLVN 88

Query: 79  HGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILSNQQILDWTDRLYLT 138
           HG+ + FLD+V+   + FF+LPMEEK K+ +  +EIEG+G   ++S  Q LDW D  + T
Sbjct: 89  HGIDSSFLDKVKSEIQDFFNLPMEEKKKFWQRPDEIEGFGQAFVVSEDQKLDWADLFFHT 148

Query: 139 VYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLDLNEDSFIKQYGEEV 198
           V P E  +   +P  P  FR+ L  Y++ V+ +++ ++  MA +L++  +   K + +  
Sbjct: 149 VQPVELRKPHLFPKLPLPFRDTLEMYSSEVQSVAKILIAKMARALEIKPEELEKLFDDVD 208

Query: 199 KLDA-RFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFLNGNEWFNAPIVPGA 258
            + + R N+YP C  PD V+G+ PH+D   +T+L+Q  +VEGLQ     +W     +P A
Sbjct: 209 SVQSMRMNYYPPCPQPDQVIGLTPHSDSVGLTVLMQVNDVEGLQIKKDGKWVPVKPLPNA 268

Query: 259 LLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKEIEPLEKLIDETRPS 318
            +VN+GD  EI +NG ++S  HR + NSE+ER+S+A F+     KE+ P + L++  + +
Sbjct: 269 FIVNIGDVLEIITNGTYRSIEHRGVVNSEKERLSIATFHNVGMYKEVGPAKSLVERQKVA 328

BLAST of Cp4.1LG18g05360 vs. ExPASy Swiss-Prot
Match: O80449 (Jasmonate-induced oxygenase 4 OS=Arabidopsis thaliana OX=3702 GN=JOX4 PE=1 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 2.3e-56
Identity = 119/318 (37.42%), Postives = 177/318 (55.66%), Query Frame = 0

Query: 6   PSGTVQGVASKG--EVPERYIHKESDRGARDAPLMAA----PVIDMALLSSSSKSGPE-L 65
           P  +VQ ++  G   VP RY+     R   +     A    PV+DM    +     PE L
Sbjct: 8   PIVSVQSLSQTGVPTVPNRYVKPAHQRPVFNTTQSDAGIEIPVLDM----NDVWGKPEGL 67

Query: 66  EKLRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDM 125
             +R   + WG FQ+VNHG++   ++ VR   ++FF+LP+EEK KY+   +  EGYG+ +
Sbjct: 68  RLVRSACEEWGFFQMVNHGVTHSLMERVRGAWREFFELPLEEKRKYANSPDTYEGYGSRL 127

Query: 126 ILSNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAI 185
            +     LDW+D  +L   P        WP+ P + RE++ +Y   V+ L E++ + ++ 
Sbjct: 128 GVVKDAKLDWSDYFFLNYLPSSIRNPSKWPSQPPKIRELIEKYGEEVRKLCERLTETLSE 187

Query: 186 SLDLNEDSFIKQY--GEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEG 245
           SL L  +  ++    G++V    R NFYP+C  P L LG+  H+D   ITILL DE+V G
Sbjct: 188 SLGLKPNKLMQALGGGDKVGASLRTNFYPKCPQPQLTLGLSSHSDPGGITILLPDEKVAG 247

Query: 246 LQFLNGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPD 305
           LQ   G+ W     VP AL+VN+GDQ +I SNGI+KS  H+V+ NS  ER+SLA FY P 
Sbjct: 248 LQVRRGDGWVTIKSVPNALIVNIGDQLQILSNGIYKSVEHQVIVNSGMERVSLAFFYNPR 307

Query: 306 WKKEIEPLEKLIDETRPS 315
               + P+E+L+   RP+
Sbjct: 308 SDIPVGPIEELVTANRPA 321

BLAST of Cp4.1LG18g05360 vs. ExPASy Swiss-Prot
Match: Q9FFF6 (Jasmonate-induced oxygenase 2 OS=Arabidopsis thaliana OX=3702 GN=JOX2 PE=1 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.9e-55
Identity = 117/316 (37.03%), Postives = 172/316 (54.43%), Query Frame = 0

Query: 6   PSGTVQGVASK--GEVPERYIHKESDR--GARDAPLMA-APVIDMALLSSSSKSGPE--L 65
           P   VQ +A      +P+RYI   S R     DAP     P+ID+  L S      +  +
Sbjct: 23  PIVRVQSLAESNLSSLPDRYIKPASLRPTTTEDAPTATNIPIIDLEGLFSEEGLSDDVIM 82

Query: 66  EKLRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDM 125
            ++    + WG FQVVNHG+  E +D  R   ++FF +P+  K  YS      EGYG+ +
Sbjct: 83  ARISEACRGWGFFQVVNHGVKPELMDAARENWREFFHMPVNAKETYSNSPRTYEGYGSRL 142

Query: 126 ILSNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAI 185
            +     LDW+D  +L + P     F  WP+ P   REV+ EY   +  LS +I++ ++ 
Sbjct: 143 GVEKGASLDWSDYYFLHLLPHHLKDFNKWPSFPPTIREVIDEYGEELVKLSGRIMRVLST 202

Query: 186 SLDLNEDSFIKQY-GEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGL 245
           +L L ED F + + GE +    R N+YP+C  P+L LG+ PH+D   +TILL D++V GL
Sbjct: 203 NLGLKEDKFQEAFGGENIGACLRVNYYPKCPRPELALGLSPHSDPGGMTILLPDDQVFGL 262

Query: 246 QFLNGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDW 305
           Q    + W      P A +VN+GDQ +I SN  +KS  HRV+ NS++ER+SLA FY P  
Sbjct: 263 QVRKDDTWITVKPHPHAFIVNIGDQIQILSNSTYKSVEHRVIVNSDKERVSLAFFYNPKS 322

Query: 306 KKEIEPLEKLIDETRP 314
              I+PL++L+    P
Sbjct: 323 DIPIQPLQELVSTHNP 338

BLAST of Cp4.1LG18g05360 vs. ExPASy Swiss-Prot
Match: Q94LP4 (2-oxoglutarate-dependent dioxygenase 11 OS=Oryza sativa subsp. japonica OX=39947 GN=2ODD11 PE=1 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 5.7e-55
Identity = 113/316 (35.76%), Postives = 169/316 (53.48%), Query Frame = 0

Query: 3   GTNPSGTVQGVAS-----KGEVPERYIHKESDRGA---RDAPLMAAPVIDMALLSSSSKS 62
           G+ P   VQ +A         +PERYI  E+            MA P+ID+  L     S
Sbjct: 8   GSLPVPNVQALAEICNDPDEHIPERYIRPEASSEEVINNYQGDMAIPIIDLKKLLCPQSS 67

Query: 63  GPELEKLRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGY 122
             E  KLR   Q WG F ++NHG+  E +  ++R    FF  P++ K +Y++  N +EGY
Sbjct: 68  EEECVKLRSACQYWGFFLLINHGVPDEVIANLKRDIVDFFSQPLDTKKEYTQLPNSLEGY 127

Query: 123 GNDMILSNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILK 182
           G   + S  Q LDW D LYL V+P +S   ++WPT+P  FR+ +  Y++  K L+  + +
Sbjct: 128 GQSFVFSEDQKLDWADMLYLHVHPSDSRDLRFWPTSPASFRQSIDAYSSETKSLALCLFE 187

Query: 183 AMAISLDLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEV 242
            MA ++    +S +  + EE     R  +YP CR  D V+G+ PH+D   +T+LL+   V
Sbjct: 188 FMAKAVGAKPESLLDLF-EEQPRGLRMAYYPPCRQADKVMGLSPHSDAGGLTLLLEINNV 247

Query: 243 EGLQFLNGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYL 302
           +GLQ     +WF+     GAL+ N+GD  EI SNG F+S  HR + N  +ERIS A+F+ 
Sbjct: 248 QGLQIKKDGKWFSIDAPNGALIANIGDTLEILSNGKFRSVEHRAVINPNKERISAALFHY 307

Query: 303 PDWKKEIEPLEKLIDE 311
           P     I PL + + +
Sbjct: 308 PSENMVISPLPEFVKD 322

BLAST of Cp4.1LG18g05360 vs. ExPASy Swiss-Prot
Match: Q9SRM3 (Jasmonate-induced oxygenase 1 OS=Arabidopsis thaliana OX=3702 GN=JOX1 PE=1 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 3.7e-54
Identity = 114/318 (35.85%), Postives = 172/318 (54.09%), Query Frame = 0

Query: 6   PSGTVQGVASKG--EVPERYIHKESDRGAR-------DAPLMAAPVIDMALLSSSSKSGP 65
           P   VQ +A      +P+RYI   S R          +   +  P+ID+  L S ++   
Sbjct: 52  PIVRVQSLAESNLTSLPDRYIKPPSQRPQTTIIDHQPEVADINIPIIDLDSLFSGNED-- 111

Query: 66  ELEKLRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGN 125
           + +++    + WG FQV+NHG+  E +D  R   K FF+LP+E K  YS      EGYG+
Sbjct: 112 DKKRISEACREWGFFQVINHGVKPELMDAARETWKSFFNLPVEAKEVYSNSPRTYEGYGS 171

Query: 126 DMILSNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAM 185
            + +    ILDW D  YL   P     F  WP+ P   RE+  EY   +  L  +++  +
Sbjct: 172 RLGVEKGAILDWNDYYYLHFLPLALKDFNKWPSLPSNIREMNDEYGKELVKLGGRLMTIL 231

Query: 186 AISLDLNEDSFIKQY-GEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVE 245
           + +L L  +   + + GE+V    R N+YP+C  P+L LG+ PH+D   +TILL D++V 
Sbjct: 232 SSNLGLRAEQLQEAFGGEDVGACLRVNYYPKCPQPELALGLSPHSDPGGMTILLPDDQVV 291

Query: 246 GLQFLNGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLP 305
           GLQ  +G+ W     +  A +VN+GDQ +I SN  +KS  HRV+ NSE+ER+SLA FY P
Sbjct: 292 GLQVRHGDTWITVNPLRHAFIVNIGDQIQILSNSKYKSVEHRVIVNSEKERVSLAFFYNP 351

Query: 306 DWKKEIEPLEKLIDETRP 314
                I+P+++L+  T P
Sbjct: 352 KSDIPIQPMQQLVTSTMP 367

BLAST of Cp4.1LG18g05360 vs. NCBI nr
Match: KAG6589813.1 (Protein SRG1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1308 bits (3384), Expect = 0.0
Identity = 654/697 (93.83%), Postives = 662/697 (94.98%), Query Frame = 0

Query: 1   MAGTNPSGTVQGVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSKSGPELEK 60
           MAGTNPSGTVQ VASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSS+SGPELEK
Sbjct: 20  MAGTNPSGTVQDVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSESGPELEK 79

Query: 61  LRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMIL 120
           LRHGLQSWGCFQVVNHGMSAEFLDE+RRLAKQFFDLPMEEKSKYSREE+EIEGYGNDMIL
Sbjct: 80  LRHGLQSWGCFQVVNHGMSAEFLDEIRRLAKQFFDLPMEEKSKYSREEDEIEGYGNDMIL 139

Query: 121 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISL 180
           SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFR VLHEYTANVKLLSEKILKAMAISL
Sbjct: 140 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFRAVLHEYTANVKLLSEKILKAMAISL 199

Query: 181 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 240
           DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL
Sbjct: 200 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 259

Query: 241 NGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 300
           NGNEWFNAPIVPGALL+NVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDW+KE
Sbjct: 260 NGNEWFNAPIVPGALLINVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWEKE 319

Query: 301 IEPLEKLIDETRPSTTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMAGTN 360
           IEPLEKLIDETRP                   EAITEL IIAAPSL CLVAAPSSMAG+N
Sbjct: 320 IEPLEKLIDETRPRR-----------------EAITELLIIAAPSLPCLVAAPSSMAGSN 379

Query: 361 PSGSVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDIALLSASPNSGPELEKLRHGL 420
           PS SVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDI LLSASPNSGPELEKLRHGL
Sbjct: 380 PSSSVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDIGLLSASPNSGPELEKLRHGL 439

Query: 421 QTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKILSDTQI 480
           QTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDK+LSDTQI
Sbjct: 440 QTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKVLSDTQI 499

Query: 481 LDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDLDEN 540
           LDWNHRLFLTL PEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDL+EN
Sbjct: 500 LDWNHRLFLTLLPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDLEEN 559

Query: 541 SFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKGDEW 600
           SFVEQFGERFELAARFN YPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKGDEW
Sbjct: 560 SFVEQFGERFELAARFNLYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKGDEW 619

Query: 601 FNAPILPDALLVLVGDQVE--------SPVHRVLTNSKSERTSLAVFYVPDPKKEVGPLE 660
           FNAPILPDALLVLVGDQVE        SPVHRVLTNSKSERTSLAVFYVPDPKKEVGPLE
Sbjct: 620 FNAPILPDALLVLVGDQVEIASNGIFKSPVHRVLTNSKSERTSLAVFYVPDPKKEVGPLE 679

Query: 661 TLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 689
           TLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI
Sbjct: 680 TLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 699

BLAST of Cp4.1LG18g05360 vs. NCBI nr
Match: KAG7023484.1 (putative 2-oxoglutarate-dependent dioxygenase ANS [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1179 bits (3050), Expect = 0.0
Identity = 588/627 (93.78%), Postives = 599/627 (95.53%), Query Frame = 0

Query: 1   MAGTNPSGTVQGVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSKSGPELEK 60
           MAGTNPSGTVQ VASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSS+SGPELEK
Sbjct: 20  MAGTNPSGTVQDVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSESGPELEK 79

Query: 61  LRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMIL 120
           LRHGLQSWGCFQVVNHGMSAEFLDE+RRLAKQFFDLPMEEKSKYSREE+EIEGYGNDMIL
Sbjct: 80  LRHGLQSWGCFQVVNHGMSAEFLDEIRRLAKQFFDLPMEEKSKYSREEDEIEGYGNDMIL 139

Query: 121 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISL 180
           SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFR VLHEYTANVKLLSEKILKAMAISL
Sbjct: 140 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFRAVLHEYTANVKLLSEKILKAMAISL 199

Query: 181 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 240
           DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL
Sbjct: 200 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 259

Query: 241 NGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 300
           NGNEWFNAPIVPGALL+NVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDW+KE
Sbjct: 260 NGNEWFNAPIVPGALLINVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWEKE 319

Query: 301 IEPLEKLIDETRP---STTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMA 360
           IEPLEKLIDETRP    T     G +      F  EAITEL IIAAPSL CLVAAPSSMA
Sbjct: 320 IEPLEKLIDETRPRLYKTVKNFVGLY------FQREAITELLIIAAPSLPCLVAAPSSMA 379

Query: 361 GTNPSGSVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDIALLSASPNSGPELEKLR 420
           G+NPS SVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDI LLSASPNSGPELEKLR
Sbjct: 380 GSNPSSSVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDIGLLSASPNSGPELEKLR 439

Query: 421 HGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKILSD 480
           HGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDK+LSD
Sbjct: 440 HGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKVLSD 499

Query: 481 TQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDL 540
           TQILDWNHRLFLTL PEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDL
Sbjct: 500 TQILDWNHRLFLTLLPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDL 559

Query: 541 DENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKG 600
           +ENSFVEQFGERFELAARFN YPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKG
Sbjct: 560 EENSFVEQFGERFELAARFNLYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKG 619

Query: 601 DEWFNAPILPDALLVLVGDQVESPVHR 624
           DEWFNAPILPDALLVLVGDQVE+ + R
Sbjct: 620 DEWFNAPILPDALLVLVGDQVEADIER 640

BLAST of Cp4.1LG18g05360 vs. NCBI nr
Match: XP_022134811.1 (uncharacterized protein LOC111006992 [Momordica charantia])

HSP 1 Score: 829 bits (2141), Expect = 9.53e-294
Identity = 428/711 (60.20%), Postives = 521/711 (73.28%), Query Frame = 0

Query: 1   MAGTNPSGTVQGVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSKSGPELEK 60
           MAGTNP+G+VQ VASKGEVPERYIHKE DRGA DAPLM APVID+ LLSS S +GPELEK
Sbjct: 1   MAGTNPTGSVQDVASKGEVPERYIHKECDRGALDAPLMEAPVIDIGLLSSPSNTGPELEK 60

Query: 61  LRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMIL 120
           LRHGL SWGCFQ +NHGMS EFL+EVR++AK FF LPME+K K+SREE+++EGYGNDMI 
Sbjct: 61  LRHGLHSWGCFQAINHGMSPEFLEEVRQVAKLFFALPMEDKLKHSREEDKMEGYGNDMIF 120

Query: 121 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISL 180
           SNQQILDWTDRLYLTV P+ES RFKYWPTNPERFREVLHEYTANVKLLSEKILKAMA SL
Sbjct: 121 SNQQILDWTDRLYLTVCPEESRRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMARSL 180

Query: 181 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 240
           DL+E+SF+ QYG+ V+LDARFNFY RCRNP+LVLGVKPHADGSAITILLQD+EVEGLQFL
Sbjct: 181 DLDENSFLNQYGKRVQLDARFNFYLRCRNPNLVLGVKPHADGSAITILLQDKEVEGLQFL 240

Query: 241 NGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 300
            GNEW+NAPI+P ALLVNVGDQ EITSNGIFKS VHRVLTNSERERISLAVFYLPD +KE
Sbjct: 241 KGNEWWNAPIIPDALLVNVGDQGEITSNGIFKSRVHRVLTNSERERISLAVFYLPDPQKE 300

Query: 301 IEPLEKLIDETRPS---TTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMA 360
           IEPLEKLI+ET P    T     G +  +  +  G+   E    AA  +S  +  PS+MA
Sbjct: 301 IEPLEKLINETHPRLYRTVKNFVGLFFQYYQQ--GQRPRE----AAKIISVHLRKPSNMA 360

Query: 361 GTNPSGSVQEMGSK-----AEVPEGYIHKQRDRGAPD----APLMQASVIDIALLSASPN 420
                   +    K      + PE YI+K    G        PL +  V+D+A LS+SP 
Sbjct: 361 ELPDYFHTKTFQQKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPP 420

Query: 421 SGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEG 480
           S   LE LR  L +WGC   INH +S  FL  +  +  QFF+ PMEEK K   E   +EG
Sbjct: 421 STAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEG 480

Query: 481 YGNDKILSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKIL 540
           YG D + S+ QILDW  RL+L + PE+ R++K+WP NP  FRE + E++  ++ + E +L
Sbjct: 481 YGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFREDLHEFTIKLKQIIETVL 540

Query: 541 KAMGRSLDLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQ 600
            AM RS++++ NSF EQ G+R EL  RFNFYPPC  PDLVLG K H+D SAIT++L D +
Sbjct: 541 MAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDRE 600

Query: 601 VEGLQFLKGDEWFNAPI--LPDALLVLVGDQVE--------SPVHRVLTNSKSERTSLAV 660
           VEGLQ+ K D+WF  P+  + D+LL+ +G+Q E        S VHR +TNS+ +R S+A 
Sbjct: 601 VEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVAC 660

Query: 661 FYVPDPKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 689
           F  P+  +E+ P+E LIDE +PR Y+ VKN V  YF+ YQ+G+RP++  +I
Sbjct: 661 FCCPEKDREIEPIEGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI 705

BLAST of Cp4.1LG18g05360 vs. NCBI nr
Match: XP_022960722.1 (probable 2-oxoglutarate-dependent dioxygenase ANS [Cucurbita moschata])

HSP 1 Score: 696 bits (1796), Expect = 2.34e-245
Identity = 346/365 (94.79%), Postives = 350/365 (95.89%), Query Frame = 0

Query: 333 EAITELSIIAAPSLSCLVAAPSSMAGTNPSGSVQEMGSKAEVPEGYIHKQRDRGAPDAPL 392
           EAITEL IIAAPSL CLVAAPSSMAG+NPS SVQEMGSKAEVPEGYIHKQRDRGAPDAPL
Sbjct: 89  EAITELLIIAAPSLPCLVAAPSSMAGSNPSSSVQEMGSKAEVPEGYIHKQRDRGAPDAPL 148

Query: 393 MQASVIDIALLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACP 452
           MQASVIDI LLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFF+CP
Sbjct: 149 MQASVIDIGLLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFSCP 208

Query: 453 MEEKLKYSMEEAEIEGYGNDKILSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREV 512
           MEEKLKYSMEEAEIEGYGNDK+LSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREV
Sbjct: 209 MEEKLKYSMEEAEIEGYGNDKVLSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREV 268

Query: 513 IDEYSANMRVVCEKILKAMGRSLDLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAK 572
           IDEYSANMRVVCEKILKAMGRSLDL+ENSFVEQFGERFELAARFNFYPPCPNPDLVLGAK
Sbjct: 269 IDEYSANMRVVCEKILKAMGRSLDLEENSFVEQFGERFELAARFNFYPPCPNPDLVLGAK 328

Query: 573 PHADVSAITVLLQDEQVEGLQFLKGDEWFNAPILPDALLVLVGDQVE--------SPVHR 632
           PHADVSAITVLLQDEQVEGLQFLKGDEWFNA ILPDALLVLVGDQVE        SPVHR
Sbjct: 329 PHADVSAITVLLQDEQVEGLQFLKGDEWFNASILPDALLVLVGDQVEIASNGIFKSPVHR 388

Query: 633 VLTNSKSERTSLAVFYVPDPKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPI 689
           VLTNSKSER SLAVFYVPD KKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPI
Sbjct: 389 VLTNSKSERISLAVFYVPDSKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPI 448

BLAST of Cp4.1LG18g05360 vs. NCBI nr
Match: KAF9662833.1 (hypothetical protein SADUNF_Sadunf18G0095400 [Salix dunnii])

HSP 1 Score: 701 bits (1810), Expect = 1.44e-243
Identity = 367/701 (52.35%), Postives = 482/701 (68.76%), Query Frame = 0

Query: 9   TVQGVASKG-EVPERYIHKESDRGARDA--PLMAAPVIDMALLSSSSKSGPELEKLRHGL 68
           +VQ +A+ G E P +Y +K +D G  DA  PL+  PV+D+ LL+S S S  ELEKL+  L
Sbjct: 11  SVQEMAASGQEPPVKYFYKGNDGGVLDASVPLIEIPVVDLGLLTSPSTSAQELEKLKLAL 70

Query: 69  QSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILSNQQI 128
            +WGCFQV+NHGM++ FLD++R ++KQFF  PMEEK KYSRE + IEGYGNDMILS+ Q 
Sbjct: 71  TTWGCFQVINHGMTSSFLDKIREVSKQFFASPMEEKQKYSREGDTIEGYGNDMILSDHQT 130

Query: 129 LDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLDLNED 188
           +DWTDRLYLT+ P++  + K WP NP+ FRE LHEYT  ++  ++ +L+AMAISL+L E 
Sbjct: 131 VDWTDRLYLTISPEDQRKIKNWPENPKDFRETLHEYTVKLQETNDFLLRAMAISLNLEES 190

Query: 189 SFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFLNGNEW 248
            F+ QYGE+  + ARFNFYP C  PD +LGVKPHAD SA+T LLQD+EVEGLQFL  NEW
Sbjct: 191 CFLDQYGEQPLVTARFNFYPPCPKPDRILGVKPHADASAVTFLLQDKEVEGLQFLKDNEW 250

Query: 249 FNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKEIEPLE 308
           F  PI+P ALLVNVGDQ  I SNGIFKSPVHRV+TN++RER +LAVF +P+   EI+P +
Sbjct: 251 FRVPIIPHALLVNVGDQ--IMSNGIFKSPVHRVVTNTKRERNTLAVFCIPESDNEIKPAD 310

Query: 309 KLIDETRPSTTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMAGT---NPS 368
            LI ETRP    + +     + S +          I A +LS   +   ++  T   + S
Sbjct: 311 GLISETRPGLYRKVKD----YVSIYFQHYQQGKRPIEARNLSVWKSMDDNLFPTKIKSKS 370

Query: 369 GSVQE--MGSKAEVPEGYIHKQRDRGAPDA--PLMQASVIDIALLSASPNSGPELEKLRH 428
            SVQE  M ++ E P  Y ++    G  D+  PL++  VIDI  L++   S  E EKL  
Sbjct: 371 KSVQELVMNNELEPPGNYFYEDGVNGVLDSSLPLLEMPVIDIDRLTSPSTSREETEKLHS 430

Query: 429 GLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKILSDT 488
            L + GC + INHG++  FL+ VR +  QFFA PMEEKLKYS      EGYGND ILS+ 
Sbjct: 431 ALISCGCFMSINHGITGVFLDQVRSLTAQFFALPMEEKLKYSRASDSTEGYGNDMILSED 490

Query: 489 QILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDLD 548
           QILDW  RL   + PE+ R++K WP  P  FRE++ EY+  ++V+ E +LKAM RSL+L+
Sbjct: 491 QILDWTDRLCHIVSPEDKRQLKLWPEKPEIFREILQEYTTKLKVIVEVVLKAMARSLNLE 550

Query: 549 ENSFVEQFGERFELA-ARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKG 608
           +N F++++GE+  L  ARFNF+PPCP PD  LG KPHAD SAITV+LQD +VEGLQFL  
Sbjct: 551 DNCFLDKYGEKRALMMARFNFFPPCPRPDRSLGQKPHADGSAITVVLQDREVEGLQFLND 610

Query: 609 DEWFNAPI-LPDALLVLVGDQVE--------SPVHRVLTNSKSERTSLAVFYVPDPKKEV 668
           D+WF  PI LP ALL+ VGD  E        SPVHRV+TNS+ ERTS+A+F  PDP  ++
Sbjct: 611 DQWFRVPIQLPHALLINVGDHTEVMSNGFFKSPVHRVVTNSERERTSVALFCTPDPDNDI 670

Query: 669 GPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 689
            P++ ++ ET+PR YK +++    YF+YYQ+GRRPIE+ RI
Sbjct: 671 EPVDGVVSETRPRLYKKMQDYQSKYFQYYQKGRRPIESVRI 705

BLAST of Cp4.1LG18g05360 vs. ExPASy TrEMBL
Match: A0A6J1BYU1 (uncharacterized protein LOC111006992 OS=Momordica charantia OX=3673 GN=LOC111006992 PE=4 SV=1)

HSP 1 Score: 829 bits (2141), Expect = 4.62e-294
Identity = 428/711 (60.20%), Postives = 521/711 (73.28%), Query Frame = 0

Query: 1   MAGTNPSGTVQGVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSKSGPELEK 60
           MAGTNP+G+VQ VASKGEVPERYIHKE DRGA DAPLM APVID+ LLSS S +GPELEK
Sbjct: 1   MAGTNPTGSVQDVASKGEVPERYIHKECDRGALDAPLMEAPVIDIGLLSSPSNTGPELEK 60

Query: 61  LRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMIL 120
           LRHGL SWGCFQ +NHGMS EFL+EVR++AK FF LPME+K K+SREE+++EGYGNDMI 
Sbjct: 61  LRHGLHSWGCFQAINHGMSPEFLEEVRQVAKLFFALPMEDKLKHSREEDKMEGYGNDMIF 120

Query: 121 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISL 180
           SNQQILDWTDRLYLTV P+ES RFKYWPTNPERFREVLHEYTANVKLLSEKILKAMA SL
Sbjct: 121 SNQQILDWTDRLYLTVCPEESRRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMARSL 180

Query: 181 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 240
           DL+E+SF+ QYG+ V+LDARFNFY RCRNP+LVLGVKPHADGSAITILLQD+EVEGLQFL
Sbjct: 181 DLDENSFLNQYGKRVQLDARFNFYLRCRNPNLVLGVKPHADGSAITILLQDKEVEGLQFL 240

Query: 241 NGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 300
            GNEW+NAPI+P ALLVNVGDQ EITSNGIFKS VHRVLTNSERERISLAVFYLPD +KE
Sbjct: 241 KGNEWWNAPIIPDALLVNVGDQGEITSNGIFKSRVHRVLTNSERERISLAVFYLPDPQKE 300

Query: 301 IEPLEKLIDETRPS---TTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMA 360
           IEPLEKLI+ET P    T     G +  +  +  G+   E    AA  +S  +  PS+MA
Sbjct: 301 IEPLEKLINETHPRLYRTVKNFVGLFFQYYQQ--GQRPRE----AAKIISVHLRKPSNMA 360

Query: 361 GTNPSGSVQEMGSK-----AEVPEGYIHKQRDRGAPD----APLMQASVIDIALLSASPN 420
                   +    K      + PE YI+K    G        PL +  V+D+A LS+SP 
Sbjct: 361 ELPDYFHTKTFQQKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPP 420

Query: 421 SGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEG 480
           S   LE LR  L +WGC   INH +S  FL  +  +  QFF+ PMEEK K   E   +EG
Sbjct: 421 STAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEG 480

Query: 481 YGNDKILSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKIL 540
           YG D + S+ QILDW  RL+L + PE+ R++K+WP NP  FRE + E++  ++ + E +L
Sbjct: 481 YGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFREDLHEFTIKLKQIIETVL 540

Query: 541 KAMGRSLDLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQ 600
            AM RS++++ NSF EQ G+R EL  RFNFYPPC  PDLVLG K H+D SAIT++L D +
Sbjct: 541 MAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDRE 600

Query: 601 VEGLQFLKGDEWFNAPI--LPDALLVLVGDQVE--------SPVHRVLTNSKSERTSLAV 660
           VEGLQ+ K D+WF  P+  + D+LL+ +G+Q E        S VHR +TNS+ +R S+A 
Sbjct: 601 VEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVAC 660

Query: 661 FYVPDPKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 689
           F  P+  +E+ P+E LIDE +PR Y+ VKN V  YF+ YQ+G+RP++  +I
Sbjct: 661 FCCPEKDREIEPIEGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI 705

BLAST of Cp4.1LG18g05360 vs. ExPASy TrEMBL
Match: A0A6J1H881 (probable 2-oxoglutarate-dependent dioxygenase ANS OS=Cucurbita moschata OX=3662 GN=LOC111461439 PE=3 SV=1)

HSP 1 Score: 696 bits (1796), Expect = 1.13e-245
Identity = 346/365 (94.79%), Postives = 350/365 (95.89%), Query Frame = 0

Query: 333 EAITELSIIAAPSLSCLVAAPSSMAGTNPSGSVQEMGSKAEVPEGYIHKQRDRGAPDAPL 392
           EAITEL IIAAPSL CLVAAPSSMAG+NPS SVQEMGSKAEVPEGYIHKQRDRGAPDAPL
Sbjct: 89  EAITELLIIAAPSLPCLVAAPSSMAGSNPSSSVQEMGSKAEVPEGYIHKQRDRGAPDAPL 148

Query: 393 MQASVIDIALLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACP 452
           MQASVIDI LLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFF+CP
Sbjct: 149 MQASVIDIGLLSASPNSGPELEKLRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFSCP 208

Query: 453 MEEKLKYSMEEAEIEGYGNDKILSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREV 512
           MEEKLKYSMEEAEIEGYGNDK+LSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREV
Sbjct: 209 MEEKLKYSMEEAEIEGYGNDKVLSDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREV 268

Query: 513 IDEYSANMRVVCEKILKAMGRSLDLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAK 572
           IDEYSANMRVVCEKILKAMGRSLDL+ENSFVEQFGERFELAARFNFYPPCPNPDLVLGAK
Sbjct: 269 IDEYSANMRVVCEKILKAMGRSLDLEENSFVEQFGERFELAARFNFYPPCPNPDLVLGAK 328

Query: 573 PHADVSAITVLLQDEQVEGLQFLKGDEWFNAPILPDALLVLVGDQVE--------SPVHR 632
           PHADVSAITVLLQDEQVEGLQFLKGDEWFNA ILPDALLVLVGDQVE        SPVHR
Sbjct: 329 PHADVSAITVLLQDEQVEGLQFLKGDEWFNASILPDALLVLVGDQVEIASNGIFKSPVHR 388

Query: 633 VLTNSKSERTSLAVFYVPDPKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPI 689
           VLTNSKSER SLAVFYVPD KKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPI
Sbjct: 389 VLTNSKSERISLAVFYVPDSKKEVGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPI 448

BLAST of Cp4.1LG18g05360 vs. ExPASy TrEMBL
Match: A0A6J1JBA9 (probable 2-oxoglutarate-dependent dioxygenase ANS OS=Cucurbita maxima OX=3661 GN=LOC111485225 PE=3 SV=1)

HSP 1 Score: 666 bits (1719), Expect = 8.92e-236
Identity = 326/342 (95.32%), Postives = 332/342 (97.08%), Query Frame = 0

Query: 356 MAGTNPSGSVQEMGSKAEVPEGYIHKQRDRGAPDAPLMQASVIDIALLSASPNSGPELEK 415
           MAGTNPSGSVQEMGSKAEVPEGYIHK+RDRGAPDAPLMQASVIDIALLSASPNSGPELEK
Sbjct: 1   MAGTNPSGSVQEMGSKAEVPEGYIHKERDRGAPDAPLMQASVIDIALLSASPNSGPELEK 60

Query: 416 LRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKIL 475
           LRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFA PMEEKLKYSMEEAEIEGYGNDK+L
Sbjct: 61  LRHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFARPMEEKLKYSMEEAEIEGYGNDKVL 120

Query: 476 SDTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSL 535
           SDTQILDWNHRLFLTL+PEESRRVKHWPSNPHRFREVIDEYSANMRVVCE+ILKAMGRSL
Sbjct: 121 SDTQILDWNHRLFLTLYPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEEILKAMGRSL 180

Query: 536 DLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFL 595
           DLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFL
Sbjct: 181 DLDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFL 240

Query: 596 KGDEWFNAPILPDALLVLVGDQVE--------SPVHRVLTNSKSERTSLAVFYVPDPKKE 655
           KGDEWFNAPILPDALLVLVGDQVE        SPVHRVLTN KSERTSLAVFYVPDPKKE
Sbjct: 241 KGDEWFNAPILPDALLVLVGDQVEIASNGIFKSPVHRVLTNLKSERTSLAVFYVPDPKKE 300

Query: 656 VGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 689
           +GPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETA+I
Sbjct: 301 IGPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETAKI 342

BLAST of Cp4.1LG18g05360 vs. ExPASy TrEMBL
Match: A0A6N2N704 (Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS447360 PE=4 SV=1)

HSP 1 Score: 645 bits (1663), Expect = 1.31e-222
Identity = 334/690 (48.41%), Postives = 450/690 (65.22%), Query Frame = 0

Query: 7   SGTVQGVASKG-EVPERYIHKESDRGARDA--PLMAAPVIDMALLSSSSKSGPELEKLRH 66
           S +VQ +AS G E P +Y +K +D G  D+  PL+  PV+D+ LL S S S  EL+KL+ 
Sbjct: 9   SNSVQEMASSGQEPPVKYFYKGNDGGVLDSSVPLIEIPVVDLGLLPSPSTSAQELDKLKL 68

Query: 67  GLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILSNQ 126
            L +WGCFQV+NHGM++ FLD++R ++KQFF  PMEEK KYSRE + IEGYGNDMILS+ 
Sbjct: 69  ALSTWGCFQVINHGMTSSFLDKIREVSKQFFASPMEEKQKYSREADSIEGYGNDMILSDH 128

Query: 127 QILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLDLN 186
           Q +DWTDRLYLT+ P++  + K+WP NPE FRE LHEYT  ++  ++ +L+AMA SL+L 
Sbjct: 129 QTVDWTDRLYLTISPEDQRKIKFWPENPEDFRETLHEYTVKLQETNDFLLRAMARSLNLE 188

Query: 187 EDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFLNGN 246
           E  F+ QYGE+  + ARFNFYP C  P  +LGVKPHAD SA+T LLQD+EVEGLQFL  N
Sbjct: 189 ESCFLDQYGEQPLVTARFNFYPPCPKPGRILGVKPHADASAVTFLLQDKEVEGLQFLKDN 248

Query: 247 EWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKEIEP 306
           EWF                  I SNGIFKSPVHRV+TN++RER +LAVF +P+   EI+P
Sbjct: 249 EWFR-----------------IMSNGIFKSPVHRVVTNTKRERNTLAVFCIPESDNEIKP 308

Query: 307 LEKLIDETRPSTTSRARGPWRPHESKFGGEAITELSIIAAPSLSCLVAAPSSMAGTNPSG 366
            + LI E+RPS   + +     +   +                           G  P  
Sbjct: 309 ADGLISESRPSLYRKVKDYVSIYFQHY-------------------------QQGKRPIE 368

Query: 367 SVQEMGSKAEVPEGYIHKQRDRGAPDA--PLMQASVIDIALLSASPNSGPELEKLRHGLQ 426
           ++  M ++ E P  Y ++    G  D+  PL++  +IDI  L++   S  E+EKL   L 
Sbjct: 369 ALV-MNNELEPPGNYFYEDGVNGVLDSSLPLLEMPIIDIGRLTSPSTSRVEIEKLHSALI 428

Query: 427 TWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKILSDTQIL 486
           + GC + ++                QFFA PMEEKLKYS      EGYGND ILS+ QIL
Sbjct: 429 SCGCFMLLD---------------AQFFAFPMEEKLKYSRSADSTEGYGNDTILSEDQIL 488

Query: 487 DWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLDLDENS 546
           DW  RL+  + PE+ R+++ WP  P  FRE++ EY+  ++V+ E +LKAM RSL+L+EN 
Sbjct: 489 DWTDRLYHIVSPEDKRQLQLWPEKPEIFREILQEYTTKLKVIVEVVLKAMARSLNLEENC 548

Query: 547 FVEQFGE-RFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLKGDEW 606
           F++++GE R  + ARFNF+PPCP PD  LG KPHAD SAIT++LQD++VEGLQFL  D+W
Sbjct: 549 FLDKYGEERALMMARFNFFPPCPRPDRSLGQKPHADGSAITIVLQDKEVEGLQFLNDDQW 608

Query: 607 FNAPI-LPDALLVLVGDQVESPVHRVLTNSKSERTSLAVFYVPDPKKEVGPLETLIDETQ 666
           F  PI LP ALL+ VGD +ESPVHRV+TNS+ ERTS+AVF  PDP  ++ P++ ++ ET+
Sbjct: 609 FRVPIQLPHALLINVGDHIESPVHRVVTNSERERTSVAVFCTPDPDNDIEPVDGVVSETR 640

Query: 667 PRSYKPVKNLVDLYFEYYQRGRRPIETARI 689
           PR YK +++    Y +YYQ+GRRPIE+ +I
Sbjct: 669 PRLYKKMQDYQSKYLQYYQKGRRPIESVKI 640

BLAST of Cp4.1LG18g05360 vs. ExPASy TrEMBL
Match: A0A6J1H9T6 (protein SRG1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461440 PE=3 SV=1)

HSP 1 Score: 622 bits (1605), Expect = 3.64e-218
Identity = 306/313 (97.76%), Postives = 311/313 (99.36%), Query Frame = 0

Query: 1   MAGTNPSGTVQGVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSKSGPELEK 60
           MAGTNPSGTVQ VASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSS+SGPELEK
Sbjct: 20  MAGTNPSGTVQDVASKGEVPERYIHKESDRGARDAPLMAAPVIDMALLSSSSESGPELEK 79

Query: 61  LRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMIL 120
           LRHGLQSWGCFQVVNHGMSAEFLDE+RRLAKQFFDLPMEEKSKYSREE+EIEGYGNDMIL
Sbjct: 80  LRHGLQSWGCFQVVNHGMSAEFLDEIRRLAKQFFDLPMEEKSKYSREEDEIEGYGNDMIL 139

Query: 121 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISL 180
           SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFR VLHEYTANVKLLSEKILKAMAISL
Sbjct: 140 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFRAVLHEYTANVKLLSEKILKAMAISL 199

Query: 181 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 240
           DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL
Sbjct: 200 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 259

Query: 241 NGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 300
           NGNEWFNAPIVPGALL+NVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDW+KE
Sbjct: 260 NGNEWFNAPIVPGALLINVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWEKE 319

Query: 301 IEPLEKLIDETRP 313
           IEPLEKLIDETRP
Sbjct: 320 IEPLEKLIDETRP 332

BLAST of Cp4.1LG18g05360 vs. TAIR 10
Match: AT5G20400.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 347.1 bits (889), Expect = 3.4e-95
Identity = 169/312 (54.17%), Postives = 219/312 (70.19%), Query Frame = 0

Query: 9   TVQGVASKGE-VPERYIHKESDRGA-----RDAPLMAAPVIDMALLSSSSKSG-PELEKL 68
           TVQ V + GE +PERY+H  +  G         P M  P ID+ LL SSS++G  EL KL
Sbjct: 8   TVQEVVAAGEGLPERYLHAPTGDGEVQPLNAAVPEMDIPAIDLNLLLSSSEAGQQELSKL 67

Query: 69  RHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILS 128
              L +WG  QV+NHG++  FLD++ +L K+FF LP EEK K +RE + I+GYGNDMIL 
Sbjct: 68  HSALSTWGVVQVMNHGITKAFLDKIYKLTKEFFALPTEEKQKCAREIDSIQGYGNDMILW 127

Query: 129 NQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLD 188
           + Q+LDW DRLY+T YP++  +  +WP  P  FRE LHEYT   +++ E+  KAMA SL+
Sbjct: 128 DDQVLDWIDRLYITTYPEDQRQLNFWPEVPLGFRETLHEYTMKQRIVIEQFFKAMARSLE 187

Query: 189 LNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFLN 248
           L E+SF+  YGE   LD RFN YP C +PD V+GVKPHADGSAIT+LL D++V GLQF  
Sbjct: 188 LEENSFLDMYGESATLDTRFNMYPPCPSPDKVIGVKPHADGSAITLLLPDKDVGGLQFQK 247

Query: 249 GNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKEI 308
             +W+ APIVP  +L+NVGDQ EI SNGI+KSPVHRV+TN E+ERIS+A F +P   KEI
Sbjct: 248 DGKWYKAPIVPDTILINVGDQMEIMSNGIYKSPVHRVVTNREKERISVATFCIPGADKEI 307

Query: 309 EPLEKLIDETRP 314
           +P+ +L+ E RP
Sbjct: 308 QPVNELVSEARP 319

BLAST of Cp4.1LG18g05360 vs. TAIR 10
Match: AT1G49390.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 340.5 bits (872), Expect = 3.1e-93
Identity = 168/341 (49.27%), Postives = 225/341 (65.98%), Query Frame = 0

Query: 364 SVQEMGSKAE-VPEGYIHKQRDRGAPD-----APLMQASVIDIALL-SASPNSGPELEKL 423
           +VQE+ +  + +PE Y+H     G         P M    ID++LL S+S +   E++KL
Sbjct: 8   TVQEVVAAGQGLPERYLHAPTGEGESQPLNGAVPEMDIPAIDLSLLFSSSVDGQEEMKKL 67

Query: 424 RHGLQTWGCILGINHGMSPEFLEDVRHVIKQFFACPMEEKLKYSMEEAEIEGYGNDKILS 483
              L TWG +  +NHG++  FL+ +  + KQFFA P EEK K + E   I+GYGND ILS
Sbjct: 68  HSALSTWGVVQVMNHGITEAFLDKIYKLTKQFFALPTEEKHKCARETGNIQGYGNDMILS 127

Query: 484 DTQILDWNHRLFLTLFPEESRRVKHWPSNPHRFREVIDEYSANMRVVCEKILKAMGRSLD 543
           D Q+LDW  RLFLT +PE+ R++K WP  P  F E +DEY+   RV+ EK  KAM RSL+
Sbjct: 128 DNQVLDWIDRLFLTTYPEDKRQLKFWPQVPVGFSETLDEYTMKQRVLIEKFFKAMARSLE 187

Query: 544 LDENSFVEQFGERFELAARFNFYPPCPNPDLVLGAKPHADVSAITVLLQDEQVEGLQFLK 603
           L+EN F+E +GE   + +RFNF+PPCP PD V+G KPHAD SAIT+LL D+ VEGLQFLK
Sbjct: 188 LEENCFLEMYGENAVMNSRFNFFPPCPRPDKVIGIKPHADGSAITLLLPDKDVEGLQFLK 247

Query: 604 GDEWFNAPILPDALLVLVGDQVE--------SPVHRVLTNSKSERTSLAVFYVPDPKKEV 663
             +W+ API+PD +L+ +GDQ+E        SPVHRV+TN + ER S+A F VP   KE+
Sbjct: 248 DGKWYKAPIVPDTILITLGDQMEIMSNGIYKSPVHRVVTNREKERISVATFCVPGLDKEI 307

Query: 664 GPLETLIDETQPRSYKPVKNLVDLYFEYYQRGRRPIETARI 690
            P + L+ E +PR YK V   VDL+++YYQ+GRR IE A I
Sbjct: 308 HPADGLVTEARPRLYKTVTKYVDLHYKYYQQGRRTIEAALI 348

BLAST of Cp4.1LG18g05360 vs. TAIR 10
Match: AT5G20550.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 330.9 bits (847), Expect = 2.5e-90
Identity = 163/313 (52.08%), Postives = 213/313 (68.05%), Query Frame = 0

Query: 9   TVQGVASKGE-VPERYIHKES--DRGAR---DAPLMAAPVIDMALLSSSSKSG-PELEKL 68
           TVQ V + GE +PERY+   +  D G       P+M  P ID++LL S S  G  EL KL
Sbjct: 8   TVQEVVAAGEGIPERYLQPPAVDDNGQHLNAAVPVMDIPAIDLSLLLSPSDDGREELSKL 67

Query: 69  RHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILS 128
              L +WG  QV+NHG++   LD++ +L K+F  LP EEK KY+RE   I+GYGNDMIL 
Sbjct: 68  HSALSTWGVVQVINHGITKALLDKIYKLTKEFCALPSEEKQKYAREIGSIQGYGNDMILW 127

Query: 129 NQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLD 188
           + Q+LDW DRLY+T YP++  + K+WP  P  FRE LHEYT    L+  ++ KAMAISL+
Sbjct: 128 DDQVLDWIDRLYITTYPEDQRQLKFWPDVPVGFRETLHEYTMKQHLVFNQVFKAMAISLE 187

Query: 189 LNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFLN 248
           L E+ F+   GE   +D RFN YP C  PD V+GV+PHAD SA T+LL D+ VEGLQFL 
Sbjct: 188 LEENCFLDMCGENATMDTRFNMYPPCPRPDKVIGVRPHADKSAFTLLLPDKNVEGLQFLK 247

Query: 249 GNEWFNAPIVPG-ALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 308
             +W+ AP+V    +L+NVGDQ EI SNGI+KSPVHRV+TN+E+ERIS+A F +P   KE
Sbjct: 248 DGKWYKAPVVASDTILINVGDQMEIMSNGIYKSPVHRVVTNTEKERISVATFCIPGADKE 307

Query: 309 IEPLEKLIDETRP 314
           I+P++ L+ E RP
Sbjct: 308 IQPVDGLVSEARP 320

BLAST of Cp4.1LG18g05360 vs. TAIR 10
Match: AT5G54000.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 330.5 bits (846), Expect = 3.3e-90
Identity = 159/313 (50.80%), Postives = 211/313 (67.41%), Query Frame = 0

Query: 9   TVQGVASKGE-VPERYIHKESDRGARDAPL------MAAPVIDMALLSSSSKSG-PELEK 68
           TVQ V + GE +PERY++  +  G  D P       M   +ID+ LL SSS  G  EL K
Sbjct: 8   TVQEVVAAGEKLPERYLYTPTGDGEGDQPFNGLLPEMKISIIDLNLLFSSSDDGREELSK 67

Query: 69  LRHGLQSWGCFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMIL 128
           L   + +WG  QV+NHG+S   LD++  L KQFF LP +EK KY+RE +  +G+GNDMIL
Sbjct: 68  LHSAISTWGVVQVMNHGISEALLDKIHELTKQFFVLPTKEKQKYAREISSFQGFGNDMIL 127

Query: 129 SNQQILDWTDRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISL 188
           S+ Q+LDW DRLYL  YP++  + K+WP NP  FRE LHEYT   +L+ EK  KA+A SL
Sbjct: 128 SDDQVLDWVDRLYLITYPEDQRQLKFWPENPSGFRETLHEYTMKQQLVVEKFFKALARSL 187

Query: 189 DLNEDSFIKQYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDEEVEGLQFL 248
           +L ++ F++ +GE   L+ RFN YP C  PD VLG+KPH+DGSA T++L D+ VEGLQFL
Sbjct: 188 ELEDNCFLEMHGENATLETRFNIYPPCPRPDKVLGLKPHSDGSAFTLILPDKNVEGLQFL 247

Query: 249 NGNEWFNAPIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKE 308
              +W+ A I+P  +L+NVGD  E+ SNGI+KSPVHRV+ N ++ERI +A F   D  KE
Sbjct: 248 KDGKWYKASILPHTILINVGDTMEVMSNGIYKSPVHRVVLNGKKERIYVATFCNADEDKE 307

Query: 309 IEPLEKLIDETRP 314
           I+PL  L+ E RP
Sbjct: 308 IQPLNGLVSEARP 320

BLAST of Cp4.1LG18g05360 vs. TAIR 10
Match: AT3G21420.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 261.2 bits (666), Expect = 2.4e-69
Identity = 129/306 (42.16%), Postives = 189/306 (61.76%), Query Frame = 0

Query: 18  EVPERYIHKESDRGARDAPLMA------APVIDMALLSSSSKSG--PELEKLRHGLQSWG 77
           +VPER+I +E +RG   + L         PVID++ LS         E+ KL    + WG
Sbjct: 27  KVPERFIREEYERGVVVSSLKTHHLHHQIPVIDLSKLSKPDNDDFFFEILKLSQACEDWG 86

Query: 78  CFQVVNHGMSAEFLDEVRRLAKQFFDLPMEEKSKYSREENEIEGYGNDMILSNQQILDWT 137
            FQV+NHG+  E ++++  +A +FFD+P+EEK KY  E   ++GYG   I S  Q LDW 
Sbjct: 87  FFQVINHGIEVEVVEDIEEVASEFFDMPLEEKKKYPMEPGTVQGYGQAFIFSEDQKLDWC 146

Query: 138 DRLYLTVYPKESHRFKYWPTNPERFREVLHEYTANVKLLSEKILKAMAISLDLNEDSFIK 197
           +   L V+P +    K WP+ P RF E L  Y+  ++ L +++LK +AISL L E+ F +
Sbjct: 147 NMFALGVHPPQIRNPKLWPSKPARFSESLEGYSKEIRELCKRLLKYIAISLGLKEERFEE 206

Query: 198 QYGEEVKLDARFNFYPRCRNPDLVLGVKPHADGSAITILLQDE-EVEGLQFLNGNEWFNA 257
            +GE V+   R N+YP C +PDLVLG+ PH+DGSA+T+L Q +    GLQ L  N W   
Sbjct: 207 MFGEAVQA-VRMNYYPPCSSPDLVLGLSPHSDGSALTVLQQSKNSCVGLQILKDNTWVPV 266

Query: 258 PIVPGALLVNVGDQAEITSNGIFKSPVHRVLTNSERERISLAVFYLPDWKKEIEPLEKLI 314
             +P AL++N+GD  E+ SNG +KS  HR +TN E+ER+++  FY P+++ EIEP+ +L+
Sbjct: 267 KPLPNALVINIGDTIEVLSNGKYKSVEHRAVTNREKERLTIVTFYAPNYEVEIEPMSELV 326

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q392241.2e-5736.51Protein SRG1 OS=Arabidopsis thaliana OX=3702 GN=SRG1 PE=2 SV=1[more]
O804492.3e-5637.42Jasmonate-induced oxygenase 4 OS=Arabidopsis thaliana OX=3702 GN=JOX4 PE=1 SV=1[more]
Q9FFF61.9e-5537.03Jasmonate-induced oxygenase 2 OS=Arabidopsis thaliana OX=3702 GN=JOX2 PE=1 SV=1[more]
Q94LP45.7e-5535.762-oxoglutarate-dependent dioxygenase 11 OS=Oryza sativa subsp. japonica OX=39947... [more]
Q9SRM33.7e-5435.85Jasmonate-induced oxygenase 1 OS=Arabidopsis thaliana OX=3702 GN=JOX1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6589813.10.093.83Protein SRG1, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7023484.10.093.78putative 2-oxoglutarate-dependent dioxygenase ANS [Cucurbita argyrosperma subsp.... [more]
XP_022134811.19.53e-29460.20uncharacterized protein LOC111006992 [Momordica charantia][more]
XP_022960722.12.34e-24594.79probable 2-oxoglutarate-dependent dioxygenase ANS [Cucurbita moschata][more]
KAF9662833.11.44e-24352.35hypothetical protein SADUNF_Sadunf18G0095400 [Salix dunnii][more]
Match NameE-valueIdentityDescription
A0A6J1BYU14.62e-29460.20uncharacterized protein LOC111006992 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1H8811.13e-24594.79probable 2-oxoglutarate-dependent dioxygenase ANS OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1JBA98.92e-23695.32probable 2-oxoglutarate-dependent dioxygenase ANS OS=Cucurbita maxima OX=3661 GN... [more]
A0A6N2N7041.31e-22248.41Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS447360 PE=4 SV=... [more]
A0A6J1H9T63.64e-21897.76protein SRG1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461440 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT5G20400.13.4e-9554.172-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G49390.13.1e-9349.272-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G20550.12.5e-9052.082-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G54000.13.3e-9050.802-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G21420.12.4e-6942.162-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027443Isopenicillin N synthase-like superfamilyGENE3D2.60.120.330coord: 9..320
e-value: 2.0E-92
score: 312.0
coord: 364..677
e-value: 6.6E-79
score: 267.6
IPR026992Non-haem dioxygenase N-terminal domainPFAMPF14226DIOX_Ncoord: 397..504
e-value: 2.2E-18
score: 67.1
coord: 41..148
e-value: 2.2E-25
score: 89.7
IPR044861Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domainPFAMPF031712OG-FeII_Oxycoord: 201..294
e-value: 2.7E-21
score: 75.9
coord: 555..640
e-value: 9.4E-17
score: 61.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availablePANTHERPTHR47991:SF25SUBFAMILY NOT NAMEDcoord: 363..686
NoneNo IPR availablePANTHERPTHR47991:SF25SUBFAMILY NOT NAMEDcoord: 9..313
NoneNo IPR availablePANTHERPTHR47991OXOGLUTARATE/IRON-DEPENDENT DIOXYGENASEcoord: 9..313
NoneNo IPR availablePANTHERPTHR47991OXOGLUTARATE/IRON-DEPENDENT DIOXYGENASEcoord: 363..686
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 371..667
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 15..314
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 549..642
score: 12.743742
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 195..295
score: 13.049004

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g05360.1Cp4.1LG18g05360.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0051213 dioxygenase activity
molecular_function GO:0046872 metal ion binding