Cp4.1LG13g06900 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g06900
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCp4.1LG13 : 4220130 .. 4229094 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGATAATTGGACGAGTAAAAACGTCCACGTAATCGGTTGGGTCCCACAGGTACACGTCAGCAACTAACGAAATAGAGCCTAACCCTAACCTGCAACTTTCACCATCTTCGTCTTCTTCCTCTTCATAACCCTAATTTCAATCCACTTCTCACACTCTCATCACAAAACATGCTTCTGTAATTCTTTCTTCAAAGGCATTTGACCTCTACAGATGCGAAGCTATGTCTCTGGAAGCTTCTCTTGAACGACGAAAGCAGCCCCAAGCTCCCGGGACTGGCAATGGAAATGGCGTCGTCTCGCCCAGTGCACACTCTTTCTCAACCCACAGGCTTCGTCTTCAACCCAAGGAAGATCACAAGTCGGAGAGCTATGAGGACCTTCAATTGGAATTCAGCCCGCTCCTCTTCAGCATGCTGGAAAGGCATTTGCCTCCGAGCATGCTCAATATGGCACGCGACCTTAAGCTTCAGTATATGAGGGACATTCTACTCCGATATGCTCCAGAGGGCGAACGCAACCGCGTAAGTGTTTTAAATTGTTTTATCTCTATATCTATGGTGAATTTCAATACATGTTTGCTTGGTTTGGCTGGTGAATGGAGGAGGAGGAAGAGAAGGTAGGTTATATTTTCTAGAAATTGTTTTATAGGTTAAAATCAATGATAGCAAATGCCGCATGATCTTATGAATTGTTAGTTGGACGTTGTTTGCAGGCTAAGGATTAAGATTTTTATTTTCATTGTTATGTGATGGTGAATCAGAAGGCGGGAACATAGTTATTTGAATATGTAGATTTATTAATAGCATGACTAATTAGACTTTGATGTTAATATGCCAAGTAAAAATGAGGATAGTATTTGTCAAATTTCTCTTATGCTCTTGATTTTTGACGCAATTTTGATGAAAGCAAGGAGGAAGTGTCAATGTTTGTACTTCACTACTCGTTGGGGTTTCTTTCTGTTCTTGAGGAATTAATGCATTTGAGCTTCTGAATTTTGGAGGCTTTGCATTCAAATTGCAGTCTGGGAAGGCAAGGTCTATTCCTGTATCTTGTTTTTATATTTGATTACTTCCATGTTTCTTGGAAGTATTTCCAGAACAGTGGATTAGATTTTATTGGGACAAAAATGGAGAAAGTATGTGAAGTATGAAGAAAGATAAATATTCCTTCCAAGAATCGAAAGCCCCGATAGAGCTTACTAATAGGTGCTCTAGGTAGATGATTAGTAGCAAGGGAATCATTGCTAGAAGTTAAGAAGTGAACATCAAGTGGATGCTGCAAAGCAGAGTCAGTCCTAAAACTCAGTCCAAAGTTCTATTTAACTCCCATACATTCTCTCATTTCTTTCAAGCTAAAGGTTCCAAATCAGAGTGTTAGAAACAATGATAACAGTTTGACATGCACAGTAACCCAGCATTGCCCGGAGAGAACTTGGTGAGGAATTCATTCAGAGTTCTTTGGAAACACCCCAGCTACTATCTTAAGAACAGAGAATGATTTCTCAGTTATAAAATGCTTGGCTTTCTAAGTTTTATAAACAGTTTATGAACCGTCGTTTTTTTAGTTACAACAATTTAATAAAAGCATGAAATTAATGTTAAAATTTACCAAACATACTTTAAAATACAAATTGTTTCTTAGAAATGCAAATAATAGGAAATATGGAATGATTTAAAAAAATATCTTAATTTTAAAAAATATTTACTTTTAAAGTATAGAATGGAAAAGAGGAAATATATATAAAGTGTGTTGTATTTCTTTCTTCTTTTATTTTATCTTAAAAACAGATCACAAGATAAGAATGATTCACAAAAAACAAATAGTGCTACCCCTTTTTGTTGTAATTTTTCTTCCAGATAAGTATCATATTCAACTCGGCTTAATTTTATATCATGATAAATAATGTCCCATGTCCATTTTAGTGCTTTCATGGTTTAAATTTCATACCCTTTTAACTCATATCATTTCAATTTGGAAGATCTTTTTGCTACTTCCAAACTTGGTGCTTGGATTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTTGGTTAATCATACCATCTGCAAATTTTAACTTAAAAGAACTCTACTTGTTCCATATTTTGAATATATGAAACACGACTTTCACTGAGATCCAATAACAAGAAACAGGAAAGACTGCAATTAAAAGTACATCACGAGCGTGAAAATACCAACAGGGTAATTTTTAGCTTGGACTCTAGGTAAACTCCCATGTCCTAGCCAACTATTCTGTCTTAGCTTCTGATACATTTTTATTTGAGTGCGGCATTTTGGTGTGTTGCATGCTGGCAAGCTGATTTCACTTCTGCGTGTGCCCATTAAGGTTCACAATTTTTCAAATACGTTTTCTGTGAACACTCTAACGTGTTTGTTATTTTTCTTGCTTGTAAGAGTCATTCTCTAAGCCTTTTCTTCATTTGTAGCTCCTTTTTCTTCATTTCTTTTGTCAAAAAGCAATAATTCATCTATTATTTTGTTTGTTTACTTTTGTGCCTAAGAAACACAAACTCTAGGTTTTTTCTTGAGTGCATTTAATTGAAATATTATGTTTTCATTCATTATCTCTTGACAATATCAAAGAGCATCAGAGTAGCTTTCCAAATTAAGACATCTATATGCCTACTATTTATAGCTTATTGCATTTGATTTAAAAATCCTGGTCTTTGAAGCTGTAAACTTTGATGCCCTTATGGTCATGAGTGCAAACCTTGGTAGTTATGCACCTTGAATGTAAAATATCCTCAAGTCTTTTCTTCTCTGACTCTAACGTTGTGAGGTTAGGTAGTTATATTGTAAGATAATCAATGTGTGTCTAAATTGGTCTGAACAATGTGTGTCTAAATTGGTCTGAACACTTTTCAATTATGATTGCTATTATTATTTTTATTAAAAGAAATTCCAGTTTCCAATTATCATCTTCATGCTAGCTAGAAGCTTGTTTCCTTAAATTAGATTTTGAGTCTAAAAAACATTGCTTGCTTATGCAGGATAAATTTAATAGTTGACTATACTCCGGCTTAAAATAGCCATCATTTATGAATATAATTTGGAAACTTCTTAATGTATGATGCGTGTCACTCATTCATTGTTGCACCTCGTTTTGAAGCTTAGGAGCTTTTGTAAATTGGTTTCCCTTGTGCTTATCTAAATATTGGCATTGAAAAAGCTGAATGATCGAGTGTCCTTCCAGTTGTTCTTCCACCAGATGGTTTATCATTCTTTATGTGATTATATCCAGTTGCAACAGGTTTCTTGAAGTCAGGGGCTTGAATAGCTATAAGTATTTTCACTGCATTGTAGCCAATTTTAAAAATATAAGCTGCTGGGCGTGATGTGATTGTTGATTATTTATATATTTGTTTCCTTTTTCTTTTCCTTCTGGCTCCTACCATGTGTTGATTGATATTATTGGAATTAGAGAAACATGAAACTTTAAATTTTTAAAAAATAGCCAATGGAAGCTAAATTACTTATTTCATTAAACAATAGAAGTTGTGATGTTTCTTTGTATTTTCCTCTTGTGGTGTCCCAAATACTACTTTTCTCATCTGAGTTATTCTCTTACCACAATTTCATGCATTTGAATTATCAGATTCAGAGGCATAGAGAATACCGACAAAAGATTATATCAAACTATCAGGTAAATGAAAGTCTTAGCTGTATGTATCTGAGTTAGTCATGCTTAACACTGGTGTAAGAATTTATGGAGTCCTTTTTTATATGCTTCAAAATCATCTAATGGCATAATATTTACTTCATAAATTGGAATTTTTGACTTTTGAGTATTTTCAAGGATCACTTTCCTTGGTTTGTCCTAATGTTACTGGTTTGAATGGAAATTGCCAAAGAAAAAAGAAAAAAAGAAAAAAGAAACAAAAAAGAAAACTTGCATTTGTTAATATACTTTTTTAACAATTATCTTTAGTAATAAAATTCATAGAAGTTAGATGAGATCTTGATATTTTGTAATGGATCTCCGTTTACATTTTCTTGCTTCAATGAATAGTTACTTACAGTACATCCACAATATATTCCCTTACTAAGTTGCTATCCTACTCTTTGCAGCCGTTACACAGGGAGCTTTACAGCATGCATGCTGCGAACTTCTTTGTCCCCTCTTTTCTCAAAGCTATCAATGAGAATTCGGAGGAGAGCTTTCGACGCATCATGTCTGAACCCTCTCCAGGAATTTATAAATTTGAAATGCTTCAACCACAATTCTGTGAAAAGCTGTTATCTGAGGTATAATAATTTAAAATTTGTTTGCATGTTTTTTACTTCCCTGAGATTTGATTTCTGCATAATACTATTCGCCTTTTAATTCTTAGGTGGAAAGCTTTGAAAGATGGGTTCACGAGACAAAATTCAGAATCATGCGACCTAACACAATGAACAAATATGGTGCTGTTCTTGATGACTTTGGTTTGGAGACCATGCTTGATAAGTTGATGGATGATTTTATACGTCCTATATCAAGAGGTAACACTTTTGATTGAGGTACTCCTTGAAAGTTGTTTCATGTAGAATACGAGAGCCATGATTTTTCACATGTTTGTTTTTGGTGTTATGTAGTTTTCTTTCCAGAAGTTGGAGGAGCCACACTGGACTCTCATCATGGTTTTGTTGTTGAATATGGAATTGACAGAGATGTGGAACTTGGTAGGTTTTAGTAGGAACTGGTCATTTTCTTGCTTTCCCTCCTATCAATCCTTTTTTATTAATGTTCATAGGGAGAAAAACATTTTCTTATGAATGGTTTTGTTAAATACAAAATTTTTAGAGATATGGAACATGGTAGTGCTTCCAGTAACTTTTTCTCCACTCGCCAATCTCTTGCTTCTTGNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTAAATTATGCCATCTATCCTTGTGCTTTTATTTGTGTCAATTTAATCCTTGTACTTTTTTGAAAGTTTTAATTTCATTACTGTAATTAATTTATCCCAAAAGTTCTTATTGTTAACTAGTGGTGTCAAATTGAATGGAAATTTGATAAAGCATTAATGTAAAGCTATGTGGACGTACTTCACACTGTACGTATGTTATGTGATAATGAGGCGGTCTACACTGTTTTCAATCACAATTGACAACACATGCCGTGGGAATGAATTATAAGGACAAATTTAAATATGACGTTGAAACTTTTTTAAGGGCAAAGTTGATACAAACGTGAAACTATAGGAACAATGTTATAATTTAACTTACTTTGTTTTTACCTTTAGTTCTCTACTTTTTCCTTTATGGACATATTTTAGTTATTGAAAATGGAGATCCAGTAAATGAATAGATGATATTAAACTTTAAAAGATCAACCGTAATCTGTTCTTTCTCTTTGCTCATTGCTCAAGACGATTGTTTTTGCTGAAATTTCATTTTTTGTCCTTTTGATAAAGTTTCCTATTTATATTTGTCTGTAGTTATTGCAATCTGTAAATTTTTTATGTTTTTTGTTTTGTTGCTCATTTTTTTTTATATTTTTCTGTTGATATCGAAACTATTAATCAATTTTATTTCAAAATGATTTTACTATAGGTTTTCACGTGGATGATTCGGAAGTCACATTGAATGTTTGTTTGGGTAAACAATTTTCTGGTGGTGAACTGTTCTTTCGCGGCATCCGATGTGACAAGCATGTTAATACGGAGACCCAACCAGAGGTATGATAGTCTTGAGAGAACACCTTACACCTTTATTTTGTAATGCTGGTTCCTTCCTTTGGGGTTGGCTTGGCTCATATATTTCAAGCTGCTTATTCATATCTGCAACCGCATAGATATGAGCTCATTAAAATTAATCATTAATATAGGTCATGAGCTGTTGGGATAATAATACGTTCTTGAACGCTTACTATTAATGTTGAAAAATCTGAACATTTAGGTATATGGGTACATGTAACCAAGAGTACTGAATATTTACCAGACGATGCAAATGTGTGAATATACAGTATACGATCAATTAGTAGTCATTTCATTTTTAGTAATACTCCGTGGTGGGTGAATTTTACCATAACAGTCTCTGTATTAGGAAATCTTTGACTATTTGCATGTTCCTGGACACGCGGTTCTTCACCGTGGTCGTCATCGGCATGGTGCTAGAGCTACAACATCTGGTCGTCGGGTCAACTTACTTTTGTGGTGCAGAAGGTATTCTCTCACTAGTACTTCGACCTCTGAGATTGTTTTATTGTCCAAATACGAGTAAAAGGATGTAATATTTTGTTCAACAGCATAAGCACTTATTGGCAGAAAGTGATCATATATATATATATGTATGTATGTATGCATGCATGTGTGTGTGTGTGTGTATATATATATATATATATATATATATGAGGTATCACATTGCACTTTTACATGCAATCACCGTTTGACTGATGTAAGCACAAATTAAGATTAAGAATTAGCTTACATTAGATGAATTGGGTTATGCATATTTTTTTGACCGGAGATGAATTGGTTTTCTGTATGCTGAAGTTGTTCTTTTATATTTATATATTATATAGGAAGGCCTGAATCTGAGCACTAAATAAATGCTCATACGATGCTTACTGTTTGAGGTTGCTTGCATTTTTCAGCCAGCCGAAGACCGTTTATGATTGCCCATAAAATTTCTTTTTGACTTGAATTAATTTTTTCTCTTTTTATTGGCTGGTTTTAAGGTGTGTGAGCTTACTTGTTTTCTTAGTAAAAGCTTGTAGTAAGAACATTTTTACTCGTAATTAATTTAGTAAACAGTGTTTGATACCAAAATCATGTTAACGTTTGGTTTTATGTATTCTTGATACTGCTTTGGGGCTTCCTAGTCAAGTAGTTCGGCCAAGAGCAGCCAATATCACCTTACAAGTAGAATGCTTTCTCATAGAGCACTTTTAAGTGCTTAAGGTGCTGTTGACATTTTCTAAAAACATAATTTTTGTTACTTTGAAACGGTGTTTTTCAGTAGGAAGCATAGTATTATTATGTACATTATAGATGTATATTGGGCTCAGTTGCTTTCCGTGTTTCCTAACCCTTTTTTTTCCTGAAAAATCTTGTTACAGTTCTGTGTTCAGAGAGTTAAAGAAATATCAGAAAGATTTCTCCAGCTGGTGTGGAGAATGCCAACGTGAGAAGAGAGAAAGGCAACTCCTATCAATTGATGCCACAAAGCAGGTGCGTTTCGTTCTTGCTGAATTGAGTACTACTTTTAAGTTTTTACCTCAAATTCAATCTGGTATATCTATTGTCTCCAGGAGTTACTTAGAAGGGACGTAAAATCTCCTCCTTGAGCCTGCAATGGTGAAAACTATTGTTGAGGAGCTGTAAATTTTACCAAATATGAAGCATGTTCATCAAAACGAAGTCTGCTGTTCTAACTTTCGCCATGTGCAGGTAATGGGTGCTGTCCTGTATTATTTATTGTGCTGCATTACATTTTGCAGTTGTAGGTGTACATTTTAAAATGTTGTTGATAGGTGAAAATAGTTGTCAGAAAAATTTAACCAATTAGAAACCCGTTTGTGCCGAATTGGAGTGAAATAACATGTAATCTTAGCTTCTGATACTCCCTATATGTCAAATTGAATATTCTGTTCATAGTGCAGCAAATTTATTACACTGCTCATCAAGTCGGCGACGGCAATGAGATTGGATTAATGTGGCGAAACAAATTTTGGTATTGAGTTTTACTGTTTGGAGGAGTCTTAGTTACTCTTTTTAGCACTCGCACATAAGTATGTGTAGAACTTTTTGAACTTTAGATATCATCTTTGGTTCTTGATTTAGATCCTGATCCCCACCATGATCGTTGGTGGTCGTTGCTCTTCCTTCTCTCCCAATCTTACAACAAAATCGAGGAAAATTCAAGGAAAACCTCTGAAATAATATCGATATTTGGACCGTGAAGGTACAGTAATGGCTTCAGAATAATTTTGTGTTATTTTGGTAGTCCCAGAAGGGCTTCAAATAGATGGGAGGGAGTTAAGAATCTTCCTTAACATTAAAACTAGGACTCTGTTGCTTGTGTTTTAAAAATTGCCTTTTGAATGTTTTGATTTATATAGCTCCTAAACATTTACTTGTTATTATGCTTGTGAAAATCATTCTCTACAAATTTATGATGTTATGAGCATGAAGAAATTCGTAGGCGACATTCATATTATTGAGTGCTAATCATAGATTTATCAACTTGAATTTTCCTTATTAACTAAAATATACCATTAACCTTTTCATATGAAATCAGAAAATAATTAGTAGAATGCATAAGCATATATATTAGAGGACGTAGATTTTCTTCTCCAAATCTCCATTATTGGTGCTTACATTCAAACCGTTTAGCATACTTGTCTGACCTCCATTGCTGATCCAAAAAGCTTCCAGTTCTACCAGAAAAAACAATGTTTGCCTTCATTTTTTCTTTTTAAAGAACAGAACTTCACACGTTGTGGGTTTGCAGGTTGGAGAACGAGGTTTGAGCATAAGGACGCTTCTGGTTGCAGGATCAATCTTCTTAGCCATTGTAGAGAGTAGCTTATGTTATGAGTAGTCGAATAGTAGTCGCTCGAGGAGAGTGCTGCTCGAGGCCTCGTTGCTACTAGAGACCAGGGAATGTCACGCAGGTCAAAGAGTGTTAGAGTTGCTTTACAAGATCTTAGAACAACGTGTAGCCTCTTGCAAGCAACACTTTGTCGGTATCAATGGTGTTTTATATATATATATATATGCGACAATGTCAACTAAATATACCAAGTGAGTGAGTGAGTGAGTGTATATGAGAAATTCATGGAATAGACCTGTCAATAAAAAGTTAAATACTATATTATAAATGTAGGATGATATTATTATTATTATTATTATTATTATTATTATTGTGCTTGTGGCTGTGTATGACAAGAAA

mRNA sequence

AGGATAATTGGACGAGTAAAAACGTCCACGTAATCGGTTGGGTCCCACAGGTACACGTCAGCAACTAACGAAATAGAGCCTAACCCTAACCTGCAACTTTCACCATCTTCGTCTTCTTCCTCTTCATAACCCTAATTTCAATCCACTTCTCACACTCTCATCACAAAACATGCTTCTGTAATTCTTTCTTCAAAGGCATTTGACCTCTACAGATGCGAAGCTATGTCTCTGGAAGCTTCTCTTGAACGACGAAAGCAGCCCCAAGCTCCCGGGACTGGCAATGGAAATGGCGTCGTCTCGCCCAGTGCACACTCTTTCTCAACCCACAGGCTTCGTCTTCAACCCAAGGAAGATCACAAGTCGGAGAGCTATGAGGACCTTCAATTGGAATTCAGCCCGCTCCTCTTCAGCATGCTGGAAAGGCATTTGCCTCCGAGCATGCTCAATATGGCACGCGACCTTAAGCTTCAGTATATGAGGGACATTCTACTCCGATATGCTCCAGAGGGCGAACGCAACCGCATTCAGAGGCATAGAGAATACCGACAAAAGATTATATCAAACTATCAGCCGTTACACAGGGAGCTTTACAGCATGCATGCTGCGAACTTCTTTGTCCCCTCTTTTCTCAAAGCTATCAATGAGAATTCGGAGGAGAGCTTTCGACGCATCATGTCTGAACCCTCTCCAGGAATTTATAAATTTGAAATGCTTCAACCACAATTCTGTGAAAAGCTGTTATCTGAGGTGGAAAGCTTTGAAAGATGGGTTCACGAGACAAAATTCAGAATCATGCGACCTAACACAATGAACAAATATGGTGCTGTTCTTGATGACTTTGGTTTGGAGACCATGCTTGATAAGTTGATGGATGATTTTATACGTCCTATATCAAGAGTTTTCTTTCCAGAAGTTGGAGGAGCCACACTGGACTCTCATCATGGTTTTGTTGTTGAATATGGAATTGACAGAGATGTGGAACTTGGTTTTCACGTGGATGATTCGGAAGTCACATTGAATGTTTGTTTGGGTAAACAATTTTCTGGTGGTGAACTGTTCTTTCGCGGCATCCGATGTGACAAGCATGTTAATACGGAGACCCAACCAGAGGAAATCTTTGACTATTTGCATGTTCCTGGACACGCGGTTCTTCACCGTGGTCGTCATCGGCATGGTGCTAGAGCTACAACATCTGGTCGTCGGGTCAACTTACTTTTGTGGTGCAGAAGTTCTGTGTTCAGAGAGTTAAAGAAATATCAGAAAGATTTCTCCAGCTGGTGTGGAGAATGCCAACGTGAGAAGAGAGAAAGGCAACTCCTATCAATTGATGCCACAAAGCAGGAGTTACTTAGAAGGGACGTAAAATCTCCTCCTTGAGCCTGCAATGGTGAAAACTATTGTTGAGGAGCTGTAAATTTTACCAAATATGAAGCATGTTCATCAAAACGAAGTCTGCTGTTCTAACTTTCGCCATGTGCAGGTTGGAGAACGAGGTTTGAGCATAAGGACGCTTCTGGTTGCAGGATCAATCTTCTTAGCCATTGTAGAGAGTAGCTTATGTTATGAGTAGTCGAATAGTAGTCGCTCGAGGAGAGTGCTGCTCGAGGCCTCGTTGCTACTAGAGACCAGGGAATGTCACGCAGGTCAAAGAGTGTTAGAGTTGCTTTACAAGATCTTAGAACAACGTGTAGCCTCTTGCAAGCAACACTTTGTCGGTATCAATGGTGTTTTATATATATATATATATGCGACAATGTCAACTAAATATACCAAGTGAGTGAGTGAGTGAGTGTATATGAGAAATTCATGGAATAGACCTGTCAATAAAAAGTTAAATACTATATTATAAATGTAGGATGATATTATTATTATTATTATTATTATTATTATTATTGTGCTTGTGGCTGTGTATGACAAGAAA

Coding sequence (CDS)

ATGTCTCTGGAAGCTTCTCTTGAACGACGAAAGCAGCCCCAAGCTCCCGGGACTGGCAATGGAAATGGCGTCGTCTCGCCCAGTGCACACTCTTTCTCAACCCACAGGCTTCGTCTTCAACCCAAGGAAGATCACAAGTCGGAGAGCTATGAGGACCTTCAATTGGAATTCAGCCCGCTCCTCTTCAGCATGCTGGAAAGGCATTTGCCTCCGAGCATGCTCAATATGGCACGCGACCTTAAGCTTCAGTATATGAGGGACATTCTACTCCGATATGCTCCAGAGGGCGAACGCAACCGCATTCAGAGGCATAGAGAATACCGACAAAAGATTATATCAAACTATCAGCCGTTACACAGGGAGCTTTACAGCATGCATGCTGCGAACTTCTTTGTCCCCTCTTTTCTCAAAGCTATCAATGAGAATTCGGAGGAGAGCTTTCGACGCATCATGTCTGAACCCTCTCCAGGAATTTATAAATTTGAAATGCTTCAACCACAATTCTGTGAAAAGCTGTTATCTGAGGTGGAAAGCTTTGAAAGATGGGTTCACGAGACAAAATTCAGAATCATGCGACCTAACACAATGAACAAATATGGTGCTGTTCTTGATGACTTTGGTTTGGAGACCATGCTTGATAAGTTGATGGATGATTTTATACGTCCTATATCAAGAGTTTTCTTTCCAGAAGTTGGAGGAGCCACACTGGACTCTCATCATGGTTTTGTTGTTGAATATGGAATTGACAGAGATGTGGAACTTGGTTTTCACGTGGATGATTCGGAAGTCACATTGAATGTTTGTTTGGGTAAACAATTTTCTGGTGGTGAACTGTTCTTTCGCGGCATCCGATGTGACAAGCATGTTAATACGGAGACCCAACCAGAGGAAATCTTTGACTATTTGCATGTTCCTGGACACGCGGTTCTTCACCGTGGTCGTCATCGGCATGGTGCTAGAGCTACAACATCTGGTCGTCGGGTCAACTTACTTTTGTGGTGCAGAAGTTCTGTGTTCAGAGAGTTAAAGAAATATCAGAAAGATTTCTCCAGCTGGTGTGGAGAATGCCAACGTGAGAAGAGAGAAAGGCAACTCCTATCAATTGATGCCACAAAGCAGGAGTTACTTAGAAGGGACGTAAAATCTCCTCCTTGA

Protein sequence

MSLEASLERRKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREKRERQLLSIDATKQELLRRDVKSPP
BLAST of Cp4.1LG13g06900 vs. Swiss-Prot
Match: Y1295_ARATH (Uncharacterized PKHD-type hydroxylase At1g22950 OS=Arabidopsis thaliana GN=At1g22950 PE=2 SV=2)

HSP 1 Score: 477.6 bits (1228), Expect = 1.3e-133
Identity = 226/380 (59.47%), Postives = 296/380 (77.89%), Query Frame = 1

Query: 1   MSLEASLER--RKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFS 60
           M+L++S ++  ++Q Q P   +GNG         +  +LR  P E+H+ E+YEDL L++S
Sbjct: 10  MALDSSGKQPEQQQQQQPRASSGNGE--------ARLKLRRTPNEEHEPENYEDLPLDYS 69

Query: 61  PLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPL 120
           P LF+ LER+LP  +LN  R  K  +MRD+LLRY+P+ ER R+ RH+EYR KI+S+YQ L
Sbjct: 70  PSLFTSLERYLPEQLLNSTRIDKASFMRDLLLRYSPDTERVRVLRHKEYRDKIMSSYQRL 129

Query: 121 HRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVES 180
           H E+Y++  ++FF PSFL A +  SE +FR  M E  PGI+ FEM +PQFCE LL+EVE 
Sbjct: 130 HGEIYTLDPSSFFAPSFLGAFSRKSEPNFRSSMVESYPGIFTFEMFKPQFCEMLLAEVEH 189

Query: 181 FERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDS 240
            E+WV++++  IMRPNTMN +G VLDDFG ++ML KL+DDFI PI++V FPEV G +LDS
Sbjct: 190 MEKWVYDSRSTIMRPNTMNNFGVVLDDFGFDSMLQKLVDDFISPIAQVLFPEVCGTSLDS 249

Query: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEI 300
           HHG++VEYG DRDV+LGFHVDDSEV+LNVCLGKQFSGGEL+FRG+RCDKHVN+++  +E+
Sbjct: 250 HHGYIVEYGKDRDVDLGFHVDDSEVSLNVCLGKQFSGGELYFRGVRCDKHVNSDSTEKEV 309

Query: 301 FDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQR 360
           +DY HVPGHA+LHRGRHRHGARATTSG R NL+LWCRSS FRE+K YQ+DFS WCG C+ 
Sbjct: 310 YDYSHVPGHAILHRGRHRHGARATTSGHRANLILWCRSSTFREMKNYQRDFSGWCGGCKL 369

Query: 361 EKRERQLLSIDATKQELLRR 379
           +K+ RQ  SI+ATK+ L R+
Sbjct: 370 DKQRRQRDSINATKEILARK 381

BLAST of Cp4.1LG13g06900 vs. Swiss-Prot
Match: OGFD2_XENTR (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Xenopus tropicalis GN=ogfod2 PE=2 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 8.1e-40
Identity = 98/267 (36.70%), Postives = 149/267 (55.81%), Query Frame = 1

Query: 83  QYMRDILLRYAPEGERNRI--QRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKAIN 142
           +  R++L     E ER R   +     R++I  +Y+PL+ E+Y +  + F    FL A+ 
Sbjct: 53  EQFRNVLETIKKEVERRRKLGEESLHRRREISLHYKPLYPEVYVLQES-FLAAEFLTAVK 112

Query: 143 ------ENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPN 202
                  N E     + S     IY+  +  P+FC KL+ E+E+FER    +     RPN
Sbjct: 113 YSKSPQANVEGLLHHLHSITDKRIYRLPVFIPEFCAKLVEELENFER----SDLPKGRPN 172

Query: 203 TMNKYGAVLDDFG-LETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVE 262
           TMN YG +L++ G ++ +   L + +I P++ + FP+ GG  LDSH  FVV+Y +  D++
Sbjct: 173 TMNNYGILLNELGFVDALTAPLCEKYIEPLTSLLFPDWGGGCLDSHRAFVVKYALQEDLD 232

Query: 263 LGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRG 322
           L  H D++EVTLNV LGK+F+ G L+F  ++ +  VN  T  E      H+ G  +LHRG
Sbjct: 233 LSCHYDNAEVTLNVSLGKEFTDGNLYFSDMK-EVPVNERTYAE----VEHITGQGILHRG 292

Query: 323 RHRHGARATTSGRRVNLLLWCRSSVFR 341
           +H HGA   +SG R NL+LW R+S  R
Sbjct: 293 QHVHGALPISSGERWNLILWMRASDVR 309

BLAST of Cp4.1LG13g06900 vs. Swiss-Prot
Match: OGFD2_DANRE (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Danio rerio GN=ogfod2 PE=2 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 8.3e-37
Identity = 91/261 (34.87%), Postives = 142/261 (54.41%), Query Frame = 1

Query: 86  RDILLRYAPEGER--NRIQRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKAIN--- 145
           RD++ +   E ER  N   +  E    I   Y PLH+ +Y +  + F  P  L+ +    
Sbjct: 52  RDVIGKIQAEIERRQNHKLKSTERAAVIKEIYTPLHQHVYHLQES-FLAPELLEMVKYCA 111

Query: 146 ---ENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMN 205
               N +   + I +E +  +++F++ + +FC+ LL E+E FE    ++     RPNTMN
Sbjct: 112 SSEANVQGLLKLIQTEAASRVFRFQVFRKEFCKDLLEELEHFE----QSDAPKGRPNTMN 171

Query: 206 KYGAVLDDFGL-ETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGF 265
            YG VL++ G  E  +  L + ++RP++ + + + GG  LDSH  FVV+Y +  D+ L +
Sbjct: 172 NYGIVLNELGFDEGFITPLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSY 231

Query: 266 HVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHR 325
           H D+SEVTLNV LGK F+ G LFF  +R            E  +  H     +LHRG+H 
Sbjct: 232 HYDNSEVTLNVSLGKDFTEGNLFFGDMR-----QVPLSETECVEVEHRVTEGLLHRGQHM 291

Query: 326 HGARATTSGRRVNLLLWCRSS 338
           HGA + +SG R NL++W R+S
Sbjct: 292 HGALSISSGTRWNLIIWMRAS 302

BLAST of Cp4.1LG13g06900 vs. Swiss-Prot
Match: OGFD2_HUMAN (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Homo sapiens GN=OGFOD2 PE=2 SV=2)

HSP 1 Score: 153.3 bits (386), Expect = 5.4e-36
Identity = 98/272 (36.03%), Postives = 141/272 (51.84%), Query Frame = 1

Query: 98  RNRIQRHREYRQKII-SNYQPLHRELYSMHAANFFVPSFLKAINENS---EESFRRIMSE 157
           R R+ +    R+ +I S+Y P   E+Y         P FL A+ E S   +   + ++  
Sbjct: 71  RQRLGQESAARKALIASSYHPARPEVYDSLQDAALAPEFL-AVTEYSVSPDADLKGLLQR 130

Query: 158 -----PSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGL 217
                    IY+  +    FC+ LL E+E FE    ++     RPNTMN YG +L + GL
Sbjct: 131 LETVSEEKRIYRVPVFTAPFCQALLEELEHFE----QSDMPKGRPNTMNNYGVLLHELGL 190

Query: 218 -ETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNV 277
            E ++  L + F++P+  + +P+ GG  LDSH  FVV+Y   +D+ELG H D++E+TLNV
Sbjct: 191 DEPLMTPLRERFLQPLMALLYPDCGGGRLDSHRAFVVKYAPGQDLELGCHYDNAELTLNV 250

Query: 278 CLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHGARATTSGRR 337
            LGK F+GG L+F G+         T   E  +  HV G  VLHRG   HGAR   +G R
Sbjct: 251 ALGKVFTGGALYFGGL-----FQAPTALTEPLEVEHVVGQGVLHRGGQLHGARPLGTGER 310

Query: 338 VNLLLWCRSSVFRELKKYQKDFSSWCGECQRE 360
            NL++W R+S  R         +S C  C RE
Sbjct: 311 WNLVVWLRASAVR---------NSLCPMCCRE 323

BLAST of Cp4.1LG13g06900 vs. Swiss-Prot
Match: OGFD2_MOUSE (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Mus musculus GN=Ogfod2 PE=2 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 7.1e-36
Identity = 88/254 (34.65%), Postives = 137/254 (53.94%), Query Frame = 1

Query: 96  GERNRIQRHREYRQKII-SNYQPLHRELYSMHAANFFVPSFLKAINENS------EESFR 155
           G R R+ +    R+ +I S+Y P   E+YS        P F+ A   ++      E   +
Sbjct: 68  GRRRRLGQESAVRKALIASSYHPARPEVYSSLQDAALAPEFMAAAEYSTSPGADLEGLLQ 127

Query: 156 RIMS-EPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFG 215
           R+ +      IY+  +   +FC+ LL E+E FE    ++     RPNTMN +G ++ + G
Sbjct: 128 RLETVSEEKRIYRVPVFSAKFCQTLLEELEHFE----QSDMPKGRPNTMNNHGVLMYELG 187

Query: 216 LET-MLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLN 275
           L+  ++  L + F+ P+  + +P+ GG  LDSH  FVV+Y + +D++LG H D++E+TLN
Sbjct: 188 LDDPLVTPLRERFLLPLMALLYPDYGGGYLDSHRAFVVKYALGQDLDLGCHYDNAELTLN 247

Query: 276 VCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHGARATTSGR 335
           V LGK F+GG L+F G+            +E  +  HV G  +LHRG   HGAR    G 
Sbjct: 248 VALGKDFTGGALYFGGL-----FQAPAALKETLEVEHVVGSGILHRGGQLHGARPLCKGE 307

Query: 336 RVNLLLWCRSSVFR 341
           R NL++W R+S  R
Sbjct: 308 RWNLVVWLRASAVR 312

BLAST of Cp4.1LG13g06900 vs. TrEMBL
Match: A0A0A0KPJ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G576670 PE=4 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 1.3e-217
Identity = 372/384 (96.88%), Postives = 380/384 (98.96%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPL 60
           MSLEASLERRKQPQAPGTGNGNGVVSP+  S STHRLRLQPKEDHKSESYEDLQLEFSP+
Sbjct: 1   MSLEASLERRKQPQAPGTGNGNGVVSPTPQSLSTHRLRLQPKEDHKSESYEDLQLEFSPV 60

Query: 61  LFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHR 120
           LFSMLERHLPP+MLN+AR++KLQYMRDILLRYAPEGERNR+QRHREYRQKIISNYQPLHR
Sbjct: 61  LFSMLERHLPPNMLNVAREVKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQPLHR 120

Query: 121 ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180
           ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE
Sbjct: 121 ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180

Query: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240
           RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH
Sbjct: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240

Query: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFD 300
           GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQ EEIFD
Sbjct: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFD 300

Query: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360
           YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK
Sbjct: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360

Query: 361 RERQLLSIDATKQELLRRDVKSPP 385
           RERQLLSIDATKQELLRR+VKSPP
Sbjct: 361 RERQLLSIDATKQELLRREVKSPP 384

BLAST of Cp4.1LG13g06900 vs. TrEMBL
Match: A0A061GER7_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_026929 PE=4 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 1.1e-176
Identity = 306/385 (79.48%), Postives = 342/385 (88.83%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTG-NGNGV-VSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFS 60
           MS + + +  +QP  P  G NGNGV V PS    + HRLRL P  +HK E+YE LQLEFS
Sbjct: 1   MSFDLTRKEPQQPTPPSAGCNGNGVAVLPSMA--TAHRLRLNPNTEHKPETYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPL 120
           PLLFS LER+LPP ML+++RD KL YMRDI+LRY+PEGER R+QRHREYRQKIIS+YQPL
Sbjct: 61  PLLFSSLERYLPPPMLSLSRDSKLNYMRDIILRYSPEGERTRVQRHREYRQKIISHYQPL 120

Query: 121 HRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVES 180
           HRELY+MHA+NFFVPSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE LLSEVE+
Sbjct: 121 HRELYAMHASNFFVPSFLKAINENKEESFRSIMAEPTLGVFTFEMLQPHFCELLLSEVEN 180

Query: 181 FERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDS 240
           FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETMLDKLM+DFIRPIS+VFF +VGG+TLDS
Sbjct: 181 FEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEDFIRPISKVFFSDVGGSTLDS 240

Query: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEI 300
           HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVNTETQ +EI
Sbjct: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVNTETQSDEI 300

Query: 301 FDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQR 360
            DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFSSWCGECQR
Sbjct: 301 LDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFSSWCGECQR 360

Query: 361 EKRERQLLSIDATKQELLRRDVKSP 384
           EK+ERQ +SI ATKQELL+R+ K P
Sbjct: 361 EKKERQRVSIAATKQELLKREGKPP 383

BLAST of Cp4.1LG13g06900 vs. TrEMBL
Match: A0A067KFJ1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09294 PE=4 SV=1)

HSP 1 Score: 621.7 bits (1602), Expect = 6.0e-175
Identity = 292/361 (80.89%), Postives = 328/361 (90.86%), Query Frame = 1

Query: 19  GNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPLLFSMLERHLPPSMLNMAR 78
           GNGNGVV P      THRLRL P  DHK +SYEDLQ +FSPLLFS LER+LPPSMLN++R
Sbjct: 11  GNGNGVVLP------THRLRLNPSTDHKPDSYEDLQSDFSPLLFSSLERYLPPSMLNLSR 70

Query: 79  DLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKA 138
           D K+Q+++DIL+RY+PEGER R+ +H+EYRQKIISNYQPLHRELYSMHA NFFVPSFLKA
Sbjct: 71  DAKIQFLKDILVRYSPEGERARVLKHKEYRQKIISNYQPLHRELYSMHAENFFVPSFLKA 130

Query: 139 INENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNK 198
           +NEN+E+SFR I++EP PGIY FEMLQP FCE L+SEVE+FERWVH+TKFRIMRPNTMNK
Sbjct: 131 VNENTEQSFRNILAEPRPGIYVFEMLQPHFCEMLMSEVENFERWVHDTKFRIMRPNTMNK 190

Query: 199 YGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHV 258
           YGAVLDDFGL+TML KLMD++IRP+SR+FFPEVGG+TLDSHHGFVVEYG+DRDVELGFHV
Sbjct: 191 YGAVLDDFGLQTMLQKLMDEYIRPMSRIFFPEVGGSTLDSHHGFVVEYGVDRDVELGFHV 250

Query: 259 DDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHG 318
           DDSEVTLNVCLGKQF GGELFFRG+RCDKHVNTETQPEEIFDY+HVPG AV HRGRHRHG
Sbjct: 251 DDSEVTLNVCLGKQFYGGELFFRGVRCDKHVNTETQPEEIFDYVHVPGRAVFHRGRHRHG 310

Query: 319 ARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREKRERQLLSIDATKQELLRR 378
           ARATTSG R NL+LWCRSSVFRELKKYQKDFS+WCGEC  EK++RQ LSI ATK ELL+R
Sbjct: 311 ARATTSGHRCNLILWCRSSVFRELKKYQKDFSNWCGECLHEKKDRQRLSITATKLELLKR 365

Query: 379 D 380
           D
Sbjct: 371 D 365

BLAST of Cp4.1LG13g06900 vs. TrEMBL
Match: B9SJY6_RICCO (Oxidoreductase, putative OS=Ricinus communis GN=RCOM_0577570 PE=4 SV=1)

HSP 1 Score: 620.2 bits (1598), Expect = 1.7e-174
Identity = 298/376 (79.26%), Postives = 334/376 (88.83%), Query Frame = 1

Query: 6   SLERRKQPQAPGTGN--GNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPLLFS 65
           SL+  K+P   G GN  GNGVV P+     THRLRL P  DH  ESY+DLQLEFSPLLFS
Sbjct: 2   SLDTHKRPNGNGNGNSTGNGVVLPTQ---VTHRLRLNPSTDHNPESYDDLQLEFSPLLFS 61

Query: 66  MLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHRELY 125
            LER+LPPSMLN++RD K+Q+M+DIL+RY+PEGER RIQ+HREYRQKIISNYQPLHRELY
Sbjct: 62  SLERYLPPSMLNLSRDSKIQFMKDILVRYSPEGERTRIQKHREYRQKIISNYQPLHRELY 121

Query: 126 SMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWV 185
           +++AA FFVPSFLKAINEN+EE FR I  EP+PG+Y FEMLQP FCE L+SEVE+FERWV
Sbjct: 122 TVNAAKFFVPSFLKAINENTEEGFRNIWVEPTPGVYVFEMLQPNFCEMLMSEVENFERWV 181

Query: 186 HETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFV 245
           HETKFRIMRPNTMN YGAVLDDFGLETMLDKLMD++IRP+S++FFPEVGG+TLDSHHGF+
Sbjct: 182 HETKFRIMRPNTMNNYGAVLDDFGLETMLDKLMDEYIRPMSKLFFPEVGGSTLDSHHGFI 241

Query: 246 VEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFDYLH 305
           VEYG+DRDVELGFHVDDSEVTLNVCL KQF GG+LFFRG+RCDKHVNTETQ EEI DY+H
Sbjct: 242 VEYGVDRDVELGFHVDDSEVTLNVCLSKQFVGGDLFFRGVRCDKHVNTETQAEEILDYVH 301

Query: 306 VPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREKRER 365
           V GHAVLH GRHRHGARATTSGRRVNL+LWCRSSVFRELKKYQKD SSWC EC+REK+ER
Sbjct: 302 VQGHAVLHHGRHRHGARATTSGRRVNLILWCRSSVFRELKKYQKDCSSWCRECRREKKER 361

Query: 366 QLLSIDATKQELLRRD 380
           Q LSI ATK ELL+RD
Sbjct: 362 QRLSITATKLELLKRD 374

BLAST of Cp4.1LG13g06900 vs. TrEMBL
Match: A0A061G6N2_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_026929 PE=4 SV=1)

HSP 1 Score: 616.7 bits (1589), Expect = 1.9e-173
Identity = 300/375 (80.00%), Postives = 334/375 (89.07%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTG-NGNGV-VSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFS 60
           MS + + +  +QP  P  G NGNGV V PS    + HRLRL P  +HK E+YE LQLEFS
Sbjct: 1   MSFDLTRKEPQQPTPPSAGCNGNGVAVLPSMA--TAHRLRLNPNTEHKPETYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPL 120
           PLLFS LER+LPP ML+++RD KL YMRDI+LRY+PEGER R+QRHREYRQKIIS+YQPL
Sbjct: 61  PLLFSSLERYLPPPMLSLSRDSKLNYMRDIILRYSPEGERTRVQRHREYRQKIISHYQPL 120

Query: 121 HRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVES 180
           HRELY+MHA+NFFVPSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE LLSEVE+
Sbjct: 121 HRELYAMHASNFFVPSFLKAINENKEESFRSIMAEPTLGVFTFEMLQPHFCELLLSEVEN 180

Query: 181 FERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDS 240
           FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETMLDKLM+DFIRPIS+VFF +VGG+TLDS
Sbjct: 181 FEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEDFIRPISKVFFSDVGGSTLDS 240

Query: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEI 300
           HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVNTETQ +EI
Sbjct: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVNTETQSDEI 300

Query: 301 FDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQR 360
            DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFSSWCGECQR
Sbjct: 301 LDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFSSWCGECQR 360

Query: 361 EKRERQLLSIDATKQ 374
           EK+ERQ +SI ATKQ
Sbjct: 361 EKKERQRVSIAATKQ 373

BLAST of Cp4.1LG13g06900 vs. TAIR10
Match: AT3G18210.1 (AT3G18210.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 519.6 bits (1337), Expect = 1.6e-147
Identity = 253/387 (65.37%), Postives = 309/387 (79.84%), Query Frame = 1

Query: 6   SLERRK--QPQAPGTGNGNGVVS-PSAHS------------FSTHRLRLQPKEDHKSESY 65
           S E+R+  Q     T  GNG ++  ++HS             S  RLRL P  +H+ +SY
Sbjct: 2   SSEQREGSQETTTTTVEGNGTIAGQNSHSAAPTTLRATSTMVSCQRLRLNPNNEHRPDSY 61

Query: 66  EDLQLEFSPLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQK 125
           EDLQL+F   ++S LE++LPP+ML   RD K+++M DI+LR+ P GER+R QRH +YR K
Sbjct: 62  EDLQLDFPNSVYSSLEKYLPPNMLVSNRDEKIKFMTDIMLRHLPHGERSRAQRHSDYRLK 121

Query: 126 IISNYQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 185
           I +NYQPLH+ELY++     FVP+FLKAINEN+EESFR I+SEPSPG++ F+MLQP FCE
Sbjct: 122 ITTNYQPLHKELYTLVPTVCFVPAFLKAINENTEESFRNIISEPSPGVFVFDMLQPSFCE 181

Query: 186 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPE 245
            +L+E+++FERWV ETKFRIMRPNTMNKYGAVLDDFGL+TMLDKLM+ FIRPIS+VFF +
Sbjct: 182 MMLAEIDNFERWVGETKFRIMRPNTMNKYGAVLDDFGLDTMLDKLMEGFIRPISKVFFSD 241

Query: 246 VGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 305
           VGGATLDSHHGFVVEYG DRDV+LGFHVDDSEVTLNVCLG QF GGELFFRG RC+KHVN
Sbjct: 242 VGGATLDSHHGFVVEYGKDRDVDLGFHVDDSEVTLNVCLGNQFVGGELFFRGTRCEKHVN 301

Query: 306 TETQPEEIFDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 365
           T T+ +E +DY H+PG AVLHRGRHRHGARATT G RVN+LLWCRSSVFRELK + KDFS
Sbjct: 302 TATKADETYDYCHIPGQAVLHRGRHRHGARATTCGHRVNMLLWCRSSVFRELKTHHKDFS 361

Query: 366 SWCGECQREKRERQLLSIDATKQELLR 378
           SWCGEC  EKR+ ++ SIDA +++L +
Sbjct: 362 SWCGECFCEKRDEKVRSIDALRKKLFK 388

BLAST of Cp4.1LG13g06900 vs. TAIR10
Match: AT1G22950.1 (AT1G22950.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 477.6 bits (1228), Expect = 7.0e-135
Identity = 226/380 (59.47%), Postives = 296/380 (77.89%), Query Frame = 1

Query: 1   MSLEASLER--RKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFS 60
           M+L++S ++  ++Q Q P   +GNG         +  +LR  P E+H+ E+YEDL L++S
Sbjct: 10  MALDSSGKQPEQQQQQQPRASSGNGE--------ARLKLRRTPNEEHEPENYEDLPLDYS 69

Query: 61  PLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPL 120
           P LF+ LER+LP  +LN  R  K  +MRD+LLRY+P+ ER R+ RH+EYR KI+S+YQ L
Sbjct: 70  PSLFTSLERYLPEQLLNSTRIDKASFMRDLLLRYSPDTERVRVLRHKEYRDKIMSSYQRL 129

Query: 121 HRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVES 180
           H E+Y++  ++FF PSFL A +  SE +FR  M E  PGI+ FEM +PQFCE LL+EVE 
Sbjct: 130 HGEIYTLDPSSFFAPSFLGAFSRKSEPNFRSSMVESYPGIFTFEMFKPQFCEMLLAEVEH 189

Query: 181 FERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDS 240
            E+WV++++  IMRPNTMN +G VLDDFG ++ML KL+DDFI PI++V FPEV G +LDS
Sbjct: 190 MEKWVYDSRSTIMRPNTMNNFGVVLDDFGFDSMLQKLVDDFISPIAQVLFPEVCGTSLDS 249

Query: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEI 300
           HHG++VEYG DRDV+LGFHVDDSEV+LNVCLGKQFSGGEL+FRG+RCDKHVN+++  +E+
Sbjct: 250 HHGYIVEYGKDRDVDLGFHVDDSEVSLNVCLGKQFSGGELYFRGVRCDKHVNSDSTEKEV 309

Query: 301 FDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQR 360
           +DY HVPGHA+LHRGRHRHGARATTSG R NL+LWCRSS FRE+K YQ+DFS WCG C+ 
Sbjct: 310 YDYSHVPGHAILHRGRHRHGARATTSGHRANLILWCRSSTFREMKNYQRDFSGWCGGCKL 369

Query: 361 EKRERQLLSIDATKQELLRR 379
           +K+ RQ  SI+ATK+ L R+
Sbjct: 370 DKQRRQRDSINATKEILARK 381

BLAST of Cp4.1LG13g06900 vs. TAIR10
Match: AT1G48740.2 (AT1G48740.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 382.1 bits (980), Expect = 4.0e-106
Identity = 183/344 (53.20%), Postives = 245/344 (71.22%), Query Frame = 1

Query: 36  RLRLQPKEDHKSESYEDLQLEFSPLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPE 95
           RL   P  +H S++Y DL+LE+S  + S LE++LPP ML   R+ K ++M DIL +Y   
Sbjct: 32  RLSPFPNMEHISDNYGDLELEYSSAMLSSLEKYLPPEMLTATREEKAKFMSDILRKYISR 91

Query: 96  GERNRIQRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPS 155
            E ++ +  + Y QKI SNYQPL RELY+     F +PSF KAI+EN++ESFRRI+SEP 
Sbjct: 92  DECSKAKWCKNYWQKIKSNYQPLSRELYNFDPELFLLPSFRKAISENTKESFRRIISEPF 151

Query: 156 PGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKL 215
           PG+  F+M QP F +KL+ EVE+  +WVHET F I RP  M+KYG    DFGL+ ML +L
Sbjct: 152 PGVLVFQMFQPDFIQKLIVEVENIGKWVHETNFPIRRPYHMSKYGVAFVDFGLDIMLQQL 211

Query: 216 MDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSG 275
           M++F+ PI +VFFPE  GA  DSHHG+ +E G DRD  LG+H+DDSE+TLNVC+ KQF G
Sbjct: 212 MEEFLFPICKVFFPEECGAMFDSHHGYYIENGEDRDPPLGYHLDDSEITLNVCVRKQFEG 271

Query: 276 GELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHGARATT-SGRRVNLLLWC 335
           GE+ F G RC +H  T+ +PEE+F Y H PG A+LHRGRHRHG RA T S  R N++L C
Sbjct: 272 GEISFIGTRCLRHKRTDVKPEEVFHYCHSPGQAILHRGRHRHGPRANTPSCSRANMILCC 331

Query: 336 RSSVFRELKKYQKDFSSWCGECQREKRERQLLSIDATKQELLRR 379
           R+S+FRE++KY+KDF  WC EC  EK+E++  S+DA ++ + +R
Sbjct: 332 RNSLFREMEKYEKDFPEWCNECAHEKKEKESQSLDAKRKVIKKR 375

BLAST of Cp4.1LG13g06900 vs. TAIR10
Match: AT5G43660.1 (AT5G43660.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 374.4 bits (960), Expect = 8.4e-104
Identity = 178/344 (51.74%), Postives = 242/344 (70.35%), Query Frame = 1

Query: 36  RLRLQPKEDHKSESYEDLQLEFSPLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPE 95
           RL L P  +H S++YEDL+LEFS  +   LER+LPP +L   R+ K ++M DIL +Y   
Sbjct: 54  RLSLLPNNEHNSDNYEDLELEFSSSVLRSLERYLPPEILTANREEKAKFMSDILHKYISR 113

Query: 96  GERNRIQRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPS 155
            E  +  R + YR+ I+SNYQP  RELY +   +  +P F KA+ EN+EESFRRIM EP 
Sbjct: 114 EECAKAIRFKNYREWIMSNYQPRFRELYKLDPESLLLPCFRKAVRENTEESFRRIMFEPF 173

Query: 156 PGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKL 215
           PG+Y F+M QP F +KLL EVE+  +W+HE K  I +PN  +KYG VLDDFG++ ML  L
Sbjct: 174 PGVYVFKMFQPDFFQKLLVEVENMRKWLHEAKLMIRKPNNKSKYGVVLDDFGMDIMLKPL 233

Query: 216 MDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSG 275
           ++DFI PI +VFFP+V G   D+ HGFV+E   DRD ELGFHV++S++TLNVCL KQ  G
Sbjct: 234 VEDFIFPICKVFFPQVCGTMFDTQHGFVIENCEDRDAELGFHVENSDITLNVCLSKQSEG 293

Query: 276 GELFFRGIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHGARAT-TSGRRVNLLLWC 335
           GE+ F G RC+KH+    +PEEIF+Y H PG A+LH G H HGA+A  TS  R N++LWC
Sbjct: 294 GEILFTGTRCNKHLKAGPKPEEIFEYCHEPGQAILHLGCHSHGAKAAITSCSRANMILWC 353

Query: 336 RSSVFRELKKYQKDFSSWCGECQREKRERQLLSIDATKQELLRR 379
            +S+FRE++ Y  +F  WCG+C REK+E++  S+ A+K+++ +R
Sbjct: 354 INSLFREMQTYDNEFRDWCGQCAREKKEKKSQSL-ASKRKVKKR 396

BLAST of Cp4.1LG13g06900 vs. TAIR10
Match: AT1G48700.1 (AT1G48700.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 355.1 bits (910), Expect = 5.3e-98
Identity = 159/283 (56.18%), Postives = 214/283 (75.62%), Query Frame = 1

Query: 102 QRHREYRQKIISNYQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKF 161
           +R + YRQ+IISNYQP  + LY +    F +PSF KAI+EN+EESFRRI+SEP PG++ F
Sbjct: 5   KRRKTYRQEIISNYQPRFKGLYKLDPKLFLLPSFRKAISENTEESFRRIISEPFPGVFVF 64

Query: 162 EMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIR 221
           +M QP F EKLL EVE+F +W +ET F I RP+  +KYG VLDDFGL+ ML +LMDDFI 
Sbjct: 65  KMFQPDFSEKLLLEVENFRKWANETNFTIRRPDNTSKYGVVLDDFGLDIMLKQLMDDFIF 124

Query: 222 PISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFR 281
           PI +VFFPEV G   DSH+GF +E G DRD ++GFHV+DS++TLNVCL KQ  GGE+ F 
Sbjct: 125 PICKVFFPEVCGTMFDSHYGFFIENGEDRDADVGFHVEDSDITLNVCLSKQGEGGEILFA 184

Query: 282 GIRCDKHVNTETQPEEIFDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRE 341
           G RC+KH++ + +PEE FDY H+PG A+LHRG H HGARAT SGRR N++LWC++S+FRE
Sbjct: 185 GARCNKHMDIDPKPEEYFDYCHIPGQAILHRGCHVHGARATASGRRANMILWCQNSLFRE 244

Query: 342 LKKYQKDFSSWCGECQREKRERQLLSIDATKQELLRRDVKSPP 385
           ++ Y+ +FS WCG+C  E++E +   +   ++E+ R + ++ P
Sbjct: 245 MQTYEPEFSDWCGQCVHEEKENKSQILAVKRKEMFRIESEAEP 287

BLAST of Cp4.1LG13g06900 vs. NCBI nr
Match: gi|449458771|ref|XP_004147120.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950 [Cucumis sativus])

HSP 1 Score: 763.5 bits (1970), Expect = 1.8e-217
Identity = 372/384 (96.88%), Postives = 380/384 (98.96%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPL 60
           MSLEASLERRKQPQAPGTGNGNGVVSP+  S STHRLRLQPKEDHKSESYEDLQLEFSP+
Sbjct: 1   MSLEASLERRKQPQAPGTGNGNGVVSPTPQSLSTHRLRLQPKEDHKSESYEDLQLEFSPV 60

Query: 61  LFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHR 120
           LFSMLERHLPP+MLN+AR++KLQYMRDILLRYAPEGERNR+QRHREYRQKIISNYQPLHR
Sbjct: 61  LFSMLERHLPPNMLNVAREVKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQPLHR 120

Query: 121 ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180
           ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE
Sbjct: 121 ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180

Query: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240
           RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH
Sbjct: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240

Query: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFD 300
           GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQ EEIFD
Sbjct: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFD 300

Query: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360
           YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK
Sbjct: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360

Query: 361 RERQLLSIDATKQELLRRDVKSPP 385
           RERQLLSIDATKQELLRR+VKSPP
Sbjct: 361 RERQLLSIDATKQELLRREVKSPP 384

BLAST of Cp4.1LG13g06900 vs. NCBI nr
Match: gi|659072926|ref|XP_008467170.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Cucumis melo])

HSP 1 Score: 763.1 bits (1969), Expect = 2.4e-217
Identity = 371/384 (96.61%), Postives = 380/384 (98.96%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPL 60
           MSLEASLERRKQPQAPGTGNGNGVVSP+  S STHRLRLQPKEDHKSESYEDLQLEFSP+
Sbjct: 1   MSLEASLERRKQPQAPGTGNGNGVVSPTPQSLSTHRLRLQPKEDHKSESYEDLQLEFSPV 60

Query: 61  LFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHR 120
           LFSMLERHLPP+MLN+AR++KLQYMRDILLRYAPEGERNR+QRHREYRQKIISNYQPLHR
Sbjct: 61  LFSMLERHLPPNMLNVAREVKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQPLHR 120

Query: 121 ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180
           ELYSMHAANFFVPSFLKA+NENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE
Sbjct: 121 ELYSMHAANFFVPSFLKAVNENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180

Query: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240
           RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH
Sbjct: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240

Query: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFD 300
           GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQ EEIFD
Sbjct: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFD 300

Query: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360
           YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK
Sbjct: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360

Query: 361 RERQLLSIDATKQELLRRDVKSPP 385
           RERQLLSIDATKQELLRR+VKSPP
Sbjct: 361 RERQLLSIDATKQELLRREVKSPP 384

BLAST of Cp4.1LG13g06900 vs. NCBI nr
Match: gi|1009120068|ref|XP_015876721.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Ziziphus jujuba])

HSP 1 Score: 645.2 bits (1663), Expect = 7.2e-182
Identity = 310/383 (80.94%), Postives = 348/383 (90.86%), Query Frame = 1

Query: 1   MSLEASLERRKQPQ-APGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSP 60
           MS++ SL+RRKQP  A   GNGNGVV P    ++ +RLRL P +DHK ++YEDLQLEFSP
Sbjct: 1   MSVDGSLDRRKQPPPAQSAGNGNGVVQPGVGQYAANRLRLNPNKDHKPDNYEDLQLEFSP 60

Query: 61  LLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLH 120
           LLFS LE++LPP+ML ++RD+KLQYMR ILLRY+PEGER R+QRHREYRQKIISNYQPL+
Sbjct: 61  LLFSSLEQYLPPTMLKVSRDVKLQYMRHILLRYSPEGERLRVQRHREYRQKIISNYQPLY 120

Query: 121 RELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESF 180
           RELY+MHAANFFVPSFLKA+++N+EESFR IM EP+PGIY FEMLQP FCE LL+EVE+F
Sbjct: 121 RELYTMHAANFFVPSFLKALSDNTEESFRNIMVEPAPGIYAFEMLQPNFCEMLLTEVENF 180

Query: 181 ERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSH 240
           ERWVHETKFRIMRPNTMNKYGAVLDDFGLETML+KL+DDFIRPISRVFFPEVGG+TLDSH
Sbjct: 181 ERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLEKLLDDFIRPISRVFFPEVGGSTLDSH 240

Query: 241 HGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIF 300
           HGFVVEYGIDRDVELGFHVDDSEVTLNVCLG+QFSGGELFFRG+RCDKHVN+ETQ EEI 
Sbjct: 241 HGFVVEYGIDRDVELGFHVDDSEVTLNVCLGRQFSGGELFFRGVRCDKHVNSETQSEEIL 300

Query: 301 DYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQRE 360
           DY H  G AVLHRGRHRHGARATT+GRRVNLLLWCRSSV+REL+KYQKD SSWCGECQRE
Sbjct: 301 DYSHALGRAVLHRGRHRHGARATTAGRRVNLLLWCRSSVYRELRKYQKDCSSWCGECQRE 360

Query: 361 KRERQLLSIDATKQELLRRDVKS 383
           K+ERQ LSI ATK ELL+RD K+
Sbjct: 361 KKERQRLSIAATKMELLKRDGKA 383

BLAST of Cp4.1LG13g06900 vs. NCBI nr
Match: gi|720075726|ref|XP_010279401.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Nelumbo nucifera])

HSP 1 Score: 627.9 bits (1618), Expect = 1.2e-176
Identity = 301/382 (78.80%), Postives = 340/382 (89.01%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTGNGNGVVSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFSPL 60
           MS + S+ RR++ Q  G GNGNGVV+ S   +++HRLRL P  DHK E+Y+DLQLEFSP 
Sbjct: 1   MSCDGSVGRREESQT-GNGNGNGVVASSRPLYASHRLRLNPNTDHKPENYDDLQLEFSPS 60

Query: 61  LFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPLHR 120
           +FS LER+LPPSMLN++RD K+QYM++IL RY PEGER R+QRHREYRQKIISNYQPLHR
Sbjct: 61  VFSSLERYLPPSMLNVSRDAKVQYMKEILSRYLPEGERTRVQRHREYRQKIISNYQPLHR 120

Query: 121 ELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFE 180
           ELY++H   FFVPSF+KAI+EN+EES R I+SEPSPG+Y FEMLQP+FCE LLSEVE+FE
Sbjct: 121 ELYTIHPTTFFVPSFIKAISENTEESLRSIISEPSPGVYTFEMLQPRFCELLLSEVENFE 180

Query: 181 RWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHH 240
           +WV E KFRIMRPNTMNK+GAVLDDFGLETMLDKLMDDF+RPIS+VFF EVGG+TLDSHH
Sbjct: 181 KWVREAKFRIMRPNTMNKFGAVLDDFGLETMLDKLMDDFLRPISKVFFAEVGGSTLDSHH 240

Query: 241 GFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEIFD 300
           GFVVEYG DRDV+LGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEI D
Sbjct: 241 GFVVEYGKDRDVDLGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEILD 300

Query: 301 YLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360
           Y HVPG AVLHRGRHRHGARATTSG R+NLLLWCRSSVFRELKKYQKDFSSWCGECQREK
Sbjct: 301 YSHVPGQAVLHRGRHRHGARATTSGHRINLLLWCRSSVFRELKKYQKDFSSWCGECQREK 360

Query: 361 RERQLLSIDATKQELLRRDVKS 383
           +ERQ  S+ A+K EL RR+ +S
Sbjct: 361 KERQRQSVAASKLELFRREGES 381

BLAST of Cp4.1LG13g06900 vs. NCBI nr
Match: gi|590614320|ref|XP_007022907.1| (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 627.5 bits (1617), Expect = 1.6e-176
Identity = 306/385 (79.48%), Postives = 342/385 (88.83%), Query Frame = 1

Query: 1   MSLEASLERRKQPQAPGTG-NGNGV-VSPSAHSFSTHRLRLQPKEDHKSESYEDLQLEFS 60
           MS + + +  +QP  P  G NGNGV V PS    + HRLRL P  +HK E+YE LQLEFS
Sbjct: 1   MSFDLTRKEPQQPTPPSAGCNGNGVAVLPSMA--TAHRLRLNPNTEHKPETYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNMARDLKLQYMRDILLRYAPEGERNRIQRHREYRQKIISNYQPL 120
           PLLFS LER+LPP ML+++RD KL YMRDI+LRY+PEGER R+QRHREYRQKIIS+YQPL
Sbjct: 61  PLLFSSLERYLPPPMLSLSRDSKLNYMRDIILRYSPEGERTRVQRHREYRQKIISHYQPL 120

Query: 121 HRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVES 180
           HRELY+MHA+NFFVPSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE LLSEVE+
Sbjct: 121 HRELYAMHASNFFVPSFLKAINENKEESFRSIMAEPTLGVFTFEMLQPHFCELLLSEVEN 180

Query: 181 FERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDS 240
           FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETMLDKLM+DFIRPIS+VFF +VGG+TLDS
Sbjct: 181 FEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEDFIRPISKVFFSDVGGSTLDS 240

Query: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQPEEI 300
           HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVNTETQ +EI
Sbjct: 241 HHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVNTETQSDEI 300

Query: 301 FDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQR 360
            DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFSSWCGECQR
Sbjct: 301 LDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFSSWCGECQR 360

Query: 361 EKRERQLLSIDATKQELLRRDVKSP 384
           EK+ERQ +SI ATKQELL+R+ K P
Sbjct: 361 EKKERQRVSIAATKQELLKREGKPP 383

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1295_ARATH1.3e-13359.47Uncharacterized PKHD-type hydroxylase At1g22950 OS=Arabidopsis thaliana GN=At1g2... [more]
OGFD2_XENTR8.1e-4036.702-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Xenop... [more]
OGFD2_DANRE8.3e-3734.872-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Danio... [more]
OGFD2_HUMAN5.4e-3636.032-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Homo ... [more]
OGFD2_MOUSE7.1e-3634.652-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Mus m... [more]
Match NameE-valueIdentityDescription
A0A0A0KPJ6_CUCSA1.3e-21796.88Uncharacterized protein OS=Cucumis sativus GN=Csa_5G576670 PE=4 SV=1[more]
A0A061GER7_THECC1.1e-17679.482-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=T... [more]
A0A067KFJ1_JATCU6.0e-17580.89Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09294 PE=4 SV=1[more]
B9SJY6_RICCO1.7e-17479.26Oxidoreductase, putative OS=Ricinus communis GN=RCOM_0577570 PE=4 SV=1[more]
A0A061G6N2_THECC1.9e-17380.002-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=T... [more]
Match NameE-valueIdentityDescription
AT3G18210.11.6e-14765.37 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G22950.17.0e-13559.47 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G48740.24.0e-10653.20 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G43660.18.4e-10451.74 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G48700.15.3e-9856.18 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|449458771|ref|XP_004147120.1|1.8e-21796.88PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950 [Cucumis sativus][more]
gi|659072926|ref|XP_008467170.1|2.4e-21796.61PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Cucumis melo][more]
gi|1009120068|ref|XP_015876721.1|7.2e-18280.94PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Ziziphus jujuba... [more]
gi|720075726|ref|XP_010279401.1|1.2e-17678.80PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Nelumbo nucifer... [more]
gi|590614320|ref|XP_007022907.1|1.6e-17679.482-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [The... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0031418L-ascorbic acid binding
GO:0005506iron ion binding
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0016491oxidoreductase activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
Vocabulary: INTERPRO
TermDefinition
IPR006620Pro_4_hyd_alph
IPR005123Oxoglu/Fe-dep_dioxygenase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006554 lysine catabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0019538 protein metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0008475 procollagen-lysine 5-dioxygenase activity
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g06900.1Cp4.1LG13g06900.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 237..336
score: 10
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 156..335
score: 2.1
NoneNo IPR availablePANTHERPTHR24014FAMILY NOT NAMEDcoord: 33..379
score: 2.7E
NoneNo IPR availablePANTHERPTHR24014:SF42-OXOGLUTARATE AND IRON-DEPENDENT OXYGENASE DOMAIN-CONTAINING PROTEIN 2coord: 33..379
score: 2.7E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG13g06900Cucsa.280630Cucumber (Gy14) v1cgycpeB0735
Cp4.1LG13g06900CmaCh15G009330Cucurbita maxima (Rimu)cmacpeB305
Cp4.1LG13g06900CmoCh15G009730Cucurbita moschata (Rifu)cmocpeB268
Cp4.1LG13g06900CsGy5G019380Cucumber (Gy14) v2cgybcpeB600
Cp4.1LG13g06900Carg24917Silver-seed gourdcarcpeB1111
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG13g06900Cucumber (Chinese Long) v2cpecuB177
Cp4.1LG13g06900Cucumber (Chinese Long) v2cpecuB190
Cp4.1LG13g06900Bottle gourd (USVL1VR-Ls)cpelsiB135
Cp4.1LG13g06900Bottle gourd (USVL1VR-Ls)cpelsiB148
Cp4.1LG13g06900Watermelon (Charleston Gray)cpewcgB172
Cp4.1LG13g06900Watermelon (Charleston Gray)cpewcgB169
Cp4.1LG13g06900Watermelon (97103) v1cpewmB178
Cp4.1LG13g06900Watermelon (97103) v1cpewmB188
Cp4.1LG13g06900Melon (DHL92) v3.5.1cpemeB149
Cp4.1LG13g06900Melon (DHL92) v3.5.1cpemeB160
Cp4.1LG13g06900Cucumber (Gy14) v2cgybcpeB616
Cp4.1LG13g06900Melon (DHL92) v3.6.1cpemedB176
Cp4.1LG13g06900Melon (DHL92) v3.6.1cpemedB187
Cp4.1LG13g06900Silver-seed gourdcarcpeB0458
Cp4.1LG13g06900Silver-seed gourdcarcpeB1175
Cp4.1LG13g06900Silver-seed gourdcarcpeB1465
Cp4.1LG13g06900Cucumber (Chinese Long) v3cpecucB0212
Cp4.1LG13g06900Cucumber (Chinese Long) v3cpecucB0228
Cp4.1LG13g06900Wax gourdcpewgoB0227
Cp4.1LG13g06900Wax gourdcpewgoB0260
Cp4.1LG13g06900Cucurbita pepo (Zucchini)cpecpeB022
Cp4.1LG13g06900Cucurbita pepo (Zucchini)cpecpeB205
Cp4.1LG13g06900Cucumber (Gy14) v1cgycpeB0747
Cp4.1LG13g06900Cucurbita maxima (Rimu)cmacpeB422
Cp4.1LG13g06900Cucurbita maxima (Rimu)cmacpeB710
Cp4.1LG13g06900Cucurbita moschata (Rifu)cmocpeB383
Cp4.1LG13g06900Cucurbita moschata (Rifu)cmocpeB659
Cp4.1LG13g06900Cucurbita moschata (Rifu)cmocpeB663
Cp4.1LG13g06900Wild cucumber (PI 183967)cpecpiB179
Cp4.1LG13g06900Wild cucumber (PI 183967)cpecpiB192