CmaCh04G000360 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G000360
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionOxidoreductase, 2OG-Fe(II) oxygenase family protein
LocationCma_Chr04 : 202968 .. 207520 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAATGCCATCGGGAAATGTGGGTGTTTCCGATAAAGTTCCGTTTCAGAGCAGTGGCGGAGTTGCGGTGAGCGGTGGTGAGATCCATCAGCACCGCCCCCGCCCCTGGTATCCTGATGAGCGTGATGGGCTTATCTCATGGTTCCGAGGCGAATTTGCTGCTTCGAATGCGATCATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGAGAGCCTGGGGAATATGATGTTGTTATTGGGTGTATACAGCAACGGCGGTGCAATTGGACGCCGGTGCTTCATATGCAGCAGTACTTTTCAGTGGCAGAAGTGATGTTTGCACTTCAGCAGGTCACCTCTAGAAGGCAGCAGAGGTTTGTTGATCCTATGAAAGTGGGGTCGAAGTTGTTTAGGAGACCTGGGCCAGCATTTAAGCAGCAGCATCAGCAGGATCAGCAGCATGGCCATCGCCTTGAAGCCACAGTCAAGGAAGAGATGGTCACTTGTGCGGAGTCTTGTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTTGAGCAAGTAAGTAATACATGCGAGGAAAGTAATGCAACGGGGGAGGATGGTAAATTGAATGATAAGGATTCAGGGTCAGCTGAGGACATAAAAGGTAACCTTTTGTTCCACTATCAACGTTTGATCGTGCATGAACTTTATTTTCGGCTAAATTAGAAGTTTAGTTCATGGACTGTGTCTAATTGGATGGTGAACCTTAAAAGTTCAAGTTTGTAAAGTTTAGGGACCTAGAGACATTGGGTCAAACTTGTTATTCAATTTTAAGGATTGACTTTGAGCGTAACAGATAAGCTTGTTAAAATAGCAAAATACACTTGTTTCTTCATATTATAATCTATTATTAAAGGGAAAGGAACAAATAGTAAAATGTATTTGGTGTTACCTGTACAAGGCATTGCCCTGTTTTGCATTGATTAAGAAAATTTAAAACAAAAAGGAAGAGACATGTTGATTGTATGATCTGTGTGTCTTTTGCGTTACGTTTATGAACTGATTTTATTGTATTAAATGTGTTATAATTTTACATGTTGCTATCAATAACTACGTTTAGGACTAGGGATAAGATTGTAGAAAAGGTGAGATGGTACTTTTCTTTAACCCCAAAAATCTTTGCTGCATGGCACATATGGTTGATATGTTTGGATGTGATAAGCTTCGGAAAGTCCCATCTAAGGAAATGAAAAAGTCCAAGCTTTCATCCACTTCCGATTTTAGTTCTACTTTATATTTTGCAGTGGTTCACTGAAATGTACATGATTCTTGTTATTTCTATTTCATTCCTAGAGGTTTGAATTACTTCAGCTATATATTGTGTTATTGAAAATAGGAGTACATCTTAATTTTGACAAGCTTGTTTGTTGGGTAGACACTCATGGGAAGGACCAAAGTAATAGCAAACCCAAGTGTGCAGAAAACTTAGAAGACAATGCAAGCAATAAAGAATCTCAAGTTGAACCTACTGATGATGGATGTTCTTCAAGTCAAAGAGGCGAGTACTTGCTCCTTAAATCATATTTGTAGTTGATCTAGTTAGAAACTACAAACAATTGTATCAGAAATCATTTAATTTAATTTTGCTGCTTTAGCCCACTTGTAGTGGAATTATCAAAAATATATTTTTTACCAATGAAAGGGCATCATTATAATCTGTTATCAACCAACTCTTTCCCCAGATAAGGGGCTGCAGTCTGTTCAAAGCCGGAACGCAAGGCAGTATGCTGCCACAGCCCCAAGAACCTTTGCTGCCAATGAGATATTTGATGGAAAGACGGTATAACTGTTTAATTTTATCCTGAATTTGGTTCTTGGATTGGTTGTTGTGGGATTCTGTGAAGGAAAGAGTCTTTCACATTTAGCTTGACATGGTAGGCTGATTCACTTCTTATTATTTTATTCTTCTTCTACTATAGGTTAATGTGATGGATGGATTGAAATTGTATGAAGAATTATTGGATGATATTGAGGTTTCAAAGCTTCTTTCATTGGTGAATGATTTGAGGGCTTCGGGAAAGAGAGGGCAACTCCAAGGCAAGTTCTTGTTCTTTTTAGCTCAGCTTGTGACTATATTTATGTTACTTTAAGAAAATCGATATCTTCTTTTTAATAAAGGAAGTTATCAAAATAAATTTCAACTAATATGGGTTTTGTGAAGGTTTATCCAATTGTGAACCAGATGACAATTATATTACATACAGCCTCGAGATAGTGAATTTAAGAATGGGAACAATCACTTAAATGTGGTTGTTGGAAACATTGTAACCTTGTGCTCACTGTTTTCTTTCTTTTTGTTGAGAATCTAATGAACTACTTTCATATCTTAGTACCTCACAATTATGCAAGTGAATGCTACATAAATATGGACAACTTTTGTATATTGCGTAATTTTCTTAAACTGAATAATACTGTATTTTTCTCTGCCACAAAATCAGGTCCGACATATATTGTCTCGAAAAGACCGATGAAGGGTCATGGGAGAGAGATGATCCAGCTAGGCTTCCCGATTGCAGATGCAGCTCATGATGACGCCAATTCTTCAGGGCTCTCAAAAGGCATTTACCATTATCTTTGATTTGGATTGCAATTTTTTAAATTTAGTTGAGCTATATAGCAATAAAAATATATGCTGATGTTAGTTTCTGTTCGATGTAGTATTTTAAATCTCCATTTTTTGTGAATTTTACAGATAGAAGAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATTGCTTGGTTTGGGAGCAAGTGATGACAGTGAAACCCGATTCCTGCATCATTGACTTTTATAACGAGGTCACTAATCGCCTACCTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNCATGCAAAGAGTACTGAAGTTGAAGTGGGAGTGTAAAATGAGAGAGAGAGAGAGAGAGAGAGAGGTGGGCTCAAACATGGTCAAAGCTGAGTAAAAAAGCATAAAAATTCCTCGGATTCTGTCTGATTCTTTGTCTTGTTTTTGTCACTCTCTTTACTTCCATTTTGAATCACCCAACACAATATAAGGCTTTTCCCTATGCCTTTCTACCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTGCAAGTTTTCATCAGTGTATGGTGTAGACTAATATTCTTATTGGTTGATATCCTCAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCTGTTGGTGTCCTCCTTTTAACTGAATGTGAAATGAGCTTTGGTAGAGTACTTGGTTCAGACCATTCTGGCAATTATAGAGGGGCTAAGACATTGTCTCTTGCACCGGGGTGGGTAACTATTTTATTCTCAATCTGCATATTTCTCTGAACCTTCTTTCATCCTCTGCCTCGTATTTACATTTTTTCTGTTTACTTTTACTTTCTTATGTTGATCATCCTCTTCCTGATATGTACTGTCTGTGTTTCATTGTAAAACACAGGAGCCTCCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCTGCTATGCGCAAGCAACGGATACTTGTTACCTTGACCAAATCACAACCAAAAAGAGCAGGACCAGCTGATGGGCAACGCACATCTTTGAATTTAGGTTCATATTCCAGTTGGGGCCCTCCATCAGCTAGATCACCCAATGCTCGTCCTTGCCCGGGACAGAAGCATTACCCTATGGGTCCATCGACAGGCGTTCTACCTGTGCCACCCATTCGTCCTCAATTGCCACCACAAAATGGCATCCCACCAATAATGGTGGCTCCTGTAGCACCACCACCTATGCCTTTCCCTCCTTCCGTGCCAATTCCAACGGGTCCACCTGCATGGCCTGCTGCTCATCCAAGGCATCCTCCGCCACGTCTCCCTGTTCCTGGCACTGGAGTATTCCTTCCTCCAGGTGCTTCCAGTGCTCCATCTCCTCAACAGATGCCAAACTCCGCAGTGGAAACGAGTTCCCTTGCAGAAAAGGAAAATGGTCCGACGGAATCTGATCACAATGGAGGTGCTTCTCCAGGGGAAAAATCTGAAGCAAAACCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGGAGTTGTAAGAAGACGGAGGAAGAACAACCAAAGCAGCAGCAGGAGGAGGAGGAGAAAAGTGAGAATGTGGAGGCCCAAAATGCAGGAGGTGGAGAAGCTTAAAGACGGAGAAAAATGCATTACTTAAAGAGAAAAAGAAAAGAGAGAAGAAAAGCAGGTCGGCTGCAGACTTGAATGAGTTAGTCACAAGCAAAATGTAGATAGCGGCAACATTCAAAGACTGATACTACCATCAAACTGCTACTACCATCAAACTGCTACTACCATCAAACTGCTACTACCATCAAGGAGAACAAGGGGAGTTCCCTTTCAAAATCCTTTACTTCATTCCTTTTTTGTGTTGTCCAAAACAATTCTTGGTTAG

mRNA sequence

ATGGCAATGCCATCGGGAAATGTGGGTGTTTCCGATAAAGTTCCGTTTCAGAGCAGTGGCGGAGTTGCGGTGAGCGGTGGTGAGATCCATCAGCACCGCCCCCGCCCCTGGTATCCTGATGAGCGTGATGGGCTTATCTCATGGTTCCGAGGCGAATTTGCTGCTTCGAATGCGATCATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGAGAGCCTGGGGAATATGATGTTGTTATTGGGTGTATACAGCAACGGCGGTGCAATTGGACGCCGGTGCTTCATATGCAGCAGTACTTTTCAGTGGCAGAAGTGATGTTTGCACTTCAGCAGGTCACCTCTAGAAGGCAGCAGAGGTTTGTTGATCCTATGAAAGTGGGGTCGAAGTTGTTTAGGAGACCTGGGCCAGCATTTAAGCAGCAGCATCAGCAGGATCAGCAGCATGGCCATCGCCTTGAAGCCACAGTCAAGGAAGAGATGGTCACTTGTGCGGAGTCTTGTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTTGAGCAAGTAAGTAATACATGCGAGGAAAGTAATGCAACGGGGGAGGATGGTAAATTGAATGATAAGGATTCAGGGTCAGCTGAGGACATAAAAGACACTCATGGGAAGGACCAAAGTAATAGCAAACCCAAGTGTGCAGAAAACTTAGAAGACAATGCAAGCAATAAAGAATCTCAAGTTGAACCTACTGATGATGGATGTTCTTCAAGTCAAAGAGATAAGGGGCTGCAGTCTGTTCAAAGCCGGAACGCAAGGCAGTATGCTGCCACAGCCCCAAGAACCTTTGCTGCCAATGAGATATTTGATGGAAAGACGGTTAATGTGATGGATGGATTGAAATTGTATGAAGAATTATTGGATGATATTGAGGTTTCAAAGCTTCTTTCATTGGTGAATGATTTGAGGGCTTCGGGAAAGAGAGGGCAACTCCAAGGTCCGACATATATTGTCTCGAAAAGACCGATGAAGGGTCATGGGAGAGAGATGATCCAGCTAGGCTTCCCGATTGCAGATGCAGCTCATGATGACGCCAATTCTTCAGGGCTCTCAAAAGATAGAAGAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATTGCTTGGTTTGGGAGCAAGTGATGACAGTGAAACCCGATTCCTGCATCATTGACTTTTATAACGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCTGTTGGTGTCCTCCTTTTAACTGAATGTGAAATGAGCTTTGGTAGAGTACTTGGTTCAGACCATTCTGGCAATTATAGAGGGGCTAAGACATTGTCTCTTGCACCGGGGAGCCTCCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCTGCTATGCGCAAGCAACGGATACTTGTTACCTTGACCAAATCACAACCAAAAAGAGCAGGACCAGCTGATGGGCAACGCACATCTTTGAATTTAGGTTCATATTCCAGTTGGGGCCCTCCATCAGCTAGATCACCCAATGCTCGTCCTTGCCCGGGACAGAAGCATTACCCTATGGGTCCATCGACAGGCGTTCTACCTGTGCCACCCATTCGTCCTCAATTGCCACCACAAAATGGCATCCCACCAATAATGGTGGCTCCTGTAGCACCACCACCTATGCCTTTCCCTCCTTCCGTGCCAATTCCAACGGGTCCACCTGCATGGCCTGCTGCTCATCCAAGGCATCCTCCGCCACGTCTCCCTGTTCCTGGCACTGGAGTATTCCTTCCTCCAGGTGCTTCCAGTGCTCCATCTCCTCAACAGATGCCAAACTCCGCAGTGGAAACGAGTTCCCTTGCAGAAAAGGAAAATGGTCCGACGGAATCTGATCACAATGGAGGTGCTTCTCCAGGGGAAAAATCTGAAGCAAAACCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGGAGTTGTAAGAAGACGGAGGAAGAACAACCAAAGCAGCAGCAGGAGGAGGAGGAGAAAAGTGAGAATGTGGAGGCCCAAAATGCAGGAGTTAGTCACAAGCAAAATGTAGATAGCGGCAACATTCAAAGACTGATACTACCATCAAACTGCTACTACCATCAAACTGCTACTACCATCAAACTGCTACTACCATCAAGGAGAACAAGGGGAGTTCCCTTTCAAAATCCTTTACTTCATTCCTTTTTTGTGTTGTCCAAAACAATTCTTGGTTAG

Coding sequence (CDS)

ATGGCAATGCCATCGGGAAATGTGGGTGTTTCCGATAAAGTTCCGTTTCAGAGCAGTGGCGGAGTTGCGGTGAGCGGTGGTGAGATCCATCAGCACCGCCCCCGCCCCTGGTATCCTGATGAGCGTGATGGGCTTATCTCATGGTTCCGAGGCGAATTTGCTGCTTCGAATGCGATCATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGAGAGCCTGGGGAATATGATGTTGTTATTGGGTGTATACAGCAACGGCGGTGCAATTGGACGCCGGTGCTTCATATGCAGCAGTACTTTTCAGTGGCAGAAGTGATGTTTGCACTTCAGCAGGTCACCTCTAGAAGGCAGCAGAGGTTTGTTGATCCTATGAAAGTGGGGTCGAAGTTGTTTAGGAGACCTGGGCCAGCATTTAAGCAGCAGCATCAGCAGGATCAGCAGCATGGCCATCGCCTTGAAGCCACAGTCAAGGAAGAGATGGTCACTTGTGCGGAGTCTTGTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTTGAGCAAGTAAGTAATACATGCGAGGAAAGTAATGCAACGGGGGAGGATGGTAAATTGAATGATAAGGATTCAGGGTCAGCTGAGGACATAAAAGACACTCATGGGAAGGACCAAAGTAATAGCAAACCCAAGTGTGCAGAAAACTTAGAAGACAATGCAAGCAATAAAGAATCTCAAGTTGAACCTACTGATGATGGATGTTCTTCAAGTCAAAGAGATAAGGGGCTGCAGTCTGTTCAAAGCCGGAACGCAAGGCAGTATGCTGCCACAGCCCCAAGAACCTTTGCTGCCAATGAGATATTTGATGGAAAGACGGTTAATGTGATGGATGGATTGAAATTGTATGAAGAATTATTGGATGATATTGAGGTTTCAAAGCTTCTTTCATTGGTGAATGATTTGAGGGCTTCGGGAAAGAGAGGGCAACTCCAAGGTCCGACATATATTGTCTCGAAAAGACCGATGAAGGGTCATGGGAGAGAGATGATCCAGCTAGGCTTCCCGATTGCAGATGCAGCTCATGATGACGCCAATTCTTCAGGGCTCTCAAAAGATAGAAGAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATTGCTTGGTTTGGGAGCAAGTGATGACAGTGAAACCCGATTCCTGCATCATTGACTTTTATAACGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCTGTTGGTGTCCTCCTTTTAACTGAATGTGAAATGAGCTTTGGTAGAGTACTTGGTTCAGACCATTCTGGCAATTATAGAGGGGCTAAGACATTGTCTCTTGCACCGGGGAGCCTCCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCTGCTATGCGCAAGCAACGGATACTTGTTACCTTGACCAAATCACAACCAAAAAGAGCAGGACCAGCTGATGGGCAACGCACATCTTTGAATTTAGGTTCATATTCCAGTTGGGGCCCTCCATCAGCTAGATCACCCAATGCTCGTCCTTGCCCGGGACAGAAGCATTACCCTATGGGTCCATCGACAGGCGTTCTACCTGTGCCACCCATTCGTCCTCAATTGCCACCACAAAATGGCATCCCACCAATAATGGTGGCTCCTGTAGCACCACCACCTATGCCTTTCCCTCCTTCCGTGCCAATTCCAACGGGTCCACCTGCATGGCCTGCTGCTCATCCAAGGCATCCTCCGCCACGTCTCCCTGTTCCTGGCACTGGAGTATTCCTTCCTCCAGGTGCTTCCAGTGCTCCATCTCCTCAACAGATGCCAAACTCCGCAGTGGAAACGAGTTCCCTTGCAGAAAAGGAAAATGGTCCGACGGAATCTGATCACAATGGAGGTGCTTCTCCAGGGGAAAAATCTGAAGCAAAACCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGGAGTTGTAAGAAGACGGAGGAAGAACAACCAAAGCAGCAGCAGGAGGAGGAGGAGAAAAGTGAGAATGTGGAGGCCCAAAATGCAGGAGTTAGTCACAAGCAAAATGTAGATAGCGGCAACATTCAAAGACTGATACTACCATCAAACTGCTACTACCATCAAACTGCTACTACCATCAAACTGCTACTACCATCAAGGAGAACAAGGGGAGTTCCCTTTCAAAATCCTTTACTTCATTCCTTTTTTGTGTTGTCCAAAACAATTCTTGGTTAG

Protein sequence

MAMPSGNVGVSDKVPFQSSGGVAVSGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSARSPNARPCPGQKHYPMGPSTGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGASSAPSPQQMPNSAVETSSLAEKENGPTESDHNGGASPGEKSEAKPQRQECNGSMDGSGSCKKTEEEQPKQQQEEEEKSENVEAQNAGVSHKQNVDSGNIQRLILPSNCYYHQTATTIKLLLPSRRTRGVPFQNPLLHSFFVLSKTILG
BLAST of CmaCh04G000360 vs. TrEMBL
Match: W9S2C1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_019288 PE=4 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 2.1e-195
Identity = 395/702 (56.27%), Postives = 469/702 (66.81%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGGVAVSGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAII 60
           MAMPSGNV  SDK+ F S        GEI  H  R W+PDERDG ISW RGEFAA+NA+I
Sbjct: 1   MAMPSGNVVSSDKMQFPSG---TAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMI 60

Query: 61  DALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRF 120
           D+LCHHLRAVGEPGEYD VI CIQ RRCNW PVLHMQQYFSVAEVMFALQQV  RRQQRF
Sbjct: 61  DSLCHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRF 120

Query: 121 VDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAES-CNGGNSSSFVGSR 180
            DP+K+G+K F+R G  FKQ  + D         + K+   + AES C  GNSS      
Sbjct: 121 YDPVKMGNKEFKRSGVGFKQWQRND---------SFKDGRNSAAESHCLDGNSS------ 180

Query: 181 KVEQVSNTCEESNATGEDGKL--NDKDSGS---AEDIKDTHGKDQSNSKPKCAENLEDNA 240
                 N   E   + + G    N  D GS   A++  D+  K Q +   K   N E   
Sbjct: 181 ----FGNAASEKGGSDKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVV 240

Query: 241 SNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLK 300
           S  E +V   DDGC+SS ++    S   +N     A  P+TF+ NE+FDGK VNV++GLK
Sbjct: 241 SGSEPEVHAVDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLK 300

Query: 301 LYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAA 360
           LYEE   D EVSKL++LVNDLR++G+RG  Q  TY+VSKRPMKGHGRE IQLG PIADA 
Sbjct: 301 LYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAP 360

Query: 361 HDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPP 420
            +D  S+G  KDRR E+IP LLQD+ + LV  QV TVKPDSCIIDFYNEGDHSQPH+WP 
Sbjct: 361 VEDEISAGTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPS 420

Query: 421 WFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPA 480
           WFGRPV VL LTEC+M+FGRV   DH G+YRGA  LSL PGSLL +QGKSADFAKHAIP+
Sbjct: 421 WFGRPVCVLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPS 480

Query: 481 MRKQRILVTLTKSQPKRAGPADGQR-TSLNLGSYSSWGPPSARSPNARPCPGQKHYPMGP 540
           +R+QRILVT TKSQPK++ P+DGQR  S  +   S WGP  +RSPN    PG KHY   P
Sbjct: 481 LRRQRILVTFTKSQPKKSMPSDGQRMPSPGVAPSSHWGPQPSRSPNHIRHPGPKHYAPVP 540

Query: 541 STGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPRL 600
           +TGVL   P+RPQ+PP NGI P+ V     P MPFP  VPIP     W AA PRHPPPRL
Sbjct: 541 TTGVLQASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRL 600

Query: 601 PVPGTGVFLPP---GASSAPSPQQM---PNSAVETSSLAEKENGPTESDHNGGASPGEKS 660
           PVPGTGVFLPP   G +S+ S Q +    N  VET++  EKENG  + +H   ASP  K 
Sbjct: 601 PVPGTGVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTASPKGKV 660

Query: 661 EAKPQRQECNGSMDGSGSCKKTEEEQPKQQQEEEEKSENVEA 690
           ++K Q+QECNGS+DGSGS     +E+ +Q  +    S++  A
Sbjct: 661 DSKTQKQECNGSLDGSGSVISVTKEERQQSSDNTATSKSAAA 680

BLAST of CmaCh04G000360 vs. TrEMBL
Match: F6HFA9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g06270 PE=4 SV=1)

HSP 1 Score: 690.3 bits (1780), Expect = 2.7e-195
Identity = 393/701 (56.06%), Postives = 477/701 (68.05%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGGVAVSGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAII 60
           MAMPSGNV +SDK+ F   GG    GG    H  R W+PDERDG ISW RGEFAA+NAII
Sbjct: 1   MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 61  DALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRF 120
           D+LC+HLR +GEPGEYD VIGCIQQRR NW+ VLHMQQYFSVAEV++ALQQV  RRQQR 
Sbjct: 61  DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 121 VDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFVGS-R 180
           +DP+K   K ++R G A++Q        G R E              +  NSS  +    
Sbjct: 121 LDPVKGAGKEYKRYGVAYRQ--------GQRGETAKDSHNSNFENHSHDANSSGTLEKGE 180

Query: 181 KVEQVSNTCEESNATGEDGKLNDKDSGSAEDIK---DTHGKDQSNSKPKCAENLEDNASN 240
           +V ++ +  +  +     GKL DKD  +AE+ K   D   K  +NS  K +EN E +   
Sbjct: 181 RVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCG 240

Query: 241 -KESQVEPTDDGCSSSQR-------DKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVN 300
             E++    DDG + + +       +     VQ++N +    T+P+TF   EIFDGK VN
Sbjct: 241 ISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVN 300

Query: 301 VMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGF 360
           V+DGLKLYEEL DD EVSK +SLVNDLRA+GKRGQLQG T++VSKRPMKGHGREMIQLG 
Sbjct: 301 VVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGV 360

Query: 361 PIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQ 420
           PIADA  +D +  G SKDRR ESIPSLLQD+I  LV  QV+TVKPD+CIIDFYNEGDHSQ
Sbjct: 361 PIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQ 420

Query: 421 PHVWPPWFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFA 480
           PH+WP WFGRPV +L LTEC+M+FGRV+G+DH G+YRG+  LSL PGSLLV+QGKSADFA
Sbjct: 421 PHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFA 480

Query: 481 KHAIPAMRKQRILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSARSPNARPCP-GQK 540
           KHAIP++RKQRILVT TKSQPK+   +DGQR        S W PP +RSPN    P G K
Sbjct: 481 KHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPK 540

Query: 541 HYPMGPSTGVL--PVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAH 600
           HY   P+TGVL  P PP+RPQLPP NG+ P+ V     P MPFP  VP+PTG P WPAA 
Sbjct: 541 HYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAP 600

Query: 601 PRHPPPRLPVPGTGVFL-PPGASSAPSPQQMPNSA----VETSSLAEKENGPTESDHNGG 660
           PRHPPPRLPVPGTGVFL PPG+ ++ SPQ +   A    VET++  EKENG  +S  N  
Sbjct: 601 PRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSN 660

Query: 661 -ASPGEKSEAKPQRQECNGSMDGSGSCKKTEEEQPKQQQEE 681
             SP  K + K  RQECNGSMD +G  ++   ++ +Q  +E
Sbjct: 661 TVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDE 693

BLAST of CmaCh04G000360 vs. TrEMBL
Match: A0A061EA95_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_011235 PE=4 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 8.6e-194
Identity = 397/696 (57.04%), Postives = 480/696 (68.97%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSS------GGVAVS--------GGEIHQHRPRPWYPDERDGLI 60
           MAMPSGNV +SDK+ F ++      GG AV         GGEIHQH  R W PDERDG I
Sbjct: 1   MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 61  SWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVM 120
            W RGEFAASNAIID+LCHHLR VGE GEY+ VI CIQQRRCNW PVLHMQQYFSVAEV 
Sbjct: 61  YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 121 FALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAES 180
           +ALQQV  RR+QR  +  KVG K F+R G  FK         G R+E   KE   +  +S
Sbjct: 121 YALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK---------GQRMEVA-KEGQNSGVDS 180

Query: 181 CNGGNSSSFVGSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKC 240
              GNS+    S + E+ S   EE  + GE GK+ DK S   ED KDT       SKP  
Sbjct: 181 --DGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDT------GSKP-- 240

Query: 241 AENLEDNASNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKT 300
                 +A + ES  E  + GC+SS ++  L S+Q++N +Q  A  P+TF  NE+FDGK 
Sbjct: 241 ------HAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKM 300

Query: 301 VNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQL 360
           VNV+DGLKLYEEL DD EV  L+SLVNDLRA+GKRGQLQG TY+ +KRPMKGHGREMIQL
Sbjct: 301 VNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQL 360

Query: 361 GFPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDH 420
           G PIADA  DD N++G SKDRRIE IP LLQD I+ LV  QVMTVKPDSCIID YNEGDH
Sbjct: 361 GLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDH 420

Query: 421 SQPHVWPPWFGRPVGVLLLTECEMSFGR-VLGSDHSGNYRGAKTLSLAPGSLLVVQGKSA 480
           SQP +WPPWFG+PV ++ LTEC+++FGR V+ +DH G+YRG+  LSLAPGSLLV+QGKSA
Sbjct: 421 SQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSA 480

Query: 481 DFAKHAIPAMRKQRILVTLTK-SQPKRAGPADGQRTSLNLGSYSSWGPPSARSPN-ARPC 540
           DFAKHA+P++RKQRILVT TK  QPK++   + + +S ++   S WGPP +RSPN  R  
Sbjct: 481 DFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHS 540

Query: 541 PGQKHYPMGPSTGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPA 600
            G KHY + P+TGVLP PPIRPQ+PP +G+ P+ V     P + FP  VPIP G   WPA
Sbjct: 541 AGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPA 600

Query: 601 AHPRHPPPRLPVPGTGVFLPPGASSAPSPQQMPNSA------VETSSLAEKENGPTESDH 660
           A PRHPPPRLPVPGTGVFLPP  S   S QQ+  +A      VET+S  EKENG  + +H
Sbjct: 601 A-PRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNH 660

Query: 661 NGGASPGEKSEAKPQRQECNGSMDGSGSCKKTEEEQ 674
           +   SP  + + K  +Q+CNGS+DG+GS +   +E+
Sbjct: 661 H-TTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEE 668

BLAST of CmaCh04G000360 vs. TrEMBL
Match: A0A061E8L7_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_011235 PE=4 SV=1)

HSP 1 Score: 680.6 bits (1755), Expect = 2.1e-192
Identity = 397/697 (56.96%), Postives = 480/697 (68.87%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSS------GGVAVS--------GGEIHQHRPRPWYPDERDGLI 60
           MAMPSGNV +SDK+ F ++      GG AV         GGEIHQH  R W PDERDG I
Sbjct: 1   MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 61  SWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVM 120
            W RGEFAASNAIID+LCHHLR VGE GEY+ VI CIQQRRCNW PVLHMQQYFSVAEV 
Sbjct: 61  YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 121 FALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAES 180
           +ALQQV  RR+QR  +  KVG K F+R G  FK         G R+E   KE   +  +S
Sbjct: 121 YALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK---------GQRME-VAKEGQNSGVDS 180

Query: 181 CNGGNSSSFVGSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKC 240
              GNS+    S + E+ S   EE  + GE GK+ DK S   ED KDT       SKP  
Sbjct: 181 --DGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDT------GSKP-- 240

Query: 241 AENLEDNASNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKT 300
                 +A + ES  E  + GC+SS ++  L S+Q++N +Q  A  P+TF  NE+FDGK 
Sbjct: 241 ------HAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKM 300

Query: 301 VNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQ-GPTYIVSKRPMKGHGREMIQ 360
           VNV+DGLKLYEEL DD EV  L+SLVNDLRA+GKRGQLQ G TY+ +KRPMKGHGREMIQ
Sbjct: 301 VNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQ 360

Query: 361 LGFPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGD 420
           LG PIADA  DD N++G SKDRRIE IP LLQD I+ LV  QVMTVKPDSCIID YNEGD
Sbjct: 361 LGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGD 420

Query: 421 HSQPHVWPPWFGRPVGVLLLTECEMSFGR-VLGSDHSGNYRGAKTLSLAPGSLLVVQGKS 480
           HSQP +WPPWFG+PV ++ LTEC+++FGR V+ +DH G+YRG+  LSLAPGSLLV+QGKS
Sbjct: 421 HSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKS 480

Query: 481 ADFAKHAIPAMRKQRILVTLTK-SQPKRAGPADGQRTSLNLGSYSSWGPPSARSPN-ARP 540
           ADFAKHA+P++RKQRILVT TK  QPK++   + + +S ++   S WGPP +RSPN  R 
Sbjct: 481 ADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRH 540

Query: 541 CPGQKHYPMGPSTGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWP 600
             G KHY + P+TGVLP PPIRPQ+PP +G+ P+ V     P + FP  VPIP G   WP
Sbjct: 541 SAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWP 600

Query: 601 AAHPRHPPPRLPVPGTGVFLPPGASSAPSPQQMPNSA------VETSSLAEKENGPTESD 660
           AA PRHPPPRLPVPGTGVFLPP  S   S QQ+  +A      VET+S  EKENG  + +
Sbjct: 601 AA-PRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPN 660

Query: 661 HNGGASPGEKSEAKPQRQECNGSMDGSGSCKKTEEEQ 674
           H+   SP  + + K  +Q+CNGS+DG+GS +   +E+
Sbjct: 661 HH-TTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEE 669

BLAST of CmaCh04G000360 vs. TrEMBL
Match: A0A067L112_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04813 PE=4 SV=1)

HSP 1 Score: 670.2 bits (1728), Expect = 2.9e-189
Identity = 395/698 (56.59%), Postives = 479/698 (68.62%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGGVAVSGG---EIHQ--HRPRPWYP-DERDGLISWFRGEFA 60
           MAMP GNV +SDK+ F + GG    GG   EIHQ  H  + W+P DERDG ISW RGEFA
Sbjct: 1   MAMPPGNVVISDKMQFPAGGGGVGGGGVGNEIHQQHHHRQQWFPVDERDGFISWLRGEFA 60

Query: 61  ASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTS 120
           A+NAIID+LCHHLRAVGEPGEYD+V+GCIQQRRCNW  VLHMQQYFSV EV+ ALQQV  
Sbjct: 61  AANAIIDSLCHHLRAVGEPGEYDLVVGCIQQRRCNWNAVLHMQQYFSVGEVILALQQVAL 120

Query: 121 RRQQR------FVDPMKVGSKLFRR-PGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESC 180
           R+QQ+      + D  KVG K F+R  G  F +      Q G   E  VKE + +  ES 
Sbjct: 121 RKQQQQQQQRYYYDQNKVGGKEFKRFSGAGFNK-----GQKGGGGE-VVKEAVNSRVESH 180

Query: 181 N-GGNSSSFVGSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKC 240
           +  GNSS   GS K E++     +S A G  GKL DK    AED KD   K   ++  K 
Sbjct: 181 SFDGNSSGNGGSEKFEEI-----KSGADG--GKLEDKSVALAEDKKDAAAKPHVDNPLKT 240

Query: 241 AENLEDNAS-NKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGK 300
           + N E+  S N E+  E  D+   SS ++    S  +++ +Q  A  P+TF   EI DGK
Sbjct: 241 SGNSEETLSGNLEADAEAVDE--QSSLKENDSHSSHNQSVKQTLAITPKTFVGGEIVDGK 300

Query: 301 TVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQ 360
            VNV+DGLKLYE+LLDD+EVSKL+SLVNDLRASG+RGQ  G TY+VSKRPMKGHGREMIQ
Sbjct: 301 MVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRASGRRGQFSGQTYVVSKRPMKGHGREMIQ 360

Query: 361 LGFPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGD 420
           LG PIADA  +D N++G SKDRR+ESIP+LLQD+I+  V  Q+M VKPDSCIID YNEGD
Sbjct: 361 LGLPIADAPAEDENAAGTSKDRRVESIPTLLQDVIERFVNMQIMAVKPDSCIIDLYNEGD 420

Query: 421 HSQPHVWPPWFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSA 480
           HSQP++WPPWFG+P+ VL LTEC+++FGRV+ +D  G+Y+G+  L LAPGSLLV+QGKS 
Sbjct: 421 HSQPNMWPPWFGKPISVLFLTECDLTFGRVITADQPGDYKGSLKLPLAPGSLLVMQGKST 480

Query: 481 DFAKHAIPAMRKQRILVTLTKSQPKRAGPADGQRTSLNLGSYS-SWGPPSARSPNARPCP 540
           D+AKHAIPA+RKQR++VT TKSQPK+   +DGQR   +  + S  WGP  +RSPN    P
Sbjct: 481 DYAKHAIPAIRKQRMIVTFTKSQPKKYAQSDGQRLVSSAAAPSPHWGPAPSRSPNHIRHP 540

Query: 541 GQKHYPMGPSTGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAA 600
             KHYP  P+TGVLP P IRPQ+PP NG+ P+ V      PMPFP  VPIP     WPAA
Sbjct: 541 VPKHYPAVPTTGVLPAPAIRPQIPPPNGVQPLFVTATVAAPMPFPAPVPIPPVSTGWPAA 600

Query: 601 HPRHPPPRL--PVPGTGVFL-PPGASSAPSPQQMPNSAVETS------SLAEKENGPTES 660
            PRHPP RL  PVPGTGVFL PPG+ +A S  Q+  +A+E +      SL +KENGP  S
Sbjct: 601 APRHPPNRLPVPVPGTGVFLPPPGSGNASSSPQISTAAIEANFPVEAVSLTDKENGPGIS 660

Query: 661 DHNGGASPGEKSEAKPQRQECNGSMDGSGSCKKTEEEQ 674
           +H   ASP EK + K QRQ+CNG  DG      TEEEQ
Sbjct: 661 NHVSCASPKEKLDGKTQRQDCNGIADGRA---VTEEEQ 680

BLAST of CmaCh04G000360 vs. TAIR10
Match: AT1G14710.1 (AT1G14710.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 438.7 bits (1127), Expect = 7.1e-123
Identity = 282/651 (43.32%), Postives = 379/651 (58.22%), Query Frame = 1

Query: 20  GGVAVSGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVV 79
           G V     ++    P  W PDERDG ISW R EFAA+NAIID+LC HL+AVG+  EY+ V
Sbjct: 7   GNVTTPSEKLQFPPPANWIPDERDGFISWLRAEFAAANAIIDSLCQHLQAVGDHNEYESV 66

Query: 80  IGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQ-----RFVDPMKVGSKLFRRP 139
           IG I  RR  W+ VL MQQ+F VA+V + LQQ+  +RQQ     R  +  +VG    RR 
Sbjct: 67  IGSIHHRRLAWSQVLTMQQFFPVADVSYNLQQIAWKRQQQMPPQRHYNSDQVGKFGARRS 126

Query: 140 GPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAT 199
           GP F + H                          GG      G R  + ++      N  
Sbjct: 127 GPGFNKHH-------------------------GGGG-----GYRGADSMARNGHNFNGV 186

Query: 200 GEDGKLNDKDSGSAEDIKD---THGKDQSNSKPKCAENLEDNASNKESQVEPTDDG-CSS 259
             D   + +++  A D+K       K   + KP+    +E      E+Q E   +  C+S
Sbjct: 187 NSDRVEHREEAKLASDVKALSVAEEKRDGSEKPRSDSKVEKKLEESETQEEIVKNHKCNS 246

Query: 260 SQRDKGLQSVQSR--NARQYAATAPRTFAANEIFDGKTVNVMDGLKLYEELLDDIEVSKL 319
             +D  L S Q +  N ++  A+  +TF   E++D K VNV++GLKLY+++LD  EVS+L
Sbjct: 247 GSKDNSLISEQKQEENDKECPASMAKTFVVQEMYDAKMVNVVEGLKLYDKMLDANEVSQL 306

Query: 320 LSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHDDANSSGLSKDRR 379
           +SLV +LR +G+RGQLQ   Y+  KRP +GHGREMIQLG PIAD   DD +     KDRR
Sbjct: 307 VSLVTNLRLAGRRGQLQSEAYVGYKRPNRGHGREMIQLGLPIADTPPDDDSI----KDRR 366

Query: 380 IESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTEC 439
           IE IPS L D+I+ LV +Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC
Sbjct: 367 IEPIPSALSDIIERLVSKQIIPVKPDACIIDFFSEGDHSQPHMFVPWFGRPISVLSLSEC 426

Query: 440 EMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQ 499
           + +FGRV+ S++ G+Y+G+  LSL PGS+L+V+GKSA+ AK+AI A RKQRIL++  KS+
Sbjct: 427 DYTFGRVIVSENPGDYKGSLKLSLTPGSVLLVEGKSANLAKYAIHATRKQRILISFIKSK 486

Query: 500 PKRAGPADGQRTSLNLGSYSSWGPPSARSPN---ARPCPGQKHYPMG-PSTGVLPVPPIR 559
           P+                 S+WGPP +RSPN     P    KHYP+  PSTGVLP P  R
Sbjct: 487 PRN----------------SNWGPPPSRSPNQHIRHPTGPPKHYPVVIPSTGVLPTPSHR 546

Query: 560 PQLPPQNGIPPIMVAPVAP--PPMPFPPSVPIPTGPPAWP--AAHPRH---PPPRLPVPG 619
              PP   + PI + P  P   PMPFP  V  PTGPP WP    HPRH   P PR+P+PG
Sbjct: 547 ---PPNGAVQPIFIPPSPPLASPMPFPGGV--PTGPPVWPLLPPHPRHQTAPQPRMPIPG 600

Query: 620 TGVFLPPGASSAPSPQQM-PNSAVETSSLAEKENGPTESDHNGGASPGEKS 648
           TGVFLPPG++   +         ++  +  E  NG  E + +G  S G++S
Sbjct: 607 TGVFLPPGSNQELADNSNGTEGKLDLKAKEEARNGFGEGECDG--SNGKQS 600

BLAST of CmaCh04G000360 vs. TAIR10
Match: AT4G02940.1 (AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein)

HSP 1 Score: 246.5 bits (628), Expect = 5.2e-65
Identity = 198/587 (33.73%), Postives = 284/587 (48.38%), Query Frame = 1

Query: 42  RDGLISWFRGEFAASNAIIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQ 101
           +D LISWFRGEFAA+NAIIDA+C HLR   E     EY+ V   I +RR NW PVL MQ+
Sbjct: 47  KDALISWFRGEFAAANAIIDAMCSHLRIAEEAVSGSEYEAVFAAIHRRRLNWIPVLQMQK 106

Query: 102 YFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKE 161
           Y S+AEV   LQ+V +++ +                    KQ+  +++      E    E
Sbjct: 107 YHSIAEVAIELQKVAAKKAE------------------DLKQKKTEEEAEEDLKEVVATE 166

Query: 162 EMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKD 221
           E     E  NG   +    +  VE V +    S+ T         DSGS +D+  T   D
Sbjct: 167 EEEVKKECFNGEKVTENDVNGDVEDVEDDSPTSDIT---------DSGSHQDVHQTVVAD 226

Query: 222 QSNSKPKCAENLEDNASNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAA 281
            ++   +   +  ++   +  +++P                              + F A
Sbjct: 227 TAH---QIICHSHEDCDARSCEIKPI-----------------------------KGFQA 286

Query: 282 NEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKG 341
            E   G TVNV+ GLKLYEELL + E+SKLL  V +LR +G  G+L G ++I+  + +KG
Sbjct: 287 KEQVKGHTVNVVKGLKLYEELLKEDEISKLLDFVAELREAGINGKLAGESFILFNKQIKG 346

Query: 342 HGREMIQLGFPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLV-WEQVMTVK-PDSC 401
           + RE+IQLG PI      D NS+  +    IE IP LL+ +ID  V W  +   K P+ C
Sbjct: 347 NKRELIQLGVPIFGHVKADENSNDTNNSVNIEPIPPLLESVIDHFVTWRLIPEYKRPNGC 406

Query: 402 IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGS 461
           +I+F+ EG++SQP + PP   +P+  L+L+E  M++GR+L SD+ GN+RG  TLSL  GS
Sbjct: 407 VINFFEEGEYSQPFLKPPHLEQPISTLVLSESTMAYGRILSSDNEGNFRGPLTLSLKQGS 466

Query: 462 LLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSAR 521
           LLV++G SAD A+H +   + +R+ +T  + +P          +  N G  + W P    
Sbjct: 467 LLVMRGNSADMARHVMCPSQNKRVSITFFRIRPDTYHNHSQPNSPRNDGVMTMWQP---- 526

Query: 522 SPNARPCP---GQKH-YPMGPSTGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFP--- 581
                P P   G  H   M P  GVL  PP+    PP          PV P  +P P   
Sbjct: 527 -YQMTPTPFLNGYDHSIDMMPKLGVLR-PPMVMMAPP----------PVQPMILPSPNVM 557

Query: 582 -----PSVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGASSAPS 612
                  V +P         H +H PPR       + LPP ASS+P+
Sbjct: 587 GTGGGTGVFLPWASVNSSRKHVKHLPPRAQKKRL-LPLPPAASSSPA 557

BLAST of CmaCh04G000360 vs. TAIR10
Match: AT2G48080.1 (AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein)

HSP 1 Score: 162.5 bits (410), Expect = 9.9e-40
Identity = 105/316 (33.23%), Postives = 161/316 (50.95%), Query Frame = 1

Query: 229 NLEDNASNKESQVEPTDDGCSSSQR----DKGLQSVQSRNARQYAATAPRTFAANEIFDG 288
           +L+D+  +     + TD G    +      K     +SR A     +  + F+A E   G
Sbjct: 102 HLDDDHDDDSPSSDITDGGSREEETLSICCKHEDECESRGASLLKQS--KRFSAKEHVRG 161

Query: 289 KTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMI 348
            T NV+ GLKLY+++    ++SKLL  +N LR +G+  QL G T+++  +  KG  RE++
Sbjct: 162 HTANVVKGLKLYQDVFTRPQLSKLLDSINQLREAGRNHQLSGETFVLFNKNTKGTKRELL 221

Query: 349 QLGFPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLV-WEQVMTVK-PDSCIIDFYN 408
           QLG PI     D         +  +E IP+L+Q +ID L+ W  +   K P+ C+I+F++
Sbjct: 222 QLGVPIFGNTTD---------EHSVEPIPTLVQSVIDHLLQWRLIPEYKRPNGCVINFFD 281

Query: 409 EGDHSQPHVWPPWFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQG 468
           E +HSQP   PP   +P+  L+L+E  M FG  LG D+ GN+RG+ TL L  GSLLV++G
Sbjct: 282 EDEHSQPFQKPPHVDQPISTLVLSESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRG 341

Query: 469 KSADFAKHAIPAMRKQRILVTLTKSQPKRA--------------------GPADGQRTSL 519
            SAD A+H +     +R+ +T  K +P                        PA  +R   
Sbjct: 342 NSADMARHVMCPSPNKRVAITFFKLKPDSGKVQPPPTLWRPGTPSPLVMLAPAP-KRLDA 401

BLAST of CmaCh04G000360 vs. TAIR10
Match: AT2G17970.1 (AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 134.0 bits (336), Expect = 3.8e-31
Identity = 83/262 (31.68%), Postives = 133/262 (50.76%), Query Frame = 1

Query: 228 ENLEDNASNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTV 287
           E  E+    ++S  +  D     +     L   Q  N R       + F   E   GK V
Sbjct: 151 EEEEEEEEERDSSRKGFDASSMKTPEKPKLSRDQRENLRLINVKRKKDFICLERVKGKIV 210

Query: 288 NVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLG 347
           NV+DGL+L+  +   +E  +++  V  L+  G+RG+L+  T+    + M+G GRE IQ G
Sbjct: 211 NVLDGLELHTGVFSAVEQKRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQFG 270

Query: 348 FPIADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVM--TVKPDSCIIDFYNEGD 407
                A     N  G+ +   ++ +P L + +I  L+   V+  T  PDSCI++ Y+EGD
Sbjct: 271 CCYNYAPDRAGNPPGILQREEVDPLPHLFKVIIRKLIKWHVLPPTCVPDSCIVNIYDEGD 330

Query: 408 HSQPHVWPPWFGRP-VGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKS 467
              PH+    F RP   +  L+EC++ FG  L  +  G++ G+ ++ L  GS+LV+ G  
Sbjct: 331 CIPPHIDNHDFLRPFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNG 390

Query: 468 ADFAKHAIPAMRKQRILVTLTK 487
           AD AKH +PA+  +RI +T  K
Sbjct: 391 ADVAKHCVPAVPTKRISITFRK 412

BLAST of CmaCh04G000360 vs. TAIR10
Match: AT1G48980.1 (AT1G48980.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 118.6 bits (296), Expect = 1.6e-26
Identity = 80/261 (30.65%), Postives = 129/261 (49.43%), Query Frame = 1

Query: 230 LEDNASNKESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNV 289
           LED  S +E   + + +   SS  +  L   Q  + R       R F   E  +G+ VN+
Sbjct: 41  LEDELSEEEDHKDSSREAFGSSLENHKLSRKQRTHIRAINVKRKRDFVCLEKVNGELVNI 100

Query: 290 MDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFP 349
           ++GL+L+ E+ +  E  +++  V +L+   ++G+L+           +G GR  IQ G  
Sbjct: 101 LEGLELHTEVFNAAEQRRIVDKVCELQEKVQKGELKRAF------TAQGKGRSTIQFGCC 160

Query: 350 IADAAHDDANSSGLSKDRRIESIPSLLQDLIDCLVWEQVM--TVKPDSCIIDFYNEGDHS 409
                    N +G+ K   ++ +P L + +I  LV   V+  T  PD C+++ Y+EGD  
Sbjct: 161 FNYRTSKTGNLAGILKHETVDPLPHLFKVIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCI 220

Query: 410 QPHVWPPWFGRPV-GVLLLTECEMSFGRVLGSDHSGNYRGAK-TLSLAPGSLLVVQGKSA 469
            PH+    F RP   V  L+EC + FG  L  + +G Y G   +L L  GS+LV+ G  A
Sbjct: 221 PPHIDHHDFLRPFCTVSFLSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGA 280

Query: 470 DFAKHAIPAMRKQRILVTLTK 487
           D AKH +P +  +RI +T  K
Sbjct: 281 DVAKHCVPEVPTKRISITFRK 295

BLAST of CmaCh04G000360 vs. NCBI nr
Match: gi|449449076|ref|XP_004142291.1| (PREDICTED: uncharacterized protein LOC101210274 isoform X2 [Cucumis sativus])

HSP 1 Score: 1065.1 bits (2753), Expect = 5.7e-308
Identity = 559/692 (80.78%), Postives = 608/692 (87.86%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGGVAVSGG--EIHQHRPRPWYPDERDGLISWFRGEFAASNA 60
           MAMPSGNVGV DKV FQS GGVAVSGG  EIHQH PRPW+PDERDG ISW RGEFAASNA
Sbjct: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60

Query: 61  IIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQ 120
           IIDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQQYFSVAEVM+ALQQVTSRRQQ
Sbjct: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120

Query: 121 RFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFVGS 180
           R++DP+KVG KL+RRPGP FKQQ       GHR EATVKEE +TCAESCNGGNSS+FV S
Sbjct: 121 RYMDPVKVGPKLYRRPGPGFKQQQ------GHRAEATVKEETITCAESCNGGNSSTFVSS 180

Query: 181 RKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLEDNASNKE 240
           RKVEQVSNTC+ES A+GED KL++KDSGSA D KDTHGKDQSN K K AENLEDNA NK+
Sbjct: 181 RKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKD 240

Query: 241 SQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLKLYEE 300
           SQVEP DDGCSSS RDK LQSVQS+N +QYAAT PRTF A+E+FDGK VNVMDGLKL+EE
Sbjct: 241 SQVEP-DDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEE 300

Query: 301 LLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHDDA 360
           LLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRPMKGHGREMIQLGFPIADA H+D 
Sbjct: 301 LLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHEDD 360

Query: 361 NSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGR 420
           NS GLSKDRRIE IPSLLQDLID LV +QVMTVKPDSCIIDFYNEGDHSQPHVWP WFGR
Sbjct: 361 NSLGLSKDRRIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVWPSWFGR 420

Query: 421 PVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQ 480
           PVGVLLLTECE++FGRV+G+DHSGNYRGA  LSL PG+LLVVQGKSADFAKHA+PA+RKQ
Sbjct: 421 PVGVLLLTECEITFGRVIGTDHSGNYRGAMKLSLTPGNLLVVQGKSADFAKHALPAIRKQ 480

Query: 481 RILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSARSPNARPCPGQKHYPMGPSTGVL 540
           RILVTLTKSQPKRA PADGQRTSLN+G++S WGPPSARSPN R  PGQK YP  PSTGVL
Sbjct: 481 RILVTLTKSQPKRAAPADGQRTSLNVGTFSGWGPPSARSPNPRLSPGQKPYPTVPSTGVL 540

Query: 541 PVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPRLPVPGT 600
           PVPPIRPQ+ P NGIPP++V PVA  PMPF P VPIPTGP AWP AH RHPPPRLPVPGT
Sbjct: 541 PVPPIRPQMAPPNGIPPLIVPPVA-SPMPFTP-VPIPTGPSAWPTAHTRHPPPRLPVPGT 600

Query: 601 GVFL-PPGASSAPSP---QQMPNSAVETSSLAEKENGPTESDHNGGASPGEKSEAKPQRQ 660
           GVFL PPG+SSAP+P   QQ+P S +ET SL+EKENG T+SDH+ G  PGEK +AK QRQ
Sbjct: 601 GVFLPPPGSSSAPTPSPQQQLPISNIETGSLSEKENGLTKSDHSSGTFPGEKPDAKAQRQ 660

Query: 661 ECNGSMDGSGSCKKTEEEQPKQQQEEEEKSEN 687
           ECNGS+DGSG+ K  EEEQ +QQQEEE+ ++N
Sbjct: 661 ECNGSIDGSGNDKVKEEEQ-QQQQEEEQSAQN 682

BLAST of CmaCh04G000360 vs. NCBI nr
Match: gi|778698245|ref|XP_011654491.1| (PREDICTED: uncharacterized protein LOC101210274 isoform X1 [Cucumis sativus])

HSP 1 Score: 1059.3 bits (2738), Expect = 3.2e-306
Identity = 559/696 (80.32%), Postives = 608/696 (87.36%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGGVAVSGG--EIHQHRPRPWYPDERDGLISWFRGEFAASNA 60
           MAMPSGNVGV DKV FQS GGVAVSGG  EIHQH PRPW+PDERDG ISW RGEFAASNA
Sbjct: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60

Query: 61  IIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQ 120
           IIDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQQYFSVAEVM+ALQQVTSRRQQ
Sbjct: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120

Query: 121 RFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFVGS 180
           R++DP+KVG KL+RRPGP FKQQ       GHR EATVKEE +TCAESCNGGNSS+FV S
Sbjct: 121 RYMDPVKVGPKLYRRPGPGFKQQQ------GHRAEATVKEETITCAESCNGGNSSTFVSS 180

Query: 181 RKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLEDNASNKE 240
           RKVEQVSNTC+ES A+GED KL++KDSGSA D KDTHGKDQSN K K AENLEDNA NK+
Sbjct: 181 RKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKD 240

Query: 241 SQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLKLYEE 300
           SQVEP DDGCSSS RDK LQSVQS+N +QYAAT PRTF A+E+FDGK VNVMDGLKL+EE
Sbjct: 241 SQVEP-DDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEE 300

Query: 301 LLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHDDA 360
           LLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRPMKGHGREMIQLGFPIADA H+D 
Sbjct: 301 LLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHEDD 360

Query: 361 NSSGLSK----DRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPP 420
           NS GLSK    DRRIE IPSLLQDLID LV +QVMTVKPDSCIIDFYNEGDHSQPHVWP 
Sbjct: 361 NSLGLSKVNFTDRRIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVWPS 420

Query: 421 WFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPA 480
           WFGRPVGVLLLTECE++FGRV+G+DHSGNYRGA  LSL PG+LLVVQGKSADFAKHA+PA
Sbjct: 421 WFGRPVGVLLLTECEITFGRVIGTDHSGNYRGAMKLSLTPGNLLVVQGKSADFAKHALPA 480

Query: 481 MRKQRILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSARSPNARPCPGQKHYPMGPS 540
           +RKQRILVTLTKSQPKRA PADGQRTSLN+G++S WGPPSARSPN R  PGQK YP  PS
Sbjct: 481 IRKQRILVTLTKSQPKRAAPADGQRTSLNVGTFSGWGPPSARSPNPRLSPGQKPYPTVPS 540

Query: 541 TGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPRLP 600
           TGVLPVPPIRPQ+ P NGIPP++V PVA  PMPF P VPIPTGP AWP AH RHPPPRLP
Sbjct: 541 TGVLPVPPIRPQMAPPNGIPPLIVPPVA-SPMPFTP-VPIPTGPSAWPTAHTRHPPPRLP 600

Query: 601 VPGTGVFL-PPGASSAPSP---QQMPNSAVETSSLAEKENGPTESDHNGGASPGEKSEAK 660
           VPGTGVFL PPG+SSAP+P   QQ+P S +ET SL+EKENG T+SDH+ G  PGEK +AK
Sbjct: 601 VPGTGVFLPPPGSSSAPTPSPQQQLPISNIETGSLSEKENGLTKSDHSSGTFPGEKPDAK 660

Query: 661 PQRQECNGSMDGSGSCKKTEEEQPKQQQEEEEKSEN 687
            QRQECNGS+DGSG+ K  EEEQ +QQQEEE+ ++N
Sbjct: 661 AQRQECNGSIDGSGNDKVKEEEQ-QQQQEEEQSAQN 686

BLAST of CmaCh04G000360 vs. NCBI nr
Match: gi|659109443|ref|XP_008454723.1| (PREDICTED: uncharacterized protein LOC103495063 isoform X2 [Cucumis melo])

HSP 1 Score: 1055.8 bits (2729), Expect = 3.5e-305
Identity = 559/694 (80.55%), Postives = 607/694 (87.46%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGG-VAVSGG--EIHQHR-PRPWYPDERDGLISWFRGEFAAS 60
           MA+PSGNVGV DKV FQS GG VAVSGG  EIHQH  PRPW+PDERDG ISW RGEFAAS
Sbjct: 1   MALPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHHPRPWFPDERDGFISWLRGEFAAS 60

Query: 61  NAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRR 120
           NA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVM+ALQQVTSRR
Sbjct: 61  NAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120

Query: 121 QQRFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFV 180
           QQR++DP+KVG KL+RRPGP FKQQ       GHR EATVKEE +TCAESCNGGNSSSFV
Sbjct: 121 QQRYMDPVKVGPKLYRRPGPGFKQQQ------GHRAEATVKEETITCAESCNGGNSSSFV 180

Query: 181 GSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLEDNASN 240
            SRKVEQVSNTC+ES A+GED KL++KDSGSAED KDTHGKDQSNSK KCAENLEDNA N
Sbjct: 181 SSRKVEQVSNTCDESKASGEDEKLSEKDSGSAEDNKDTHGKDQSNSKTKCAENLEDNAGN 240

Query: 241 KESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLKLY 300
           K+SQVEP DDGCSSS RDK LQSVQS+N +Q+AAT PRTF ANE+FDGK VNVMDGLKL+
Sbjct: 241 KDSQVEP-DDGCSSSHRDKELQSVQSQNGKQHAATTPRTFVANEMFDGKMVNVMDGLKLF 300

Query: 301 EELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHD 360
           EELLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRP KGHGREMIQLGFPIADA ++
Sbjct: 301 EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPTKGHGREMIQLGFPIADAPYE 360

Query: 361 DANSSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWF 420
           D NS  LSKDRRIE IPSLLQDLID LV +QVMTVKPDSCIIDFYNEGDHSQPHVWP WF
Sbjct: 361 DDNSLALSKDRRIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVWPSWF 420

Query: 421 GRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMR 480
           GRPVGVLLLTECE++FGRV+G+DHSGNYRGA  LSL PG+LLVVQGKSADFAKHAIPA+R
Sbjct: 421 GRPVGVLLLTECEITFGRVIGTDHSGNYRGAIKLSLTPGNLLVVQGKSADFAKHAIPAIR 480

Query: 481 KQRILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSARSPNARPCPGQKHYPMGPSTG 540
           KQRILVTLTKSQPKRA PADGQR+SLN+G++S WGPPSARSPN R  PGQK Y   PSTG
Sbjct: 481 KQRILVTLTKSQPKRASPADGQRSSLNVGTFSGWGPPSARSPNPRLSPGQKPYSNVPSTG 540

Query: 541 VLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPRLPVP 600
           VLPVPPIRPQ+ P NGIPP++V  VA PPMPF P VPIPTGP  WP AH RHPPPRLPVP
Sbjct: 541 VLPVPPIRPQMAPPNGIPPLIVPSVA-PPMPFTP-VPIPTGPSTWPTAHTRHPPPRLPVP 600

Query: 601 GTGVFL-PPGASSAPSP---QQMPNSAVETSSLAEKENGPTESDHNGGASPGEKSEAKPQ 660
           GTGVFL PPG+SSAPSP   QQ+PNS +E  SL+EKENG T+SDHN G  PGEK EAK Q
Sbjct: 601 GTGVFLPPPGSSSAPSPSPQQQLPNSNIEMGSLSEKENGLTKSDHNSGTFPGEKPEAKTQ 660

Query: 661 RQECNGSMDGSGSCKKTEEEQPKQQQEEEEKSEN 687
           RQECNG++DGSG+ K  EEEQ +QQQEEE+ ++N
Sbjct: 661 RQECNGTIDGSGNDKVKEEEQ-QQQQEEEQSAQN 684

BLAST of CmaCh04G000360 vs. NCBI nr
Match: gi|659109441|ref|XP_008454722.1| (PREDICTED: uncharacterized protein LOC103495063 isoform X1 [Cucumis melo])

HSP 1 Score: 1050.0 bits (2714), Expect = 1.9e-303
Identity = 559/698 (80.09%), Postives = 607/698 (86.96%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGG-VAVSGG--EIHQHR-PRPWYPDERDGLISWFRGEFAAS 60
           MA+PSGNVGV DKV FQS GG VAVSGG  EIHQH  PRPW+PDERDG ISW RGEFAAS
Sbjct: 1   MALPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHHPRPWFPDERDGFISWLRGEFAAS 60

Query: 61  NAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRR 120
           NA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVM+ALQQVTSRR
Sbjct: 61  NAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120

Query: 121 QQRFVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFV 180
           QQR++DP+KVG KL+RRPGP FKQQ       GHR EATVKEE +TCAESCNGGNSSSFV
Sbjct: 121 QQRYMDPVKVGPKLYRRPGPGFKQQQ------GHRAEATVKEETITCAESCNGGNSSSFV 180

Query: 181 GSRKVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLEDNASN 240
            SRKVEQVSNTC+ES A+GED KL++KDSGSAED KDTHGKDQSNSK KCAENLEDNA N
Sbjct: 181 SSRKVEQVSNTCDESKASGEDEKLSEKDSGSAEDNKDTHGKDQSNSKTKCAENLEDNAGN 240

Query: 241 KESQVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLKLY 300
           K+SQVEP DDGCSSS RDK LQSVQS+N +Q+AAT PRTF ANE+FDGK VNVMDGLKL+
Sbjct: 241 KDSQVEP-DDGCSSSHRDKELQSVQSQNGKQHAATTPRTFVANEMFDGKMVNVMDGLKLF 300

Query: 301 EELLDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHD 360
           EELLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRP KGHGREMIQLGFPIADA ++
Sbjct: 301 EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPTKGHGREMIQLGFPIADAPYE 360

Query: 361 DANSSGLSK----DRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVW 420
           D NS  LSK    DRRIE IPSLLQDLID LV +QVMTVKPDSCIIDFYNEGDHSQPHVW
Sbjct: 361 DDNSLALSKVNFTDRRIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVW 420

Query: 421 PPWFGRPVGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAI 480
           P WFGRPVGVLLLTECE++FGRV+G+DHSGNYRGA  LSL PG+LLVVQGKSADFAKHAI
Sbjct: 421 PSWFGRPVGVLLLTECEITFGRVIGTDHSGNYRGAIKLSLTPGNLLVVQGKSADFAKHAI 480

Query: 481 PAMRKQRILVTLTKSQPKRAGPADGQRTSLNLGSYSSWGPPSARSPNARPCPGQKHYPMG 540
           PA+RKQRILVTLTKSQPKRA PADGQR+SLN+G++S WGPPSARSPN R  PGQK Y   
Sbjct: 481 PAIRKQRILVTLTKSQPKRASPADGQRSSLNVGTFSGWGPPSARSPNPRLSPGQKPYSNV 540

Query: 541 PSTGVLPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPR 600
           PSTGVLPVPPIRPQ+ P NGIPP++V  VA PPMPF P VPIPTGP  WP AH RHPPPR
Sbjct: 541 PSTGVLPVPPIRPQMAPPNGIPPLIVPSVA-PPMPFTP-VPIPTGPSTWPTAHTRHPPPR 600

Query: 601 LPVPGTGVFL-PPGASSAPSP---QQMPNSAVETSSLAEKENGPTESDHNGGASPGEKSE 660
           LPVPGTGVFL PPG+SSAPSP   QQ+PNS +E  SL+EKENG T+SDHN G  PGEK E
Sbjct: 601 LPVPGTGVFLPPPGSSSAPSPSPQQQLPNSNIEMGSLSEKENGLTKSDHNSGTFPGEKPE 660

Query: 661 AKPQRQECNGSMDGSGSCKKTEEEQPKQQQEEEEKSEN 687
           AK QRQECNG++DGSG+ K  EEEQ +QQQEEE+ ++N
Sbjct: 661 AKTQRQECNGTIDGSGNDKVKEEEQ-QQQQEEEQSAQN 688

BLAST of CmaCh04G000360 vs. NCBI nr
Match: gi|645228327|ref|XP_008220943.1| (PREDICTED: uncharacterized protein LOC103320980 [Prunus mume])

HSP 1 Score: 721.1 bits (1860), Expect = 2.0e-204
Identity = 411/699 (58.80%), Postives = 486/699 (69.53%), Query Frame = 1

Query: 1   MAMPSGNVGVSDKVPFQSSGGV-AVSGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAI 60
           M MPSGNV +SDK+ F S GG  AV GGEI QH  R W+PDERDG ISW RGEFAA+NAI
Sbjct: 1   MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIPQHH-RQWFPDERDGFISWLRGEFAAANAI 60

Query: 61  IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQR 120
           ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNW PVLHMQQYFSVAEV++ALQ V  RRQQR
Sbjct: 61  IDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQR 120

Query: 121 FVDPMKVGSKLFRRPGPAFKQQHQQDQQHGHRLEATVKEEMVTCAESCNGGNSSSFVGSR 180
           + DP+K G+K F+R G  F +  Q       R EA  +    T     N GNSS  V   
Sbjct: 121 YYDPVKAGAKEFKRSGVGFNKGQQ-------RAEAFKEGHNSTLESHSNDGNSSGVVAPE 180

Query: 181 KVEQVSNTCEESNATGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLEDNASNKES 240
           K E+ S   EE    GE GKLNDK    A + KD   K Q +S  +   N +   S    
Sbjct: 181 KFERGSEVGEEVEPGGEVGKLNDKGLAPAGEKKDALTKPQEDSNLRSFGNSQGTISENSE 240

Query: 241 QVEPTDDGCSSSQRDKGLQSVQSRNARQYAATAPRTFAANEIFDGKTVNVMDGLKLYEEL 300
                 DGC+ S +     S+Q +N +Q  +  P+TF  NE  DGKTVN +DGLKLYE+ 
Sbjct: 241 PEVVEVDGCTPSSKVNESHSIQIQNQKQNLSIVPKTFIGNETSDGKTVNAVDGLKLYEDF 300

Query: 301 LDDIEVSKLLSLVNDLRASGKRGQLQGPTYIVSKRPMKGHGREMIQLGFPIADAAHDDAN 360
           L D EVSKLLSLVNDLRA+GKR QLQG TY+VSKRPMKGHGREMIQLG PIADA  +D  
Sbjct: 301 LGDTEVSKLLSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEI 360

Query: 361 SSGLSKDRRIESIPSLLQDLIDCLVWEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRP 420
           S+G SKDR+IE IPSLLQD+ID LV   V+TVKPDSCIID YNEGDHSQPH WP WFGRP
Sbjct: 361 SAGTSKDRKIEPIPSLLQDVIDRLVGMHVVTVKPDSCIIDVYNEGDHSQPHTWPSWFGRP 420

Query: 421 VGVLLLTECEMSFGRVLGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQR 480
           V  L LTEC+M+FGRVL  DH G+YRG+  LSL PGS+L++QGKSADFAKHAIP++RKQR
Sbjct: 421 VCALYLTECDMTFGRVLLMDHPGDYRGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQR 480

Query: 481 ILVTLTKSQPKRAGPADGQRTSLNLGSYSS-WGPPSARSPN-ARPCPGQKHYPMGPSTGV 540
           ILVT TKSQPK++  +DGQR      + SS WGPP +RSPN  R   G KHY   P+TGV
Sbjct: 481 ILVTFTKSQPKKSTTSDGQRFPAPAPAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGV 540

Query: 541 LPVPPIRPQLPPQNGIPPIMVAPVAPPPMPFPPSVPIPTGPPAWPAAHPRHPPPRLPVPG 600
           LP PPIR QLPPQNGI P+ V     P +PF  +VPIP G   WPAA PRHPPPR+P+PG
Sbjct: 541 LPAPPIRSQLPPQNGIQPLFVPAPVGPAIPFAAAVPIPPGSAGWPAA-PRHPPPRIPLPG 600

Query: 601 TGVFL-PPGASSAPSPQQMPNSA------VETSSLAEKENGPTESDHNGGASPGEKSEAK 660
           TGVFL PPG+ ++ +PQQ+P +A      VET S  +K+NG  +S+H+  ASP  KS+ K
Sbjct: 601 TGVFLPPPGSGNSSAPQQLPGTATEMSPTVETPSPRDKDNGSGKSNHSTSASPKGKSDGK 660

Query: 661 PQRQECNGSMDGSGSCKKTEEEQPKQQQEEEEKSENVEA 690
             RQ+CNGS +G+GS +   +E+ +Q  ++   S    A
Sbjct: 661 AHRQDCNGSAEGTGSGRTAVKEEEQQTSDKTAASNQAGA 690

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
W9S2C1_9ROSA2.1e-19556.27Uncharacterized protein OS=Morus notabilis GN=L484_019288 PE=4 SV=1[more]
F6HFA9_VITVI2.7e-19556.06Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g06270 PE=4 SV=... [more]
A0A061EA95_THECC8.6e-19457.04Hydroxyproline-rich glycoprotein family protein, putative isoform 2 OS=Theobroma... [more]
A0A061E8L7_THECC2.1e-19256.96Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma... [more]
A0A067L112_JATCU2.9e-18956.59Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04813 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14710.17.1e-12343.32 hydroxyproline-rich glycoprotein family protein[more]
AT4G02940.15.2e-6533.73 oxidoreductase, 2OG-Fe(II) oxygenase family protein[more]
AT2G48080.19.9e-4033.23 oxidoreductase, 2OG-Fe(II) oxygenase family protein[more]
AT2G17970.13.8e-3131.68 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G48980.11.6e-2630.65 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|449449076|ref|XP_004142291.1|5.7e-30880.78PREDICTED: uncharacterized protein LOC101210274 isoform X2 [Cucumis sativus][more]
gi|778698245|ref|XP_011654491.1|3.2e-30680.32PREDICTED: uncharacterized protein LOC101210274 isoform X1 [Cucumis sativus][more]
gi|659109443|ref|XP_008454723.1|3.5e-30580.55PREDICTED: uncharacterized protein LOC103495063 isoform X2 [Cucumis melo][more]
gi|659109441|ref|XP_008454722.1|1.9e-30380.09PREDICTED: uncharacterized protein LOC103495063 isoform X1 [Cucumis melo][more]
gi|645228327|ref|XP_008220943.1|2.0e-20458.80PREDICTED: uncharacterized protein LOC103320980 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027450AlkB-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G000360.1CmaCh04G000360.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likeGENE3DG3DSA:2.60.120.590coord: 290..481
score: 6.
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likePFAMPF135322OG-FeII_Oxy_2coord: 364..485
score: 3.
NoneNo IPR availablePANTHERPTHR31447FAMILY NOT NAMEDcoord: 1..650
score:
NoneNo IPR availablePANTHERPTHR31447:SF0HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKE PROTEINcoord: 1..650
score:
NoneNo IPR availableunknownSSF51197Clavaminate synthase-likecoord: 282..484
score: 4.94