Cp4.1LG03g10080 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g10080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCp4.1LG03: 11844846 .. 11848623 (+)
RNA-Seq ExpressionCp4.1LG03g10080
SyntenyCp4.1LG03g10080
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGTTCCTTCGATTCAAGCCCAGAGATCGGCTATGGCGGTCGCAGCCTCAACAGTCAACGGATTCAGCCTCACTCCGGTATCCAAAGCCGACGAAAACTACCACAGACCCACTGACCTTAAAGCTTTCGACGACACGAAAGCCGGCGTCAAAGGCTTAGTCGACGCCGGAATCACCGAAATTCCACGTATCTTTTACAGCCCACCGGAGGATTTTAACTCCGATGATGTTTCCAGCGAAACCCAAATCCATATACCAGTGATTGACCTCGACCTTATCGACAAATCCACCGTCGACAGAGTCAGAGAAGCTTCAGAGAAATTGGGTTTTTTCCAGTTGATTAACCATGGGATTCCGGTGGACGTTCTTGAAGAGATGAAAGATGCAGTTCGGAGATTTAACGAACAAGAAACAGAATCAAGGAAACAATATTACACTCGTGACCTCACGATGCCTTTGATTTACAACAGTAATTTCGATCTGTACTCTGCTACGACCACCAATTGGAGGGATACATTTGGGTATATAAGTGCCCCAAATCTCCACAATCCGGCAAGCCTGCCGGAAATTTGCAGGTAACTGTGGTGATAATTTTGGTTTTTTAAAACCTTTTACTAACCAATGAAGCTCTGTTTGAAATTTGCAGGGATATTCTAGTGGATTACTCGAAACGAGTGATGGAGGTTGGGAAATTACTGTTTGAATTGCTGTCGGAAGCTCTGGGCCTGAACCCAAATTATTTGAACGATATAGACTGCAACGAAGGGCTTGCACTTGTATGCCACTATTACCCACCATGCCCACAGCCGAATTTGGCCATCGGCACATCGGAGCACACTGACAATGACTTCATCACTGTGCTATTGCAAGACCACATCGGCGGCCTACAGATTCGGTATGAGAACAAGTGGGTCGACGTGCCCCCGGTCGCCGGAGCTTTAGTCGTGAACATCGGAGATCTTATGCAGGTAAGTTTATGCAAAAAAAAGAATGCAGCTAATAATTAAGTGCCCTCTAATTCATTTGGGGTTTGAGTGCAGCTAATAACGAATGACAAATTCAAAAGCGTGAAGCACAGGGTTCTTGCAAACAAGGAGGGGCCGCGAGTGTCAGTGGCTGGAGTCTTTTCGACGCTTTCTTTTCCAAACTCGAAATTGTATGGACCCATCAAGGAGTTGTTGAGTGAAGATAATCCTGCAATATACAGAGAAACCACTGTCAGAGACTTTAGTATCCAGTTTCGTTCAGTTGGCCTTGGAACTTCTACTTTGCAGCATTTCAAGCTGAGTGAAGCAGATGCTTAATTCTGGGTGTTCTCTGGTTGATTGAACTTCTAATCGTAATCATAATCGTAATCGTAAATCTCTGTAGCTCAAAAACTAAATCGAAGCTTGTTCCGATGACACACCGTCGATGTAAGATTGTAAGCTGAATATGAATAAATTAAGACTTAGCTTGATATTATAAAAAGAAAATAGATGAGTTGATTTTTTTTTGAACACTTATTAACCACAACCACAAACTTAGTGATTGGCATATTTGAATGAATAATGGTGTTGCAACGTAAGCTCTTAGGAAGGATAGAACTTTCAACTAAAGAATATGTAGCACTTATTCTAACTTAGTGAACAAATCAGTTGTGTGAAAAACGAATTTTGAAGCCAATTTCAATCGTGATTGAGTATCGGGTCACATCTAATAGAAAATGAAAAACTAAGCGCAATCTGAAAAGTAATGTTATAGCCTAAAAATGAATAAATAACAACCATAACATTCATAGTATAATCCAGTAAGAAGAATTAAAAACTAGTAGCATCTCTTTTTGAGACATTTTAGGCCTGGTATGGTTGGTATATACATGATTCTGATGAAAATAACTAAAAATGGATTTGAATCAGGCATAAGTCCTAGAGAATACATATGATCAAAGATATCCAACCATAAATAACTTGACTTGTATAAGCATGATCCAAATCGGACACCTCTTAGTCAAGTGAAAAGGAGTTATTCATATCCACTCATTCAGTCAAACTTGGAAAATTCATTACCAGAAGAAAACTAGAACCTCGAACGAAGCTTCCCATCATTAACAGTATCCCTTCTGCATTTATATGCAACAGCTGCCACCACCGATCAAGGCACTTCACAATTTTGAAAAGATTCAGTTCTTTTGATTTGAATTCAATCCGTTCCGTCGAAGCCAAGAGAATGGACAATGACGGTCGCAGCCTTAGGATCAACGGAGTCGATGGAATCAACCTCACTCCGTTATCCAAGGCCGACGAAAACTACTACCGACCCACTGAACTCAAGGCTTTCGACGACACTAAAGCCGGCGTTAAAGGCTTAGTCGACGCCGGAATTACCGAAATTCCCCGTATCTTTTACCACCCACTGGAGGAAGATGACTCCGGCGAAACCCAAATCCGTATTCCAGTGATAGACCTCGAAGGCGTCGCCAAAGATTCACTCAAACGCAAAGACATTGTTGAACAAATCCGTGAAGCTTCAGAGGAATTGGGTTTTTTCCAGTTGATTAACCATGGAATTCCGGCAAGCGTTCTTGAAGAAATGAGAGAATCTGTTCGGAGATTTCACGAACAAGACACGGAAGTGAAGAAACAATTCTATACACGGGACCTCATGAAGCCATTTGTTTACAATAGTAACTTCGATCTATACTCTGCAGCGACCACCAATTGGAGGGACACCTTTAGCCATGCTAGTGCTCCAAACCCTCCCAATCCACAGGACTTGCCGGAAATTTGCAGGTATATGTGCTGATTTTGAACCAAAACAAAATACTCTGTTACTAACCTCTGTTTTGTGCAGAGATATTCTGGTGGATTACTCAAAACGGGTGATGGAGATTGGAAAATTACTGTTTGAATTGCTGTCGGAGGCTCTCGGTCTGAACCCAAATTACTTGAACAACATAGGCTGCAGTGATGGGCTTGCATTTGTATACCACTATTACCCTGCATGCCCACAGCCGAAATCGACCATTGGCATATCCGAGCACTCTGACACTGACTTCATCACAGTGCTGTTACAAGACCACGTCGGCGGCCTACAGATTCGTCATCAGAACAAATGGATTGACGTGTGTCCTGTCGCCGGTGCGTTAGTCGTCAACATCGGAGATCTTATGCAGGTAAGTCTGCAGCATCCTCCCTGTTTCGACGCACATATGGATCGCCAAACTATGTTGGACACCCTTTAATTAAATTGGATTTTTTGATGCAGCTAATAACAAATGACAGGTTCAAAAGCGTTAATCATAGGGTTGTGTCGAAACATGAGGGTCCGAGAATATCAGTGGCGGGCATTTTTTCGACGCTTGTTTTGCCAAGCAACAAACTTTATGGACCCATCAAGGAATTGTTATCGGAAGAAAATCCTGCAATATACAGAGAAACCACCGTCAGAGACTTCAGTATCCAGTTCCGGTCGGACGGCCTTGGAACTTCTACTTTAAAGCATTACAAGCTGAATCAAGCAGAGGTTTAAATTGCAAGGAAATGGACGAAGATCATAACAGTTCTTTATATCATCCGTTGTAATAATTTTCTTTCAGTTCCACTTCGCTTTATAGTAATACTTTTGTATGACTTTACGTTAAATTTCAATAATAAAATTGTATCAGATTCTTATCCCAAATCTCATGAATATTTTACCGGCTGCCTGAGTTCGGCGAGTCCCCTGTTCCCGGTGAAGCCCAAATGAGCATTCCTGTGATAGACCTCGAAGGCATCGACAGAGGTTCATCCAAAATTGTA

mRNA sequence

GCGTTCCTTCGATTCAAGCCCAGAGATCGGCTATGGCGGTCGCAGCCTCAACAGTCAACGGATTCAGCCTCACTCCGGTATCCAAAGCCGACGAAAACTACCACAGACCCACTGACCTTAAAGCTTTCGACGACACGAAAGCCGGCGTCAAAGGCTTAGTCGACGCCGGAATCACCGAAATTCCACGTATCTTTTACAGCCCACCGGAGGATTTTAACTCCGATGATGTTTCCAGCGAAACCCAAATCCATATACCAGTGATTGACCTCGACCTTATCGACAAATCCACCGTCGACAGAGTCAGAGAAGCTTCAGAGAAATTGGGTTTTTTCCAGTTGATTAACCATGGGATTCCGGTGGACGTTCTTGAAGAGATGAAAGATGCAGTTCGGAGATTTAACGAACAAGAAACAGAATCAAGGAAACAATATTACACTCGTGACCTCACGATGCCTTTGATTTACAACAGTAATTTCGATCTGTACTCTGCTACGACCACCAATTGGAGGGATACATTTGGGTATATAAGTGCCCCAAATCTCCACAATCCGGCAAGCCTGCCGGAAATTTGCAGGGATATTCTAGTGGATTACTCGAAACGAGTGATGGAGGTTGGGAAATTACTGTTTGAATTGCTGTCGGAAGCTCTGGGCCTGAACCCAAATTATTTGAACGATATAGACTGCAACGAAGGGCTTGCACTTGTATGCCACTATTACCCACCATGCCCACAGCCGAATTTGGCCATCGGCACATCGGAGCACACTGACAATGACTTCATCACTGTGCTATTGCAAGACCACATCGGCGGCCTACAGATTCGGTATGAGAACAAGTGGGTCGACGTGCCCCCGGTCGCCGGAGCTTTAGTCGTGAACATCGGAGATCTTATGCAGCTAATAACGAATGACAAATTCAAAAGCGTGAAGCACAGGGTTCTTGCAAACAAGGAGGGGCCGCGAGTGTCAGTGGCTGGAGTCTTTTCGACGCTTTCTTTTCCAAACTCGAAATTGTATGGACCCATCAAGGAGTTGTTGAGTGAAGATAATCCTGCAATATACAGAGAAACCACTGTCAGAGACTTTAGTATCCAGATCAACGGAGTCGATGGAATCAACCTCACTCCGTTATCCAAGGCCGACGAAAACTACTACCGACCCACTGAACTCAAGGCTTTCGACGACACTAAAGCCGGCGTTAAAGGCTTAGTCGACGCCGGAATTACCGAAATTCCCCGTATCTTTTACCACCCACTGGAGGAAGATGACTCCGGCGAAACCCAAATCCGTATTCCAGTGATAGACCTCGAAGGCGTCGCCAAAGATTCACTCAAACGCAAAGACATTGTTGAACAAATCCGTGAAGCTTCAGAGGAATTGGGTTTTTTCCAGTTGATTAACCATGGAATTCCGGCAAGCGTTCTTGAAGAAATGAGAGAATCTGTTCGGAGATTTCACGAACAAGACACGGAAGTGAAGAAACAATTCTATACACGGGACCTCATGAAGCCATTTGTTTACAATAGTAACTTCGATCTATACTCTGCAGCGACCACCAATTGGAGGGACACCTTTAGCCATGCTAGTGCTCCAAACCCTCCCAATCCACAGGACTTGCCGGAAATTTGCAGAGATATTCTGGTGGATTACTCAAAACGGGTGATGGAGATTGGAAAATTACTGTTTGAATTGCTGTCGGAGGCTCTCGGTCTGAACCCAAATTACTTGAACAACATAGGCTGCAGTGATGGGCTTGCATTTGTATACCACTATTACCCTGCATGCCCACAGCCGAAATCGACCATTGGCATATCCGAGCACTCTGACACTGACTTCATCACAGTGCTGTTACAAGACCACGTCGGCGGCCTACAGATTCGTCATCAGAACAAATGGATTGACGTGTGTCCTGTCGCCGGTGCGTTAGTCGTCAACATCGGAGATCTTATGCAGCTAATAACAAATGACAGGTTCAAAAGCGTTAATCATAGGGTTGTGTCGAAACATGAGGGTCCGAGAATATCAGTGGCGGGCATTTTTTCGACGCTTGTTTTGCCAAGCAACAAACTTTATGGACCCATCAAGGAATTGTTATCGGAAGAAAATCCTGCAATATACAGAGAAACCACCGTCAGAGACTTCAGTATCCAGTTCCGGTCGGACGGCCTTGGAACTTCTACTTTAAAGCATTACAAGCTGAATCAAGCAGAGTTCGGCGAGTCCCCTGTTCCCGGTGAAGCCCAAATGAGCATTCCTGTGATAGACCTCGAAGGCATCGACAGAGGTTCATCCAAAATTGTA

Coding sequence (CDS)

ATGGCGGTCGCAGCCTCAACAGTCAACGGATTCAGCCTCACTCCGGTATCCAAAGCCGACGAAAACTACCACAGACCCACTGACCTTAAAGCTTTCGACGACACGAAAGCCGGCGTCAAAGGCTTAGTCGACGCCGGAATCACCGAAATTCCACGTATCTTTTACAGCCCACCGGAGGATTTTAACTCCGATGATGTTTCCAGCGAAACCCAAATCCATATACCAGTGATTGACCTCGACCTTATCGACAAATCCACCGTCGACAGAGTCAGAGAAGCTTCAGAGAAATTGGGTTTTTTCCAGTTGATTAACCATGGGATTCCGGTGGACGTTCTTGAAGAGATGAAAGATGCAGTTCGGAGATTTAACGAACAAGAAACAGAATCAAGGAAACAATATTACACTCGTGACCTCACGATGCCTTTGATTTACAACAGTAATTTCGATCTGTACTCTGCTACGACCACCAATTGGAGGGATACATTTGGGTATATAAGTGCCCCAAATCTCCACAATCCGGCAAGCCTGCCGGAAATTTGCAGGGATATTCTAGTGGATTACTCGAAACGAGTGATGGAGGTTGGGAAATTACTGTTTGAATTGCTGTCGGAAGCTCTGGGCCTGAACCCAAATTATTTGAACGATATAGACTGCAACGAAGGGCTTGCACTTGTATGCCACTATTACCCACCATGCCCACAGCCGAATTTGGCCATCGGCACATCGGAGCACACTGACAATGACTTCATCACTGTGCTATTGCAAGACCACATCGGCGGCCTACAGATTCGGTATGAGAACAAGTGGGTCGACGTGCCCCCGGTCGCCGGAGCTTTAGTCGTGAACATCGGAGATCTTATGCAGCTAATAACGAATGACAAATTCAAAAGCGTGAAGCACAGGGTTCTTGCAAACAAGGAGGGGCCGCGAGTGTCAGTGGCTGGAGTCTTTTCGACGCTTTCTTTTCCAAACTCGAAATTGTATGGACCCATCAAGGAGTTGTTGAGTGAAGATAATCCTGCAATATACAGAGAAACCACTGTCAGAGACTTTAGTATCCAGATCAACGGAGTCGATGGAATCAACCTCACTCCGTTATCCAAGGCCGACGAAAACTACTACCGACCCACTGAACTCAAGGCTTTCGACGACACTAAAGCCGGCGTTAAAGGCTTAGTCGACGCCGGAATTACCGAAATTCCCCGTATCTTTTACCACCCACTGGAGGAAGATGACTCCGGCGAAACCCAAATCCGTATTCCAGTGATAGACCTCGAAGGCGTCGCCAAAGATTCACTCAAACGCAAAGACATTGTTGAACAAATCCGTGAAGCTTCAGAGGAATTGGGTTTTTTCCAGTTGATTAACCATGGAATTCCGGCAAGCGTTCTTGAAGAAATGAGAGAATCTGTTCGGAGATTTCACGAACAAGACACGGAAGTGAAGAAACAATTCTATACACGGGACCTCATGAAGCCATTTGTTTACAATAGTAACTTCGATCTATACTCTGCAGCGACCACCAATTGGAGGGACACCTTTAGCCATGCTAGTGCTCCAAACCCTCCCAATCCACAGGACTTGCCGGAAATTTGCAGAGATATTCTGGTGGATTACTCAAAACGGGTGATGGAGATTGGAAAATTACTGTTTGAATTGCTGTCGGAGGCTCTCGGTCTGAACCCAAATTACTTGAACAACATAGGCTGCAGTGATGGGCTTGCATTTGTATACCACTATTACCCTGCATGCCCACAGCCGAAATCGACCATTGGCATATCCGAGCACTCTGACACTGACTTCATCACAGTGCTGTTACAAGACCACGTCGGCGGCCTACAGATTCGTCATCAGAACAAATGGATTGACGTGTGTCCTGTCGCCGGTGCGTTAGTCGTCAACATCGGAGATCTTATGCAGCTAATAACAAATGACAGGTTCAAAAGCGTTAATCATAGGGTTGTGTCGAAACATGAGGGTCCGAGAATATCAGTGGCGGGCATTTTTTCGACGCTTGTTTTGCCAAGCAACAAACTTTATGGACCCATCAAGGAATTGTTATCGGAAGAAAATCCTGCAATATACAGAGAAACCACCGTCAGAGACTTCAGTATCCAGTTCCGGTCGGACGGCCTTGGAACTTCTACTTTAAAGCATTACAAGCTGAATCAAGCAGAGTTCGGCGAGTCCCCTGTTCCCGGTGAAGCCCAAATGAGCATTCCTGTGATAGACCTCGAAGGCATCGACAGAGGTTCATCCAAAATTGTA

Protein sequence

MAVAASTVNGFSLTPVSKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPVIDLDLIDKSTVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYYTRDLTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVMEVGKLLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDHIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSVAGVFSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQINGVDGINLTPLSKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHPLEEDDSGETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGLGTSTLKHYKLNQAEFGESPVPGEAQMSIPVIDLEGIDRGSSKIV
Homology
BLAST of Cp4.1LG03g10080 vs. ExPASy Swiss-Prot
Match: Q84MB3 (1-aminocyclopropane-1-carboxylate oxidase homolog 1 OS=Arabidopsis thaliana OX=3702 GN=At1g06620 PE=2 SV=1)

HSP 1 Score: 419.5 bits (1077), Expect = 8.2e-116
Identity = 198/353 (56.09%), Postives = 262/353 (74.22%), Query Frame = 0

Query: 375 RPTELKAFDDTKAGVKGLVDAGITEIPRIFYHP----LEEDDSGETQIRIPVIDLEGVAK 434
           R T LKAFD+TK GVKGL+DAGITEIP IF  P            +   IP IDL+G   
Sbjct: 13  RSTLLKAFDETKTGVKGLIDAGITEIPSIFRAPPATLTSPKPPSSSDFSIPTIDLKGGGT 72

Query: 435 DSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDL 494
           DS+ R+ +VE+I +A+E+ GFFQ+INHGIP  VLE+M + +R FHEQDTEVKK FY+RD 
Sbjct: 73  DSITRRSLVEKIGDAAEKWGFFQVINHGIPMDVLEKMIDGIREFHEQDTEVKKGFYSRDP 132

Query: 495 MKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLL 554
               VY+SNFDL+S+   NWRDT    +AP+PP P+DLP  C +++++YSK VM++GKLL
Sbjct: 133 ASKMVYSSNFDLFSSPAANWRDTLGCYTAPDPPRPEDLPATCGEMMIEYSKEVMKLGKLL 192

Query: 555 FELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHV 614
           FELLSEALGLN N+L ++ C++ L  + HYYP CPQP  T+G+++HSD  F+T+LLQDH+
Sbjct: 193 FELLSEALGLNTNHLKDMDCTNSLLLLGHYYPPCPQPDLTLGLTKHSDNSFLTILLQDHI 252

Query: 615 GGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFS 674
           GGLQ+ H   W+DV PV GALVVN+GDL+QLITND+F SV HRV++   GPRISVA  FS
Sbjct: 253 GGLQVLHDQYWVDVPPVPGALVVNVGDLLQLITNDKFISVEHRVLANVAGPRISVACFFS 312

Query: 675 TLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHYKL 723
           + ++ + ++YGPIKE+LSEENP  YR+TT+ +++  +RS G  GTS L + K+
Sbjct: 313 SYLMANPRVYGPIKEILSEENPPNYRDTTITEYAKFYRSKGFDGTSGLLYLKI 365

BLAST of Cp4.1LG03g10080 vs. ExPASy Swiss-Prot
Match: Q8H1S4 (1-aminocyclopropane-1-carboxylate oxidase homolog 3 OS=Arabidopsis thaliana OX=3702 GN=At1g06650 PE=2 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 5.9e-114
Identity = 197/367 (53.68%), Postives = 272/367 (74.11%), Query Frame = 0

Query: 366 LSKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHPLEEDDSGET-------QI 425
           + K D  + R +ELKAFD+TK GVKGLVD+G++++PRIF+HP  +  + +          
Sbjct: 3   MMKIDPLFDRASELKAFDETKTGVKGLVDSGVSQVPRIFHHPTVKLSTPKPLPSDLLHLK 62

Query: 426 RIPVIDLEG-VAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQ 485
            IP IDL G   +D++KR + +E+I+EA+ + GFFQ+INHG+   +LE+M++ VR FHEQ
Sbjct: 63  TIPTIDLGGRDFQDAIKRNNAIEEIKEAAAKWGFFQVINHGVSLELLEKMKKGVRDFHEQ 122

Query: 486 DTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILV 545
             EV+K+FY+RD  + F+Y SNFDL+S+   NWRDTFS   AP+ P PQDLPEICRDI++
Sbjct: 123 SQEVRKEFYSRDFSRRFLYLSNFDLFSSPAANWRDTFSCTMAPDTPKPQDLPEICRDIMM 182

Query: 546 DYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHS 605
           +YSK+VM +GK LFELLSEALGL PN+LN++ CS GL  + HYYP CP+P  T+G S+HS
Sbjct: 183 EYSKQVMNLGKFLFELLSEALGLEPNHLNDMDCSKGLLMLSHYYPPCPEPDLTLGTSQHS 242

Query: 606 DTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVV-S 665
           D  F+TVLL D + GLQ+R +  W DV  V+GAL++NIGDL+QLITND+F S+ HRV+ +
Sbjct: 243 DNSFLTVLLPDQIEGLQVRREGHWFDVPHVSGALIINIGDLLQLITNDKFISLEHRVLAN 302

Query: 666 KHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTS 723
           +    R+SVA  F+T V P+ ++YGPI+EL+SEENP  YRETT++D++  F + GL GTS
Sbjct: 303 RATRARVSVACFFTTGVRPNPRMYGPIRELVSEENPPKYRETTIKDYATYFNAKGLDGTS 362

BLAST of Cp4.1LG03g10080 vs. ExPASy Swiss-Prot
Match: Q9LTH7 (1-aminocyclopropane-1-carboxylate oxidase homolog 12 OS=Arabidopsis thaliana OX=3702 GN=At5g59540 PE=2 SV=1)

HSP 1 Score: 402.9 bits (1034), Expect = 8.0e-111
Identity = 194/353 (54.96%), Postives = 261/353 (73.94%), Query Frame = 0

Query: 378 ELKAFDDTKAGVKGLVDAGITEIPRIFYHPLE-----EDDSGETQIRIPVIDLEGVAKDS 437
           E KAFD+TK GVKGLVDA ITE+PRIF+H  +     +  +  + + IP+ID   V  D+
Sbjct: 14  ERKAFDETKQGVKGLVDAKITEVPRIFHHRQDILTNKKPSASVSDLEIPIIDFASVHADT 73

Query: 438 LKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDL-M 497
             R+ IVE+++ A E  GFFQ+INH IP +VLEE+++ VRRFHE+D EVKK F++RD   
Sbjct: 74  ASREAIVEKVKYAVENWGFFQVINHSIPLNVLEEIKDGVRRFHEEDPEVKKSFFSRDAGN 133

Query: 498 KPFVYNSNFDLYSAA-TTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLL 557
           K FVYNSNFDLYS++ + NWRD+FS   AP+PP P+++PE CRD + +YSK V+  G LL
Sbjct: 134 KKFVYNSNFDLYSSSPSVNWRDSFSCYIAPDPPAPEEIPETCRDAMFEYSKHVLSFGGLL 193

Query: 558 FELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHV 617
           FELLSEALGL    L ++ C   L  + HYYP CPQP  T+GI++HSD  F+T+LLQD++
Sbjct: 194 FELLSEALGLKSQTLESMDCVKTLLMICHYYPPCPQPDLTLGITKHSDNSFLTLLLQDNI 253

Query: 618 GGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFS 677
           GGLQI HQ+ W+DV P+ GALVVNIGD +QLITND+F SV HRV++  +GPRISVA  FS
Sbjct: 254 GGLQILHQDSWVDVSPIHGALVVNIGDFLQLITNDKFVSVEHRVLANRQGPRISVASFFS 313

Query: 678 TLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHYKL 723
           + + P++++YGP+KEL+SEENP  YR+ T++++S  F   GL GTS L + ++
Sbjct: 314 SSMRPNSRVYGPMKELVSEENPPKYRDITIKEYSKIFFEKGLDGTSHLSNIRI 366

BLAST of Cp4.1LG03g10080 vs. ExPASy Swiss-Prot
Match: Q9LTH8 (1-aminocyclopropane-1-carboxylate oxidase homolog 11 OS=Arabidopsis thaliana OX=3702 GN=At5g59530 PE=2 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 9.7e-109
Identity = 194/364 (53.30%), Postives = 262/364 (71.98%), Query Frame = 0

Query: 366 LSKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHP---LEEDDSGETQIRIPV 425
           ++K    + R  E KAFD+TK GVKGL+DA ITEIPRIF+ P   L +     + + IP 
Sbjct: 1   MAKNSVEFDRYIERKAFDNTKEGVKGLIDAKITEIPRIFHVPQDTLPDKKRSVSDLEIPT 60

Query: 426 IDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFH-EQDTEV 485
           ID   V  D+  R+ IVE+++ A E  GFFQ+INHG+P +VLEE+++ VRRFH E+D EV
Sbjct: 61  IDFASVNVDTPSREAIVEKVKYAVENWGFFQVINHGVPLNVLEEIKDGVRRFHEEEDPEV 120

Query: 486 KKQFYTRDLMK-PFVYNSNFDLYSAA-TTNWRDTFSHASAPNPPNPQDLPEICRDILVDY 545
           KK +Y+ D  K  F Y+SNFDLYS++ +  WRD+ S   AP+PP P++LPE CRD +++Y
Sbjct: 121 KKSYYSLDFTKNKFAYSSNFDLYSSSPSLTWRDSISCYMAPDPPTPEELPETCRDAMIEY 180

Query: 546 SKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDT 605
           SK V+ +G LLFELLSEALGL    L ++ C   L  + HYYP CPQP  T+GIS+HSD 
Sbjct: 181 SKHVLSLGDLLFELLSEALGLKSEILKSMDCLKSLLMICHYYPPCPQPDLTLGISKHSDN 240

Query: 606 DFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHE 665
            F+TVLLQD++GGLQI HQ+ W+DV P+ GALVVN+GD +QLITND+F SV HRV++   
Sbjct: 241 SFLTVLLQDNIGGLQILHQDSWVDVSPLPGALVVNVGDFLQLITNDKFISVEHRVLANTR 300

Query: 666 GPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLK 723
           GPRISVA  FS+ +  ++ +YGP+KEL+SEENP  YR+TT+R++S  +   GL GTS L 
Sbjct: 301 GPRISVASFFSSSIRENSTVYGPMKELVSEENPPKYRDTTLREYSEGYFKKGLDGTSHLS 360

BLAST of Cp4.1LG03g10080 vs. ExPASy Swiss-Prot
Match: Q9C5K7 (1-aminocyclopropane-1-carboxylate oxidase homolog 2 OS=Arabidopsis thaliana OX=3702 GN=At1g06640 PE=2 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 8.2e-108
Identity = 197/368 (53.53%), Postives = 263/368 (71.47%), Query Frame = 0

Query: 367 SKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYH---------PLEEDDSGETQ 426
           +K   ++ R +ELKAFD+TK GVKGLVD+GI++IPRIF+H         PL  D      
Sbjct: 4   TKIAPSFDRASELKAFDETKTGVKGLVDSGISKIPRIFHHSSVELANPKPLPSDLLHLK- 63

Query: 427 IRIPVIDLEG-VAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHE 486
             IP IDL G   +D++K K+ +E I+EA+ + GFFQ+INHG+   +LE+M++ VR FHE
Sbjct: 64  -TIPTIDLGGRDFQDAIKHKNAIEGIKEAAAKWGFFQVINHGVSLELLEKMKDGVRDFHE 123

Query: 487 QDTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDIL 546
           Q  EV+K  Y+RD  + F+Y SNFDLY+AA  NWRDTF    AP+PP PQDLPEICRD++
Sbjct: 124 QPPEVRKDLYSRDFGRKFIYLSNFDLYTAAAANWRDTFYCYMAPDPPEPQDLPEICRDVM 183

Query: 547 VDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEH 606
           ++YSK+VM +G+ LFELLSEALGLNPN+L ++ C  GL  + HY+P CP+P  T G S+H
Sbjct: 184 MEYSKQVMILGEFLFELLSEALGLNPNHLKDMECLKGLRMLCHYFPPCPEPDLTFGTSKH 243

Query: 607 SDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVV- 666
           SD  F+TVLL D++ GLQ+  +  W DV  V GAL++NIGDL+QLITND+F S+ HRV+ 
Sbjct: 244 SDGSFLTVLLPDNIEGLQVCREGYWFDVPHVPGALIINIGDLLQLITNDKFISLKHRVLA 303

Query: 667 SKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GT 723
           ++    R+SVA  F T V P+ ++YGPIKEL+SEENP  YRETT+RD++  F   GL GT
Sbjct: 304 NRATRARVSVACFFHTHVKPNPRVYGPIKELVSEENPPKYRETTIRDYATYFNGKGLGGT 363

BLAST of Cp4.1LG03g10080 vs. NCBI nr
Match: KAG7028836.1 (1-aminocyclopropane-1-carboxylate oxidase-like 1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1023 bits (2645), Expect = 0.0
Identity = 509/738 (68.97%), Postives = 597/738 (80.89%), Query Frame = 0

Query: 8   VNGFSLTPVSKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVS 67
           V+G  LTP+SK DENYHR T+LKAFD++KAGVKGLVDAG+TEIPRIFY PPED++SD+V+
Sbjct: 3   VDGCILTPLSKVDENYHRATELKAFDESKAGVKGLVDAGVTEIPRIFYQPPEDYHSDNVA 62

Query: 68  SETQIHIPVIDLDLIDKS------TVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRR 127
            ET   IPVIDL+ +D++      T+DR+REASEKLGFFQLINHGIP  VLEEMK+AV+R
Sbjct: 63  GETPYKIPVIDLEHVDRNSLKRKFTIDRIREASEKLGFFQLINHGIPAAVLEEMKEAVKR 122

Query: 128 FNEQETESRKQYYTRDLTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICR 187
           FNEQ TE +KQYYTR+ T PLIYNSNFDLYSA  TNWRDT GYISAP   NP  LPEI R
Sbjct: 123 FNEQCTEVKKQYYTRNTTKPLIYNSNFDLYSAAATNWRDTIGYISAPVPPNPQDLPEIIR 182

Query: 188 DILVDYSKRVMEVGKLLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGT 247
           D LVDYSKRVME+G LLFELLSEALGL PNYL DI C EGLA+ CHYYPPCPQP+L +GT
Sbjct: 183 DNLVDYSKRVMEIGNLLFELLSEALGLTPNYLKDIGCCEGLAIGCHYYPPCPQPHLTLGT 242

Query: 248 SEHTDNDFITVLLQDHIGGLQIRYENKWVDVPPVAGALVVN-----IGDLMQLITND--- 307
           SEH+DN FITVL QDHIGGLQIR++ KWVDVPPV GA +        G + +L++ +   
Sbjct: 243 SEHSDNVFITVLYQDHIGGLQIRHQKKWVDVPPVDGATLYLPNSKLYGPIKELLSEENPA 302

Query: 308 KFKSVKHR-----VLANKEGPRVSVAGVFSTLSFPNSKLYGPIKELLSEDNPAIYRETTV 367
           K++    R     VLANKEGPRVSVAG FS      SK+YGPIKEL+SE+ PAIYR+TT+
Sbjct: 303 KYRETTIRDFHLQVLANKEGPRVSVAGFFSPPVLSTSKVYGPIKELVSEETPAIYRDTTI 362

Query: 368 RDFSIQING----------------VDGINLTPLSKADENYYRPTELKAFDDTKAGVKGL 427
            +F+ Q +                 V+ +NLTP+SKADE+Y+RPTELKAFDDTK+GVKGL
Sbjct: 363 GEFNTQFHSKGIGTSTLQQFKLTTEVNTLNLTPVSKADESYHRPTELKAFDDTKSGVKGL 422

Query: 428 VDAGITEIPRIFYHPLEEDDSG----ETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEE 487
           VDAGITEIPRIFY P E +DSG    ETQI IPVI+L+ + K  LKRK  V++IREASE+
Sbjct: 423 VDAGITEIPRIFYQPPESNDSGHVSDETQIHIPVINLDHIGKTPLKRKYTVDRIREASEK 482

Query: 488 LGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDLMKPFVYNSNFDLYSAATT 547
            GFFQLINHGIP SVLEEM++ VRRFHEQDTE+K Q+Y+R++M+PF+Y SNFDLYS+ATT
Sbjct: 483 FGFFQLINHGIPVSVLEEMKDGVRRFHEQDTELKAQYYSRNIMRPFIYISNFDLYSSATT 542

Query: 548 NWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNI 607
           NWRDTF + SAPN  +PQ LPEICRDIL DYSKRVMEIGKL+FELLSEALGL PNYLN+I
Sbjct: 543 NWRDTFRYLSAPNSHHPQKLPEICRDILEDYSKRVMEIGKLMFELLSEALGLPPNYLNDI 602

Query: 608 GCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVA 667
            CS+GLA V HYYP CPQP   IG SEH+D DFIT+LLQDH+GGLQ+RH NKW+DV PVA
Sbjct: 603 DCSEGLALVCHYYPPCPQPNMAIGTSEHTDNDFITILLQDHIGGLQVRHGNKWVDVPPVA 662

Query: 668 GALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLS 706
           GALVVNIGDL+QLITND+FKS  HRVV+  EGPR+SVAG FST  L ++K+YGPIKEL+S
Sbjct: 663 GALVVNIGDLLQLITNDKFKSAKHRVVANKEGPRVSVAGFFSTFGLQTSKVYGPIKELVS 722

BLAST of Cp4.1LG03g10080 vs. NCBI nr
Match: XP_021290758.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC110421487 [Herrania umbratica])

HSP 1 Score: 872 bits (2252), Expect = 2.89e-308
Identity = 431/758 (56.86%), Postives = 550/758 (72.56%), Query Frame = 0

Query: 23  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPVIDLDLI 82
           Y R ++LKAFD+TKAGVKGLVDAGI E+PRIFY P + F +D  S  TQ+ IPVIDL+ +
Sbjct: 20  YDRASELKAFDETKAGVKGLVDAGIKEVPRIFYQPRDQFETDSFSGGTQVSIPVIDLEGV 79

Query: 83  DKST------VDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYYTR 142
           +K+       V++V+ AS+  GFFQ++NHGIPV V++EM D VRRF EQ  E++KQ ++R
Sbjct: 80  EKNPITRKEIVEKVQIASKTWGFFQVLNHGIPVSVMDEMMDGVRRFFEQGVEAKKQLFSR 139

Query: 143 DLTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVMEVGK 202
           D T  ++YNSNFDL+SA    WRDT     APN   P  LP + RDI ++YSK++M +G 
Sbjct: 140 DYTKRVVYNSNFDLFSAPAAKWRDTVFCSMAPNPPKPEELPTVFRDITLEYSKQIMNLGY 199

Query: 203 LLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQD 262
           LLFELLSEALGLN +YL DIDC +GL ++CHYYP CPQP L +G+S+H DN F+TVLLQD
Sbjct: 200 LLFELLSEALGLNLDYLRDIDCAKGLVMLCHYYPICPQPELTLGSSKHADNGFLTVLLQD 259

Query: 263 HIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSVAGV 322
           H+GGLQ+ +EN W+DVPP  GALV+NIGDL+QLI+ND F SV HRVL N  G RVSVA  
Sbjct: 260 HVGGLQVLHENHWIDVPPTPGALVINIGDLLQLISNDSFTSVAHRVLTNSVGSRVSVASF 319

Query: 323 FSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQIN--GVDGINLTP--------- 382
           F+T   P+S+LYGPIKELLSE+NP  YRETTV+D+    N  G+ G +  P         
Sbjct: 320 FTTALLPDSRLYGPIKELLSEENPPKYRETTVKDYITYFNAKGLSGTSPLPHFSLICETL 379

Query: 383 ----------------------LSKADE-------NYYRPTELKAFDDTKAGVKGLVDAG 442
                                 ++K DE        Y R +ELKAFDDTKAGVKGLVDAG
Sbjct: 380 PYFSSDVPLFLLKIANRRKKMVIAKTDEVQFELKPEYDRTSELKAFDDTKAGVKGLVDAG 439

Query: 443 ITEIPRIFYHPLEEDD----SGETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEELGFF 502
           I E+PRIF HP ++ +    SG TQ+RIPVIDLEGV KD   R++IVE++R+AS+ LGFF
Sbjct: 440 IKEVPRIFQHPPDQSEKISVSGVTQVRIPVIDLEGVKKDPGTRQEIVEKVRDASKTLGFF 499

Query: 503 QLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRD 562
           Q++NHGIP SVLEEM++  RRF EQD E+KKQF+TRD  K   YNSNFDLYS+   NWRD
Sbjct: 500 QVVNHGIPLSVLEEMKDGARRFFEQDLEIKKQFHTRDYTKRVAYNSNFDLYSSPAANWRD 559

Query: 563 TFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSD 622
           T S   AP+PP P++LP++CRDI+++YSK VM +G LLFEL SEA+GL+P++L ++ C+ 
Sbjct: 560 TVSSLMAPDPPMPEELPDVCRDIMMEYSKLVMHLGYLLFELFSEAVGLHPDHLKDMDCAK 619

Query: 623 GLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALV 682
           GL  + HYYPACP+P+ T+G ++H+D DF+TVLLQDH+GGLQ+ H+N+W+D+ P  GALV
Sbjct: 620 GLVMLSHYYPACPRPELTLGATKHADNDFLTVLLQDHIGGLQVFHENQWVDIPPTPGALV 679

Query: 683 VNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENP 729
           +NIGDL+QLI+ND F SV HRV+S   G R+SVA  FST +LP  + YGPIKELLSEENP
Sbjct: 680 INIGDLLQLISNDAFVSVEHRVLSNSVGARVSVACFFSTFLLPDLRPYGPIKELLSEENP 739

BLAST of Cp4.1LG03g10080 vs. NCBI nr
Match: PRQ36474.1 (putative deacetoxyvindoline 4-hydroxylase [Rosa chinensis])

HSP 1 Score: 853 bits (2204), Expect = 1.33e-301
Identity = 449/745 (60.27%), Postives = 536/745 (71.95%), Query Frame = 0

Query: 23  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDF---NSDDVSSETQIHIPVIDL 82
           Y R ++LKAFDDTK GVKGLVDAGITEIPRIFY PP+++   N+ D S E Q  +PVIDL
Sbjct: 13  YDRKSELKAFDDTKEGVKGLVDAGITEIPRIFYHPPDEYSIHNTSD-SEEEQFSVPVIDL 72

Query: 83  DLID-----KSTVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYY 142
             +      K  V  V EASE  GFFQ++NHGI VDVLEE+KD VR F +Q+T  +KQ+Y
Sbjct: 73  KGLSDPTKRKGIVAAVAEASESWGFFQIVNHGISVDVLEEIKDGVRGFFQQDTAVKKQFY 132

Query: 143 TRD-LTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVME 202
           TRD  + P +YNSNFDLYSA  T+WRD+F +           LPE+CR+ILV+YS +V +
Sbjct: 133 TRDNFSSPFVYNSNFDLYSAPATHWRDSFTWYMITPTPKTEDLPEVCREILVEYSNQVRK 192

Query: 203 VGKLLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVL 262
           +GKLLFELLSEALGL  ++LNDIDCNEGL +  HYYP CPQP L +G S+H D  FITVL
Sbjct: 193 LGKLLFELLSEALGLKQSHLNDIDCNEGLLVTGHYYPACPQPELTVGISKHADRTFITVL 252

Query: 263 LQDHIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSV 322
           LQD++GGLQ+ + N W+DV PV GALV        LI+ND+FKSV+HRVLAN  GPR+SV
Sbjct: 253 LQDNVGGLQVLHHNMWIDVNPVPGALV--------LISNDRFKSVEHRVLANHRGPRISV 312

Query: 323 AGVFSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQIN--GVDG----------- 382
           AG FST   P  KLYGPIKE+LSEDNP  YRETT+RD+       G+DG           
Sbjct: 313 AGFFSTGLLPLGKLYGPIKEILSEDNPPKYRETTLRDYHAYYREKGLDGRYSIKDTKSLF 372

Query: 383 ----------INLTPL-----SKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFY 442
                       LT +     ++   NY R  ELKAFDDTK GVKGLVDAGITEIPRIFY
Sbjct: 373 GILLCMDAEVYRLTKMVATNRNEVPTNYDRKHELKAFDDTKEGVKGLVDAGITEIPRIFY 432

Query: 443 HPLEE------DDSGETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGI 502
           HP +E       DS E +  IPVID+EG+  D  KRK+IV  + EASE  GFFQ+ NHGI
Sbjct: 433 HPPDEYSIDNTSDSEEVRCSIPVIDVEGLF-DQTKRKEIVAAVGEASETWGFFQISNHGI 492

Query: 503 PASVLEEMRESVRRFHEQDTEVKKQFYTRD-LMKPFVYNSNFDLYSAATTNWRDTFSHAS 562
           P  VLEEM++ VR F+EQDTEVKKQ+YTRD      VYNSNFDLYSA  TNWRD+     
Sbjct: 493 PFDVLEEMKDGVRGFYEQDTEVKKQYYTRDDSSSTVVYNSNFDLYSAPATNWRDSLLCYM 552

Query: 563 APNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVY 622
           AP PP  +D PE+CR+ILV+YSK+VM++GKLLFELLSEALGLNP++LN+I CS+GL  + 
Sbjct: 553 APTPPKTEDFPEVCREILVEYSKQVMKLGKLLFELLSEALGLNPSHLNDIDCSEGLIVLG 612

Query: 623 HYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDL 682
           HYYPACPQP+ TIG S+H+D  F+TVLLQDH+GGLQ+ HQNKWIDV  V GALVVNIGD 
Sbjct: 613 HYYPACPQPELTIGTSKHADNSFMTVLLQDHIGGLQVLHQNKWIDVPHVPGALVVNIGDF 672

Query: 683 MQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRET 722
             LI+NDRFKSV HRV++ H GPR+SVA  FST +LPS KLYGPIKELLSE+NP  YRET
Sbjct: 673 --LISNDRFKSVEHRVLANHRGPRVSVASFFSTGLLPSTKLYGPIKELLSEDNPPKYRET 732

BLAST of Cp4.1LG03g10080 vs. NCBI nr
Match: XP_002529305.3 (uncharacterized protein LOC8276157 [Ricinus communis])

HSP 1 Score: 830 bits (2144), Expect = 2.06e-292
Identity = 412/740 (55.68%), Postives = 534/740 (72.16%), Query Frame = 0

Query: 20  DENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSP---PEDFNSDDVSSETQIHIPV 79
           +  Y R ++LKAFD+TK GVKGL+DAG+T++PRIF+ P   P+D +S   + + Q  IP+
Sbjct: 14  NHEYDRNSELKAFDETKLGVKGLIDAGVTKVPRIFHKPHDYPDDISS--AAEDAQFRIPI 73

Query: 80  IDLDLID------KSTVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESR 139
           IDL+ ++      +  V+ +R A+E  GFFQ+INHGI + ++EEM + VRRF EQ++E +
Sbjct: 74  IDLEAVEMDSTTHEKAVEEIRNAAETWGFFQIINHGIDLSIMEEMINGVRRFFEQDSEVK 133

Query: 140 KQYYTRDLTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKR 199
           K++Y+R+     +Y SNFDLY +   +WRDTF    APN   P  LPE+CRDI ++YSK+
Sbjct: 134 KKFYSREAGKSFMYVSNFDLYFSKFASWRDTFSCNIAPNTVKPEELPEVCRDITLEYSKK 193

Query: 200 VMEVGKLLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFI 259
           V  VG  LFELLSEALGL PN L +++C EGL ++CHYYPPCP+P L  GT  HTD  F+
Sbjct: 194 VRRVGIFLFELLSEALGLKPNRLREMECTEGLVVLCHYYPPCPEPELTTGTVGHTDPGFL 253

Query: 260 TVLLQDHIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPR 319
           TVLLQD IGGLQ+ ++NKWVD+PPV GALVVN+ D++QLI+NDKFKS  HRVL+N+ GPR
Sbjct: 254 TVLLQDQIGGLQVLHQNKWVDIPPVPGALVVNLADMLQLISNDKFKSSMHRVLSNRIGPR 313

Query: 320 VSVAGVFSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQINGVDG---------- 379
           +SVA  FST    +SKL+GPIKELLSEDNP IYRET V DF+  +    G          
Sbjct: 314 ISVASFFSTGFQKSSKLFGPIKELLSEDNPPIYRETAVYDFATSLKDGKGGLQNFKLQTY 373

Query: 380 ---INLTPLSK----------ADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHP 439
              +NL   S            D  Y R +ELKAFDDTKAGVKGLVDAGI E+PRIF+  
Sbjct: 374 SSTVNLLMQSNPQRSKKMNSPVDSEYCRTSELKAFDDTKAGVKGLVDAGIIEVPRIFHLS 433

Query: 440 LEEDDS----GETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGIPASV 499
            +  D+     +     P IDLEGV KDS+ RK+IV+++R ASE  GFF+++NHGIP SV
Sbjct: 434 SDHLDNISHTVDPMFNFPRIDLEGVNKDSILRKEIVDKVRHASETWGFFEVVNHGIPVSV 493

Query: 500 LEEMRESVRRFHEQDTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPP 559
           LEEM+E V+RFHEQD E+KK+FY+RD  K  +YNSNFDLYS+   NWRD+      P+PP
Sbjct: 494 LEEMKEGVKRFHEQDVELKKEFYSRDYTKKVLYNSNFDLYSSPFANWRDSIFFQMIPDPP 553

Query: 560 NPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPA 619
            P++LP  CRDIL++YSK V ++G LL EL SEALGL+PN+L ++ C++GL  V +YYPA
Sbjct: 554 KPEELPAACRDILMEYSKEVRKLGDLLLELFSEALGLSPNHLKDMECNEGLLIVGNYYPA 613

Query: 620 CPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLIT 679
           C QP+ T+G S H+D+DF TVLLQDH+GGLQ+ HQN+WI+V     ALVVNIGDL+QLIT
Sbjct: 614 CRQPEITLGASGHADSDFFTVLLQDHIGGLQVLHQNEWINVPSTPDALVVNIGDLIQLIT 673

Query: 680 NDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDF 722
           ND+F SV HRV++   GPRISVA  F+T ++ +++LYGPIKELLSEENP  YRETTVR++
Sbjct: 674 NDKFISVEHRVLANCVGPRISVASFFTTTLISTSRLYGPIKELLSEENPPKYRETTVREY 733

BLAST of Cp4.1LG03g10080 vs. NCBI nr
Match: XP_003626788.3 (uncharacterized protein LOC11434998 [Medicago truncatula])

HSP 1 Score: 825 bits (2132), Expect = 1.31e-290
Identity = 414/737 (56.17%), Postives = 532/737 (72.18%), Query Frame = 0

Query: 28  DLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPVIDLDLID---- 87
           + KAFD+TKAGVKGLVD G+ +IP +F+  P+ +   +++  T   IPVIDL  ID    
Sbjct: 17  ERKAFDETKAGVKGLVDGGVEKIPSLFHHQPDKY---EIAYNTSHVIPVIDLKDIDNKDP 76

Query: 88  ---KSTVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYYTRDLTM 147
              +  V +++EA E  GFFQ++NHGIP+ VLEEMKD V+RF+E ET+++K++YTRDL  
Sbjct: 77  SIHQGIVSKIKEACETWGFFQVVNHGIPLSVLEEMKDGVKRFHEMETDAKKEFYTRDLHG 136

Query: 148 PLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVMEVGKLLFE 207
             IY SNFDLYS+   NWRDT     AP+   P   P +CRDIL++Y K+VM +G LLFE
Sbjct: 137 SFIYKSNFDLYSSPALNWRDTCTCSLAPDTPKPEDFPVVCRDILLEYGKQVMNLGTLLFE 196

Query: 208 LLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDHIGG 267
           LLS+ALGLNPN+L D+ C EGL  +CHYYPPCP+P L +GT++H DNDF+TVLLQDHIGG
Sbjct: 197 LLSQALGLNPNHLKDMGCAEGLIALCHYYPPCPEPELTVGTTKHCDNDFLTVLLQDHIGG 256

Query: 268 LQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSVAGVFSTL 327
           LQ+ YE+KW+D+ PV GALVVN+GDL+QLITND+FKSV+HRV+AN+ GPR+SVA  FST 
Sbjct: 257 LQVLYEDKWIDITPVPGALVVNVGDLLQLITNDRFKSVEHRVVANQVGPRISVACFFSTG 316

Query: 328 SFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQIN--GVDGIN---------LTPLSKA 387
             P+SKLYGP+KELLSE+NP  YRETTV DF+   N  G+DG +         L  L   
Sbjct: 317 LRPSSKLYGPMKELLSENNPPKYRETTVADFAAYFNAKGLDGTSALTHYKILCLLTLLVT 376

Query: 388 DENYYRPT--------------------ELKAFDDTKAGVKGLVDAGITEIPRIFYH-PL 447
             +Y   T                    E K FD+TKAGVKGLVDAG+ +IP +F+H P 
Sbjct: 377 ISSYRTTTNIRKMGTTGTTKPNLDSILSERKEFDETKAGVKGLVDAGLKKIPSLFHHQPD 436

Query: 448 EEDDSGETQIRIPVIDLEGVA-KDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEM 507
           + + +      IPVIDL+ +  KD    + IV+ I+EA E  GFFQ++NHGIP SVLEE+
Sbjct: 437 KYEKANNMSHAIPVIDLKDIDNKDPSIHQGIVDNIKEACETWGFFQVVNHGIPLSVLEEL 496

Query: 508 RESVRRFHEQDTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFS-HASAPNPPNPQ 567
           ++ V+RF+EQDTEVKK+ YTR+  + FVYNSNFD+YS+   NWRD+F  + + P+   PQ
Sbjct: 497 KDGVKRFYEQDTEVKKELYTRNSNRSFVYNSNFDIYSSPALNWRDSFMCYLAPPDTLKPQ 556

Query: 568 DLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQ 627
           + P +CRDIL+ Y K +M +G LLFELLSEALGLNPN+L ++ C++GL  + HYYP CP+
Sbjct: 557 EFPVVCRDILLQYGKYMMNLGTLLFELLSEALGLNPNHLKDMDCAEGLIALCHYYPPCPE 616

Query: 628 PKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDR 687
           P+ T+G ++HSD DF+TVLLQDHVGGLQ+ + +KWID+ PV GAL+VN+GDL+QLITNDR
Sbjct: 617 PELTVGTTKHSDNDFLTVLLQDHVGGLQVLYDDKWIDITPVPGALIVNVGDLLQLITNDR 676

Query: 688 FKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQ 722
           FKSV HRVV+   GPRISVA  F T +  S+KLYGPIKELLSE+NP  YRETTV D+   
Sbjct: 677 FKSVEHRVVANEVGPRISVACFFCTGIRSSSKLYGPIKELLSEDNPPKYRETTVSDYVAY 736

BLAST of Cp4.1LG03g10080 vs. ExPASy TrEMBL
Match: A0A6J1AUS7 (LOW QUALITY PROTEIN: uncharacterized protein LOC110421487 OS=Herrania umbratica OX=108875 GN=LOC110421487 PE=4 SV=1)

HSP 1 Score: 872 bits (2252), Expect = 1.40e-308
Identity = 431/758 (56.86%), Postives = 550/758 (72.56%), Query Frame = 0

Query: 23  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPVIDLDLI 82
           Y R ++LKAFD+TKAGVKGLVDAGI E+PRIFY P + F +D  S  TQ+ IPVIDL+ +
Sbjct: 20  YDRASELKAFDETKAGVKGLVDAGIKEVPRIFYQPRDQFETDSFSGGTQVSIPVIDLEGV 79

Query: 83  DKST------VDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYYTR 142
           +K+       V++V+ AS+  GFFQ++NHGIPV V++EM D VRRF EQ  E++KQ ++R
Sbjct: 80  EKNPITRKEIVEKVQIASKTWGFFQVLNHGIPVSVMDEMMDGVRRFFEQGVEAKKQLFSR 139

Query: 143 DLTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVMEVGK 202
           D T  ++YNSNFDL+SA    WRDT     APN   P  LP + RDI ++YSK++M +G 
Sbjct: 140 DYTKRVVYNSNFDLFSAPAAKWRDTVFCSMAPNPPKPEELPTVFRDITLEYSKQIMNLGY 199

Query: 203 LLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQD 262
           LLFELLSEALGLN +YL DIDC +GL ++CHYYP CPQP L +G+S+H DN F+TVLLQD
Sbjct: 200 LLFELLSEALGLNLDYLRDIDCAKGLVMLCHYYPICPQPELTLGSSKHADNGFLTVLLQD 259

Query: 263 HIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSVAGV 322
           H+GGLQ+ +EN W+DVPP  GALV+NIGDL+QLI+ND F SV HRVL N  G RVSVA  
Sbjct: 260 HVGGLQVLHENHWIDVPPTPGALVINIGDLLQLISNDSFTSVAHRVLTNSVGSRVSVASF 319

Query: 323 FSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQIN--GVDGINLTP--------- 382
           F+T   P+S+LYGPIKELLSE+NP  YRETTV+D+    N  G+ G +  P         
Sbjct: 320 FTTALLPDSRLYGPIKELLSEENPPKYRETTVKDYITYFNAKGLSGTSPLPHFSLICETL 379

Query: 383 ----------------------LSKADE-------NYYRPTELKAFDDTKAGVKGLVDAG 442
                                 ++K DE        Y R +ELKAFDDTKAGVKGLVDAG
Sbjct: 380 PYFSSDVPLFLLKIANRRKKMVIAKTDEVQFELKPEYDRTSELKAFDDTKAGVKGLVDAG 439

Query: 443 ITEIPRIFYHPLEEDD----SGETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEELGFF 502
           I E+PRIF HP ++ +    SG TQ+RIPVIDLEGV KD   R++IVE++R+AS+ LGFF
Sbjct: 440 IKEVPRIFQHPPDQSEKISVSGVTQVRIPVIDLEGVKKDPGTRQEIVEKVRDASKTLGFF 499

Query: 503 QLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRD 562
           Q++NHGIP SVLEEM++  RRF EQD E+KKQF+TRD  K   YNSNFDLYS+   NWRD
Sbjct: 500 QVVNHGIPLSVLEEMKDGARRFFEQDLEIKKQFHTRDYTKRVAYNSNFDLYSSPAANWRD 559

Query: 563 TFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSD 622
           T S   AP+PP P++LP++CRDI+++YSK VM +G LLFEL SEA+GL+P++L ++ C+ 
Sbjct: 560 TVSSLMAPDPPMPEELPDVCRDIMMEYSKLVMHLGYLLFELFSEAVGLHPDHLKDMDCAK 619

Query: 623 GLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALV 682
           GL  + HYYPACP+P+ T+G ++H+D DF+TVLLQDH+GGLQ+ H+N+W+D+ P  GALV
Sbjct: 620 GLVMLSHYYPACPRPELTLGATKHADNDFLTVLLQDHIGGLQVFHENQWVDIPPTPGALV 679

Query: 683 VNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENP 729
           +NIGDL+QLI+ND F SV HRV+S   G R+SVA  FST +LP  + YGPIKELLSEENP
Sbjct: 680 INIGDLLQLISNDAFVSVEHRVLSNSVGARVSVACFFSTFLLPDLRPYGPIKELLSEENP 739

BLAST of Cp4.1LG03g10080 vs. ExPASy TrEMBL
Match: A0A2P6QQL3 (Putative deacetoxyvindoline 4-hydroxylase OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4g0391961 PE=4 SV=1)

HSP 1 Score: 853 bits (2204), Expect = 6.42e-302
Identity = 449/745 (60.27%), Postives = 536/745 (71.95%), Query Frame = 0

Query: 23  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDF---NSDDVSSETQIHIPVIDL 82
           Y R ++LKAFDDTK GVKGLVDAGITEIPRIFY PP+++   N+ D S E Q  +PVIDL
Sbjct: 13  YDRKSELKAFDDTKEGVKGLVDAGITEIPRIFYHPPDEYSIHNTSD-SEEEQFSVPVIDL 72

Query: 83  DLID-----KSTVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYY 142
             +      K  V  V EASE  GFFQ++NHGI VDVLEE+KD VR F +Q+T  +KQ+Y
Sbjct: 73  KGLSDPTKRKGIVAAVAEASESWGFFQIVNHGISVDVLEEIKDGVRGFFQQDTAVKKQFY 132

Query: 143 TRD-LTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVME 202
           TRD  + P +YNSNFDLYSA  T+WRD+F +           LPE+CR+ILV+YS +V +
Sbjct: 133 TRDNFSSPFVYNSNFDLYSAPATHWRDSFTWYMITPTPKTEDLPEVCREILVEYSNQVRK 192

Query: 203 VGKLLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVL 262
           +GKLLFELLSEALGL  ++LNDIDCNEGL +  HYYP CPQP L +G S+H D  FITVL
Sbjct: 193 LGKLLFELLSEALGLKQSHLNDIDCNEGLLVTGHYYPACPQPELTVGISKHADRTFITVL 252

Query: 263 LQDHIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSV 322
           LQD++GGLQ+ + N W+DV PV GALV        LI+ND+FKSV+HRVLAN  GPR+SV
Sbjct: 253 LQDNVGGLQVLHHNMWIDVNPVPGALV--------LISNDRFKSVEHRVLANHRGPRISV 312

Query: 323 AGVFSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQIN--GVDG----------- 382
           AG FST   P  KLYGPIKE+LSEDNP  YRETT+RD+       G+DG           
Sbjct: 313 AGFFSTGLLPLGKLYGPIKEILSEDNPPKYRETTLRDYHAYYREKGLDGRYSIKDTKSLF 372

Query: 383 ----------INLTPL-----SKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFY 442
                       LT +     ++   NY R  ELKAFDDTK GVKGLVDAGITEIPRIFY
Sbjct: 373 GILLCMDAEVYRLTKMVATNRNEVPTNYDRKHELKAFDDTKEGVKGLVDAGITEIPRIFY 432

Query: 443 HPLEE------DDSGETQIRIPVIDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGI 502
           HP +E       DS E +  IPVID+EG+  D  KRK+IV  + EASE  GFFQ+ NHGI
Sbjct: 433 HPPDEYSIDNTSDSEEVRCSIPVIDVEGLF-DQTKRKEIVAAVGEASETWGFFQISNHGI 492

Query: 503 PASVLEEMRESVRRFHEQDTEVKKQFYTRD-LMKPFVYNSNFDLYSAATTNWRDTFSHAS 562
           P  VLEEM++ VR F+EQDTEVKKQ+YTRD      VYNSNFDLYSA  TNWRD+     
Sbjct: 493 PFDVLEEMKDGVRGFYEQDTEVKKQYYTRDDSSSTVVYNSNFDLYSAPATNWRDSLLCYM 552

Query: 563 APNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVY 622
           AP PP  +D PE+CR+ILV+YSK+VM++GKLLFELLSEALGLNP++LN+I CS+GL  + 
Sbjct: 553 APTPPKTEDFPEVCREILVEYSKQVMKLGKLLFELLSEALGLNPSHLNDIDCSEGLIVLG 612

Query: 623 HYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDL 682
           HYYPACPQP+ TIG S+H+D  F+TVLLQDH+GGLQ+ HQNKWIDV  V GALVVNIGD 
Sbjct: 613 HYYPACPQPELTIGTSKHADNSFMTVLLQDHIGGLQVLHQNKWIDVPHVPGALVVNIGDF 672

Query: 683 MQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRET 722
             LI+NDRFKSV HRV++ H GPR+SVA  FST +LPS KLYGPIKELLSE+NP  YRET
Sbjct: 673 --LISNDRFKSVEHRVLANHRGPRVSVASFFSTGLLPSTKLYGPIKELLSEDNPPKYRET 732

BLAST of Cp4.1LG03g10080 vs. ExPASy TrEMBL
Match: A0A103Y5A5 (Non-heme dioxygenase N-terminal domain-containing protein OS=Cynara cardunculus var. scolymus OX=59895 GN=Ccrd_018938 PE=3 SV=1)

HSP 1 Score: 809 bits (2090), Expect = 2.33e-285
Identity = 401/705 (56.88%), Postives = 519/705 (73.62%), Query Frame = 0

Query: 23  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPVIDLDLI 82
           Y R  +LKAFD+TK GVK LVDAGI EIPRIF   PE        S T   IP++DL   
Sbjct: 10  YDRKAELKAFDETKGGVKALVDAGIQEIPRIFIHQPEPLPK----SSTPFEIPILDLGST 69

Query: 83  DK-STVDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYYTRDLTMP 142
           D+ STV+++REASE LGFFQ++NHGIPV V+ E+   VRRF+EQ+ E +K++YTRD +  
Sbjct: 70  DRASTVEKIREASETLGFFQVVNHGIPVTVMNEVIQGVRRFHEQDVEVKKRFYTRDPSNA 129

Query: 143 LIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVMEVGKLLFEL 202
           ++Y+SNFDLY++    WRDTF    AP+  +P  LPE+CRDI ++YS  VM++G +LF L
Sbjct: 130 VVYHSNFDLYTSPAAAWRDTFYTFMAPSPPSPEELPEVCRDIQIEYSNHVMKLGGVLFGL 189

Query: 203 LSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDHIGGL 262
           +SEAL LNP++L+D+DC++GLA   HYYP CPQP+L +G ++H+D  F+TVLLQD +GGL
Sbjct: 190 ISEALNLNPSHLSDLDCDKGLAFFGHYYPACPQPDLTMGATKHSDYGFLTVLLQDEVGGL 249

Query: 263 QIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSVAGVFSTLS 322
           QI   N+W+DVPP  GALV+NIGD++Q+++NDK KSV+HRV+A +EGPRVSVA  FST  
Sbjct: 250 QILNNNQWIDVPPTPGALVINIGDILQMMSNDKLKSVEHRVVAKEEGPRVSVACFFSTSL 309

Query: 323 FPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQIN---GVDGINLTPLSKADENYYRPTE 382
            P + LYGPIKEL+S++NP  YRETTV D+ IQ +    VDG+++TP       Y RPTE
Sbjct: 310 APLTALYGPIKELVSDENPPRYRETTVHDY-IQYSFYREVDGVSVTPY------YDRPTE 369

Query: 383 LKAFDDTKAGVKGLVDAGITEIPRIFYHPLEEDDSGETQIRIPVIDLEGVAKDSLKRKDI 442
           LKAFD+TKAGVK L DAGI +IPRIF++  E      T   IPV+DL      S  R   
Sbjct: 370 LKAFDETKAGVKALADAGIQKIPRIFHNQPEPLPKSSTPFEIPVVDL-----GSTDRAST 429

Query: 443 VEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDLMKPFVYNS 502
           V +IREASE +GFFQ++NHGIP +V+ EM + VRRFH+QD EVKK+FYTRD  +  +YNS
Sbjct: 430 VAKIREASETVGFFQVVNHGIPVTVMNEMLQGVRRFHDQDVEVKKRFYTRDPSRAVIYNS 489

Query: 503 NFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLLFELLSEAL 562
           NFDL+S+   NWRDTF    AP+PP  ++LPE+CR+I V+YS  VM++G +LF L+SEAL
Sbjct: 490 NFDLFSSPAANWRDTFISLMAPSPPPLEELPEVCREIQVEYSNEVMKLGGVLFRLISEAL 549

Query: 563 GLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHVGGLQIRHQ 622
            LNPN+L ++ C   LAFV H YPACPQP  T+G ++H+D  FITVLLQD +GGLQI H 
Sbjct: 550 KLNPNHLGDLDCDKALAFVAHCYPACPQPDLTMGATKHTDDGFITVLLQDEIGGLQILHN 609

Query: 623 NKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFSTLVLPSNK 682
            +W+DV P  GALVVNIGDL+Q+I+ND+F+SV HRVV+  +GPR+SVA  FS+ + PS K
Sbjct: 610 QQWVDVPPTPGALVVNIGDLLQMISNDKFRSVEHRVVANEKGPRVSVACFFSSSLAPSTK 669

Query: 683 LYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHYKL 722
           +YGPIKEL+S++NPA YRETTV D+     S GL G   L H KL
Sbjct: 670 VYGPIKELVSDDNPARYRETTVYDYIQYSLSKGLDGVPRLLHLKL 698

BLAST of Cp4.1LG03g10080 vs. ExPASy TrEMBL
Match: A0A5N6LA68 (Uncharacterized protein OS=Mikania micrantha OX=192012 GN=E3N88_45137 PE=3 SV=1)

HSP 1 Score: 786 bits (2031), Expect = 3.37e-276
Identity = 399/721 (55.34%), Postives = 510/721 (70.74%), Query Frame = 0

Query: 23  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPVIDLDLI 82
           + R T+LKAFD TK+GV+GLVDAGI ++PRIF + PE        + T   +P+IDL   
Sbjct: 9   FDRATELKAFDQTKSGVQGLVDAGIQQVPRIFINSPETIPK----ATTPFKLPIIDLSST 68

Query: 83  DKSTVDR-VREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESRKQYYTRDLTMP 142
           D+S+V + +R ASE LGFFQ+INHGIP+ ++ EM   VRRF+EQ  E +KQ+YTRD +  
Sbjct: 69  DRSSVVKNIRSASETLGFFQVINHGIPLSMMHEMLQGVRRFHEQNAEIKKQFYTRDASKT 128

Query: 143 LIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKRVMEVGKLLFEL 202
           ++YNSN DLY++ + NWRDTF    AP+      LPE+CRDI V+YS R++E+G LLF L
Sbjct: 129 VVYNSNADLYTSPSANWRDTFFTFMAPSAPRVEELPEVCRDIQVEYSSRMLELGGLLFRL 188

Query: 203 LSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDHIGGL 262
           +SE LGL+P+YL +IDC++GL  V H YP CPQP+L +G ++HTD+ F+TV+LQD IGGL
Sbjct: 189 ISEGLGLDPDYLGNIDCDKGLVFVGHCYPACPQPDLTMGATKHTDDGFLTVVLQDEIGGL 248

Query: 263 QIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPRVSVAGVFSTLS 322
           QI YEN+WVDVPP  GALVVNIGDL+Q+I+ND  KSV+HRV+AN++GPRVSVA  FST  
Sbjct: 249 QILYENQWVDVPPTPGALVVNIGDLLQMISNDILKSVEHRVVANEKGPRVSVACFFSTSL 308

Query: 323 FPNSKLYGPIKELLSEDNPAIYRETTVRDFS-IQINGVDGINL----------------- 382
            P++K+YGPIKEL+S++NP  YRETTV DFS   I+ VD I L                 
Sbjct: 309 APSTKVYGPIKELVSDENPPKYRETTVHDFSQFIIHYVDCILLFIGRTFAHNHQSINQFI 368

Query: 383 -TPLSKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHPLEEDDSGETQIRIPV 442
              ++     Y R TELKAFD TK+GVKGLVDAGI ++PRIF +  E      T  ++P+
Sbjct: 369 TNMVTTKTTAYDRATELKAFDQTKSGVKGLVDAGIQQVPRIFINAPETIPKATTPFKLPI 428

Query: 443 IDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVK 502
           IDL      S     IV  +R ASE LGFFQ++NHGIP SV+ E  + VRRFHEQ+ E+K
Sbjct: 429 IDL-----GSTDISSIVNNVRSASETLGFFQVVNHGIPLSVMHETLQGVRRFHEQNVEIK 488

Query: 503 KQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKR 562
           KQFYTRD  K  VYNSNFDLYS+   NWRDTF    AP+ P  ++LPE+CRDI V+YS +
Sbjct: 489 KQFYTRDASKTVVYNSNFDLYSSPAANWRDTFFTFMAPSAPRVEELPEVCRDIQVEYSSQ 548

Query: 563 VMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFI 622
           V+E+G LLF L+SE LGL P+YL NI C  GL FV H YPACPQP  T+G ++H+D  F+
Sbjct: 549 VLELGGLLFRLISEGLGLEPDYLGNIDCDKGLTFVGHCYPACPQPDLTMGATKHTDDGFL 608

Query: 623 TVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPR 682
           TV+LQD +GGLQI H+N+W+DV P  GALV        +I+ND+ KSV HRVV+  +GPR
Sbjct: 609 TVVLQDEIGGLQILHENQWVDVPPTPGALV--------MISNDKLKSVEHRVVANEKGPR 668

Query: 683 ISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHYK 722
           +SVA +FST + PS+K+YGPIKEL+S+ENP  YRETTV D+     S GL G   L H K
Sbjct: 669 VSVACLFSTSLAPSSKVYGPIKELVSDENPPKYRETTVHDYIQYSFSKGLDGVPRLLHLK 712

BLAST of Cp4.1LG03g10080 vs. ExPASy TrEMBL
Match: A0A0D3C845 (Uncharacterized protein OS=Brassica oleracea var. oleracea OX=109376 PE=4 SV=1)

HSP 1 Score: 769 bits (1986), Expect = 5.14e-269
Identity = 389/735 (52.93%), Postives = 515/735 (70.07%), Query Frame = 0

Query: 17  SKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYSPPEDFNSDDVSSETQIHIPV 76
           S A   + R   LKAFD+TK GVKGLVDAGI+EIP IF +PP    +    S +Q  IP 
Sbjct: 3   SSATVAFDRSVQLKAFDETKIGVKGLVDAGISEIPAIFRAPPATITTPKPPSSSQFTIPT 62

Query: 77  IDL----DLIDKST--VDRVREASEKLGFFQLINHGIPVDVLEEMKDAVRRFNEQETESR 136
           IDL    D I +    V+++ +A+E+ GFFQ+INHGI +DV E MK  VR F+EQ+ E R
Sbjct: 63  IDLQGGADSISRRDLLVEKIGDAAERWGFFQVINHGISLDVQERMKKGVREFHEQDPEVR 122

Query: 137 KQYYTRDLTMPLIYNSNFDLYSATTTNWRDTFGYISAPNLHNPASLPEICRDILVDYSKR 196
           K +Y+RD +  L+Y+SNFDLYS+   NWRDT G  +AP+   P  LP +C +++++YSK 
Sbjct: 123 KGFYSRDPSSKLVYSSNFDLYSSPAANWRDTLGCYTAPDPPRPEDLPAVCGEVMIEYSKE 182

Query: 197 VMEVGKLLFELLSEALGLNPNYLNDIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFI 256
           VM+VGK+LFELLSEALGLN N+L D+DC   L L+ +YYPPCPQP+L +G ++H+DN F+
Sbjct: 183 VMKVGKMLFELLSEALGLNTNHLKDMDCTNSLLLLGNYYPPCPQPDLTLGLTKHSDNSFL 242

Query: 257 TVLLQDHIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKEGPR 316
           TVLLQD +GGLQ+ ++  WVDVPPV GALVVN+GDL+QLITN KF SV+HRVLANK GPR
Sbjct: 243 TVLLQDQVGGLQVLHDQYWVDVPPVPGALVVNVGDLLQLITNGKFISVEHRVLANKAGPR 302

Query: 317 VSVAGVFSTLSFPNSKLYGPIKELLSEDNPAIYRETTVRDFSIQINGVDGINLTPLSKAD 376
           +SV   FS+    N ++YGPIKELLSE+NP IYR+TT+ ++S      +  N    +K  
Sbjct: 303 ISVGCFFSSYLMANPRVYGPIKELLSEENPPIYRDTTITEYSKLFYFFNTKNYMETTKIG 362

Query: 377 ENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFY-------HPLEEDDSGETQIRIPVI 436
            ++ R +ELKAFD+TK GVKGLVDAGIT++PRIF+       +P            IP I
Sbjct: 363 -SFDRISELKAFDETKTGVKGLVDAGITQLPRIFHDSPSNLSNPKPPSSDLLHLTTIPTI 422

Query: 437 DLEG-VAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVK 496
           DLEG V +D  KRK+ V+ IR+A+E+ GFFQ+INHG+   +LE M++ VRRF+EQ  EVK
Sbjct: 423 DLEGRVFEDETKRKNTVDGIRDAAEKWGFFQVINHGVSLDLLERMKDGVRRFNEQAPEVK 482

Query: 497 KQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKR 556
           KQFY+RD  + FVY SNFDLY+++  +WRDTFS   APNPP PQDLP ICRD++ +YSK+
Sbjct: 483 KQFYSRDFRREFVYTSNFDLYTSSAASWRDTFSCYMAPNPPKPQDLPAICRDVMFEYSKQ 542

Query: 557 VMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFI 616
            M +G+ LFEL+SEALGLN N+L  I CS GL  + HYYP CP+P  T+G ++HSD  F+
Sbjct: 543 AMSLGEFLFELISEALGLNRNHLKEIDCSKGLRMLCHYYPPCPEPDLTLGTTKHSDIAFL 602

Query: 617 TVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDL-------------MQLITNDRFKS 676
           TVLL D + GLQ+  +  W DV  V GAL++N+GDL             MQL+TND+F S
Sbjct: 603 TVLLPDQIEGLQVLREGYWFDVPHVPGALIINVGDLLQATRNKVVLVGLMQLVTNDKFIS 662

Query: 677 VNHRVVSKHEGP-RISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFR 722
             HRV++      R+SVA  F+T + P+ ++YGPI+EL+S++NP  YRE TV +F+    
Sbjct: 663 SEHRVLANRATKARVSVACFFTTGIRPNPRIYGPIRELVSKDNPPKYREITVMEFAAHRS 722

BLAST of Cp4.1LG03g10080 vs. TAIR 10
Match: AT1G06620.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 419.5 bits (1077), Expect = 5.8e-117
Identity = 198/353 (56.09%), Postives = 262/353 (74.22%), Query Frame = 0

Query: 375 RPTELKAFDDTKAGVKGLVDAGITEIPRIFYHP----LEEDDSGETQIRIPVIDLEGVAK 434
           R T LKAFD+TK GVKGL+DAGITEIP IF  P            +   IP IDL+G   
Sbjct: 13  RSTLLKAFDETKTGVKGLIDAGITEIPSIFRAPPATLTSPKPPSSSDFSIPTIDLKGGGT 72

Query: 435 DSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDL 494
           DS+ R+ +VE+I +A+E+ GFFQ+INHGIP  VLE+M + +R FHEQDTEVKK FY+RD 
Sbjct: 73  DSITRRSLVEKIGDAAEKWGFFQVINHGIPMDVLEKMIDGIREFHEQDTEVKKGFYSRDP 132

Query: 495 MKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLL 554
               VY+SNFDL+S+   NWRDT    +AP+PP P+DLP  C +++++YSK VM++GKLL
Sbjct: 133 ASKMVYSSNFDLFSSPAANWRDTLGCYTAPDPPRPEDLPATCGEMMIEYSKEVMKLGKLL 192

Query: 555 FELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHV 614
           FELLSEALGLN N+L ++ C++ L  + HYYP CPQP  T+G+++HSD  F+T+LLQDH+
Sbjct: 193 FELLSEALGLNTNHLKDMDCTNSLLLLGHYYPPCPQPDLTLGLTKHSDNSFLTILLQDHI 252

Query: 615 GGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFS 674
           GGLQ+ H   W+DV PV GALVVN+GDL+QLITND+F SV HRV++   GPRISVA  FS
Sbjct: 253 GGLQVLHDQYWVDVPPVPGALVVNVGDLLQLITNDKFISVEHRVLANVAGPRISVACFFS 312

Query: 675 TLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHYKL 723
           + ++ + ++YGPIKE+LSEENP  YR+TT+ +++  +RS G  GTS L + K+
Sbjct: 313 SYLMANPRVYGPIKEILSEENPPNYRDTTITEYAKFYRSKGFDGTSGLLYLKI 365

BLAST of Cp4.1LG03g10080 vs. TAIR 10
Match: AT1G06650.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 413.3 bits (1061), Expect = 4.2e-115
Identity = 197/367 (53.68%), Postives = 272/367 (74.11%), Query Frame = 0

Query: 366 LSKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHPLEEDDSGET-------QI 425
           + K D  + R +ELKAFD+TK GVKGLVD+G++++PRIF+HP  +  + +          
Sbjct: 3   MMKIDPLFDRASELKAFDETKTGVKGLVDSGVSQVPRIFHHPTVKLSTPKPLPSDLLHLK 62

Query: 426 RIPVIDLEG-VAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQ 485
            IP IDL G   +D++KR + +E+I+EA+ + GFFQ+INHG+   +LE+M++ VR FHEQ
Sbjct: 63  TIPTIDLGGRDFQDAIKRNNAIEEIKEAAAKWGFFQVINHGVSLELLEKMKKGVRDFHEQ 122

Query: 486 DTEVKKQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILV 545
             EV+K+FY+RD  + F+Y SNFDL+S+   NWRDTFS   AP+ P PQDLPEICRDI++
Sbjct: 123 SQEVRKEFYSRDFSRRFLYLSNFDLFSSPAANWRDTFSCTMAPDTPKPQDLPEICRDIMM 182

Query: 546 DYSKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHS 605
           +YSK+VM +GK LFELLSEALGL PN+LN++ CS GL  + HYYP CP+P  T+G S+HS
Sbjct: 183 EYSKQVMNLGKFLFELLSEALGLEPNHLNDMDCSKGLLMLSHYYPPCPEPDLTLGTSQHS 242

Query: 606 DTDFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVV-S 665
           D  F+TVLL D + GLQ+R +  W DV  V+GAL++NIGDL+QLITND+F S+ HRV+ +
Sbjct: 243 DNSFLTVLLPDQIEGLQVRREGHWFDVPHVSGALIINIGDLLQLITNDKFISLEHRVLAN 302

Query: 666 KHEGPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTS 723
           +    R+SVA  F+T V P+ ++YGPI+EL+SEENP  YRETT++D++  F + GL GTS
Sbjct: 303 RATRARVSVACFFTTGVRPNPRMYGPIRELVSEENPPKYRETTIKDYATYFNAKGLDGTS 362

BLAST of Cp4.1LG03g10080 vs. TAIR 10
Match: AT5G59540.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 402.9 bits (1034), Expect = 5.7e-112
Identity = 194/353 (54.96%), Postives = 261/353 (73.94%), Query Frame = 0

Query: 378 ELKAFDDTKAGVKGLVDAGITEIPRIFYHPLE-----EDDSGETQIRIPVIDLEGVAKDS 437
           E KAFD+TK GVKGLVDA ITE+PRIF+H  +     +  +  + + IP+ID   V  D+
Sbjct: 14  ERKAFDETKQGVKGLVDAKITEVPRIFHHRQDILTNKKPSASVSDLEIPIIDFASVHADT 73

Query: 438 LKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVKKQFYTRDL-M 497
             R+ IVE+++ A E  GFFQ+INH IP +VLEE+++ VRRFHE+D EVKK F++RD   
Sbjct: 74  ASREAIVEKVKYAVENWGFFQVINHSIPLNVLEEIKDGVRRFHEEDPEVKKSFFSRDAGN 133

Query: 498 KPFVYNSNFDLYSAA-TTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKRVMEIGKLL 557
           K FVYNSNFDLYS++ + NWRD+FS   AP+PP P+++PE CRD + +YSK V+  G LL
Sbjct: 134 KKFVYNSNFDLYSSSPSVNWRDSFSCYIAPDPPAPEEIPETCRDAMFEYSKHVLSFGGLL 193

Query: 558 FELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFITVLLQDHV 617
           FELLSEALGL    L ++ C   L  + HYYP CPQP  T+GI++HSD  F+T+LLQD++
Sbjct: 194 FELLSEALGLKSQTLESMDCVKTLLMICHYYPPCPQPDLTLGITKHSDNSFLTLLLQDNI 253

Query: 618 GGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHEGPRISVAGIFS 677
           GGLQI HQ+ W+DV P+ GALVVNIGD +QLITND+F SV HRV++  +GPRISVA  FS
Sbjct: 254 GGLQILHQDSWVDVSPIHGALVVNIGDFLQLITNDKFVSVEHRVLANRQGPRISVASFFS 313

Query: 678 TLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHYKL 723
           + + P++++YGP+KEL+SEENP  YR+ T++++S  F   GL GTS L + ++
Sbjct: 314 SSMRPNSRVYGPMKELVSEENPPKYRDITIKEYSKIFFEKGLDGTSHLSNIRI 366

BLAST of Cp4.1LG03g10080 vs. TAIR 10
Match: AT2G30840.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 401.4 bits (1030), Expect = 1.6e-111
Identity = 197/362 (54.42%), Postives = 266/362 (73.48%), Query Frame = 0

Query: 370 DENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHP-LEEDDS----GETQIRIPVID 429
           +  Y R +E+KAFD+ K GVKGL+DAG+T+IPRIF+HP L   DS      T + IP ID
Sbjct: 2   EATYDRASEVKAFDELKIGVKGLLDAGVTQIPRIFHHPHLNLTDSNLLLSSTTMVIPTID 61

Query: 430 LEGVAKD--SLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFHEQDTEVK 489
           L+G   D  ++ R+ ++  IR+A E  GFFQ+INHGI   V+E+M++ +R FHEQD++V+
Sbjct: 62  LKGGVFDEYTVTRESVIAMIRDAVERFGFFQVINHGISNDVMEKMKDGIRGFHEQDSDVR 121

Query: 490 KQFYTRDLMKPFVYNSNFDLYSAATTNWRDTFSHASAPNPPNPQDLPEICRDILVDYSKR 549
           K+FYTRD+ K   YNSNFDLYS+ + NWRDT S   AP+ P  +DLP+IC +I+++Y+KR
Sbjct: 122 KKFYTRDVTKTVKYNSNFDLYSSPSANWRDTLSCFMAPDVPETEDLPDICGEIMLEYAKR 181

Query: 550 VMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDTDFI 609
           VM++G+L+FELLSEALGLNPN+L  + C+ GL  + HYYP CP+P  T G S HSD  F+
Sbjct: 182 VMKLGELIFELLSEALGLNPNHLKEMDCTKGLLMLSHYYPPCPEPGLTFGTSPHSDRSFL 241

Query: 610 TVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVV-SKHEGP 669
           T+LLQDH+GGLQ+R    W+DV PV GAL+VN+GDL+QL+TND+F SV HRV+ +K E P
Sbjct: 242 TILLQDHIGGLQVRQNGYWVDVPPVPGALLVNLGDLLQLMTNDQFVSVEHRVLANKGEKP 301

Query: 670 RISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLKHY 723
           RISVA  F    LPS ++YGPIKELLSE+N   YR+TTV +++  + + GL G S L  +
Sbjct: 302 RISVASFF-VHPLPSLRVYGPIKELLSEQNLPKYRDTTVTEYTSHYMARGLYGNSVLLDF 361

BLAST of Cp4.1LG03g10080 vs. TAIR 10
Match: AT5G59530.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 396.0 bits (1016), Expect = 6.9e-110
Identity = 194/364 (53.30%), Postives = 262/364 (71.98%), Query Frame = 0

Query: 366 LSKADENYYRPTELKAFDDTKAGVKGLVDAGITEIPRIFYHP---LEEDDSGETQIRIPV 425
           ++K    + R  E KAFD+TK GVKGL+DA ITEIPRIF+ P   L +     + + IP 
Sbjct: 1   MAKNSVEFDRYIERKAFDNTKEGVKGLIDAKITEIPRIFHVPQDTLPDKKRSVSDLEIPT 60

Query: 426 IDLEGVAKDSLKRKDIVEQIREASEELGFFQLINHGIPASVLEEMRESVRRFH-EQDTEV 485
           ID   V  D+  R+ IVE+++ A E  GFFQ+INHG+P +VLEE+++ VRRFH E+D EV
Sbjct: 61  IDFASVNVDTPSREAIVEKVKYAVENWGFFQVINHGVPLNVLEEIKDGVRRFHEEEDPEV 120

Query: 486 KKQFYTRDLMK-PFVYNSNFDLYSAA-TTNWRDTFSHASAPNPPNPQDLPEICRDILVDY 545
           KK +Y+ D  K  F Y+SNFDLYS++ +  WRD+ S   AP+PP P++LPE CRD +++Y
Sbjct: 121 KKSYYSLDFTKNKFAYSSNFDLYSSSPSLTWRDSISCYMAPDPPTPEELPETCRDAMIEY 180

Query: 546 SKRVMEIGKLLFELLSEALGLNPNYLNNIGCSDGLAFVYHYYPACPQPKSTIGISEHSDT 605
           SK V+ +G LLFELLSEALGL    L ++ C   L  + HYYP CPQP  T+GIS+HSD 
Sbjct: 181 SKHVLSLGDLLFELLSEALGLKSEILKSMDCLKSLLMICHYYPPCPQPDLTLGISKHSDN 240

Query: 606 DFITVLLQDHVGGLQIRHQNKWIDVCPVAGALVVNIGDLMQLITNDRFKSVNHRVVSKHE 665
            F+TVLLQD++GGLQI HQ+ W+DV P+ GALVVN+GD +QLITND+F SV HRV++   
Sbjct: 241 SFLTVLLQDNIGGLQILHQDSWVDVSPLPGALVVNVGDFLQLITNDKFISVEHRVLANTR 300

Query: 666 GPRISVAGIFSTLVLPSNKLYGPIKELLSEENPAIYRETTVRDFSIQFRSDGL-GTSTLK 723
           GPRISVA  FS+ +  ++ +YGP+KEL+SEENP  YR+TT+R++S  +   GL GTS L 
Sbjct: 301 GPRISVASFFSSSIRENSTVYGPMKELVSEENPPKYRDTTLREYSEGYFKKGLDGTSHLS 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84MB38.2e-11656.091-aminocyclopropane-1-carboxylate oxidase homolog 1 OS=Arabidopsis thaliana OX=3... [more]
Q8H1S45.9e-11453.681-aminocyclopropane-1-carboxylate oxidase homolog 3 OS=Arabidopsis thaliana OX=3... [more]
Q9LTH78.0e-11154.961-aminocyclopropane-1-carboxylate oxidase homolog 12 OS=Arabidopsis thaliana OX=... [more]
Q9LTH89.7e-10953.301-aminocyclopropane-1-carboxylate oxidase homolog 11 OS=Arabidopsis thaliana OX=... [more]
Q9C5K78.2e-10853.531-aminocyclopropane-1-carboxylate oxidase homolog 2 OS=Arabidopsis thaliana OX=3... [more]
Match NameE-valueIdentityDescription
KAG7028836.10.068.971-aminocyclopropane-1-carboxylate oxidase-like 1, partial [Cucurbita argyrosperm... [more]
XP_021290758.12.89e-30856.86LOW QUALITY PROTEIN: uncharacterized protein LOC110421487 [Herrania umbratica][more]
PRQ36474.11.33e-30160.27putative deacetoxyvindoline 4-hydroxylase [Rosa chinensis][more]
XP_002529305.32.06e-29255.68uncharacterized protein LOC8276157 [Ricinus communis][more]
XP_003626788.31.31e-29056.17uncharacterized protein LOC11434998 [Medicago truncatula][more]
Match NameE-valueIdentityDescription
A0A6J1AUS71.40e-30856.86LOW QUALITY PROTEIN: uncharacterized protein LOC110421487 OS=Herrania umbratica ... [more]
A0A2P6QQL36.42e-30260.27Putative deacetoxyvindoline 4-hydroxylase OS=Rosa chinensis OX=74649 GN=RchiOBHm... [more]
A0A103Y5A52.33e-28556.88Non-heme dioxygenase N-terminal domain-containing protein OS=Cynara cardunculus ... [more]
A0A5N6LA683.37e-27655.34Uncharacterized protein OS=Mikania micrantha OX=192012 GN=E3N88_45137 PE=3 SV=1[more]
A0A0D3C8455.14e-26952.93Uncharacterized protein OS=Brassica oleracea var. oleracea OX=109376 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G06620.15.8e-11756.092-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G06650.24.2e-11553.682-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G59540.15.7e-11254.962-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G30840.11.6e-11154.422-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G59530.16.9e-11053.302-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 112..132
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 365..721
coord: 15..358
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 365..721
coord: 15..358
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 36..353
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 386..710
IPR026992Non-haem dioxygenase N-terminal domainPFAMPF14226DIOX_Ncoord: 74..167
e-value: 2.3E-13
score: 50.9
coord: 420..524
e-value: 6.1E-19
score: 68.9
IPR027443Isopenicillin N synthase-like superfamilyGENE3D2.60.120.330coord: 29..352
e-value: 1.6E-99
score: 335.6
coord: 379..704
e-value: 2.6E-101
score: 341.5
IPR044861Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domainPFAMPF031712OG-FeII_Oxycoord: 224..316
e-value: 1.1E-25
score: 90.0
coord: 576..668
e-value: 5.6E-23
score: 81.3
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 570..672
score: 12.390281
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 218..321
score: 14.274066

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g10080.1Cp4.1LG03g10080.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1901576 organic substance biosynthetic process
molecular_function GO:0016706 2-oxoglutarate-dependent dioxygenase activity
molecular_function GO:0046872 metal ion binding