CSPI05G12520.1 (mRNA) Cucumber (PI 183967) v1

Overview
NameCSPI05G12520.1
TypemRNA
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProcollagen-proline 3-dioxygenase
LocationChr5: 11787099 .. 11789939 (-)
Sequence length949
RNA-Seq ExpressionCSPI05G12520.1
SyntenyCSPI05G12520.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGTATGTGTACCAATCTTTCCTAAGAAAGTTGACGTCATTCTTCCTACTCTGGATAGGCTATTATTTAATTTCAATTGTTGTCTCATATTACTAATATATTTTTTCTACAGTATTTCTCGAGTTTTTCTTTTCAATTTTTCTTTCTGGCAAGGGGAATGGTGGCTTTGTATTTTTTAACAATTCCTTTACCTTCTCTCTGAAATTCATTTGGTATGTCAGCTTTATTATTACAGTACTCTAGCTGATAGTTATCTTCGTGTGTTTGGCAAAAGGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGTAAGTTAGATGAAGGCTGTTTACCTCCTTGACTTATTGTTATTATTTAATGTTGATGGTACATCGTAACTTTATCTTATCGAAAATGGTACATATGCGATTGCTACCAATCTGAGTTGTAGCTATGCTGGTCAAGACATTATAATGTTGACCAAAAGGTTAAAGGTTTTAATCCTCTCCATATATTGTTGTGCTCGAAAATGGTACATTTTCTATTTTGAACTCTGAAGAATCTGCTTGTGCTCTTTTGAGCCAAGAAGTCTGCAGCATTTTATGGTAAATAGAAAAAATAAGTGGAGCTAGTCCTTTACATTTTGAATCGAGGCCCATGTTTCATTGCTCTTCTTGTTGTTACCTTTCTAATTATTTTAGTAATTATGCCTTATTTATAAATTCTAATCAGTTCTAGTCTTGGGGGATGATGAACTGTTGGGTTTCTTGTTTGATGTCAATATATATAAGTAATGTGGAACAGGGCACATTTTCGGAATTGTTTGATTTTGGTGGTGGATTTTTAATTCATTCGACGGTTTTCCTTTTTACAAGAAATGCTTGCGACTAAATTAGTCTGAATAATTGATGATGTTGGATCATAGTTCCTAGTAGTTTGAAATTTGAAACAATGACAGATCAACTATAGTTAATACTCATTTCACTTTATCAGGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGGTATGCCCAAGCACTGAGACCCTCTGAATCTGTGTGAAACCATTTTACAAAAAGGAAAACCGTTTGATAGACTAATCACAGCCTTTCTAAAATTAATGCTAAATATTGGCTAGGCAAAAAGATGTTGCATCCGTTCACTGAGTTTATATTTTAATATAATGGAGTAATTTCATGTACATCCATGACCCACAAGTCCAGTTACCACTATCATTTTAAGATATCAGTGGATAACAAAAAGTGTCTAATTGTTTCTAGAACTTCTATAGGTGCTTGTAAAGAAGATCGCAATTCTTAATTTCTTAACTTGGAGACAGCAACACATGCTTGTTGACCCTTTCTGGATGCGGCTCTAAATGAGGCTAAGCCTCGATGCCTAGTGTTTGATAGACTAATCACAGCCTTTCTAAAATTAATTGGCATCAGTAGGATGAGGATCATATACAAAAAATTGCTCTTCACAGTTAACACAAACATTTTCTTAATATAAGCAATTAATGGGTGGGGATCTGAACTTTGATCGATTCTTTAACCCAAACCAAATCTTAGTCAAGTTTTCTATTGAGTCTTTTTTTGGTTATTTTTTGGAAAGCAAGATAAATAGGAGCACATTAATTCTTTCTTGCTTTTTTGCTTTTTACAGGTAACTGAGGAATCATGCATTTGATTAATCCTCAAGAGAGTTGCTAGTAGACCTCTTAGCACAGTTGTTCTTCATCTTGCTTACTTCATGGTTTCCTTGGATAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTATTCCATCGAAATTACCCTTTAACATTTATTTCCCCTTCTGGTTATAGGGGAAACTGGGGGATTGTAGTGTGACGAAAAACATGCTATATCATCAATCGTGAAAGGCTACTTTGAATAACTTGCTTTGTTTTATAGGGTTTTCTCATTGAAAAATTCTACTGAAACTAAAAGTAAATTGCGTTTACTACATATGCTTAATGGTTAAACAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAGCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGAATAGCCCAAATACTTAAACTAGCAGTAGCTTCGAGG

mRNA sequence

CTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAGCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGAATAGCCCAAATACTTAAACTAGCAGTAGCTTCGAGG

Coding sequence (CDS)

TGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAGCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGA

Protein sequence

WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS*
Homology
BLAST of CSPI05G12520.1 vs. ExPASy Swiss-Prot
Match: Q5XGE0 (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 3 OS=Xenopus tropicalis OX=8364 GN=ogfod3 PE=2 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 3.3e-07
Identity = 26/85 (30.59%), Postives = 48/85 (56.47%), Query Frame = 0

Query: 9   WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDN 68
           WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ ++N
Sbjct: 225 WHPHIDKVTYGSFDYTSLLYLSDYSQDFGGGRFVFIDEGANRTVEPRTGRLSFFTSGSEN 284

Query: 69  VHSVDEITNGERLTLTLWFTRDSSH 93
           +H V++++ G R  +T+ FT +  H
Sbjct: 285 LHRVEKVSWGTRYAITISFTCNPEH 309

BLAST of CSPI05G12520.1 vs. ExPASy TrEMBL
Match: A0A0A0KMN7 (Procollagen-proline 3-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G289640 PE=3 SV=1)

HSP 1 Score: 648.3 bits (1671), Expect = 1.6e-182
Identity = 303/303 (100.00%), Postives = 303/303 (100.00%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 109 WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 168

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP
Sbjct: 169 YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 228

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI
Sbjct: 229 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 288

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 289 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 348

Query: 241 LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS 300
           LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
Sbjct: 349 LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS 408

Query: 301 LDS 304
           LDS
Sbjct: 409 LDS 411

BLAST of CSPI05G12520.1 vs. ExPASy TrEMBL
Match: A0A5D3CRE9 (Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold186G001030 PE=3 SV=1)

HSP 1 Score: 617.8 bits (1592), Expect = 2.3e-173
Identity = 290/304 (95.39%), Postives = 295/304 (97.04%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           Y AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFP+SCLPQPP
Sbjct: 154 YRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 393

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 397

BLAST of CSPI05G12520.1 vs. ExPASy TrEMBL
Match: A0A5A7SSL8 (Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold452G001280 PE=3 SV=1)

HSP 1 Score: 615.5 bits (1586), Expect = 1.2e-172
Identity = 289/304 (95.07%), Postives = 295/304 (97.04%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPP
Sbjct: 154 YTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 393

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 397

BLAST of CSPI05G12520.1 vs. ExPASy TrEMBL
Match: A0A1S3C486 (Procollagen-proline 3-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103496668 PE=3 SV=1)

HSP 1 Score: 615.5 bits (1586), Expect = 1.2e-172
Identity = 289/304 (95.07%), Postives = 295/304 (97.04%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPP
Sbjct: 154 YTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 393

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 397

BLAST of CSPI05G12520.1 vs. ExPASy TrEMBL
Match: A0A1S3C4U3 (uncharacterized protein LOC103496668 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496668 PE=4 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 1.9e-146
Identity = 257/304 (84.54%), Postives = 263/304 (86.51%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFS                                DCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFS--------------------------------DCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPP
Sbjct: 154 YTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 365

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 365

BLAST of CSPI05G12520.1 vs. NCBI nr
Match: XP_004140463.1 (prolyl 3-hydroxylase 1 [Cucumis sativus] >KAE8648112.1 hypothetical protein Csa_004690 [Cucumis sativus])

HSP 1 Score: 648.3 bits (1671), Expect = 3.3e-182
Identity = 303/303 (100.00%), Postives = 303/303 (100.00%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP
Sbjct: 154 YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS 300
           LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
Sbjct: 334 LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS 393

Query: 301 LDS 304
           LDS
Sbjct: 394 LDS 396

BLAST of CSPI05G12520.1 vs. NCBI nr
Match: TYK14443.1 (prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa])

HSP 1 Score: 617.8 bits (1592), Expect = 4.8e-173
Identity = 290/304 (95.39%), Postives = 295/304 (97.04%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           Y AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFP+SCLPQPP
Sbjct: 154 YRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 393

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 397

BLAST of CSPI05G12520.1 vs. NCBI nr
Match: XP_008456831.1 (PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo] >KAA0032195.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa])

HSP 1 Score: 615.5 bits (1586), Expect = 2.4e-172
Identity = 289/304 (95.07%), Postives = 295/304 (97.04%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPP
Sbjct: 154 YTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 393

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 397

BLAST of CSPI05G12520.1 vs. NCBI nr
Match: XP_038893062.1 (uncharacterized protein LOC120081945 [Benincasa hispida] >XP_038893063.1 uncharacterized protein LOC120081945 [Benincasa hispida])

HSP 1 Score: 571.6 bits (1472), Expect = 4.0e-159
Identity = 271/303 (89.44%), Postives = 282/303 (93.07%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVM
Sbjct: 96  WTRGASIGWHSDDNRPYLKQRDFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVM 155

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+DNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS LHDR PDS LPQPP
Sbjct: 156 YTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPP 215

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFS EDDPNFK GFDICWARL ALGYD+YF GDH FSEYPDLF +DVQLV G+K+
Sbjct: 216 SCNMYWFSLEDDPNFKSGFDICWARLHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKL 275

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQ+FENILHLLQVVQFLCWKGKELDSTN+ EDSSYAEYLSPKRNVGVSYFKSEFSK+D 
Sbjct: 276 FFQEFENILHLLQVVQFLCWKGKELDSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDV 335

Query: 241 LAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS 300
           LAESVFSSA SDGKENQ WLGWDKL AAAAAWE YASILRRELLGS S+WRN QSIYSVS
Sbjct: 336 LAESVFSSATSDGKENQHWLGWDKLAAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVS 395

Query: 301 LDS 304
           L S
Sbjct: 396 LSS 398

BLAST of CSPI05G12520.1 vs. NCBI nr
Match: XP_008456833.1 (PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo])

HSP 1 Score: 528.5 bits (1360), Expect = 3.9e-146
Identity = 257/304 (84.54%), Postives = 263/304 (86.51%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           WTRGASIGWHSDDNRPYLKQREFS                                DCVM
Sbjct: 94  WTRGASIGWHSDDNRPYLKQREFS--------------------------------DCVM 153

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPP
Sbjct: 154 YTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPP 213

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKI 180
           SCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKI
Sbjct: 214 SCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKI 273

Query: 181 FFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDG 240
           FFQKFENILHLLQVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDG
Sbjct: 274 FFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDG 333

Query: 241 LAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV 300
           LAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Sbjct: 334 LAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSV 365

Query: 301 SLDS 304
           SLDS
Sbjct: 394 SLDS 365

BLAST of CSPI05G12520.1 vs. TAIR 10
Match: AT1G68080.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 285.0 bits (728), Expect = 6.9e-77
Identity = 147/305 (48.20%), Postives = 196/305 (64.26%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           W +GASIGWHSDDNR YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +M
Sbjct: 90  WCKGASIGWHSDDNRSYLKQRDFAAVCYLNSYEKDFIGGLFRFQSGEPVTVAPSAGDVIM 149

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P 
Sbjct: 150 YTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKLLSRLSQCTSH----EVCLPLPA 209

Query: 121 SCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGD 180
           S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G 
Sbjct: 210 STNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQGEDHSTDASEQLMGPLQLAKGG 269

Query: 181 KIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSK 240
           K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F  
Sbjct: 270 KLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEEVKAMSHSQLETINALKSVFLL 329

Query: 241 NDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIY 300
           ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+
Sbjct: 330 DENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYSCKLLKELLSSLPQWKTYQTIH 389

Query: 301 SVSLD 303
            V  D
Sbjct: 390 KVESD 389

BLAST of CSPI05G12520.1 vs. TAIR 10
Match: AT1G68080.3 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 243.0 bits (619), Expect = 3.0e-64
Identity = 132/305 (43.28%), Postives = 181/305 (59.34%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           W +GASIGWHSDDNR YLKQR+F++                    GEP T++P  GD +M
Sbjct: 90  WCKGASIGWHSDDNRSYLKQRDFAS--------------------GEPVTVAPSAGDVIM 149

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P 
Sbjct: 150 YTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKLLSRLSQCTSH----EVCLPLPA 209

Query: 121 SCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGD 180
           S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G 
Sbjct: 210 STNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQGEDHSTDASEQLMGPLQLAKGG 269

Query: 181 KIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSK 240
           K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F  
Sbjct: 270 KLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEEVKAMSHSQLETINALKSVFLL 329

Query: 241 NDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIY 300
           ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+
Sbjct: 330 DENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYSCKLLKELLSSLPQWKTYQTIH 369

Query: 301 SVSLD 303
            V  D
Sbjct: 390 KVESD 369

BLAST of CSPI05G12520.1 vs. TAIR 10
Match: AT1G68080.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 203.0 bits (515), Expect = 3.5e-52
Identity = 117/304 (38.49%), Postives = 164/304 (53.95%), Query Frame = 0

Query: 1   WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVM 60
           W +GASIGWHSDDNR YLKQR+F++                    GEP T++P  GD +M
Sbjct: 90  WCKGASIGWHSDDNRSYLKQRDFAS--------------------GEPVTVAPSAGDVIM 149

Query: 61  YTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPP 120
           YTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ                
Sbjct: 150 YTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKLLSRLSQC--------------- 209

Query: 121 SCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDK 180
                             FD+C ARL  LG+D++   G+   ++  +     +QL  G K
Sbjct: 210 ------------------FDVCVARLHLLGFDVHSLQGEDHSTDASEQLMGPLQLAKGGK 269

Query: 181 IFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKN 240
           +  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F  +
Sbjct: 270 LLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEEVKAMSHSQLETINALKSVFLLD 329

Query: 241 DGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYS 300
           + L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ 
Sbjct: 330 ENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYSCKLLKELLSSLPQWKTYQTIHK 339

Query: 301 VSLD 303
           V  D
Sbjct: 390 VESD 339

BLAST of CSPI05G12520.1 vs. TAIR 10
Match: AT1G68080.4 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 192.2 bits (487), Expect = 6.1e-49
Identity = 106/246 (43.09%), Postives = 148/246 (60.16%), Query Frame = 0

Query: 60  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQP 119
           MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P
Sbjct: 1   MYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKLLSRLSQCTSH----EVCLPLP 60

Query: 120 PSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWG 179
            S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G
Sbjct: 61  ASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQGEDHSTDASEQLMGPLQLAKG 120

Query: 180 DKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFS 239
            K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F 
Sbjct: 121 GKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEEVKAMSHSQLETINALKSVFL 180

Query: 240 KNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSI 299
            ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I
Sbjct: 181 LDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYSCKLLKELLSSLPQWKTYQTI 240

Query: 300 YSVSLD 303
           + V  D
Sbjct: 241 HKVESD 241

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q5XGE03.3e-0730.592-oxoglutarate and iron-dependent oxygenase domain-containing protein 3 OS=Xenop... [more]
Match NameE-valueIdentityDescription
A0A0A0KMN71.6e-182100.00Procollagen-proline 3-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G289640 PE=... [more]
A0A5D3CRE92.3e-17395.39Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7SSL81.2e-17295.07Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A1S3C4861.2e-17295.07Procollagen-proline 3-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103496668 PE=3 S... [more]
A0A1S3C4U31.9e-14684.54uncharacterized protein LOC103496668 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
XP_004140463.13.3e-182100.00prolyl 3-hydroxylase 1 [Cucumis sativus] >KAE8648112.1 hypothetical protein Csa_... [more]
TYK14443.14.8e-17395.39prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa][more]
XP_008456831.12.4e-17295.07PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo] >KAA00... [more]
XP_038893062.14.0e-15989.44uncharacterized protein LOC120081945 [Benincasa hispida] >XP_038893063.1 unchara... [more]
XP_008456833.13.9e-14684.54PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT1G68080.16.9e-7748.202-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.33.0e-6443.282-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.23.5e-5238.492-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.46.1e-4943.092-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 3..86
e-value: 8.7E-11
score: 42.6
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 2..104
e-value: 3.1E-19
score: 71.4
NoneNo IPR availablePANTHERPTHR14049:SF92-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 3..299
IPR039575Prolyl 3-hydroxylasePANTHERPTHR14049LEPRECAN 1coord: 3..299
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 1..88
score: 8.550411

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI05G12520CSPI05G12520gene


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G12520.1.utr3p1CSPI05G12520.1.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G12520.1.cds5CSPI05G12520.1.cds5CDS
CSPI05G12520.1.cds4CSPI05G12520.1.cds4CDS
CSPI05G12520.1.cds3CSPI05G12520.1.cds3CDS
CSPI05G12520.1.cds2CSPI05G12520.1.cds2CDS
CSPI05G12520.1.cds1CSPI05G12520.1.cds1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G12520.1.utr5p1CSPI05G12520.1.utr5p1five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI05G12520.1CSPI05G12520.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032963 collagen metabolic process
biological_process GO:0019511 peptidyl-proline hydroxylation
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0019797 procollagen-proline 3-dioxygenase activity