CSPI04G21570 (gene) Wild cucumber (PI 183967)

NameCSPI04G21570
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
Description2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2
LocationChr4 : 20054812 .. 20061356 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGCCTTAAAAATAGGTTGTTTTGTTGCAAGTAATTCCAGCCCCAGGGGGAATATGAATATCAATTGGAAAACAAAAATTAGGTCAATCTTTACAAAATTCCCTCGAGATATCAATATCTTTGGGAAAGTGGAAAGTCTATTTTGCAGATTTTGCCCAATGATTTTACCGAAAACGCCATTTCGGACAGTAACCCACCAAATGAAAGGGAAGAAAACGGCGCAAAGTTCTTGGTCAATTTGACTCTCTCAACTCCCAATTCTCCTCCAACTCCTTTTCTCATCCTCTCCCTTCATTTTCTTTTTCTTTTTCTTTTTCTTTTTCTTTTTCTGGCGCTAATCTTCCTCTAAACCCTCTTAAATTTCCCATCAGCTCGATTTCCGTTCTTCCGCTCAGTTGCTGTCAAGAACATCAGGCAAACACCTCGCTCCAGCGATTTAGGACGGAAATTTCGTTATTTTCAGGTCTGTTTTTAAGTTACTCAATATTTGGCCATGGATTTAGGGCTTTGAATTTTATCCAAACCCGACCTTTTGGATGTTGCTCTACTCGATGGGTTTTGTTCGGTTTTACATTTTACAATCGGTTATCTTCTGTTTGGTCGCGTGACTTGGGGATCATACTGTTGTTTGCCTCGTGTTAATTCTGCAAAAGTTTTTAGTTGTTAGTGATAGATGAAATGAAGCATGAGTAGAGATTTGTTTGAGTTGGGTATGATAAAATATATGTTTGTTACTAATATTGATTATTCTTTTCGTATGTTTAGTTGACTTTTAAATTCCAGGTTTTAAGATATATATTAAACCGCACTGTAGTACATAACTCCATGAATTGGTATTTGTTTTTGCCCTCACTGATTGGGCTTTTCGCGTGTTCTTGTATGTAGATTGAGCTAAGTTGCTTGAATTTGAGGTATTTGGCCGTTGAAAGCATTTCATTTCATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAACTTACTGATATAGCACCATCTGAAGGAGTTCCATCTGAGTCTTTCAAATTGTCAGTCTCAACTCTGTCACATTCACTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCTTGTGATGGGGCTCTTTTAAGGTCTGGTTTAGATTCTGCGCGTCTCTACTTCCACCAGAGAGCTGCATGTTCATCTGCTGAGTTGATGCAAAGCAATGATTCACGAGAGTGGTGCAGAACTTCTGGTTACTATGTGGATGCTCAGATGTGGCAAGAAACGTATGACTATAGGCCTGGATTGACTCCAGTTGAGCCCAGCAATGGAATGGAGTTACCGCCTGCAGGTTTGCCAGACATATTTGCCCTCTATGGAAAGGCATCTCGAATTATTTTGGATGCCATCAGCTTCTATCTAAACTTGCGGAGCTCTCCTTTCACGGAAATACTTGATAATGTTCCTCTAAGAAGTAGGGAGATATCATCTTCTGTGTTGTCTGTGTGCTGTTATGGGAGACCCTCATTTCACGGAGAGCATCACCATAAGTTAACTGCTCAAGAGGATAGCCAGTTGGCCATGTATACGCCAGACCATGACAATCAAATTGATAAAAGTCTTATTACTCTGTTCAAGGCGGATAAGGCAGGTTTACTAATAAAGGATTTCAATGGTAGATGGATTCTTGTGGATGGGGATCTTGGCCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTATCAAGCAACCGCAGGATATGTGAATCCTGCTTTGCTCAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTGTCATTCAAACTCATGCCTAAATCTATGACTAGCCTCAGTTGTTCAGAAATGAGAGCTGCTGGCCATGGGGTAGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTTATGCAGAGATCACACTCAACTGACCAACTCTTTAATCGCCCAAATTTTCAGAATTTCAGTTTCTCTACATCCCAAGATGGTAGGCCAAATCTTTTTATAATTTCTAATTTTGGAGCCGATGAATGTCATTTGTTGAGGTTTTTCCCCTTTTTTCCTGTAACCTCCCTTGGTATGAGCTCGGCTCTCCCTTTTTTTGTAGTTTAATTACATCAATTAAATTGTTTCTTATAAAAAGAAAAAGGCGTTTATTGAGGGTTGATAATGCAATATTGATTTTCTTTTACATGTTAGTACTTAGCATGATGCTTGTGTAGGTCGTTGAAATTTAAATTCCCCTTCCCTTGATGTTTTACCTAGACATAATGAAGAGCCATCTGTTTCAATTGCCATTTTGCTATATGAGAATCTCTCATAATGAAAAAGCGTTGTGAGAAAATGATTTATGTTTTAACGGATACTTTTGGGTGAAATATGGACTGGAAGCCATGTATTTGCATTTGTTATTGCTTGATAGTTTGAATGCTTGCTTTCCTGATTATTTAGTTATTCATTCCCTCTTTATCCTCTTATTGGTTAGTGTTGCTTCTATAGTAATTTCTTAGCGTTAGTAGCCACCTTAGTGAGAAGCTTTCAATTAACAGGGTGTTGGATCGATGAAATGCTGGTACAGGATCCATAAAAATGAGGAGGAGAAAGAATAATTCAAGTACCAAACCTCTACCCCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAAAGTACAAGATATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAGGATTGTGAGAGTCACATTCACACATTAGATAGCCCTTGTGCCAGCACAAGAATGGAGATTGGGTGGCCTCCTGGAGTGCCATTTGTTCATCCTCACGACCTACCTAATAAGGCAAAAATTGGTTTTCTCGAAGCTTACGAGCCTGGTTGGACAGATAGTCATGATGTTGAATTGAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTGTAAGTGAATCTCTGTCATATTCATATATGCGTGCATTTCGACTGGTTCTTGTTTCTTACTCCAACAAGTTCCTGCTTTTCAATCAATGTGAATAAAAAGCTTGCTGATTTTGGTTCAAATCGAAAAGTCCTTGTGGTATAGATACTTGTTTATTATAGTATTACCTTTGGAAAGAGAAATCATAGGTGTGCATCACTATTTTTCCTATTGTGTGAAGGTGGGTAAGCACATAATTCAAAATTGTAAGCATGTATATTCTTTCCCATTACATTATTATTCCCCTCCTTTCCCCTTTACCCATTTTTATCATCGTTTGCAGATTCATGTTTACTATTATTTCTCTTCTATCCCACTGAAACTGGCCAGTCTTTTCTTAATTTGGCTGCCGAAAATCTCTGAGATAATTTTATGATCAAGAAATTAGGAAGATTAACAGAAGTGGTTGATGCAAGAGATAGCTTTATTGCAATGCTGCATTTGATGTGCCTTAGCTATGCATTTGTGAGTGTAGAAAATTTGATATATGATTTGAACTTGTGTCCATGTCCTTTGTTTCTCATAAAAAAGTTGTGATCATCTCTGGTGGCTGTGGGAACTATTGACGGAGATTAATTTAGTTCACAAGTGATTTGGAGCCTAAGGTATATATTTCCTAGAGGCTTGAAGATAATTGTGTTGGAACTTTAGAATTCCGTGTCACACATGTTTCACGGACGTTGGGAGCTCTTTTGAGGAACAATTTTACAGGCACAACAATCTCTATGCCATTTTTTAGAAACGAAGGTTATTATTAAAAATGGCCAAAAGCCTCGACAAAACAACCTAAAGGACTAGAGGATAAGGCAGGGTCACAAAAAGACCTCCAATCTAAGTTAATGGCATCTATGTTGTAGTTATAAAACCAACTATGTTGAGGCCGTAAAACTACCTATGTTCTGCATATCATGGCTTTAACAAGTTCTATATTTTTACATTGAAAGAGACATGGCGTTGCTGTCAGGAGCTTGAAAGTTTAGTGTTGGTTAGGCAATTGCGTATTCCATTTTTAGAACATCATACTACAACTTGTAGATTAGTCTTCGTTCAGTTTGTGTTGTTGCACAAGCATCACCGACCCTACCTTTCAGCCTTGCACCTTAAAGCCAAGCTCACCTTTCTCTCGCTCCAAATATTTCTTTTTCATTTTTCTTGATCTAAGGGCTTCCTGAATGAATCATTTAGTTTTTCTTCCTCCCATTTTATTTTTGCTCTACCAGTCTATCTGTGTCTTCTTTTACTTAAAATGTTTGTTTTAGCTCTATTCTTTTTGGATGATGTTAAAATTAGTGGGCTTAGCCCGAGGAAGGATTTACCCTAATATCCTTTCTTTTCTTTTTTATCACAAGCTTTAACCTATTTATCTTCCCTCTTGTATCTTTTGTTAATATGGGAAATTAAAAAGAAAGTCAATCGTGGTTTTTCTTTCGGTACTCGGGTTTCTACGTATCTCGGTGTCTATTTTACTTTACGCCTTTTAACATGGTATCAGAGCAACGTAAAGACGAAACCCTAGACACAAGTGTAAATGAAACTTGAAACCCAAACAGAGACAGCCCCCGTCGCCGCCGCCATGGAAAAGCTGCTCCATCAGCTTCAGAAGACACTGACAATCATTGTGGGCCAACCCTCGGAGTCAGCCGTGCTGCCCCATGGGAAAAACCAGCCACATGCACTTCAATTGACCGCCCACACGCCGCCGCCCAACGCCTCCACTACTCAGCCGATCTACTTCCATTTGTCCACCTCCGTGCGCTGCCACCGCCCTCGGTTTATGGACTGCCATCGTTTCACTCCTCGCTGTCTTTTGATTCTTTACAGCCACACATCCATGGTATCGGGATCGACCAAGTCCATAATAAATTAGGGTTTGAAGTGGTGAATCTTTGGCGCAATCCAAACCAACCGACTTGCCTATGTATTTCAAGAACCCAGTAACTTTGCTCCCTAACTTGTCCTTAAATTATATAACTAGTTCTCTAGCACCATCTACAGGTGCCTTTTCAGGAGAGGAGCTGAATGGCCAAAATTACTTTTCCTGGTCTCAATTGATCAAGATGTTTCTCGTGGGTCGTCACCAATTCGGTTTTCTGACAGGGGAGAATCTCTCTTTCATCCCATGATGCCCTGGAACGTTTTTGAAGATGTGAGGACTCCTTTATTCAGTCCATGTTGATTAAGAGTATAGAACCACAAATAGGCAAGCCTTTGCTTTATGCAGCAACTGTGAAGGATCTATGGGACACAACTCATAAACTCTGTTTGAAGCGTCATAACGCTTCTCATTTGTATACGCTGAGAAAACAAGTCCATGATTGTAAGCAAAGAACTCTGGATGTAACCTTCTACTTCAATAAACTCTCTACTTTGGCAAGAGATGGACTTGTGCAAAGAAACAGTATGGGATACCTCGAAAGATGGTTTATAGTATGCTTGACTTGAAGAAGTTGATCGGATTTATGACTTCCTTGCAGGTCTCAGTCCCAAGTTTGACATTGTTTGCGGTTGTATACTTGGACAGAGACCTCTTCCCTCTTTAACGGAAGTGTGCTATGAAGTTTATCTTGAAGAAGACTGTATGAATGCCATAAGTGTGCTGGCTACTCTTGCTATTGACTTCGCTGCCTTCAGTGCTAGATCTCAACCCTTGACAGAGAAAAAATAACGGAAAACCGATCCCTAGCTATGAGCATTGGAAGAAACAGTGGGACACTAAGGATCAATGTTAGAAACTTCATGGTCGTCCCTCAGGAGGTTAGAAACGTTCCTCCAATTATAAACATAATTCAGGGCATGCTTATATGAGTGAGTCTGCTGGCAATTCTCAACCATCTGGCCCTACTGTAAATCAGAACGTTCCATCCTGCATTCTAGAAGCCATTGCTCAGTTAGGTATGTCTCAGTCTCTTAGTCTTATCAGTGTTGATGGGAAGAATCCTTGGATCCTCGAGTTGGAGCTACAGATCACTTGACAGGTTTCTATGAGAATTTTGTTTCTTATATTCCGTGTTCTGGTAATGAGAAGATCATAATAGTTGATGGTTCTTTAGTCCCAATTGTTAGGAAGGGACAGAATTTTCCTTTTGAAGGGCTATTGCTCCAGAATGTTTTGCATGTGCCTAAGATTTCCTATAATTTGCTATCTATAAGCAAGATCACTCCTAAGCTGAACTGCAAAGCTACTTTCTTACCTAAATTTGTTTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGATACTGCATGGTATAGTAGGGGACCTATCTCCTTAATGATGATACCTCCGGTAGTAGTATCTTTTGGACCAGTTTATTGTCATCATATTTTACTATTTTTTAATAAGATTGTATGTTGTGTCATTTTCGGTTGGGCCACCCAAACTTTAAATATATGAAGTATATATTTCCCCATCTCTTCTCTAAAGTTGATGTCTCCTCTTTATCTTGTGATATTTGTATACGGGCCAAACAACATCAGGTCTCCTTCCCCTCACAACCATATAAATCAACCCAACTGTTCACCCTTATCTATAGCGACATTTGGGGTCCATCCAAGG

mRNA sequence

ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAACTTACTGATATAGCACCATCTGAAGGAGTTCCATCTGAGTCTTTCAAATTGTCAGTCTCAACTCTGTCACATTCACTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCTTGTGATGGGGCTCTTTTAAGGTCTGGTTTAGATTCTGCGCGTCTCTACTTCCACCAGAGAGCTGCATGTTCATCTGCTGAGTTGATGCAAAGCAATGATTCACGAGAGTGGTGCAGAACTTCTGGTTACTATGTGGATGCTCAGATGTGGCAAGAAACGTATGACTATAGGCCTGGATTGACTCCAGTTGAGCCCAGCAATGGAATGGAGTTACCGCCTGCAGGTTTGCCAGACATATTTGCCCTCTATGGAAAGGCATCTCGAATTATTTTGGATGCCATCAGCTTCTATCTAAACTTGCGGAGCTCTCCTTTCACGGAAATACTTGATAATGTTCCTCTAAGAAGTAGGGAGATATCATCTTCTGTGTTGTCTGTGTGCTGTTATGGGAGACCCTCATTTCACGGAGAGCATCACCATAAGTTAACTGCTCAAGAGGATAGCCAGTTGGCCATGTATACGCCAGACCATGACAATCAAATTGATAAAAGTCTTATTACTCTGTTCAAGGCGGATAAGGCAGGTTTACTAATAAAGGATTTCAATGGTAGATGGATTCTTGTGGATGGGGATCTTGGCCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTATCAAGCAACCGCAGGATATGTGAATCCTGCTTTGCTCAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTGTCATTCAAACTCATGCCTAAATCTATGACTAGCCTCAGTTGTTCAGAAATGAGAGCTGCTGGCCATGGGGTAGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTTATGCAGAGATCACACTCAACTGACCAACTCTTTAATCGCCCAAATTTTCAGAATTTCAGTTTCTCTACATCCCAAGATGGATCCATAAAAATGAGGAGGAGAAAGAATAATTCAAGTACCAAACCTCTACCCCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAAAGTACAAGATATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAGGATTGTGAGAGTCACATTCACACATTAGATAGCCCTTGTGCCAGCACAAGAATGGAGATTGGGTGGCCTCCTGGAGTGCCATTTGTTCATCCTCACGACCTACCTAATAAGGCAAAAATTGGTTTTCTCGAAGCTTACGAGCCTGGTTGGACAGATAGTCATGATGTTGAATTGAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTGA

Coding sequence (CDS)

ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAACTTACTGATATAGCACCATCTGAAGGAGTTCCATCTGAGTCTTTCAAATTGTCAGTCTCAACTCTGTCACATTCACTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCTTGTGATGGGGCTCTTTTAAGGTCTGGTTTAGATTCTGCGCGTCTCTACTTCCACCAGAGAGCTGCATGTTCATCTGCTGAGTTGATGCAAAGCAATGATTCACGAGAGTGGTGCAGAACTTCTGGTTACTATGTGGATGCTCAGATGTGGCAAGAAACGTATGACTATAGGCCTGGATTGACTCCAGTTGAGCCCAGCAATGGAATGGAGTTACCGCCTGCAGGTTTGCCAGACATATTTGCCCTCTATGGAAAGGCATCTCGAATTATTTTGGATGCCATCAGCTTCTATCTAAACTTGCGGAGCTCTCCTTTCACGGAAATACTTGATAATGTTCCTCTAAGAAGTAGGGAGATATCATCTTCTGTGTTGTCTGTGTGCTGTTATGGGAGACCCTCATTTCACGGAGAGCATCACCATAAGTTAACTGCTCAAGAGGATAGCCAGTTGGCCATGTATACGCCAGACCATGACAATCAAATTGATAAAAGTCTTATTACTCTGTTCAAGGCGGATAAGGCAGGTTTACTAATAAAGGATTTCAATGGTAGATGGATTCTTGTGGATGGGGATCTTGGCCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTATCAAGCAACCGCAGGATATGTGAATCCTGCTTTGCTCAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTGTCATTCAAACTCATGCCTAAATCTATGACTAGCCTCAGTTGTTCAGAAATGAGAGCTGCTGGCCATGGGGTAGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTTATGCAGAGATCACACTCAACTGACCAACTCTTTAATCGCCCAAATTTTCAGAATTTCAGTTTCTCTACATCCCAAGATGGATCCATAAAAATGAGGAGGAGAAAGAATAATTCAAGTACCAAACCTCTACCCCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAAAGTACAAGATATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAGGATTGTGAGAGTCACATTCACACATTAGATAGCCCTTGTGCCAGCACAAGAATGGAGATTGGGTGGCCTCCTGGAGTGCCATTTGTTCATCCTCACGACCTACCTAATAAGGCAAAAATTGGTTTTCTCGAAGCTTACGAGCCTGGTTGGACAGATAGTCATGATGTTGAATTGAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTGA
BLAST of CSPI04G21570 vs. TrEMBL
Match: A0A0A0KZ26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G608060 PE=4 SV=1)

HSP 1 Score: 810.4 bits (2092), Expect = 1.1e-231
Identity = 393/394 (99.75%), Postives = 393/394 (99.75%), Query Frame = 1

Query: 80  MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR 139
           MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR
Sbjct: 1   MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR 60

Query: 140 IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS 199
           IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS
Sbjct: 61  IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS 120

Query: 200 QLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA 259
           QLAMYT DHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA
Sbjct: 121 QLAMYTSDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA 180

Query: 260 TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP 319
           TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP
Sbjct: 181 TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP 240

Query: 320 VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV 379
           VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV
Sbjct: 241 VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV 300

Query: 380 LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA 439
           LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA
Sbjct: 301 LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA 360

Query: 440 KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN
Sbjct: 361 KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 394

BLAST of CSPI04G21570 vs. TrEMBL
Match: A0A061F8X2_THECC (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_032425 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 2.0e-228
Identity = 385/475 (81.05%), Postives = 429/475 (90.32%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++ +NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E PP GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY PDH++Q+DK LI++ K+DKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWT +HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

BLAST of CSPI04G21570 vs. TrEMBL
Match: A0A061FH49_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_032425 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 2.0e-228
Identity = 385/475 (81.05%), Postives = 429/475 (90.32%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++ +NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E PP GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY PDH++Q+DK LI++ K+DKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWT +HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

BLAST of CSPI04G21570 vs. TrEMBL
Match: D7SWX7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00600 PE=4 SV=1)

HSP 1 Score: 790.8 bits (2041), Expect = 9.2e-226
Identity = 381/475 (80.21%), Postives = 429/475 (90.32%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGN LPSLGRVKL D+   EG+PS+S+KLSVSTLS SLAQYSAAIIQFP+ DGALLRSG
Sbjct: 1   MAGNSLPSLGRVKLCDLIACEGLPSDSYKLSVSTLSQSLAQYSAAIIQFPSSDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSA LYFHQRA+  +A+++ +N+SREWC+TSGYY D Q WQETYD+RPGLTP E ++G+
Sbjct: 61  LDSAHLYFHQRASYPAADMIHNNESREWCKTSGYYADPQQWQETYDFRPGLTPPESNSGL 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E PPAGLPDIF+L GKA+R ILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 EFPPAGLPDIFSLLGKAARDILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
           GRPSF G  HH LT QED QL M++ DH++Q+DKSLITL K+DKAGL ++DF+GRW+LVD
Sbjct: 181 GRPSFQGPQHHNLTTQEDGQLVMFS-DHEHQVDKSLITLVKSDKAGLHVRDFHGRWVLVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+AIVYPGLALYQATAGYV PAL RT+++N+QG+MYGRCSL+FKLMPKSMTSL+C
Sbjct: 241 GDLGPQEAIVYPGLALYQATAGYVGPALHRTEISNMQGNMYGRCSLAFKLMPKSMTSLNC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH T+QLFNR NF +F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTEQLFNRNNFPSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NNS  KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNLK+CESHIHTLDSPCA+TR
Sbjct: 361 NNSRCKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNLKECESHIHTLDSPCANTR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT +HD+ELSL EPGQ  Q S N
Sbjct: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLIEPGQASQHSAN 474

BLAST of CSPI04G21570 vs. TrEMBL
Match: A0A067JMA5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22651 PE=4 SV=1)

HSP 1 Score: 787.7 bits (2033), Expect = 7.8e-225
Identity = 383/480 (79.79%), Postives = 431/480 (89.79%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKL+D+ PSEG+PS+S+KLSVSTLS SLAQ+SAAIIQF A DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLSDLIPSEGLPSDSYKLSVSTLSQSLAQFSAAIIQFSASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNG- 120
           LDSARLYFHQR +  SA+++ +ND REWC+TSGYY D Q++QETYDYRPGLTP EP+NG 
Sbjct: 61  LDSARLYFHQRPSYPSADMIHTND-REWCKTSGYYADPQLYQETYDYRPGLTPAEPNNGI 120

Query: 121 ----MELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVL 180
               ME PP GLPDIFAL GKA+R ILDAISFYLNLRSSPFTEILDNVPLRSREISSSVL
Sbjct: 121 DYKPMEFPPGGLPDIFALLGKAARDILDAISFYLNLRSSPFTEILDNVPLRSREISSSVL 180

Query: 181 SVCCYGRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGR 240
           SVCC+ RPSF G  HH LTAQED QL MY PDH+NQ+DKSLI+L K+DKAGL ++D++GR
Sbjct: 181 SVCCHARPSFQGAQHHNLTAQEDGQLVMY-PDHENQVDKSLISLVKSDKAGLHVRDYHGR 240

Query: 241 WILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSM 300
           W+LVDGDLGPQ+AIVYPGLALYQATAGY+NPAL RT++NN+QG+MYGRCSL+FKLMPKSM
Sbjct: 241 WVLVDGDLGPQEAIVYPGLALYQATAGYINPALQRTEINNVQGNMYGRCSLAFKLMPKSM 300

Query: 301 TSLSCSEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIK-- 360
           TSLSCSEMRAAGHGV+ QFQLPVPVDDFMQRSH  DQLFNR +FQ+F+F TSQ+GS+K  
Sbjct: 301 TSLSCSEMRAAGHGVEAQFQLPVPVDDFMQRSHPPDQLFNRHSFQSFNFPTSQEGSMKPM 360

Query: 361 MRRRKNNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSP 420
           M+RRKNNS  KPLPPSKRLRLEAQRVLKE+VQDIADKKG+KLRFCNLK+CE+HIH LDSP
Sbjct: 361 MKRRKNNSRCKPLPPSKRLRLEAQRVLKERVQDIADKKGVKLRFCNLKECENHIHALDSP 420

Query: 421 CASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           CA+ R+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT +H +ELSLTEPGQ  Q S N
Sbjct: 421 CANIRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTATHGMELSLTEPGQGSQHSAN 478

BLAST of CSPI04G21570 vs. TAIR10
Match: AT3G12940.1 (AT3G12940.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 713.8 bits (1841), Expect = 7.2e-206
Identity = 347/476 (72.90%), Postives = 410/476 (86.13%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNG+P+LGRVK+ D+ PSEG+PS+S+KL+V+TLS SLAQYSAAIIQFPA DGALLRSG
Sbjct: 2   MAGNGMPTLGRVKVCDLVPSEGLPSDSYKLAVTTLSQSLAQYSAAIIQFPASDGALLRSG 61

Query: 61  LDSARLYFHQRAACSSAE-LMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNG 120
           LDSARLYFHQR +  +   ++ +NDS+EWC+TSGYY D Q WQE+Y+YRPGLTP EPSN 
Sbjct: 62  LDSARLYFHQRDSYPATNNMIHTNDSQEWCKTSGYYADPQSWQESYEYRPGLTPTEPSNS 121

Query: 121 MELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCC 180
           ME PPAGLPDIFAL GKA+R++LDAI FYLNLRS PFTEILDNVPLR+ E+SSSVLSVCC
Sbjct: 122 MEFPPAGLPDIFALLGKAARVVLDAIGFYLNLRSCPFTEILDNVPLRNCEVSSSVLSVCC 181

Query: 181 YGRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILV 240
           Y RPSFHG  HH LT  ED QL +Y+ DHD+Q+DKSLI+  K+DKAGL I+D +G+WILV
Sbjct: 182 YARPSFHGAQHHSLT--EDEQLILYS-DHDHQLDKSLISFVKSDKAGLHIRDMHGQWILV 241

Query: 241 DGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLS 300
           D DLGPQ+A+VYPGLALYQATAGYV+PA+ RTD+N++QGS+ GR SL+FKLMPKSMT+LS
Sbjct: 242 DVDLGPQEAVVYPGLALYQATAGYVSPAVHRTDLNSLQGSIEGRFSLAFKLMPKSMTNLS 301

Query: 301 CSEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKM--RRR 360
           CSEMRAAGHGV+ QFQLPV VDDFMQRSHS D+LFNR   Q+F    SQDGS+K   +RR
Sbjct: 302 CSEMRAAGHGVEAQFQLPVSVDDFMQRSHSNDELFNRQTLQSFIVPQSQDGSMKQLKKRR 361

Query: 361 KNNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCAST 420
           K++S  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CE++ + ++SPCA+ 
Sbjct: 362 KSDSRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECENNHNVMNSPCANI 421

Query: 421 RMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           R EIGWP GVPFVHPHDLPNKAKIGFLE YEPGW+++HD+E SL+E  Q  Q  TN
Sbjct: 422 RREIGWPHGVPFVHPHDLPNKAKIGFLETYEPGWSETHDMEFSLSETAQGNQHVTN 474

BLAST of CSPI04G21570 vs. TAIR10
Match: AT3G19895.1 (AT3G19895.1 RING/U-box superfamily protein)

HSP 1 Score: 159.5 bits (402), Expect = 5.2e-39
Identity = 116/336 (34.52%), Postives = 167/336 (49.70%), Query Frame = 1

Query: 4   NGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDS 63
           +G P L RV+L++I P EG PS  +  +V  LS SL +Y+A++I+  + D AL+R GL++
Sbjct: 56  SGTP-LARVRLSEILPYEGAPSPVYAKAVEALSVSLMRYNASVIEIGSEDTALMRCGLEA 115

Query: 64  ARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELP 123
           ARLYF  R+   S +  +                         YR G +  +    ++  
Sbjct: 116 ARLYFRTRSLTVSGKGNRGLSM---------------------YRAGRSVED----LDSS 175

Query: 124 PAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRP 183
           P  + +IF   GK +R  L AI+ +L LRS  F  +LD+ PL   E+SSSVL +  Y   
Sbjct: 176 PPCMAEIFRCLGKVARAALSAIARHLRLRSDVFNHMLDDFPLAPNEVSSSVL-LASYAHA 235

Query: 184 SFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDL 243
           S     H        +++         +++K L+TLF +D  G+ + D NGRW   D   
Sbjct: 236 SIQNGKHASGGGNLSAKI---------EVEKGLLTLFCSDGTGIQVCDPNGRWYTADNGC 295

Query: 244 GPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGS-MYGRCSLSFKLMPKSMTSLSCSE 303
           G  D ++  G AL  ATAG    A  RT  +++  +   GR SL+F+LMPKS   L CS 
Sbjct: 296 GVGDLLLITGKALSHATAGLRPAASYRTTTDHLSATDTRGRASLAFRLMPKSNAILDCSP 354

Query: 304 MRAAGHGVDVQFQLPVPVDDFMQR-SHSTDQLFNRP 338
           + AAGH V  Q  +PV V  FM       D L N P
Sbjct: 356 IEAAGH-VIPQSYVPVSVSQFMDNLLAENDTLVNPP 354

BLAST of CSPI04G21570 vs. NCBI nr
Match: gi|449453784|ref|XP_004144636.1| (PREDICTED: uncharacterized protein LOC101216737 [Cucumis sativus])

HSP 1 Score: 960.7 bits (2482), Expect = 9.6e-277
Identity = 472/473 (99.79%), Postives = 472/473 (99.79%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM
Sbjct: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
           GRPSFHGEHHHKLTAQEDSQLAMYT DHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD
Sbjct: 181 GRPSFHGEHHHKLTAQEDSQLAMYTSDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Sbjct: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNN 360
           SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNN
Sbjct: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNN 360

Query: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420
           SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME
Sbjct: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420

Query: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN
Sbjct: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 473

BLAST of CSPI04G21570 vs. NCBI nr
Match: gi|659130950|ref|XP_008465436.1| (PREDICTED: uncharacterized protein LOC103503048 isoform X1 [Cucumis melo])

HSP 1 Score: 947.6 bits (2448), Expect = 8.4e-273
Identity = 463/473 (97.89%), Postives = 470/473 (99.37%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYFHQRAACSSAELMQ+NDSREWCRTSGYYVD QMWQETYDYRPGLTPVEPS+GM
Sbjct: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDTQMWQETYDYRPGLTPVEPSSGM 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
           GRPSFHGEHHHKLTAQEDSQL+MY  DHDNQIDKSLITLFK+DKAGLLIKDFNGRWILVD
Sbjct: 181 GRPSFHGEHHHKLTAQEDSQLSMYASDHDNQIDKSLITLFKSDKAGLLIKDFNGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMT+LSC
Sbjct: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTNLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNN 360
           SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKN+
Sbjct: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNS 360

Query: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420
           SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCE+HIHTLDSPCASTRME
Sbjct: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCENHIHTLDSPCASTRME 420

Query: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN
Sbjct: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 473

BLAST of CSPI04G21570 vs. NCBI nr
Match: gi|659130954|ref|XP_008465438.1| (PREDICTED: uncharacterized protein LOC103503048 isoform X2 [Cucumis melo])

HSP 1 Score: 947.6 bits (2448), Expect = 8.4e-273
Identity = 463/473 (97.89%), Postives = 470/473 (99.37%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYFHQRAACSSAELMQ+NDSREWCRTSGYYVD QMWQETYDYRPGLTPVEPS+GM
Sbjct: 61  LDSARLYFHQRAACSSAELMQNNDSREWCRTSGYYVDTQMWQETYDYRPGLTPVEPSSGM 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY
Sbjct: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
           GRPSFHGEHHHKLTAQEDSQL+MY  DHDNQIDKSLITLFK+DKAGLLIKDFNGRWILVD
Sbjct: 181 GRPSFHGEHHHKLTAQEDSQLSMYASDHDNQIDKSLITLFKSDKAGLLIKDFNGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMT+LSC
Sbjct: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTNLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNN 360
           SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKN+
Sbjct: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNS 360

Query: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRME 420
           SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCE+HIHTLDSPCASTRME
Sbjct: 361 SSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCENHIHTLDSPCASTRME 420

Query: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN
Sbjct: 421 IGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 473

BLAST of CSPI04G21570 vs. NCBI nr
Match: gi|700199772|gb|KGN54930.1| (hypothetical protein Csa_4G608060 [Cucumis sativus])

HSP 1 Score: 810.4 bits (2092), Expect = 1.6e-231
Identity = 393/394 (99.75%), Postives = 393/394 (99.75%), Query Frame = 1

Query: 80  MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR 139
           MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR
Sbjct: 1   MQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASR 60

Query: 140 IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS 199
           IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS
Sbjct: 61  IILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQEDS 120

Query: 200 QLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA 259
           QLAMYT DHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA
Sbjct: 121 QLAMYTSDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQA 180

Query: 260 TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP 319
           TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP
Sbjct: 181 TAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVP 240

Query: 320 VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV 379
           VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV
Sbjct: 241 VDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRV 300

Query: 380 LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA 439
           LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA
Sbjct: 301 LKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKA 360

Query: 440 KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN
Sbjct: 361 KIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 394

BLAST of CSPI04G21570 vs. NCBI nr
Match: gi|590611970|ref|XP_007022253.1| (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 799.7 bits (2064), Expect = 2.8e-228
Identity = 385/475 (81.05%), Postives = 429/475 (90.32%), Query Frame = 1

Query: 1   MAGNGLPSLGRVKLTDIAPSEGVPSESFKLSVSTLSHSLAQYSAAIIQFPACDGALLRSG 60
           MAGNGLPSLGRVKLTD+ PSEG+PS+S+KLSVSTLS S AQY AAIIQFPA DGALLRSG
Sbjct: 1   MAGNGLPSLGRVKLTDLIPSEGLPSDSYKLSVSTLSQSFAQYCAAIIQFPASDGALLRSG 60

Query: 61  LDSARLYFHQRAACSSAELMQSNDSREWCRTSGYYVDAQMWQETYDYRPGLTPVEPSNGM 120
           LDSARLYF QRAA  SA+++ +NDSREWC+TSGYY D Q+WQETYDYRPGLTP EPSNGM
Sbjct: 61  LDSARLYFQQRAAYPSADMIHANDSREWCKTSGYYADPQLWQETYDYRPGLTPTEPSNGM 120

Query: 121 ELPPAGLPDIFALYGKASRIILDAISFYLNLRSSPFTEILDNVPLRSREISSSVLSVCCY 180
           E PP GLPDIFAL GKA+R ILDAIS+YLNLRSSPFTEILDN+PLRSREISSSVLSVCC+
Sbjct: 121 EFPPGGLPDIFALLGKAARDILDAISYYLNLRSSPFTEILDNIPLRSREISSSVLSVCCH 180

Query: 181 GRPSFHGEHHHKLTAQEDSQLAMYTPDHDNQIDKSLITLFKADKAGLLIKDFNGRWILVD 240
            RPSF G  HH LTAQ+D QL MY PDH++Q+DK LI++ K+DKAGL ++DF+GRWILVD
Sbjct: 181 ARPSFQGAQHHNLTAQDDGQLIMY-PDHEHQVDKCLISVVKSDKAGLHVRDFHGRWILVD 240

Query: 241 GDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC 300
           GDLGPQ+A+VYPGLALYQATAGYVNPAL RT++NN+ G++YGRCSL+FKLMPKSMTSLSC
Sbjct: 241 GDLGPQEAVVYPGLALYQATAGYVNPALHRTEINNMPGNLYGRCSLAFKLMPKSMTSLSC 300

Query: 301 SEMRAAGHGVDVQFQLPVPVDDFMQRSHSTDQLFNRPNFQNFSFSTSQDGSIK--MRRRK 360
           SEMRAAGHGV+ QFQLPVPVDDFMQRSH TD LFNR  FQ+F+F T+QDGS+K  MRRRK
Sbjct: 301 SEMRAAGHGVEAQFQLPVPVDDFMQRSHPTDHLFNRNTFQSFNFPTAQDGSMKPLMRRRK 360

Query: 361 NNSSTKPLPPSKRLRLEAQRVLKEKVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTR 420
           NN+  KPLPPSKRLRLEAQRVLKE+VQ+IADKKGIKLRFCNLK+CESHIH LDSPCA+ R
Sbjct: 361 NNTRCKPLPPSKRLRLEAQRVLKERVQEIADKKGIKLRFCNLKECESHIHALDSPCANIR 420

Query: 421 MEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTDSHDVELSLTEPGQVGQQSTN 474
           MEIGWP GVPFVHPHDLPNKAKIGFLEAYEPGWT +HD+ELSLTEPGQ  QQS N
Sbjct: 421 MEIGWPHGVPFVHPHDLPNKAKIGFLEAYEPGWTATHDMELSLTEPGQASQQSAN 474

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KZ26_CUCSA1.1e-23199.75Uncharacterized protein OS=Cucumis sativus GN=Csa_4G608060 PE=4 SV=1[more]
A0A061F8X2_THECC2.0e-22881.052-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform ... [more]
A0A061FH49_THECC2.0e-22881.052-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=T... [more]
D7SWX7_VITVI9.2e-22680.21Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00600 PE=4 SV=... [more]
A0A067JMA5_JATCU7.8e-22579.79Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22651 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12940.17.2e-20672.90 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT3G19895.15.2e-3934.52 RING/U-box superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449453784|ref|XP_004144636.1|9.6e-27799.79PREDICTED: uncharacterized protein LOC101216737 [Cucumis sativus][more]
gi|659130950|ref|XP_008465436.1|8.4e-27397.89PREDICTED: uncharacterized protein LOC103503048 isoform X1 [Cucumis melo][more]
gi|659130954|ref|XP_008465438.1|8.4e-27397.89PREDICTED: uncharacterized protein LOC103503048 isoform X2 [Cucumis melo][more]
gi|700199772|gb|KGN54930.1|1.6e-23199.75hypothetical protein Csa_4G608060 [Cucumis sativus][more]
gi|590611970|ref|XP_007022253.1|2.8e-22881.052-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G21570.1CSPI04G21570.1mRNA
CSPI04G21570.2CSPI04G21570.2mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33644FAMILY NOT NAMEDcoord: 1..70
score: 2.7E-161coord: 90..350
score: 2.7E
NoneNo IPR availablePANTHERPTHR33644:SF2SUBFAMILY NOT NAMEDcoord: 1..70
score: 2.7E-161coord: 90..350
score: 2.7E