Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGTTCATGAAATGAAAAGATTGCATAGAGCTGCCAAACTAGAGAACCGGGAAGTGAATACCGACAAATTGATTATCCCACATCTTTATGGAAACTTCAAGAAGGATGGAGTCGAAATGGATAATCTAATAATGGAAATGTCAGACCTTTCAATCACTTCTATTATCAAGGAGCATGAAGAACATGAGATAGGATCCTTGGTATATCCCTACCCTCTAGGTTCTGAACTAAATAATTGGGAAACCATAGAGCTGCCTGTCCATTTCGAAGAAATTGAACAGTCATTTAAATTATCTTTATTTTTGCCTTGTTTCTTATTAGTTAGGAGTCCCCATCAGTTATGCCTAAGACTATGAGAAGTTTTCATTTACTAGTTCTTTGTTCTATCTTTGTTCTCTCCTTTCTCTGGCTATATGAAACGATTCAGGATTTACAGCAATGTAATTAAAAGCAAAAGTGATAGTAATTTTCAAACTGAGCTCAATGGTCCCATCATAGACAACGAACTTGACGAAGTTGAGAACAACGAGGCTATAGACGATGATATATCACCTGAACTTCTACGTTTAGTTCAAGAGGAGAAAAGAATTTATGTACCTAATCAAGAAGAAATTGAAACTATCAACTTAGGAACAGACAATGAAGTTAAAGAAGTTAGAATTGGAAACATTTATGGTGGTGAGGGACGAACCAAACTAATCTCCTTATTACAAGAATTTTGGGACGTATTCGCATAG
mRNA sequence
ATGAATGTTCATGAAATGAAAAGATTGCATAGAGCTGCCAAACTAGAGAACCGGGAAGTGAATACCGACAAATTGATTATCCCACATCTTTATGGAAACTTCAAGAAGGATGGAGTCGAAATGGATAATCTAATAATGGAAATGTCAGACCTTTCAATCACTTCTATTATCAAGGAGCATGAAGAACATGAGATAGGATCCTTGTTCTTTGTTCTATCTTTGTTCTCTCCTTTCTCTGGCTATATGAAACGATTCAGGATTTACAGCAATGTAATTAAAAGCAAAAGTGATAGTAATTTTCAAACTGAGCTCAATGGTCCCATCATAGACAACGAACTTGACGAAGTTGAGAACAACGAGGCTATAGACGATGATATATCACCTGAACTTCTACGTTTAGTTCAAGAGGAGAAAAGAATTTATGTACCTAATCAAGAAGAAATTGAAACTATCAACTTAGGAACAGACAATGAAGTTAAAGAAGTTAGAATTGGAAACATTTATGGTGGTGAGGGACGAACCAAACTAATCTCCTTATTACAAGAATTTTGGGACGTATTCGCATAG
Coding sequence (CDS)
ATGAATGTTCATGAAATGAAAAGATTGCATAGAGCTGCCAAACTAGAGAACCGGGAAGTGAATACCGACAAATTGATTATCCCACATCTTTATGGAAACTTCAAGAAGGATGGAGTCGAAATGGATAATCTAATAATGGAAATGTCAGACCTTTCAATCACTTCTATTATCAAGGAGCATGAAGAACATGAGATAGGATCCTTGTTCTTTGTTCTATCTTTGTTCTCTCCTTTCTCTGGCTATATGAAACGATTCAGGATTTACAGCAATGTAATTAAAAGCAAAAGTGATAGTAATTTTCAAACTGAGCTCAATGGTCCCATCATAGACAACGAACTTGACGAAGTTGAGAACAACGAGGCTATAGACGATGATATATCACCTGAACTTCTACGTTTAGTTCAAGAGGAGAAAAGAATTTATGTACCTAATCAAGAAGAAATTGAAACTATCAACTTAGGAACAGACAATGAAGTTAAAGAAGTTAGAATTGGAAACATTTATGGTGGTGAGGGACGAACCAAACTAATCTCCTTATTACAAGAATTTTGGGACGTATTCGCATAG
Protein sequence
MNVHEMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEHEEHEIGSLFFVLSLFSPFSGYMKRFRIYSNVIKSKSDSNFQTELNGPIIDNELDEVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRTKLISLLQEFWDVFA
Homology
BLAST of ClCG07G004255 vs. NCBI nr
Match:
XP_020205364.1 (uncharacterized protein LOC109790590, partial [Cajanus cajan])
HSP 1 Score: 91.7 bits (226), Expect = 7.5e-15
Identity = 66/188 (35.11%), Postives = 100/188 (53.19%), Query Frame = 0
Query: 5 EMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEHEEHE 64
E + R A+LENRE K+ I HLY +F+ GV + + ++IKE ++
Sbjct: 806 EENKEKRLARLENREPKVQKIPICHLYQSFRSGGVVHADQV---------AMIKEDDDDN 865
Query: 65 IGSLFFVLSLFSPFSGY-MKRFRIYSNVIKSKSDSNFQTELNGPIIDNELDE-VENNEAI 124
+ ++ + + F + N I SKSD+N N I + D + E
Sbjct: 866 HANFIYLCDPCEELRNWKIVEFPVTLNSI-SKSDNNHAKNNNANIPCHNFDHPINEAEED 925
Query: 125 DDDIS--PELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRTKLISLL 184
DDD+S PE+ LV++E R+ P++EE+E INLG +N+ KE++IG GE R KL+ LL
Sbjct: 926 DDDLSTPPEIEMLVEQEDRVIQPHEEELEVINLGNENDKKEIKIGVAIKGEERNKLLKLL 983
Query: 185 QEFWDVFA 189
E+ DVFA
Sbjct: 986 FEYVDVFA 983
BLAST of ClCG07G004255 vs. NCBI nr
Match:
XP_017978298.1 (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC108662439 [Theobroma cacao])
HSP 1 Score: 86.7 bits (213), Expect = 2.4e-13
Identity = 71/209 (33.97%), Postives = 103/209 (49.28%), Query Frame = 0
Query: 7 KRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIM------------EMSDLSIT 66
+R R A+ + E+ + P+LY F+ G L+ SDLSI
Sbjct: 1105 RRKERLARFKGHELEIQGMTYPNLYETFRSGGCIFPELLTVGSRESVSALGEAFSDLSIC 1164
Query: 67 SIIKEHEEHE---------IGSLFFVLSLFSPFS--GYMKRF----RIYSNVIKSKSDSN 126
+ + E+ E +G LS ++ S Y+ +F RI +N + +DS
Sbjct: 1165 ATEEGEEQSENMDGIPTTYLGPPNLKLSSWTTMSLPLYIFQFPLPSRIPNNECEDDNDSG 1224
Query: 127 FQTELNGPIIDNELDEVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEV 186
F+ + +ELD EN E D D++P+LLRLV++E R VP+QE +ETINLG +
Sbjct: 1225 FEVDFEKGTSVSELDNTENVE--DYDLTPDLLRLVEQEGRQIVPHQETLETINLGNEENK 1284
Query: 187 KEVRIGNIYGGEGRTKLISLLQEFWDVFA 189
KEVRIG + KLI LL E+ DVFA
Sbjct: 1285 KEVRIGVTLVSMEKEKLIKLLHEYVDVFA 1311
BLAST of ClCG07G004255 vs. NCBI nr
Match:
EOY00620.1 (Uncharacterized protein TCM_010507 [Theobroma cacao])
HSP 1 Score: 86.3 bits (212), Expect = 3.1e-13
Identity = 70/207 (33.82%), Postives = 103/207 (49.76%), Query Frame = 0
Query: 7 KRLHRAAKLENREVNTDKLIIPHLYGNFKKDG-VEMDNLIME-----------MSDLSIT 66
+R R A+ + E+ + PHLY F+ G + ++L +E SDLSI
Sbjct: 1091 RRKERLARFKGHELEIRGMTYPHLYKTFRSGGCIFPESLTVENQESVSALGGTFSDLSIC 1150
Query: 67 SIIKEHEEHEIGSLFFVLSLFSPFSGYMKRF-------------RIYSNVIKSKSDSNFQ 126
+ +E EE + + F P + + + +I +N K +DS F+
Sbjct: 1151 A-TEEGEEQPRNADEIPTTYFGPPNLKLSNWTTMSLPVTCDSISKIPNNECKDDNDSGFE 1210
Query: 127 TELNGPIIDNELDEVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKE 186
+ +ELD EN E D D++P+LLRLV++E R VP+QE +ETINLG + KE
Sbjct: 1211 VDFEKGTSVSELDSTENVE--DYDLTPDLLRLVEQEGRQIVPHQEILETINLGDEENKKE 1270
Query: 187 VRIGNIYGGEGRTKLISLLQEFWDVFA 189
VRIG + KLI LL E+ DVFA
Sbjct: 1271 VRIGVTLVSMEKEKLIKLLHEYVDVFA 1294
BLAST of ClCG07G004255 vs. NCBI nr
Match:
XP_022147189.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia])
HSP 1 Score: 85.1 bits (209), Expect = 7.0e-13
Identity = 59/193 (30.57%), Postives = 103/193 (53.37%), Query Frame = 0
Query: 1 MNVHEMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEH 60
+ V +++ R ++ EN E + + I+P L +F+ G + E + S+ + + E
Sbjct: 1031 IRVRSLEKAKRLSRFENEERDYPRRIVPPLTHSFRSAG----TIHQEYDESSVVAAVTE- 1090
Query: 61 EEHEIGSLFFVLSLFSPFSGYM-----KRFRIYSNVIKSKSDSNFQTELNGPIIDNELDE 120
E ++G ++ S + SN + + D++ + EL+ PI
Sbjct: 1091 EREQVGPFVYLCPDGFELSNWSVIKLPSFVNNKSNNTEIECDNDSKYELDTPIY-----I 1150
Query: 121 VENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRTK 180
+E++E IDD+ S ELLR+++EE+++ P++E ET+NLG+ E KE++IG E R K
Sbjct: 1151 IESDEEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKK 1210
Query: 181 LISLLQEFWDVFA 189
LI LL E+ DVFA
Sbjct: 1211 LIELLHEYADVFA 1213
BLAST of ClCG07G004255 vs. NCBI nr
Match:
XP_022147181.1 (uncharacterized protein LOC111016192 [Momordica charantia])
HSP 1 Score: 85.1 bits (209), Expect = 7.0e-13
Identity = 63/194 (32.47%), Postives = 104/194 (53.61%), Query Frame = 0
Query: 1 MNVHEMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEH 60
+ V +++ R ++ EN E + + +P L +F+ G+ + E + S+ + + E
Sbjct: 238 IRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGI----IHQEYDESSVVAGVTE- 297
Query: 61 EEHEIGSLF------FVLSLFSPFSGYMKRFRIYSNVIKSKSDSNFQTELNGPIIDNELD 120
E ++G F LS +S SN + + D++ + EL+ PI +
Sbjct: 298 EREQVGPFVYPCPDGFELSNWSVLE-LPSFVNNKSNXXEIECDNDSKYELDTPIYN---- 357
Query: 121 EVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRT 180
+E++E IDD+ S ELLR++ EE+++ P++E ETINLG+ E KEV+IG E R
Sbjct: 358 -IESDEEIDDEFSXELLRMLAEEEKMLGPHEELTETINLGSQAEAKEVKIGTNMSSESRK 417
Query: 181 KLISLLQEFWDVFA 189
KLI LL E+ DVFA
Sbjct: 418 KLIELLHEYADVFA 420
BLAST of ClCG07G004255 vs. ExPASy TrEMBL
Match:
A0A061E6J4 (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_010507 PE=4 SV=1)
HSP 1 Score: 86.3 bits (212), Expect = 1.5e-13
Identity = 70/207 (33.82%), Postives = 103/207 (49.76%), Query Frame = 0
Query: 7 KRLHRAAKLENREVNTDKLIIPHLYGNFKKDG-VEMDNLIME-----------MSDLSIT 66
+R R A+ + E+ + PHLY F+ G + ++L +E SDLSI
Sbjct: 1091 RRKERLARFKGHELEIRGMTYPHLYKTFRSGGCIFPESLTVENQESVSALGGTFSDLSIC 1150
Query: 67 SIIKEHEEHEIGSLFFVLSLFSPFSGYMKRF-------------RIYSNVIKSKSDSNFQ 126
+ +E EE + + F P + + + +I +N K +DS F+
Sbjct: 1151 A-TEEGEEQPRNADEIPTTYFGPPNLKLSNWTTMSLPVTCDSISKIPNNECKDDNDSGFE 1210
Query: 127 TELNGPIIDNELDEVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKE 186
+ +ELD EN E D D++P+LLRLV++E R VP+QE +ETINLG + KE
Sbjct: 1211 VDFEKGTSVSELDSTENVE--DYDLTPDLLRLVEQEGRQIVPHQEILETINLGDEENKKE 1270
Query: 187 VRIGNIYGGEGRTKLISLLQEFWDVFA 189
VRIG + KLI LL E+ DVFA
Sbjct: 1271 VRIGVTLVSMEKEKLIKLLHEYVDVFA 1294
BLAST of ClCG07G004255 vs. ExPASy TrEMBL
Match:
A0A6J1D099 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1)
HSP 1 Score: 85.1 bits (209), Expect = 3.4e-13
Identity = 59/193 (30.57%), Postives = 103/193 (53.37%), Query Frame = 0
Query: 1 MNVHEMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEH 60
+ V +++ R ++ EN E + + I+P L +F+ G + E + S+ + + E
Sbjct: 1031 IRVRSLEKAKRLSRFENEERDYPRRIVPPLTHSFRSAG----TIHQEYDESSVVAAVTE- 1090
Query: 61 EEHEIGSLFFVLSLFSPFSGYM-----KRFRIYSNVIKSKSDSNFQTELNGPIIDNELDE 120
E ++G ++ S + SN + + D++ + EL+ PI
Sbjct: 1091 EREQVGPFVYLCPDGFELSNWSVIKLPSFVNNKSNNTEIECDNDSKYELDTPIY-----I 1150
Query: 121 VENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRTK 180
+E++E IDD+ S ELLR+++EE+++ P++E ET+NLG+ E KE++IG E R K
Sbjct: 1151 IESDEEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKK 1210
Query: 181 LISLLQEFWDVFA 189
LI LL E+ DVFA
Sbjct: 1211 LIELLHEYADVFA 1213
BLAST of ClCG07G004255 vs. ExPASy TrEMBL
Match:
A0A6J1D1K4 (uncharacterized protein LOC111016192 OS=Momordica charantia OX=3673 GN=LOC111016192 PE=4 SV=1)
HSP 1 Score: 85.1 bits (209), Expect = 3.4e-13
Identity = 63/194 (32.47%), Postives = 104/194 (53.61%), Query Frame = 0
Query: 1 MNVHEMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEH 60
+ V +++ R ++ EN E + + +P L +F+ G+ + E + S+ + + E
Sbjct: 238 IRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGI----IHQEYDESSVVAGVTE- 297
Query: 61 EEHEIGSLF------FVLSLFSPFSGYMKRFRIYSNVIKSKSDSNFQTELNGPIIDNELD 120
E ++G F LS +S SN + + D++ + EL+ PI +
Sbjct: 298 EREQVGPFVYPCPDGFELSNWSVLE-LPSFVNNKSNXXEIECDNDSKYELDTPIYN---- 357
Query: 121 EVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRT 180
+E++E IDD+ S ELLR++ EE+++ P++E ETINLG+ E KEV+IG E R
Sbjct: 358 -IESDEEIDDEFSXELLRMLAEEEKMLGPHEELTETINLGSQAEAKEVKIGTNMSSESRK 417
Query: 181 KLISLLQEFWDVFA 189
KLI LL E+ DVFA
Sbjct: 418 KLIELLHEYADVFA 420
BLAST of ClCG07G004255 vs. ExPASy TrEMBL
Match:
A0A6A2WRD6 (Ribonuclease H OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00112928pilonHSYRG00014 PE=4 SV=1)
HSP 1 Score: 81.6 bits (200), Expect = 3.7e-12
Identity = 60/197 (30.46%), Postives = 100/197 (50.76%), Query Frame = 0
Query: 5 EMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDG----VEMDNLIMEMSDLSITSIIKEH 64
+ R R A+L E+ + P L+ FK G M L + DL++ ++ E
Sbjct: 1569 QKNRERRLARLTGNELQWKPMTFPPLFRTFKSGGYVFETTMSELQTALEDLNVNAVTTED 1628
Query: 65 EEHEIGSLFFVLSL-FSPFSGYMKRFR-IYSNVIKSKSDSNFQTELNGPIID-------N 124
++E S L F P S ++ +Y N+ + + N T P +D
Sbjct: 1629 VDNEDRSRICPLPHGFVPSSWSVEELPVVYKNLSEFPDNDNVGTNDPDPEVDFESTICLG 1688
Query: 125 ELDEVENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGE 184
E +E +N E + ++ ELLR++++E++ +P++E +E +NLGT+ + KEV+IG E
Sbjct: 1689 EFEECDNEE--EYNLPTELLRMIEKEEKQILPHEEAVEILNLGTEEDRKEVKIGTTLSTE 1748
Query: 185 GRTKLISLLQEFWDVFA 189
R LI+LLQE+ DVFA
Sbjct: 1749 ARKNLIALLQEYEDVFA 1763
BLAST of ClCG07G004255 vs. ExPASy TrEMBL
Match:
A0A6J1D7C7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1)
HSP 1 Score: 80.5 bits (197), Expect = 8.3e-12
Identity = 56/193 (29.02%), Postives = 96/193 (49.74%), Query Frame = 0
Query: 1 MNVHEMKRLHRAAKLENREVNTDKLIIPHLYGNFKKDGVEMDNLIMEMSDLSITSIIKEH 60
+ V M++ R ++ EN E + + +P L + + G + E + S+ + + E
Sbjct: 463 IRVRSMEKAKRLSRFENGERDYSRRTVPPLSHSLRSAG----TIHQEYDESSVAAAVTEE 522
Query: 61 EEHEIGSLFFVLSLFSPF-----SGYMKRFRIYSNVIKSKSDSNFQTELNGPIIDNELDE 120
E PF G+ + I+ +DS ++ +D +
Sbjct: 523 REQ-----------VEPFVYPCPDGFKLSNWSVNTEIECDNDSKYE-------LDTPIYN 582
Query: 121 VENNEAIDDDISPELLRLVQEEKRIYVPNQEEIETINLGTDNEVKEVRIGNIYGGEGRTK 180
+E++E IDD+ S ELLR+++EE+++ P++E ET+NLG+ E KE++IG E R K
Sbjct: 583 IESDEEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKK 633
Query: 181 LISLLQEFWDVFA 189
LI LL E+ DVFA
Sbjct: 643 LIELLHEYADVFA 633
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_020205364.1 | 7.5e-15 | 35.11 | uncharacterized protein LOC109790590, partial [Cajanus cajan] | [more] |
XP_017978298.1 | 2.4e-13 | 33.97 | PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC108662439 [Theobroma ... | [more] |
EOY00620.1 | 3.1e-13 | 33.82 | Uncharacterized protein TCM_010507 [Theobroma cacao] | [more] |
XP_022147189.1 | 7.0e-13 | 30.57 | LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia] | [more] |
XP_022147181.1 | 7.0e-13 | 32.47 | uncharacterized protein LOC111016192 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A061E6J4 | 1.5e-13 | 33.82 | Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_010507 PE=4 SV=1 | [more] |
A0A6J1D099 | 3.4e-13 | 30.57 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1 | [more] |
A0A6J1D1K4 | 3.4e-13 | 32.47 | uncharacterized protein LOC111016192 OS=Momordica charantia OX=3673 GN=LOC111016... | [more] |
A0A6A2WRD6 | 3.7e-12 | 30.46 | Ribonuclease H OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00112928pilonHSYRG0001... | [more] |
A0A6J1D7C7 | 8.3e-12 | 29.02 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |