Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACTAATAAACAAAAGGAAGAAGCGAGGTCAAGGAAGCCCCGCCAACGGAGGAACCTCCAGATGGAAGAACTACCCACTTTCACCAAATGGTTAACCTCATTCGGCCATTCACCTTCTACCAACGATTCCAATCCCAAATCTAATCTTCTTACTCCCAATCGCCCCTTTTTGGAGACACCTCTCGCCGTTCCAGATTCGTCGCAGGATGCTGGCCCAGCTGTCGCCGATGTTCCTCCCAAGGACGACAGCGTGCAGGCTGTGGATGATTCTATATCTGCGCGAGCTGGTTGTTGTTGTTGGCAAACTTCCAAATCTACGCGCCGAGAGTGTGCTCTTAAATTTCATCTCTCTCTAAGGAAGAGAAAGGTGGTTACCAATGAAGCAGGGGTTGTTACCGCCGTTTCGAATCTGCAGGAGGAGGAGACAATGGTGGCTGTTGGCGATGTTCTTGATAAGGAGGGGTGTGGGTGCAGGTGTTGGCCAACGTTCCAGATCTGCGGGAGAACAAAAGGCACGGTCGTCGGCACTTCGGATCTTCAGAAAAAGGAGGGAGCGGTTACCGACGGCCTGAATCTGCAAAAGGAGGAAGTGGCTACTAATGGTACAGATTTGGTGAAGAAAGAGGAAAAGGTGGTTACCAATGCTTCTTCAAAGGCAAGGATTGTTACCGCCGCTTCGAATCTGCAAGAGGAGGAGACGGTGGTGGCTGTCGGCGATGTTCTTGATAAGAACGGGTGTACGTGTAGGTGTTGGCCAACATTCCAGATCTGCGGGAGAAGAAAAGGCATAGTCGTCGGCACTTCAGATCTTCAGAAAGAGGAGGGAGCGGTTACCGACAGCCTAAATCTGGGGAAGGAGGAAGTAGCTACTAATGGTCCAAATTTGGTGAAGGAAGAGGAAGTTGTCGTCGTCGTCGATCCAGACCTGCGTAAGGAAGAGGGGAGTGGATGCTTTTGTGGACGATGGGGATGTCTACCGGCGTTTCAAATTTGCAGGAGGAGGAAAGTGGTTGCCGTCAAGGAAGAGGTGATTGTCGATGTTCCAAAGGTTGAAGAACTGACTAATGATAATGTAAATATGAAAGAGGGCGATTCAGTGGGAAGCTTACAAGGGAGGAAAAGTGTAGACGACAATCCACAGACATTTGAGAAGGAGGCGGTGGCTTCCGACTATTCTTCAAACGTCGGTGTTCCAAATTTGCAGAAAGAAGGGAGTGGTTGTTGTAGTTGTTTTAAATGTTTGCCGACATTCCACATTTGCGGGCGAAGGAGGAATGTGGATTCCGACGTTCCAAATCCAAGCAGGGAAGAGAAAGTAGTTGTCAGTGGTGTTTCAGATCTGCTGGAGGAGAAGGTGGTTGCTACCGATAAGGAAAGCCATTCCAAACCTGCTCAAGGGGGAATTTGTTGGCCATTCCAGATCTGTGCAAGAGGATGGTTGCCTAGGTTTTTTCTGTGTGGGGAAAAGACAGTCGTTGAGGCCTCAAATCATCGAGAGGAAGGAGAAAAGATGCCTCTCGACGTTCCTAAGGAGGAGGTTGTTGCCGTTATTCCAGATCCTCAGAAAAAGAACATTGCGGTGGCTGACGGCATTCCAGATCACAGCAAGGAGAAACAGGTGGTTGCTGACGACATTCCAGTTCTTAGTACAGAGGAGAAGATGTTTGTCGGTAAGGAAAAGAATGTGTCTTCCGACAGCATTCAGGAAGTTTGGAAGGAGGAGGAAATTGTGGATTCTGGTGTAGATTTGGGCGAGAAGGAGGAAGGGGGTTGTTGTCGGTGTTTCAAATTTGGTGGGGGAAAAGAGAGCGATCGGCGTCAACACCGCCGAAGTTCTAGATCTAGGGAAGGAGGTTGGGGATTCCAAATCTGGGGGAGGGGGTGGCTACCAACACTAGGCATTTGCGGAGGGAGAAAAATGGTATCTGCTAGCATTCAAAAATTGCATGAAGAAGGGTTGGTCAACAGTGGCGTTTCAGAGGTTCATAATGAGGTGGTGGATGCTGCCGGCGTTACGGGCGTGGTGGCTGCCACCGATCAGTCTAAATCTACGCGAGGATGTGGTTGTTGGCAATCGAAACCACGACGAAGGAGGGCGGTGGCTGTAGATAAGGATGGGGGAAGCGGCCGGCGTTCCAAATCTAAGCGTAGAAGAGGGTGGCTACGACGATGGGGAAGGAAACAAAGAGAGGAAAAGGAAAGAAAGAGATAG
mRNA sequence
ATGGCCACTAATAAACAAAAGGAAGAAGCGAGGTCAAGGAAGCCCCGCCAACGGAGGAACCTCCAGATGGAAGAACTACCCACTTTCACCAAATGGTTAACCTCATTCGGCCATTCACCTTCTACCAACGATTCCAATCCCAAATCTAATCTTCTTACTCCCAATCGCCCCTTTTTGGAGACACCTCTCGCCGTTCCAGATTCGTCGCAGGATGCTGGCCCAGCTGTCGCCGATGTTCCTCCCAAGGACGACAGCGTGCAGGCTGTGGATGATTCTATATCTGCGCGAGCTGGTTGTTGTTGTTGGCAAACTTCCAAATCTACGCGCCGAGAGTGTGCTCTTAAATTTCATCTCTCTCTAAGGAAGAGAAAGGTGGTTACCAATGAAGCAGGGGTTGTTACCGCCGTTTCGAATCTGCAGGAGGAGGAGACAATGGTGGCTGTTGGCGATGTTCTTGATAAGGAGGGGTGTGGGTGCAGGTGTTGGCCAACGTTCCAGATCTGCGGGAGAACAAAAGGCACGGTCGTCGGCACTTCGGATCTTCAGAAAAAGGAGGGAGCGGTTACCGACGGCCTGAATCTGCAAAAGGAGGAAGTGGCTACTAATGGTACAGATTTGGTGAAGAAAGAGGAAAAGGTGGTTACCAATGCTTCTTCAAAGGCAAGGATTGTTACCGCCGCTTCGAATCTGCAAGAGGAGGAGACGGTGGTGGCTGTCGGCGATGTTCTTGATAAGAACGGGTGTACGTGTAGGTGTTGGCCAACATTCCAGATCTGCGGGAGAAGAAAAGGCATAGTCGTCGGCACTTCAGATCTTCAGAAAGAGGAGGGAGCGGTTACCGACAGCCTAAATCTGGGGAAGGAGGAAGTAGCTACTAATGGTCCAAATTTGGTGAAGGAAGAGGAAGTTGTCGTCGTCGTCGATCCAGACCTGCGTAAGGAAGAGGGGAGTGGATGCTTTTGTGGACGATGGGGATGTCTACCGGCGTTTCAAATTTGCAGGAGGAGGAAAGTGGTTGCCGTCAAGGAAGAGGTGATTGTCGATGTTCCAAAGGTTGAAGAACTGACTAATGATAATGTAAATATGAAAGAGGGCGATTCAGTGGGAAGCTTACAAGGGAGGAAAAGTGTAGACGACAATCCACAGACATTTGAGAAGGAGGCGGTGGCTTCCGACTATTCTTCAAACGTCGGTGTTCCAAATTTGCAGAAAGAAGGGAGTGGTTGTTGTAGTTGTTTTAAATGTTTGCCGACATTCCACATTTGCGGGCGAAGGAGGAATGTGGATTCCGACGTTCCAAATCCAAGCAGGGAAGAGAAAGTAGTTGTCAGTGGTGTTTCAGATCTGCTGGAGGAGAAGGTGGTTGCTACCGATAAGGAAAGCCATTCCAAACCTGCTCAAGGGGGAATTTGTTGGCCATTCCAGATCTGTGCAAGAGGATGGTTGCCTAGGTTTTTTCTGTGTGGGGAAAAGACAGTCGTTGAGGCCTCAAATCATCGAGAGGAAGGAGAAAAGATGCCTCTCGACGTTCCTAAGGAGGAGGTTGTTGCCGTTATTCCAGATCCTCAGAAAAAGAACATTGCGGTGGCTGACGGCATTCCAGATCACAGCAAGGAGAAACAGGTGGTTGCTGACGACATTCCAGTTCTTAGTACAGAGGAGAAGATGTTTGTCGGTAAGGAAAAGAATGTGTCTTCCGACAGCATTCAGGAAGTTTGGAAGGAGGAGGAAATTGTGGATTCTGGTGTAGATTTGGGCGAGAAGGAGGAAGGGGGTTGTTGTCGGTGTTTCAAATTTGGTGGGGGAAAAGAGAGCGATCGGCGTCAACACCGCCGAAGTTCTAGATCTAGGGAAGGAGGTTGGGGATTCCAAATCTGGGGGAGGGGGTGGCTACCAACACTAGGCATTTGCGGAGGGAGAAAAATGGTATCTGCTAGCATTCAAAAATTGCATGAAGAAGGGTTGGTCAACAGTGGCGTTTCAGAGGTTCATAATGAGGTGGTGGATGCTGCCGGCGTTACGGGCGTGGTGGCTGCCACCGATCAGTCTAAATCTACGCGAGGATGTGGTTGTTGGCAATCGAAACCACGACGAAGGAGGGCGGTGGCTGTAGATAAGGATGGGGGAAGCGGCCGGCGTTCCAAATCTAAGCGTAGAAGAGGGTGGCTACGACGATGGGGAAGGAAACAAAGAGAGGAAAAGGAAAGAAAGAGATAG
Coding sequence (CDS)
ATGGCCACTAATAAACAAAAGGAAGAAGCGAGGTCAAGGAAGCCCCGCCAACGGAGGAACCTCCAGATGGAAGAACTACCCACTTTCACCAAATGGTTAACCTCATTCGGCCATTCACCTTCTACCAACGATTCCAATCCCAAATCTAATCTTCTTACTCCCAATCGCCCCTTTTTGGAGACACCTCTCGCCGTTCCAGATTCGTCGCAGGATGCTGGCCCAGCTGTCGCCGATGTTCCTCCCAAGGACGACAGCGTGCAGGCTGTGGATGATTCTATATCTGCGCGAGCTGGTTGTTGTTGTTGGCAAACTTCCAAATCTACGCGCCGAGAGTGTGCTCTTAAATTTCATCTCTCTCTAAGGAAGAGAAAGGTGGTTACCAATGAAGCAGGGGTTGTTACCGCCGTTTCGAATCTGCAGGAGGAGGAGACAATGGTGGCTGTTGGCGATGTTCTTGATAAGGAGGGGTGTGGGTGCAGGTGTTGGCCAACGTTCCAGATCTGCGGGAGAACAAAAGGCACGGTCGTCGGCACTTCGGATCTTCAGAAAAAGGAGGGAGCGGTTACCGACGGCCTGAATCTGCAAAAGGAGGAAGTGGCTACTAATGGTACAGATTTGGTGAAGAAAGAGGAAAAGGTGGTTACCAATGCTTCTTCAAAGGCAAGGATTGTTACCGCCGCTTCGAATCTGCAAGAGGAGGAGACGGTGGTGGCTGTCGGCGATGTTCTTGATAAGAACGGGTGTACGTGTAGGTGTTGGCCAACATTCCAGATCTGCGGGAGAAGAAAAGGCATAGTCGTCGGCACTTCAGATCTTCAGAAAGAGGAGGGAGCGGTTACCGACAGCCTAAATCTGGGGAAGGAGGAAGTAGCTACTAATGGTCCAAATTTGGTGAAGGAAGAGGAAGTTGTCGTCGTCGTCGATCCAGACCTGCGTAAGGAAGAGGGGAGTGGATGCTTTTGTGGACGATGGGGATGTCTACCGGCGTTTCAAATTTGCAGGAGGAGGAAAGTGGTTGCCGTCAAGGAAGAGGTGATTGTCGATGTTCCAAAGGTTGAAGAACTGACTAATGATAATGTAAATATGAAAGAGGGCGATTCAGTGGGAAGCTTACAAGGGAGGAAAAGTGTAGACGACAATCCACAGACATTTGAGAAGGAGGCGGTGGCTTCCGACTATTCTTCAAACGTCGGTGTTCCAAATTTGCAGAAAGAAGGGAGTGGTTGTTGTAGTTGTTTTAAATGTTTGCCGACATTCCACATTTGCGGGCGAAGGAGGAATGTGGATTCCGACGTTCCAAATCCAAGCAGGGAAGAGAAAGTAGTTGTCAGTGGTGTTTCAGATCTGCTGGAGGAGAAGGTGGTTGCTACCGATAAGGAAAGCCATTCCAAACCTGCTCAAGGGGGAATTTGTTGGCCATTCCAGATCTGTGCAAGAGGATGGTTGCCTAGGTTTTTTCTGTGTGGGGAAAAGACAGTCGTTGAGGCCTCAAATCATCGAGAGGAAGGAGAAAAGATGCCTCTCGACGTTCCTAAGGAGGAGGTTGTTGCCGTTATTCCAGATCCTCAGAAAAAGAACATTGCGGTGGCTGACGGCATTCCAGATCACAGCAAGGAGAAACAGGTGGTTGCTGACGACATTCCAGTTCTTAGTACAGAGGAGAAGATGTTTGTCGGTAAGGAAAAGAATGTGTCTTCCGACAGCATTCAGGAAGTTTGGAAGGAGGAGGAAATTGTGGATTCTGGTGTAGATTTGGGCGAGAAGGAGGAAGGGGGTTGTTGTCGGTGTTTCAAATTTGGTGGGGGAAAAGAGAGCGATCGGCGTCAACACCGCCGAAGTTCTAGATCTAGGGAAGGAGGTTGGGGATTCCAAATCTGGGGGAGGGGGTGGCTACCAACACTAGGCATTTGCGGAGGGAGAAAAATGGTATCTGCTAGCATTCAAAAATTGCATGAAGAAGGGTTGGTCAACAGTGGCGTTTCAGAGGTTCATAATGAGGTGGTGGATGCTGCCGGCGTTACGGGCGTGGTGGCTGCCACCGATCAGTCTAAATCTACGCGAGGATGTGGTTGTTGGCAATCGAAACCACGACGAAGGAGGGCGGTGGCTGTAGATAAGGATGGGGGAAGCGGCCGGCGTTCCAAATCTAAGCGTAGAAGAGGGTGGCTACGACGATGGGGAAGGAAACAAAGAGAGGAAAAGGAAAGAAAGAGATAG
Protein sequence
MATNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKSNLLTPNRPFLETPLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLRKRKVVTNEAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTSDLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVVTNASSKARIVTAASNLQEEETVVAVGDVLDKNGCTCRCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLNLGKEEVATNGPNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIVDVPKVEELTNDNVNMKEGDSVGSLQGRKSVDDNPQTFEKEAVASDYSSNVGVPNLQKEGSGCCSCFKCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVATDKESHSKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDPQKKNIAVADGIPDHSKEKQVVADDIPVLSTEEKMFVGKEKNVSSDSIQEVWKEEEIVDSGVDLGEKEEGGCCRCFKFGGGKESDRRQHRRSSRSREGGWGFQIWGRGWLPTLGICGGRKMVSASIQKLHEEGLVNSGVSEVHNEVVDAAGVTGVVAATDQSKSTRGCGCWQSKPRRRRAVAVDKDGGSGRRSKSKRRRGWLRRWGRKQREEKERKR
Homology
BLAST of ClCG05G010080 vs. NCBI nr
Match:
XP_038880648.1 (uncharacterized protein LOC120072275 [Benincasa hispida])
HSP 1 Score: 860.1 bits (2221), Expect = 1.4e-245
Identity = 487/751 (64.85%), Postives = 532/751 (70.84%), Query Frame = 0
Query: 1 MATNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKSNLLTPNRPFLE 60
M T+KQ EE +SRKPRQRRNLQMEELPTFTKWL S GHS S ND+N KS + NRP +
Sbjct: 6 MTTDKQMEEPKSRKPRQRRNLQMEELPTFTKWLNSVGHSGSCNDANSKSKPVNGNRPVVR 65
Query: 61 TPLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSL 120
TP+ V DSS+D G VADVP D VQAVDDSISARAGCCCWQ+SKSTRRECALKFHLSL
Sbjct: 66 TPVVVFDSSEDGGAVVADVP--KDRVQAVDDSISARAGCCCWQSSKSTRRECALKFHLSL 125
Query: 121 RKRKVVTNEAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTSD 180
RKRKVVTN
Sbjct: 126 RKRKVVTN---------------------------------------------------- 185
Query: 181 LQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVVTNASSKARIVTAASNLQEEETVVAVG 240
S+ +VT SNL+EEETVVAV
Sbjct: 186 -------------------------------------GSEVEVVTVVSNLREEETVVAVS 245
Query: 241 DVLDKNGCTCRCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLNLGKEEVATNGPNLVKE 300
DVL+K+GC CRC+PTFQICGRRK IVV LQKE+GAVTD NL EEVA GP+LVKE
Sbjct: 246 DVLEKDGCGCRCFPTFQICGRRKSIVV----LQKEDGAVTDDPNLRTEEVANRGPDLVKE 305
Query: 301 EEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIVDVPKV-EELTNDN 360
EEVVVV DLRKEEGSGC CGR GC PAFQICRRR VVA KEE +VDVP+V EE+ ND
Sbjct: 306 EEVVVVGGSDLRKEEGSGCCCGRCGCFPAFQICRRRNVVASKEEAVVDVPEVEEEVANDK 365
Query: 361 VNMKEGDSVGSLQ---------GRKSVDDNPQTFEKEAVASDYSSNVGVPNLQKEGS-GC 420
VN +EGDSV LQ GRK+VDDNP+TFEKEA+AS+ SSNV VP+LQKEGS GC
Sbjct: 366 VNKQEGDSVECLQAFQSHICCAGRKTVDDNPKTFEKEALASEDSSNVDVPDLQKEGSGGC 425
Query: 421 CSCFKCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLE-EKVVATDKESHSKPAQG 480
CSCFKC+PT HICGRRRN S+VPNP REEKVVVS VSD E E+VVA D+E HSK QG
Sbjct: 426 CSCFKCMPTLHICGRRRNAVSEVPNPGREEKVVVS-VSDPPEGEEVVAADEEWHSKSTQG 485
Query: 481 GICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDPQKKNIA 540
GICWPFQIC RGWLPRFFLCGE+ V+ASNHREE EK P DV KEEVVAVIPDPQK++IA
Sbjct: 486 GICWPFQICTRGWLPRFFLCGERITVDASNHREEEEKAPPDVQKEEVVAVIPDPQKESIA 545
Query: 541 VADGIPDHSKEKQVVADDIPVLSTEEKMFVGKEKNVSSDSIQEVWKEEEIVDSGVDLGEK 600
VADGIPD KEKQV ADDIPV STEEKM GKE+N SS SIQE ++ DL ++
Sbjct: 546 VADGIPDDGKEKQVAADDIPVQSTEEKMSAGKEENASSGSIQETFEP--------DLNQE 605
Query: 601 EEGGCCRCFKFGGGKESDRRQHRRSSRSREGGWGFQIWGRGWLPTLGICGGRKMVSASIQ 660
+EGGCCRCFK GGKE RRQHRRS +SREGGWGFQI GWLPTL +C GRK S SI
Sbjct: 606 DEGGCCRCFKL-GGKEGSRRQHRRSPKSREGGWGFQIC--GWLPTLDMCRGRKKESVSIA 649
Query: 661 KLHE-EGLVNSGVSEVHNEVVDAAGVTGVVAATDQSKSTRGCGCWQSKPRRRRAVAVDKD 720
KLHE EGLV++GVSEVHNEVVDAAGVTGVVA TD S STR CGCW SKPRRRRAV V KD
Sbjct: 666 KLHEKEGLVDNGVSEVHNEVVDAAGVTGVVADTDHSNSTRCCGCWHSKPRRRRAVDVVKD 649
Query: 721 GGSGRRSKSKRRRGWLRRWGRKQREEKERKR 739
GGSGR SKSKRR+GWLRRWGRKQRE KERK+
Sbjct: 726 GGSGRCSKSKRRKGWLRRWGRKQREGKERKK 649
BLAST of ClCG05G010080 vs. NCBI nr
Match:
KGN57524.1 (hypothetical protein Csa_011487 [Cucumis sativus])
HSP 1 Score: 661.8 bits (1706), Expect = 7.1e-186
Identity = 418/762 (54.86%), Postives = 470/762 (61.68%), Query Frame = 0
Query: 1 MATNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKSNLLTP-NRPFL 60
MATNK KEE RSRKPR RRN QMEE PTFTKWLT+FGHS S ND+ KS L P NRP
Sbjct: 1 MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQ 60
Query: 61 ETPLAVPDSSQDAGPAVADVPPKDD-SVQAVDDSISAR--AGCCCWQTSKSTRRECALKF 120
P+ +P SS+DA P V +VP ++ AVD SISAR AGCCCWQ+SKSTRRECALKF
Sbjct: 61 RPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAGCCCWQSSKSTRRECALKF 120
Query: 121 HLSLRKRKVVTN--EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGT 180
H+SLRKRKVV N EA VV AV N EE T V DV K+GCGCRC TF+I R K
Sbjct: 121 HISLRKRKVVANSTEAEVVAAVLNPPEEAT---VEDV--KDGCGCRCLRTFKIFRRRKSR 180
Query: 181 VVGTSDLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVVTNASSKARIVTAASNLQEEE 240
VVG SDLQK+EGAVTDG+NL+ EEVA++G+D++K+EE
Sbjct: 181 VVGVSDLQKEEGAVTDGVNLRTEEVASSGSDMMKEEE----------------------- 240
Query: 241 TVVAVGDVLDKNGCTCRCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLNLGKEEVATNG 300
Sbjct: 241 ------------------------------------------------------------ 300
Query: 301 PNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIVDVPKVEE 360
VV+ PD RKEE SGC CGRW C P FQICRRRKVVA KEEV+ PKVEE
Sbjct: 301 ----------VVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEE 360
Query: 361 LTNDNVNMKEGDSVGSLQ------GRKSVDDNPQTFEKEAVASDYSSNVGVPNLQKEGSG 420
+ NDNV +E DSVG LQ GRK VDDNP+T EKE + S+ SSN+ V NLQKE SG
Sbjct: 361 VGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEKEPLVSNDSSNLDVQNLQKEESG 420
Query: 421 CCSCFKCLPTFHIC-GRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVATDKESHSKPAQ 480
CCSCF+C+PTF IC GRR N DS VP P REEKV+V VSD E V +E HS+P Q
Sbjct: 421 CCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVD-VSDPPEMGSVVDGRERHSRPVQ 480
Query: 481 GGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDPQKKNI 540
GG CW GW PRF LCGE T V+A NHREE EK P D KEE V V
Sbjct: 481 GGTCW------SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVA-------T 540
Query: 541 AVADGIPDHSKEKQVVADDIPVLSTEEKMFVG--------KEKNVSSDSIQEVWKEEEIV 600
AV D I DH KEK V A DIPV++ EE++FVG KEKNVSS +IQ+V ++EEIV
Sbjct: 541 AVGDEISDHDKEKPVAAIDIPVVN-EEEVFVGAGDTLDLHKEKNVSSCNIQDV-RKEEIV 600
Query: 601 DSGVDLGEKEEGGCCRCFKFGGGKESDRRQHRRSSRSREGGWGFQIWGRGWLPTLGICGG 660
DS EK EGG C C+ GKES RQ RSSRS EG W FQI GRG LPTL IC G
Sbjct: 601 DS----DEKVEGGGCGCW----GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRG 640
Query: 661 RKMVSASIQKLHEEGLVNSGVSEVHNEVVDAAGVTGVVAATDQSKSTRGCGCWQSKPRRR 720
RK VS I KL EEGLV++ VS+VH EVVDA GVT VVA +D SK RGCGCW K RRR
Sbjct: 661 RKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRR 640
Query: 721 RAVAVDKDGGSGRRSKSKRRR---GWLRRWGRKQREEKERKR 739
RAVAVDK+GGSGRRSK K R+ GWLRR RK+RE KE+ R
Sbjct: 721 RAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR 640
BLAST of ClCG05G010080 vs. NCBI nr
Match:
XP_022922695.1 (uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata])
HSP 1 Score: 422.9 bits (1086), Expect = 5.5e-114
Identity = 358/909 (39.38%), Postives = 443/909 (48.73%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 T----------------------------------------------NASSKARIVTAAS 302
N +
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 NLQEEETVVAVGDVLDKNG-C--TCRCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLNL 362
++E E VA G +++G C +C P+F CGR+ +V EE D N+
Sbjct: 304 LVKEGEATVAPGPCKEESGYCFPQWKCLPSFPTCGRKLSVV--------EEEVTVDDPNV 363
Query: 363 GKEEVATNGPNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRRKVVAVKEEV 422
EVA N P+L KE E V DP ++EGSGC R + FQI R KV A K+E+
Sbjct: 364 --PEVA-NRPDLAKEGEATVAPDP---RKEGSGCCSPRLKDVSEFQI-RNSKVAAGKQEM 423
Query: 423 IVDVP---KVEELTNDNVNMK-----EGDSVGSLQG------------------------ 482
VDVP +VEE+ ND VN + E V L G
Sbjct: 424 TVDVPNVLEVEEVANDVVNKQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPE 483
Query: 483 ------------RKSVDDNPQTFEKEAVASDYSS--------NVGVPNLQKEGSGCCSCF 542
+K VD +P T E A S S+ N VP L K GSG C F
Sbjct: 484 VEEVANDVVKKQKKEVDGDPNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSG-CGWF 543
Query: 543 KCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA-----------TDKESH 602
K +P+F ICG + V SDVPNPSREEK VV+G SD+ E A D E+
Sbjct: 544 KWMPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAAAVAVAAAADSDADWEAQ 603
Query: 603 SKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDP 662
SK A+ G CWPFQ C W P LC V ASNH+EE ++ PL KE+VV V+
Sbjct: 604 SKSARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQEEPLSDNKEKVVDVLTVK 663
Query: 663 QKKNIAVADGIPDHSKE------KQVVADDIPVLSTEEKMFVG-------KEKNVSSDSI 722
+ A G+PD +E KQV DIPV EEK G KEK SS
Sbjct: 664 HNEK-GAAMGVPDLHEEDGTVADKQVAVVDIPVPRKEEKESAGDTLDLIQKEKKASSGK- 723
Query: 723 QEVWKEEEIVDSGVDLGEKEEGGCCRCFKFGGGKE-------SDRRQHRRSSRSREGGWG 739
+EEE V S V KEEGGCC FK GG ++ R R+SS SREG W
Sbjct: 724 ----EEEENVGSVVSDSGKEEGGCCCSFKHGGKEDGCSRRGSGSGRHRRQSSNSREGCWP 783
BLAST of ClCG05G010080 vs. NCBI nr
Match:
XP_022922694.1 (uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata])
HSP 1 Score: 404.8 bits (1039), Expect = 1.6e-108
Identity = 370/986 (37.53%), Postives = 451/986 (45.74%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 TNASSK------------------------------------------------------ 302
K
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 ------ARIVTA------------------------ASNLQEEETVVAVGDVLD------ 362
A I A + +EE TVV V ++LD
Sbjct: 304 SVKEGEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANG 363
Query: 363 -----------------KNGCTC---RCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLN 422
++G C +C P F +CGR+ V TS K E V D N
Sbjct: 364 PELVQEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRK----VATS---KVELTVVDIPN 423
Query: 423 LGKEEVATNGPNLVKE-------------------------------------EEVVVVV 482
L + E NGP LVKE EE V V
Sbjct: 424 LLEVEEVANGPELVKEGVATGSPGPYKKRSGSCCSQWKCLPSFPTCGRKLSVVEEEVTVD 483
Query: 483 D---------PDLRKE-----------EGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIV 542
D PDL KE EGSGC R + FQI R KV A K+E+ V
Sbjct: 484 DPNVPEVANRPDLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQI-RNSKVAAGKQEMTV 543
Query: 543 DVP---KVEELTNDNVNMKEGDSVGSLQGRKSVDDNPQTFEKEAVASDYSS--------N 602
DVP +VEE+ ND VN + +K VD +P T E A S S+ N
Sbjct: 544 DVPNVLEVEEVANDVVNKQ----------KKEVDGDPNTGEAVASCSSSSNLHEEEEKIN 603
Query: 603 VGVPNLQKEGSGCCSCFKCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA 662
VP L K GSG C FK +P+F ICG + V SDVPNPSREEK VV+G SD+ E A
Sbjct: 604 ASVPELHKNGSG-CGWFKWMPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAA 663
Query: 663 -----------TDKESHSKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEK 722
D E+ SK A+ G CWPFQ C W P LC V ASNH+EE ++
Sbjct: 664 AVAVAAAADSDADWEAQSKSARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQE 723
Query: 723 MPLDVPKEEVVAVIPDPQKKNIAVADGIPDHSKE------KQVVADDIPVLSTEEKMFVG 739
PL KE+VV V+ + A G+PD +E KQV DIPV EEK G
Sbjct: 724 EPLSDNKEKVVDVLTVKHNEK-GAAMGVPDLHEEDGTVADKQVAVVDIPVPRKEEKESAG 783
BLAST of ClCG05G010080 vs. NCBI nr
Match:
XP_022922693.1 (uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata])
HSP 1 Score: 393.7 bits (1010), Expect = 3.6e-105
Identity = 369/1029 (35.86%), Postives = 452/1029 (43.93%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 TNASSK------------------------------------------------------ 302
K
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 ------ARIVTA------------------------ASNLQEEETVVAVGDVLD------ 362
A I A + +EE TVV V ++LD
Sbjct: 304 SVKEGEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANG 363
Query: 363 -----------------KNGCTC---RCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLN 422
++G C +C P F +CGR+ V TS KEE D N
Sbjct: 364 PELVQEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRK----VATS---KEELTDVDVPN 423
Query: 423 LGKEEVATNGPNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRR-------- 482
L + E NGP LVKE E V P KEE CF +W CLP+F C R+
Sbjct: 424 LLEVEEVANGPELVKEGEATVAPGP--CKEESGYCF-PQWKCLPSFPTCGRKLSVVEEEV 483
Query: 483 ---------------------------------------------------KVVAVKEEV 542
KV A K+E+
Sbjct: 484 TVDDPNVPEVANRPDLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEM 543
Query: 543 IVDVP---KVEELTNDNVNMK-----EGDSVGSLQG------------------------ 602
VDVP +VEE+ ND VN + E V L G
Sbjct: 544 TVDVPNVLEVEEVANDVVNKQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPE 603
Query: 603 ------------RKSVDDNPQTFEKEAVASDYSS--------NVGVPNLQKEGSGCCSCF 662
+K VD +P T E A S S+ N VP L K GSG C F
Sbjct: 604 VEEVANDVVKKQKKEVDGDPNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSG-CGWF 663
Query: 663 KCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA-----------TDKESH 722
K +P+F ICG + V SDVPNPSREEK VV+G SD+ E A D E+
Sbjct: 664 KWMPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAAAVAVAAAADSDADWEAQ 723
Query: 723 SKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDP 739
SK A+ G CWPFQ C W P LC V ASNH+EE ++ PL KE+VV V+
Sbjct: 724 SKSARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQEEPLSDNKEKVVDVLTVK 783
BLAST of ClCG05G010080 vs. ExPASy TrEMBL
Match:
A0A0A0L996 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G204780 PE=4 SV=1)
HSP 1 Score: 661.8 bits (1706), Expect = 3.4e-186
Identity = 418/762 (54.86%), Postives = 470/762 (61.68%), Query Frame = 0
Query: 1 MATNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKSNLLTP-NRPFL 60
MATNK KEE RSRKPR RRN QMEE PTFTKWLT+FGHS S ND+ KS L P NRP
Sbjct: 1 MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQ 60
Query: 61 ETPLAVPDSSQDAGPAVADVPPKDD-SVQAVDDSISAR--AGCCCWQTSKSTRRECALKF 120
P+ +P SS+DA P V +VP ++ AVD SISAR AGCCCWQ+SKSTRRECALKF
Sbjct: 61 RPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAGCCCWQSSKSTRRECALKF 120
Query: 121 HLSLRKRKVVTN--EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGT 180
H+SLRKRKVV N EA VV AV N EE T V DV K+GCGCRC TF+I R K
Sbjct: 121 HISLRKRKVVANSTEAEVVAAVLNPPEEAT---VEDV--KDGCGCRCLRTFKIFRRRKSR 180
Query: 181 VVGTSDLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVVTNASSKARIVTAASNLQEEE 240
VVG SDLQK+EGAVTDG+NL+ EEVA++G+D++K+EE
Sbjct: 181 VVGVSDLQKEEGAVTDGVNLRTEEVASSGSDMMKEEE----------------------- 240
Query: 241 TVVAVGDVLDKNGCTCRCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLNLGKEEVATNG 300
Sbjct: 241 ------------------------------------------------------------ 300
Query: 301 PNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIVDVPKVEE 360
VV+ PD RKEE SGC CGRW C P FQICRRRKVVA KEEV+ PKVEE
Sbjct: 301 ----------VVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEE 360
Query: 361 LTNDNVNMKEGDSVGSLQ------GRKSVDDNPQTFEKEAVASDYSSNVGVPNLQKEGSG 420
+ NDNV +E DSVG LQ GRK VDDNP+T EKE + S+ SSN+ V NLQKE SG
Sbjct: 361 VGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEKEPLVSNDSSNLDVQNLQKEESG 420
Query: 421 CCSCFKCLPTFHIC-GRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVATDKESHSKPAQ 480
CCSCF+C+PTF IC GRR N DS VP P REEKV+V VSD E V +E HS+P Q
Sbjct: 421 CCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVD-VSDPPEMGSVVDGRERHSRPVQ 480
Query: 481 GGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDPQKKNI 540
GG CW GW PRF LCGE T V+A NHREE EK P D KEE V V
Sbjct: 481 GGTCW------SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVA-------T 540
Query: 541 AVADGIPDHSKEKQVVADDIPVLSTEEKMFVG--------KEKNVSSDSIQEVWKEEEIV 600
AV D I DH KEK V A DIPV++ EE++FVG KEKNVSS +IQ+V ++EEIV
Sbjct: 541 AVGDEISDHDKEKPVAAIDIPVVN-EEEVFVGAGDTLDLHKEKNVSSCNIQDV-RKEEIV 600
Query: 601 DSGVDLGEKEEGGCCRCFKFGGGKESDRRQHRRSSRSREGGWGFQIWGRGWLPTLGICGG 660
DS EK EGG C C+ GKES RQ RSSRS EG W FQI GRG LPTL IC G
Sbjct: 601 DS----DEKVEGGGCGCW----GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRG 640
Query: 661 RKMVSASIQKLHEEGLVNSGVSEVHNEVVDAAGVTGVVAATDQSKSTRGCGCWQSKPRRR 720
RK VS I KL EEGLV++ VS+VH EVVDA GVT VVA +D SK RGCGCW K RRR
Sbjct: 661 RKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRR 640
Query: 721 RAVAVDKDGGSGRRSKSKRRR---GWLRRWGRKQREEKERKR 739
RAVAVDK+GGSGRRSK K R+ GWLRR RK+RE KE+ R
Sbjct: 721 RAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR 640
BLAST of ClCG05G010080 vs. ExPASy TrEMBL
Match:
A0A6J1E9I0 (uncharacterized protein LOC111430613 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 422.9 bits (1086), Expect = 2.7e-114
Identity = 358/909 (39.38%), Postives = 443/909 (48.73%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 T----------------------------------------------NASSKARIVTAAS 302
N +
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 NLQEEETVVAVGDVLDKNG-C--TCRCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLNL 362
++E E VA G +++G C +C P+F CGR+ +V EE D N+
Sbjct: 304 LVKEGEATVAPGPCKEESGYCFPQWKCLPSFPTCGRKLSVV--------EEEVTVDDPNV 363
Query: 363 GKEEVATNGPNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRRKVVAVKEEV 422
EVA N P+L KE E V DP ++EGSGC R + FQI R KV A K+E+
Sbjct: 364 --PEVA-NRPDLAKEGEATVAPDP---RKEGSGCCSPRLKDVSEFQI-RNSKVAAGKQEM 423
Query: 423 IVDVP---KVEELTNDNVNMK-----EGDSVGSLQG------------------------ 482
VDVP +VEE+ ND VN + E V L G
Sbjct: 424 TVDVPNVLEVEEVANDVVNKQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPE 483
Query: 483 ------------RKSVDDNPQTFEKEAVASDYSS--------NVGVPNLQKEGSGCCSCF 542
+K VD +P T E A S S+ N VP L K GSG C F
Sbjct: 484 VEEVANDVVKKQKKEVDGDPNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSG-CGWF 543
Query: 543 KCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA-----------TDKESH 602
K +P+F ICG + V SDVPNPSREEK VV+G SD+ E A D E+
Sbjct: 544 KWMPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAAAVAVAAAADSDADWEAQ 603
Query: 603 SKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDP 662
SK A+ G CWPFQ C W P LC V ASNH+EE ++ PL KE+VV V+
Sbjct: 604 SKSARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQEEPLSDNKEKVVDVLTVK 663
Query: 663 QKKNIAVADGIPDHSKE------KQVVADDIPVLSTEEKMFVG-------KEKNVSSDSI 722
+ A G+PD +E KQV DIPV EEK G KEK SS
Sbjct: 664 HNEK-GAAMGVPDLHEEDGTVADKQVAVVDIPVPRKEEKESAGDTLDLIQKEKKASSGK- 723
Query: 723 QEVWKEEEIVDSGVDLGEKEEGGCCRCFKFGGGKE-------SDRRQHRRSSRSREGGWG 739
+EEE V S V KEEGGCC FK GG ++ R R+SS SREG W
Sbjct: 724 ----EEEENVGSVVSDSGKEEGGCCCSFKHGGKEDGCSRRGSGSGRHRRQSSNSREGCWP 783
BLAST of ClCG05G010080 vs. ExPASy TrEMBL
Match:
A0A6J1E452 (uncharacterized protein LOC111430613 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 404.8 bits (1039), Expect = 7.6e-109
Identity = 370/986 (37.53%), Postives = 451/986 (45.74%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 TNASSK------------------------------------------------------ 302
K
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 ------ARIVTA------------------------ASNLQEEETVVAVGDVLD------ 362
A I A + +EE TVV V ++LD
Sbjct: 304 SVKEGEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANG 363
Query: 363 -----------------KNGCTC---RCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLN 422
++G C +C P F +CGR+ V TS K E V D N
Sbjct: 364 PELVQEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRK----VATS---KVELTVVDIPN 423
Query: 423 LGKEEVATNGPNLVKE-------------------------------------EEVVVVV 482
L + E NGP LVKE EE V V
Sbjct: 424 LLEVEEVANGPELVKEGVATGSPGPYKKRSGSCCSQWKCLPSFPTCGRKLSVVEEEVTVD 483
Query: 483 D---------PDLRKE-----------EGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIV 542
D PDL KE EGSGC R + FQI R KV A K+E+ V
Sbjct: 484 DPNVPEVANRPDLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQI-RNSKVAAGKQEMTV 543
Query: 543 DVP---KVEELTNDNVNMKEGDSVGSLQGRKSVDDNPQTFEKEAVASDYSS--------N 602
DVP +VEE+ ND VN + +K VD +P T E A S S+ N
Sbjct: 544 DVPNVLEVEEVANDVVNKQ----------KKEVDGDPNTGEAVASCSSSSNLHEEEEKIN 603
Query: 603 VGVPNLQKEGSGCCSCFKCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA 662
VP L K GSG C FK +P+F ICG + V SDVPNPSREEK VV+G SD+ E A
Sbjct: 604 ASVPELHKNGSG-CGWFKWMPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAA 663
Query: 663 -----------TDKESHSKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEK 722
D E+ SK A+ G CWPFQ C W P LC V ASNH+EE ++
Sbjct: 664 AVAVAAAADSDADWEAQSKSARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQE 723
Query: 723 MPLDVPKEEVVAVIPDPQKKNIAVADGIPDHSKE------KQVVADDIPVLSTEEKMFVG 739
PL KE+VV V+ + A G+PD +E KQV DIPV EEK G
Sbjct: 724 EPLSDNKEKVVDVLTVKHNEK-GAAMGVPDLHEEDGTVADKQVAVVDIPVPRKEEKESAG 783
BLAST of ClCG05G010080 vs. ExPASy TrEMBL
Match:
A0A6J1E7K5 (uncharacterized protein LOC111430613 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 393.7 bits (1010), Expect = 1.7e-105
Identity = 369/1029 (35.86%), Postives = 452/1029 (43.93%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 TNASSK------------------------------------------------------ 302
K
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 ------ARIVTA------------------------ASNLQEEETVVAVGDVLD------ 362
A I A + +EE TVV V ++LD
Sbjct: 304 SVKEGEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANG 363
Query: 363 -----------------KNGCTC---RCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLN 422
++G C +C P F +CGR+ V TS KEE D N
Sbjct: 364 PELVQEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRK----VATS---KEELTDVDVPN 423
Query: 423 LGKEEVATNGPNLVKEEEVVVVVDPDLRKEEGSGCFCGRWGCLPAFQICRRR-------- 482
L + E NGP LVKE E V P KEE CF +W CLP+F C R+
Sbjct: 424 LLEVEEVANGPELVKEGEATVAPGP--CKEESGYCF-PQWKCLPSFPTCGRKLSVVEEEV 483
Query: 483 ---------------------------------------------------KVVAVKEEV 542
KV A K+E+
Sbjct: 484 TVDDPNVPEVANRPDLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEM 543
Query: 543 IVDVP---KVEELTNDNVNMK-----EGDSVGSLQG------------------------ 602
VDVP +VEE+ ND VN + E V L G
Sbjct: 544 TVDVPNVLEVEEVANDVVNKQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPE 603
Query: 603 ------------RKSVDDNPQTFEKEAVASDYSS--------NVGVPNLQKEGSGCCSCF 662
+K VD +P T E A S S+ N VP L K GSG C F
Sbjct: 604 VEEVANDVVKKQKKEVDGDPNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSG-CGWF 663
Query: 663 KCLPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA-----------TDKESH 722
K +P+F ICG + V SDVPNPSREEK VV+G SD+ E A D E+
Sbjct: 664 KWMPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAAAVAVAAAADSDADWEAQ 723
Query: 723 SKPAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDP 739
SK A+ G CWPFQ C W P LC V ASNH+EE ++ PL KE+VV V+
Sbjct: 724 SKSARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQEEPLSDNKEKVVDVLTVK 783
BLAST of ClCG05G010080 vs. ExPASy TrEMBL
Match:
A0A6J1E412 (uncharacterized protein LOC111430613 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 392.9 bits (1008), Expect = 3.0e-105
Identity = 374/1027 (36.42%), Postives = 455/1027 (44.30%), Query Frame = 0
Query: 3 TNKQKEEARSRKPRQRRNLQMEELPTFTKWLTSFGHSPSTNDSNPKS-NLLTPNRPFLET 62
T+KQKE+ + +K RRNLQ+EE PTFTKWL G S S +D++ KS P P
Sbjct: 4 TDKQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRI 63
Query: 63 PLAVPDSSQDAGPAVADVPPKDDSVQAVDDSISARAGCCCWQTSKSTRRECALKFHLSLR 122
P DSS G VA K+D A+ +SISAR GCCCWQ SKST+REC L FH SLR
Sbjct: 64 P---RDSSSGGGGVVACDVSKEDGGVAIGESISARGGCCCWQRSKSTQRECGLNFHFSLR 123
Query: 123 KRKVVTN-EAGVVTAVSNLQEEETMVAVGDVLDKEGCGCRCWPTFQICGRTKGTVVGTS- 182
K KVVTN GV VS+++EE+ +V G V+++EGCGCRC PTF ICGR K G S
Sbjct: 124 KSKVVTNVSEGVAAGVSDVREEQ-VVEAGAVINREGCGCRCSPTFHICGRRKVPPTGVSY 183
Query: 183 -------------------------DLQKKEGAVTDGLNLQKEEVATNGTDLVKKEEKVV 242
+L K+EG TDG ++ KEEVAT+ DLVK+ E +
Sbjct: 184 LPEKRSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIA 243
Query: 243 TNASSK------------------------------------------------------ 302
K
Sbjct: 244 APGPCKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPE 303
Query: 303 ------ARIVTA------------------------ASNLQEEETVVAVGDVLD------ 362
A I A + +EE TVV V ++LD
Sbjct: 304 SVKEGEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANG 363
Query: 363 -----------------KNGCTC---RCWPTFQICGRRKGIVVGTSDLQKEEGAVTDSLN 422
++G C +C P F +CGR+ V TS K E V D N
Sbjct: 364 PELVQEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRK----VATS---KVELTVVDIPN 423
Query: 423 LGKEEVATNGPNLVKE-------------------------------------EEVVVVV 482
L + E NGP LVKE EE V V
Sbjct: 424 LLEVEEVANGPELVKEGVATGSPGPYKKRSGSCCSQWKCLPSFPTCGRKLSVVEEEVTVD 483
Query: 483 D---------PDLRKE-----------EGSGCFCGRWGCLPAFQICRRRKVVAVKEEVIV 542
D PDL KE EGSGC R + FQI R KV A K+E+ V
Sbjct: 484 DPNVPEVANRPDLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQI-RNSKVAAGKQEMTV 543
Query: 543 DVP---KVEELTNDNVNMK-----EGDSVGSLQG-------------------------- 602
DVP +VEE+ ND VN + E V L G
Sbjct: 544 DVPNVLEVEEVANDVVNKQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPEVE 603
Query: 603 ----------RKSVDDNPQTFEKEAVASDYSS--------NVGVPNLQKEGSGCCSCFKC 662
+K VD +P T E A S S+ N VP L K GSG C FK
Sbjct: 604 EVANDVVKKQKKEVDGDPNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSG-CGWFKW 663
Query: 663 LPTFHICGRRRNVDSDVPNPSREEKVVVSGVSDLLEEKVVA-----------TDKESHSK 722
+P+F ICG + V SDVPNPSREEK VV+G SD+ E A D E+ SK
Sbjct: 664 MPSFLICGSK--VVSDVPNPSREEK-VVAGDSDIQEGVGAAAVAVAAAADSDADWEAQSK 723
Query: 723 PAQGGICWPFQICARGWLPRFFLCGEKTVVEASNHREEGEKMPLDVPKEEVVAVIPDPQK 739
A+ G CWPFQ C W P LC V ASNH+EE ++ PL KE+VV V+
Sbjct: 724 SARQGSCWPFQCCTI-WWPSKLLCTGGMDVNASNHQEEEQEEPLSDNKEKVVDVLTVKHN 783
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038880648.1 | 1.4e-245 | 64.85 | uncharacterized protein LOC120072275 [Benincasa hispida] | [more] |
KGN57524.1 | 7.1e-186 | 54.86 | hypothetical protein Csa_011487 [Cucumis sativus] | [more] |
XP_022922695.1 | 5.5e-114 | 39.38 | uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata] | [more] |
XP_022922694.1 | 1.6e-108 | 37.53 | uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata] | [more] |
XP_022922693.1 | 3.6e-105 | 35.86 | uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L996 | 3.4e-186 | 54.86 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G204780 PE=4 SV=1 | [more] |
A0A6J1E9I0 | 2.7e-114 | 39.38 | uncharacterized protein LOC111430613 isoform X4 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E452 | 7.6e-109 | 37.53 | uncharacterized protein LOC111430613 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E7K5 | 1.7e-105 | 35.86 | uncharacterized protein LOC111430613 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E412 | 3.0e-105 | 36.42 | uncharacterized protein LOC111430613 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |