Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATTGGGTAGAGAAACTAATGGAATCCCATCGGCGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGATCTGATTCTAATAAGCTTCGTTTTAGTCACCGTCGATACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCGAAAGCAAAGACGATGGGGTCGGAGATTTCGGAACCGTCGTCACCGAAAGTGACTTGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGAAGGAAATTACGGAGAAGGAGGTTTAATTGGGTTGAATCATTAGGGTTCAAGAAAGATATTATGCAATTCTTGACTTGTTTACGGAGTATACGATTTGATTTTAGGTGTTTCAGAGCTTTCCCAGAAACAGATTTCACCACTGAAGAAGAGGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAAAATCTCAGGGGAATCAAGTGGGTATTGAAGGAAGTGAGAGCTCCAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAAGAAAGTGGGAGTAATGGGATTCAGAGAGATAGCAAAAGTTTTTGTAGCGGTGATGATGCATCGATTGGAAGAACCCCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCAACTCCAGCAAAGAGATGGTTGGAAGAAGAAGGAGAAGAAGATGAAAAGGAAGAAGTGAAGGTGAAAAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATATTGCAAAATGACATCGCATATTGCAAAACAGACATTGGTTGCTAGTGAAAAGAGTAGGGATTTGTTTACAAGGAGTCAAAGTTGGAAAGTTTGA
mRNA sequence
ATGAAATTGGGTAGAGAAACTAATGGAATCCCATCGGCGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGATCTGATTCTAATAAGCTTCGTTTTAGTCACCGTCGATACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCGAAAGCAAAGACGATGGGGTCGGAGATTTCGGAACCGTCGTCACCGAAAGTGACTTGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGAAGGAAATTACGGAGAAGGAGGTTTAATTGGGTTGAATCATTAGGGTTCAAGAAAGATATTATGCAATTCTTGACTTGTTTACGGAGTATACGATTTGATTTTAGGTGTTTCAGAGCTTTCCCAGAAACAGATTTCACCACTGAAGAAGAGGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAAAATCTCAGGGGAATCAAGTGGGTATTGAAGGAAGTGAGAGCTCCAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAAGAAAGTGGGAGTAATGGGATTCAGAGAGATAGCAAAAGTTTTTGTAGCGGTGATGATGCATCGATTGGAAGAACCCCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCAACTCCAGCAAAGAGATGGTTGGAAGAAGAAGGAGAAGAAGATGAAAAGGAAGAAGTGAAGGTGAAAAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATATTGCAAAATGACATCGCATATTGCAAAACAGACATTGGTTGCTAGTGAAAAGAGTAGGGATTTGTTTACAAGGAGTCAAAGTTGGAAAGTTTGA
Coding sequence (CDS)
ATGAAATTGGGTAGAGAAACTAATGGAATCCCATCGGCGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGATCTGATTCTAATAAGCTTCGTTTTAGTCACCGTCGATACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCGAAAGCAAAGACGATGGGGTCGGAGATTTCGGAACCGTCGTCACCGAAAGTGACTTGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGAAGGAAATTACGGAGAAGGAGGTTTAATTGGGTTGAATCATTAGGGTTCAAGAAAGATATTATGCAATTCTTGACTTGTTTACGGAGTATACGATTTGATTTTAGGTGTTTCAGAGCTTTCCCAGAAACAGATTTCACCACTGAAGAAGAGGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAAAATCTCAGGGGAATCAAGTGGGTATTGAAGGAAGTGAGAGCTCCAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAAGAAAGTGGGAGTAATGGGATTCAGAGAGATAGCAAAAGTTTTTGTAGCGGTGATGATGCATCGATTGGAAGAACCCCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCAACTCCAGCAAAGAGATGGTTGGAAGAAGAAGGAGAAGAAGATGAAAAGGAAGAAGTGAAGGTGAAAAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATATTGCAAAATGACATCGCATATTGCAAAACAGACATTGGTTGCTAGTGAAAAGAGTAGGGATTTGTTTACAAGGAGTCAAAGTTGGAAAGTTTGA
Protein sequence
MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEEKSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNALLLMRCRSTPAKRWLEEEGEEDEKEEVKVKKSLKWLMEEENRERYCKMTSHIAKQTLVASEKSRDLFTRSQSWKV
Homology
BLAST of Cla97C11G221490 vs. NCBI nr
Match:
XP_038898663.1 (transcription initiation factor IIE subunit alpha [Benincasa hispida])
HSP 1 Score: 496.9 bits (1278), Expect = 1.3e-136
Identity = 270/330 (81.82%), Postives = 283/330 (85.76%), Query Frame = 0
Query: 1 MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAES 60
MKLGRE GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R SHR YHRRRKSAES
Sbjct: 1 MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAES 60
Query: 61 PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFN 120
PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF+
Sbjct: 61 PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFH 120
Query: 121 WVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEEK 180
WVESLGFKKDIMQFLTCLR+IRFDFRCFRAFP TDFTT EEEEEEEEEEEEK
Sbjct: 121 WVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTT---------EEEEEEEEEEEEK 180
Query: 181 SQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNAL 240
SQGNQVG++ +ESSRTAFSKWFMVLQE+GSN ++R+SK CS DD SI MAPPKNAL
Sbjct: 181 SQGNQVGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSI-EAAMAPPKNAL 240
Query: 241 LLMRCRSTPAKRWLEEEGEE--------DEKEEVKVKKSLKWLMEEENRER--------Y 300
LLMRCRS PAKRWLEEE EE DEKEEVKVKKSLKWLMEEENRER +
Sbjct: 241 LLMRCRSAPAKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDF 300
Query: 301 CKMTSHIAKQTLVASEKSRDLFTRSQSWKV 315
C+MTS IAK+T V SEKSRDLFTRS SWKV
Sbjct: 301 CRMTSDIAKETWVVSEKSRDLFTRSHSWKV 320
BLAST of Cla97C11G221490 vs. NCBI nr
Match:
XP_008444111.1 (PREDICTED: uncharacterized protein LOC103487551 [Cucumis melo] >KAA0064246.1 transcription initiation factor IIE subunit alpha-like [Cucumis melo var. makuwa] >TYK18616.1 transcription initiation factor IIE subunit alpha-like [Cucumis melo var. makuwa])
HSP 1 Score: 455.7 bits (1171), Expect = 3.3e-124
Identity = 254/325 (78.15%), Postives = 271/325 (83.38%), Query Frame = 0
Query: 1 MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
MKL RE + GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R HRR+HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
WVES GFKKDIMQFLTCLR+IRFDFRCFRAFPETDFTTEEEEEEEEEEE+E+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------- 187
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDSKS C+ DD SI MAPP NA
Sbjct: 188 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESI-EAIMAPPINA 247
Query: 241 LLLMRCRSTPAKRWLEEEGEE--DEKEEVKVKKSLKWLMEEENRER--------YCKMTS 300
LLLMRCRS PA+RW+EEE EE DEKE+VKVKKSLKWLMEEENRER +C+MTS
Sbjct: 248 LLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTS 307
Query: 301 HIAKQTLVASEKSRDLFTRSQSWKV 315
AK+ FTRSQSWKV
Sbjct: 308 DNAKE-----------FTRSQSWKV 309
BLAST of Cla97C11G221490 vs. NCBI nr
Match:
XP_004142611.2 (transcription initiation factor IIE subunit alpha [Cucumis sativus] >KAE8649637.1 hypothetical protein Csa_012410 [Cucumis sativus])
HSP 1 Score: 450.3 bits (1157), Expect = 1.4e-122
Identity = 252/331 (76.13%), Postives = 273/331 (82.48%), Query Frame = 0
Query: 1 MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
MKL RE + GIPS+DLLVCFPSRSHLALMPNPLCSPARGSDS+K R +RRYHRRRKSAE
Sbjct: 11 MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 70
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 71 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 130
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
NW+ES GFKKDIMQFLTCLR++RFDFRCFRAFPETDFTTEEEEEEEEEEEEEEE+
Sbjct: 131 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEK----- 190
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDS S C DD SI T MAPP+NA
Sbjct: 191 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEAT-MAPPRNA 250
Query: 241 LLLMRCRSTPAKRWLEEEGEED--------EKEEVKVKKSLKWLMEEENRER-------- 300
LLLMRC+S PA+RW+EEE EE+ EKE+VKVKKSLKWLMEEENRER
Sbjct: 251 LLLMRCKSAPARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTD 310
Query: 301 YCKMTSHIAKQTLVASEKSRDLFTRSQSWKV 315
+C+M S AK+ FTRSQSWKV
Sbjct: 311 FCRMISDNAKE-----------FTRSQSWKV 320
BLAST of Cla97C11G221490 vs. NCBI nr
Match:
XP_022147766.1 (uncharacterized protein LOC111016595 [Momordica charantia])
HSP 1 Score: 412.9 bits (1060), Expect = 2.4e-111
Identity = 242/343 (70.55%), Postives = 263/343 (76.68%), Query Frame = 0
Query: 1 MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRK--SA 60
MKLGR+ I SADLLVCFPSRS+L LMP PLCSPARG DSNKLR SHR +HRRRK SA
Sbjct: 1 MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60
Query: 61 ESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPK--NSKSWQSVMEEIERIHNRRKLRR 120
SP++WAK KTMGSEISEPSSPKVTCAGQIKIRPK + KSWQSVMEEIERIHNRRKLRR
Sbjct: 61 ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120
Query: 121 RRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEE 180
RR NWVESLGFKKDIMQFLTCLR+IRFDFRCF+AFPE DFTT EEE+EEEEEE
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTT--------EEEDEEEEEE 180
Query: 181 EEEKSQGNQVGIEGSESSRTAFSKWFMVLQESG-SNGIQRDSKSFCSGDDASIGRTPMAP 240
EE KSQ NQVG+EG+ESSRTAFSKWFMVLQESG SNGI R+S P+AP
Sbjct: 181 EEGKSQENQVGVEGNESSRTAFSKWFMVLQESGASNGICRESNG-----------PPLAP 240
Query: 241 PKNALLLMRCRSTPAKRWLEEEGEEDEKE-----------------EVKVKKSLKWLMEE 300
PKNALLLMRCRS PAK W EEE EE+E+E EVKVKKSLKWLMEE
Sbjct: 241 PKNALLLMRCRSAPAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEE 300
Query: 301 ENRER--------YCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
ENRER +C+M+S IAK+T V RDLF+RS+SWK
Sbjct: 301 ENRERLVMEMGPDFCRMSSEIAKETWV----GRDLFSRSRSWK 320
BLAST of Cla97C11G221490 vs. NCBI nr
Match:
XP_022935869.1 (uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 uncharacterized protein LOC111442647 [Cucurbita moschata])
HSP 1 Score: 392.5 bits (1007), Expect = 3.4e-105
Identity = 222/316 (70.25%), Postives = 240/316 (75.95%), Query Frame = 0
Query: 1 MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAES 60
MK R+T PS DLLVCFPSRSH ALMPNPLCSPAR SDSNKL RRYHRRRKSAES
Sbjct: 1 MKSIRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKL----RRYHRRRKSAES 60
Query: 61 PVVWAKAKTM-GSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
PVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF
Sbjct: 61 PVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRF 120
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
NWVESLGFKKDIMQFLTCLRS+RFDF CF AFPE +FT+E+EEEEE
Sbjct: 121 NWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE-------------- 180
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
VG+EGS+ SRTAFSKWFMVLQ S G++RD C+ DDASIG PMAPP+NA
Sbjct: 181 ------VGVEGSDGSRTAFSKWFMVLQGS---GVRRDGNGLCTVDDASIG-PPMAPPRNA 240
Query: 241 LLLMRCRSTPAKRWLEEE-GEEDEKEEVKVKKSLKWLMEEENRERYCKMTSHIAKQTLVA 300
LLLMRCRS PAK W+EE EE E+ EVKVKKSLKWLMEEENRE
Sbjct: 241 LLLMRCRSAPAKSWVEEGCSEEGEETEVKVKKSLKWLMEEENRE---------------- 269
Query: 301 SEKSRDLFTRSQSWKV 315
SRDL TRSQSWKV
Sbjct: 301 ---SRDLVTRSQSWKV 269
BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match:
A0A5D3D503 (Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001490 PE=4 SV=1)
HSP 1 Score: 455.7 bits (1171), Expect = 1.6e-124
Identity = 254/325 (78.15%), Postives = 271/325 (83.38%), Query Frame = 0
Query: 1 MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
MKL RE + GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R HRR+HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
WVES GFKKDIMQFLTCLR+IRFDFRCFRAFPETDFTTEEEEEEEEEEE+E+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------- 187
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDSKS C+ DD SI MAPP NA
Sbjct: 188 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESI-EAIMAPPINA 247
Query: 241 LLLMRCRSTPAKRWLEEEGEE--DEKEEVKVKKSLKWLMEEENRER--------YCKMTS 300
LLLMRCRS PA+RW+EEE EE DEKE+VKVKKSLKWLMEEENRER +C+MTS
Sbjct: 248 LLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTS 307
Query: 301 HIAKQTLVASEKSRDLFTRSQSWKV 315
AK+ FTRSQSWKV
Sbjct: 308 DNAKE-----------FTRSQSWKV 309
BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match:
A0A1S3B949 (uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=4 SV=1)
HSP 1 Score: 455.7 bits (1171), Expect = 1.6e-124
Identity = 254/325 (78.15%), Postives = 271/325 (83.38%), Query Frame = 0
Query: 1 MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
MKL RE + GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R HRR+HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
WVES GFKKDIMQFLTCLR+IRFDFRCFRAFPETDFTTEEEEEEEEEEE+E+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------- 187
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDSKS C+ DD SI MAPP NA
Sbjct: 188 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESI-EAIMAPPINA 247
Query: 241 LLLMRCRSTPAKRWLEEEGEE--DEKEEVKVKKSLKWLMEEENRER--------YCKMTS 300
LLLMRCRS PA+RW+EEE EE DEKE+VKVKKSLKWLMEEENRER +C+MTS
Sbjct: 248 LLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTS 307
Query: 301 HIAKQTLVASEKSRDLFTRSQSWKV 315
AK+ FTRSQSWKV
Sbjct: 308 DNAKE-----------FTRSQSWKV 309
BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match:
A0A0A0L1Z4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1)
HSP 1 Score: 450.3 bits (1157), Expect = 6.7e-123
Identity = 252/331 (76.13%), Postives = 273/331 (82.48%), Query Frame = 0
Query: 1 MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
MKL RE + GIPS+DLLVCFPSRSHLALMPNPLCSPARGSDS+K R +RRYHRRRKSAE
Sbjct: 1 MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 60
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
NW+ES GFKKDIMQFLTCLR++RFDFRCFRAFPETDFTTEEEEEEEEEEEEEEE+
Sbjct: 121 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEK----- 180
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDS S C DD SI T MAPP+NA
Sbjct: 181 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEAT-MAPPRNA 240
Query: 241 LLLMRCRSTPAKRWLEEEGEED--------EKEEVKVKKSLKWLMEEENRER-------- 300
LLLMRC+S PA+RW+EEE EE+ EKE+VKVKKSLKWLMEEENRER
Sbjct: 241 LLLMRCKSAPARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTD 300
Query: 301 YCKMTSHIAKQTLVASEKSRDLFTRSQSWKV 315
+C+M S AK+ FTRSQSWKV
Sbjct: 301 FCRMISDNAKE-----------FTRSQSWKV 310
BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match:
A0A6J1D3C2 (uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016595 PE=4 SV=1)
HSP 1 Score: 412.9 bits (1060), Expect = 1.2e-111
Identity = 242/343 (70.55%), Postives = 263/343 (76.68%), Query Frame = 0
Query: 1 MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRK--SA 60
MKLGR+ I SADLLVCFPSRS+L LMP PLCSPARG DSNKLR SHR +HRRRK SA
Sbjct: 1 MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60
Query: 61 ESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPK--NSKSWQSVMEEIERIHNRRKLRR 120
SP++WAK KTMGSEISEPSSPKVTCAGQIKIRPK + KSWQSVMEEIERIHNRRKLRR
Sbjct: 61 ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120
Query: 121 RRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEE 180
RR NWVESLGFKKDIMQFLTCLR+IRFDFRCF+AFPE DFTT EEE+EEEEEE
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTT--------EEEDEEEEEE 180
Query: 181 EEEKSQGNQVGIEGSESSRTAFSKWFMVLQESG-SNGIQRDSKSFCSGDDASIGRTPMAP 240
EE KSQ NQVG+EG+ESSRTAFSKWFMVLQESG SNGI R+S P+AP
Sbjct: 181 EEGKSQENQVGVEGNESSRTAFSKWFMVLQESGASNGICRESNG-----------PPLAP 240
Query: 241 PKNALLLMRCRSTPAKRWLEEEGEEDEKE-----------------EVKVKKSLKWLMEE 300
PKNALLLMRCRS PAK W EEE EE+E+E EVKVKKSLKWLMEE
Sbjct: 241 PKNALLLMRCRSAPAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEE 300
Query: 301 ENRER--------YCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
ENRER +C+M+S IAK+T V RDLF+RS+SWK
Sbjct: 301 ENRERLVMEMGPDFCRMSSEIAKETWV----GRDLFSRSRSWK 320
BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match:
A0A6J1FBW7 (uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC111442647 PE=4 SV=1)
HSP 1 Score: 392.5 bits (1007), Expect = 1.7e-105
Identity = 222/316 (70.25%), Postives = 240/316 (75.95%), Query Frame = 0
Query: 1 MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAES 60
MK R+T PS DLLVCFPSRSH ALMPNPLCSPAR SDSNKL RRYHRRRKSAES
Sbjct: 1 MKSIRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKL----RRYHRRRKSAES 60
Query: 61 PVVWAKAKTM-GSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
PVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF
Sbjct: 61 PVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRF 120
Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
NWVESLGFKKDIMQFLTCLRS+RFDF CF AFPE +FT+E+EEEEE
Sbjct: 121 NWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE-------------- 180
Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
VG+EGS+ SRTAFSKWFMVLQ S G++RD C+ DDASIG PMAPP+NA
Sbjct: 181 ------VGVEGSDGSRTAFSKWFMVLQGS---GVRRDGNGLCTVDDASIG-PPMAPPRNA 240
Query: 241 LLLMRCRSTPAKRWLEEE-GEEDEKEEVKVKKSLKWLMEEENRERYCKMTSHIAKQTLVA 300
LLLMRCRS PAK W+EE EE E+ EVKVKKSLKWLMEEENRE
Sbjct: 241 LLLMRCRSAPAKSWVEEGCSEEGEETEVKVKKSLKWLMEEENRE---------------- 269
Query: 301 SEKSRDLFTRSQSWKV 315
SRDL TRSQSWKV
Sbjct: 301 ---SRDLVTRSQSWKV 269
BLAST of Cla97C11G221490 vs. TAIR 10
Match:
AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )
HSP 1 Score: 223.0 bits (567), Expect = 3.3e-58
Identity = 162/348 (46.55%), Postives = 209/348 (60.06%), Query Frame = 0
Query: 12 SADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE----------SP 71
SADLLVCFPSR+HLAL P P+CSP+R SDS+ ++RR H RR+ ++ SP
Sbjct: 17 SADLLVCFPSRTHLALTPKPICSPSRPSDSS----TNRRPHHRRQLSKLSGGGGGGHGSP 76
Query: 72 VVWAK---AKTM-GSEISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRK 131
V+WAK +K M G EI+EP+SPKVTCAGQIK+RP K+WQSVMEEIERIH+ R
Sbjct: 77 VLWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRS 136
Query: 132 LRRRRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEE 191
+ G KKD+M FLTCLR+I+FDFRCF F D T++++EEE+++++EEEE
Sbjct: 137 QSK-------FFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEE 196
Query: 192 EEEEEEKSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFC--SGDDASIGRT 251
E EE+ E+S+T FSKWFMVLQE +N + + C D
Sbjct: 197 VVEGEEE-----------ENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETE 256
Query: 252 PMAPPKNALLLMRCRSTPAKRWLEE--------EGEEDEKEEVKV----------KKSLK 311
P PP NALLLMRCRS PAK WLEE E E++KEE + KK L+
Sbjct: 257 PAVPPPNALLLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLR 316
Query: 312 WLMEEENRE--------RYCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
LMEEE E + +++S IAK+T V +D +RS+SWK
Sbjct: 317 SLMEEEKMELVLMRYDTEFYRLSSDIAKETWVVG-GIQDPLSRSRSWK 341
BLAST of Cla97C11G221490 vs. TAIR 10
Match:
AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )
HSP 1 Score: 175.6 bits (444), Expect = 6.1e-44
Identity = 143/332 (43.07%), Postives = 181/332 (54.52%), Query Frame = 0
Query: 12 SADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAESPVVWAKAKTMG 71
SADL+VCFPSR+HL+L + SP+ + + HRR + S+ V + + G
Sbjct: 13 SADLMVCFPSRAHLSLPSKSISSPSSSFNRRQNAPHHRRSISKLSSSGGGV--RQNRGGG 72
Query: 72 SE-ISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRKLRRRRFNWVESLG 131
E + EP+SPKVTCAGQIK+R K+WQS+M EIE+IH R K + F G
Sbjct: 73 REVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIH-RSKSESKFF------G 132
Query: 132 FKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEEKSQGNQV 191
K+D+M FLTCLR FDFRCF AFP D +++EEE+EEEEEE+EEE+E+
Sbjct: 133 IKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEEEDED--------- 192
Query: 192 GIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNALLLMRCR 251
ESS T FSKW MVL E +N D K D + PP NALLLMRCR
Sbjct: 193 -----ESSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVET-----AVPPPNALLLMRCR 252
Query: 252 STPAKRWLEE----------------EGEEDEKEEVKVKKSLKWLMEEENR--------- 311
S P K W EE E EE+EK+ V KK L+ LMEEE +
Sbjct: 253 SAPVKNWSEEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYD 312
Query: 312 ERYCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
Y K+++ IAK+T V LF RS+SWK
Sbjct: 313 TNYYKLSNDIAKETWVVGGIQDPLF-RSRSWK 313
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038898663.1 | 1.3e-136 | 81.82 | transcription initiation factor IIE subunit alpha [Benincasa hispida] | [more] |
XP_008444111.1 | 3.3e-124 | 78.15 | PREDICTED: uncharacterized protein LOC103487551 [Cucumis melo] >KAA0064246.1 tra... | [more] |
XP_004142611.2 | 1.4e-122 | 76.13 | transcription initiation factor IIE subunit alpha [Cucumis sativus] >KAE8649637.... | [more] |
XP_022147766.1 | 2.4e-111 | 70.55 | uncharacterized protein LOC111016595 [Momordica charantia] | [more] |
XP_022935869.1 | 3.4e-105 | 70.25 | uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3D503 | 1.6e-124 | 78.15 | Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. maku... | [more] |
A0A1S3B949 | 1.6e-124 | 78.15 | uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=... | [more] |
A0A0A0L1Z4 | 6.7e-123 | 76.13 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1 | [more] |
A0A6J1D3C2 | 1.2e-111 | 70.55 | uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016... | [more] |
A0A6J1FBW7 | 1.7e-105 | 70.25 | uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC1114426... | [more] |
Match Name | E-value | Identity | Description | |
AT1G78110.1 | 3.3e-58 | 46.55 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G22230.1 | 6.1e-44 | 43.07 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |