Cla97C11G221490 (gene) Watermelon (97103) v2.5

Overview
NameCla97C11G221490
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionARID domain-containing protein
LocationCla97Chr11: 27488325 .. 27489269 (+)
RNA-Seq ExpressionCla97C11G221490
SyntenyCla97C11G221490
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATTGGGTAGAGAAACTAATGGAATCCCATCGGCGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGATCTGATTCTAATAAGCTTCGTTTTAGTCACCGTCGATACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCGAAAGCAAAGACGATGGGGTCGGAGATTTCGGAACCGTCGTCACCGAAAGTGACTTGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGAAGGAAATTACGGAGAAGGAGGTTTAATTGGGTTGAATCATTAGGGTTCAAGAAAGATATTATGCAATTCTTGACTTGTTTACGGAGTATACGATTTGATTTTAGGTGTTTCAGAGCTTTCCCAGAAACAGATTTCACCACTGAAGAAGAGGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAAAATCTCAGGGGAATCAAGTGGGTATTGAAGGAAGTGAGAGCTCCAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAAGAAAGTGGGAGTAATGGGATTCAGAGAGATAGCAAAAGTTTTTGTAGCGGTGATGATGCATCGATTGGAAGAACCCCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCAACTCCAGCAAAGAGATGGTTGGAAGAAGAAGGAGAAGAAGATGAAAAGGAAGAAGTGAAGGTGAAAAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATATTGCAAAATGACATCGCATATTGCAAAACAGACATTGGTTGCTAGTGAAAAGAGTAGGGATTTGTTTACAAGGAGTCAAAGTTGGAAAGTTTGA

mRNA sequence

ATGAAATTGGGTAGAGAAACTAATGGAATCCCATCGGCGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGATCTGATTCTAATAAGCTTCGTTTTAGTCACCGTCGATACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCGAAAGCAAAGACGATGGGGTCGGAGATTTCGGAACCGTCGTCACCGAAAGTGACTTGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGAAGGAAATTACGGAGAAGGAGGTTTAATTGGGTTGAATCATTAGGGTTCAAGAAAGATATTATGCAATTCTTGACTTGTTTACGGAGTATACGATTTGATTTTAGGTGTTTCAGAGCTTTCCCAGAAACAGATTTCACCACTGAAGAAGAGGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAAAATCTCAGGGGAATCAAGTGGGTATTGAAGGAAGTGAGAGCTCCAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAAGAAAGTGGGAGTAATGGGATTCAGAGAGATAGCAAAAGTTTTTGTAGCGGTGATGATGCATCGATTGGAAGAACCCCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCAACTCCAGCAAAGAGATGGTTGGAAGAAGAAGGAGAAGAAGATGAAAAGGAAGAAGTGAAGGTGAAAAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATATTGCAAAATGACATCGCATATTGCAAAACAGACATTGGTTGCTAGTGAAAAGAGTAGGGATTTGTTTACAAGGAGTCAAAGTTGGAAAGTTTGA

Coding sequence (CDS)

ATGAAATTGGGTAGAGAAACTAATGGAATCCCATCGGCGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGATCTGATTCTAATAAGCTTCGTTTTAGTCACCGTCGATACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCGAAAGCAAAGACGATGGGGTCGGAGATTTCGGAACCGTCGTCACCGAAAGTGACTTGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGAAGGAAATTACGGAGAAGGAGGTTTAATTGGGTTGAATCATTAGGGTTCAAGAAAGATATTATGCAATTCTTGACTTGTTTACGGAGTATACGATTTGATTTTAGGTGTTTCAGAGCTTTCCCAGAAACAGATTTCACCACTGAAGAAGAGGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAAAATCTCAGGGGAATCAAGTGGGTATTGAAGGAAGTGAGAGCTCCAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAAGAAAGTGGGAGTAATGGGATTCAGAGAGATAGCAAAAGTTTTTGTAGCGGTGATGATGCATCGATTGGAAGAACCCCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCAACTCCAGCAAAGAGATGGTTGGAAGAAGAAGGAGAAGAAGATGAAAAGGAAGAAGTGAAGGTGAAAAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATATTGCAAAATGACATCGCATATTGCAAAACAGACATTGGTTGCTAGTGAAAAGAGTAGGGATTTGTTTACAAGGAGTCAAAGTTGGAAAGTTTGA

Protein sequence

MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEEKSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNALLLMRCRSTPAKRWLEEEGEEDEKEEVKVKKSLKWLMEEENRERYCKMTSHIAKQTLVASEKSRDLFTRSQSWKV
Homology
BLAST of Cla97C11G221490 vs. NCBI nr
Match: XP_038898663.1 (transcription initiation factor IIE subunit alpha [Benincasa hispida])

HSP 1 Score: 496.9 bits (1278), Expect = 1.3e-136
Identity = 270/330 (81.82%), Postives = 283/330 (85.76%), Query Frame = 0

Query: 1   MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAES 60
           MKLGRE  GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R SHR YHRRRKSAES
Sbjct: 1   MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAES 60

Query: 61  PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFN 120
           PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF+
Sbjct: 61  PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFH 120

Query: 121 WVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEEK 180
           WVESLGFKKDIMQFLTCLR+IRFDFRCFRAFP TDFTT         EEEEEEEEEEEEK
Sbjct: 121 WVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTT---------EEEEEEEEEEEEK 180

Query: 181 SQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNAL 240
           SQGNQVG++ +ESSRTAFSKWFMVLQE+GSN ++R+SK  CS DD SI    MAPPKNAL
Sbjct: 181 SQGNQVGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSI-EAAMAPPKNAL 240

Query: 241 LLMRCRSTPAKRWLEEEGEE--------DEKEEVKVKKSLKWLMEEENRER--------Y 300
           LLMRCRS PAKRWLEEE EE        DEKEEVKVKKSLKWLMEEENRER        +
Sbjct: 241 LLMRCRSAPAKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDF 300

Query: 301 CKMTSHIAKQTLVASEKSRDLFTRSQSWKV 315
           C+MTS IAK+T V SEKSRDLFTRS SWKV
Sbjct: 301 CRMTSDIAKETWVVSEKSRDLFTRSHSWKV 320

BLAST of Cla97C11G221490 vs. NCBI nr
Match: XP_008444111.1 (PREDICTED: uncharacterized protein LOC103487551 [Cucumis melo] >KAA0064246.1 transcription initiation factor IIE subunit alpha-like [Cucumis melo var. makuwa] >TYK18616.1 transcription initiation factor IIE subunit alpha-like [Cucumis melo var. makuwa])

HSP 1 Score: 455.7 bits (1171), Expect = 3.3e-124
Identity = 254/325 (78.15%), Postives = 271/325 (83.38%), Query Frame = 0

Query: 1   MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
           MKL RE + GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R  HRR+HRRRKSAE
Sbjct: 8   MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
            WVES GFKKDIMQFLTCLR+IRFDFRCFRAFPETDFTTEEEEEEEEEEE+E+       
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------- 187

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
               NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDSKS C+ DD SI    MAPP NA
Sbjct: 188 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESI-EAIMAPPINA 247

Query: 241 LLLMRCRSTPAKRWLEEEGEE--DEKEEVKVKKSLKWLMEEENRER--------YCKMTS 300
           LLLMRCRS PA+RW+EEE EE  DEKE+VKVKKSLKWLMEEENRER        +C+MTS
Sbjct: 248 LLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTS 307

Query: 301 HIAKQTLVASEKSRDLFTRSQSWKV 315
             AK+           FTRSQSWKV
Sbjct: 308 DNAKE-----------FTRSQSWKV 309

BLAST of Cla97C11G221490 vs. NCBI nr
Match: XP_004142611.2 (transcription initiation factor IIE subunit alpha [Cucumis sativus] >KAE8649637.1 hypothetical protein Csa_012410 [Cucumis sativus])

HSP 1 Score: 450.3 bits (1157), Expect = 1.4e-122
Identity = 252/331 (76.13%), Postives = 273/331 (82.48%), Query Frame = 0

Query: 1   MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
           MKL RE + GIPS+DLLVCFPSRSHLALMPNPLCSPARGSDS+K R  +RRYHRRRKSAE
Sbjct: 11  MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 70

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 71  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 130

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
           NW+ES GFKKDIMQFLTCLR++RFDFRCFRAFPETDFTTEEEEEEEEEEEEEEE+     
Sbjct: 131 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEK----- 190

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
               NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDS S C  DD SI  T MAPP+NA
Sbjct: 191 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEAT-MAPPRNA 250

Query: 241 LLLMRCRSTPAKRWLEEEGEED--------EKEEVKVKKSLKWLMEEENRER-------- 300
           LLLMRC+S PA+RW+EEE EE+        EKE+VKVKKSLKWLMEEENRER        
Sbjct: 251 LLLMRCKSAPARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTD 310

Query: 301 YCKMTSHIAKQTLVASEKSRDLFTRSQSWKV 315
           +C+M S  AK+           FTRSQSWKV
Sbjct: 311 FCRMISDNAKE-----------FTRSQSWKV 320

BLAST of Cla97C11G221490 vs. NCBI nr
Match: XP_022147766.1 (uncharacterized protein LOC111016595 [Momordica charantia])

HSP 1 Score: 412.9 bits (1060), Expect = 2.4e-111
Identity = 242/343 (70.55%), Postives = 263/343 (76.68%), Query Frame = 0

Query: 1   MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRK--SA 60
           MKLGR+   I SADLLVCFPSRS+L LMP PLCSPARG DSNKLR SHR +HRRRK  SA
Sbjct: 1   MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60

Query: 61  ESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPK--NSKSWQSVMEEIERIHNRRKLRR 120
            SP++WAK KTMGSEISEPSSPKVTCAGQIKIRPK  + KSWQSVMEEIERIHNRRKLRR
Sbjct: 61  ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120

Query: 121 RRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEE 180
           RR NWVESLGFKKDIMQFLTCLR+IRFDFRCF+AFPE DFTT        EEE+EEEEEE
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTT--------EEEDEEEEEE 180

Query: 181 EEEKSQGNQVGIEGSESSRTAFSKWFMVLQESG-SNGIQRDSKSFCSGDDASIGRTPMAP 240
           EE KSQ NQVG+EG+ESSRTAFSKWFMVLQESG SNGI R+S              P+AP
Sbjct: 181 EEGKSQENQVGVEGNESSRTAFSKWFMVLQESGASNGICRESNG-----------PPLAP 240

Query: 241 PKNALLLMRCRSTPAKRWLEEEGEEDEKE-----------------EVKVKKSLKWLMEE 300
           PKNALLLMRCRS PAK W EEE EE+E+E                 EVKVKKSLKWLMEE
Sbjct: 241 PKNALLLMRCRSAPAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEE 300

Query: 301 ENRER--------YCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
           ENRER        +C+M+S IAK+T V     RDLF+RS+SWK
Sbjct: 301 ENRERLVMEMGPDFCRMSSEIAKETWV----GRDLFSRSRSWK 320

BLAST of Cla97C11G221490 vs. NCBI nr
Match: XP_022935869.1 (uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 uncharacterized protein LOC111442647 [Cucurbita moschata])

HSP 1 Score: 392.5 bits (1007), Expect = 3.4e-105
Identity = 222/316 (70.25%), Postives = 240/316 (75.95%), Query Frame = 0

Query: 1   MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAES 60
           MK  R+T   PS DLLVCFPSRSH ALMPNPLCSPAR SDSNKL    RRYHRRRKSAES
Sbjct: 1   MKSIRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKL----RRYHRRRKSAES 60

Query: 61  PVVWAKAKTM-GSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           PVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF
Sbjct: 61  PVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRF 120

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
           NWVESLGFKKDIMQFLTCLRS+RFDF CF AFPE +FT+E+EEEEE              
Sbjct: 121 NWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE-------------- 180

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
                 VG+EGS+ SRTAFSKWFMVLQ S   G++RD    C+ DDASIG  PMAPP+NA
Sbjct: 181 ------VGVEGSDGSRTAFSKWFMVLQGS---GVRRDGNGLCTVDDASIG-PPMAPPRNA 240

Query: 241 LLLMRCRSTPAKRWLEEE-GEEDEKEEVKVKKSLKWLMEEENRERYCKMTSHIAKQTLVA 300
           LLLMRCRS PAK W+EE   EE E+ EVKVKKSLKWLMEEENRE                
Sbjct: 241 LLLMRCRSAPAKSWVEEGCSEEGEETEVKVKKSLKWLMEEENRE---------------- 269

Query: 301 SEKSRDLFTRSQSWKV 315
              SRDL TRSQSWKV
Sbjct: 301 ---SRDLVTRSQSWKV 269

BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match: A0A5D3D503 (Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001490 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 1.6e-124
Identity = 254/325 (78.15%), Postives = 271/325 (83.38%), Query Frame = 0

Query: 1   MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
           MKL RE + GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R  HRR+HRRRKSAE
Sbjct: 8   MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
            WVES GFKKDIMQFLTCLR+IRFDFRCFRAFPETDFTTEEEEEEEEEEE+E+       
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------- 187

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
               NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDSKS C+ DD SI    MAPP NA
Sbjct: 188 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESI-EAIMAPPINA 247

Query: 241 LLLMRCRSTPAKRWLEEEGEE--DEKEEVKVKKSLKWLMEEENRER--------YCKMTS 300
           LLLMRCRS PA+RW+EEE EE  DEKE+VKVKKSLKWLMEEENRER        +C+MTS
Sbjct: 248 LLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTS 307

Query: 301 HIAKQTLVASEKSRDLFTRSQSWKV 315
             AK+           FTRSQSWKV
Sbjct: 308 DNAKE-----------FTRSQSWKV 309

BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match: A0A1S3B949 (uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 1.6e-124
Identity = 254/325 (78.15%), Postives = 271/325 (83.38%), Query Frame = 0

Query: 1   MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
           MKL RE + GIPSADLLVCFPSRSHLALMPNPLCSPARGSDS+K R  HRR+HRRRKSAE
Sbjct: 8   MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
            WVES GFKKDIMQFLTCLR+IRFDFRCFRAFPETDFTTEEEEEEEEEEE+E+       
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------- 187

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
               NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDSKS C+ DD SI    MAPP NA
Sbjct: 188 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESI-EAIMAPPINA 247

Query: 241 LLLMRCRSTPAKRWLEEEGEE--DEKEEVKVKKSLKWLMEEENRER--------YCKMTS 300
           LLLMRCRS PA+RW+EEE EE  DEKE+VKVKKSLKWLMEEENRER        +C+MTS
Sbjct: 248 LLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTS 307

Query: 301 HIAKQTLVASEKSRDLFTRSQSWKV 315
             AK+           FTRSQSWKV
Sbjct: 308 DNAKE-----------FTRSQSWKV 309

BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match: A0A0A0L1Z4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 6.7e-123
Identity = 252/331 (76.13%), Postives = 273/331 (82.48%), Query Frame = 0

Query: 1   MKLGRE-TNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE 60
           MKL RE + GIPS+DLLVCFPSRSHLALMPNPLCSPARGSDS+K R  +RRYHRRRKSAE
Sbjct: 1   MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 60

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
           NW+ES GFKKDIMQFLTCLR++RFDFRCFRAFPETDFTTEEEEEEEEEEEEEEE+     
Sbjct: 121 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEK----- 180

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
               NQVGIE +ESSRTAFSKWFMVLQE+GSN ++RDS S C  DD SI  T MAPP+NA
Sbjct: 181 ----NQVGIEENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEAT-MAPPRNA 240

Query: 241 LLLMRCRSTPAKRWLEEEGEED--------EKEEVKVKKSLKWLMEEENRER-------- 300
           LLLMRC+S PA+RW+EEE EE+        EKE+VKVKKSLKWLMEEENRER        
Sbjct: 241 LLLMRCKSAPARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTD 300

Query: 301 YCKMTSHIAKQTLVASEKSRDLFTRSQSWKV 315
           +C+M S  AK+           FTRSQSWKV
Sbjct: 301 FCRMISDNAKE-----------FTRSQSWKV 310

BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match: A0A6J1D3C2 (uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016595 PE=4 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 1.2e-111
Identity = 242/343 (70.55%), Postives = 263/343 (76.68%), Query Frame = 0

Query: 1   MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRK--SA 60
           MKLGR+   I SADLLVCFPSRS+L LMP PLCSPARG DSNKLR SHR +HRRRK  SA
Sbjct: 1   MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60

Query: 61  ESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPK--NSKSWQSVMEEIERIHNRRKLRR 120
            SP++WAK KTMGSEISEPSSPKVTCAGQIKIRPK  + KSWQSVMEEIERIHNRRKLRR
Sbjct: 61  ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120

Query: 121 RRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEE 180
           RR NWVESLGFKKDIMQFLTCLR+IRFDFRCF+AFPE DFTT        EEE+EEEEEE
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTT--------EEEDEEEEEE 180

Query: 181 EEEKSQGNQVGIEGSESSRTAFSKWFMVLQESG-SNGIQRDSKSFCSGDDASIGRTPMAP 240
           EE KSQ NQVG+EG+ESSRTAFSKWFMVLQESG SNGI R+S              P+AP
Sbjct: 181 EEGKSQENQVGVEGNESSRTAFSKWFMVLQESGASNGICRESNG-----------PPLAP 240

Query: 241 PKNALLLMRCRSTPAKRWLEEEGEEDEKE-----------------EVKVKKSLKWLMEE 300
           PKNALLLMRCRS PAK W EEE EE+E+E                 EVKVKKSLKWLMEE
Sbjct: 241 PKNALLLMRCRSAPAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEE 300

Query: 301 ENRER--------YCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
           ENRER        +C+M+S IAK+T V     RDLF+RS+SWK
Sbjct: 301 ENRERLVMEMGPDFCRMSSEIAKETWV----GRDLFSRSRSWK 320

BLAST of Cla97C11G221490 vs. ExPASy TrEMBL
Match: A0A6J1FBW7 (uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC111442647 PE=4 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 1.7e-105
Identity = 222/316 (70.25%), Postives = 240/316 (75.95%), Query Frame = 0

Query: 1   MKLGRETNGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAES 60
           MK  R+T   PS DLLVCFPSRSH ALMPNPLCSPAR SDSNKL    RRYHRRRKSAES
Sbjct: 1   MKSIRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKL----RRYHRRRKSAES 60

Query: 61  PVVWAKAKTM-GSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           PVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF
Sbjct: 61  PVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRF 120

Query: 121 NWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEE 180
           NWVESLGFKKDIMQFLTCLRS+RFDF CF AFPE +FT+E+EEEEE              
Sbjct: 121 NWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE-------------- 180

Query: 181 KSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNA 240
                 VG+EGS+ SRTAFSKWFMVLQ S   G++RD    C+ DDASIG  PMAPP+NA
Sbjct: 181 ------VGVEGSDGSRTAFSKWFMVLQGS---GVRRDGNGLCTVDDASIG-PPMAPPRNA 240

Query: 241 LLLMRCRSTPAKRWLEEE-GEEDEKEEVKVKKSLKWLMEEENRERYCKMTSHIAKQTLVA 300
           LLLMRCRS PAK W+EE   EE E+ EVKVKKSLKWLMEEENRE                
Sbjct: 241 LLLMRCRSAPAKSWVEEGCSEEGEETEVKVKKSLKWLMEEENRE---------------- 269

Query: 301 SEKSRDLFTRSQSWKV 315
              SRDL TRSQSWKV
Sbjct: 301 ---SRDLVTRSQSWKV 269

BLAST of Cla97C11G221490 vs. TAIR 10
Match: AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )

HSP 1 Score: 223.0 bits (567), Expect = 3.3e-58
Identity = 162/348 (46.55%), Postives = 209/348 (60.06%), Query Frame = 0

Query: 12  SADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAE----------SP 71
           SADLLVCFPSR+HLAL P P+CSP+R SDS+    ++RR H RR+ ++          SP
Sbjct: 17  SADLLVCFPSRTHLALTPKPICSPSRPSDSS----TNRRPHHRRQLSKLSGGGGGGHGSP 76

Query: 72  VVWAK---AKTM-GSEISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRK 131
           V+WAK   +K M G EI+EP+SPKVTCAGQIK+RP       K+WQSVMEEIERIH+ R 
Sbjct: 77  VLWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRS 136

Query: 132 LRRRRFNWVESLGFKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEE 191
             +         G KKD+M FLTCLR+I+FDFRCF  F   D T++++EEE+++++EEEE
Sbjct: 137 QSK-------FFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEE 196

Query: 192 EEEEEEKSQGNQVGIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFC--SGDDASIGRT 251
             E EE+           E+S+T FSKWFMVLQE  +N     + + C    D       
Sbjct: 197 VVEGEEE-----------ENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETE 256

Query: 252 PMAPPKNALLLMRCRSTPAKRWLEE--------EGEEDEKEEVKV----------KKSLK 311
           P  PP NALLLMRCRS PAK WLEE        E  E++KEE +           KK L+
Sbjct: 257 PAVPPPNALLLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLR 316

Query: 312 WLMEEENRE--------RYCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
            LMEEE  E         + +++S IAK+T V     +D  +RS+SWK
Sbjct: 317 SLMEEEKMELVLMRYDTEFYRLSSDIAKETWVVG-GIQDPLSRSRSWK 341

BLAST of Cla97C11G221490 vs. TAIR 10
Match: AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )

HSP 1 Score: 175.6 bits (444), Expect = 6.1e-44
Identity = 143/332 (43.07%), Postives = 181/332 (54.52%), Query Frame = 0

Query: 12  SADLLVCFPSRSHLALMPNPLCSPARGSDSNKLRFSHRRYHRRRKSAESPVVWAKAKTMG 71
           SADL+VCFPSR+HL+L    + SP+   +  +    HRR   +  S+   V   + +  G
Sbjct: 13  SADLMVCFPSRAHLSLPSKSISSPSSSFNRRQNAPHHRRSISKLSSSGGGV--RQNRGGG 72

Query: 72  SE-ISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRKLRRRRFNWVESLG 131
            E + EP+SPKVTCAGQIK+R        K+WQS+M EIE+IH R K   + F      G
Sbjct: 73  REVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIH-RSKSESKFF------G 132

Query: 132 FKKDIMQFLTCLRSIRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEEEEEEEKSQGNQV 191
            K+D+M FLTCLR   FDFRCF AFP  D  +++EEE+EEEEEE+EEE+E+         
Sbjct: 133 IKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEEEDED--------- 192

Query: 192 GIEGSESSRTAFSKWFMVLQESGSNGIQRDSKSFCSGDDASIGRTPMAPPKNALLLMRCR 251
                ESS T FSKW MVL E  +N    D K     D  +       PP NALLLMRCR
Sbjct: 193 -----ESSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVET-----AVPPPNALLLMRCR 252

Query: 252 STPAKRWLEE----------------EGEEDEKEEVKVKKSLKWLMEEENR--------- 311
           S P K W EE                E EE+EK+ V  KK L+ LMEEE +         
Sbjct: 253 SAPVKNWSEEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYD 312

Query: 312 ERYCKMTSHIAKQTLVASEKSRDLFTRSQSWK 314
             Y K+++ IAK+T V       LF RS+SWK
Sbjct: 313 TNYYKLSNDIAKETWVVGGIQDPLF-RSRSWK 313

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898663.11.3e-13681.82transcription initiation factor IIE subunit alpha [Benincasa hispida][more]
XP_008444111.13.3e-12478.15PREDICTED: uncharacterized protein LOC103487551 [Cucumis melo] >KAA0064246.1 tra... [more]
XP_004142611.21.4e-12276.13transcription initiation factor IIE subunit alpha [Cucumis sativus] >KAE8649637.... [more]
XP_022147766.12.4e-11170.55uncharacterized protein LOC111016595 [Momordica charantia][more]
XP_022935869.13.4e-10570.25uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 unchar... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3D5031.6e-12478.15Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. maku... [more]
A0A1S3B9491.6e-12478.15uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=... [more]
A0A0A0L1Z46.7e-12376.13Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1[more]
A0A6J1D3C21.2e-11170.55uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A6J1FBW71.7e-10570.25uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC1114426... [more]
Match NameE-valueIdentityDescription
AT1G78110.13.3e-5846.55unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G22230.16.1e-4443.07unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 153..186
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 156..192
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 156..180
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 4..313
NoneNo IPR availablePANTHERPTHR33448:SF3OS09G0370000 PROTEINcoord: 4..313

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G221490.1Cla97C11G221490.1mRNA