Cla97C06G118200 (gene) Watermelon (97103) v2

NameCla97C06G118200
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionLate embryogenesis abundant protein, LEA-14
LocationCla97Chr06 : 11741267 .. 11743140 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCTCCAGTTCCAGGGACGATTCGGTCCCTCTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTTCATGCCGACATCGGCGACTTCTCCGACTCTGTGCCCTCTACTCCGCCGCCTTCTTCCTCCTCTTCGCCGTTGCTTTTCTACTTTTCCCCTCCGATCCTTCGCTCCAACTCGTCCGATTAAAACTCAATCGCGTCAAAGTCCATTTGTTGCCTGTTGTCGCGCTTGACCTTTCTTTCTCTGCTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTACGATTACATTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGTGAATGCCACTCTCGATTTGAATGGGTTAGAAGTTGTTCACGATGTCTTTTACTTGCTTGCGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAGGATCCGTGGGGATTTTCTTAATCAAATTCCCGATTAAGGTAATATTGATTTTGATTCTGTTTCCGACCATAAATCAAATCTTTACTTGGAATTGTAAATTGCTTCGATGTTGATTTGTTTTGTTTCTTTGATTAGTTGGCAATCATTGAATGGATGAACAGTAGTAAGACTCCATTTCTTTGCAAACAGCACGATTTAAATAATGCTTATTCTGCTGTGGCGACTATTACACGTTTCTTAGTGTAGAAGTTCAGATAATTGGAATAAGACATTCAACTTTGTAATTGACTCTTTGGATGTCCAGTATTGAACCCATTTCTTGGTCCTGCAAGCTGCAGCACTCCTTAAATCATTGCCAAATGCTTAGGCCCCATTTGATTACTTTTGGTCTTTGACCATTTTGTTTCTTTTCTCACAATTTCTTTAACATGGATTTTCAAAACATAACAAAGCAAAGAAACTAATAAACTAAAAATCAAATCATCTTTGGCTTCTTCACCTCTGTGCCCTCTACTGGCCCATTTTGTGCTATGATGTTGATAGGGAAAATGGTTGAAGACGATTAACCTTCTTATTCTGAACTTGAACCCTGCCGTTGATTTGATTATTGCCTGCCGATTCGTACATTAAATATAATTATAACCATGTCGTCTTAGGCTATTGGAATGTGTAGATCTCAAATGCTTTTCTTGTGGAATGTGTCATCCGAGTTAAAAGAACATCCCTATTCTATCCTTCACTGCCTTCTTTCCCCAACTTGACTATTCAGCTAAATAAATAGTCTAAGCTATTCTCGAGGGTTTGCTATTATTGTAATTATAATGTTGTGTTACGCTACTGTGGTTCTTGTATAAATGTCAAATAGTGGAATGTGTCATTTGAATTCAAAGAAGTCTGTCTTCCTCATCCTCACCCGACCCTCCTGTCATTCTGTGGCCTCTTTCACCAACTTAACTATTCAGCACAAATAATTCTAATTTAGTGTGTGTTCTGTTCTTTTAATGGTGTTTCTTAATGCTCGTCAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAAACAATTGAACATCAAGATTGCTACCCTGAGGTGAGAATTCATCACGTTGCTCACTTTTCTGTTGATATTTCTGCCAAAGTTTCAACCTCACAACCCCTTGGATCTTTCTTCTTGTATTTGCAGTGAAGGGAAGATGGAAATTGGGTTTTGATTATTACTTTCGTGACATGAAGCTGAAACTGGGAAGTGGGAACTCCCCTGATCTTGCTGAATATGACTGTAAATATCACTCGCAGAAAGTTAGTGTTCATTGTGGTATGCTAGGGATTTGA

mRNA sequence

ATGACCTCCAGTTCCAGGGACGATTCGGTCCCTCTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTTCATGCCGACATCGGCGACTTCTCCGACTCTGTGCCCTCTACTCCGCCGCCTTCTTCCTCCTCTTCGCCGTTGCTTTTCTACTTTTCCCCTCCGATCCTTCGCTCCAACTCGTCCGATTAAAACTCAATCGCGTCAAAGTCCATTTGTTGCCTGTTGTCGCGCTTGACCTTTCTTTCTCTGCTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTACGATTACATTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGTGAATGCCACTCTCGATTTGAATGGGTTAGAAGTTGTTCACGATGTCTTTTACTTGCTTGCGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAGGATCCGTGGGGATTTTCTTAATCAAATTCCCGATTAAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAAACAATTGAACATCAAGATTGCTACCCTGAGGGAAGATGGAAATTGGGTTTTGATTATTACTTTCGTGACATGAAGCTGAAACTGGGAAGTGGGAACTCCCCTGATCTTGCTGAATATGACTGTAAATATCACTCGCAGAAAGTTAGTGTTCATTGTGGTATGCTAGGGATTTGA

Coding sequence (CDS)

ATGACCTCCAGTTCCAGGGACGATTCGGTCCCTCTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTTCATGCCGACATCGGCGACTTCTCCGACTCTGTGCCCTCTACTCCGCCGCCTTCTTCCTCCTCTTCGCCGTTGCTTTTCTACTTTTCCCCTCCGATCCTTCGCTCCAACTCGTCCGATTAAAACTCAATCGCGTCAAAGTCCATTTGTTGCCTGTTGTCGCGCTTGACCTTTCTTTCTCTGCTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTACGATTACATTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGTGAATGCCACTCTCGATTTGAATGGGTTAGAAGTTGTTCACGATGTCTTTTACTTGCTTGCGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAGGATCCGTGGGGATTTTCTTAATCAAATTCCCGATTAAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAAACAATTGAACATCAAGATTGCTACCCTGAGGGAAGATGGAAATTGGGTTTTGATTATTACTTTCGTGACATGAAGCTGAAACTGGGAAGTGGGAACTCCCCTGATCTTGCTGAATATGACTGTAAATATCACTCGCAGAAAGTTAGTGTTCATTGTGGTATGCTAGGGATTTGA

Protein sequence

MTSSSRDDSVPLPYTLLPQNAAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRGRRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSVGIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPEGRWKLGFDYYFRDMKLKLGSGNSPDLAEYDCKYHSQKVSVHCGMLGI
BLAST of Cla97C06G118200 vs. NCBI nr
Match: XP_022144909.1 (uncharacterized protein LOC111014473 [Momordica charantia])

HSP 1 Score: 356.3 bits (913), Expect = 8.9e-95
Identity = 179/215 (83.26%), Postives = 194/215 (90.23%), Query Frame = 0

Query: 1   MTSSSRDDSVPLPYTLLPQNAAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVA 60
           MTSSSRDDSVP+PY+LLP NAA QNVVVLSLYRPP  R RRLLRLCA YSAAF LL AVA
Sbjct: 1   MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVA 60

Query: 61  FLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYR 120
           FLLFP+DPSLQLVRLKLNR+KV LLPV+ LDLSFSAS+RVRN NFFSLDY+Y+GVSVGYR
Sbjct: 61  FLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVRNNNFFSLDYNYLGVSVGYR 120

Query: 121 GRRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGS 180
           GRRLGFVSSEGGRVSARG SYVNATLDLNG EV+HD  YL+ DL  GI+PFDTETEVEG 
Sbjct: 121 GRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGY 180

Query: 181 VGIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           +G+F IKFPIKARVSCEV VNTN++TIEHQDCYPE
Sbjct: 181 MGLFFIKFPIKARVSCEVFVNTNDKTIEHQDCYPE 215

BLAST of Cla97C06G118200 vs. NCBI nr
Match: XP_023515526.1 (uncharacterized protein LOC111779657 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 347.4 bits (890), Expect = 4.1e-92
Identity = 176/214 (82.24%), Postives = 193/214 (90.19%), Query Frame = 0

Query: 3   SSSRDDSVPLPYTLLPQNA-AQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVAF 62
           S S+D S+P+PY+ +P NA A QNVVVLSLYRPP  RHRRLLRLCALYS AF LL AV F
Sbjct: 2   SCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVF 61

Query: 63  LLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRG 122
           LLFPSDPSLQLVRLKLN V V LLP V LDLSFSAS+RVRNKNFFSLDY+Y+GVSVGYRG
Sbjct: 62  LLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNYLGVSVGYRG 121

Query: 123 RRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSV 182
           RRLGFVSS+GGRVSARGSSYVNATLDLNGL+++HDVF+LL DL KGIIPFDTETEVEGS+
Sbjct: 122 RRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSM 181

Query: 183 GIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           G+F IKFPIKA VSCEVLV+TN+QTIEHQDCYPE
Sbjct: 182 GLFFIKFPIKATVSCEVLVDTNSQTIEHQDCYPE 215

BLAST of Cla97C06G118200 vs. NCBI nr
Match: XP_022987870.1 (uncharacterized protein LOC111485280 [Cucurbita maxima])

HSP 1 Score: 344.7 bits (883), Expect = 2.7e-91
Identity = 175/214 (81.78%), Postives = 193/214 (90.19%), Query Frame = 0

Query: 3   SSSRDDSVPLPYTLLPQN-AAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVAF 62
           S S+D S+P+PY+ +P N AA QNVVVLSLYRPP  R RRLLRLCALYSAAF LL AV F
Sbjct: 2   SCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAVVF 61

Query: 63  LLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRG 122
           LLFPSDPSLQLVRLKLN VKV LLP V LDLSFSAS+RVRNKNFFSLDY+Y+GVSVG+RG
Sbjct: 62  LLFPSDPSLQLVRLKLNGVKVRLLPAVVLDLSFSASVRVRNKNFFSLDYNYLGVSVGFRG 121

Query: 123 RRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSV 182
           RRLGFVSS+GGRVSARGSSYVNATLDLNGL+++HDVF+LL DL KGIIPFDTETEVEGS+
Sbjct: 122 RRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSM 181

Query: 183 GIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           G+F IKFPIKA VSCEV V+TN+QTIEHQDCYPE
Sbjct: 182 GLFFIKFPIKATVSCEVFVDTNSQTIEHQDCYPE 215

BLAST of Cla97C06G118200 vs. NCBI nr
Match: XP_022960913.1 (uncharacterized protein LOC111461574 [Cucurbita moschata])

HSP 1 Score: 339.7 bits (870), Expect = 8.6e-90
Identity = 172/214 (80.37%), Postives = 190/214 (88.79%), Query Frame = 0

Query: 3   SSSRDDSVPLPYTLLPQN-AAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVAF 62
           S S+D S+P+PY+ +P N AA QN+VVLSLYRPP  R RRLLRLC LYSAAF LL AV F
Sbjct: 2   SCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVF 61

Query: 63  LLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRG 122
           LLFPSDPSLQLVRLKLN V V LLP V LDLSFSAS+RVRN NFFSLDY+Y+GVSVGYRG
Sbjct: 62  LLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNNNFFSLDYNYLGVSVGYRG 121

Query: 123 RRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSV 182
           RRLGFVSS+GGRVSARGSSYVNATLDLNGL+++HDVF+LL DL KGIIPFDTETEVEGS+
Sbjct: 122 RRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSM 181

Query: 183 GIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           G+F IKFPIKA VSCEV V+TN+QTIEHQDCYPE
Sbjct: 182 GLFFIKFPIKATVSCEVFVDTNSQTIEHQDCYPE 215

BLAST of Cla97C06G118200 vs. NCBI nr
Match: XP_022931563.1 (uncharacterized protein LOC111437732 [Cucurbita moschata])

HSP 1 Score: 329.7 bits (844), Expect = 8.9e-87
Identity = 170/217 (78.34%), Postives = 188/217 (86.64%), Query Frame = 0

Query: 1   MTSSSRDDSVPLPYTLLPQNA--AQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFA 60
           MTSSSRDDSV    +LLPQNA    QN+V+LSLYRPP   HRRLLRLCA YSAAF LL A
Sbjct: 1   MTSSSRDDSV----SLLPQNAGHGHQNLVLLSLYRPPPYPHRRLLRLCAQYSAAFLLLAA 60

Query: 61  VAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVG 120
           ++FLLFPSDPSLQLVRL+LN  KV LLPV+ LDLS SAS+RVRNKNFFSLDY+Y+GVSVG
Sbjct: 61  LSFLLFPSDPSLQLVRLRLNHAKVRLLPVLVLDLSISASIRVRNKNFFSLDYNYLGVSVG 120

Query: 121 YRGRRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVE 180
           YRGR LGFVSS+GGRVSARG SYVNAT+DLNG+EV+HD FYLL DLGKGIIPFD++TEVE
Sbjct: 121 YRGRLLGFVSSDGGRVSARGFSYVNATVDLNGVEVIHDAFYLLQDLGKGIIPFDSKTEVE 180

Query: 181 GSVGIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           G +G F IKFPIKARVSC+V VNT  QTIEHQDCYPE
Sbjct: 181 GFMGFFFIKFPIKARVSCDVFVNTKRQTIEHQDCYPE 213

BLAST of Cla97C06G118200 vs. TrEMBL
Match: tr|A0A0A0LTV4|A0A0A0LTV4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G369500 PE=4 SV=1)

HSP 1 Score: 322.8 bits (826), Expect = 7.2e-85
Identity = 168/215 (78.14%), Postives = 175/215 (81.40%), Query Frame = 0

Query: 1   MTSSSRDDSVPLPYTLLPQNAAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVA 60
           MTSSS DDSVP+PYTL+P NAAQQNVVVLSLYRPP CRHRRLLRLCA YSAAF LLFAVA
Sbjct: 1   MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVA 60

Query: 61  FLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYR 120
           FLLFPSDPSLQLVRLKLNRVKVHL+PVV+LDLSFS SLRVRNKNFFSL            
Sbjct: 61  FLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLXXXXXXXXXXXX 120

Query: 121 GRRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGS 180
                              SYVNATLDLNGLEVVHDV YLLADLGKGIIPFDTET+VEGS
Sbjct: 121 XXXXXXXXXXXXXXXXXXXSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGS 180

Query: 181 VGIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           +G+F IK PIKARVSCEVLVNTNNQTIEHQDCYPE
Sbjct: 181 MGLFFIKIPIKARVSCEVLVNTNNQTIEHQDCYPE 215

BLAST of Cla97C06G118200 vs. TrEMBL
Match: tr|A0A1S3CJK6|A0A1S3CJK6_CUCME (uncharacterized protein LOC103501551 OS=Cucumis melo OX=3656 GN=LOC103501551 PE=4 SV=1)

HSP 1 Score: 313.2 bits (801), Expect = 5.7e-82
Identity = 166/215 (77.21%), Postives = 172/215 (80.00%), Query Frame = 0

Query: 1   MTSSSRDDSVPLPYTLLPQNAAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVA 60
           MT+SS DDSVP+PYTLL  NAAQQNVVVLSLYRP  CRHRRLLRL A YSAAF LLFAVA
Sbjct: 1   MTTSSGDDSVPVPYTLLSSNAAQQNVVVLSLYRPTPCRHRRLLRLFAFYSAAFLLLFAVA 60

Query: 61  FLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYR 120
           FLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSFS SLRVRNKNFFSL            
Sbjct: 61  FLLFPSDPSLQLVRLKLNRVKVHLVPFVSLDLSFSVSLRVRNKNFFSLXXXXXXXXXXXX 120

Query: 121 GRRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGS 180
                             SSYVNATLDLNGLEVVHDV YLLADLGKGIIPFDTETEVEGS
Sbjct: 121 XXXXXXXXXXXXXXXXXXSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGS 180

Query: 181 VGIFLIKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
           +G+F IK PIKARVSCEVLVNTNNQTIEHQDCYPE
Sbjct: 181 MGLFFIKIPIKARVSCEVLVNTNNQTIEHQDCYPE 215

BLAST of Cla97C06G118200 vs. TrEMBL
Match: tr|A0A061GN48|A0A061GN48_THECC (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_037898 PE=4 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 1.8e-67
Identity = 136/210 (64.76%), Postives = 160/210 (76.19%), Query Frame = 0

Query: 6   RDDSVPLPYTLLPQNAAQQNVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVAFLLFP 65
           RD SV  PY  LP N  QQNV+VL +Y     ++ R LR C +++    LL A  F L+P
Sbjct: 7   RDSSV--PYAALPSNPNQQNVIVLPVYYSRPNQNYRCLRRCLIFTGIVVLLSAAVFFLYP 66

Query: 66  SDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRGRRLG 125
           SDP+LQLVRL+LN V+V+  P + LDLSFS ++RVRN++FFSLDYD + VSVGYRGR LG
Sbjct: 67  SDPTLQLVRLQLNHVRVNSSPALTLDLSFSLTIRVRNRDFFSLDYDKLVVSVGYRGRELG 126

Query: 126 FVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSVGIFL 185
            VSSEGGRV ARGSSYVNATLDLNG EVVHDV YL+AD  KG+IPFDT T+V+G +G+FL
Sbjct: 127 VVSSEGGRVRARGSSYVNATLDLNGFEVVHDVIYLIADWAKGVIPFDTNTKVDGDLGLFL 186

Query: 186 IKFPIKARVSCEVLVNTNNQTIEHQDCYPE 216
            K PIKA VSCEV VNTNNQTI  QDCY E
Sbjct: 187 FKAPIKAEVSCEVYVNTNNQTIVRQDCYAE 214

BLAST of Cla97C06G118200 vs. TrEMBL
Match: tr|A0A2P6RJI0|A0A2P6RJI0_ROSCH (Putative Late embryogenesis abundant protein, LEA-14 OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr2g0090601 PE=4 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 2.9e-65
Identity = 133/205 (64.88%), Postives = 156/205 (76.10%), Query Frame = 0

Query: 14  YTLLPQNAAQ-QNVVVLSLYRPPS--CRHRRLLRLCALYSAAFFLLFAVAFLLFPSDPSL 73
           Y  +P N    Q+VVVL+ YR PS     RR LRLC   + AF LL A AF LFPSDP+L
Sbjct: 12  YAPIPANPDHPQHVVVLTHYRGPSPDYHERRRLRLCVSTTVAFVLLSAAAFFLFPSDPAL 71

Query: 74  QLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRGRRLGFVSSE 133
           +L R+ LN V VH  P + LDLSFS ++RVRN++FFSLDYD + V +GYRGR LGFVSS 
Sbjct: 72  ELARIHLNHVGVHSSPKLTLDLSFSLTIRVRNRDFFSLDYDSLVVKIGYRGRELGFVSSA 131

Query: 134 GGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSVGIFLIKFPI 193
           GGRV ARGSSYVNATL L+GLEV+HDVFYLL DL +G+IPFDT+TEV+G+VG+F  K PI
Sbjct: 132 GGRVRARGSSYVNATLVLDGLEVIHDVFYLLEDLARGVIPFDTDTEVDGTVGLFFFKIPI 191

Query: 194 KARVSCEVLVNTNNQTIEHQDCYPE 216
           K R SCEV VNTNNQT+  QDCYPE
Sbjct: 192 KGRASCEVYVNTNNQTVVRQDCYPE 216

BLAST of Cla97C06G118200 vs. TrEMBL
Match: tr|W9QGI4|W9QGI4_9ROSA (Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_020812 PE=4 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 3.2e-64
Identity = 132/207 (63.77%), Postives = 159/207 (76.81%), Query Frame = 0

Query: 14  YTLLPQN----AAQQNVVVLSLYRP-PSCRHRRLLRLCALYSAAFFLLFAVAFLLFPSDP 73
           Y+ LP N    A  QNVVVL  YRP PS R  R L  C L SAA  LL A  F+L+PSDP
Sbjct: 10  YSPLPPNPTATAYHQNVVVLPYYRPSPSKRRSRRLCRCLLASAAVLLLIAAVFILYPSDP 69

Query: 74  SLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRGRRLGFVS 133
           SLQLVR+ LNRV+V+  P + LDLSF  +++V N++FFSLDYD + VSVGYRGR LGFV+
Sbjct: 70  SLQLVRVHLNRVRVNSSPDLTLDLSFFLTVKVFNRDFFSLDYDSLAVSVGYRGRELGFVN 129

Query: 134 SEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSVGIFLIKF 193
           S+GG++ ARGSSYV+ATLDLNG  ++ DVFYLL DL +G+IPFDT T+VEG++G+FL K 
Sbjct: 130 SDGGKIRARGSSYVDATLDLNGFAIIQDVFYLLEDLARGVIPFDTVTKVEGNLGLFLFKI 189

Query: 194 PIKARVSCEVLVNTNNQTIEHQDCYPE 216
           P+KA VSCEV VNTNNQTI  QDCYPE
Sbjct: 190 PLKASVSCEVYVNTNNQTIARQDCYPE 216

BLAST of Cla97C06G118200 vs. TAIR10
Match: AT4G13270.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 221.1 bits (562), Expect = 8.1e-58
Identity = 112/216 (51.85%), Postives = 152/216 (70.37%), Query Frame = 0

Query: 3   SSSRDDSVPLPYTLLPQNAAQQNVVVLSLYRPPSCRHR-----RLLRLCALYSAAFFLLF 62
           +SS+ +   +PYT LP +   Q+V++L+ YR    RHR     R LR   L++A   LL 
Sbjct: 2   ASSKHEDYGIPYTPLPSSQPSQSVILLTPYR----RHRRPSLLRNLRCSLLFTAVILLLS 61

Query: 63  AVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSV 122
           A  +LL+PSDP + + R+ LN + V     +ALDLSFS +++VRN++FFSLDYD + VS+
Sbjct: 62  AAVYLLYPSDPDITVSRINLNHISVVDSHKIALDLSFSLTIKVRNRDFFSLDYDSLVVSI 121

Query: 123 GYRGRRLGFVSSEGGRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEV 182
           GYRGR LG V S+GG + AR SSY++ATL+L+GLEVVHDV YL+ DL KG+IPFDT  +V
Sbjct: 122 GYRGRELGLVKSKGGHLKARDSSYIDATLELDGLEVVHDVIYLIGDLAKGVIPFDTIAQV 181

Query: 183 EGSVGIFLIKFPIKARVSCEVLVNTNNQTIEHQDCY 214
           +G +G+ L   PI+ +VSCEV VN NNQ I HQDC+
Sbjct: 182 QGDLGVLLFNIPIQGKVSCEVYVNVNNQKISHQDCH 213

BLAST of Cla97C06G118200 vs. TAIR10
Match: AT1G52330.2 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 131.0 bits (328), Expect = 1.1e-30
Identity = 65/182 (35.71%), Postives = 116/182 (63.74%), Query Frame = 0

Query: 14  YTLLPQNAAQQ--NVVVLSLYRPPSCRHRRLLRLCALYSAAFFLLFAVAFLLFPSDPSLQ 73
           Y  LP +++ +  + V++S +  P  R R ++ +  +  A+  +     ++ +PSDP ++
Sbjct: 16  YKPLPSSSSHELNDAVLISSHPSPPSRRRFIISIFLISFASILI-----YIFWPSDPRIK 75

Query: 74  LVRLKLNRVKVHLLPVVALDLSFSASLRVRNKNFFSLDYDYIGVSVGYRGRRLGFVSSEG 133
           ++R+K++ V VH  PV ++D++   +L+V N + +S D+  + V++ YRG+ LG VSS+G
Sbjct: 76  IIRVKISHVHVHRRPVPSIDMTLLVTLKVSNADVYSFDFTDLDVTIDYRGKTLGHVSSDG 135

Query: 134 GRVSARGSSYVNATLDLNGLEVVHDVFYLLADLGKGIIPFDTETEVEGSVGIFLIKFPIK 193
           G V+A GSSY++A  +L+G+ V  DV +L+ DL KG + FDT TE  G +G+   +FP+K
Sbjct: 136 GHVTAFGSSYLDAEAELDGVMVFPDVIHLIHDLAKGSVEFDTVTETNGKLGVLFFRFPLK 192

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022144909.18.9e-9583.26uncharacterized protein LOC111014473 [Momordica charantia][more]
XP_023515526.14.1e-9282.24uncharacterized protein LOC111779657 [Cucurbita pepo subsp. pepo][more]
XP_022987870.12.7e-9181.78uncharacterized protein LOC111485280 [Cucurbita maxima][more]
XP_022960913.18.6e-9080.37uncharacterized protein LOC111461574 [Cucurbita moschata][more]
XP_022931563.18.9e-8778.34uncharacterized protein LOC111437732 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0LTV4|A0A0A0LTV4_CUCSA7.2e-8578.14Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G369500 PE=4 SV=1[more]
tr|A0A1S3CJK6|A0A1S3CJK6_CUCME5.7e-8277.21uncharacterized protein LOC103501551 OS=Cucumis melo OX=3656 GN=LOC103501551 PE=... [more]
tr|A0A061GN48|A0A061GN48_THECC1.8e-6764.76Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isofor... [more]
tr|A0A2P6RJI0|A0A2P6RJI0_ROSCH2.9e-6564.88Putative Late embryogenesis abundant protein, LEA-14 OS=Rosa chinensis OX=74649 ... [more]
tr|W9QGI4|W9QGI4_9ROSA3.2e-6463.77Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_020812 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G13270.18.1e-5851.85Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
AT1G52330.21.1e-3035.71Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009269 response to desiccation
biological_process GO:0044763 single-organism cellular process
biological_process GO:0044699 single-organism process
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G118200.1Cla97C06G118200.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 98..195
e-value: 9.3E-12
score: 45.3
NoneNo IPR availablePANTHERPTHR31852:SF15SUBFAMILY NOT NAMEDcoord: 4..214
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 4..214
NoneNo IPR availableSUPERFAMILYSSF117070LEA14-likecoord: 53..190