Lsi02G000120.1 (mRNA) Bottle gourd (USVL1VR-Ls)

NameLsi02G000120.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCoffea canephora DH200=94 genomic scaffold, scaffold_2
Locationchr02 : 150192 .. 152790 (+)
Sequence length2200
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCTTGCGTTGGCGCCTCTTTGTCCATTCATACACATACGTACGCGTGCCAAAACGTGTTTCACAATTTCATTCTATAAGCATCTCCTCTTCTCAATCGGTACTCCAGTCTCCAGTCTCCAGTCTCCAGTTTCGAGTTTCTTCTTCTTCCCAGGTTTGTCTCATTTTCCTTCGCTTGCTGAAATTTATCATTCAATTTGGGTGGGCTTCCTATCAATCTTCATATGCCTTTGGCCAGATCCATACAACACAACGCAGAGACACGCTCATGGAAGCTGCCACTGGTTCAACTTCAACCTTTGCCATTGGAATTGGATCCCCATTTCGGGACACCGCCCTCAGGCCTCTTGCCTCCCATTCCCTTTCTTTTCCTTCCAAATCCCTCCTCTCTGTTCCTGTAAGAAGTTGGAATCCTTTTTTCTTTTCTTCCTTTTATCAAACAGTTCATGGTCAATTACGGTGTATCTCGTTTTGATCTCTTTGGTTCTTTCTCCAACTTTCTGTTGTTTCATTGCAGCATTATCGGTCCTTCGTTTCTCCATCAAGACTTGGAAAGAAGACGATTACCCTTCCCTTTAGCGGTCGGGGTCGGGGATTGGGATTGTCAATGGTTAAAGCGTCCCTGTCTGGGGATCCGGCTGGTTCTGCTGCCCAAATTGCTCCACTTCAGCTCCAGTCTCCAATTGGCCAGTTTCTGTCTCAAATCCTGACTACCCATCCCCACCTTCTTCCTGCAGCCGTCGACCAGCAGCTTCAACAGCTGCAAACCCAACGCCATTCTGAAGAACAAACTCAAGAGCCCCCTGCTTCCGCTACTCATGACATTGTCTTGTACAGGTTAGTTCCGCATCCCTTAGAGGCTAAGTATATACAGGTAGAGGATGGGTCTGAAAACTTCAAGTCGTAACAATTATTTGGCATCAGTTTCTCTTTATAAAAGCAAATTGTTTGATTAGAAGAGCACCAAAGCTGATGAAGCTTGTTTTTTTCGGGACAACCCTGCTCACGTAGCTTAGCGTTTGAAATTTTTAGGAGAATTGCAGAGGTTAAAGCAAATGAAAGGAAAAGGGCCTTAGAAGAGATATTGTATGCATTGGTGGTGCAACGATTCATGGACGCCGATGTTCCTCTTATACCGGGTGTTGCCCCATCGTGTACGGACCCATATGGCCGAGTTGACACATGGGCACAAGATGATGAAAAGCTGGAGCGGCTTCACTCGTCCGAAGCAAGGGAAATGATTCAGAACCACCTAGCGCTCGTTTTGGGCAATCGGATTGGTGACTTTGCTTCAGTAGTGCAGATAAGCAAACTGAGAGTGGGGCAGGTGTATGCAGCGTCTGTGATGTACGGATACTTCCTCAAGCGAGTGGATGAGAGGTTTCAGCTTGAGAAGACTGTGAAAATGCTACCAGCTGATGCAACAGACGAGGGAGAAGGAGAAGAATGGGATTCCTCCTTCTCCAATGCACCAGTGTATCCTGAAATCTCTTCCATGGCAGTTGAACAAGGAGATGTTAGTCCTGGGGAGTCGGGTCTGGGGATCAAGCCCTCCCGCTTGAGAACATACGTATTGTCGTTTGATGGGGACACACTACAGAGATTAGCCAATATAAGGTCAAAGGAGGCTGTTAGCATCATCGAGAGACACACGGAGGCCTTGTTTGGAAGACCCCAGATTGCAATCACCCCGCAAGGAACAGTAGATACCTCCAAAGACGAGCTTATCAAAATCAACTTTGGTGGGTTGAAGAGACTAGTTATGGAAGCTGTGACTTTCGGTTCTTTTCTGTGGGATGTTGAGACGTATGTGGACTCCAGGTATCATTTTGTCATGAATTGAGATATGAACTTCCTGGAGCATTCTGGACAATGGATGCAGTTGCGCGCCTGACCGCCTCTTCTCTAGCGAGTGTAATGTGGACTGATATCTAAGTCGACTGTGTAGTTTCCAGTGGACTCCACAATCTGGCTTGTCTTATGGTGCGTTTGAATGATAAACTGATACACTCGTTGTAGAAGATATAGAAGCTAATTACACAGAGAGACTACAACCAAGAGTCTAAGATACAATACAGAGCTTCCCATTACAAAATTGTTTTATTTACTGCCCTTGTTCAATAGTATAATTAGTTATGAAAAGATTTGAGGTGTGGCCTATTAGAAAATGAGATCGAACCCAAGATGTTCAGAATATGGAAACGAAGCAGATTGGGTGGACTCGGATATCAAACAAAGGAGGATCATTAAATTTTAAGAGTTCAAATTTGACGGTCGAGGTCTATTTGGTGTAGTGCCCATTCACTAATCTTGCAGGTTAAAACAGTAAAATTATACCTCCACAACAGAAGTCTTTGTTTTTTCTTTTCTTTTCTTTTTTCTCTAAAAAAGAAAGCTCGAAAATGGAATATTTAACTTCTAGCTCTTGGTTGATAACATATGTCTTGACCAGTTGAGTTATGTTTAAGTTCATTTATCTAGTATCGTTTATATAGATGTGTCTTTGCATCCATGAGGACTAAATTTAACAAACTTGTACATGATGAAATATGGAAACATATAATAATAGAGACAATTTTCAAATTTAGTAATGTCAA

mRNA sequence

ATGTCCTTGCGTTGGCGCCTCTTTGTCCATTCATACACATACGTACGCGTGCCAAAACGTGTTTCACAATTTCATTCTATAAGCATCTCCTCTTCTCAATCGGTACTCCAGTCTCCAGTCTCCAGTCTCCAGTTTCGAGTTTCTTCTTCTTCCCAGATCCATACAACACAACGCAGAGACACGCTCATGGAAGCTGCCACTGGTTCAACTTCAACCTTTGCCATTGGAATTGGATCCCCATTTCGGGACACCGCCCTCAGGCCTCTTGCCTCCCATTCCCTTTCTTTTCCTTCCAAATCCCTCCTCTCTGTTCCTCATTATCGGTCCTTCGTTTCTCCATCAAGACTTGGAAAGAAGACGATTACCCTTCCCTTTAGCGGTCGGGGTCGGGGATTGGGATTGTCAATGGTTAAAGCGTCCCTGTCTGGGGATCCGGCTGGTTCTGCTGCCCAAATTGCTCCACTTCAGCTCCAGTCTCCAATTGGCCAGTTTCTGTCTCAAATCCTGACTACCCATCCCCACCTTCTTCCTGCAGCCGTCGACCAGCAGCTTCAACAGCTGCAAACCCAACGCCATTCTGAAGAACAAACTCAAGAGCCCCCTGCTTCCGCTACTCATGACATTGTCTTGTACAGGAGAATTGCAGAGGTTAAAGCAAATGAAAGGAAAAGGGCCTTAGAAGAGATATTGTATGCATTGGTGGTGCAACGATTCATGGACGCCGATGTTCCTCTTATACCGGGTGTTGCCCCATCGTGTACGGACCCATATGGCCGAGTTGACACATGGGCACAAGATGATGAAAAGCTGGAGCGGCTTCACTCGTCCGAAGCAAGGGAAATGATTCAGAACCACCTAGCGCTCGTTTTGGGCAATCGGATTGGTGACTTTGCTTCAGTAGTGCAGATAAGCAAACTGAGAGTGGGGCAGGTGTATGCAGCGTCTGTGATGTACGGATACTTCCTCAAGCGAGTGGATGAGAGGTTTCAGCTTGAGAAGACTGTGAAAATGCTACCAGCTGATGCAACAGACGAGGGAGAAGGAGAAGAATGGGATTCCTCCTTCTCCAATGCACCAGTGTATCCTGAAATCTCTTCCATGGCAGTTGAACAAGGAGATGTTAGTCCTGGGGAGTCGGGTCTGGGGATCAAGCCCTCCCGCTTGAGAACATACGTATTGTCGTTTGATGGGGACACACTACAGAGATTAGCCAATATAAGGTCAAAGGAGGCTGTTAGCATCATCGAGAGACACACGGAGGCCTTGTTTGGAAGACCCCAGATTGCAATCACCCCGCAAGGAACAGTAGATACCTCCAAAGACGAGCTTATCAAAATCAACTTTGGTGGGTTGAAGAGACTAGTTATGGAAGCTGTGACTTTCGGTTCTTTTCTGTGGGATGTTGAGACGTATGTGGACTCCAGGTATCATTTTGTCATGAATTGAGATATGAACTTCCTGGAGCATTCTGGACAATGGATGCAGTTGCGCGCCTGACCGCCTCTTCTCTAGCGAGTGTAATGTGGACTGATATCTAAGTCGACTGTGTAGTTTCCAGTGGACTCCACAATCTGGCTTGTCTTATGGTGCGTTTGAATGATAAACTGATACACTCGTTGTAGAAGATATAGAAGCTAATTACACAGAGAGACTACAACCAAGAGTCTAAGATACAATACAGAGCTTCCCATTACAAAATTGTTTTATTTACTGCCCTTGTTCAATAGTATAATTAGTTATGAAAAGATTTGAGGTGTGGCCTATTAGAAAATGAGATCGAACCCAAGATGTTCAGAATATGGAAACGAAGCAGATTGGGTGGACTCGGATATCAAACAAAGGAGGATCATTAAATTTTAAGAGTTCAAATTTGACGGTCGAGGTCTATTTGGTGTAGTGCCCATTCACTAATCTTGCAGGTTAAAACAGTAAAATTATACCTCCACAACAGAAGTCTTTGTTTTTTCTTTTCTTTTCTTTTTTCTCTAAAAAAGAAAGCTCGAAAATGGAATATTTAACTTCTAGCTCTTGGTTGATAACATATGTCTTGACCAGTTGAGTTATGTTTAAGTTCATTTATCTAGTATCGTTTATATAGATGTGTCTTTGCATCCATGAGGACTAAATTTAACAAACTTGTACATGATGAAATATGGAAACATATAATAATAGAGACAATTTTCAAATTTAGTAATGTCAA

Coding sequence (CDS)

ATGTCCTTGCGTTGGCGCCTCTTTGTCCATTCATACACATACGTACGCGTGCCAAAACGTGTTTCACAATTTCATTCTATAAGCATCTCCTCTTCTCAATCGGTACTCCAGTCTCCAGTCTCCAGTCTCCAGTTTCGAGTTTCTTCTTCTTCCCAGATCCATACAACACAACGCAGAGACACGCTCATGGAAGCTGCCACTGGTTCAACTTCAACCTTTGCCATTGGAATTGGATCCCCATTTCGGGACACCGCCCTCAGGCCTCTTGCCTCCCATTCCCTTTCTTTTCCTTCCAAATCCCTCCTCTCTGTTCCTCATTATCGGTCCTTCGTTTCTCCATCAAGACTTGGAAAGAAGACGATTACCCTTCCCTTTAGCGGTCGGGGTCGGGGATTGGGATTGTCAATGGTTAAAGCGTCCCTGTCTGGGGATCCGGCTGGTTCTGCTGCCCAAATTGCTCCACTTCAGCTCCAGTCTCCAATTGGCCAGTTTCTGTCTCAAATCCTGACTACCCATCCCCACCTTCTTCCTGCAGCCGTCGACCAGCAGCTTCAACAGCTGCAAACCCAACGCCATTCTGAAGAACAAACTCAAGAGCCCCCTGCTTCCGCTACTCATGACATTGTCTTGTACAGGAGAATTGCAGAGGTTAAAGCAAATGAAAGGAAAAGGGCCTTAGAAGAGATATTGTATGCATTGGTGGTGCAACGATTCATGGACGCCGATGTTCCTCTTATACCGGGTGTTGCCCCATCGTGTACGGACCCATATGGCCGAGTTGACACATGGGCACAAGATGATGAAAAGCTGGAGCGGCTTCACTCGTCCGAAGCAAGGGAAATGATTCAGAACCACCTAGCGCTCGTTTTGGGCAATCGGATTGGTGACTTTGCTTCAGTAGTGCAGATAAGCAAACTGAGAGTGGGGCAGGTGTATGCAGCGTCTGTGATGTACGGATACTTCCTCAAGCGAGTGGATGAGAGGTTTCAGCTTGAGAAGACTGTGAAAATGCTACCAGCTGATGCAACAGACGAGGGAGAAGGAGAAGAATGGGATTCCTCCTTCTCCAATGCACCAGTGTATCCTGAAATCTCTTCCATGGCAGTTGAACAAGGAGATGTTAGTCCTGGGGAGTCGGGTCTGGGGATCAAGCCCTCCCGCTTGAGAACATACGTATTGTCGTTTGATGGGGACACACTACAGAGATTAGCCAATATAAGGTCAAAGGAGGCTGTTAGCATCATCGAGAGACACACGGAGGCCTTGTTTGGAAGACCCCAGATTGCAATCACCCCGCAAGGAACAGTAGATACCTCCAAAGACGAGCTTATCAAAATCAACTTTGGTGGGTTGAAGAGACTAGTTATGGAAGCTGTGACTTTCGGTTCTTTTCTGTGGGATGTTGAGACGTATGTGGACTCCAGGTATCATTTTGTCATGAATTGA

Protein sequence

MSLRWRLFVHSYTYVRVPKRVSQFHSISISSSQSVLQSPVSSLQFRVSSSSQIHTTQRRDTLMEAATGSTSTFAIGIGSPFRDTALRPLASHSLSFPSKSLLSVPHYRSFVSPSRLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEWDSSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETYVDSRYHFVMN
BLAST of Lsi02G000120.1 vs. Swiss-Prot
Match: UVB31_ARATH (UV-B-induced protein At3g17800, chloroplastic OS=Arabidopsis thaliana GN=At3g17800 PE=2 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 7.1e-118
Identity = 229/353 (64.87%), Postives = 280/353 (79.32%), Query Frame = 1

Query: 136 MVKASLSGDPAGSAAQ---IAPLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRH 195
           +V+AS + + A S +    IAPLQLQSP GQFLSQIL +HPHL+PAAV+QQL+QLQT R 
Sbjct: 73  VVRASSASNDASSGSSPKPIAPLQLQSPAGQFLSQILVSHPHLVPAAVEQQLEQLQTDRD 132

Query: 196 SEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPS 255
           S+ Q ++  +    DIVLYRRIAE+K NER+R LEEILYALVVQ+FM+A+V L+P V+PS
Sbjct: 133 SQGQNKDSASVPGTDIVLYRRIAELKENERRRTLEEILYALVVQKFMEANVSLVPSVSPS 192

Query: 256 CTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVVQISKLRVGQVY 315
            +DP GRVDTW    EKLERLHS E  EMI NHLAL+LG+R+GD  SV QISKLRVGQVY
Sbjct: 193 -SDPSGRVDTWPTKVEKLERLHSPEMYEMIHNHLALILGSRMGDLNSVAQISKLRVGQVY 252

Query: 316 AASVMYGYFLKRVDERFQLEKTVKMLPADATDEG---EGEEWDSSFSNA-PVYPEISSMA 375
           AASVMYGYFLKRVD+RFQLEKT+K+LP  + +     E  E  +++  A   +PE+ + A
Sbjct: 253 AASVMYGYFLKRVDQRFQLEKTMKILPGGSDESKTSVEQAEGTATYQAAVSSHPEVGAFA 312

Query: 376 VEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSKEAVSIIERHTEALFGRPQI 435
              G VS    G  IKPSRLR+YV+SFD +TLQR A IRS+EAV IIE+HTEALFG+P+I
Sbjct: 313 ---GGVSAKGFGSEIKPSRLRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEI 372

Query: 436 AITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETYVDSRYHFVMN 482
            ITP+GTVD+SKDE IKI+FGG+KRLV+EAVTFGSFLWDVE++VD+RYHFV+N
Sbjct: 373 VITPEGTVDSSKDEQIKISFGGMKRLVLEAVTFGSFLWDVESHVDARYHFVLN 421

BLAST of Lsi02G000120.1 vs. TrEMBL
Match: A0A061FWL1_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_013026 PE=4 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 4.9e-134
Identity = 275/438 (62.79%), Postives = 324/438 (73.97%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGI------GSPFRDTALRPLASHSLSFPSKSLL--SVPHYRSFVSPS 122
           M+AAT S S     +       S  R   L     H L F +K  L  S+ HY S +S S
Sbjct: 1   MDAATASASVVGSSMTTRRPPSSVTRSAILTANEPHFLRFAAKPRLPFSIKHY-SPLSYS 60

Query: 123 RLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPH 182
           +   + + L   G  RG+   +V+AS S D AG  A IAPLQ++SPIGQFLSQIL +HPH
Sbjct: 61  KPQNRRMAL---GSRRGM---VVRASSSPDSAGPTAPIAPLQMESPIGQFLSQILISHPH 120

Query: 183 LLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALV 242
           L+PAAV+QQL+QLQT R +EE+ +EP ASA  D+VLYRRIAEVKANERK+ALEEILYALV
Sbjct: 121 LVPAAVEQQLEQLQTDRDAEEKKEEPSASAGTDLVLYRRIAEVKANERKKALEEILYALV 180

Query: 243 VQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRI 302
           VQ+FMDA+V L+P + PS TDP GRVD W  +++KLE LHS EA EMIQNHLAL+LGNR+
Sbjct: 181 VQKFMDANVSLVPAMTPSSTDPSGRVDMWPSEEDKLELLHSPEAYEMIQNHLALILGNRL 240

Query: 303 GDFASVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEWD-- 362
           GD  SV QISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP  +  E  G E    
Sbjct: 241 GDSTSVAQISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKILPNASNGEESGVEQSVG 300

Query: 363 ---------SSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRL 422
                     S+     +PE+SS +   G +SPG  G GIKP RLRTYV+SFDG+TLQ+ 
Sbjct: 301 EDMGTAGLGDSYKAVSSHPEVSSWS---GGISPGGFGHGIKPCRLRTYVMSFDGETLQKF 360

Query: 423 ANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGS 482
           A IRSKEAVSIIE+HTEALFGRP+I ITPQGTVD+SKDELIKI+F GLKRLV+EAVTFGS
Sbjct: 361 AAIRSKEAVSIIEKHTEALFGRPEIVITPQGTVDSSKDELIKISFNGLKRLVLEAVTFGS 420

BLAST of Lsi02G000120.1 vs. TrEMBL
Match: A0A061FX61_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_013026 PE=4 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 5.6e-130
Identity = 269/432 (62.27%), Postives = 318/432 (73.61%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGI------GSPFRDTALRPLASHSLSFPSKSLL--SVPHYRSFVSPS 122
           M+AAT S S     +       S  R   L     H L F +K  L  S+ HY S +S S
Sbjct: 1   MDAATASASVVGSSMTTRRPPSSVTRSAILTANEPHFLRFAAKPRLPFSIKHY-SPLSYS 60

Query: 123 RLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPH 182
           +   + + L   G  RG+   +V+AS S D AG  A IAPLQ++SPIGQFLSQIL +HPH
Sbjct: 61  KPQNRRMAL---GSRRGM---VVRASSSPDSAGPTAPIAPLQMESPIGQFLSQILISHPH 120

Query: 183 LLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALV 242
           L+PAAV+QQL+QLQT R +EE+ +EP ASA  D+VLYRRIAEVKANERK+ALEEILYALV
Sbjct: 121 LVPAAVEQQLEQLQTDRDAEEKKEEPSASAGTDLVLYRRIAEVKANERKKALEEILYALV 180

Query: 243 VQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRI 302
           VQ+FMDA+V L+P + PS TDP GRVD W  +++KLE LHS EA EMIQNHLAL+LGNR+
Sbjct: 181 VQKFMDANVSLVPAMTPSSTDPSGRVDMWPSEEDKLELLHSPEAYEMIQNHLALILGNRL 240

Query: 303 GDFASVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEWD-- 362
           GD  SV QISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP  +  E  G E    
Sbjct: 241 GDSTSVAQISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKILPNASNGEESGVEQSVG 300

Query: 363 ---------SSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRL 422
                     S+     +PE+SS +   G +SPG  G GIKP RLRTYV+SFDG+TLQ+ 
Sbjct: 301 EDMGTAGLGDSYKAVSSHPEVSSWS---GGISPGGFGHGIKPCRLRTYVMSFDGETLQKF 360

Query: 423 ANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGS 476
           A IRSKEAVSIIE+HTEALFGRP+I ITPQGTVD+SKDELIKI+F GLKRLV+EAVTFGS
Sbjct: 361 AAIRSKEAVSIIEKHTEALFGRPEIVITPQGTVDSSKDELIKISFNGLKRLVLEAVTFGS 420

BLAST of Lsi02G000120.1 vs. TrEMBL
Match: W9QPD8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_019353 PE=4 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 1.8e-128
Identity = 265/415 (63.86%), Postives = 312/415 (75.18%), Query Frame = 1

Query: 89  LASHSLSFPSKSL---LSVPHYRSFVSPSRLGKKTITLPFSGRGRGLGLSMVKASLSGDP 148
           L S+  SFP       LS+ H  S  S S+LG K I+    G  R L   +V+AS S D 
Sbjct: 28  LTSNRTSFPCFGTNFGLSMKHKTS--SRSKLGHKRISF---GSRRFL---LVRASTSSDS 87

Query: 149 AGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHSEEQTQ------- 208
             S + IAPLQL+SP+GQFLSQIL +HPHL+PAAV+QQL+QLQT R + +Q Q       
Sbjct: 88  GSSDSPIAPLQLESPVGQFLSQILMSHPHLVPAAVEQQLEQLQTDRDAAQQLQTDCDAEK 147

Query: 209 -EPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPSCTDPY 268
            E P++   D+ LYRRIAEVKANER++ALEEILYALVVQ+FMDA+V L+P +  S +DP 
Sbjct: 148 SEEPSATGTDLALYRRIAEVKANERRKALEEILYALVVQKFMDANVSLVPSIETSASDPS 207

Query: 269 GRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVVQISKLRVGQVYAASVM 328
           G VD+W   DEKLE+LHS EA EMIQNHLAL+LGNR+GD  SV QISKLRVGQVYAASVM
Sbjct: 208 GCVDSWPSQDEKLEQLHSPEAYEMIQNHLALILGNRLGDSTSVAQISKLRVGQVYAASVM 267

Query: 329 YGYFLKRVDERFQLEKTVKMLP----ADATDEGEGEEWDS-------SFSNAPVYPEISS 388
           YGYFLKRVD+RFQLEKT+K+LP     D T+  +    DS       SF  AP +PE+SS
Sbjct: 268 YGYFLKRVDQRFQLEKTMKILPNTLDGDDTNVQQAVGDDSRPLGGGESFQAAPSHPEVSS 327

Query: 389 MAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSKEAVSIIERHTEALFGRP 448
            A   G  SPG  G G+KPSRLRTYV+SFDG+TLQR A IRSKEAVSIIE+HTEALFGRP
Sbjct: 328 WA---GGTSPGGFGHGMKPSRLRTYVMSFDGETLQRYATIRSKEAVSIIEKHTEALFGRP 387

Query: 449 QIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETYVDSRYHFVMN 482
           +I ITPQGTVD+SKDELIKI+F GLKRLV+EAVTFGSFLWDVE+YVD+RYHFV+N
Sbjct: 388 EIVITPQGTVDSSKDELIKISFAGLKRLVLEAVTFGSFLWDVESYVDARYHFVLN 431

BLAST of Lsi02G000120.1 vs. TrEMBL
Match: A0A0D2SGC6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G134400 PE=4 SV=1)

HSP 1 Score: 465.3 bits (1196), Expect = 9.0e-128
Identity = 265/446 (59.42%), Postives = 325/446 (72.87%), Query Frame = 1

Query: 47  VSSSSQIHTTQRRDTLMEAATGSTSTFAIGIGSPFRDTALRPLASHSLSFPSKSLLSVPH 106
           V SS  +H T        + + + S   +  G  F   A  P +S    FP K   SV +
Sbjct: 7   VGSSMSLHRT--------SCSAARSALLVANGPLFPRFAANPRSS----FPIKLYSSVSY 66

Query: 107 YRSFVSPSRLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLS 166
            +S      LG +          RG+   +VKAS S D A   AQIAPL+++SPIGQFLS
Sbjct: 67  SKSRNRRMGLGGR----------RGM---VVKASSSPDSAEPNAQIAPLRMESPIGQFLS 126

Query: 167 QILTTHPHLLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRAL 226
           QIL +HPHL+PAAV+QQL+QLQT R ++E+ +EP AS T D+VLYRRIAEVKANERKRAL
Sbjct: 127 QILISHPHLVPAAVEQQLEQLQTDRDTDEKKEEPSASGT-DLVLYRRIAEVKANERKRAL 186

Query: 227 EEILYALVVQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHL 286
           EEILYALVVQ+FMDA++ L+P +  S  DP GRVDTW   ++KLE++HS+EA EMIQNH+
Sbjct: 187 EEILYALVVQKFMDANISLVPAITSSA-DPSGRVDTWPSQEDKLEQIHSAEAHEMIQNHV 246

Query: 287 ALVLGNRIGDFASVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEG 346
           AL+LGNR+G+  SV QISKLRVGQVYAASVMYGYFL+RVD+RFQLE+T+K+LP+ + D+ 
Sbjct: 247 ALILGNRLGESTSVAQISKLRVGQVYAASVMYGYFLRRVDQRFQLERTMKVLPSASDDDK 306

Query: 347 EGEEWD-----------SSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSF 406
              E              S+  A  +PE+SS +   G +S G  G GIKPSRLRTYV+SF
Sbjct: 307 SSIEQTVGDDTRPSGLGDSYQAASSHPEVSSWS---GGISSGGFGSGIKPSRLRTYVMSF 366

Query: 407 DGDTLQRLANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLV 466
           DG+TLQR A+IRSKEAV IIE+HTEALFGRP+IAITPQGTVD+S DELIKI+FGGLKRLV
Sbjct: 367 DGETLQRYASIRSKEAVGIIEKHTEALFGRPEIAITPQGTVDSSNDELIKISFGGLKRLV 422

Query: 467 MEAVTFGSFLWDVETYVDSRYHFVMN 482
           +EAVTFGSFLWDVE++VDSRYHFVMN
Sbjct: 427 LEAVTFGSFLWDVESFVDSRYHFVMN 422

BLAST of Lsi02G000120.1 vs. TrEMBL
Match: F6HW66_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0119g00070 PE=4 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 1.5e-127
Identity = 266/434 (61.29%), Postives = 316/434 (72.81%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGIGSPF----RDTALRPLASHSLSFPSKSLLSVPHYRSFVSPSRLGK 122
           MEA   +    + G   P     R T      S+S+ FP++       +R F     L  
Sbjct: 1   MEAVIATVVPSSFGFSRPTDSISRSTVFNVNRSNSVRFPTQ-------FRFFSGWLNLKS 60

Query: 123 KTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPA 182
               + F  +      ++V+AS S D +GSAA IAPLQL+SPIGQFLSQIL +HPHL+PA
Sbjct: 61  GHRNMAFGCKK----CTIVRASASADSSGSAAPIAPLQLESPIGQFLSQILISHPHLVPA 120

Query: 183 AVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRF 242
           AV+QQL+QLQT R +EE  +E  AS T ++VLYRRIAEVKANERK+ALEEILYALVVQ+F
Sbjct: 121 AVEQQLEQLQTDRDAEEHKEESSASGT-ELVLYRRIAEVKANERKKALEEILYALVVQKF 180

Query: 243 MDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFA 302
           MDA+V LIP ++ S +D   RVDTW   D KLE+LHS EA EMIQNHLAL+LGNR+GD  
Sbjct: 181 MDANVSLIPTISSSSSDSSDRVDTWPSQDGKLEQLHSPEAYEMIQNHLALILGNRLGDST 240

Query: 303 SVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLP-ADATDEGEGEE--W---- 362
           SV QISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP A   D+G  +E  W    
Sbjct: 241 SVAQISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKILPHALDGDKGSVQEALWDKMT 300

Query: 363 ----DSSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIR 422
               D S      +PE+SS A   G  +PG  G GIKPSRLR YV+SFD +TLQR A IR
Sbjct: 301 PSGSDDSVQTVKSHPEVSSWA---GGFTPGGFGHGIKPSRLRNYVMSFDAETLQRYATIR 360

Query: 423 SKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWD 482
           SKEAVSIIE+HTEALFGRP+I ITPQGT+D+SKDELIKI+FGGLKRLV+EAVTFGSFLWD
Sbjct: 361 SKEAVSIIEKHTEALFGRPEIIITPQGTIDSSKDELIKISFGGLKRLVLEAVTFGSFLWD 419

BLAST of Lsi02G000120.1 vs. TAIR10
Match: AT3G17800.2 (AT3G17800.2 Protein of unknown function (DUF760))

HSP 1 Score: 425.6 bits (1093), Expect = 4.0e-119
Identity = 229/353 (64.87%), Postives = 280/353 (79.32%), Query Frame = 1

Query: 136 MVKASLSGDPAGSAAQ---IAPLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRH 195
           +V+AS + + A S +    IAPLQLQSP GQFLSQIL +HPHL+PAAV+QQL+QLQT R 
Sbjct: 79  VVRASSASNDASSGSSPKPIAPLQLQSPAGQFLSQILVSHPHLVPAAVEQQLEQLQTDRD 138

Query: 196 SEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPS 255
           S+ Q ++  +    DIVLYRRIAE+K NER+R LEEILYALVVQ+FM+A+V L+P V+PS
Sbjct: 139 SQGQNKDSASVPGTDIVLYRRIAELKENERRRTLEEILYALVVQKFMEANVSLVPSVSPS 198

Query: 256 CTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVVQISKLRVGQVY 315
            +DP GRVDTW    EKLERLHS E  EMI NHLAL+LG+R+GD  SV QISKLRVGQVY
Sbjct: 199 -SDPSGRVDTWPTKVEKLERLHSPEMYEMIHNHLALILGSRMGDLNSVAQISKLRVGQVY 258

Query: 316 AASVMYGYFLKRVDERFQLEKTVKMLPADATDEG---EGEEWDSSFSNA-PVYPEISSMA 375
           AASVMYGYFLKRVD+RFQLEKT+K+LP  + +     E  E  +++  A   +PE+ + A
Sbjct: 259 AASVMYGYFLKRVDQRFQLEKTMKILPGGSDESKTSVEQAEGTATYQAAVSSHPEVGAFA 318

Query: 376 VEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSKEAVSIIERHTEALFGRPQI 435
              G VS    G  IKPSRLR+YV+SFD +TLQR A IRS+EAV IIE+HTEALFG+P+I
Sbjct: 319 ---GGVSAKGFGSEIKPSRLRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEI 378

Query: 436 AITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETYVDSRYHFVMN 482
            ITP+GTVD+SKDE IKI+FGG+KRLV+EAVTFGSFLWDVE++VD+RYHFV+N
Sbjct: 379 VITPEGTVDSSKDEQIKISFGGMKRLVLEAVTFGSFLWDVESHVDARYHFVLN 427

BLAST of Lsi02G000120.1 vs. TAIR10
Match: AT1G48450.1 (AT1G48450.1 Protein of unknown function (DUF760))

HSP 1 Score: 416.0 bits (1068), Expect = 3.2e-116
Identity = 225/358 (62.85%), Postives = 274/358 (76.54%), Query Frame = 1

Query: 136 MVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHSEE 195
           +VKAS SGD   S   IAPLQL+SP+GQFLSQIL +HPHL+PAAV+QQL+QLQ  R +EE
Sbjct: 69  VVKASASGD--ASTESIAPLQLKSPVGQFLSQILVSHPHLVPAAVEQQLEQLQIDRDAEE 128

Query: 196 QTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPSCTD 255
           Q+++  +    DIVLYRRIAEVK  ER+RALEEILYALVVQ+FMDA+V L+P +  S  D
Sbjct: 129 QSKDASSVLGTDIVLYRRIAEVKEKERRRALEEILYALVVQKFMDANVTLVPSITSSSAD 188

Query: 256 PYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVVQISKLRVGQVYAAS 315
           P GRVDTW   D +LERLHS E  EMIQNHL+++L NR  D  +V QISKL VGQVYAAS
Sbjct: 189 PSGRVDTWPTLDGELERLHSPEVYEMIQNHLSIILKNRTDDLTAVAQISKLGVGQVYAAS 248

Query: 316 VMYGYFLKRVDERFQLEKTVKMLPADATDEGE------GEEWDSSF--SNAPVYPEISSM 375
           VMYGYFLKR+D+RFQLEKT+++LP   +DEGE      G + + +F       Y  +SS 
Sbjct: 249 VMYGYFLKRIDQRFQLEKTMRILPG-GSDEGETSIEQAGRDVERNFYEEAEETYQAVSSN 308

Query: 376 ----AVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSKEAVSIIERHTEALF 435
               +   G  + G     +K SRL+TYV+SFDG+TLQR A IRS+E+V IIE+HTEALF
Sbjct: 309 QDVGSFVGGINASGGFSSDMKQSRLKTYVMSFDGETLQRYATIRSRESVGIIEKHTEALF 368

Query: 436 GRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETYVDSRYHFVMN 482
           GRP+I ITPQGT+D+SKDE IKI+F GLKRLV+EAVTFGSFLWDVE++VDSRYHFV+N
Sbjct: 369 GRPEIVITPQGTIDSSKDEHIKISFKGLKRLVLEAVTFGSFLWDVESHVDSRYHFVLN 423

BLAST of Lsi02G000120.1 vs. TAIR10
Match: AT1G32160.1 (AT1G32160.1 Protein of unknown function (DUF760))

HSP 1 Score: 304.3 bits (778), Expect = 1.3e-82
Identity = 165/361 (45.71%), Postives = 243/361 (67.31%), Query Frame = 1

Query: 126 SGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQ 185
           +GRGR +    V+AS   D   + A +AP++L+SP+GQ L QIL THPHLLP  VD+QL+
Sbjct: 55  NGRGRSV---TVRASGDEDSNENFAPLAPVELESPVGQLLEQILRTHPHLLPVTVDEQLE 114

Query: 186 QLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDADVPL 245
           +      +E ++++  +S+T DI L +RI+EV+  ER++ L EI+Y LVV RF++  + +
Sbjct: 115 KFA----AESESRKADSSSTQDI-LQKRISEVRDKERRKTLAEIIYCLVVHRFVEKGISM 174

Query: 246 IPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGN--RIGDFASVVQI 305
           IP + P+ +DP GR+D W   +EKLE +HS++A EMIQ+HL+ VLG+   +G  +S+VQI
Sbjct: 175 IPRIKPT-SDPAGRIDLWPNQEEKLEVIHSADAFEMIQSHLSSVLGDGPAVGPLSSIVQI 234

Query: 306 SKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEWDSSFSNAPVYPE 365
            K+++G++YAAS MYGYFL+RVD+R+QLE+T+  LP     E   E ++      P++  
Sbjct: 235 GKIKLGKLYAASAMYGYFLRRVDQRYQLERTMNTLPK--RPEKTRERFEEPSPPYPLWDP 294

Query: 366 ISSMAVEQGDVSPGESGLGIKPSR-----LRTYVLSFDGDTLQRLANIRSKEAVSIIERH 425
            S + ++  +  P E  +           LR+YV   D DTLQR A IRSKEA+++IE+ 
Sbjct: 295 DSLIRIQPEEYDPDEYAIQRNEDESSSYGLRSYVTYLDSDTLQRYATIRSKEAMTLIEKQ 354

Query: 426 TEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETYVDSRYHF 480
           T+ALFGRP I I   G +DTS DE++ ++  GL  LV+EAV FGSFLWD E+YV+S+YHF
Sbjct: 355 TQALFGRPDIRILEDGKLDTSNDEVLSLSVSGLAMLVLEAVAFGSFLWDSESYVESKYHF 404

BLAST of Lsi02G000120.1 vs. TAIR10
Match: AT3G07310.1 (AT3G07310.1 Protein of unknown function (DUF760))

HSP 1 Score: 165.2 bits (417), Expect = 9.7e-41
Identity = 111/333 (33.33%), Postives = 174/333 (52.25%), Query Frame = 1

Query: 153 APLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYR 212
           APL+ +S  G+FL  +L     L   A   +L+QL   R +    +   +S + +  L+R
Sbjct: 61  APLEPRSAQGRFLRSVLLNKRQLFHYAAADELKQLADDREAA-LARMSLSSGSDEASLHR 120

Query: 213 RIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLER 272
           RIAE+K    K A+++I+Y L+  ++ +  VPL+P ++    +  GR++ W   D +LE 
Sbjct: 121 RIAELKERYCKTAVQDIMYMLIFYKYSEIRVPLVPKLSRCIYN--GRLEIWPSKDWELES 180

Query: 273 LHSSEAREMIQNHLALVLGNRIG----DFASVVQISKLRVGQVYAASVMYGYFLKRVDER 332
           ++S +  E+I+ H++ V+G R+     D  +  QI KL + +VYAAS++YGYFLK    R
Sbjct: 181 IYSCDTLEIIKEHVSAVIGLRVNSCVTDNWATTQIQKLHLRKVYAASILYGYFLKSASLR 240

Query: 333 FQLEKTVKMLPADATDEGEGEEWDSSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRL 392
            QLE ++  +              S +  +P++    S       +S           +L
Sbjct: 241 HQLECSLSDIHG------------SGYLKSPIFG--CSFTTGTAQIS--------NKQQL 300

Query: 393 RTYVLSFDGDTLQRLANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINF 452
           R Y+  FD +TLQR A  R++EA ++IE+ + ALFG  +             DE I  +F
Sbjct: 301 RHYISDFDPETLQRCAKPRTEEARNLIEKQSLALFGTEE------------SDETIVTSF 356

Query: 453 GGLKRLVMEAVTFGSFLWDVETYVDSRYHFVMN 482
             LKRLV+EAV FG+FLWD E YVD  Y    N
Sbjct: 361 SSLKRLVLEAVAFGTFLWDTELYVDGAYKLKEN 356

BLAST of Lsi02G000120.1 vs. TAIR10
Match: AT5G48590.1 (AT5G48590.1 Protein of unknown function (DUF760))

HSP 1 Score: 103.2 bits (256), Expect = 4.5e-22
Identity = 84/244 (34.43%), Postives = 130/244 (53.28%), Query Frame = 1

Query: 93  SLSFPSKSLLSVPHYRSFVSPSRLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQI 152
           SL FP       P  R+FV  +R G   + LP   + R   L +V A+ SG         
Sbjct: 10  SLPFP-------PSRRNFVKQNR-GGDCVFLPSRRKFRYDSLVVVSAASSGQSID----- 69

Query: 153 APLQLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYR 212
           APL  +SP G+FLS +L     L   AV   L+QL   + +   ++   +  + +  L+R
Sbjct: 70  APLVPRSPQGRFLSSVLVKKRQLFHFAVADLLKQLADDKEAS-LSRMFLSYGSDEASLHR 129

Query: 213 RIAEVKANERKRALEEILYALVVQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLER 272
           RIA++K ++ + A+E+I+Y L++ +F +  VPL+P + PSC    GR++     D +LE 
Sbjct: 130 RIAQLKESDCQIAIEDIMYMLILYKFSEIRVPLVPKL-PSCIYN-GRLEISPSKDWELES 189

Query: 273 LHSSEAREMIQNHLALVLGNRIG----DFASVVQISKLRVGQVYAASVMYGYFLKRVDER 332
           +HS +  E+I+ H   V+  R+     D  +  +I K R+ +VY ASV+YGYFLK    R
Sbjct: 190 IHSFDVLELIKEHSNAVISLRVNSSLTDDCATTEIDKNRLSKVYTASVLYGYFLKSASLR 237

BLAST of Lsi02G000120.1 vs. NCBI nr
Match: gi|590666388|ref|XP_007036962.1| (Uncharacterized protein isoform 2 [Theobroma cacao])

HSP 1 Score: 486.1 bits (1250), Expect = 7.0e-134
Identity = 275/438 (62.79%), Postives = 324/438 (73.97%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGI------GSPFRDTALRPLASHSLSFPSKSLL--SVPHYRSFVSPS 122
           M+AAT S S     +       S  R   L     H L F +K  L  S+ HY S +S S
Sbjct: 1   MDAATASASVVGSSMTTRRPPSSVTRSAILTANEPHFLRFAAKPRLPFSIKHY-SPLSYS 60

Query: 123 RLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPH 182
           +   + + L   G  RG+   +V+AS S D AG  A IAPLQ++SPIGQFLSQIL +HPH
Sbjct: 61  KPQNRRMAL---GSRRGM---VVRASSSPDSAGPTAPIAPLQMESPIGQFLSQILISHPH 120

Query: 183 LLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALV 242
           L+PAAV+QQL+QLQT R +EE+ +EP ASA  D+VLYRRIAEVKANERK+ALEEILYALV
Sbjct: 121 LVPAAVEQQLEQLQTDRDAEEKKEEPSASAGTDLVLYRRIAEVKANERKKALEEILYALV 180

Query: 243 VQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRI 302
           VQ+FMDA+V L+P + PS TDP GRVD W  +++KLE LHS EA EMIQNHLAL+LGNR+
Sbjct: 181 VQKFMDANVSLVPAMTPSSTDPSGRVDMWPSEEDKLELLHSPEAYEMIQNHLALILGNRL 240

Query: 303 GDFASVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEWD-- 362
           GD  SV QISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP  +  E  G E    
Sbjct: 241 GDSTSVAQISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKILPNASNGEESGVEQSVG 300

Query: 363 ---------SSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRL 422
                     S+     +PE+SS +   G +SPG  G GIKP RLRTYV+SFDG+TLQ+ 
Sbjct: 301 EDMGTAGLGDSYKAVSSHPEVSSWS---GGISPGGFGHGIKPCRLRTYVMSFDGETLQKF 360

Query: 423 ANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGS 482
           A IRSKEAVSIIE+HTEALFGRP+I ITPQGTVD+SKDELIKI+F GLKRLV+EAVTFGS
Sbjct: 361 AAIRSKEAVSIIEKHTEALFGRPEIVITPQGTVDSSKDELIKISFNGLKRLVLEAVTFGS 420

BLAST of Lsi02G000120.1 vs. NCBI nr
Match: gi|697121438|ref|XP_009614692.1| (PREDICTED: uncharacterized protein LOC104107561 [Nicotiana tomentosiformis])

HSP 1 Score: 478.8 bits (1231), Expect = 1.1e-131
Identity = 266/432 (61.57%), Postives = 320/432 (74.07%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGIGSPFRDTALRPLASHSLSFPSKSLLSVPHYRSFVSPSRLGKKTIT 122
           ME AT   S F+I     +R T      S  + F S+  +S    + + S S +      
Sbjct: 1   METATAFGSAFSIC----YRPTKASLGGSDFVRFGSQFRISPSGIKLYPSVSHVKLSNRK 60

Query: 123 LPFSGRGRGLGLSMVKASLS-GDPAGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVD 182
             F  R      + ++ASLS  +  GSAA IAPLQL+SPIGQFLSQILT+HPHL+PAAVD
Sbjct: 61  AAFGSRK----CTSIRASLSPSESGGSAAPIAPLQLESPIGQFLSQILTSHPHLVPAAVD 120

Query: 183 QQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDA 242
           QQL+QLQT+R SE+Q +EP A+ T DIVLYRRIAEVKAN+RK+ALEEILYALVVQ+FMDA
Sbjct: 121 QQLEQLQTERDSEQQKEEPSATGT-DIVLYRRIAEVKANDRKKALEEILYALVVQKFMDA 180

Query: 243 DVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVV 302
           +V L+P ++P  ++P GR+DTW   D+K ERLHS+EA EMIQNHLAL+LGNR+GD ++V 
Sbjct: 181 NVSLVPAISPPSSEPSGRIDTWPSQDDKFERLHSAEANEMIQNHLALILGNRLGDNSAVA 240

Query: 303 QISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDE--------GE----GE 362
           QISK RVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP    DE        GE    G+
Sbjct: 241 QISKFRVGQVYAASVMYGYFLKRVDQRFQLEKTMKVLPQGVDDEDSSIRQVGGEEIRSGD 300

Query: 363 EWDSSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSK 422
             D+SF     +PE+SS +   G    G  G GIKPSRLR YV+SFDG+TLQR A IRSK
Sbjct: 301 RSDTSFGVTQSHPELSSWSA--GSAGTGGFGHGIKPSRLRNYVMSFDGETLQRYATIRSK 360

Query: 423 EAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVE 482
           EA+ IIE+HTEALFGRP+I ITPQGTVD+SKDEL+KI+FGGL RLV+EAVTFGSFLWDVE
Sbjct: 361 EAIGIIEKHTEALFGRPEIVITPQGTVDSSKDELLKISFGGLSRLVLEAVTFGSFLWDVE 420

BLAST of Lsi02G000120.1 vs. NCBI nr
Match: gi|590666384|ref|XP_007036961.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 472.6 bits (1215), Expect = 8.0e-130
Identity = 269/432 (62.27%), Postives = 318/432 (73.61%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGI------GSPFRDTALRPLASHSLSFPSKSLL--SVPHYRSFVSPS 122
           M+AAT S S     +       S  R   L     H L F +K  L  S+ HY S +S S
Sbjct: 1   MDAATASASVVGSSMTTRRPPSSVTRSAILTANEPHFLRFAAKPRLPFSIKHY-SPLSYS 60

Query: 123 RLGKKTITLPFSGRGRGLGLSMVKASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPH 182
           +   + + L   G  RG+   +V+AS S D AG  A IAPLQ++SPIGQFLSQIL +HPH
Sbjct: 61  KPQNRRMAL---GSRRGM---VVRASSSPDSAGPTAPIAPLQMESPIGQFLSQILISHPH 120

Query: 183 LLPAAVDQQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALV 242
           L+PAAV+QQL+QLQT R +EE+ +EP ASA  D+VLYRRIAEVKANERK+ALEEILYALV
Sbjct: 121 LVPAAVEQQLEQLQTDRDAEEKKEEPSASAGTDLVLYRRIAEVKANERKKALEEILYALV 180

Query: 243 VQRFMDADVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRI 302
           VQ+FMDA+V L+P + PS TDP GRVD W  +++KLE LHS EA EMIQNHLAL+LGNR+
Sbjct: 181 VQKFMDANVSLVPAMTPSSTDPSGRVDMWPSEEDKLELLHSPEAYEMIQNHLALILGNRL 240

Query: 303 GDFASVVQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEWD-- 362
           GD  SV QISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP  +  E  G E    
Sbjct: 241 GDSTSVAQISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKILPNASNGEESGVEQSVG 300

Query: 363 ---------SSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRL 422
                     S+     +PE+SS +   G +SPG  G GIKP RLRTYV+SFDG+TLQ+ 
Sbjct: 301 EDMGTAGLGDSYKAVSSHPEVSSWS---GGISPGGFGHGIKPCRLRTYVMSFDGETLQKF 360

Query: 423 ANIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGS 476
           A IRSKEAVSIIE+HTEALFGRP+I ITPQGTVD+SKDELIKI+F GLKRLV+EAVTFGS
Sbjct: 361 AAIRSKEAVSIIEKHTEALFGRPEIVITPQGTVDSSKDELIKISFNGLKRLVLEAVTFGS 420

BLAST of Lsi02G000120.1 vs. NCBI nr
Match: gi|698456831|ref|XP_009780705.1| (PREDICTED: uncharacterized protein LOC104229716 [Nicotiana sylvestris])

HSP 1 Score: 471.1 bits (1211), Expect = 2.3e-129
Identity = 265/432 (61.34%), Postives = 320/432 (74.07%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGIGSPFRDTALRPLASHSLSFPSKSLLSVPHYRSFVSPSRLGKKTIT 122
           ME AT   S+F+I     +R T      S  + F S+   S    + + S S +      
Sbjct: 1   METATAFGSSFSIC----YRPTKASLGGSDFVRFGSQFRNSPSGIKLYPSISHVKLSNKK 60

Query: 123 LPFSGRGRGLGLSMVKASLSGDPAG-SAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVD 182
             F  R      + ++ASLS   +G SAA IAPLQL+SPIGQFLSQILT+HPHL+PAAVD
Sbjct: 61  AAFGSRK----CTSIRASLSPSESGRSAAPIAPLQLESPIGQFLSQILTSHPHLVPAAVD 120

Query: 183 QQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDA 242
           QQL+QLQT+R SE+Q +EP A+ T DIVLYRRIAEVKAN+RK+ALEEILYALVVQ+FMDA
Sbjct: 121 QQLEQLQTERDSEQQKEEPSATGT-DIVLYRRIAEVKANDRKKALEEILYALVVQKFMDA 180

Query: 243 DVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVV 302
           +V L+P ++P  ++P GRVDTW   D+K E LHS+EA EMIQNHLAL+LG+R+GD ++V 
Sbjct: 181 NVSLVPAISPPSSEPSGRVDTWPSQDDKFEHLHSAEANEMIQNHLALILGSRLGDNSAVA 240

Query: 303 QISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDE--------GE----GE 362
           QISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP    DE        GE    G+
Sbjct: 241 QISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKVLPQGVDDEDGSFRQVGGEEIRSGD 300

Query: 363 EWDSSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSK 422
             D+S      +PE+SS +   G    G  G GIKPSRLR YV+SFDG+TLQR A IRSK
Sbjct: 301 RSDTSSRVTQSHPELSSWSA--GSAGTGGFGHGIKPSRLRNYVMSFDGETLQRYATIRSK 360

Query: 423 EAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVE 482
           EA+ IIE+HTEALFGRP+I ITPQGTVD+SKDEL+KI+FGGL+RLV+EAVTFGSFLWDVE
Sbjct: 361 EAIGIIEKHTEALFGRPEIVITPQGTVDSSKDELLKISFGGLRRLVLEAVTFGSFLWDVE 420

BLAST of Lsi02G000120.1 vs. NCBI nr
Match: gi|568880748|ref|XP_006493269.1| (PREDICTED: UV-B-induced protein At3g17800, chloroplastic-like [Citrus sinensis])

HSP 1 Score: 469.9 bits (1208), Expect = 5.2e-129
Identity = 264/430 (61.40%), Postives = 316/430 (73.49%), Query Frame = 1

Query: 63  MEAATGSTSTFAIGIGSPFRDTALRPLASHSLSFPSKSLLSVPHYRSFVSPSRLGKKTIT 122
           MEAA  S +  +IG+ S              + F +KSLL + HY S  +P    ++   
Sbjct: 1   MEAAAASVARSSIGLHSHRPVLFSVYSGPDFIRFGTKSLLPIKHYSSVSNPKPRHRR--- 60

Query: 123 LPFSGRGRGLGLSMV-KASLSGDPAGSAAQIAPLQLQSPIGQFLSQILTTHPHLLPAAVD 182
                +G G    MV +AS S + +GS   IAPLQL+SP+GQFLSQIL +HPHL+PAAV+
Sbjct: 61  -----KGFGSRRCMVVRASSSSESSGSMDPIAPLQLESPVGQFLSQILISHPHLVPAAVE 120

Query: 183 QQLQQLQTQRHSEEQTQEPPASATHDIVLYRRIAEVKANERKRALEEILYALVVQRFMDA 242
           QQL+QLQT R +E+  +E  AS T ++VLYRRIAEVKANER++ALEEILYALVVQ+FMDA
Sbjct: 121 QQLEQLQTDRDAEKHKEEASASGT-ELVLYRRIAEVKANERRKALEEILYALVVQKFMDA 180

Query: 243 DVPLIPGVAPSCTDPYGRVDTWAQDDEKLERLHSSEAREMIQNHLALVLGNRIGDFASVV 302
           +V LIP + PS +D  GRVDTW   DE LE+LHSSEA EMIQNHLAL+LGNR+GD  SV 
Sbjct: 181 NVSLIPSITPSSSDSSGRVDTWLSQDENLEQLHSSEAYEMIQNHLALILGNRLGDSTSVA 240

Query: 303 QISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKMLPADATDEGEGEEW---------- 362
           QISKLRVGQVYAASVMYGYFLKRVD+RFQLEK++K+LP  +  E  G +           
Sbjct: 241 QISKLRVGQVYAASVMYGYFLKRVDQRFQLEKSMKILPDASDVEASGIQQVVGDVTPTGA 300

Query: 363 DSSFSNAPVYPEISSMAVEQGDVSPGESGLGIKPSRLRTYVLSFDGDTLQRLANIRSKEA 422
           + S      +PE+SS +   G VSPG  G GIK SRLRTYV+SFDG+TLQR A IRSKEA
Sbjct: 301 EGSHEALSSHPEVSSFS---GGVSPGGFGHGIKASRLRTYVMSFDGETLQRYATIRSKEA 360

Query: 423 VSIIERHTEALFGRPQIAITPQGTVDTSKDELIKINFGGLKRLVMEAVTFGSFLWDVETY 482
           VSIIE+HTEALFGRP+I +TPQGTVD+S DE IKI+F GLKRLV+EAVTFGSFLWDVE+Y
Sbjct: 361 VSIIEKHTEALFGRPEIVVTPQGTVDSSNDEQIKISFAGLKRLVLEAVTFGSFLWDVESY 418

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UVB31_ARATH7.1e-11864.87UV-B-induced protein At3g17800, chloroplastic OS=Arabidopsis thaliana GN=At3g178... [more]
Match NameE-valueIdentityDescription
A0A061FWL1_THECC4.9e-13462.79Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_013026 PE=4 SV=1[more]
A0A061FX61_THECC5.6e-13062.27Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_013026 PE=4 SV=1[more]
W9QPD8_9ROSA1.8e-12863.86Uncharacterized protein OS=Morus notabilis GN=L484_019353 PE=4 SV=1[more]
A0A0D2SGC6_GOSRA9.0e-12859.42Uncharacterized protein OS=Gossypium raimondii GN=B456_005G134400 PE=4 SV=1[more]
F6HW66_VITVI1.5e-12761.29Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0119g00070 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G17800.24.0e-11964.87 Protein of unknown function (DUF760)[more]
AT1G48450.13.2e-11662.85 Protein of unknown function (DUF760)[more]
AT1G32160.11.3e-8245.71 Protein of unknown function (DUF760)[more]
AT3G07310.19.7e-4133.33 Protein of unknown function (DUF760)[more]
AT5G48590.14.5e-2234.43 Protein of unknown function (DUF760)[more]
Match NameE-valueIdentityDescription
gi|590666388|ref|XP_007036962.1|7.0e-13462.79Uncharacterized protein isoform 2 [Theobroma cacao][more]
gi|697121438|ref|XP_009614692.1|1.1e-13161.57PREDICTED: uncharacterized protein LOC104107561 [Nicotiana tomentosiformis][more]
gi|590666384|ref|XP_007036961.1|8.0e-13062.27Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|698456831|ref|XP_009780705.1|2.3e-12961.34PREDICTED: uncharacterized protein LOC104229716 [Nicotiana sylvestris][more]
gi|568880748|ref|XP_006493269.1|5.2e-12961.40PREDICTED: UV-B-induced protein At3g17800, chloroplastic-like [Citrus sinensis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR008479DUF760
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016874 ligase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi02G000120Lsi02G000120gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi02G000120.1.CDS.1Lsi02G000120.1.CDS.1CDS
Lsi02G000120.1.CDS.2Lsi02G000120.1.CDS.2CDS
Lsi02G000120.1.CDS.3Lsi02G000120.1.CDS.3CDS
Lsi02G000120.1.CDS.4Lsi02G000120.1.CDS.4CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi02G000120.1.three_prime_UTR.1Lsi02G000120.1.three_prime_UTR.1three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi02G000120.1Lsi02G000120.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008479Protein of unknown function DUF760PFAMPF05542DUF760coord: 209..334
score: 1.8
NoneNo IPR availablePANTHERPTHR31808FAMILY NOT NAMEDcoord: 136..481
score: 6.8E
NoneNo IPR availablePANTHERPTHR31808:SF4SUBFAMILY NOT NAMEDcoord: 136..481
score: 6.8E