Lsi08G006740 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi08G006740
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionDNA binding protein
Locationchr08 : 15069814 .. 15074745 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGCTATAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTAACCACGCAATGGTTTTAAATTCAATTCATATTTTCTATAGAATGCATGGATTGGTTAGATTGTTGAAATTATTAATTGTCTATGCATGATGACATTTGACCTGCAGCTATATTGAATTCCCAAGTGGATGATATCTTTCTCGAGAAAACCTTTTCCTGCTATATGATGTTCTCACTCCATGATTTATCTGGCTGTTCATGTTTAAACATGGATCATTGTTGTTTGGCACTAGTTCTTGGATTTGTTTACGCCGCTGTTTTCTTGTAAGCTTTTAGATAGCAAGCATCTAGAAACTGTTAGGTTGATAAATTCCCTTGCAGCTTAATAGTCATGGTTATTTTTATCCCAAAAACAGTCGTGGTTATTTCCCCTTAGTATGAAGTGGGATAAAAACATAATTCATAGATGTTAACAGGGGAATATTTATGGAAGTTTTAGCTCCCAGAGGAGGTAAGGGGAAAAAAGCTTCCTTCCTCAACATTTTGTTGATTGCACGTATATTGGATGAACATTTGTATTTTTGAACAAGAAAGAATGGATGAGCATTTTTTTGGGCCATATAAATAAACATAAAAGTAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTTCTCAAGATAGAGCTATTGAAAATCAGATCCTTGCGAAACACTATTGTATATAAGCAAAATAATAATGATGGTAGTCTTATTTGAAGGAGTGTAACCTTCTAATGTTGAGAGTTCAAGGGAACTCAACTCAACTCTTGCCAAAATAATAATAATAATAATAATAATAATAAAAGAAAGAAAAATAAAAAGGAAATGGGTTATTCATCTTCTCCATTTATTTCATGAAAAATCTCTATAACCGAAGAAATCAGTAACCCGTGAAACCTTGCAATAAGTGACTTTGTCGACTGCTATCTACATACTAATAAAGATAAAATAAAGGGGTTAAAAGCCTTTCAGGATAACTGGTATCTAAAAGCATAATTTAATTAATAACATAGAATAGGATTGTAGCCTTTCTTCTCCCTTACATGACCAACCTTTTCCTAAAGGGCGAGATCTTTGAGGATGATTCTGGTATTCTGTGACTTATTGTTGGGTCTTTCGATTTCAGAATACACCAGCTCCTTTGATATGTTTTCTTCAAATCTCAGTATTGAAGCCTTTAAAAATGTTAGAGCTCTCTTGGGCAAGGTGAAAAGGCTCTAGTTCTTCTTATTGCTCCAGATCTGGTCTTCTTTGTTAAAAATGAGGTTGCATGATATTTAAATAGAAATAATAAAATTATTCAAGAGAGTTTAGACCTCTATGGTGGAACTTTGGAGAATCTGAAACTTCTCAAAAAATTTATCTGGACATGGAAATTCTCATTCCTTATTAACCTCTCGATTTAAATACTATGATATGAAAACAATTTACTGTTAGAAATCCTAAAATCAAAGAAATTACTAGACCACAAAAACTTATTTAACTGTTAATGACTAAATCTAGATATGGATGTAATTTATTTAATTATTAGAAGTATGAAATCGCATCATAACATCTTGCTTGCAGGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTACGAGAGATTGTACGTGATATAATCCAAGAAAACAGAGTCCTTGGTCCGGGAAAGTTGTTATTAGAAGAGCACAGCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACCGTATCGTCTAAGGAATTCTACTCTCCAGTTGACTACTACAACCAATATATAAATGAAGAATCAATCATTGTTTCAGATGAGCAATGCACTGCAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGCAGCCTGGTGGATGTAAGGGACAAGGATTCTGTTGAATTTATCAAGCCAGAGTTGCTAGTAAATGAACACAAGAAAGTAGAGGAAGTGATGAAAGAGGAATCAGGAATGCCAATTAATCATGTAAGTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAAATGTTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGAAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAAAGAAGGTATCATATATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATAATCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGTGCGTGTTCTACTTTTTTCTGGCTAAGCATTTAATGACTTTTTTTTTTTTTTTTTTTTCTTCTATCGTCTTTTAAATTTCTGGGGCATAGGTGTAACTTCTTAAAGTTGACTATTAGGTTATGACAAAATAGTGTGGGGCAATTACTTGAGTAGTTTCTGGGCCAGTTGAAACTTTAGATTGAAGAAAAAGAATAGAGAATTCCTTTTATCTAAAAACTTTATGAATGACATTTGGACGTTCATTTGCACGGTGGTCTTGTTTCCATGTTTCTGAAATACCTACTTTTTTATTTGAAAAATGCGAGAAACACCTACTTTTTTATTTGAAAAATGCACCTAGGACTGCATCCATCTTTTGGCAGCCCACCCAGCTGTCCAACCAAAGTTTGCCATGTGACAACTCTTCTCTCTTCGTTTTTTTTTTCTTTTTCATTTTGCTGTGTGATTAAGGTGGACTGCTAGAAGATGAGTGCAACTCAGTATGCATCCATGTATCATTATCGCTCATAAATAAATTGTCAAACAATATGTTCTCATCCAACCAGAGAAAACCTTGTCTTTCTTTTTTTTCTTTTGGCGGGGAGATGAGAAAGAAAAAGGAAACGAAGAAAATGATATGTTGGACTTTGGAGTTGTTTATTTTGAGGATTTGCCACTCTTTTTGTGTTTGTAACTCAACTTTTGAAAGCCTAATTTGCATTTGGACCAATATTGATGCTCTTTTACAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAGTCTCAACAAGACATTCAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGAACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTGTAAGAATATCCCCCCCCCCAAAATTAGTCAATCATTTTACATGCTTATTGGTGAGAAGGTACATCTGTTCGCTTATTGCGAATTGATTTTGAGATGAAATCTCATGATTATCTAATACTTATCAAACAGTTTAACCAACATATTCATCCTCCACCTTTTTTTTTTCTTTTTTTCCAGTGAATCATGGGAAGGGATGTCCAAAAACTCGTCAAAACCCGAAAACAACCTGCTTTTGGAAATCTTCAAGGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGAGTAAGTAATATGATTGTCAAGTAGACGAATAGAGAGTAGTAGTTAAATTTTCTGCCACAGAACCTGTCTGTCTTTGTACCAAGTTGCAGTCGGTTACCCCGTTCAGTCCAGTCGGTCCCATCATCGATATTTATGAGGACAAAACTGGATTCTGAATGTGGGTTGGCATTTCTGTACTCTGCA

mRNA sequence

ATGCATGCTATAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTACGAGAGATTGTACGTGATATAATCCAAGAAAACAGAGTCCTTGGTCCGGGAAAGTTGTTATTAGAAGAGCACAGCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACCGTATCGTCTAAGGAATTCTACTCTCCAGTTGACTACTACAACCAATATATAAATGAAGAATCAATCATTGTTTCAGATGAGCAATGCACTGCAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGCAGCCTGGTGGATGTAAGGGACAAGGATTCTGTTGAATTTATCAAGCCAGAGTTGCTAGTAAATGAACACAAGAAAGTAGAGGAAGTGATGAAAGAGGAATCAGGAATGCCAATTAATCATGTAAGTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAAATGTTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGAAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAAAGAAGGTATCATATATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATAATCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAGTCTCAACAAGACATTCAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGAACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCGTCAAAACCCGAAAACAACCTGCTTTTGGAAATCTTCAAGGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGATCGGTTACCCCGTTCAGTCCAGTCGGTCCCATCATCGATATTTATGAGGACAAAACTGGATTCTGAATGTGGGTTGGCATTTCTGTACTCTGCA

Coding sequence (CDS)

ATGCATGCTATAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTACGAGAGATTGTACGTGATATAATCCAAGAAAACAGAGTCCTTGGTCCGGGAAAGTTGTTATTAGAAGAGCACAGCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACCGTATCGTCTAAGGAATTCTACTCTCCAGTTGACTACTACAACCAATATATAAATGAAGAATCAATCATTGTTTCAGATGAGCAATGCACTGCAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGCAGCCTGGTGGATGTAAGGGACAAGGATTCTGTTGAATTTATCAAGCCAGAGTTGCTAGTAAATGAACACAAGAAAGTAGAGGAAGTGATGAAAGAGGAATCAGGAATGCCAATTAATCATGTAAGTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAAATGTTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGAAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAAAGAAGGTATCATATATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATAATCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAGTCTCAACAAGACATTCAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGAACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCGTCAAAACCCGAAAACAACCTGCTTTTGGAAATCTTCAAGGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGATCGGTTACCCCGTTCAGTCCAGTCGGTCCCATCATCGATATTTATGAGGACAAAACTGGATTCTGAATGTGGGTTGGCATTTCTGTACTCTGCA

Protein sequence

MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDRLPRSVQSVPSSIFMRTKLDSECGLAFLYSA
BLAST of Lsi08G006740 vs. TrEMBL
Match: A0A0A0LML9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G384970 PE=4 SV=1)

HSP 1 Score: 644.0 bits (1660), Expect = 1.3e-181
Identity = 348/458 (75.98%), Postives = 381/458 (83.19%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH LE+NPLHSIAIEP SPLT+S
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLS 120

Query: 121 SKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPE 180
           S E + PV+Y N+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ E
Sbjct: 121 SNEVHFPVNY-NKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVN-------------------------------EHKKVEEVMKEESGMPINHVSPLAT 240
           LLVN                               EH KVEEV+KEESGMPIN+V+PLAT
Sbjct: 181 LLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLAT 240

Query: 241 DVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK 300
           DVVVETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Sbjct: 241 DVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK 300

Query: 301 AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQ 360
           AE+N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGPELEVCSD P+SV+FEQGQ
Sbjct: 301 AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQ 360

Query: 361 KSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNR 420
           KSS+MK+P AS    E+LNKTFSN FDQASKI    E++NKVD  QTGGSQKES+PTLNR
Sbjct: 361 KSSKMKSPIAS----ENLNKTFSNDFDQASKI----EIKNKVDPGQTGGSQKESVPTLNR 420

Query: 421 INLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD 428
           INL+SWEGMSKNSSKP NN LLEI K+FI AFVKFWS+
Sbjct: 421 INLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 449

BLAST of Lsi08G006740 vs. TrEMBL
Match: Q5DW97_CUCSA (Plastid envelope DNA binding protein (Fragment) OS=Cucumis sativus GN=PEND PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 6.0e-97
Identity = 198/279 (70.97%), Postives = 220/279 (78.85%), Query Frame = 1

Query: 64  VGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVSSKE 123
           VGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH LE+NPLHSIAIEP SPLT+SS E
Sbjct: 1   VGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLSSNE 60

Query: 124 FYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPE--- 183
            + PV+Y N+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ E   
Sbjct: 61  VHFPVNY-NKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLV 120

Query: 184 ----------------------------LLVNEHKKVEEVMKEESGMPINHVSPLATDVV 243
                                       +LVNEH KVEEV+KEESGMPIN+V+PLATDVV
Sbjct: 121 NGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVV 180

Query: 244 VETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEK 303
           VETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEKAE+
Sbjct: 181 VETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKAEE 240

Query: 304 NFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGP 312
           N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGP
Sbjct: 241 NLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGP 278

BLAST of Lsi08G006740 vs. TrEMBL
Match: F6H824_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0250g00040 PE=4 SV=1)

HSP 1 Score: 279.3 bits (713), Expect = 8.7e-72
Identity = 202/449 (44.99%), Postives = 270/449 (60.13%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGW G+  ALAK N+   RK+RIRRSKEERK MVE FIKKYQ+SN+G+FPSLNLT
Sbjct: 1   MHAIKGGWVGQTFALAKGNDSGRRKSRIRRSKEERKEMVESFIKKYQKSNDGNFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFYTVREIVR+IIQENRVLGP KL  EE     L E+ PL SI++EPQ  L+ S
Sbjct: 61  HKEVGGSFYTVREIVREIIQENRVLGPAKLTPEEQHMVELSEQYPLGSISLEPQVHLS-S 120

Query: 121 SKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPE 180
           S E  S  D++ Q  +EE ++ S  + T +      NG IINGS ++ ++++S   I  E
Sbjct: 121 SVETDSVPDHH-QIRSEELVLDSSRKYTGSEHHIFDNGWIINGSHMEKKNEESDMPIYAE 180

Query: 181 LLVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSV---SWGVNGSDVRSEMLIS 240
           L V E    +  + EE  +    V+ +A DVVVETFPL S    S+ ++G     E  I 
Sbjct: 181 LEVAETSGAKNALLEEVEVTAAKVTDIAADVVVETFPLRSFTKPSYSLDGE--LGEASIM 240

Query: 241 TSASEKQVSQTIELES-DVGLFNIKAS-----GCVVEKAEKNFAGPLSETKSDLVE---- 300
           T   E++ ++ +E E+    + + K S     G V EKA  +  G L E  S L++    
Sbjct: 241 TGILEEKETEKVETETGKSSVLDGKNSVEDPFGLVDEKAVTSPGGSLLEMNSGLIDEEAV 300

Query: 301 ---VAQIVETSNGSTVKEGIIY-EVGGPELEV---CSDNPISVTFEQGQKSSEMKAPNAS 360
                 ++E+SN +++ + +++ +  G  LEV     D   S TFEQ Q+ +E K  + S
Sbjct: 301 KNVADPLLESSNITSINKDVVHDDQDGTVLEVKISHGDCLSSDTFEQSQEIAENKNLD-S 360

Query: 361 PSTIESLNKTFSNGFDQASKI--KEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGM 420
           P+ I S N T S+     S+   +E   +E K + E     QK S PTL+RINLESWEG 
Sbjct: 361 PNGIHSENMTGSSTSSACSETISEEAIVIEKKPNIEDGSIPQKGSSPTLDRINLESWEGA 420

Query: 421 SKNSSKPENNLLLEIFKAFIAAFVKFWSD 428
           SK S++PE N  L   KAF+A FVKFWS+
Sbjct: 421 SKKSTEPETNPFLAFIKAFVAGFVKFWSE 444

BLAST of Lsi08G006740 vs. TrEMBL
Match: A0A151T0Z8_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_023068 PE=4 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 5.3e-69
Identity = 190/455 (41.76%), Postives = 263/455 (57.80%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MH++K GW G+  ALAK+NE +G+KTRIRRSKEERKAMVE FIKKYQESNNG+FPSLN+T
Sbjct: 2   MHSVKSGW-GKTFALAKHNESQGKKTRIRRSKEERKAMVESFIKKYQESNNGNFPSLNIT 61

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFYTVREIVRDIIQENRVLGP K  LEE + DH  E+NPL SIA +PQ     S
Sbjct: 62  HKEVGGSFYTVREIVRDIIQENRVLGPAKFTLEELNADHFFEQNPLGSIARDPQPLSAAS 121

Query: 121 SKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPII----NGSLVDVRDKDSVEF 180
           S E  S  D  +   N + I VSD   T    Q   NG II    NGS V++ +K+S E 
Sbjct: 122 SIENCSENDKLSD-TNSKVISVSDGSYTEAAHQVVDNGHIIRHVLNGSQVNMINKESDEA 181

Query: 181 IKPELLVNEHKKVEEVMKEE---SGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSE 240
             PE+ V +   +++ +++E   +  P+  V+ +A D++VETFPL SVS   +G      
Sbjct: 182 AIPEVQVGDPTTLKQNVEQELKAATTPMAKVTAVADDLIVETFPLRSVSRTTDGIKNLGG 241

Query: 241 MLISTSASEKQVSQTIELESDVGLFNI------KASGCVVEKAEKNFAGPL--------- 300
           +  S+++ E  + +T+EL+  VG   +      K    + EK E +    +         
Sbjct: 242 LRDSSNSPENDI-KTVELKQGVGKSELNGIEPSKNLNLLDEKFEDSHGNQILKNIPDTGL 301

Query: 301 --SETKSDLVEVAQIVETSNGSTVKE----GIIYEVGGPELEVCSDNPISV-TFEQGQKS 360
              E   D+ E     E+SN ST++     G    +  P+  V   N I+  T +QG   
Sbjct: 302 DKDENVGDMFE-----ESSNHSTIEHFDHHGFEDHIN-PQARVSHQNTITYGTVKQGHMM 361

Query: 361 SEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRIN 420
              K    + + I +++KT+    +    +K +    ++VD +  G SQ+ S  T++RIN
Sbjct: 362 DGAK----TSTQINNISKTYKPSEEDGGLLKTDI---HRVDGQHGGNSQRSSNTTVDRIN 421

Query: 421 LESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWS 427
           LESW+G +KNS++ E+N  L + K    AFVKFWS
Sbjct: 422 LESWDGATKNSARRESNPFLAVLKVLADAFVKFWS 440

BLAST of Lsi08G006740 vs. TrEMBL
Match: A0A061FNJ5_THECC (DNA binding, putative isoform 1 OS=Theobroma cacao GN=TCM_043403 PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 2.0e-68
Identity = 185/446 (41.48%), Postives = 258/446 (57.85%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGW G+  ALAK NE  G+K+RIRRSKEERKAMVE FI+KYQ+SNNG+FPSLNLT
Sbjct: 1   MHAIKGGWVGQTFALAKCNEQGGKKSRIRRSKEERKAMVESFIRKYQKSNNGNFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFY +REIVR+IIQEN+VLGP K    E + D  LE+NPL SI+  P++ L + 
Sbjct: 61  HKEVGGSFYIIREIVREIIQENKVLGPAKFTEGEQNIDLFLEQNPLGSISAAPKNSLPIQ 120

Query: 121 SKEFYSP-VDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRD-KDSVEFIK 180
           S    SP +  +++  N+ S+ VSD     +  +   +G IING+ VDV +  D V  + 
Sbjct: 121 SNG--SPFIPSHHEDANDGSVSVSDGHSMGSVYKTFDSGQIINGNFVDVTNGTDKVAIV- 180

Query: 181 PELLVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLIST 240
            +L V E  + ++  KE +    + V+ +  DVVVETFPL  V+  ++  D RS      
Sbjct: 181 -DLQVTEPLESDKSGKELAA-ATSKVTQITPDVVVETFPLRPVAKPIDSIDGRSS---EV 240

Query: 241 SASEKQVSQTIELESDVGLFNI----------KASGCVVEKAEKNFAGPLSETKSDLVEV 300
               + + QT  ++ +  L N+          + S    EK  +N    L E  SDL + 
Sbjct: 241 GELNENLDQTETVKVNESLENVSPKLDDINSSEVSNLTDEKEVENLVDLLLEKNSDLADK 300

Query: 301 A-------QIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASP 360
                    ++E+S+ ST K  I  +  G  LEV   N ++    +  ++   +A NAS 
Sbjct: 301 KVVENISDPLLESSDCSTRKSAIDEDYNGAALEVSCSNVLTSEINEPSQAIVEEAVNASN 360

Query: 361 STIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMSKN 420
                ++ T +      S  +E   +E +VD +    SQK S  TL+RINLESWEG SK+
Sbjct: 361 GMHPKIDGTDTGSCIGESTTQEAVVVEGQVDLQHV-NSQKGSNKTLDRINLESWEGTSKS 420

Query: 421 SSKPENNLLLEIFKAFIAAFVKFWSD 428
           ++K E N L  IFK+FI+AF+KFWS+
Sbjct: 421 AAKSETNPLWAIFKSFISAFLKFWSE 437

BLAST of Lsi08G006740 vs. TAIR10
Match: AT3G52170.1 (AT3G52170.1 DNA binding)

HSP 1 Score: 136.3 bits (342), Expect = 4.6e-32
Identity = 72/126 (57.14%), Postives = 96/126 (76.19%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MH++K    G+  ALAK ++  G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LT
Sbjct: 1   MHSLKTTCVGQIFALAKPHDSVGKRTRNRIPKEERKTLVESFIKKHQKLNNGSFPSLSLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFYT+REIVR+IIQENRVLGPG LLLE + +  + +++   SI ++P  PL++S
Sbjct: 61  HKEVGGSFYTIREIVREIIQENRVLGPGDLLLEGNGS--VQDQSLSSSILMDPVPPLSLS 120

Query: 121 SKEFYS 127
              F+S
Sbjct: 121 PNGFHS 124

BLAST of Lsi08G006740 vs. TAIR10
Match: AT5G58210.3 (AT5G58210.3 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 61.6 bits (148), Expect = 1.4e-09
Identity = 30/53 (56.60%), Postives = 41/53 (77.36%), Query Frame = 1

Query: 29 RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE 82
          R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Sbjct: 48 RLSKDDRRALVESFVNEYRATNAGRFPSLDATHKQVGGSYY----IVRDIFQE 96

BLAST of Lsi08G006740 vs. NCBI nr
Match: gi|659113398|ref|XP_008456554.1| (PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo])

HSP 1 Score: 662.9 bits (1709), Expect = 4.0e-187
Identity = 362/458 (79.04%), Postives = 389/458 (84.93%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEE-HSTDHLLEENPLHSIAIEPQSPLTV 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLLLEE H+TDH L++NPLHSIAIEPQSPLT+
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKP 180
           SSKE + P++Y N+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ 
Sbjct: 121 SSKEVHFPLNY-NKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQS 180

Query: 181 ELLVNEHK------------------------------KVEEVMKEESGMPINHVSPLAT 240
           ELLVNEHK                              KVEEV+KEESGMPINHV+PLAT
Sbjct: 181 ELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLAT 240

Query: 241 DVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK 300
           DVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Sbjct: 241 DVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEK 300

Query: 301 AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQ 360
           A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQ
Sbjct: 301 AGENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQ 360

Query: 361 KSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNR 420
           KSS+MK+P AS    E+LNKTFSN FDQASKI    E+ENKVD  QTGGSQKES+PTLNR
Sbjct: 361 KSSKMKSPIAS----ENLNKTFSNDFDQASKI----EIENKVDPGQTGGSQKESVPTLNR 420

Query: 421 INLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD 428
           INLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWS+
Sbjct: 421 INLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 449

BLAST of Lsi08G006740 vs. NCBI nr
Match: gi|449442130|ref|XP_004138835.1| (PREDICTED: uncharacterized protein LOC101202832 isoform X1 [Cucumis sativus])

HSP 1 Score: 644.0 bits (1660), Expect = 1.9e-181
Identity = 348/458 (75.98%), Postives = 381/458 (83.19%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH LE+NPLHSIAIEP SPLT+S
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLS 120

Query: 121 SKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPE 180
           S E + PV+Y N+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ E
Sbjct: 121 SNEVHFPVNY-NKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVN-------------------------------EHKKVEEVMKEESGMPINHVSPLAT 240
           LLVN                               EH KVEEV+KEESGMPIN+V+PLAT
Sbjct: 181 LLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLAT 240

Query: 241 DVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK 300
           DVVVETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Sbjct: 241 DVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK 300

Query: 301 AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQ 360
           AE+N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGPELEVCSD P+SV+FEQGQ
Sbjct: 301 AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQ 360

Query: 361 KSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNR 420
           KSS+MK+P AS    E+LNKTFSN FDQASKI    E++NKVD  QTGGSQKES+PTLNR
Sbjct: 361 KSSKMKSPIAS----ENLNKTFSNDFDQASKI----EIKNKVDPGQTGGSQKESVPTLNR 420

Query: 421 INLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD 428
           INL+SWEGMSKNSSKP NN LLEI K+FI AFVKFWS+
Sbjct: 421 INLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 449

BLAST of Lsi08G006740 vs. NCBI nr
Match: gi|659113404|ref|XP_008456557.1| (PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo])

HSP 1 Score: 630.6 bits (1625), Expect = 2.2e-177
Identity = 348/458 (75.98%), Postives = 373/458 (81.44%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEE-HSTDHLLEENPLHSIAIEPQSPLTV 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLLLEE H+TDH L++NPLHSIAIEPQSPLT+
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKP 180
           SSKE + P++Y N+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ 
Sbjct: 121 SSKEVHFPLNY-NKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQS 180

Query: 181 ELLVNEHK------------------------------KVEEVMKEESGMPINHVSPLAT 240
           ELLVNEHK                              KVEEV+KEESGMPINHV+PLAT
Sbjct: 181 ELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLAT 240

Query: 241 DVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK 300
           DVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Sbjct: 241 DVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEK 300

Query: 301 AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQ 360
           A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQ
Sbjct: 301 AGENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQ 360

Query: 361 KSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNR 420
           KSS+MK                      ASKI    E+ENKVD  QTGGSQKES+PTLNR
Sbjct: 361 KSSKMK----------------------ASKI----EIENKVDPGQTGGSQKESVPTLNR 420

Query: 421 INLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD 428
           INLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWS+
Sbjct: 421 INLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 431

BLAST of Lsi08G006740 vs. NCBI nr
Match: gi|778672729|ref|XP_011649859.1| (PREDICTED: uncharacterized protein LOC101202832 isoform X2 [Cucumis sativus])

HSP 1 Score: 611.7 bits (1576), Expect = 1.1e-171
Identity = 334/458 (72.93%), Postives = 365/458 (79.69%), Query Frame = 1

Query: 1   MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVS 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH LE+NPLHSIAIEP SPLT+S
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLS 120

Query: 121 SKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPE 180
           S E + PV+Y N+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ E
Sbjct: 121 SNEVHFPVNY-NKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVN-------------------------------EHKKVEEVMKEESGMPINHVSPLAT 240
           LLVN                               EH KVEEV+KEESGMPIN+V+PLAT
Sbjct: 181 LLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLAT 240

Query: 241 DVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK 300
           DVVVETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Sbjct: 241 DVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK 300

Query: 301 AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQ 360
           AE+N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGPELEVCSD P+SV+FEQGQ
Sbjct: 301 AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQ 360

Query: 361 KSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNR 420
           KSS+MKA                      SKI    E++NKVD  QTGGSQKES+PTLNR
Sbjct: 361 KSSKMKA----------------------SKI----EIKNKVDPGQTGGSQKESVPTLNR 420

Query: 421 INLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD 428
           INL+SWEGMSKNSSKP NN LLEI K+FI AFVKFWS+
Sbjct: 421 INLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 431

BLAST of Lsi08G006740 vs. NCBI nr
Match: gi|60498649|dbj|BAD90708.1| (plastid envelope DNA binding protein [Cucumis sativus])

HSP 1 Score: 362.8 bits (930), Expect = 8.6e-97
Identity = 198/279 (70.97%), Postives = 220/279 (78.85%), Query Frame = 1

Query: 64  VGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVSSKE 123
           VGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH LE+NPLHSIAIEP SPLT+SS E
Sbjct: 1   VGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLSSNE 60

Query: 124 FYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPE--- 183
            + PV+Y N+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ E   
Sbjct: 61  VHFPVNY-NKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLV 120

Query: 184 ----------------------------LLVNEHKKVEEVMKEESGMPINHVSPLATDVV 243
                                       +LVNEH KVEEV+KEESGMPIN+V+PLATDVV
Sbjct: 121 NGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVV 180

Query: 244 VETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEK 303
           VETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEKAE+
Sbjct: 181 VETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKAEE 240

Query: 304 NFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGP 312
           N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGP
Sbjct: 241 NLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGP 278

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LML9_CUCSA1.3e-18175.98Uncharacterized protein OS=Cucumis sativus GN=Csa_2G384970 PE=4 SV=1[more]
Q5DW97_CUCSA6.0e-9770.97Plastid envelope DNA binding protein (Fragment) OS=Cucumis sativus GN=PEND PE=4 ... [more]
F6H824_VITVI8.7e-7244.99Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0250g00040 PE=4 SV=... [more]
A0A151T0Z8_CAJCA5.3e-6941.76Uncharacterized protein OS=Cajanus cajan GN=KK1_023068 PE=4 SV=1[more]
A0A061FNJ5_THECC2.0e-6841.48DNA binding, putative isoform 1 OS=Theobroma cacao GN=TCM_043403 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52170.14.6e-3257.14 DNA binding[more]
AT5G58210.31.4e-0956.60 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|659113398|ref|XP_008456554.1|4.0e-18779.04PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo][more]
gi|449442130|ref|XP_004138835.1|1.9e-18175.98PREDICTED: uncharacterized protein LOC101202832 isoform X1 [Cucumis sativus][more]
gi|659113404|ref|XP_008456557.1|2.2e-17775.98PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo][more]
gi|778672729|ref|XP_011649859.1|1.1e-17172.93PREDICTED: uncharacterized protein LOC101202832 isoform X2 [Cucumis sativus][more]
gi|60498649|dbj|BAD90708.1|8.6e-9770.97plastid envelope DNA binding protein [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi08G006740.1Lsi08G006740.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34568FAMILY NOT NAMEDcoord: 1..428
score: 3.4
NoneNo IPR availablePANTHERPTHR34568:SF2DNA BINDING PROTEINcoord: 1..428
score: 3.4

The following gene(s) are paralogous to this gene:

None