Lsi04G000470 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G000470
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionHIT zinc finger protein
Locationchr04 : 614792 .. 617618 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGCAAGTGATTCTAGAAATTGTAGAAAGTAGAAAGAGAAGCCCAAGCGTTTATGGTGCCACTGAAGATTCCAGTGTTAGGGTAATCGACATTCATTCATTTTCTTGTTTCCGTAGCTCAGTTTCTGATGACACGCTGTATCCTTTGAAGTTTTTTCTTTGTTCTTCTGGCTATGGAGGAGGAGATACGAACTTCTGTTGATTCGTCTTCTTCAGTCAATCTTCCTTTGCGGACCATCTGTCATGTGTATGTCATTCTCTCACTCATTCATTTTCATCGTTACTTCTTCAAATTTCTTTCCTCCAATTTCCTAATTTCATCCTTTTCGATTTCGAATGCTAATGCAGGTGCCACAAGCAATTTTCACACTATACGTGTCCTCGTTGTAATTCTCGTTACTGTTCTCTGCAATGTTACAAGGTTAGTGAAAATCCATTCATATTTTGGTCATCAGAATGTTATAGCTTCAATGGGTTTAATATTTTTTAGGGTTAGTGACGGGGTTGAGTGGTTTATTTACTTGAATTGTTTTTTGGGTTTGCAGTCTCATAGTAACCGCTGTACTGAGTCCTTCATGCGAGAAAATGTTGTTGAGGAGCTGCGTCAATTGCGGACGGACGATAGTACTAAACGTAAAACGCTTGACATTTTGAAAAGGTTTCATTCGGAAGAGGAAATGCAAGACTTGGATGAGGAGGATGGTACGTGGCTGCAAACCTTGGTTTGTTTATTAATTATTTCTGAAAGTAAATTTCTGGGTTTGTATTTTATGCTTGAGCATTTCTAAATTCTTTTCCTGGCTAGGAGTAGTTGTCTATTGCCAGATTCTAGCCATGTTTAAAAGGCTATTGAGGTTTATGCCTCCATGAAAACGCACCGTGGCTTAAAGTCGTCCTTAAAAAGAATAAAAATGTTTGAGTTGTGAGAATTTGAACTATTAATACTTTTCCTAATATGATGGAAAACATTGTAGACAACAGATATCTTTTTAAATTTTTTGTTTAAGTGGATTCACAGGGTGATTTTTTTTTTATATACTTCTATATGATTATTGTGTTCAGTAATATTTGGTCATACCAATTTAGAATTTATAAAAATGTCATTAATTACATTTGAGGCATATGCCTCGCCTCTTCTAGAGGAAAACCCTCAAGACTGCCTTTAGCCTTTTAAAATATTTGATTCTAGTACTTAGTTGACTGATGCTTTGCTTCCTTCTGTAATTGAGCCAAATAGTGGAACTTTATAGTGGCAATCAGAAGTTTTCGGCACTTCATTAGTTCTTCATTGATAAAATTGAAATGAAAATTTCTTTTTTGTGAAGTTATTTTAACAAAGTCTTGCCCTTAAATCTTCCAACAGCTTATTGTCTCCTATAACATGACTAAAGACTTTTTGTGCCTTCTTGCAGATTCGATTTTATCCGAGGAAACTATTGAAAAAGTTCTTGCTGGTATGATCCTCGGGAATCACTCAGAAATATCATTTTTGCCTTGATTCTACCTGAATTGCTCTCGCTTGACCTTCTTCCAATGATTTTGAACGGATTTTCAACTCTTCTGTTTTTATGATGTGTACTCATCCGTAGGATCTATAAGTTCTTTTTTTTTTTTTTAATTTTAAATTTCGACTCTCCCGATGGTAAATCATCTGATTAACATTGATGAAAACATGTTTTTCTCCCTAATTTCTGCAGATGACCAACTAAGTTTTGATGATTTGTCTGTGGAGGAGAAGAAACAGTTCTTAAGGGCTATGGCATCAGGAGAGTTGAGCAAGATGATTGAACCTTGGGAAGCTTGGTGGACGAAACCTTCTGCTAGAACTATATCTCTGAGCAAAGAAGGAACACCACTTGTTCAACTCCACGCTGAGCAGAAGCTGACAACTTCACTAACAAGTGGAACGGAAGTGATGCAATCAAGTGGAATTCCTCAAGCACCTGATACCCCGTTACCTCCAATTAGCAAGCTTAGTTCTGCAGAGCCATCACCCCTGTTGGCTGTTCACCTGGTTGATATCATTTATAGTTACTGTTTCACACTTCGCTTGTACAATGGAGACTGGCAATCAGATGCTATAGGATCAGCTCTGGTTGTTTTGAGTGTCTCCTCTGTCTTGGGTCAAAGTGGTCAACCAGAAACAGTTTTAGAGGCCTTGTCATCTTGCTTGGAACAAACCTGTTCTCCAGCTTATCGACATGTGGGGGGCTTGCAGCTTGGATTGAGTCTTATTGATGATGTCGCAATCCTGCTTTCACTGGGTGGTCCTGCTTTAGTGTGCTTGCTCTGCGACTTGCAGAGGCTTATTCAAGCTGGGGAGAGAGATCTGAAATCAGGAAAAAGGAGAAGAAAATCTAGATGGTCAGACATTGGTACTAAGCTTAAACACGCTGACAGGAAGATCTTTTTCATCATGTGCTGGGTTCATGAGCAGCCGAGCGAAGTTTGGTCGAGTTTGGAGAATATTATAAAGATGGAGAAAAGTTCCATTAAGGAGTTTGAGAATCATAAAATGTCAACAAAAATGGATAGTAAGGTCAAAAACAAGGACAAAGTTTTGATACAGGAGATAAAATGAGAATCATGTGCATACTTATTTATATTTATATTTGTTGTGTGTTATCTTTGACCTGGTATATATTAAGGATCCATTATGATCCTTTTTCAGGGGTTTCGAATAGCTTGTGTGCTCAATCCTTGGAACAAATTAATACAGGGAGAGAATTGCACGTCTTTTAATTACTTGTTGCCTCTCTTCCTGCTAATCCTAAGTAACTAACAAACAGTAGGTGGATCCAATCTACAAAGTACTTCTATCACTTCTA

mRNA sequence

GAGCAAGTGATTCTAGAAATTGTAGAAAGTAGAAAGAGAAGCCCAAGCGTTTATGGTGCCACTGAAGATTCCAGTGTTAGGGTAATCGACATTCATTCATTTTCTTGTTTCCGTAGCTCAGTTTCTGATGACACGCTGTATCCTTTGAAGTTTTTTCTTTGTTCTTCTGGCTATGGAGGAGGAGATACGAACTTCTGTTGATTCGTCTTCTTCAGTCAATCTTCCTTTGCGGACCATCTGTCATGTGTGCCACAAGCAATTTTCACACTATACGTGTCCTCGTTGTAATTCTCGTTACTGTTCTCTGCAATGTTACAAGTCTCATAGTAACCGCTGTACTGAGTCCTTCATGCGAGAAAATGTTGTTGAGGAGCTGCGTCAATTGCGGACGGACGATAGTACTAAACGTAAAACGCTTGACATTTTGAAAAGGTTTCATTCGGAAGAGGAAATGCAAGACTTGGATGAGGAGGATGGTACGTGGCTGCAAACCTTGGTTTGTTTATTAATTATTTCTGAAAATTCGATTTTATCCGAGGAAACTATTGAAAAAGTTCTTGCTGATGACCAACTAAGTTTTGATGATTTGTCTGTGGAGGAGAAGAAACAGTTCTTAAGGGCTATGGCATCAGGAGAGTTGAGCAAGATGATTGAACCTTGGGAAGCTTGGTGGACGAAACCTTCTGCTAGAACTATATCTCTGAGCAAAGAAGGAACACCACTTGTTCAACTCCACGCTGAGCAGAAGCTGACAACTTCACTAACAAGTGGAACGGAAGTGATGCAATCAAGTGGAATTCCTCAAGCACCTGATACCCCGTTACCTCCAATTAGCAAGCTTAGTTCTGCAGAGCCATCACCCCTGTTGGCTGTTCACCTGGTTGATATCATTTATAGTTACTGTTTCACACTTCGCTTGTACAATGGAGACTGGCAATCAGATGCTATAGGATCAGCTCTGGTTGTTTTGAGTGTCTCCTCTGTCTTGGGTCAAAGTGGTCAACCAGAAACAGTTTTAGAGGCCTTGTCATCTTGCTTGGAACAAACCTGTTCTCCAGCTTATCGACATGTGGGGGGCTTGCAGCTTGGATTGAGTCTTATTGATGATGTCGCAATCCTGCTTTCACTGGGTGGTCCTGCTTTAGTGTGCTTGCTCTGCGACTTGCAGAGGCTTATTCAAGCTGGGGAGAGAGATCTGAAATCAGGAAAAAGGAGAAGAAAATCTAGATGGTCAGACATTGGTACTAAGCTTAAACACGCTGACAGGAAGATCTTTTTCATCATGTGCTGGGTTCATGAGCAGCCGAGCGAAGTTTGGTCGAGTTTGGAGAATATTATAAAGATGGAGAAAAGTTCCATTAAGGAGTTTGAGAATCATAAAATGTCAACAAAAATGGATAGTAAGGTCAAAAACAAGGACAAAGTTTTGATACAGGAGATAAAATGAGAATCATGTGCATACTTATTTATATTTATATTTGTTGTGTGTTATCTTTGACCTGGTATATATTAAGGATCCATTATGATCCTTTTTCAGGGGTTTCGAATAGCTTGTGTGCTCAATCCTTGGAACAAATTAATACAGGGAGAGAATTGCACGTCTTTTAATTACTTGTTGCCTCTCTTCCTGCTAATCCTAAGTAACTAACAAACAGTAGGTGGATCCAATCTACAAAGTACTTCTATCACTTCTA

Coding sequence (CDS)

ATGGAGGAGGAGATACGAACTTCTGTTGATTCGTCTTCTTCAGTCAATCTTCCTTTGCGGACCATCTGTCATGTGTGCCACAAGCAATTTTCACACTATACGTGTCCTCGTTGTAATTCTCGTTACTGTTCTCTGCAATGTTACAAGTCTCATAGTAACCGCTGTACTGAGTCCTTCATGCGAGAAAATGTTGTTGAGGAGCTGCGTCAATTGCGGACGGACGATAGTACTAAACGTAAAACGCTTGACATTTTGAAAAGGTTTCATTCGGAAGAGGAAATGCAAGACTTGGATGAGGAGGATGGTACGTGGCTGCAAACCTTGGTTTGTTTATTAATTATTTCTGAAAATTCGATTTTATCCGAGGAAACTATTGAAAAAGTTCTTGCTGATGACCAACTAAGTTTTGATGATTTGTCTGTGGAGGAGAAGAAACAGTTCTTAAGGGCTATGGCATCAGGAGAGTTGAGCAAGATGATTGAACCTTGGGAAGCTTGGTGGACGAAACCTTCTGCTAGAACTATATCTCTGAGCAAAGAAGGAACACCACTTGTTCAACTCCACGCTGAGCAGAAGCTGACAACTTCACTAACAAGTGGAACGGAAGTGATGCAATCAAGTGGAATTCCTCAAGCACCTGATACCCCGTTACCTCCAATTAGCAAGCTTAGTTCTGCAGAGCCATCACCCCTGTTGGCTGTTCACCTGGTTGATATCATTTATAGTTACTGTTTCACACTTCGCTTGTACAATGGAGACTGGCAATCAGATGCTATAGGATCAGCTCTGGTTGTTTTGAGTGTCTCCTCTGTCTTGGGTCAAAGTGGTCAACCAGAAACAGTTTTAGAGGCCTTGTCATCTTGCTTGGAACAAACCTGTTCTCCAGCTTATCGACATGTGGGGGGCTTGCAGCTTGGATTGAGTCTTATTGATGATGTCGCAATCCTGCTTTCACTGGGTGGTCCTGCTTTAGTGTGCTTGCTCTGCGACTTGCAGAGGCTTATTCAAGCTGGGGAGAGAGATCTGAAATCAGGAAAAAGGAGAAGAAAATCTAGATGGTCAGACATTGGTACTAAGCTTAAACACGCTGACAGGAAGATCTTTTTCATCATGTGCTGGGTTCATGAGCAGCCGAGCGAAGTTTGGTCGAGTTTGGAGAATATTATAAAGATGGAGAAAAGTTCCATTAAGGAGTTTGAGAATCATAAAATGTCAACAAAAATGGATAGTAAGGTCAAAAACAAGGACAAAGTTTTGATACAGGAGATAAAATGA

Protein sequence

MEEEIRTSVDSSSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKEGTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHVGGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKLKHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLIQEIK
BLAST of Lsi04G000470 vs. Swiss-Prot
Match: ZNHI2_MOUSE (Zinc finger HIT domain-containing protein 2 OS=Mus musculus GN=Znhit2 PE=2 SV=2)

HSP 1 Score: 87.4 bits (215), Expect = 4.0e-16
Identity = 97/370 (26.22%), Postives = 154/370 (41.62%), Query Frame = 1

Query: 31  SHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQLRTDDSTKRKTLDILKRFHS 90
           + YTCPRCN+ YCSL+CY++H   C E F R+ V   LR+LR   ++  +    L+R   
Sbjct: 18  ARYTCPRCNAPYCSLRCYRAH-GACAEDFYRDQV---LRELRGRSASPSRLAGALRRLRE 77

Query: 91  EEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLADDQLSFDDLSVEEKKQFLRA 150
           + E +D  EE G                 L        L+     ++ L+  EK  F R 
Sbjct: 78  QREAEDEPEEAG-----------------LGPGARPGGLSG---LWERLTPAEKAAFERL 137

Query: 151 MASGELSKMIEPWEAWWTKPSARTISLSKEGTPLVQLHAEQK---LTTSLTSGTEVMQS- 210
           ++ GE  +++ PW  WW         L +      +  AE +     T+L SG +   + 
Sbjct: 138 LSRGEAGRLLPPWRPWWWGRGTGPRLLEELDHAANRDLAEPEPAPARTALQSGDDAAAAE 197

Query: 211 -------SGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDAI 270
                  +  P A    +P ++ LS +  SPL+   L +++++Y  TL LY+G    DA+
Sbjct: 198 PFAEDSCAARPLALPARIPALASLSRSPASPLVRFQLPNVLFAYAHTLALYHGG-DDDAL 257

Query: 271 GS--ALVVLSVSSVLGQS---GQPETVLEALSSCLEQTCSP--------AYRHVGGLQLG 330
            S     +L VS  LG     G  E  L+A +  LE    P        A + V  + LG
Sbjct: 258 LSDFCATLLDVSGALGAQQVFGSTEEALQAAAHVLEAGEHPPGPLGTRGAMQEVARILLG 317

Query: 331 LSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKLKHADRK 377
              ++     L+  G     L    ++ +  GERD                 +L  A +K
Sbjct: 318 EGPVNQKGYTLTALGHLAQTLGRARKQAVIGGERD-----------------RLYRARKK 345

BLAST of Lsi04G000470 vs. Swiss-Prot
Match: ZNHI2_HUMAN (Zinc finger HIT domain-containing protein 2 OS=Homo sapiens GN=ZNHIT2 PE=1 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 3.4e-15
Identity = 98/378 (25.93%), Postives = 150/378 (39.68%), Query Frame = 1

Query: 23  CHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQLRTDDSTKRKTL 82
           C     Q + YTCPRCN+ YCSL+CY++H   C E+F R+ V+ ELR      S   +  
Sbjct: 10  CPAGEVQPARYTCPRCNAPYCSLRCYRTHGT-CAENFYRDQVLGELRGCSAPPS---RLA 69

Query: 83  DILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLADDQLSFDDLSVE 142
             L+R   + E +D   E G                 LS       L+     ++ L+  
Sbjct: 70  SALRRLRQQRETEDEPGEAG-----------------LSSGPAPGGLSG---LWERLAPG 129

Query: 143 EKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKEGTPLVQLHAEQKLTTSLTSGTE 202
           EK  F R ++ GE  +++ PW  WW    A    L +         AE +L  + T    
Sbjct: 130 EKAAFERLLSRGEAGRLLPPWRPWWWNRGAGPQLLEELDNAPGSDAAELELAPARTPPDS 189

Query: 203 VMQSSGIPQAP----------------DTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFT 262
           V  +S    A                  T +P I  LS    SPL+   L +++++Y  T
Sbjct: 190 VKDASAAEPAAAERVLGDVPGACTPVVPTRIPAIVSLSRGPVSPLVRFQLPNVLFAYAHT 249

Query: 263 LRLYNGDWQSDAIGSALVVLSVSSVLGQS---GQPETVLEALSSCLEQTCSPAYRHVGGL 322
           L LY+G   +        +L VS  LG        E  L+A +  LE     A  H  G 
Sbjct: 250 LALYHGGDDALLSDFCATLLGVSGALGAQQVFASAEEALQAAAHVLE-----AGEHPPGP 309

Query: 323 QLGLSLIDDVAILLSLGGPA-----LVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGT 377
                 + +VA +L   GP       +  L DL + +         G+ R+++   +   
Sbjct: 310 LGTRGAMHEVARILLGEGPTNQKGYTLAALGDLAQTL---------GRARKQAVAREERD 349

BLAST of Lsi04G000470 vs. Swiss-Prot
Match: ZNHI2_BOVIN (Zinc finger HIT domain-containing protein 2 OS=Bos taurus GN=ZNHIT2 PE=2 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 1.3e-14
Identity = 93/371 (25.07%), Postives = 149/371 (40.16%), Query Frame = 1

Query: 23  CHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQLRTDDSTKRKTL 82
           C     Q + YTCPRCN  YCSL+CY++H + C E F R+ V+ ELR      S   +  
Sbjct: 10  CPTGEAQPARYTCPRCNVPYCSLRCYRAHGS-CAEEFYRDQVLGELRGRSASPS---RLA 69

Query: 83  DILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLADDQLS--FDDLS 142
             L+R   + E +D   + G                      +    A   LS  ++ L+
Sbjct: 70  TALRRLRQQRETEDEPGDAG----------------------LRPGPAPGGLSGLWERLA 129

Query: 143 VEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKEG----------TPLVQLHAE 202
             EK  F R ++ GE  +++ PW  WW    A    L + G           P       
Sbjct: 130 PAEKVAFERLLSRGEAGRLLPPWRPWWWGRGAGPRLLEELGDAPSGDAEELEPSPARMPP 189

Query: 203 QKLTTSLTSGTEVMQS--SGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFTLR 262
           + +     +  +V+       P A  T +P ++ LS    SPL+   L +++++Y  TL 
Sbjct: 190 EPVRDEPAAVEQVLGDLPGACPPAVPTRIPALASLSRGRTSPLVRFQLPNVLFAYAHTLA 249

Query: 263 LYNGDWQSDAIGSALVVLSVSSVLGQS---GQPETVLEALSSCLEQTCSPAYRHVGGLQL 322
           LY+G  ++        +L VS  LG        E  L+A +  LE     A  H  G   
Sbjct: 250 LYHGGDEALLSDFCATLLGVSGALGAQQVFASAEEALQAAAHVLE-----AGEHPPGPLG 309

Query: 323 GLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKLKHADR 377
               + + A +L   GPA          L   G+     G+ R+++   +   +L  A +
Sbjct: 310 TRGAMREAARILLGEGPANQ----KSYTLAALGDLAQTLGRARKQAVAPEERDRLYRARK 345

BLAST of Lsi04G000470 vs. TrEMBL
Match: A0A0A0KV24_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G623610 PE=4 SV=1)

HSP 1 Score: 726.1 bits (1873), Expect = 2.5e-206
Identity = 374/423 (88.42%), Postives = 393/423 (92.91%), Query Frame = 1

Query: 1   MEEEIRTSVDSSSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFM 60
           MEEEIRTSVDSSSSVNLPLRTICHVCHKQFS Y+CPRCNSRYCSLQCYKSHSNRCTESFM
Sbjct: 1   MEEEIRTSVDSSSSVNLPLRTICHVCHKQFSQYSCPRCNSRYCSLQCYKSHSNRCTESFM 60

Query: 61  RENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSIL 120
           RENVVEELRQLRTDDS KRKTLDILKRFH+EEEM+DLDEED               +S L
Sbjct: 61  RENVVEELRQLRTDDSAKRKTLDILKRFHAEEEMEDLDEED---------------DSTL 120

Query: 121 SEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKE 180
           SEET EKVLA DQLSFDDLS EEKK+FLRAMASGELSKMIEPWEAWW KPSARTISLSKE
Sbjct: 121 SEETTEKVLAGDQLSFDDLSEEEKKRFLRAMASGELSKMIEPWEAWWMKPSARTISLSKE 180

Query: 181 GTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDII 240
           GTPLVQLHA +++TTSLTS TEVMQSSGIPQAPDTPLPP+SKLS+AEPSPLLAVHL+DII
Sbjct: 181 GTPLVQLHAAERMTTSLTSETEVMQSSGIPQAPDTPLPPLSKLSTAEPSPLLAVHLIDII 240

Query: 241 YSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHV 300
           YSYCFTLRLYNGDWQSDA GSALVVLS+SSVLGQ+G+PETVLEALSSCLEQTCSPAYRHV
Sbjct: 241 YSYCFTLRLYNGDWQSDATGSALVVLSISSVLGQNGKPETVLEALSSCLEQTCSPAYRHV 300

Query: 301 GGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKL 360
           GGLQLGLSLIDDV+ LLSLG PALVCLLCDLQRLIQAGERDLKS KRRRKS+WSDIGTKL
Sbjct: 301 GGLQLGLSLIDDVSTLLSLGCPALVCLLCDLQRLIQAGERDLKSEKRRRKSKWSDIGTKL 360

Query: 361 KHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLI 420
           KHADRKIFFIMCWVHEQPSEVWS+LENI+KMEKSSI EFENHKMSTKMDSKVK+ DKVLI
Sbjct: 361 KHADRKIFFIMCWVHEQPSEVWSTLENIVKMEKSSIMEFENHKMSTKMDSKVKSGDKVLI 408

Query: 421 QEI 424
           QEI
Sbjct: 421 QEI 408

BLAST of Lsi04G000470 vs. TrEMBL
Match: E5GCK6_CUCME (Aquarius OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 1.3e-154
Identity = 287/339 (84.66%), Postives = 306/339 (90.27%), Query Frame = 1

Query: 43   CSLQCYKSHSNRCTESFMRENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDG 102
            C++   +SHSNRCTESFMRENVVEELRQLRTDD+ KRKTLDILKRFH+EEEM+DL EED 
Sbjct: 1878 CNVTRLQSHSNRCTESFMRENVVEELRQLRTDDNAKRKTLDILKRFHTEEEMEDLVEED- 1937

Query: 103  TWLQTLVCLLIISENSILSEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEP 162
                          +S LSEETIEKVLA DQ+SFDDLS EEKK+FLRAMASGELSKMIEP
Sbjct: 1938 --------------DSTLSEETIEKVLAGDQISFDDLSEEEKKRFLRAMASGELSKMIEP 1997

Query: 163  WEAWWTKPSARTISLSKEGTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISK 222
            WEAWW KPSARTISLSKEGTPLVQLH  +++TTSLTSGTE MQSSGIP+APD PLPP+SK
Sbjct: 1998 WEAWWMKPSARTISLSKEGTPLVQLHGAERMTTSLTSGTEAMQSSGIPRAPDAPLPPLSK 2057

Query: 223  LSSAEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVL 282
            LS+AEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDA GSALVVLSVSSVLGQ+G+PETVL
Sbjct: 2058 LSTAEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDATGSALVVLSVSSVLGQNGKPETVL 2117

Query: 283  EALSSCLEQTCSPAYRHVGGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDL 342
            EALSSCLEQTCSPAYRHVGGLQLGLSLIDDV  LLSLG PAL+CLLCDLQRLIQAGERDL
Sbjct: 2118 EALSSCLEQTCSPAYRHVGGLQLGLSLIDDVTTLLSLGCPALLCLLCDLQRLIQAGERDL 2177

Query: 343  KSGKRRRKSRWSDIGTKLKHADRKIFFIMCWVHEQPSEV 382
            KS KRRRKS+WSDIGTKLKHADRKIFFIMCWVHEQPSEV
Sbjct: 2178 KSEKRRRKSKWSDIGTKLKHADRKIFFIMCWVHEQPSEV 2201

BLAST of Lsi04G000470 vs. TrEMBL
Match: W9S1B8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003644 PE=4 SV=1)

HSP 1 Score: 523.9 bits (1348), Expect = 1.9e-145
Identity = 278/413 (67.31%), Postives = 324/413 (78.45%), Query Frame = 1

Query: 12  SSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQL 71
           SS +N P R  CHVC KQFS YTCPRCNSRYCSL CYKSHS RCTESFMRENVVEELRQL
Sbjct: 14  SSPLNPPSRITCHVCQKQFSQYTCPRCNSRYCSLHCYKSHSLRCTESFMRENVVEELRQL 73

Query: 72  RTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLAD 131
           + +D TK K LDILKRFHSEEE +D+DEED T                LSE+TI+K L+ 
Sbjct: 74  QPNDETKEKMLDILKRFHSEEETEDMDEEDST----------------LSEDTIQKFLSG 133

Query: 132 DQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKEGTPLVQLHAEQ 191
            Q+SF DLS EEKK+F RA+ASGELSKMIEPW+ WW +PSARTISLS+EGT LVQ  ++ 
Sbjct: 134 GQISFVDLSAEEKKRFQRAVASGELSKMIEPWDPWWLRPSARTISLSREGTQLVQPLSKD 193

Query: 192 KLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFTLRLYN 251
           +L+ S     E  Q+S IP  P++PLPP+SKLSS  PSPLLAVHLVDIIYSYCFTLRLYN
Sbjct: 194 ELSVSPQLNNESDQASDIPPGPESPLPPVSKLSSTAPSPLLAVHLVDIIYSYCFTLRLYN 253

Query: 252 GDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHVGGLQLGLSLID 311
           GDWQSDAIGSA VVLSVSSVLGQ GQPETVLEALS CLEQTCSPAYRH+GGLQ  L L+D
Sbjct: 254 GDWQSDAIGSATVVLSVSSVLGQGGQPETVLEALSYCLEQTCSPAYRHIGGLQFRLGLVD 313

Query: 312 DVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKLKHADRKIFFIM 371
           DV  +LSLGG AL+CLL DL RL+Q+GER+LKS K + KSR  +I  KLK A+RKI+FIM
Sbjct: 314 DVVSILSLGGNALLCLLSDLYRLVQSGERELKSEKLQ-KSRKLEIKNKLKLAERKIYFIM 373

Query: 372 CWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLIQEIK 425
           CWVH+QP E WSSL  I++ EK+S  ++ N  +S K++ K +N+ KVLIQE++
Sbjct: 374 CWVHDQPGEAWSSLAAIVRAEKASAMDYIN--ISRKVEKKAENRGKVLIQEME 407

BLAST of Lsi04G000470 vs. TrEMBL
Match: D7TVP0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0025g02330 PE=4 SV=1)

HSP 1 Score: 518.1 bits (1333), Expect = 1.0e-143
Identity = 270/413 (65.38%), Postives = 320/413 (77.48%), Query Frame = 1

Query: 12  SSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQL 71
           SS +N   R IC VC KQFS YTCPRCNSRYCSLQCYKSHS RCTESFMRENVVEEL Q+
Sbjct: 14  SSPLNPSNRIICRVCQKQFSQYTCPRCNSRYCSLQCYKSHSLRCTESFMRENVVEELGQM 73

Query: 72  RTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLAD 131
           + DD TK K LDILKRFHSEE+M  +DE+D                S LSEETI+K+L+ 
Sbjct: 74  QPDDETKHKMLDILKRFHSEEDMDSMDEDD----------------SGLSEETIQKILSG 133

Query: 132 DQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKEGTPLVQLHAEQ 191
            Q+SFDDLS  EK+ F RA+ASGELSKMIEPW  WW KP+ART+SLS+EGT LVQ   +Q
Sbjct: 134 CQVSFDDLSPNEKRLFQRAVASGELSKMIEPWVPWWLKPAARTLSLSQEGTQLVQPLIKQ 193

Query: 192 KLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFTLRLYN 251
           + + S     E  Q++ IP  P+TP PP+SKL + EPSPLL VHLVDI+YSYCFTLR+YN
Sbjct: 194 ETSVSSEDDPESNQANEIPPGPETPFPPVSKLIATEPSPLLTVHLVDILYSYCFTLRIYN 253

Query: 252 GDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHVGGLQLGLSLID 311
           GDW+SDA+GSA+V+LS+SSVLGQ+G PET+LEA+S CLEQTCSPAYRH+GGLQ GL L+D
Sbjct: 254 GDWRSDALGSAMVLLSISSVLGQAGLPETILEAVSHCLEQTCSPAYRHLGGLQFGLGLLD 313

Query: 312 DVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKLKHADRKIFFIM 371
           DV  LLSLG PALVC LCDLQRLIQAGER+LK GK  RKS+  DI +KLK A+RKI+FIM
Sbjct: 314 DVITLLSLGSPALVCSLCDLQRLIQAGERELKLGK-PRKSKRPDIRSKLKLAERKIYFIM 373

Query: 372 CWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLIQEIK 425
           CWVHEQP E WSSL  I+K EK S  +F   + S KM+ K K++ K+LI+E++
Sbjct: 374 CWVHEQPDEAWSSLAAIVKTEKGSTMDFAASQRSVKMEDKSKSRGKILIEEVQ 409

BLAST of Lsi04G000470 vs. TrEMBL
Match: A0A061G415_THECC (HIT-type Zinc finger family protein OS=Theobroma cacao GN=TCM_015676 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 1.3e-143
Identity = 272/425 (64.00%), Postives = 330/425 (77.65%), Query Frame = 1

Query: 1   MEEEIRTSVDSS--SSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTES 60
           M E I TS + S  S +N P R ICHVC KQFS YTCPRCNSRYCSL CYKSHS RCTES
Sbjct: 1   MAETIHTSENLSNPSPLNPPSRVICHVCQKQFSQYTCPRCNSRYCSLHCYKSHSLRCTES 60

Query: 61  FMRENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENS 120
           FMRENVVEELRQL+ DD  KRK L+ILKRFHSEEE   LDE+D              ++S
Sbjct: 61  FMRENVVEELRQLQPDDEIKRKMLEILKRFHSEEETHTLDEDD--------------DDS 120

Query: 121 ILSEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLS 180
            LS+ETI+K+L+  ++SFDDLS+EEKK+F +A+ASGELSKMIEPW+ WW  P+A TI LS
Sbjct: 121 TLSDETIQKILSGREVSFDDLSLEEKKRFQKAVASGELSKMIEPWDPWWLNPAAGTICLS 180

Query: 181 KEGTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVD 240
           ++GT LVQ  A  + +       E  +SSGIP  P+TPLP + KL S EPSPLLAVHLVD
Sbjct: 181 RDGTRLVQPIANLEASVPPEDDLESNESSGIPVGPETPLPSLRKLISTEPSPLLAVHLVD 240

Query: 241 IIYSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYR 300
           I+YSYCFTLR+YNG+WQSDAIGSA+VVLS+S VLGQ+GQPETV EA+S CLEQTCSPAYR
Sbjct: 241 IVYSYCFTLRVYNGEWQSDAIGSAMVVLSISCVLGQAGQPETVREAVSYCLEQTCSPAYR 300

Query: 301 HVGGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGT 360
           H+GGLQ GL+L+DDVA LLSLGGPAL+C+L DLQR+IQAGE++LKS ++ R  R ++I +
Sbjct: 301 HIGGLQFGLALVDDVATLLSLGGPALICMLFDLQRMIQAGEKELKS-EKPRMLRKAEIKS 360

Query: 361 KLKHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKV 420
           KLK A+RK+ FIMCWVHEQP E WSSL  ++K EKSS  ++   K  +K ++K +NK KV
Sbjct: 361 KLKLAERKVHFIMCWVHEQPGEAWSSLGAMVKAEKSSFMDYGGSKSFSKRENKAENKGKV 410

Query: 421 LIQEI 424
           LI+E+
Sbjct: 421 LIEEM 410

BLAST of Lsi04G000470 vs. TAIR10
Match: AT5G63830.1 (AT5G63830.1 HIT-type Zinc finger family protein)

HSP 1 Score: 416.8 bits (1070), Expect = 1.6e-116
Identity = 230/427 (53.86%), Postives = 302/427 (70.73%), Query Frame = 1

Query: 6   RTSVDSSSSVNLPL-----RTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFM 65
           +T + S SS + PL     R ICHVC+KQFS YTCPRCN RYCSL CYKSHS +CTESFM
Sbjct: 4   KTIISSESSNSSPLNPSSTRIICHVCNKQFSQYTCPRCNFRYCSLPCYKSHSVQCTESFM 63

Query: 66  RENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSIL 125
           R+NV +EL+Q+R+DD TKRK L+ILKRFH EEE     E+DG      +  +   E   L
Sbjct: 64  RDNVNDELKQVRSDDQTKRKMLEILKRFHEEEE-----EDDGG-----IDSITDDEGLSL 123

Query: 126 SEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKE 185
            EE I+K++  D++S DDLS+EE+K F RA+ASGELSKMI+PW+ WW   SARTI+L   
Sbjct: 124 PEEIIQKIMNGDEVSLDDLSLEERKGFQRALASGELSKMIQPWDPWWLLASARTINLGLG 183

Query: 186 GTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDII 245
           GT LVQ   +++ TT +         S IP+ PDTPL  +SKL+S  PSPLL +HL+DI 
Sbjct: 184 GTQLVQCVEDEEETTVV---------SEIPRGPDTPLISLSKLTSTNPSPLLPIHLIDIA 243

Query: 246 YSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHV 305
           YSYCFTLR+YNG+WQSD++G+A +VL+VSSVLG +GQPET+ E LS CLEQTCS AY+++
Sbjct: 244 YSYCFTLRIYNGEWQSDSLGAATMVLTVSSVLGHNGQPETIKEVLSFCLEQTCSSAYKNL 303

Query: 306 -GGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTK 365
            GGL+ GL+L+DDV   LSLG  A+VCLL DLQRLI    +++KS   R      D+  K
Sbjct: 304 GGGLKFGLNLVDDVICFLSLGSGAMVCLLGDLQRLILGAIKEVKSSSGR------DLKKK 363

Query: 366 LKHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSK--VKNKDK 425
           LK A+RK+F++MCWV+EQ SEVW +LE  ++ EK+S+ E  +++   +M  K   +    
Sbjct: 364 LKLAERKVFYMMCWVNEQNSEVWEALEPSVRAEKNSVVELNDYRGFPEMKKKDQFRKGGG 405

BLAST of Lsi04G000470 vs. NCBI nr
Match: gi|778706714|ref|XP_011655899.1| (PREDICTED: zinc finger HIT domain-containing protein 2 [Cucumis sativus])

HSP 1 Score: 726.1 bits (1873), Expect = 3.6e-206
Identity = 374/423 (88.42%), Postives = 393/423 (92.91%), Query Frame = 1

Query: 1   MEEEIRTSVDSSSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFM 60
           MEEEIRTSVDSSSSVNLPLRTICHVCHKQFS Y+CPRCNSRYCSLQCYKSHSNRCTESFM
Sbjct: 1   MEEEIRTSVDSSSSVNLPLRTICHVCHKQFSQYSCPRCNSRYCSLQCYKSHSNRCTESFM 60

Query: 61  RENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSIL 120
           RENVVEELRQLRTDDS KRKTLDILKRFH+EEEM+DLDEED               +S L
Sbjct: 61  RENVVEELRQLRTDDSAKRKTLDILKRFHAEEEMEDLDEED---------------DSTL 120

Query: 121 SEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKE 180
           SEET EKVLA DQLSFDDLS EEKK+FLRAMASGELSKMIEPWEAWW KPSARTISLSKE
Sbjct: 121 SEETTEKVLAGDQLSFDDLSEEEKKRFLRAMASGELSKMIEPWEAWWMKPSARTISLSKE 180

Query: 181 GTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDII 240
           GTPLVQLHA +++TTSLTS TEVMQSSGIPQAPDTPLPP+SKLS+AEPSPLLAVHL+DII
Sbjct: 181 GTPLVQLHAAERMTTSLTSETEVMQSSGIPQAPDTPLPPLSKLSTAEPSPLLAVHLIDII 240

Query: 241 YSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHV 300
           YSYCFTLRLYNGDWQSDA GSALVVLS+SSVLGQ+G+PETVLEALSSCLEQTCSPAYRHV
Sbjct: 241 YSYCFTLRLYNGDWQSDATGSALVVLSISSVLGQNGKPETVLEALSSCLEQTCSPAYRHV 300

Query: 301 GGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKL 360
           GGLQLGLSLIDDV+ LLSLG PALVCLLCDLQRLIQAGERDLKS KRRRKS+WSDIGTKL
Sbjct: 301 GGLQLGLSLIDDVSTLLSLGCPALVCLLCDLQRLIQAGERDLKSEKRRRKSKWSDIGTKL 360

Query: 361 KHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLI 420
           KHADRKIFFIMCWVHEQPSEVWS+LENI+KMEKSSI EFENHKMSTKMDSKVK+ DKVLI
Sbjct: 361 KHADRKIFFIMCWVHEQPSEVWSTLENIVKMEKSSIMEFENHKMSTKMDSKVKSGDKVLI 408

Query: 421 QEI 424
           QEI
Sbjct: 421 QEI 408

BLAST of Lsi04G000470 vs. NCBI nr
Match: gi|659092151|ref|XP_008446925.1| (PREDICTED: zinc finger HIT domain-containing protein 2 [Cucumis melo])

HSP 1 Score: 716.8 bits (1849), Expect = 2.2e-203
Identity = 369/424 (87.03%), Postives = 390/424 (91.98%), Query Frame = 1

Query: 1   MEEEIRTSVDSSSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFM 60
           MEEEIRTSVDSSSSVNLPLR ICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFM
Sbjct: 1   MEEEIRTSVDSSSSVNLPLRIICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFM 60

Query: 61  RENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSIL 120
           RENVVEELRQLRTDD+ KRKTLDILKRFH+EEEM+DL EED               +S L
Sbjct: 61  RENVVEELRQLRTDDNAKRKTLDILKRFHTEEEMEDLVEED---------------DSTL 120

Query: 121 SEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKE 180
           SEETIEKVLA DQ+SFDDLS EEKK+FLRAMASGELSKMIEPWEAWW KPSARTISLSKE
Sbjct: 121 SEETIEKVLAGDQISFDDLSEEEKKRFLRAMASGELSKMIEPWEAWWMKPSARTISLSKE 180

Query: 181 GTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDII 240
           GTPLVQLH  +++TTSLTSGTE MQSSGIP+APD PLPP+SKLS+AEPSPLLAVHLVDII
Sbjct: 181 GTPLVQLHGAERMTTSLTSGTEAMQSSGIPRAPDAPLPPLSKLSTAEPSPLLAVHLVDII 240

Query: 241 YSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHV 300
           YSYCFTLRLYNGDWQSDA GSALVVLSVSSVLGQ+G+PETVLEALSSCLEQTCSPAYRHV
Sbjct: 241 YSYCFTLRLYNGDWQSDATGSALVVLSVSSVLGQNGKPETVLEALSSCLEQTCSPAYRHV 300

Query: 301 GGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKL 360
           GGLQLGLSLIDDV  LLSLG PAL+CLLCDLQRLIQAGERDLKS KRRRKS+WSDIGTKL
Sbjct: 301 GGLQLGLSLIDDVTTLLSLGCPALLCLLCDLQRLIQAGERDLKSEKRRRKSKWSDIGTKL 360

Query: 361 KHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLI 420
           KHADRKIFFIMCWVHEQPSEVWS+LENI+KMEKSSI EF+NHKMSTKMDSKV + DKVLI
Sbjct: 361 KHADRKIFFIMCWVHEQPSEVWSTLENIVKMEKSSIMEFQNHKMSTKMDSKVISGDKVLI 409

Query: 421 QEIK 425
           QE+K
Sbjct: 421 QEMK 409

BLAST of Lsi04G000470 vs. NCBI nr
Match: gi|307136393|gb|ADN34203.1| (aquarius [Cucumis melo subsp. melo])

HSP 1 Score: 554.3 bits (1427), Expect = 1.9e-154
Identity = 287/339 (84.66%), Postives = 306/339 (90.27%), Query Frame = 1

Query: 43   CSLQCYKSHSNRCTESFMRENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDG 102
            C++   +SHSNRCTESFMRENVVEELRQLRTDD+ KRKTLDILKRFH+EEEM+DL EED 
Sbjct: 1878 CNVTRLQSHSNRCTESFMRENVVEELRQLRTDDNAKRKTLDILKRFHTEEEMEDLVEED- 1937

Query: 103  TWLQTLVCLLIISENSILSEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEP 162
                          +S LSEETIEKVLA DQ+SFDDLS EEKK+FLRAMASGELSKMIEP
Sbjct: 1938 --------------DSTLSEETIEKVLAGDQISFDDLSEEEKKRFLRAMASGELSKMIEP 1997

Query: 163  WEAWWTKPSARTISLSKEGTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISK 222
            WEAWW KPSARTISLSKEGTPLVQLH  +++TTSLTSGTE MQSSGIP+APD PLPP+SK
Sbjct: 1998 WEAWWMKPSARTISLSKEGTPLVQLHGAERMTTSLTSGTEAMQSSGIPRAPDAPLPPLSK 2057

Query: 223  LSSAEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVL 282
            LS+AEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDA GSALVVLSVSSVLGQ+G+PETVL
Sbjct: 2058 LSTAEPSPLLAVHLVDIIYSYCFTLRLYNGDWQSDATGSALVVLSVSSVLGQNGKPETVL 2117

Query: 283  EALSSCLEQTCSPAYRHVGGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDL 342
            EALSSCLEQTCSPAYRHVGGLQLGLSLIDDV  LLSLG PAL+CLLCDLQRLIQAGERDL
Sbjct: 2118 EALSSCLEQTCSPAYRHVGGLQLGLSLIDDVTTLLSLGCPALLCLLCDLQRLIQAGERDL 2177

Query: 343  KSGKRRRKSRWSDIGTKLKHADRKIFFIMCWVHEQPSEV 382
            KS KRRRKS+WSDIGTKLKHADRKIFFIMCWVHEQPSEV
Sbjct: 2178 KSEKRRRKSKWSDIGTKLKHADRKIFFIMCWVHEQPSEV 2201

BLAST of Lsi04G000470 vs. NCBI nr
Match: gi|1009156119|ref|XP_015896076.1| (PREDICTED: zinc finger HIT domain-containing protein 2-like [Ziziphus jujuba])

HSP 1 Score: 533.9 bits (1374), Expect = 2.6e-148
Identity = 285/425 (67.06%), Postives = 332/425 (78.12%), Query Frame = 1

Query: 1   MEEEIRTSVD--SSSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTES 60
           M ++I TS    SSS  N P R ICHVC KQFS YTCPRCNSRYCSL CYKSHS RCTES
Sbjct: 1   MADDIVTSQQPSSSSQFNPPSRIICHVCQKQFSQYTCPRCNSRYCSLHCYKSHSIRCTES 60

Query: 61  FMRENVVEELRQLRTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENS 120
           FMR+NVVEEL+QL   D TK+K LDIL+RFHSEEE  D+DEE               E+S
Sbjct: 61  FMRDNVVEELQQLEPKDETKQKMLDILRRFHSEEETDDMDEE---------------EDS 120

Query: 121 ILSEETIEKVLADDQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLS 180
            LSEETI+KVL+  Q+SFDDLSVEE+K+F RA+ASGELSKMIEPW+AWW +P+A+TISLS
Sbjct: 121 TLSEETIQKVLSGVQVSFDDLSVEERKRFKRAVASGELSKMIEPWDAWWLRPAAKTISLS 180

Query: 181 KEGTPLVQLHAEQKLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVD 240
           KEGT LVQ  AE++ +  L +  E  Q+S IP  P++PLPP+SKL S EPSP LAVHLVD
Sbjct: 181 KEGTQLVQPLAEREASILLQNDRESNQASEIPPGPESPLPPVSKLISTEPSPFLAVHLVD 240

Query: 241 IIYSYCFTLRLYNGDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYR 300
           I+YSYCFTLRLYNGDWQSDA+GSA+VVL VSSVLGQ GQPETVLEALS CLEQTCSPAYR
Sbjct: 241 IVYSYCFTLRLYNGDWQSDALGSAMVVLGVSSVLGQGGQPETVLEALSYCLEQTCSPAYR 300

Query: 301 HVGGLQLGLSLIDDVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGT 360
           H+GGLQ GL L+DDV  +L+LG PAL+CLL DLQRL+QAGERDLKS K  +K R  +I +
Sbjct: 301 HMGGLQFGLGLVDDVVSILTLGSPALLCLLSDLQRLVQAGERDLKSDK-SQKPRRKEIKS 360

Query: 361 KLKHADRKIFFIMCWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKV 420
           KLK A+RKI+FIMCWVHEQP E WSSL  II+ EKSS  +FE+   S K+  K + K  V
Sbjct: 361 KLKLAERKIYFIMCWVHEQPGEAWSSLAAIIRAEKSSGMDFESISRSGKVKRKAEMKGNV 409

Query: 421 LIQEI 424
           LI+E+
Sbjct: 421 LIEEM 409

BLAST of Lsi04G000470 vs. NCBI nr
Match: gi|703136374|ref|XP_010106140.1| (hypothetical protein L484_003644 [Morus notabilis])

HSP 1 Score: 523.9 bits (1348), Expect = 2.7e-145
Identity = 278/413 (67.31%), Postives = 324/413 (78.45%), Query Frame = 1

Query: 12  SSSVNLPLRTICHVCHKQFSHYTCPRCNSRYCSLQCYKSHSNRCTESFMRENVVEELRQL 71
           SS +N P R  CHVC KQFS YTCPRCNSRYCSL CYKSHS RCTESFMRENVVEELRQL
Sbjct: 14  SSPLNPPSRITCHVCQKQFSQYTCPRCNSRYCSLHCYKSHSLRCTESFMRENVVEELRQL 73

Query: 72  RTDDSTKRKTLDILKRFHSEEEMQDLDEEDGTWLQTLVCLLIISENSILSEETIEKVLAD 131
           + +D TK K LDILKRFHSEEE +D+DEED T                LSE+TI+K L+ 
Sbjct: 74  QPNDETKEKMLDILKRFHSEEETEDMDEEDST----------------LSEDTIQKFLSG 133

Query: 132 DQLSFDDLSVEEKKQFLRAMASGELSKMIEPWEAWWTKPSARTISLSKEGTPLVQLHAEQ 191
            Q+SF DLS EEKK+F RA+ASGELSKMIEPW+ WW +PSARTISLS+EGT LVQ  ++ 
Sbjct: 134 GQISFVDLSAEEKKRFQRAVASGELSKMIEPWDPWWLRPSARTISLSREGTQLVQPLSKD 193

Query: 192 KLTTSLTSGTEVMQSSGIPQAPDTPLPPISKLSSAEPSPLLAVHLVDIIYSYCFTLRLYN 251
           +L+ S     E  Q+S IP  P++PLPP+SKLSS  PSPLLAVHLVDIIYSYCFTLRLYN
Sbjct: 194 ELSVSPQLNNESDQASDIPPGPESPLPPVSKLSSTAPSPLLAVHLVDIIYSYCFTLRLYN 253

Query: 252 GDWQSDAIGSALVVLSVSSVLGQSGQPETVLEALSSCLEQTCSPAYRHVGGLQLGLSLID 311
           GDWQSDAIGSA VVLSVSSVLGQ GQPETVLEALS CLEQTCSPAYRH+GGLQ  L L+D
Sbjct: 254 GDWQSDAIGSATVVLSVSSVLGQGGQPETVLEALSYCLEQTCSPAYRHIGGLQFRLGLVD 313

Query: 312 DVAILLSLGGPALVCLLCDLQRLIQAGERDLKSGKRRRKSRWSDIGTKLKHADRKIFFIM 371
           DV  +LSLGG AL+CLL DL RL+Q+GER+LKS K + KSR  +I  KLK A+RKI+FIM
Sbjct: 314 DVVSILSLGGNALLCLLSDLYRLVQSGERELKSEKLQ-KSRKLEIKNKLKLAERKIYFIM 373

Query: 372 CWVHEQPSEVWSSLENIIKMEKSSIKEFENHKMSTKMDSKVKNKDKVLIQEIK 425
           CWVH+QP E WSSL  I++ EK+S  ++ N  +S K++ K +N+ KVLIQE++
Sbjct: 374 CWVHDQPGEAWSSLAAIVRAEKASAMDYIN--ISRKVEKKAENRGKVLIQEME 407

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ZNHI2_MOUSE4.0e-1626.22Zinc finger HIT domain-containing protein 2 OS=Mus musculus GN=Znhit2 PE=2 SV=2[more]
ZNHI2_HUMAN3.4e-1525.93Zinc finger HIT domain-containing protein 2 OS=Homo sapiens GN=ZNHIT2 PE=1 SV=1[more]
ZNHI2_BOVIN1.3e-1425.07Zinc finger HIT domain-containing protein 2 OS=Bos taurus GN=ZNHIT2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KV24_CUCSA2.5e-20688.42Uncharacterized protein OS=Cucumis sativus GN=Csa_5G623610 PE=4 SV=1[more]
E5GCK6_CUCME1.3e-15484.66Aquarius OS=Cucumis melo subsp. melo PE=4 SV=1[more]
W9S1B8_9ROSA1.9e-14567.31Uncharacterized protein OS=Morus notabilis GN=L484_003644 PE=4 SV=1[more]
D7TVP0_VITVI1.0e-14365.38Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0025g02330 PE=4 SV=... [more]
A0A061G415_THECC1.3e-14364.00HIT-type Zinc finger family protein OS=Theobroma cacao GN=TCM_015676 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G63830.11.6e-11653.86 HIT-type Zinc finger family protein[more]
Match NameE-valueIdentityDescription
gi|778706714|ref|XP_011655899.1|3.6e-20688.42PREDICTED: zinc finger HIT domain-containing protein 2 [Cucumis sativus][more]
gi|659092151|ref|XP_008446925.1|2.2e-20387.03PREDICTED: zinc finger HIT domain-containing protein 2 [Cucumis melo][more]
gi|307136393|gb|ADN34203.1|1.9e-15484.66aquarius [Cucumis melo subsp. melo][more]
gi|1009156119|ref|XP_015896076.1|2.6e-14867.06PREDICTED: zinc finger HIT domain-containing protein 2-like [Ziziphus jujuba][more]
gi|703136374|ref|XP_010106140.1|2.7e-14567.31hypothetical protein L484_003644 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007529Znf_HIT
IPR007009SHQ1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0005575 cellular_component
cellular_component GO:0005681 spliceosomal complex
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G000470.1Lsi04G000470.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007009SHQ1 proteinPFAMPF04925SHQ1coord: 221..338
score: 1.
IPR007529Zinc finger, HIT-typePFAMPF04438zf-HITcoord: 20..49
score: 4.5
IPR007529Zinc finger, HIT-typePROFILEPS51083ZF_HITcoord: 23..55
score: 10
NoneNo IPR availablePANTHERPTHR15555ZINC FINGER HIT DOMAIN CONTAINING PROTEIN 2 PROTEIN FON -RELATEDcoord: 1..424
score: 8.3E
NoneNo IPR availablePANTHERPTHR15555:SF0ZINC FINGER HIT DOMAIN-CONTAINING PROTEIN 2coord: 1..424
score: 8.3E
NoneNo IPR availableunknownSSF144232HIT/MYND zinc finger-likecoord: 19..57
score: 1.46