Lsi05G002220 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G002220
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPIF / Ping-Pong family of plant transposases
Locationchr05 : 2991641 .. 2992600 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTACTTGAGCCCCTTCTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGGCTCGGTGTTGGTCTGTATCGCCTCGCCACCGGCTGCGATTTCTCCACAATCTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAACAATTGTGTCGAGTTCTTTGTACTAATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGACTTCCGAATTGCTGTGGCGTGGTTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGATTCCACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAAATCTCAGTATGTTAAAGCTGGGTTGAATGAGGATTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA

mRNA sequence

ATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTACTTGAGCCCCTTCTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGGCTCGGTGTTGGTCTGTATCGCCTCGCCACCGGCTGCGATTTCTCCACAATCTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAACAATTGTGTCGAGTTCTTTGTACTAATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGACTTCCGAATTGCTGTGGCGTGGTTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGATTCCACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAAATCTCAGTATGTTAAAGCTGGGTTGAATGAGGATTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA

Coding sequence (CDS)

ATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTACTTGAGCCCCTTCTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGGCTCGGTGTTGGTCTGTATCGCCTCGCCACCGGCTGCGATTTCTCCACAATCTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAACAATTGTGTCGAGTTCTTTGTACTAATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGACTTCCGAATTGCTGTGGCGTGGTTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGATTCCACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAAATCTCAGTATGTTAAAGCTGGGTTGAATGAGGATTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA

Protein sequence

MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTNEKASVIQRALALRARELHS
BLAST of Lsi05G002220 vs. TrEMBL
Match: A0A0A0LBX6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G202740 PE=4 SV=1)

HSP 1 Score: 636.0 bits (1639), Expect = 2.5e-179
Identity = 309/319 (96.87%), Postives = 317/319 (99.37%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES
Sbjct: 106 MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 165

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           VARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS
Sbjct: 166 VARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 225

Query: 121 HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHG 180
           HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHG
Sbjct: 226 HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHG 285

Query: 181 VAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 240
           VAVN+YLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS
Sbjct: 286 VAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 345

Query: 241 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTN 300
           QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHKSQYV+AGLN DSTN
Sbjct: 346 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTN 405

Query: 301 EKASVIQRALALRARELHS 320
           EKASVIQRALALRARELHS
Sbjct: 406 EKASVIQRALALRARELHS 424

BLAST of Lsi05G002220 vs. TrEMBL
Match: A0A067EX85_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013572mg PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.3e-118
Identity = 216/323 (66.87%), Postives = 254/323 (78.64%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           M+SSTF WLSGLLEPLL+CRDPVG PL+LS +IRLG+GL+RL  G  +S I+ +F V+ES
Sbjct: 105 MSSSTFRWLSGLLEPLLDCRDPVGLPLNLSADIRLGIGLFRLVNGSTYSEIATRFEVTES 164

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR-- 120
           V RFC KQLCRVLCTNFRFWV FP P EL L S +FE+L GLPNCCGV+ CTRFKII+  
Sbjct: 165 VTRFCVKQLCRVLCTNFRFWVAFPGPEELGLISKSFEELTGLPNCCGVIDCTRFKIIKID 224

Query: 121 --NSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPV 180
             NS   EDSIA Q+VVDSSSR+LSIVAG RGDK DS VL SSTL+KDIE+ +LL++ P+
Sbjct: 225 GSNSSKDEDSIAVQIVVDSSSRMLSIVAGIRGDKGDSRVLKSSTLYKDIEEKKLLNSSPI 284

Query: 181 YLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNW 240
            ++GVAV+QYL G G YPLLPWL+VPF  A  GS+EE+FN AH LM +PALKAI SL+NW
Sbjct: 285 CVNGVAVDQYLIGDGGYPLLPWLMVPFVDANPGSSEENFNAAHNLMRVPALKAIASLKNW 344

Query: 241 GVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVK-AGLN 300
           GVLS+PI E+FKTAVA IGACSILHNALLMREDFS + +E    +  D  SQY   A L 
Sbjct: 345 GVLSRPIDEDFKTAVALIGACSILHNALLMREDFSGLFEELGDYSLHDESSQYYSDASLE 404

Query: 301 EDSTNEKASVIQRALALRARELH 319
           E+ST +KAS I+ ALA RAR  H
Sbjct: 405 ENSTEKKASAIRSALATRARVQH 427

BLAST of Lsi05G002220 vs. TrEMBL
Match: F6HQ92_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g00150 PE=4 SV=1)

HSP 1 Score: 431.8 bits (1109), Expect = 7.3e-118
Identity = 212/328 (64.63%), Postives = 253/328 (77.13%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           MTSSTFEWLSGLLEPLL+CRDP+GSPL+L+ EIRLG+GL+RLATG D+  I+ +FGVSES
Sbjct: 114 MTSSTFEWLSGLLEPLLDCRDPIGSPLNLAPEIRLGIGLFRLATGSDYPEIARRFGVSES 173

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           + RFC KQLCRVLCTNFRFW+ FP P +L+  S++FE L GLPNCCGV+ CTRFKI+RN+
Sbjct: 174 ITRFCVKQLCRVLCTNFRFWIAFPSPIDLDSLSTSFEALTGLPNCCGVIDCTRFKIVRNN 233

Query: 121 HFY--------EDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLD 180
            F         E+SIA Q+VVDSSSRILSIVAGFRGDK +S VL SSTL+KDIE G LL+
Sbjct: 234 GFKLSPKEEVREESIAAQIVVDSSSRILSIVAGFRGDKGESRVLKSSTLYKDIEGGSLLN 293

Query: 181 APPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVS 240
           APPVY++GV +NQYL G G YPLLPWL+VPF     GS EE+FN AH LM I AL+AI S
Sbjct: 294 APPVYMNGVGINQYLIGDGGYPLLPWLMVPFVDPAPGSYEENFNSAHHLMHISALRAIAS 353

Query: 241 LRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVK- 300
           L++WGVL Q I  EFK AVAYIG+C+ILHN LLMR+D+SA++D    L  +    QY + 
Sbjct: 354 LKDWGVLRQTIEGEFKMAVAYIGSCAILHNVLLMRDDYSALSD---GLGDYSQSPQYCRN 413

Query: 301 AGLNEDSTNEKASVIQRALALRARELHS 320
           A L E      ASVI+ ALA RAR+ HS
Sbjct: 414 ASLEESPIERNASVIRNALATRARKFHS 438

BLAST of Lsi05G002220 vs. TrEMBL
Match: A0A061E009_THECC (PIF / Ping-Pong family of plant transposases OS=Theobroma cacao GN=TCM_007086 PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 2.2e-114
Identity = 204/319 (63.95%), Postives = 244/319 (76.49%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           M SSTFEWL+GLLEPLLECRDPVGSPL+LS E+RLG+GL+RLATG  +  I+ +FGVSES
Sbjct: 112 MKSSTFEWLAGLLEPLLECRDPVGSPLNLSAELRLGIGLFRLATGSSYPEIAQRFGVSES 171

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           V RFC K LCRVLCTNFRFWV FP P EL+  S +FE   GLPNCCGV+ CTRF I+  +
Sbjct: 172 VTRFCTKHLCRVLCTNFRFWVAFPSPEELKSVSLSFEQFTGLPNCCGVIDCTRFNIVNEN 231

Query: 121 HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHG 180
           +   DS+A Q+VVDSSS+ILSIVAGF+GDK DS VL SSTL+KD+E+GRLL++ PV ++G
Sbjct: 232 NGSIDSVAAQIVVDSSSKILSIVAGFKGDKGDSRVLKSSTLYKDVEEGRLLNSSPVLVNG 291

Query: 181 VAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 240
           VA+NQYL G G YPLLPWL+VPF   V GS+E  FN AHR M + ALK I SL+NWG+L 
Sbjct: 292 VAINQYLVGDGAYPLLPWLMVPFVDVVPGSSEGKFNVAHRAMHVSALKTIASLKNWGILK 351

Query: 241 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQ-YVKAGLNEDST 300
           +P+ EE K AVA IGACSILHN LLMRED SA+ +        D  SQ Y +A L E+S 
Sbjct: 352 KPMEEELKAAVAIIGACSILHNILLMREDDSALCELVGDYLVHDQSSQCYGEASLEENSI 411

Query: 301 NEKASVIQRALALRARELH 319
            ++ASVI+ ALA  ARE H
Sbjct: 412 GKEASVIRDALATEAREAH 430

BLAST of Lsi05G002220 vs. TrEMBL
Match: B9GHA8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s31230g PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 8.6e-111
Identity = 200/325 (61.54%), Postives = 248/325 (76.31%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           M SSTFEWLSGLLEPLLECRDP+G+P++LS E+RLG+GL+RLATG  +  I+ +FGV+ES
Sbjct: 108 MRSSTFEWLSGLLEPLLECRDPIGTPINLSSELRLGIGLFRLATGSSYIEIAGRFGVTES 167

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           V RFCAKQLCRVLCTNFRFW+ FP   EL+L S   E L GLPNCCGV+ CTRF +++ +
Sbjct: 168 VTRFCAKQLCRVLCTNFRFWIAFPTSTELQLVSKDIEGLTGLPNCCGVIDCTRFNVVKRN 227

Query: 121 --------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLD 180
                      +DSIA Q+VVDSSSRILSI+AGFRGDK+DS +L S+TL  DIE  RLL+
Sbjct: 228 DCKLASDDEVQDDSIAVQIVVDSSSRILSIIAGFRGDKNDSRILKSTTLCHDIEGRRLLN 287

Query: 181 APPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVS 240
           A PV ++GVA++QYL G G YPLLPWL+VPF   V GS+EE FN A+ LM + AL+ I S
Sbjct: 288 ATPVIVNGVAIDQYLIGDGGYPLLPWLMVPFVDVVPGSSEEKFNAANNLMHVFALRTIAS 347

Query: 241 LRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA 300
           L+NWGVL++P+ EEFKTAVA+IGACSILHN LLMRED SA+ D  E  + +D  SQ+ K 
Sbjct: 348 LKNWGVLNKPVEEEFKTAVAFIGACSILHNVLLMREDDSALIDV-EDYSLYDQDSQFYKD 407

Query: 301 GLNEDS-TNEKASVIQRALALRARE 317
            + E++ T +KAS  +RALA R  E
Sbjct: 408 AMTEENLTEKKASDTRRALATRVTE 431

BLAST of Lsi05G002220 vs. TAIR10
Match: AT3G55350.1 (AT3G55350.1 PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 119.8 bits (299), Expect = 3.1e-27
Identity = 82/294 (27.89%), Postives = 140/294 (47.62%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQF 60
           ++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + F
Sbjct: 78  ISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND--RVAVALRRLGSGESLSVIGETF 137

Query: 61  GVSESVA-----RFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVS 120
           G+++S       RF      R +  +   W     P++L+   S FE ++GLPNCCG + 
Sbjct: 138 GMNQSTVSQITWRFVESMEERAI--HHLSW-----PSKLDEIKSKFEKISGLPNCCGAID 197

Query: 121 CTRFKIIRNSHFYEDS------------IATQLVVDSSSRILSIVAGFRGDKDDSTVLMS 180
            T   I+ N    E S            +  Q VVD   R L ++AG+ G  +D  VL +
Sbjct: 198 ITH--IVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWPGSLNDDVVLKN 257

Query: 181 STLFKDIEQGRLLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNE 240
           S  +K +E+G+ L+   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+
Sbjct: 258 SGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPTSLPQTEFNK 317

Query: 241 AHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRED 270
            H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  ED
Sbjct: 318 RHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIIIDMED 360

BLAST of Lsi05G002220 vs. TAIR10
Match: AT3G63270.1 (AT3G63270.1 Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 119.0 bits (297), Expect = 5.3e-27
Identity = 83/285 (29.12%), Postives = 132/285 (46.32%), Query Frame = 1

Query: 2   TSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV 61
           + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV
Sbjct: 72  SKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGV 131

Query: 62  SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTR---- 121
            +S       +    L    +  + +P  + +E   S FE++ GLPNCCG +  T     
Sbjct: 132 GQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMT 191

Query: 122 FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQG 181
              ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  
Sbjct: 192 LPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENA 251

Query: 182 RLLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPAL 241
           ++LD  P  L  G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A 
Sbjct: 252 QILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAA 311

Query: 242 KAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF 271
            A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Sbjct: 312 TAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDY 356

BLAST of Lsi05G002220 vs. TAIR10
Match: AT3G19120.1 (AT3G19120.1 PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 92.8 bits (229), Expect = 4.1e-19
Identity = 64/252 (25.40%), Postives = 111/252 (44.05%), Query Frame = 1

Query: 25  SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEF 84
           S L L  +  + + L RLA GC   T++ ++ +   +       + R+L T  +  +++ 
Sbjct: 142 SNLSLPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLISKITNMVTRLLATKLYPEFIKI 201

Query: 85  PC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSH----------FYEDSIATQLV 144
           P     L  T+  FE+L  LPN CG +  T  K+ R +           +  D++  Q+V
Sbjct: 202 PVGKRRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTKLNPRNIYGCKYGYDAVLLQVV 261

Query: 145 VDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE 204
            D       +     G +DDS+    S L+K +  G ++    + + G  V  Y+ G   
Sbjct: 262 ADHKKIFWDVCVKAPGGEDDSSHFRDSLLYKRLTSGDIVWEKVINIRGHHVRPYIVGDWC 321

Query: 205 YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTA 263
           YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A
Sbjct: 322 YPLLSFLMTPFSPNGSGTPPENLFDGMLMKGRSVVVEAIGLLKARWKIL-QSLNVGVNHA 381

BLAST of Lsi05G002220 vs. TAIR10
Match: AT4G29780.1 (AT4G29780.1 unknown protein)

HSP 1 Score: 86.3 bits (212), Expect = 3.8e-17
Identity = 75/295 (25.42%), Postives = 124/295 (42.03%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQF 60
           M+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++F
Sbjct: 219 MSKSTFNLICEELDTTVTKKNTMLRDAIPAPK------RVGVCVWRLATGAPLRHVSERF 278

Query: 61  GVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSC 120
           G+  S       ++CR    VL   +  W   P  +E+  T + FE +  +PN  G +  
Sbjct: 279 GLGISTCHKLVIEVCRAIYDVLMPKYLLW---PSDSEINSTKAKFESVHKIPNVVGSIYT 338

Query: 121 TRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVL 180
           T   II          N    E       SI  Q VV++      +  G  G   D  +L
Sbjct: 339 THIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQIL 398

Query: 181 MSSTLFKD-IEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESF 240
             S+L +    +G L D+            ++ G+  +PL  +L+VP+       T+ +F
Sbjct: 399 EKSSLSRQRAARGMLRDS------------WIVGNSGFPLTDYLLVPYTRQNLTWTQHAF 458

Query: 241 NEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED 270
           NE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Sbjct: 459 NESIGEIQGIATAAFERLKGRWACLQKRTEVKLQDLPYVLGACCVLHNICEMRKE 492

BLAST of Lsi05G002220 vs. NCBI nr
Match: gi|778688571|ref|XP_011652780.1| (PREDICTED: uncharacterized protein LOC101203312 [Cucumis sativus])

HSP 1 Score: 636.0 bits (1639), Expect = 3.6e-179
Identity = 309/319 (96.87%), Postives = 317/319 (99.37%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES
Sbjct: 106 MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 165

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           VARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS
Sbjct: 166 VARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 225

Query: 121 HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHG 180
           HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHG
Sbjct: 226 HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHG 285

Query: 181 VAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 240
           VAVN+YLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS
Sbjct: 286 VAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 345

Query: 241 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTN 300
           QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHKSQYV+AGLN DSTN
Sbjct: 346 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTN 405

Query: 301 EKASVIQRALALRARELHS 320
           EKASVIQRALALRARELHS
Sbjct: 406 EKASVIQRALALRARELHS 424

BLAST of Lsi05G002220 vs. NCBI nr
Match: gi|659112261|ref|XP_008456140.1| (PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo])

HSP 1 Score: 629.4 bits (1622), Expect = 3.4e-177
Identity = 306/319 (95.92%), Postives = 315/319 (98.75%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES
Sbjct: 63  MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 122

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           VARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS
Sbjct: 123 VARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 182

Query: 121 HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHG 180
           HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHG
Sbjct: 183 HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHG 242

Query: 181 VAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 240
           VAVN+YLFG GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS
Sbjct: 243 VAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 302

Query: 241 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTN 300
           QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYV+AGLN DSTN
Sbjct: 303 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTN 362

Query: 301 EKASVIQRALALRARELHS 320
           EKASVIQRALA RARELHS
Sbjct: 363 EKASVIQRALAQRARELHS 381

BLAST of Lsi05G002220 vs. NCBI nr
Match: gi|657963140|ref|XP_008373170.1| (PREDICTED: putative nuclease HARBI1 [Malus domestica])

HSP 1 Score: 443.7 bits (1140), Expect = 2.7e-121
Identity = 215/316 (68.04%), Postives = 252/316 (79.75%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           MTSSTFEWL GLLEPLLECRDPVGSPL+LS ++RLG+GL+RL+TG  +  IS QFGVSE 
Sbjct: 105 MTSSTFEWLCGLLEPLLECRDPVGSPLNLSADLRLGMGLFRLSTGSSYPEISKQFGVSEM 164

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           VARFCAKQLCRVLCTN+RFW+EFP P EL+  S+AFE   GLPNCCGV+ CTRFKI+RN 
Sbjct: 165 VARFCAKQLCRVLCTNYRFWIEFPNPXELDSVSAAFETQTGLPNCCGVIDCTRFKIVRNG 224

Query: 121 HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHG 180
              E+SIA Q+ VDSSSRILSIVAGFRG+K DS VL SSTL+KDIE G+LL++PP  ++G
Sbjct: 225 GVQEESIAAQITVDSSSRILSIVAGFRGNKGDSRVLRSSTLYKDIEAGKLLNSPPASVNG 284

Query: 181 VAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 240
           VAVNQYL G G YPLLPWL+VPF  AV GS EE FN AH +M + AL+ IVSL+NWGVLS
Sbjct: 285 VAVNQYLIGDGGYPLLPWLMVPFVDAVKGSPEEHFNAAHNVMRLSALRTIVSLKNWGVLS 344

Query: 241 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVK-AGLNEDST 300
           +PI EE K AVAYIGACSILHN LL REDFSA+ D  +  + +D  SQY +   L E+S 
Sbjct: 345 RPIQEEMKMAVAYIGACSILHNGLLRREDFSALCDGLDDYSLYDQSSQYYRDTSLEENSI 404

Query: 301 NEKASVIQRALALRAR 316
             KASVI+ ALA +A+
Sbjct: 405 ERKASVIRSALATKAK 420

BLAST of Lsi05G002220 vs. NCBI nr
Match: gi|645261815|ref|XP_008236474.1| (PREDICTED: putative nuclease HARBI1 [Prunus mume])

HSP 1 Score: 441.8 bits (1135), Expect = 1.0e-120
Identity = 216/320 (67.50%), Postives = 253/320 (79.06%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           MT STFEWL GLLEPLLECRDPVG PL+LS E+RLG+GL+RL+TG  +  IS QFGVSE 
Sbjct: 107 MTYSTFEWLCGLLEPLLECRDPVGLPLNLSAELRLGIGLFRLSTGSSYPEISKQFGVSEP 166

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS 120
           VARFCAKQLCRVLCTN+RFW+EFP PNEL   S+AF    GLPNCCGV+ CTRFK ++N 
Sbjct: 167 VARFCAKQLCRVLCTNYRFWIEFPNPNELASVSAAFGSQTGLPNCCGVIDCTRFKTVKNG 226

Query: 121 HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHG 180
            F+E+SIA Q++VDSSSRILSIVAGFRG+K DS VL SSTL+KDIE GRLL++PPV + G
Sbjct: 227 GFHEESIAAQIMVDSSSRILSIVAGFRGNKGDSRVLKSSTLYKDIEAGRLLNSPPVNVDG 286

Query: 181 VAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLS 240
           VAVNQYL G   YPLLPWL+VPF  A  GS+EE FN AH LM + AL+ IVSL++WG+LS
Sbjct: 287 VAVNQYLIGDEGYPLLPWLMVPFVDAAKGSSEEHFNAAHNLMRLSALRTIVSLKSWGILS 346

Query: 241 QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVK-AGLNEDST 300
           QPI EEFK AVAYIGACSILHN LL REDFSAM D  +  + +D  SQY +   L E+S 
Sbjct: 347 QPIQEEFKMAVAYIGACSILHNGLLRREDFSAMCDV-DDYSLYDQSSQYYRDTSLEENSI 406

Query: 301 NEKASVIQRALALRARELHS 320
             KASVI+ ALA +A+E  +
Sbjct: 407 ERKASVIRTALAAKAKEFQN 425

BLAST of Lsi05G002220 vs. NCBI nr
Match: gi|568867441|ref|XP_006487046.1| (PREDICTED: putative nuclease HARBI1 [Citrus sinensis])

HSP 1 Score: 432.6 bits (1111), Expect = 6.1e-118
Identity = 216/323 (66.87%), Postives = 254/323 (78.64%), Query Frame = 1

Query: 1   MTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES 60
           M+SSTF WLSGLLEPLL+CRDPVG PL+LS +IRLG+GL+RL  G  +S I+ +F V+ES
Sbjct: 105 MSSSTFRWLSGLLEPLLDCRDPVGLPLNLSADIRLGIGLFRLVNGSTYSEIATRFEVTES 164

Query: 61  VARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR-- 120
           V RFC KQLCRVLCTNFRFWV FP P EL L S +FE+L GLPNCCGV+ CTRFKII+  
Sbjct: 165 VTRFCVKQLCRVLCTNFRFWVAFPGPEELGLISKSFEELTGLPNCCGVIDCTRFKIIKID 224

Query: 121 --NSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPV 180
             NS   EDSIA Q+VVDSSSR+LSIVAG RGDK DS VL SSTL+KDIE+ +LL++ P+
Sbjct: 225 GSNSSKDEDSIAVQIVVDSSSRMLSIVAGIRGDKGDSRVLKSSTLYKDIEEKKLLNSSPI 284

Query: 181 YLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNW 240
            ++GVAV+QYL G G YPLLPWL+VPF  A  GS+EE+FN AH LM +PALKAI SL+NW
Sbjct: 285 CVNGVAVDQYLIGDGGYPLLPWLMVPFVDANPGSSEENFNAAHNLMRVPALKAIASLKNW 344

Query: 241 GVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVK-AGLN 300
           GVLS+PI E+FKTAVA IGACSILHNALLMREDFS + +E    +  D  SQY   A L 
Sbjct: 345 GVLSRPIDEDFKTAVALIGACSILHNALLMREDFSGLFEELGDYSLHDESSQYYSDASLE 404

Query: 301 EDSTNEKASVIQRALALRARELH 319
           E+ST +KAS I+ ALA RAR  H
Sbjct: 405 ENSTEKKASAIRSALATRARVQH 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LBX6_CUCSA2.5e-17996.87Uncharacterized protein OS=Cucumis sativus GN=Csa_3G202740 PE=4 SV=1[more]
A0A067EX85_CITSI4.3e-11866.87Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013572mg PE=4 SV=1[more]
F6HQ92_VITVI7.3e-11864.63Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g00150 PE=4 SV=... [more]
A0A061E009_THECC2.2e-11463.95PIF / Ping-Pong family of plant transposases OS=Theobroma cacao GN=TCM_007086 PE... [more]
B9GHA8_POPTR8.6e-11161.54Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s31230g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55350.13.1e-2727.89 PIF / Ping-Pong family of plant transposases[more]
AT3G63270.15.3e-2729.12 Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT3G19120.14.1e-1925.40 PIF / Ping-Pong family of plant transposases[more]
AT4G29780.13.8e-1725.42 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778688571|ref|XP_011652780.1|3.6e-17996.87PREDICTED: uncharacterized protein LOC101203312 [Cucumis sativus][more]
gi|659112261|ref|XP_008456140.1|3.4e-17795.92PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo][more]
gi|657963140|ref|XP_008373170.1|2.7e-12168.04PREDICTED: putative nuclease HARBI1 [Malus domestica][more]
gi|645261815|ref|XP_008236474.1|1.0e-12067.50PREDICTED: putative nuclease HARBI1 [Prunus mume][more]
gi|568867441|ref|XP_006487046.1|6.1e-11866.87PREDICTED: putative nuclease HARBI1 [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G002220.1Lsi05G002220.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 122..262
score: 3.6
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..314
score: 3.6E
NoneNo IPR availablePANTHERPTHR22930:SF49SUBFAMILY NOT NAMEDcoord: 1..314
score: 3.6E

The following gene(s) are paralogous to this gene:

None