Lag0031908 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0031908
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionH15 domain-containing protein
Locationchr11: 18429087 .. 18431674 (-)
RNA-Seq ExpressionLag0031908
SyntenyLag0031908
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTAAGTCAGAAAACCGAGGAGAAGCTACAGATAGTAAAGGGAAGAGTCAAGAATCGAGTTTCGGAACCCTTCTTCCATGGTGAAGAGGGGGTTTTATACCTGTTTGTCCTCCTAGGGTTTTTAGGAATTCGGTGGTGTTTCGGGGCGAACCAGGCGAAACCGGGGCGATTCGGGGCATCAGGGACCGAAAAGAGGTCACCGGGCTCGGCCCGCGCAAACGGGCCGAATGGTCGGCCTCGGCCTTTTGCCGAGGCCGACCATACGGGTCGGGTCATTTTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGGTCGGTCTTTTGGTCCTACCTCTGCCCGATTGTCCTCGTCAGCTTCTTGTCCATCTGTGTGGTCCAAAATCACCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGGTGCGACTTTAATGCTATAGGCCGAGCTCCTTAAGCCAAATTTAAGAGTACAACTCTCGTCTCGTGCTGACTTTTGTATGACTTCTTTGTTATCCTGCAAGAGAGGGCGAGCTATGCAAAGTAAAAACTCCCTGAGTAGGGGTTAGGTGAAACGGAAATGAGAAGAGTGATAAGTTACAGAAGTGGCCGGAGGGCCTCTTTGGTCGGCCTCGGCCTCGGGAAGAGGCCGAGCACAACATTTTTCATTTTAGCTTGGTCGTTCGGAGTGCTTCATTCCATAATTAACAATAGGAAAAACAGAAAACTTCACAACCCTGCATTGAAAAGTTAATTCCTGCAAAAGAAAAACAAGGGAGGAGGCCCGGATCGGCCTGGGCCTCTAGCTCCTGTTAGCTCAGTGCTTAGCCCCCTGCATTCATATGTCAGTCCTTACTGGAAAATAATGGGAAAGTTGCATGATGATTACAACAAAGGAACGGAATAGCCTTAACCTACCCCTTATTCCCAGCCTCCACTCGACATGGGCATTTATTACCTCAAGCGTTGCATTTTTCGAGCAAGAAGGACTAGTTCCATACCTGGCCTTTTCGAAGGTGGATGGTCGGCCTCGGCCATGGCATGAGGCTGACCAGCAGCTCTGTACTATTTGCTCGCAGACGTCCAGGCTTATGTGCTAAAAAGTAAAGTGACTGTGCTAAAGTCACGCTGCCTTGAATGTGTGCCTGATGCTTGGCCTTGGAGTTGGGATGCTTGGCCTTGGCATGACGATGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCTCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATAGTCATCCTAGGGGTGTGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCTCGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGAGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTAGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTATCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGCTGTATTCATCCCGAGGGTGCGTACACTCAGTCGGGGGAAGTGCAGAAGTCTGGCCTCTACAAGGATCATTCTCGTCTGGCCACACGCGTTACATCCACGAAGAAACGCCACGGACATGTGTTATACAACACGAAAGGAAAATCAAAACTGTCGCCACAAGACCGTGGACTCGACCTGTCCTTACCCCTACCCCCCACTCAAAATGTGCATTCAATATCTTGGACATGCATTTTTTGA

mRNA sequence

ATGCCTAAGTCAGAAAACCGAGGAGAAGCTACAGATAGTAAAGGGAAGAGTCAAGAATCGAGTTTCGGAACCCTTCTTCCATGGAATTCGGTGGTGTTTCGGGGCGAACCAGGCGAAACCGGGGCGATTCGGGGCATCAGGGACCGAAAAGAGGTCACCGGGCTCGGCCCGCGCAAACGGGCCGAATGGTCGGCCTCGGCCTTTTGCCGAGGCCGACCATACGGGTCGGGTCATTTTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGGTCGGTCTTTTGGTCCTACCTCTGCCCGATTGTCCTCGTCAGCTTCTTGTCCATCTGTGTGGTCCAAAATCACCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGACGTCCAGGCTTATGTGCTAAAAAGTAAAGTGACTGTGCTAAAGTCACGCTGCCTTGAATGTGTGCCTGATGCTTGGCCTTGGAGTTGGGATGCTTGGCCTTGGCATGACGATGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCTCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATAGTCATCCTAGGGGTGTGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCTCGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGAGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTAGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTATCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGCTGTATTCATCCCGAGGGTGCGTACACTCAGTCGGGGGAAGTGCAGAAGTCTGGCCTCTACAAGGATCATTCTCGTCTGGCCACACGCGTTACATCCACGAAGAAACGCCACGGACATGTGTTATACAACACGAAAGGAAAATCAAAACTGTCGCCACAAGACCGTGGACTCGACCTGTCCTTACCCCTACCCCCCACTCAAAATGTGCATTCAATATCTTGGACATGCATTTTTTGA

Coding sequence (CDS)

ATGCCTAAGTCAGAAAACCGAGGAGAAGCTACAGATAGTAAAGGGAAGAGTCAAGAATCGAGTTTCGGAACCCTTCTTCCATGGAATTCGGTGGTGTTTCGGGGCGAACCAGGCGAAACCGGGGCGATTCGGGGCATCAGGGACCGAAAAGAGGTCACCGGGCTCGGCCCGCGCAAACGGGCCGAATGGTCGGCCTCGGCCTTTTGCCGAGGCCGACCATACGGGTCGGGTCATTTTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGGTCGGTCTTTTGGTCCTACCTCTGCCCGATTGTCCTCGTCAGCTTCTTGTCCATCTGTGTGGTCCAAAATCACCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGACGTCCAGGCTTATGTGCTAAAAAGTAAAGTGACTGTGCTAAAGTCACGCTGCCTTGAATGTGTGCCTGATGCTTGGCCTTGGAGTTGGGATGCTTGGCCTTGGCATGACGATGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCTCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATAGTCATCCTAGGGGTGTGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCTCGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGAGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTAGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTATCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGCTGTATTCATCCCGAGGGTGCGTACACTCAGTCGGGGGAAGTGCAGAAGTCTGGCCTCTACAAGGATCATTCTCGTCTGGCCACACGCGTTACATCCACGAAGAAACGCCACGGACATGTGTTATACAACACGAAAGGAAAATCAAAACTGTCGCCACAAGACCGTGGACTCGACCTGTCCTTACCCCTACCCCCCACTCAAAATGTGCATTCAATATCTTGGACATGCATTTTTTGA

Protein sequence

MPKSENRGEATDSKGKSQESSFGTLLPWNSVVFRGEPGETGAIRGIRDRKEVTGLGPRKRAEWSASAFCRGRPYGSGHFGPTLWSGLPLGRSFGPTSARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSWDAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCCIHPEGAYTQSGEVQKSGLYKDHSRLATRVTSTKKRHGHVLYNTKGKSKLSPQDRGLDLSLPLPPTQNVHSISWTCIF
Homology
BLAST of Lag0031908 vs. NCBI nr
Match: XP_033736132.1 (uncharacterized protein LOC117324396 [Pecten maximus])

HSP 1 Score: 97.1 bits (240), Expect = 5.9e-16
Identity = 89/355 (25.07%), Postives = 130/355 (36.62%), Query Frame = 0

Query: 177 RKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRKFP 236
           R   + + +CH+ +RG YT     L   R   + + +CH+  RG YT             
Sbjct: 9   RGMYTSTCTCHTEVRGIYTSTCTCLTEVRGIYTSTCTCHTEVRGIYT------------- 68

Query: 237 SLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSS 296
             + +CH+  RG+YT     L   R   + + +CH+  RG YT               + 
Sbjct: 69  -STCTCHTEVRGMYTYTCTCLTEVRGIYTSTCTCHTEVRGIYT--------------STC 128

Query: 297 SCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHG 356
           +CH   RG YT     L   R   + + +CH   RG YT     L   R     + +C  
Sbjct: 129 TCHTEVRGIYTSTCTCLTEVRGMYTSTCTCHTEVRGIYTSTCTCLTEVRGIYISTCTCLT 188

Query: 357 HPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRG 416
             RG YT         R     + +C    RG YT     L   R   + + +C     G
Sbjct: 189 EVRGIYTSTCTCHTEVRGIYISTCTCLTEVRGIYTPTCTCLTEVRGIYTSTCTCLTEVHG 248

Query: 417 AYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTL 476
            YT     L   R   + + +C     G YT     L   R   + + +C    RG YT 
Sbjct: 249 IYTSTCTCLTEVRGIYTSTCTCLTEVHGIYTSTCTCLTEVRGIYTSTCTCLTEVRGIYTS 308

Query: 477 LGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYT 532
               L   R   + + + H   RG YT     L   R   + + +C    RG YT
Sbjct: 309 TCTCLTEVRGIYTSTCTCHTEVRGIYTSTCTCLTEVRGIYTSTCTCLTEVRGIYT 335

BLAST of Lag0031908 vs. NCBI nr
Match: PWV06054.1 (hypothetical protein C3747_120g5 [Trypanosoma cruzi])

HSP 1 Score: 73.6 bits (179), Expect = 7.0e-09
Identity = 81/317 (25.55%), Postives = 128/317 (40.38%), Query Frame = 0

Query: 97  SARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSW 156
           SA  SS + C + +S    ++      + +   +  L S      +  L     A+  + 
Sbjct: 242 SATASSLSLCSTAFSATASSLSFCSTAFSATASSLSLCSTAFSATASSLSLCSTAFSATA 301

Query: 157 DAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHS 216
            +  +   A++    SL       S ++S  S    A      SL       S ++S  S
Sbjct: 302 PSLSFCSTAFSATASSLSLGSTAFSATASSLSLCTWAIAATASSLSLCSTAFSATASSLS 361

Query: 217 HPRGAYTLLGKSLGR------PRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSC 276
               A++    SL        PR+ PS S++ HS PR   +L      + R+  S S++ 
Sbjct: 362 LCSTAFSATASSLSSAAQHSPPRRHPSPSAAQHSPPRRHPSLSAPGQSQSRRHPSPSAAQ 421

Query: 277 HSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHP 336
           HS               PR+ PS S++ H  PR   +L   G  + R+  SLS++ H  P
Sbjct: 422 HS--------------LPRRHPSPSAAQHSLPRRHPSLSAPGQSQSRRHPSLSAAKHSPP 481

Query: 337 RGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY 396
           R   +    G   PR+  S S++ H  PR   +L   G   PR+ PS S++ H  PR   
Sbjct: 482 RRHPSPSAPGQSPPRRHPSPSAAQHSLPRRHPSLSAPG-QPPRRHPSPSAAQHSLPRRHP 541

Query: 397 TLLGKGLGRPRKFPSLS 408
           +        PR+ PSLS
Sbjct: 542 SPSAAQHSLPRRHPSLS 543

BLAST of Lag0031908 vs. NCBI nr
Match: BAK05521.1 (predicted protein, partial [Hordeum vulgare subsp. vulgare])

HSP 1 Score: 70.1 bits (170), Expect = 7.8e-08
Identity = 97/312 (31.09%), Postives = 124/312 (39.74%), Query Frame = 0

Query: 243 HSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSS--CHG 302
           +SH   +     +  GR R    L S+      GA +L  KG GRPRK   L SS     
Sbjct: 291 YSHSYMLRAKRKRKRGRGRSLKLLVSA----NAGASSLTKKGRGRPRKKQKLGSSQAAFQ 350

Query: 303 HPRGAYTLLGKGLGRPRKFQSLS-SSCHGHPR----GAYTLLRKGLGRPRKFQSLSSSCH 362
           + +G ++   +G GRPRK  S       G PR    G  T +++G GRPRK +S      
Sbjct: 351 NGQGGFSTPKRGRGRPRKDLSTGVKRGRGRPRKDAQGLSTSVKRGRGRPRKEES------ 410

Query: 363 GHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPR 422
              +G  T +  G GRPRK  S          G  T L  G GRPRK P   S+      
Sbjct: 411 ---QGMSTGVKSGRGRPRKEES---------EGMSTGLKSGRGRPRKEPEEMSTGVTETD 470

Query: 423 GAYTLLGKGLGRPRKFQSLSSSC--HGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGA 482
            + T             SL+ S    G P      L +G G P    +  ++     RG 
Sbjct: 471 SSTT-----ASDSESDSSLTGSDTESGSP-NVSRWLKEGFGNPTTLATPPAAVRPVSRG- 530

Query: 483 YTLLGKGLGRP---RKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY 542
              L  G  RP   R  P   S+       A T + K LGRPRK  S  +       G  
Sbjct: 531 ---LNIGSLRPTIERPPPPPRSAGTALDTVASTGMKKPLGRPRKKRSAKAVSAETGDGGS 570

BLAST of Lag0031908 vs. NCBI nr
Match: XP_020199045.1 (collagen alpha-1(I) chain [Aegilops tauschii subsp. strangulata])

HSP 1 Score: 68.6 bits (166), Expect = 2.3e-07
Identity = 93/304 (30.59%), Postives = 120/304 (39.47%), Query Frame = 0

Query: 274 PRGAYTLL--GKGLGRPRKFPSLSSSCHGHPR-----GAYTLLGKGLGRPRKFQSLSSSC 333
           P  AY L+  G   GRPRK         G PR        ++  KG GRPRK Q L SS 
Sbjct: 286 PARAYALIVHGAKRGRPRKRKRKRGP--GRPRKLLLSANSSVTKKGRGRPRKKQKLGSSL 345

Query: 334 ----HGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSC 393
               +G P G ++  ++G GRPRK            +G  T + +G GRPRK        
Sbjct: 346 AAFQNGTPGGGFSTPKRGRGRPRK----------DAQGLSTRVKRGRGRPRK-------- 405

Query: 394 HGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKF------QSLSS 453
               +G  T + KG GRPRK             G  T +  G GRPRK       ++ SS
Sbjct: 406 --DAQGLSTGVKKGRGRPRK----------ESEGMSTGVKSGRGRPRKEPEDGVPEADSS 465

Query: 454 SCHGHPRGAYTLLG---------------KGLGRPRKFQSLSSSCHGHPRGAYTLLGKGL 513
                     +L G               +G G P    +  ++     RG    L  G 
Sbjct: 466 ETESDAESDSSLTGSDTESGSPDVSRWLKEGFGNPTTIATPPAAVRPASRG----LNIGS 525

Query: 514 GRPR-KFPSLSSSYHGH--PRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLG 543
            RP  + P  ++S  G      A T + K LGRPRK  S  +       GA T + K  G
Sbjct: 526 LRPTIERPPAAASSPGTLVYSAASTGMKKPLGRPRKKGSSKAPAEA-GGGASTGIKKPRG 552

BLAST of Lag0031908 vs. NCBI nr
Match: KAF6998140.1 (hypothetical protein CFC21_014287 [Triticum aestivum])

HSP 1 Score: 68.6 bits (166), Expect = 2.3e-07
Identity = 93/304 (30.59%), Postives = 120/304 (39.47%), Query Frame = 0

Query: 274 PRGAYTLL--GKGLGRPRKFPSLSSSCHGHPR-----GAYTLLGKGLGRPRKFQSLSSSC 333
           P  AY L+  G   GRPRK         G PR        ++  KG GRPRK Q L SS 
Sbjct: 186 PARAYALIVHGAKRGRPRKRKRKRGP--GRPRKLLLSANSSVTKKGRGRPRKKQKLGSSL 245

Query: 334 ----HGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSC 393
               +G P G ++  ++G GRPRK            +G  T + +G GRPRK        
Sbjct: 246 AAFQNGTPGGGFSTPKRGRGRPRK----------DAQGLSTRVKRGRGRPRK-------- 305

Query: 394 HGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKF------QSLSS 453
               +G  T + KG GRPRK             G  T +  G GRPRK       ++ SS
Sbjct: 306 --DAQGLSTGVKKGRGRPRK----------ESEGMSTGVKSGRGRPRKEPEDGVPEADSS 365

Query: 454 SCHGHPRGAYTLLG---------------KGLGRPRKFQSLSSSCHGHPRGAYTLLGKGL 513
                     +L G               +G G P    +  ++     RG    L  G 
Sbjct: 366 ETESDAESDSSLTGSDTESGSPDVSRWLKEGFGNPTTIATPPAAVRPASRG----LNIGS 425

Query: 514 GRPR-KFPSLSSSYHGH--PRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLG 543
            RP  + P  ++S  G      A T + K LGRPRK  S  +       GA T + K  G
Sbjct: 426 LRPTIERPPAAASSPGTLVYSAASTGMKKPLGRPRKKGSSKAPAEA-GGGASTGIKKPRG 452

BLAST of Lag0031908 vs. ExPASy TrEMBL
Match: A0A6Q2ZHZ9 (Uncharacterized protein OS=Esox lucius OX=8010 PE=3 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 4.6e-14
Identity = 123/389 (31.62%), Postives = 163/389 (41.90%), Query Frame = 0

Query: 171 KSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLG 230
           K +   +K +SLS S    L  + +   +  GRP K  SLS S       + T   +  G
Sbjct: 1   KQVDNDQKLISLSKS-DEILSNSSSTQKRGRGRPPKERSLSKSDEILSNSSST-QKRGRG 60

Query: 231 RPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRK 290
           RP K  SLS S         T   +  GRP K  SLS S       + T   +G GRP K
Sbjct: 61  RPPKERSLSKSDEILSNSSST-QKRGRGRPPKERSLSKSDEILSNSSST-QKRGRGRPPK 120

Query: 291 FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSL 350
             SLS S       + T   +G GRP K +SLS S      G+ + L++G GR  K +S+
Sbjct: 121 ERSLSKSDEILSNSSST-QKRGRGRPPKERSLSKSDEISSNGSLSTLKRGRGRLPKERSV 180

Query: 351 SSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSC 410
           S S       + T   KG GRP K  S+S S       + T L +G GRP K  S+S S 
Sbjct: 181 SKSDEILSNSS-TTPKKGRGRPPKESSVSKSDEILSNSSST-LKRGRGRPPKERSISKSD 240

Query: 411 HGHPRGAYTLLGKGLGRPRKFQSLSSSC-------HGHPRGAYTLLGKGLGRPRKF---- 470
              P G  +   KG GRP+   S   +         G P+ +     K  GRP+K     
Sbjct: 241 EVLPHGNPS-TPKGRGRPQGTVSKMGTIPITLNKGRGRPKRSVPKPTK-RGRPKKMVFSK 300

Query: 471 -QSLSSSCHGHPRG---AYTLLGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKF 530
            + L  +C  +P G      L  +  GRP K   LS+   G P      L    GRP+K 
Sbjct: 301 ARMLKVTCKPYPSGPRQVVDLAPEKRGRPGKPVDLSNRRRGRP----GKLPPKRGRPKKI 360

Query: 531 PSLSSSCHGHPRGAYTLLGKGLGRPRKFP 545
            +         R +   + K LGRPR  P
Sbjct: 361 LTPEEEEKRRKRESEPRVFKPLGRPRIHP 376

BLAST of Lag0031908 vs. ExPASy TrEMBL
Match: A0A3P8P8U8 (Uncharacterized protein OS=Astatotilapia calliptera OX=8154 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 6.6e-13
Identity = 73/246 (29.67%), Postives = 93/246 (37.80%), Query Frame = 0

Query: 228 SLGRPRKFP--SLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGL 287
           S  RP   P  S  + C      +  +L +    P   +         PR   +L+ + L
Sbjct: 23  STNRPVMDPADSALTECLEETARIQAILDRIFSAPDLKTGFKFRLLEGPRSPSSLMPRQL 82

Query: 288 GRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPR 347
           G      SL     G PR    L+ + LG PR   SL     G PR    L+ + LG PR
Sbjct: 83  GVQENRTSLMPRLLGGPRSLSGLMPRLLGGPRSPSSLMPRLLGGPRSPSGLMPRLLGGPR 142

Query: 348 KFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPS 407
               L     G PR    L+ + LG PR    L     G PR    L+ + LG PR    
Sbjct: 143 SLSGLMPRLLGGPRSLSGLMPRLLGGPRSPSGLMPRLLGGPRSLSGLMPRLLGGPRSLSG 202

Query: 408 LSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSS 467
           L     G PR    L+ + LG PR    L     G PR    L+ + LG PR    L   
Sbjct: 203 LMPRLLGGPRSLSGLMPRLLGGPRSLSGLMPRLLGGPRSLSGLMPRLLGGPRSPSGLMPR 262

Query: 468 CHGHPR 472
             G PR
Sbjct: 263 LLGGPR 268

BLAST of Lag0031908 vs. ExPASy TrEMBL
Match: A0A2V2WH94 (Uncharacterized protein OS=Trypanosoma cruzi OX=5693 GN=C3747_120g5 PE=4 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 3.4e-09
Identity = 81/317 (25.55%), Postives = 128/317 (40.38%), Query Frame = 0

Query: 97  SARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSW 156
           SA  SS + C + +S    ++      + +   +  L S      +  L     A+  + 
Sbjct: 242 SATASSLSLCSTAFSATASSLSFCSTAFSATASSLSLCSTAFSATASSLSLCSTAFSATA 301

Query: 157 DAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHS 216
            +  +   A++    SL       S ++S  S    A      SL       S ++S  S
Sbjct: 302 PSLSFCSTAFSATASSLSLGSTAFSATASSLSLCTWAIAATASSLSLCSTAFSATASSLS 361

Query: 217 HPRGAYTLLGKSLGR------PRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSC 276
               A++    SL        PR+ PS S++ HS PR   +L      + R+  S S++ 
Sbjct: 362 LCSTAFSATASSLSSAAQHSPPRRHPSPSAAQHSPPRRHPSLSAPGQSQSRRHPSPSAAQ 421

Query: 277 HSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHP 336
           HS               PR+ PS S++ H  PR   +L   G  + R+  SLS++ H  P
Sbjct: 422 HS--------------LPRRHPSPSAAQHSLPRRHPSLSAPGQSQSRRHPSLSAAKHSPP 481

Query: 337 RGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY 396
           R   +    G   PR+  S S++ H  PR   +L   G   PR+ PS S++ H  PR   
Sbjct: 482 RRHPSPSAPGQSPPRRHPSPSAAQHSLPRRHPSLSAPG-QPPRRHPSPSAAQHSLPRRHP 541

Query: 397 TLLGKGLGRPRKFPSLS 408
           +        PR+ PSLS
Sbjct: 542 SPSAAQHSLPRRHPSLS 543

BLAST of Lag0031908 vs. ExPASy TrEMBL
Match: A0A673XVP5 (Uncharacterized protein OS=Salmo trutta OX=8032 PE=4 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 3.4e-09
Identity = 130/402 (32.34%), Postives = 166/402 (41.29%), Query Frame = 0

Query: 180 MSLSSSCHSHLR-GAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRK---- 239
           M LS      LR G    L K LGRP +            +G    + K LGRP +    
Sbjct: 1   MPLSKGLDRQLRKGLDRPLRKGLGRPLR------------KGLGRPMRKGLGRPMRKGLG 60

Query: 240 FPSLSSSCHSHPRGVYTLLGKSLGRPRK---FSSLSSSCHSHPR-GAYTLLGKGLGRPRK 299
            P          +G+   L K LGRP +    S L     S  R G  +LL KGLGRP++
Sbjct: 61  SPLRKGLGRPLSKGLDRPLRKGLGRPMRKGLGSPLRKGLGSPLRKGLGSLLRKGLGRPQR 120

Query: 300 ----FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHP--RGAYTLLRKGLGRP 359
               +P           G      KGLGRP + +       G P  +G   LLRKGLGR 
Sbjct: 121 KGLGWPLRKGLGRPQRNGLGRPQRKGLGRPLRKRL------GRPLRKGLGRLLRKGLGRL 180

Query: 360 RKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHP--RGAYTLLGKGLGRPRK 419
            + + L  +     +G   LL KGLG P +         G P  +G    L KGLGRP++
Sbjct: 181 LR-KGLGRTLR---KGLGRLLRKGLGWPLR------KGLGRPQRKGLGRPLRKGLGRPQR 240

Query: 420 FPSLSSSCH---GHP----------RGAYTLLGKGLGRPRKFQSLSSSCH---GHP--RG 479
              L        G P          +G   LL KGLGR  + + L S      G P  +G
Sbjct: 241 -KGLGRPLRKRLGRPLRKGLGRLLRKGLGRLLRKGLGRTLR-KGLGSPLRKGLGSPLRKG 300

Query: 480 AYTLLGKGLGRPRKFQSLSSSCHGHP--RGAYTLLGKGLGRPRKFPSLSSSYHGHP--RG 539
             +LL KGLG P +         G P  +G   LL KGLGR  +         G P  +G
Sbjct: 301 LGSLLRKGLGWPLR------KGLGRPLRKGLGRLLRKGLGRTLR------KGLGRPLRKG 356

Query: 540 AYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRK 543
              LL K LGRP +   L S      +G  +LL  GLGRP++
Sbjct: 361 LGRLLRKVLGRPLR-KGLGSPLR---KGLGSLLRNGLGRPQR 356

BLAST of Lag0031908 vs. ExPASy TrEMBL
Match: F2EDU9 (Predicted protein (Fragment) OS=Hordeum vulgare subsp. vulgare OX=112509 PE=2 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 3.8e-08
Identity = 97/312 (31.09%), Postives = 124/312 (39.74%), Query Frame = 0

Query: 243 HSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSS--CHG 302
           +SH   +     +  GR R    L S+      GA +L  KG GRPRK   L SS     
Sbjct: 291 YSHSYMLRAKRKRKRGRGRSLKLLVSA----NAGASSLTKKGRGRPRKKQKLGSSQAAFQ 350

Query: 303 HPRGAYTLLGKGLGRPRKFQSLS-SSCHGHPR----GAYTLLRKGLGRPRKFQSLSSSCH 362
           + +G ++   +G GRPRK  S       G PR    G  T +++G GRPRK +S      
Sbjct: 351 NGQGGFSTPKRGRGRPRKDLSTGVKRGRGRPRKDAQGLSTSVKRGRGRPRKEES------ 410

Query: 363 GHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPR 422
              +G  T +  G GRPRK  S          G  T L  G GRPRK P   S+      
Sbjct: 411 ---QGMSTGVKSGRGRPRKEES---------EGMSTGLKSGRGRPRKEPEEMSTGVTETD 470

Query: 423 GAYTLLGKGLGRPRKFQSLSSSC--HGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGA 482
            + T             SL+ S    G P      L +G G P    +  ++     RG 
Sbjct: 471 SSTT-----ASDSESDSSLTGSDTESGSP-NVSRWLKEGFGNPTTLATPPAAVRPVSRG- 530

Query: 483 YTLLGKGLGRP---RKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY 542
              L  G  RP   R  P   S+       A T + K LGRPRK  S  +       G  
Sbjct: 531 ---LNIGSLRPTIERPPPPPRSAGTALDTVASTGMKKPLGRPRKKRSAKAVSAETGDGGS 570

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_033736132.15.9e-1625.07uncharacterized protein LOC117324396 [Pecten maximus][more]
PWV06054.17.0e-0925.55hypothetical protein C3747_120g5 [Trypanosoma cruzi][more]
BAK05521.17.8e-0831.09predicted protein, partial [Hordeum vulgare subsp. vulgare][more]
XP_020199045.12.3e-0730.59collagen alpha-1(I) chain [Aegilops tauschii subsp. strangulata][more]
KAF6998140.12.3e-0730.59hypothetical protein CFC21_014287 [Triticum aestivum][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6Q2ZHZ94.6e-1431.62Uncharacterized protein OS=Esox lucius OX=8010 PE=3 SV=1[more]
A0A3P8P8U86.6e-1329.67Uncharacterized protein OS=Astatotilapia calliptera OX=8154 PE=4 SV=1[more]
A0A2V2WH943.4e-0925.55Uncharacterized protein OS=Trypanosoma cruzi OX=5693 GN=C3747_120g5 PE=4 SV=1[more]
A0A673XVP53.4e-0932.34Uncharacterized protein OS=Salmo trutta OX=8032 PE=4 SV=1[more]
F2EDU93.8e-0831.09Predicted protein (Fragment) OS=Hordeum vulgare subsp. vulgare OX=112509 PE=2 SV... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017956AT hook, DNA-binding motifSMARTSM00384AT_hook_2coord: 534..546
e-value: 3.1
score: 12.5
coord: 338..350
e-value: 7.8
score: 10.2
coord: 170..182
e-value: 18.0
score: 8.2
coord: 226..238
e-value: 11.0
score: 9.4
coord: 310..322
e-value: 4.1
score: 11.8
coord: 422..434
e-value: 4.1
score: 11.8
coord: 366..378
e-value: 3.1
score: 12.5
coord: 198..210
e-value: 18.0
score: 8.2
coord: 478..490
e-value: 3.1
score: 12.5
coord: 254..266
e-value: 14.0
score: 8.7
coord: 450..462
e-value: 4.1
score: 11.8
coord: 394..406
e-value: 3.1
score: 12.5
coord: 282..294
e-value: 3.1
score: 12.5
coord: 506..518
e-value: 3.1
score: 12.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0031908.1Lag0031908.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding