Lag0028022 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0028022
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionDNA-directed DNA polymerase
Locationchr8: 10731994 .. 10735593 (+)
RNA-Seq ExpressionLag0028022
SyntenyLag0028022
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTGGTACAGCGACAGCTGATAACTTTTATAATCCAGGTTGGCGCAACCACCCCAAGTTTGCGTGGGGAGGGCAAAGAAGCAATTCGCAAGTCCCCCAAGCACAGCAAAAGGCGGTGAACCAGTCCGGATTTGCTAAATCACAAGAATTGCCCCAGCAAAATAAGCAGGCTTTGCTCCAGCAAAATTCGGGTAATTCTCTGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAGGCTTCATTGAGAACCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTTTAAGGAGTGGTAAGCCACTAGAAGAGAGGAAAGAGCCTAGTAAACCCCAGGAAGTAGAGAAAAGTAGTGATAAAAATAATGTTGTTGAAAAAGAGTTGGAGTCGGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACCCTATAACCCACCCTTACCTTTTCCACAAAGGCAAAGACCTAAGAATCAAGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATTCCTTTAGTAGAAGTTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATGTTTTAACCAAAAAGAAGAGATTCGGAGAGTTTGAGACTGTGGCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCTTCCAAGGTTGGGAATCCAGGATCATTCACTATTCCTGTCTCCATGGGTGGAAAGGAGTTGGGGAGAGCACATTGTGATTTAGGTGCAAGCATAAACCTTATGCCTCTTTCGGTTTATCGAAAGCTAGGAATAGGTGATGCTACACCTACCACAATTACACTCCAATTGGCAGATAAATCTATCACATATCCTGAAGGTAAAATTGAGGATGTTCTAGTCCAGGTAGATAAATTTCTTTTTCCTGTCGATTTCATTGTTTTAGATTATGAGGCAGATAAGAATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCAACTGGTAGTTCATTGATAGATGTCCAAAAGGGGAGCTTACAATGAAGGTGCATGACCAAAAGGTGAAGTTTAATATGTGTGATGCAATGAAATATCCTAATGATCTTGAGGATTGCTCGTGCATTCAGGTGTTGGATGAGTTTGTTGAGGACCACGTTGAGAAGGATTTGATGGAGTACCATACCCAAAAATTTGGAGAGATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGTATGACGATGTAGGTGAGATTTCTAGTTTTAAGAGGAATTTTGAATCCTTAGAGCCAATAGATGAGAAAAGCCTATTGAACCTTGTAATTCATTGACATTGCTCCAGCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGTTATTTATTGTAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGGTTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCATGGTTTATCAAAAGAACATATGAAAGAATTTTTCCATGAGGTGGAGTTCTATTTATCAAATGATGCATCTATGGTTAGACAATGTATTGATGATGGATGTGAGTTTAAAGTAAAAGGGCAGTGAATGAGAAAACAAGAGTTTTGCGGGAGCATTTCATATTGCAATATTTTTATCTCCTAGTGTTGTTGCAATTTTATTTTTCTCGTGGATTTTATTCTGGTATGAATTAATTTGAAGTAGATTTTATTGATTTCAATTTTATCTGATTTATTTTTCGTTGGTATTTTCGTAGATTTTATGTTCGTTTTATTTATTTATTTTTATTTTGGTCAATTTAATTCTTAATTTAATCCGATTAGTTCTGTTTAGATTTAATTTCAGTTTGAATTTTTAATTTCTGTTTAGATTAGATTTTTAATAGTATATCTTCGAGATTTAATTACGAAATATTCTTCTTTGAAGTTGCTGGCAGATTTAGGGAGCGCATATTTTGAGGGAGCAAAATTTCAACATTCTAATTAGAAGCTCTCATTATTTGATCTCATTTTTTTGTGTTAATTTTGATTTATTGTCTGTTTTGATATTATTTTGATTTGATATGATTTTTATTCGGTTAGATTTTTATTTTATTTTTATTTTTATTTTTTGTAGGTTAGTTTAGATTTGATTTTGAAGGATTTTATTTTCAAAAGATTTTTCCAGATTTAAAATAATATATCTTGTGGATGTTATGTAAAAAAAAAAAAGATTTGGTTTCATAATTTTGTTGAGAAGATCAGAGATTTTATTCGGTTAAGATTTTTTGTGAGATAAGATTCTAATTTTATCATTGAGATTTTATTGTTGTTGGCAGGATAATTATGTGGGCATGCAGTTAATTACCGAAGATGTCATGTGTTGAGCAGAGGGAAGCTAGCATTATGTTTGTTAGACATAGAGGGAGCGCAGTAAATGGGAAGATATTACCGTTTGAAAGATCAAAATTTTCCACGAATCCATGTTGTTAAAACGGTAACTTAAATTAATTAGGAGATAATTAGGGGCTAATTATTTGTTGAAGATCTTTGAGTTAGTGTCGTATTTATACGATCTCCCATTAGGTATGTTTTTCACTCTTATCTTTTATTACTACTGATGTGATTTTTTCTTTGTGTGTATTATTATTATGAGGAAAAAAACACGAGCAAGGAAAAATGTTGAGACTGTTGAGATTGAAAGGGAACTGAGTAAAGAAGTTGAGCTTGATGAAAGATTTCAGTACGGGAGATTTATCCATGAAGAGGCAAGACAAAAATATCAAGAAGTCCTGAAGCGAGATTTTCTGATGAAGAGCGGCTTCAATGGTGGCAAAGAGCTCCCACATTTTCCTTGAGCAAACATTACTAATCACGGTTGGAAACGATTGTGCGCAAAGCCGGATCCCACTGTTGATCAACTAGAACGGGAGTTCTATGCAAACATCGATGAAAATGAACGATTCTTGGTTATTGTTCCTAGAGTTGTTGTCGACTGGAGCCATGTAGTGATTAATGCTCTATTTATTTTGCAAGACTTCCCCCACGTTGTTTTCAATGAAATATTAGTTGCTTCCTCAAACGAGCAACTGAATGTCCAGTGGAGGTTGTCTAAGATGAGGGCAAGAACATTCCAGTCAACATACCTGAAGTGTGAGGCTAACACTTGGCTCGACTTTGTCATGTAGAGATTACTTACCTCAACGTACGATTCCATTGTCCCCATGATAGAGTGCACTTGAAGGTTCAGAGTCCCAAAAGAGTTCCAATGATATAAACTTGTTTGATAAGGGGATTATTGACACTACCAATCTAGCTAGGCTTCAAAAAGGTGAGGTGCGCCAAGGTGGTTTGTTGTGCAGTGTCCATAAAGTGCTAGAGCAACTTGAGAGGATGTTCATTAGGCCAGTTTCACTTAAAGACAACTTCAAACCTACTAGAGCTATGATCGTTGAAGGGACACCACACTAA

mRNA sequence

ATGCTTGGTACAGCGACAGCTGATAACTTTTATAATCCAGGTTGGCGCAACCACCCCAAGTTTGCGTGGGGAGGGCAAAGAAGCAATTCGCAAGTCCCCCAAGCACAGCAAAAGGCGGTGAACCAGTCCGGATTTGCTAAATCACAAGAATTGCCCCAGCAAAATAAGCAGGCTTTGCTCCAGCAAAATTCGGGTAATTCTCTGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAGGCTTCATTGAGAACCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTTTAAGGAGTGGTAAGCCACTAGAAGAGAGGAAAGAGCCTAGTAAACCCCAGGAAGTAGAGAAAAGTAGTGATAAAAATAATGTTGTTGAAAAAGAGTTGGAGTCGGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACCCTATAACCCACCCTTACCTTTTCCACAAAGGCAAAGACCTAAGAATCAAGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATTCCTTTAGTAGAAGTTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATGTTTTAACCAAAAAGAAGAGATTCGGAGAGTTTGAGACTGTGGCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCTTCCAAGGTTGGGAATCCAGGATCATTCACTATTCCTGTCTCCATGGGTGGAAAGGAGTTGGGGAGAGCACATTGTGATTTAGGTGCAAGCATAAACCTTATGCCTCTTTCGGTTTATCGAAAGCTAGGAATAGGTGATGCTACACCTACCACAATTACACTCCAATTGGCAGATAAATCTATCACATATCCTGAAGGTAAAATTGAGGATGTTCTAGTCCAGGTGTTGGATGAGTTTGTTGAGGACCACGTTGAGAAGGATTTGATGGAGTACCATACCCAAAAATTTGGAGAGATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGTATGACGATCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGTTATTTATTGTAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGGTTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCATGGTTTATCAAAAGAACATATGAAAGAATTTTTCCATGAGGTGGAGTTCTATTTATCAAATGATGCATCTATGCCGGATCCCACTGTTGATCAACTAGAACGGGAGTTCTATGCAAACATCGATGAAAATGAACGATTCTTGGTTATTGTTCCTAGAGTTGTTGTCGACTGGAGCCATGTAGTGATTAATGCTCTATTTATTTTGCAAGACTTCCCCCACGTTGTTTTCAATGAAATATTAGTTGCTTCCTCAAACGAGCAACTGAATGTCCAGTGGAGGTTGTCTAAGATGAGGGCAAGAACATTCCAGTCAACATACCTGAAGTGTGAGGCTAACACTTGGCTCGACTTTAGTGCACTTGAAGGTTCAGAGTCCCAAAAGAGTTCCAATGATATAAACTTGTTTGATAAGGGGATTATTGACACTACCAATCTAGCTAGGCTTCAAAAAGGTGAGGTGCGCCAAGGTGGTTTGTTGTGCAGTGTCCATAAAGTGCTAGAGCAACTTGAGAGGATGTTCATTAGGCCAGTTTCACTTAAAGACAACTTCAAACCTACTAGAGCTATGATCGTTGAAGGGACACCACACTAA

Coding sequence (CDS)

ATGCTTGGTACAGCGACAGCTGATAACTTTTATAATCCAGGTTGGCGCAACCACCCCAAGTTTGCGTGGGGAGGGCAAAGAAGCAATTCGCAAGTCCCCCAAGCACAGCAAAAGGCGGTGAACCAGTCCGGATTTGCTAAATCACAAGAATTGCCCCAGCAAAATAAGCAGGCTTTGCTCCAGCAAAATTCGGGTAATTCTCTGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAGGCTTCATTGAGAACCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTTTAAGGAGTGGTAAGCCACTAGAAGAGAGGAAAGAGCCTAGTAAACCCCAGGAAGTAGAGAAAAGTAGTGATAAAAATAATGTTGTTGAAAAAGAGTTGGAGTCGGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACCCTATAACCCACCCTTACCTTTTCCACAAAGGCAAAGACCTAAGAATCAAGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATTCCTTTAGTAGAAGTTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATGTTTTAACCAAAAAGAAGAGATTCGGAGAGTTTGAGACTGTGGCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCTTCCAAGGTTGGGAATCCAGGATCATTCACTATTCCTGTCTCCATGGGTGGAAAGGAGTTGGGGAGAGCACATTGTGATTTAGGTGCAAGCATAAACCTTATGCCTCTTTCGGTTTATCGAAAGCTAGGAATAGGTGATGCTACACCTACCACAATTACACTCCAATTGGCAGATAAATCTATCACATATCCTGAAGGTAAAATTGAGGATGTTCTAGTCCAGGTGTTGGATGAGTTTGTTGAGGACCACGTTGAGAAGGATTTGATGGAGTACCATACCCAAAAATTTGGAGAGATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGTATGACGATCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGTTATTTATTGTAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGGTTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCATGGTTTATCAAAAGAACATATGAAAGAATTTTTCCATGAGGTGGAGTTCTATTTATCAAATGATGCATCTATGCCGGATCCCACTGTTGATCAACTAGAACGGGAGTTCTATGCAAACATCGATGAAAATGAACGATTCTTGGTTATTGTTCCTAGAGTTGTTGTCGACTGGAGCCATGTAGTGATTAATGCTCTATTTATTTTGCAAGACTTCCCCCACGTTGTTTTCAATGAAATATTAGTTGCTTCCTCAAACGAGCAACTGAATGTCCAGTGGAGGTTGTCTAAGATGAGGGCAAGAACATTCCAGTCAACATACCTGAAGTGTGAGGCTAACACTTGGCTCGACTTTAGTGCACTTGAAGGTTCAGAGTCCCAAAAGAGTTCCAATGATATAAACTTGTTTGATAAGGGGATTATTGACACTACCAATCTAGCTAGGCTTCAAAAAGGTGAGGTGCGCCAAGGTGGTTTGTTGTGCAGTGTCCATAAAGTGCTAGAGCAACTTGAGAGGATGTTCATTAGGCCAGTTTCACTTAAAGACAACTTCAAACCTACTAGAGCTATGATCGTTGAAGGGACACCACACTAA

Protein sequence

MLGTATADNFYNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQQNKQALLQQNSGNSLESMMKDYMARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIEHPIREGKEQVKAVTLRSGKPLEERKEPSKPQEVEKSSDKNNVVEKELESGQGAGGSNNDAGASGSVPDVEPPYVPPPPYNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVIEQMPNYAKFLKDVLTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGASINLMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEFVEDHVEKDLMEYHTQKFGEIQIEDLEIGGLEHEYDDQPEIRKSFIDERLFIVAHIKEVKTPWYDGFSNYLDFGNLPHGLSKEHMKEFFHEVEFYLSNDASMPDPTVDQLEREFYANIDENERFLVIVPRVVVDWSHVVINALFILQDFPHVVFNEILVASSNEQLNVQWRLSKMRARTFQSTYLKCEANTWLDFSALEGSESQKSSNDINLFDKGIIDTTNLARLQKGEVRQGGLLCSVHKVLEQLERMFIRPVSLKDNFKPTRAMIVEGTPH
Homology
BLAST of Lag0028022 vs. NCBI nr
Match: XP_030509265.1 (uncharacterized protein LOC115723943 [Cannabis sativa])

HSP 1 Score: 346.3 bits (887), Expect = 5.6e-91
Identity = 199/347 (57.35%), Postives = 250/347 (72.05%), Query Frame = 0

Query: 17  NHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQQNKQALLQQNSGNSLESMMKDYM 76
           NHP  +WGGQ ++S    AQ +     GF++    P+ ++ A  Q    +SLES+M+DYM
Sbjct: 165 NHPNLSWGGQGASSSTAPAQGRQAYPPGFSQQ---PRHSQHA--QNAQPSSLESLMRDYM 224

Query: 77  ARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIEHPIREGKEQVKAVTLRSGK 136
           A+ND +I SQ ASLR LELQ+G LANELKARPQG++PSD E+P R+GKEQ K++ LRSGK
Sbjct: 225 AKNDAVIQSQAASLRNLELQLGHLANELKARPQGSLPSDTENPRRDGKEQCKSIHLRSGK 284

Query: 137 PL----EERK---EPSKPQEVEKSSDKNNVVEKELESGQGAGGSNNDAGASGSVPDVEPP 196
            L    EE K   EP+  Q  EK S K      +      A G  +D+  S         
Sbjct: 285 HLKNSEEEIKGSGEPTSIQNDEKLSKKTAQEIADTRPVDTASGQQSDSQQSA-------- 344

Query: 197 YVPPPPYNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVIEQMPNYAKFLKDVL 256
              P    PPLPFPQR R + QDGQFKKFL++LKQLHINIPLVE +EQMPNY KFLKD+L
Sbjct: 345 ---PVCTKPPLPFPQRFRKQQQDGQFKKFLDVLKQLHINIPLVEALEQMPNYVKFLKDIL 404

Query: 257 TKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGASINL 316
           TKK+R GEFETVALTE CSA+LK+ +P K+ +PGSFTIP S+GG+++GRA CDLGASINL
Sbjct: 405 TKKRRLGEFETVALTEGCSAMLKSKIPPKLKDPGSFTIPCSIGGRDVGRALCDLGASINL 464

Query: 317 MPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEFV 357
           MP+S+++KLGIG+A PTT+TLQLAD+S+ +PEGKIEDVLVQV D+F+
Sbjct: 465 MPMSIFKKLGIGEARPTTVTLQLADRSMAHPEGKIEDVLVQV-DKFI 494

BLAST of Lag0028022 vs. NCBI nr
Match: XP_030505532.1 (uncharacterized protein LOC115720524 [Cannabis sativa])

HSP 1 Score: 337.8 bits (865), Expect = 2.0e-88
Identity = 187/348 (53.74%), Postives = 246/348 (70.69%), Query Frame = 0

Query: 9   NFYNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQQNKQALLQQNSGNSL 68
           N YN  W+NHP  +WGGQ ++S    AQ +     GF++    P+ ++ A  Q +  +SL
Sbjct: 216 NSYNQAWKNHPNLSWGGQGASSSTAPAQGRQAYPPGFSQQ---PRHSQHA--QNSQPSSL 275

Query: 69  ESMMKDYMARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIEHPIREGKEQVK 128
           ES+M+DYMA+ D +I SQ ASLR LELQ+G LANELKARPQG++PSD ++P R+GKEQ K
Sbjct: 276 ESLMRDYMAKIDAVIQSQVASLRNLELQLGHLANELKARPQGSLPSDTKNPRRDGKEQCK 335

Query: 129 AVTLRSGKPLEERKEPSKPQEVEKSSDKNNVVEKELESGQGAGGSNNDAGASGSVPDVEP 188
           ++ LRSGK                   KN   E ++++ +G   ++  +    S P    
Sbjct: 336 SIQLRSGK-----------------HPKNYEEEIKVDTAKGQQPNSQQSALVCSSP---- 395

Query: 189 PYVPPPPYNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVIEQMPNYAKFLKDV 248
                    PPLPFPQR R + QDGQFKKFL++LKQLHINIPLVE +EQMPNY KFLKD+
Sbjct: 396 --------KPPLPFPQRFRKQQQDGQFKKFLDVLKQLHINIPLVEALEQMPNYVKFLKDI 455

Query: 249 LTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGASIN 308
           LTKK+R GEFETVALTE C+A+LK+ +P K+ +PGSFTIP S+ G+++GRA  DLGASIN
Sbjct: 456 LTKKRRLGEFETVALTEGCNAMLKSKIPPKLKDPGSFTIPCSIRGRDVGRALSDLGASIN 515

Query: 309 LMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEFV 357
           LMP+S+++ LGIG+A PTT+TLQLAD+S+ +PEGKIEDVLVQV D+F+
Sbjct: 516 LMPMSIFKMLGIGEARPTTVTLQLADRSMAHPEGKIEDVLVQV-DKFI 528

BLAST of Lag0028022 vs. NCBI nr
Match: XP_024028757.1 (uncharacterized protein LOC112093792 [Morus notabilis])

HSP 1 Score: 310.8 bits (795), Expect = 2.6e-80
Identity = 183/363 (50.41%), Postives = 239/363 (65.84%), Query Frame = 0

Query: 9   NFYNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQ----QNKQALLQQNS 68
           N YN GW+ HP F+W  Q +N        K     GF + Q   Q    Q+ Q    Q S
Sbjct: 133 NSYNQGWKQHPNFSWSNQEANPM--PGPSKPAYPPGFHQHQHQRQPPQEQSNQRQPHQAS 192

Query: 69  GNSLESMMKDYMARND-------VIILSQQASLRTLELQVGQLANELKARPQGNIPSDIE 128
              +E+++K+YMARND        ++ SQ ASLRTLE QVGQLAN L  RPQG++PSD +
Sbjct: 193 STPMEALLKEYMARNDSLIPGQAALLQSQAASLRTLENQVGQLANVLSNRPQGSLPSDTK 252

Query: 129 HPIREG----KEQVKAVTLRSGKPLEERKEPSKPQEVEKSSDKNNVVEKELESGQGAGGS 188
           +P R+G    KE  KA+TL++G+ +E+    +   E   S     V +   ES Q     
Sbjct: 253 NPRRDGKEHCKEHCKAITLQNGREIEQLTRQTAATE-HSSIQTQEVQQPPAESEQDVVDQ 312

Query: 189 NNDAGASGSVPDVEPPYVPPPPYNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVE 248
           +  A    + P+            PP PFPQR + + QD QF++FL++LKQLHINIPLVE
Sbjct: 313 DATAKLKQNKPE-----------RPPPPFPQRFQNQKQDKQFRRFLDVLKQLHINIPLVE 372

Query: 249 VIEQMPNYAKFLKDVLTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGG 308
            +EQMP+Y KF+KD+LTKK+R GEFETVALTEECSAILKN LP K+ +PGSFTIP S+G 
Sbjct: 373 ALEQMPSYVKFMKDILTKKRRLGEFETVALTEECSAILKNRLPPKLKDPGSFTIPCSIGD 432

Query: 309 KELGRAHCDLGASINLMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLD 357
           + +G+A CDLGASINLMP+S++RKLGIG+ +PTT+TLQLAD+S  +PEGKIEDVLV+V D
Sbjct: 433 QYIGKALCDLGASINLMPMSIFRKLGIGEVSPTTVTLQLADRSYAHPEGKIEDVLVRV-D 480

BLAST of Lag0028022 vs. NCBI nr
Match: XP_022951570.1 (uncharacterized protein LOC111454344 [Cucurbita moschata])

HSP 1 Score: 304.7 bits (779), Expect = 1.9e-78
Identity = 175/324 (54.01%), Postives = 223/324 (68.83%), Query Frame = 0

Query: 41  NQSGFAKSQELPQQNKQALLQQNSGNSLESMMKDYMARNDVIILSQQASLRTLELQVGQL 100
           NQ  ++  Q   Q       Q  SG S+ES++K+YMA+NDV+I SQQASL+ LE+QVGQL
Sbjct: 15  NQLAYSSQQVNTQGKGITQAQYTSGTSIESLIKEYMAKNDVVIQSQQASLQNLEVQVGQL 74

Query: 101 ANELKARPQGNIPSDIEHPIREGKEQVKAVTLRSGKPLEERKE------PSKPQEVEKSS 160
           A EL+ RP G +P+D E P REGKEQ +A+ LRSGK +    E       S  QE   + 
Sbjct: 75  ATELRNRPLGKLPADTETPKREGKEQCQAIELRSGKKIPSGGEKNAEQGDSHSQETADTQ 134

Query: 161 DKNN--VVEKELESGQGAGGSNNDAGASGSVPDVEPPYVPPPPYNPPLPFPQRQRPKNQD 220
            +N+   V+KE                + S       Y P P      PFPQR + K ++
Sbjct: 135 QRNDEAAVQKEHSKDYAKIKEQPKIQTTASSGQESITYTPSP------PFPQRIKRKKEE 194

Query: 221 GQFKKFLEILKQLHINIPLVEVIEQMPNYAKFLKDVLTKKKRFGEFETVALTEECSAILK 280
             F+KF++ILK++HINIP VE ++QMPNY KFLKDVLT +++F EF+ V+L EECSAILK
Sbjct: 195 AHFEKFMDILKEIHINIPRVEALKQMPNYVKFLKDVLTNRRKFEEFKVVSLNEECSAILK 254

Query: 281 NGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGASINLMPLSVYRKLGIGDATPTTITLQL 340
           N +P K  +PGSFTIPVS+GGKELGRA CDLGASINLMPLS+Y+KLGIG+A PTT+TLQL
Sbjct: 255 NKIPLKEKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSIYKKLGIGEARPTTVTLQL 314

Query: 341 ADKSITYPEGKIEDVLVQVLDEFV 357
           AD+SITYPEGKIED+L+QV D+F+
Sbjct: 315 ADRSITYPEGKIEDILIQV-DKFI 331

BLAST of Lag0028022 vs. NCBI nr
Match: XP_022960431.1 (uncharacterized protein LOC111461167 [Cucurbita moschata])

HSP 1 Score: 300.4 bits (768), Expect = 3.5e-77
Identity = 176/361 (48.75%), Postives = 241/361 (66.76%), Query Frame = 0

Query: 11  YNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQQNKQALLQQNSGNSLES 70
           Y PG+    +  +G Q++ +Q            G +++Q +P+             SLES
Sbjct: 7   YPPGFGLQNQLTYGSQQATTQ----------GEGTSQAQHIPE------------TSLES 66

Query: 71  MMKDYMARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIEHPIREGKEQVKAV 130
           ++K+YMA+NDV+I SQQASLR LE+QVGQLANEL+ RP G +PSD E P REG EQ +A+
Sbjct: 67  LIKEYMAKNDVVIQSQQASLRNLEVQVGQLANELRNRPLGKLPSDTEMPKREGMEQCQAI 126

Query: 131 TLRSGKPLEERKEPSKPQEVEKSSDKNNVVEKELESGQGAGGSNNDAGA------SGSVP 190
            LRSGK +  R+E  K     +S +  +  +++ E+      + NDA A      S +  
Sbjct: 127 ELRSGKEISSREEKIKEHSDSRSQETADTQQRKEEAVVQEEHNKNDAEALVQKEHSRNYA 186

Query: 191 DV-EPPYVPPPP--------YNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVI 250
           ++ +PP              Y P  PFPQR + K ++  F+KF++I K++HINIPLVE +
Sbjct: 187 EIMKPPKTQTSVNSGQESRIYTPTPPFPQRIKRKKEEAHFEKFMDIFKEIHINIPLVEAL 246

Query: 251 EQMPNYAKFLKDVLTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKE 310
           +QM NY KFLKDVLT +++F EF+ V L EECSAILKN +P K  +PGSFTIP+S+GGK+
Sbjct: 247 KQMQNYVKFLKDVLTNRRKFEEFKVVPLNEECSAILKNKIPLKEKDPGSFTIPISIGGKK 306

Query: 311 LGRAHCDLGASINLMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEF 357
           LGRA CDLG+SINLMPLS+Y+KLGIG+A PTT+TLQLAD+S T+PEGKIED+L+QV D+F
Sbjct: 307 LGRALCDLGSSINLMPLSIYKKLGIGEARPTTVTLQLADRSFTHPEGKIEDILIQV-DKF 344

BLAST of Lag0028022 vs. ExPASy TrEMBL
Match: A0A6J1GJ68 (uncharacterized protein LOC111454344 OS=Cucurbita moschata OX=3662 GN=LOC111454344 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 9.1e-79
Identity = 175/324 (54.01%), Postives = 223/324 (68.83%), Query Frame = 0

Query: 41  NQSGFAKSQELPQQNKQALLQQNSGNSLESMMKDYMARNDVIILSQQASLRTLELQVGQL 100
           NQ  ++  Q   Q       Q  SG S+ES++K+YMA+NDV+I SQQASL+ LE+QVGQL
Sbjct: 15  NQLAYSSQQVNTQGKGITQAQYTSGTSIESLIKEYMAKNDVVIQSQQASLQNLEVQVGQL 74

Query: 101 ANELKARPQGNIPSDIEHPIREGKEQVKAVTLRSGKPLEERKE------PSKPQEVEKSS 160
           A EL+ RP G +P+D E P REGKEQ +A+ LRSGK +    E       S  QE   + 
Sbjct: 75  ATELRNRPLGKLPADTETPKREGKEQCQAIELRSGKKIPSGGEKNAEQGDSHSQETADTQ 134

Query: 161 DKNN--VVEKELESGQGAGGSNNDAGASGSVPDVEPPYVPPPPYNPPLPFPQRQRPKNQD 220
            +N+   V+KE                + S       Y P P      PFPQR + K ++
Sbjct: 135 QRNDEAAVQKEHSKDYAKIKEQPKIQTTASSGQESITYTPSP------PFPQRIKRKKEE 194

Query: 221 GQFKKFLEILKQLHINIPLVEVIEQMPNYAKFLKDVLTKKKRFGEFETVALTEECSAILK 280
             F+KF++ILK++HINIP VE ++QMPNY KFLKDVLT +++F EF+ V+L EECSAILK
Sbjct: 195 AHFEKFMDILKEIHINIPRVEALKQMPNYVKFLKDVLTNRRKFEEFKVVSLNEECSAILK 254

Query: 281 NGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGASINLMPLSVYRKLGIGDATPTTITLQL 340
           N +P K  +PGSFTIPVS+GGKELGRA CDLGASINLMPLS+Y+KLGIG+A PTT+TLQL
Sbjct: 255 NKIPLKEKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSIYKKLGIGEARPTTVTLQL 314

Query: 341 ADKSITYPEGKIEDVLVQVLDEFV 357
           AD+SITYPEGKIED+L+QV D+F+
Sbjct: 315 ADRSITYPEGKIEDILIQV-DKFI 331

BLAST of Lag0028022 vs. ExPASy TrEMBL
Match: A0A6J1H7K8 (uncharacterized protein LOC111461167 OS=Cucurbita moschata OX=3662 GN=LOC111461167 PE=4 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 1.7e-77
Identity = 176/361 (48.75%), Postives = 241/361 (66.76%), Query Frame = 0

Query: 11  YNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQQNKQALLQQNSGNSLES 70
           Y PG+    +  +G Q++ +Q            G +++Q +P+             SLES
Sbjct: 7   YPPGFGLQNQLTYGSQQATTQ----------GEGTSQAQHIPE------------TSLES 66

Query: 71  MMKDYMARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIEHPIREGKEQVKAV 130
           ++K+YMA+NDV+I SQQASLR LE+QVGQLANEL+ RP G +PSD E P REG EQ +A+
Sbjct: 67  LIKEYMAKNDVVIQSQQASLRNLEVQVGQLANELRNRPLGKLPSDTEMPKREGMEQCQAI 126

Query: 131 TLRSGKPLEERKEPSKPQEVEKSSDKNNVVEKELESGQGAGGSNNDAGA------SGSVP 190
            LRSGK +  R+E  K     +S +  +  +++ E+      + NDA A      S +  
Sbjct: 127 ELRSGKEISSREEKIKEHSDSRSQETADTQQRKEEAVVQEEHNKNDAEALVQKEHSRNYA 186

Query: 191 DV-EPPYVPPPP--------YNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVI 250
           ++ +PP              Y P  PFPQR + K ++  F+KF++I K++HINIPLVE +
Sbjct: 187 EIMKPPKTQTSVNSGQESRIYTPTPPFPQRIKRKKEEAHFEKFMDIFKEIHINIPLVEAL 246

Query: 251 EQMPNYAKFLKDVLTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKE 310
           +QM NY KFLKDVLT +++F EF+ V L EECSAILKN +P K  +PGSFTIP+S+GGK+
Sbjct: 247 KQMQNYVKFLKDVLTNRRKFEEFKVVPLNEECSAILKNKIPLKEKDPGSFTIPISIGGKK 306

Query: 311 LGRAHCDLGASINLMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEF 357
           LGRA CDLG+SINLMPLS+Y+KLGIG+A PTT+TLQLAD+S T+PEGKIED+L+QV D+F
Sbjct: 307 LGRALCDLGSSINLMPLSIYKKLGIGEARPTTVTLQLADRSFTHPEGKIEDILIQV-DKF 344

BLAST of Lag0028022 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 8.2e-72
Identity = 168/351 (47.86%), Postives = 227/351 (64.67%), Query Frame = 0

Query: 9   NFYNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKS--QELPQQNKQALLQQNSGN 68
           N YNPGWRNHP F+W    SN+  P +  K +   GF +    ++P++  Q         
Sbjct: 348 NTYNPGWRNHPNFSW----SNNAGP-SNPKPIMPPGFQQQARPQIPEKKSQ--------- 407

Query: 69  SLESMMKDYMARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIE-HPIREGKE 128
            LE ++  Y+++ D II SQ ASLR LE QVGQLAN +  RPQG++PSD + +P  +GKE
Sbjct: 408 -LEELLLQYISKTDAIIQSQGASLRNLETQVGQLANSINNRPQGSLPSDTQINP--KGKE 467

Query: 129 QVKAVTLRSGKPLEERKEPSKPQEVEKSSDKNNVVEKELESGQGAGGSNNDAGASGSVPD 188
           Q +A+TLRSGK +E   + +   E+E   DK  + E E+E  Q       + G S  +  
Sbjct: 468 QCQAITLRSGKEIEGVNQKAVESEIE-HVDKEGMCENEIEIQQKDDDKAENQGTSQVI-- 527

Query: 189 VEPPYVPPPPYNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVIEQMPNYAKFL 248
                      +PP PFPQR + +  + QF+KFL + K+LHINIP  E +EQMP+Y KFL
Sbjct: 528 -----------HPPPPFPQRLQKQKLEKQFQKFLNVFKKLHINIPFAEALEQMPSYVKFL 587

Query: 249 KDVLTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGA 308
           KD+L+KK++ GEFETV LTEECSAIL+N LP K+ +PGSFTIP ++G     +A  DLGA
Sbjct: 588 KDILSKKRKLGEFETVFLTEECSAILQNKLPPKLKDPGSFTIPCTIGNLFFTKALSDLGA 647

Query: 309 SINLMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEFV 357
           SINLMP S++ KLG+G+  PT++TLQLAD+S  YP G IEDVLV+V D+F+
Sbjct: 648 SINLMPWSIFEKLGLGECKPTSVTLQLADRSYVYPRGIIEDVLVKV-DKFI 666

BLAST of Lag0028022 vs. ExPASy TrEMBL
Match: A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 1.4e-71
Identity = 194/439 (44.19%), Postives = 250/439 (56.95%), Query Frame = 0

Query: 9   NFYNPGWRNHPKFAWGGQRSNSQVPQAQQ--KAVNQSGFAKSQELP----QQNKQALLQQ 68
           N YNPGW+ HP F+W GQ S++     QQ  +A    GF  S   P    Q N+Q    Q
Sbjct: 328 NTYNPGWKQHPNFSWSGQGSSNTTGHNQQYKEAYTPPGFPNSPAFPPTPHQYNQQKNYVQ 387

Query: 69  NSGNSLESM-------------------------------MKDYMARNDVIILSQQASLR 128
            +  +L +M                               +KDYM RNDV       ++R
Sbjct: 388 PAQQNLSNMEILMKELITKNDATMKELMTRTDVTMKDMKDVKDYMGRNDV-------TVR 447

Query: 129 TLELQVGQLANELKARPQGNIPSDIEHPIREGKEQVKAVTLRSGKPLEERKEPSKPQEVE 188
            LE+Q+GQL NE++ RPQG++PS  E P R GKE   ++  RSG   E    P  P E  
Sbjct: 448 KLEMQLGQLVNEVRTRPQGSLPSSTEEPRRIGKEHCNSIATRSGLKYE---GPRMPDESS 507

Query: 189 KSSDKNNVVEKELESGQGAGGSNNDAGASGSVPD--VEPPY-VPPPPY----NPPLPFPQ 248
            S  +    EK+ +                +VPD  VEP   VP  P      PP PFPQ
Sbjct: 508 HSPSR----EKDTQ----------------AVPDKIVEPAVSVPVAPQVSNSRPPPPFPQ 567

Query: 249 RQRPKNQDGQFKKFLEILKQLHINIPLVEVIEQMPNYAKFLKDVLTKKKRFGEFETVALT 308
           R   KNQD  F+KFL+ILKQLHINIP VE +EQMP YAKF+KD++T+KK+ GE+ETVALT
Sbjct: 568 RLVRKNQDNNFRKFLDILKQLHINIPFVEALEQMPTYAKFIKDIITRKKKLGEYETVALT 627

Query: 309 EECSAILKNGLPSKVGNPGSFTIPVSMGGKELGRAHCDLGASINLMPLSVYRKLGIGDAT 368
           E  S + K+ +P K+ +PGSFTIP  +GGK++GRA CDLGASINLMPLS+++K  IG A+
Sbjct: 628 ECSSNVFKSKMPPKLKDPGSFTIPCLIGGKDVGRALCDLGASINLMPLSIFKKFEIGKAS 687

Query: 369 PTTITLQLADKSITYPEGKIEDVLVQVLDEFV--------EDHVEKDLMEYHTQKF---G 393
           PTT+TLQLAD+SIT PEGKIEDVLV+V D+F+        +   +KD+     + F   G
Sbjct: 688 PTTVTLQLADRSITKPEGKIEDVLVKV-DKFIFPTDFIILDCEADKDVPIILGRPFLATG 734

BLAST of Lag0028022 vs. ExPASy TrEMBL
Match: A0A6J1EQ90 (uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC111436411 PE=4 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 1.4e-71
Identity = 173/357 (48.46%), Postives = 219/357 (61.34%), Query Frame = 0

Query: 9   NFYNPGWRNHPKFAWGGQRSNSQVPQAQQKAVNQSGFAKSQELPQQNKQ--------ALL 68
           N YNPGWRNHP F+W GQ   +Q  Q   KA   SGF    +L   ++Q           
Sbjct: 388 NTYNPGWRNHPNFSWKGQSLYNQ--QMPPKANYPSGFRLQNQLAYSSQQVNTQGKGTTQA 447

Query: 69  QQNSGNSLESMMKDYMARNDVIILSQQASLRTLELQVGQLANELKARPQGNIPSDIEHPI 128
           Q  S  S+ES++K+YMA+ND +I SQQASLR LE+Q+G   N      QG          
Sbjct: 448 QYTSETSIESLIKEYMAKNDAVIQSQQASLRNLEVQIGGEKN----AEQG---------- 507

Query: 129 REGKEQVKAVTLRSGKPLEERKEPSKP-QEVEKSSDKNNVVEKELESGQGAGGSNNDAGA 188
            +   Q  A T +  +    +KE SK   EVE+          E ES             
Sbjct: 508 -DSHSQETADTQQRNEEAAVQKEHSKDYAEVEEQPKMQTTASSEQES------------- 567

Query: 189 SGSVPDVEPPYVPPPPYNPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEVIEQMP 248
                           Y P  PFPQR + K ++  F+KF++ILK++HINIPLVE ++QMP
Sbjct: 568 --------------RTYTPSPPFPQRIKRKKEEAHFEKFMDILKEIHINIPLVEALKQMP 627

Query: 249 NYAKFLKDVLTKKKRFGEFETVALTEECSAILKNGLPSKVGNPGSFTIPVSMGGKELGRA 308
           NY KFLKDVL  +++F EF+ V+L EECSAILKN +P K  +PGSFTIPVS+GGKELGRA
Sbjct: 628 NYVKFLKDVLINRRKFEEFKVVSLNEECSAILKNKIPLKEKDPGSFTIPVSIGGKELGRA 687

Query: 309 HCDLGASINLMPLSVYRKLGIGDATPTTITLQLADKSITYPEGKIEDVLVQVLDEFV 357
            CDLGA+INLMPLS+Y+KLGIG+A PTT+TLQLAD+SITYPEGKIED+L+QV D+F+
Sbjct: 688 LCDLGANINLMPLSIYKKLGIGEARPTTVTLQLADRSITYPEGKIEDILIQV-DKFI 699

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_030509265.15.6e-9157.35uncharacterized protein LOC115723943 [Cannabis sativa][more]
XP_030505532.12.0e-8853.74uncharacterized protein LOC115720524 [Cannabis sativa][more]
XP_024028757.12.6e-8050.41uncharacterized protein LOC112093792 [Morus notabilis][more]
XP_022951570.11.9e-7854.01uncharacterized protein LOC111454344 [Cucurbita moschata][more]
XP_022960431.13.5e-7748.75uncharacterized protein LOC111461167 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GJ689.1e-7954.01uncharacterized protein LOC111454344 OS=Cucurbita moschata OX=3662 GN=LOC1114543... [more]
A0A6J1H7K81.7e-7748.75uncharacterized protein LOC111461167 OS=Cucurbita moschata OX=3662 GN=LOC1114611... [more]
A0A6J0ZX648.2e-7247.86LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6J1DY391.4e-7144.19uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1EQ901.4e-7148.46uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC1114364... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 263..371
e-value: 1.1E-12
score: 49.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 106..210
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 186..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 117..162
NoneNo IPR availablePANTHERPTHR33067:SF23SUBFAMILY NOT NAMEDcoord: 117..351
NoneNo IPR availablePANTHERPTHR33067FAMILY NOT NAMEDcoord: 117..351
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 287..360
e-value: 5.03584E-6
score: 43.094

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0028022.1Lag0028022.1mRNA