Lag0000600 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0000600
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Locationchr4: 11097563 .. 11098936 (-)
RNA-Seq ExpressionLag0000600
SyntenyLag0000600
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCTGATGCTTCTTCTTCTTCTTCTTCCAGTGGTCCGATAATCACACCCGTAGCTTCACCATCTACCCCAAATACTACACCGATTGTCACACCGATTACAAACACCCAGAATGTTCAGCCTCAAATCCCTCGACCAAATCCTCAAATTACTCGACCAAATCCCCAAAATCCTCAACAACCATTTCAACCTCAACCTTCGATTTCTACTCATCAACATTATCAAGCCTATGCCCAACCATTTTCCCCAAATTTCTATAATCCTCGTCCTCAGTTTTTCCCACCTCCACAACAGTTTTCACAAAATCAAATCCAAAATCCTATTCCATACCCAAACCCTTTTACCCCTAACCCTTACCCGACCTTACCCCAACCCTTATCGGTGAAGCTGAATGACTCGAACTTTCTCCTCTGGAAAAATCAGTTGCTGAATGCGGTGATTGCAAATGGGCTTCAAGGGTACCTCCATGGCTCTATTGCGGCTCCTCCCAGGTATCTCGATGATCAACAAACTCAACCGAATCCAGATTTTCTCCATTGGGAAAGGTACAATCGGTTTATCATGTGTTGGATATACTCTTCTCTGTCTGAGGAAAAAATGGGTGAGATAGTAAGTTTTGACTCTGCTGCTGCTATTTGGAACTCTTTGAAACGATCCTATGATTCTAAAACTACGGCTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATAAAGAAGGATAACCTCTCTGTTAGTCAATATCTGTCTCAAATAAAGGAAGTAGCTGATAAATTTTCTGCAATAGGTGAGCCCATCTCTTATAGGGACCATTTAGCTCATATCTTAGATGGTCTTGGTAGTGAATACAATGCCTTTGTTACTACCATACAAAATCGGTCTGATAATCCGTCTTTAGAAGATGTTAGAAGTTTATTATTGGCATATGAGGCCCGGTTGGAAAAACAGAATGCTGTGGATCAATTGAATCTTGCCCAGGCAAATTTAAGTTCTCTTAGCCTCCAAAACAGCCGTCGGTCCAACCCCAAACCAAATCACTCCATCCCCTTTAGACCTCCCTTCAATCCACAAGCCTTTTCTCCTTTTTCCTCTCAACAACACTCTGCCTCTCCAAGCCTCTTAGGCAAACCACAAACTCAACAACTTCAAAAATGGCCTTCTCGTTTATCTTCTAACAAACCTCAATGCCAAATATGTGGCAAATTTGGGCACACTGCTTTAATTTGTCACCATAGAACTAATTTGGCCTACCAAACCCCACCTCCTCAAGCCTATTGTCCACGGTTTCCGCAACCACTCCCTCCTCTGTCCCTGATGCTTTATCCACTATGTCCACTGATTCCTATCATCCTGACGAGAATTGGTTTTTAG

mRNA sequence

ATGGCCTCTGATGCTTCTTCTTCTTCTTCTTCCAGTGGTCCGATAATCACACCCGTAGCTTCACCATCTACCCCAAATACTACACCGATTGTCACACCGATTACAAACACCCAGAATGTTCAGCCTCAAATCCCTCGACCAAATCCTCAAATTACTCGACCAAATCCCCAAAATCCTCAACAACCATTTCAACCTCAACCTTCGATTTCTACTCATCAACATTATCAAGCCTATGCCCAACCATTTTCCCCAAATTTCTATAATCCTCGTCCTCAGTTTTTCCCACCTCCACAACAGTTTTCACAAAATCAAATCCAAAATCCTATTCCATACCCAAACCCTTTTACCCCTAACCCTTACCCGACCTTACCCCAACCCTTATCGGTGAAGCTGAATGACTCGAACTTTCTCCTCTGGAAAAATCAGTTGCTGAATGCGGTGATTGCAAATGGGCTTCAAGGGTACCTCCATGGCTCTATTGCGGCTCCTCCCAGGTATCTCGATGATCAACAAACTCAACCGAATCCAGATTTTCTCCATTGGGAAAGGTACAATCGGTTTATCATGTGTTGGATATACTCTTCTCTGTCTGAGGAAAAAATGGGTGAGATAGTAAGTTTTGACTCTGCTGCTGCTATTTGGAACTCTTTGAAACGATCCTATGATTCTAAAACTACGGCTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATAAAGAAGGATAACCTCTCTGTTAGTCAATATCTGTCTCAAATAAAGGAAGTAGCTGATAAATTTTCTGCAATAGGTGAGCCCATCTCTTATAGGGACCATTTAGCTCATATCTTAGATGGTCTTGGTAGTGAATACAATGCCTTTGTTACTACCATACAAAATCGGTCTGATAATCCGTCTTTAGAAGATGTTAGAAGTTTATTATTGGCATATGAGGCCCGGTTGGAAAAACAGAATGCTGTGGATCAATTGAATCTTGCCCAGGCAAATTTAAGTTCTCTTAGCCTCCAAAACAGCCGTCGGTCCAACCCCAAACCAAATCACTCCATCCCCTTTAGACCTCCCTTCAATCCACAAGCCTTTTCTCCTTTTTCCTCTCAACAACACTCTGCCTCTCCAAGCCTCTTAGGCAAACCACAAACTCAACAACTTCAAAAATGGCCTTCTCGTTTATCTTCTAACAAACCTCAATGCCAAATATGTGGCAAATTTGGGCACACTGCTTTAATTTGTCACCATAGAACTAATTTGGCCTACCAAACCCCACCTCCTCAAGCCTATTGTCCACGGTTTCCGCAACCACTCCCTCCTCTGTCCCTGATGCTTTATCCACTATGTCCACTGATTCCTATCATCCTGACGAGAATTGGTTTTTAG

Coding sequence (CDS)

ATGGCCTCTGATGCTTCTTCTTCTTCTTCTTCCAGTGGTCCGATAATCACACCCGTAGCTTCACCATCTACCCCAAATACTACACCGATTGTCACACCGATTACAAACACCCAGAATGTTCAGCCTCAAATCCCTCGACCAAATCCTCAAATTACTCGACCAAATCCCCAAAATCCTCAACAACCATTTCAACCTCAACCTTCGATTTCTACTCATCAACATTATCAAGCCTATGCCCAACCATTTTCCCCAAATTTCTATAATCCTCGTCCTCAGTTTTTCCCACCTCCACAACAGTTTTCACAAAATCAAATCCAAAATCCTATTCCATACCCAAACCCTTTTACCCCTAACCCTTACCCGACCTTACCCCAACCCTTATCGGTGAAGCTGAATGACTCGAACTTTCTCCTCTGGAAAAATCAGTTGCTGAATGCGGTGATTGCAAATGGGCTTCAAGGGTACCTCCATGGCTCTATTGCGGCTCCTCCCAGGTATCTCGATGATCAACAAACTCAACCGAATCCAGATTTTCTCCATTGGGAAAGGTACAATCGGTTTATCATGTGTTGGATATACTCTTCTCTGTCTGAGGAAAAAATGGGTGAGATAGTAAGTTTTGACTCTGCTGCTGCTATTTGGAACTCTTTGAAACGATCCTATGATTCTAAAACTACGGCTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATAAAGAAGGATAACCTCTCTGTTAGTCAATATCTGTCTCAAATAAAGGAAGTAGCTGATAAATTTTCTGCAATAGGTGAGCCCATCTCTTATAGGGACCATTTAGCTCATATCTTAGATGGTCTTGGTAGTGAATACAATGCCTTTGTTACTACCATACAAAATCGGTCTGATAATCCGTCTTTAGAAGATGTTAGAAGTTTATTATTGGCATATGAGGCCCGGTTGGAAAAACAGAATGCTGTGGATCAATTGAATCTTGCCCAGGCAAATTTAAGTTCTCTTAGCCTCCAAAACAGCCGTCGGTCCAACCCCAAACCAAATCACTCCATCCCCTTTAGACCTCCCTTCAATCCACAAGCCTTTTCTCCTTTTTCCTCTCAACAACACTCTGCCTCTCCAAGCCTCTTAGGCAAACCACAAACTCAACAACTTCAAAAATGGCCTTCTCGTTTATCTTCTAACAAACCTCAATGCCAAATATGTGGCAAATTTGGGCACACTGCTTTAATTTGTCACCATAGAACTAATTTGGCCTACCAAACCCCACCTCCTCAAGCCTATTGTCCACGGTTTCCGCAACCACTCCCTCCTCTGTCCCTGATGCTTTATCCACTATGTCCACTGATTCCTATCATCCTGACGAGAATTGGTTTTTAG

Protein sequence

MASDASSSSSSSGPIITPVASPSTPNTTPIVTPITNTQNVQPQIPRPNPQITRPNPQNPQQPFQPQPSISTHQHYQAYAQPFSPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSRLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQAYCPRFPQPLPPLSLMLYPLCPLIPIILTRIGF
Homology
BLAST of Lag0000600 vs. NCBI nr
Match: XP_022155181.1 (uncharacterized protein LOC111022315 [Momordica charantia])

HSP 1 Score: 427.6 bits (1098), Expect = 1.4e-115
Identity = 229/350 (65.43%), Postives = 274/350 (78.29%), Query Frame = 0

Query: 94  FPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQ 153
           FPPP   + N +  P   PNPF+ NP+PTLPQPL+VKLND+NFLLWKNQLLNAVIANGL+
Sbjct: 3   FPPP---TPNFLAQP---PNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLR 62

Query: 154 GYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAI 213
           GYL G+I  PP++LD  Q QPNP +  WERYNR +MCWIYSSLSEEKMGE+VS ++   I
Sbjct: 63  GYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDI 122

Query: 214 WNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHL 273
           W+SL R YDSKTTARIMGLKT+LQ ++KD  SVSQYL++IKE+ADKF+A+GEP+SYRDHL
Sbjct: 123 WSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHL 182

Query: 274 AHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSL 333
           AH+LDGLGSEYNAFVT+I NR+D+PSLEDVRSLLLAYEARL+KQN VDQLN+AQANL +L
Sbjct: 183 AHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNL 242

Query: 334 SLQ-NSRRSNPK---PNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR 393
           SLQ NS+R  PK   PNH   ++  F     SP S+ Q   S S+LGKPQ+  + KWP +
Sbjct: 243 SLQHNSKRPPPKFSFPNH---YKHSF---PNSPISAAQ---SQSILGKPQS--VHKWPPK 302

Query: 394 LSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQA-YCPRFPQPLPPLS 439
            SS+K QCQICGK GH+A +C+HRTN+AY    PQA Y    P P  P S
Sbjct: 303 PSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSS 335

BLAST of Lag0000600 vs. NCBI nr
Match: RVW69807.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 233.8 bits (595), Expect = 3.0e-57
Identity = 145/356 (40.73%), Postives = 198/356 (55.62%), Query Frame = 0

Query: 94  FPPP----QQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIA 153
           FPP        + N  QNP P     T  P P+L Q LS+KL+++N LL K+QLLN +IA
Sbjct: 4   FPPTPASNSNTTTNNNQNPAPQITQMT-LPSPSLSQSLSIKLDETNLLLRKSQLLNVIIA 63

Query: 154 NGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDS 213
           NGL+ ++    ++PP+YLD    Q NP+F+ W+R N+ +M WIYSSL+   +G+IV + +
Sbjct: 64  NGLEDFIDPDQSSPPKYLDAACRQVNPEFVQWDRLNQLVMSWIYSSLTPGMVGQIVEYST 123

Query: 214 AAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISY 273
           A  IW SL   Y+S + A +M L +QLQ+IKK ++ +S+YLS++K V D+F+ IGEP+SY
Sbjct: 124 ARDIWASLNDEYESPSIATVMSLNSQLQRIKKIDIPLSEYLSRLKFVFDEFATIGEPLSY 183

Query: 274 RDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQAN 333
           RD L  IL+GL  EY+ FVT+I NRSD PSL++V SLL  YE RL +++    LN  QAN
Sbjct: 184 RDKLTRILEGLPEEYDNFVTSIHNRSDRPSLQEVHSLLHTYEYRLSQRSMDQNLNFPQAN 243

Query: 334 LSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR 393
                               P +P +N                                 
Sbjct: 244 --------------------PRQPGYN--------------------------------- 303

Query: 394 LSSNKPQCQICGKFGHTALICHHRTNLAYQT---PPPQAYCPRFP-QPLPPLSLML 442
             ++ PQCQICGK GH AL  +HRTNL Y     P   A+ P  P Q   P+S ML
Sbjct: 304 --NSIPQCQICGKSGHIALNGYHRTNLTYHPPVFPNAAAFNPNGPGQTSSPISAML 303

BLAST of Lag0000600 vs. NCBI nr
Match: GFY85402.1 (hypothetical protein Acr_04g0001400 [Actinidia rufa])

HSP 1 Score: 228.8 bits (582), Expect = 9.5e-56
Identity = 111/231 (48.05%), Postives = 161/231 (69.70%), Query Frame = 0

Query: 96  PPQQFSQNQIQNPIPYPNP--FTPNPYPTLP---QPLSVKLNDSNFLLWKNQLLNAVIAN 155
           PP   S     + IP PNP     +P P +P   QPL+VKL+D N+++WK QLLN VIAN
Sbjct: 15  PPPPTSNPLPSSSIPNPNPQILNTSPLPNMPSINQPLAVKLDDHNYIIWKEQLLNIVIAN 74

Query: 156 GLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSA 215
           GL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M WIY+S++E  +G+IV + SA
Sbjct: 75  GLEEFLDGSRVCPPRFLDPQQQQSNPEFHSWQRYNRLVMSWIYASINESMLGQIVGYTSA 134

Query: 216 AAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYR 275
           + IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y 
Sbjct: 135 SQIWEALERLYAAASFAHLTELRTALQTIKKEGLTALAYIQKFRHLCNSLASIGEPVTYT 194

Query: 276 DHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVD 322
           DHL + L GLG +YN FVT+IQ+++  PS+E+V SLLL+Y+ARLE+Q+A D
Sbjct: 195 DHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQSATD 245

BLAST of Lag0000600 vs. NCBI nr
Match: GFZ12741.1 (UBX domain-containing protein [Actinidia rufa])

HSP 1 Score: 223.4 bits (568), Expect = 4.0e-54
Identity = 132/335 (39.40%), Postives = 190/335 (56.72%), Query Frame = 0

Query: 96  PPQQFSQNQIQNPIPYPNPFTPNP-----YPTLPQPLSVKLNDSNFLLWKNQLLNAVIAN 155
           PP   S     + IP PNP   N       P++ QPL+VKL+D N+++WK QLLN VIAN
Sbjct: 15  PPPPTSNPLPSSSIPKPNPQIINTSPLLNMPSINQPLAVKLDDHNYIIWKEQLLNIVIAN 74

Query: 156 GLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSA 215
           GL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M WIY+S++E  +G+IV + SA
Sbjct: 75  GLEEFLDGSRVCPPRFLDPQQQQSNPEFHSWQRYNRLVMSWIYASINESMLGQIVGYTSA 134

Query: 216 AAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYR 275
           + IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y 
Sbjct: 135 SQIWEALERLYAAASFAHLTELRTALQTIKKEGLTALAYIQKFRHLCNSLASIGEPVTYT 194

Query: 276 DHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRS-LLLAYEARLEKQNAVDQLNLAQAN 335
           DHL + L GLG +YN FVT+IQ+++  PS+E+  S   L  + + +             N
Sbjct: 195 DHLIYFLGGLGRDYNPFVTSIQSQAIRPSVEEPTSPTSLTRKPKFK-------------N 254

Query: 336 LSSLSLQNSRR-SNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPS 395
            S+ S  NS   S+P+  +     P ++P   SP              KP          
Sbjct: 255 PSTNSFPNSNSYSHPRGQNR---NPSYSPNPSSP--------------KP---------- 304

Query: 396 RLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPP 424
                +P+CQIC K GHTA  C+H TNL YQ PPP
Sbjct: 315 -----RPRCQICLKPGHTANKCYHHTNLNYQPPPP 304

BLAST of Lag0000600 vs. NCBI nr
Match: PON47862.1 (hypothetical protein TorRG33x02_321990 [Trema orientale])

HSP 1 Score: 219.5 bits (558), Expect = 5.8e-53
Identity = 128/309 (41.42%), Postives = 187/309 (60.52%), Query Frame = 0

Query: 83  SPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNP-YPTLPQPLSVKLNDSNFLLWKN 142
           +P+  NP  Q  P      Q+QI    P   P  P P  P++ QP ++KL+  N+L+WKN
Sbjct: 10  NPSTGNPTIQMPPTNIPNVQDQIIGAQP---PLPPLPILPSMNQPFTIKLDADNYLIWKN 69

Query: 143 QLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKM 202
           QLLN +IANGL+ ++ GS   PPR+ D  +   N +++ W+R+NR IM WIY+SL++  M
Sbjct: 70  QLLNVIIANGLEDFIDGSRPCPPRFTDPARQIVNAEYIAWQRFNRLIMSWIYASLTQGVM 129

Query: 203 GEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFS 262
           G+IV + SA  IW +L + Y S + A+I  L+ +LQ ++KD L+  +Y+ + K + +  +
Sbjct: 130 GQIVGYASAFEIWEALNQIYTSSSLAKITELRAKLQNLRKDGLTAIEYIQKHKNICNTLA 189

Query: 263 AIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVD 322
           A+GEP+S +DHL ++  GL  EYNAFVT+I  R DN  LE++ SLLL+YE RLE QNA  
Sbjct: 190 AVGEPVSCKDHLLYLFGGLDREYNAFVTSITKRPDNLPLEEIYSLLLSYEFRLESQNASA 249

Query: 323 QLNLAQANLSSLSLQNSRRSNPKPNHSIP---FRPPF--NPQAFSPFSSQQHSASPSLLG 382
           QL+  QANL+ L   N  +   +PN S P   F   F    Q F       +   PS+LG
Sbjct: 250 QLSSLQANLAHL---NINKKPYRPNFSNPVGHFTQNFQNRTQQFQSHPPNSNQFQPSILG 309

Query: 383 KPQTQQLQK 386
           KPQ + + +
Sbjct: 310 KPQGKHMNQ 312

BLAST of Lag0000600 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.8e-20
Identity = 71/310 (22.90%), Postives = 141/310 (45.48%), Query Frame = 0

Query: 130 KLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYL-DDQQTQPNPDFLHWERYNRFI 189
           KL  +N+L+W  Q+        L G+L GS   PP  +  D   + NPD+  W+R ++ I
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLI 84

Query: 190 MCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQ 249
              +  ++S      +    +AA IW +L++ Y + +   +  L+TQL++  K   ++  
Sbjct: 85  YSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDD 144

Query: 250 YLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLL 309
           Y+  +    D+ + +G+P+ + + +  +L+ L  EY   +  I  +   P+L ++   LL
Sbjct: 145 YMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLL 204

Query: 310 AYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQH 369
            +E+++    AV    +     +++S +N+  +N   N +               +++  
Sbjct: 205 NHESKI---LAVSSATVIPITANAVSHRNTTTTNNNNNGN--------------RNNRYD 264

Query: 370 SASPSLLGKPQTQQLQKWPSRLSSNKP---QCQICGKFGHTALIC---HHRTNLAYQTPP 429
           + + +   KP  Q    +    + +KP   +CQICG  GH+A  C    H  +      P
Sbjct: 265 NRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQP 317

Query: 430 PQAYCPRFPQ 433
           P  + P  P+
Sbjct: 325 PSPFTPWQPR 317

BLAST of Lag0000600 vs. ExPASy TrEMBL
Match: A0A6J1DQX7 (uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022315 PE=4 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 6.7e-116
Identity = 229/350 (65.43%), Postives = 274/350 (78.29%), Query Frame = 0

Query: 94  FPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQ 153
           FPPP   + N +  P   PNPF+ NP+PTLPQPL+VKLND+NFLLWKNQLLNAVIANGL+
Sbjct: 3   FPPP---TPNFLAQP---PNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLR 62

Query: 154 GYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAI 213
           GYL G+I  PP++LD  Q QPNP +  WERYNR +MCWIYSSLSEEKMGE+VS ++   I
Sbjct: 63  GYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDI 122

Query: 214 WNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHL 273
           W+SL R YDSKTTARIMGLKT+LQ ++KD  SVSQYL++IKE+ADKF+A+GEP+SYRDHL
Sbjct: 123 WSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHL 182

Query: 274 AHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSL 333
           AH+LDGLGSEYNAFVT+I NR+D+PSLEDVRSLLLAYEARL+KQN VDQLN+AQANL +L
Sbjct: 183 AHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNL 242

Query: 334 SLQ-NSRRSNPK---PNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR 393
           SLQ NS+R  PK   PNH   ++  F     SP S+ Q   S S+LGKPQ+  + KWP +
Sbjct: 243 SLQHNSKRPPPKFSFPNH---YKHSF---PNSPISAAQ---SQSILGKPQS--VHKWPPK 302

Query: 394 LSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQA-YCPRFPQPLPPLS 439
            SS+K QCQICGK GH+A +C+HRTN+AY    PQA Y    P P  P S
Sbjct: 303 PSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSS 335

BLAST of Lag0000600 vs. ExPASy TrEMBL
Match: A0A438GC62 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_210 PE=4 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 1.4e-57
Identity = 145/356 (40.73%), Postives = 198/356 (55.62%), Query Frame = 0

Query: 94  FPPP----QQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIA 153
           FPP        + N  QNP P     T  P P+L Q LS+KL+++N LL K+QLLN +IA
Sbjct: 4   FPPTPASNSNTTTNNNQNPAPQITQMT-LPSPSLSQSLSIKLDETNLLLRKSQLLNVIIA 63

Query: 154 NGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDS 213
           NGL+ ++    ++PP+YLD    Q NP+F+ W+R N+ +M WIYSSL+   +G+IV + +
Sbjct: 64  NGLEDFIDPDQSSPPKYLDAACRQVNPEFVQWDRLNQLVMSWIYSSLTPGMVGQIVEYST 123

Query: 214 AAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISY 273
           A  IW SL   Y+S + A +M L +QLQ+IKK ++ +S+YLS++K V D+F+ IGEP+SY
Sbjct: 124 ARDIWASLNDEYESPSIATVMSLNSQLQRIKKIDIPLSEYLSRLKFVFDEFATIGEPLSY 183

Query: 274 RDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQAN 333
           RD L  IL+GL  EY+ FVT+I NRSD PSL++V SLL  YE RL +++    LN  QAN
Sbjct: 184 RDKLTRILEGLPEEYDNFVTSIHNRSDRPSLQEVHSLLHTYEYRLSQRSMDQNLNFPQAN 243

Query: 334 LSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR 393
                               P +P +N                                 
Sbjct: 244 --------------------PRQPGYN--------------------------------- 303

Query: 394 LSSNKPQCQICGKFGHTALICHHRTNLAYQT---PPPQAYCPRFP-QPLPPLSLML 442
             ++ PQCQICGK GH AL  +HRTNL Y     P   A+ P  P Q   P+S ML
Sbjct: 304 --NSIPQCQICGKSGHIALNGYHRTNLTYHPPVFPNAAAFNPNGPGQTSSPISAML 303

BLAST of Lag0000600 vs. ExPASy TrEMBL
Match: A0A7J0EGI5 (Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_04g0001400 PE=4 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 4.6e-56
Identity = 111/231 (48.05%), Postives = 161/231 (69.70%), Query Frame = 0

Query: 96  PPQQFSQNQIQNPIPYPNP--FTPNPYPTLP---QPLSVKLNDSNFLLWKNQLLNAVIAN 155
           PP   S     + IP PNP     +P P +P   QPL+VKL+D N+++WK QLLN VIAN
Sbjct: 15  PPPPTSNPLPSSSIPNPNPQILNTSPLPNMPSINQPLAVKLDDHNYIIWKEQLLNIVIAN 74

Query: 156 GLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSA 215
           GL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M WIY+S++E  +G+IV + SA
Sbjct: 75  GLEEFLDGSRVCPPRFLDPQQQQSNPEFHSWQRYNRLVMSWIYASINESMLGQIVGYTSA 134

Query: 216 AAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYR 275
           + IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y 
Sbjct: 135 SQIWEALERLYAAASFAHLTELRTALQTIKKEGLTALAYIQKFRHLCNSLASIGEPVTYT 194

Query: 276 DHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVD 322
           DHL + L GLG +YN FVT+IQ+++  PS+E+V SLLL+Y+ARLE+Q+A D
Sbjct: 195 DHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQSATD 245

BLAST of Lag0000600 vs. ExPASy TrEMBL
Match: A0A7J0GPN0 (UBX domain-containing protein OS=Actinidia rufa OX=165716 GN=Acr_23g0011260 PE=4 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 1.9e-54
Identity = 132/335 (39.40%), Postives = 190/335 (56.72%), Query Frame = 0

Query: 96  PPQQFSQNQIQNPIPYPNPFTPNP-----YPTLPQPLSVKLNDSNFLLWKNQLLNAVIAN 155
           PP   S     + IP PNP   N       P++ QPL+VKL+D N+++WK QLLN VIAN
Sbjct: 15  PPPPTSNPLPSSSIPKPNPQIINTSPLLNMPSINQPLAVKLDDHNYIIWKEQLLNIVIAN 74

Query: 156 GLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSA 215
           GL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M WIY+S++E  +G+IV + SA
Sbjct: 75  GLEEFLDGSRVCPPRFLDPQQQQSNPEFHSWQRYNRLVMSWIYASINESMLGQIVGYTSA 134

Query: 216 AAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYR 275
           + IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y 
Sbjct: 135 SQIWEALERLYAAASFAHLTELRTALQTIKKEGLTALAYIQKFRHLCNSLASIGEPVTYT 194

Query: 276 DHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRS-LLLAYEARLEKQNAVDQLNLAQAN 335
           DHL + L GLG +YN FVT+IQ+++  PS+E+  S   L  + + +             N
Sbjct: 195 DHLIYFLGGLGRDYNPFVTSIQSQAIRPSVEEPTSPTSLTRKPKFK-------------N 254

Query: 336 LSSLSLQNSRR-SNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPS 395
            S+ S  NS   S+P+  +     P ++P   SP              KP          
Sbjct: 255 PSTNSFPNSNSYSHPRGQNR---NPSYSPNPSSP--------------KP---------- 304

Query: 396 RLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPP 424
                +P+CQIC K GHTA  C+H TNL YQ PPP
Sbjct: 315 -----RPRCQICLKPGHTANKCYHHTNLNYQPPPP 304

BLAST of Lag0000600 vs. ExPASy TrEMBL
Match: A0A2P5BGF8 (Uncharacterized protein OS=Trema orientale OX=63057 GN=TorRG33x02_321990 PE=4 SV=1)

HSP 1 Score: 219.5 bits (558), Expect = 2.8e-53
Identity = 128/309 (41.42%), Postives = 187/309 (60.52%), Query Frame = 0

Query: 83  SPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNP-YPTLPQPLSVKLNDSNFLLWKN 142
           +P+  NP  Q  P      Q+QI    P   P  P P  P++ QP ++KL+  N+L+WKN
Sbjct: 10  NPSTGNPTIQMPPTNIPNVQDQIIGAQP---PLPPLPILPSMNQPFTIKLDADNYLIWKN 69

Query: 143 QLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKM 202
           QLLN +IANGL+ ++ GS   PPR+ D  +   N +++ W+R+NR IM WIY+SL++  M
Sbjct: 70  QLLNVIIANGLEDFIDGSRPCPPRFTDPARQIVNAEYIAWQRFNRLIMSWIYASLTQGVM 129

Query: 203 GEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFS 262
           G+IV + SA  IW +L + Y S + A+I  L+ +LQ ++KD L+  +Y+ + K + +  +
Sbjct: 130 GQIVGYASAFEIWEALNQIYTSSSLAKITELRAKLQNLRKDGLTAIEYIQKHKNICNTLA 189

Query: 263 AIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVD 322
           A+GEP+S +DHL ++  GL  EYNAFVT+I  R DN  LE++ SLLL+YE RLE QNA  
Sbjct: 190 AVGEPVSCKDHLLYLFGGLDREYNAFVTSITKRPDNLPLEEIYSLLLSYEFRLESQNASA 249

Query: 323 QLNLAQANLSSLSLQNSRRSNPKPNHSIP---FRPPF--NPQAFSPFSSQQHSASPSLLG 382
           QL+  QANL+ L   N  +   +PN S P   F   F    Q F       +   PS+LG
Sbjct: 250 QLSSLQANLAHL---NINKKPYRPNFSNPVGHFTQNFQNRTQQFQSHPPNSNQFQPSILG 309

Query: 383 KPQTQQLQK 386
           KPQ + + +
Sbjct: 310 KPQGKHMNQ 312

BLAST of Lag0000600 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 4.2e-17
Identity = 54/209 (25.84%), Postives = 108/209 (51.67%), Query Frame = 0

Query: 127 LSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNR 186
           +++ LN  N+ +W+       ++ G+ G++ GS    P       T+       W+  + 
Sbjct: 24  VTLDLNKLNYDVWRELFETLCLSFGVLGHIDGSSTPTP------MTEK-----RWKERDG 83

Query: 187 FIMCWIYSSLSEEKMGEIVSFD-SAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLS 246
            +  WIY ++++  +  I+    +A  +W SL+  +     AR +  + +L+    D+LS
Sbjct: 84  LVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTTTIDDLS 143

Query: 247 VSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRS 306
           V +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  PS  + RS
Sbjct: 144 VHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPSFTEARS 203

Query: 307 LLLAYEARLEKQNAVDQLNLAQANLSSLS 335
           +LL  E+RL  ++   + +L+  N  SLS
Sbjct: 204 MLLMEESRLSNKS---KSSLSHTNHPSLS 218

BLAST of Lag0000600 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 75.5 bits (184), Expect = 1.3e-13
Identity = 39/192 (20.31%), Postives = 100/192 (52.08%), Query Frame = 0

Query: 126 PLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYN 185
           P+ + + +SN+  W+   L   ++  + G++ G++              N + ++W++ +
Sbjct: 21  PVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGTL-----------LPTNANDVNWQKRD 80

Query: 186 RFIMCWIYSSLSEEK-MGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNL 245
             +   +Y +L+ ++  G  V+  ++  IW  +K  + +   AR + L ++L+     ++
Sbjct: 81  GIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDSELRTKDIGDM 140

Query: 246 SVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVR 305
            V+ Y  ++K++AD    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  
Sbjct: 141 RVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFDDAA 200

Query: 306 SLLLAYEARLEK 317
           ++L   E RL++
Sbjct: 201 TMLQEEEDRLKR 201

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155181.11.4e-11565.43uncharacterized protein LOC111022315 [Momordica charantia][more]
RVW69807.13.0e-5740.73Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
GFY85402.19.5e-5648.05hypothetical protein Acr_04g0001400 [Actinidia rufa][more]
GFZ12741.14.0e-5439.40UBX domain-containing protein [Actinidia rufa][more]
PON47862.15.8e-5341.42hypothetical protein TorRG33x02_321990 [Trema orientale][more]
Match NameE-valueIdentityDescription
Q94HW21.8e-2022.90Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1DQX76.7e-11665.43uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A438GC621.4e-5740.73Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A7J0EGI54.6e-5648.05Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_04g0001400 PE=4 SV=1[more]
A0A7J0GPN01.9e-5439.40UBX domain-containing protein OS=Actinidia rufa OX=165716 GN=Acr_23g0011260 PE=4... [more]
A0A2P5BGF82.8e-5341.42Uncharacterized protein OS=Trema orientale OX=63057 GN=TorRG33x02_321990 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT5G48050.14.2e-1725.84CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT1G34070.11.3e-1320.31CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 316..336
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 183..316
e-value: 1.6E-25
score: 89.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..46
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 334..353
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 47..63
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 360..382
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..69
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 136..421
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 136..421

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0000600.1Lag0000600.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding