Tan0004862 (gene) Snake gourd v1

Overview
NameTan0004862
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
LocationLG02: 58541932 .. 58542315 (+)
RNA-Seq ExpressionTan0004862
SyntenyTan0004862
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAAGGGGAGAACTGCAGACTTGCACCCTTTTGATCTTGAGATTGAGAGGACCTGTAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAAATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCTACCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGCTAATAATTTTGAACTTAAGACAGGGCTGATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCTTAGCCACAGAAGATCCTAACTCTCATCTTCAAAATTTCTTAGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCAGATAGTATTCGCTTACGTCTTTTTCCTTTCTCTTTGTAG

mRNA sequence

ATGCCCAAGGGGAGAACTGCAGACTTGCACCCTTTTGATCTTGAGATTGAGAGGACCTGTAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAAATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCTACCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGCTAATAATTTTGAACTTAAGACAGGGCTGATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCTTAGCCACAGAAGATCCTAACTCTCATCTTCAAAATTTCTTAGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCAGATAGTATTCGCTTACGTCTTTTTCCTTTCTCTTTGTAG

Coding sequence (CDS)

ATGCCCAAGGGGAGAACTGCAGACTTGCACCCTTTTGATCTTGAGATTGAGAGGACCTGTAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAAATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCTACCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGCTAATAATTTTGAACTTAAGACAGGGCTGATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCTTAGCCACAGAAGATCCTAACTCTCATCTTCAAAATTTCTTAGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCAGATAGTATTCGCTTACGTCTTTTTCCTTTCTCTTTGTAG

Protein sequence

MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADSIRLRLFPFSL
Homology
BLAST of Tan0004862 vs. NCBI nr
Match: WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 142.5 bits (358), Expect = 2.5e-30
Identity = 74/127 (58.27%), Postives = 90/127 (70.87%), Query Frame = 0

Query: 1   MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAG 60
           MP+  T +L P D EI+RT RR LR    +  EMA++     PK IR+YF P   + Q G
Sbjct: 17  MPRDNT-NLLPLDPEIDRTYRRNLRALLNQTTEMAEE----IPKAIRDYFQPTLPASQPG 76

Query: 61  IVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADSIRL 120
           I++ PIN NNFELK GLIQM RE AFRG   EDP+ HL++FLEICGT KMNG+S D+I+L
Sbjct: 77  IMNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKL 136

Query: 121 RLFPFSL 128
           RLFPFSL
Sbjct: 137 RLFPFSL 138

BLAST of Tan0004862 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 122.5 bits (306), Expect = 2.7e-24
Identity = 62/127 (48.82%), Postives = 82/127 (64.57%), Query Frame = 0

Query: 1   MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAG 60
           M + R+ D+ P D EIERT R L R    +   MA+++    P+ +++Y  PV N   + 
Sbjct: 1   MRRARSRDIIPVDPEIERTLRSLRRN---KILAMAEEDREVLPRTLKDYVRPVVNGNYSS 60

Query: 61  IVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADSIRL 120
           I+  PINANNFELK  LI MV++  F G   +DPN HL  FLEIC T K+NG++ D+IRL
Sbjct: 61  IMRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRL 120

Query: 121 RLFPFSL 128
           RLFPFSL
Sbjct: 121 RLFPFSL 124

BLAST of Tan0004862 vs. NCBI nr
Match: XP_017428686.1 (PREDICTED: uncharacterized protein LOC108336731 [Vigna angularis])

HSP 1 Score: 121.7 bits (304), Expect = 4.5e-24
Identity = 66/133 (49.62%), Postives = 83/133 (62.41%), Query Frame = 0

Query: 12  FDLEIERTCR-----------------RLLREGRAEPQEMADQELINNPKPIREYFLPVF 71
           FD EIERT R                 RL+ +G+ E   MADQE +   K +R+Y +P  
Sbjct: 18  FDSEIERTARRNRSSARRRKRERRQEQRLIEQGK-ETSTMADQEPVR--KTLRDYSMPNP 77

Query: 72  NSEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGIS 128
           NS Q  IV  PI ANNFE+K  L+Q++++  F G  +EDPNSHL+NFL IC T K NG+S
Sbjct: 78  NSYQGSIVRPPIQANNFEIKPALLQVIQQNQFGGTNSEDPNSHLENFLAICDTLKYNGVS 137

BLAST of Tan0004862 vs. NCBI nr
Match: XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])

HSP 1 Score: 120.6 bits (301), Expect = 1.0e-23
Identity = 66/132 (50.00%), Postives = 81/132 (61.36%), Query Frame = 0

Query: 1   MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQEL-----INNPKPIREYFLPVFN 60
           M + R  DL   D E ERT R L    R E + MA+Q++      N  + IR+Y  PV N
Sbjct: 94  MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153

Query: 61  SEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISA 120
              +GI    I A NFELK GLI MV++  F G A EDPN+HL +FLEIC T KMNG++ 
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213

Query: 121 DSIRLRLFPFSL 128
           D+IRLRLF FSL
Sbjct: 214 DAIRLRLFSFSL 225

BLAST of Tan0004862 vs. NCBI nr
Match: PKA48167.1 (hypothetical protein AXF42_Ash020512 [Apostasia shenzhenica])

HSP 1 Score: 119.8 bits (299), Expect = 1.7e-23
Identity = 63/142 (44.37%), Postives = 84/142 (59.15%), Query Frame = 0

Query: 1   MPKGRTADLHPFDLEIERTCRRLLREGRAEP-------------QEMADQELINNP-KPI 60
           M +    DL PFD EIERT  ++ R+ + +               +M DQ  I    + +
Sbjct: 1   MTRSSKKDLAPFDPEIERTIAKITRQLKEQEVRGQLNKVKKDLFTKMEDQPTIQEAGRAL 60

Query: 61  REYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGA-FRGLATEDPNSHLQNFLEIC 120
           REY LP  N     +V   + ANNFE+K  LIQM+++   F GL ++DPN+H+ NFLEIC
Sbjct: 61  REYALPSINGANTSVVRPAVQANNFEIKPALIQMIQQSVQFYGLPSDDPNTHIANFLEIC 120

Query: 121 GTFKMNGISADSIRLRLFPFSL 128
            TFK NG+S D+IRLRLFPFSL
Sbjct: 121 DTFKHNGVSDDAIRLRLFPFSL 142

BLAST of Tan0004862 vs. ExPASy TrEMBL
Match: A0A2H9ZY12 (Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 PE=4 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 8.4e-24
Identity = 63/142 (44.37%), Postives = 84/142 (59.15%), Query Frame = 0

Query: 1   MPKGRTADLHPFDLEIERTCRRLLREGRAEP-------------QEMADQELINNP-KPI 60
           M +    DL PFD EIERT  ++ R+ + +               +M DQ  I    + +
Sbjct: 1   MTRSSKKDLAPFDPEIERTIAKITRQLKEQEVRGQLNKVKKDLFTKMEDQPTIQEAGRAL 60

Query: 61  REYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGA-FRGLATEDPNSHLQNFLEIC 120
           REY LP  N     +V   + ANNFE+K  LIQM+++   F GL ++DPN+H+ NFLEIC
Sbjct: 61  REYALPSINGANTSVVRPAVQANNFEIKPALIQMIQQSVQFYGLPSDDPNTHIANFLEIC 120

Query: 121 GTFKMNGISADSIRLRLFPFSL 128
            TFK NG+S D+IRLRLFPFSL
Sbjct: 121 DTFKHNGVSDDAIRLRLFPFSL 142

BLAST of Tan0004862 vs. ExPASy TrEMBL
Match: A0A1U8Q202 (uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208 PE=4 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.6e-22
Identity = 63/121 (52.07%), Postives = 79/121 (65.29%), Query Frame = 0

Query: 11  PFDLEIERT-CRRL--LREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAGIVHAPIN 70
           P+D EIERT C RL   R+ RAE +EMA+      P+ + +Y  P        IV   I+
Sbjct: 5   PYDPEIERTLCIRLRAARQVRAETEEMAE------PRTMMDYAKPTLTGAALSIVRPAIS 64

Query: 71  ANNFELKTGLIQMVREGA-FRGLATEDPNSHLQNFLEICGTFKMNGISADSIRLRLFPFS 128
           ANNFE+K  +IQM++    F G+A EDPNSH+ NFLEIC TFK NG+S D +RLRLFPFS
Sbjct: 65  ANNFEIKPAIIQMIQNTVQFCGMANEDPNSHIANFLEICDTFKHNGVSDDVVRLRLFPFS 119

BLAST of Tan0004862 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 2.7e-22
Identity = 59/98 (60.20%), Postives = 68/98 (69.39%), Query Frame = 0

Query: 30  EPQEMADQELINNPKPIREYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGAFRGL 89
           +P E    E  NN   IR+Y  P F     GI++ PINANN ELK GLIQMVRE  FRG 
Sbjct: 12  QPMERPQLEQ-NNQMTIRDYCQPNF-PNHVGIINLPINANNSELKPGLIQMVRENTFRGN 71

Query: 90  ATEDPNSHLQNFLEICGTFKMNGISADSIRLRLFPFSL 128
           ATEDPN+HL  FL++CGT KMNG+  D+IRLRLFP SL
Sbjct: 72  ATEDPNNHLTIFLDVCGTVKMNGVIDDAIRLRLFPLSL 107

BLAST of Tan0004862 vs. ExPASy TrEMBL
Match: A0A1S3UKD4 (uncharacterized protein LOC106766267 OS=Vigna radiata var. radiata OX=3916 GN=LOC106766267 PE=4 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 4.6e-22
Identity = 63/130 (48.46%), Postives = 79/130 (60.77%), Query Frame = 0

Query: 12  FDLEIERT-------CRRLLREGRAEPQEMADQELIN------NP-KPIREYFLPVFNSE 71
           FD  IERT        RR  RE R E + +  +E  +      NP K IR+Y +P  N  
Sbjct: 12  FDSRIERTARSNRSSARRRKRERRKEQRRIEQEEETSTMTEEQNPRKTIRDYSMPDPNGY 71

Query: 72  QAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADS 128
           Q  IV  PI ANNFE+K  L+Q++++  F G  +EDPNSHL+NFL IC T K NG+S D+
Sbjct: 72  QGSIVRPPIQANNFEIKPALLQVIQQNQFGGAVSEDPNSHLENFLAICDTLKYNGVSDDA 131

BLAST of Tan0004862 vs. ExPASy TrEMBL
Match: A0A6P6W382 (uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 PE=4 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 1.3e-21
Identity = 62/136 (45.59%), Postives = 80/136 (58.82%), Query Frame = 0

Query: 8   DLHPFDLEIERTCRRLLRE-GRAEPQEM------------ADQELINNP---KPIREYFL 67
           ++ PFD EIER  RR  R     E QE+             ++E+  N    + +R++ L
Sbjct: 7   EVAPFDPEIERALRRQRRNTPHQEEQEIWQPIEEILIELPFEEEIAENEPNRRILRDFAL 66

Query: 68  PVFNSEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMN 127
           P     Q  I    +NANNFE+K  LIQMV++  + G ATEDPNSHL  FLEIC T K N
Sbjct: 67  PETQGSQTSIARPMVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLEICDTIKFN 126

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
WP_217833153.12.5e-3058.27retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... [more]
KAG7990634.12.7e-2448.82hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
XP_017428686.14.5e-2449.62PREDICTED: uncharacterized protein LOC108336731 [Vigna angularis][more]
XP_022843226.11.0e-2350.00uncharacterized protein LOC111366761 [Olea europaea var. sylvestris][more]
PKA48167.11.7e-2344.37hypothetical protein AXF42_Ash020512 [Apostasia shenzhenica][more]
Match NameE-valueIdentityDescription
A0A2H9ZY128.4e-2444.37Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 P... [more]
A0A1U8Q2021.6e-2252.07uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208... [more]
A0A6J1DU192.7e-2260.20uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A1S3UKD44.6e-2248.46uncharacterized protein LOC106766267 OS=Vigna radiata var. radiata OX=3916 GN=LO... [more]
A0A6P6W3821.3e-2145.59uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 ... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004862.1Tan0004862.1mRNA