Tan0006737 (gene) Snake gourd v1

Overview
NameTan0006737
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein ALP1-like
LocationLG07: 62291406 .. 62293475 (-)
RNA-Seq ExpressionTan0006737
SyntenyTan0006737
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGAGACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTCGATGATGAAGAAACTGAGATCCATGGAGATTGGGGTGAGACACCTGCTCAATCTTCTAAACGTCGTCTAGTTGGCCCATCATTGGATAGTCGTGCTTATAAGACTGCACGAACTGCTCGACAAGCCATTTTAGATGATGCTTTAAAAGTATGGACTAAGTCGATGACTGAGAAGTGCAAGCTAGGTCTTCTTCAAAAAGCGTGAATTAGAAACCACCGTATCTAATGCTGAAAAGTATAGCATTGAGGAGTGTATGACCAAGTTAGGCTCTATTTCTGGACTTCCTAGCACTGCTTATGTGAAAGCATGTGAAAAATTTGTTTTATATGAGTGGAGAGCAATGTTCATGATCATGTCAGAGGAAAATATTCGAACAATGTTAGGTGTAGAATCCTCTTCTTAACTTTTAAGATGCTTAGCTACATCTTTTGGTAAGAAATTATGAATATCAAGTGTACTTTTTTGTATCTTGTTTTGTACAAATGCTACCAAAATACTAGTTTTTGTTTCGTAGCCTATAAAGATTTTTTGTACGTTCAAGTATGTAAATGAACTTATGTCGTAAACTATATATGAATGGAATATATGATAGTCTTTTGTTATTTACTTTTAAACTATTTCAATTATTTTTTGGTTATTTATTTTATTAGGTACTATGGCACAAAATGGCGATCATGATAGTGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAAATTCCTAAACACCCATGTCATACATCTTCATTAAATGGACATGAATATATGTTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTGATTCTTTTCGAATGGATAAAAATACATTTAGAGCTTTATGTGAGAGATTAAGACAATCAAATTATTTAGTGAATGATAAGATTATTAGTATTGAGGAAGCAGTTGGAATGATTTTACTCACAGTATGTCATAGCACTCGTAATAGAATTGTAATGAACGATTTCAACACTCTAAAGAGAGACCGTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGCTATGTGTATGTTGGGATGTGATGTTATCCAAGGTCCAAATATGATGGAAACTCCACCTGAAATTTTGAACAATCCCAAGTTTAATCCATGGTTCCAGGTATGTGTATTAGTATTAGCCATATTGTTTTAGTATATTTTTTTCATATTAATATCAACATCTTTTTCTTATGTTTAGAATTGTGTTGGTGCAATTGATGGAACTCACGTAAGTGCGTGGGCCTCAGATCAAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTATTCACCTTTGTCTATACTGGTTGGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAGGGAAGGGAATAATTTTCCATTACCACCTGAAGGTTAGAACTTTTTTTTGAAAAAAATATACAACATACTTATATCTTTTGTAATATGATCTTATTATAAATTTAATTCCAAATGAACAGGAAAATATTATCTTGTGGATTCTGGATATACAAATATGCCTGGTTTCTTAGCTCCATATCGTGGTGAGAGATATCATTTAAGAGATTATAAAGGAAGAGGAAGACATCCACGAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGCTCGATTCCCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAGCAACGTAGAATTCCTATTGCTTGTTGCGCAATACATAATTTTATTAGAATGAATTCAACCAGAGATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACCTGATGAAGAAAGCTTGGGTGGAACACAAGAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAAGTAG

mRNA sequence

ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGAGACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTCGATGATGAAGAAACTGAGATCCATGGAGATTGGGGTGAGACACCTGCTCAATCTTCTAAACGTCGTCTAGTTGGCCCATCATTGGATAGTCGTGCTTATAAGACTGCACGAACTGCTCGACAAGCCATTTTAGATGATGCTTTAAAAGTATGGACTAAGTCGATGACTGAGAAGTGCAAGCTAGGTACTATGGCACAAAATGGCGATCATGATAGTGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAAATTCCTAAACACCCATGTCATACATCTTCATTAAATGGACATGAATATATGTTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTGATTCTTTTCGAATGGATAAAAATACATTTAGAGCTTTATGTGAGAGATTAAGACAATCAAATTATTTAGTGAATGATAAGATTATTAGTATTGAGGAAGCAGTTGGAATGATTTTACTCACAGTATGTCATAGCACTCGTAATAGAATTGTAATGAACGATTTCAACACTCTAAAGAGAGACCGTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGCTATGTGTATGTTGGGATGTGATGTTATCCAAGGTCCAAATATGATGGAAACTCCACCTGAAATTTTGAACAATCCCAAGTTTAATCCATGGTTCCAGTGCGTGGGCCTCAGATCAAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTATTCACCTTTGTCTATACTGGTTGGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAGGGAAGGGAATAATTTTCCATTACCACCTGAAGGAAAATATTATCTTGTGGATTCTGGATATACAAATATGCCTGGTTTCTTAGCTCCATATCGTGGTGAGAGATATCATTTAAGAGATTATAAAGGAAGAGGAAGACATCCACGAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGCTCGATTCCCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAGCAACGTAGAATTCCTATTGCTTGTTGCGCAATACATAATTTTATTAGAATGAATTCAACCAGAGATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACCTGATGAAGAAAGCTTGGGTGGAACACAAGAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAAGTAG

Coding sequence (CDS)

ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGAGACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTCGATGATGAAGAAACTGAGATCCATGGAGATTGGGGTGAGACACCTGCTCAATCTTCTAAACGTCGTCTAGTTGGCCCATCATTGGATAGTCGTGCTTATAAGACTGCACGAACTGCTCGACAAGCCATTTTAGATGATGCTTTAAAAGTATGGACTAAGTCGATGACTGAGAAGTGCAAGCTAGGTACTATGGCACAAAATGGCGATCATGATAGTGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAAATTCCTAAACACCCATGTCATACATCTTCATTAAATGGACATGAATATATGTTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTGATTCTTTTCGAATGGATAAAAATACATTTAGAGCTTTATGTGAGAGATTAAGACAATCAAATTATTTAGTGAATGATAAGATTATTAGTATTGAGGAAGCAGTTGGAATGATTTTACTCACAGTATGTCATAGCACTCGTAATAGAATTGTAATGAACGATTTCAACACTCTAAAGAGAGACCGTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGCTATGTGTATGTTGGGATGTGATGTTATCCAAGGTCCAAATATGATGGAAACTCCACCTGAAATTTTGAACAATCCCAAGTTTAATCCATGGTTCCAGTGCGTGGGCCTCAGATCAAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTATTCACCTTTGTCTATACTGGTTGGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAGGGAAGGGAATAATTTTCCATTACCACCTGAAGGAAAATATTATCTTGTGGATTCTGGATATACAAATATGCCTGGTTTCTTAGCTCCATATCGTGGTGAGAGATATCATTTAAGAGATTATAAAGGAAGAGGAAGACATCCACGAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGCTCGATTCCCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAGCAACGTAGAATTCCTATTGCTTGTTGCGCAATACATAATTTTATTAGAATGAATTCAACCAGAGATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACCTGATGAAGAAAGCTTGGGTGGAACACAAGAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAAGTAG

Protein sequence

MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPSLDSRAYKTARTARQAILDDALKVWTKSMTEKCKLGTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFQCVGLRSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYIK
Homology
BLAST of Tan0006737 vs. NCBI nr
Match: XP_008237544.2 (PREDICTED: uncharacterized protein LOC103336277 [Prunus mume])

HSP 1 Score: 453.8 bits (1166), Expect = 1.9e-123
Identity = 222/392 (56.63%), Postives = 279/392 (71.17%), Query Frame = 0

Query: 90  EKCKLGTMAQNGDH---DSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGHE 149
           + C   + + +  H   +SDSS++ D+ +G +L+   + NEY     K PC  S L+G +
Sbjct: 7   DNCHSSSKSDSSSHSSSESDSSSNLDDSLGELLLFIKL-NEYYSRFHKEPCMISQLSGRQ 66

Query: 150 YMLELLNGHPDRIFDSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHST 209
           +++ELL GHP+R+FD  RMDKNTF  LC  LR  ++L +D+ I +EE++ M L  + H+T
Sbjct: 67  FVIELLTGHPNRLFDFARMDKNTFMNLCSTLRGLDFLQDDRSICVEESMCMFLRIIGHTT 126

Query: 210 RNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFQ- 269
           RNR+    F   K + VSRQF+RVL+A+C  G  +IQ PNM  TPP+I+ NP ++ WF+ 
Sbjct: 127 RNRMDAEKFQHSK-ETVSRQFNRVLKAICRFGTQIIQPPNMDMTPPKIMGNPNYHSWFKK 186

Query: 270 --CVGL-----------RSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSR 329
             C+G             SKQ PYRGRK+ VTQNIMCACSF+MLFT+VYTGWE TANDSR
Sbjct: 187 NDCIGAIDATHINAWAPASKQIPYRGRKIEVTQNIMCACSFDMLFTYVYTGWERTANDSR 246

Query: 330 VLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQE 389
           VL+DAI RE NNFPLP EGKYY+VDSGY NM G LAPY GERYHL DY+GRGRHPRG  E
Sbjct: 247 VLMDAISREENNFPLPKEGKYYVVDSGYANMRGSLAPYHGERYHLCDYRGRGRHPRGAME 306

Query: 390 FFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDT 449
            FNY+HSSLRNVI+RCFGVLKARFPILKLMPN PIRKQ+RIP+ACC +HNFIRM S  D 
Sbjct: 307 LFNYRHSSLRNVIKRCFGVLKARFPILKLMPNNPIRKQKRIPVACCTVHNFIRMQSRNDI 366

Query: 450 LFEEYQVADLEVPDEESLGGTQEFLDMNLSQA 465
           +F +YQ  DL+V DEES    QE + ++   A
Sbjct: 367 IFHQYQANDLQVVDEESSEVNQEHIHLHEDNA 396

BLAST of Tan0006737 vs. NCBI nr
Match: XP_018505745.1 (PREDICTED: uncharacterized protein LOC103958625 [Pyrus x bretschneideri])

HSP 1 Score: 450.3 bits (1157), Expect = 2.1e-122
Identity = 216/336 (64.29%), Postives = 259/336 (77.08%), Query Frame = 0

Query: 139 TSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMI 198
           TS  +G +Y++ELLNGHP R+FD  RMDKNTFR LC  LR+ N+L +D+ I +EEAV M 
Sbjct: 2   TSHFSGRQYVIELLNGHPQRLFDIVRMDKNTFRNLCSTLRELNFLQDDRSICVEEAVCMF 61

Query: 199 LLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNP 258
           L T+ H+ RNR++   F   K + VSRQF+RVL+A+C LG  +IQ PNM  TP EIL NP
Sbjct: 62  LFTISHTIRNRVIAETFQHSK-ETVSRQFNRVLKAICRLGTRIIQPPNMDATPSEILGNP 121

Query: 259 KFNPWFQ-CVGL-----------RSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEG 318
           K++PWFQ C+G             SKQ  YRGRKV VTQNIM ACSFNM+FT+VYTGWEG
Sbjct: 122 KYDPWFQNCIGAIDGTHISAWVPSSKQISYRGRKVSVTQNIMLACSFNMMFTYVYTGWEG 181

Query: 319 TANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRH 378
           TANDSRVL+DAI RE N FP+P EGKYY+VDSGY NM GFLAPYR  RYHLRD++GRG+ 
Sbjct: 182 TANDSRVLMDAITREDNRFPMPKEGKYYVVDSGYANMSGFLAPYRKVRYHLRDFRGRGKR 241

Query: 379 PRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRM 438
           PRG  E FN++HSSLRNV+ERC GVLK RFPILKLMPNYPIRKQRRIPIACCA+HNFIRM
Sbjct: 242 PRGAMELFNFRHSSLRNVVERCIGVLKNRFPILKLMPNYPIRKQRRIPIACCAVHNFIRM 301

Query: 439 NSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLS 463
            S  DTLF++++  D++V DEES G  QE  +M+L+
Sbjct: 302 QSRNDTLFQQFEDNDVDVVDEESSGTNQEGENMHLN 336

BLAST of Tan0006737 vs. NCBI nr
Match: XP_024164615.1 (uncharacterized protein LOC112171704 [Rosa chinensis])

HSP 1 Score: 449.9 bits (1156), Expect = 2.7e-122
Identity = 222/374 (59.36%), Postives = 264/374 (70.59%), Query Frame = 0

Query: 102 DHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFD 161
           D DSDSS+  DEM   M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI++
Sbjct: 3   DGDSDSSSELDEMAQHMIICMNIYDYWSSYIDKVPCHTSILSGAEYVQELLNGHPDRIYN 62

Query: 162 SFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRD 221
           SFRMDK+ F+ LC  L   N L +D+ + I+EAV + L  V HS R R+    F   K D
Sbjct: 63  SFRMDKHVFQRLCCTLESLNLLKDDRHVGIQEAVAIFLYIVSHSERMRMAAERFQRSK-D 122

Query: 222 RVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFQ-CVGL----------- 281
            + RQF RVL A+C L   +I+  +  ETPPEILNNPKF P+F+ C+G            
Sbjct: 123 TIHRQFKRVLAALCKLSPQIIRAQSQGETPPEILNNPKFYPYFEKCIGAIDGTHVAAWAP 182

Query: 282 RSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPP 341
             KQT YRGRKV+VTQN+MCACSF+M+FTFVYTGWEGTANDSRV  DA+ R  N FP P 
Sbjct: 183 AQKQTSYRGRKVLVTQNVMCACSFDMMFTFVYTGWEGTANDSRVFADAVTRPENKFPFPN 242

Query: 342 EGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCF 401
           EG YY+VD+GYTNMPGFLAPYRGERYHLRDY+G  R PRGP+E FNY+HSSLRNVIERCF
Sbjct: 243 EGYYYVVDAGYTNMPGFLAPYRGERYHLRDYRGPRRTPRGPRELFNYRHSSLRNVIERCF 302

Query: 402 GVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEES 461
           GVLKARFPILK MPNYP R+QRRIPIACC +HNFIR  + RD LFE + V D+   +E S
Sbjct: 303 GVLKARFPILKYMPNYPPRRQRRIPIACCVLHNFIRKEARRDRLFEAFDVEDMIFEEENS 362

Query: 462 LGGTQEFLDMNLSQ 464
                +    NL+Q
Sbjct: 363 TPANLDMSQENLAQ 375

BLAST of Tan0006737 vs. NCBI nr
Match: XP_024163380.1 (uncharacterized protein LOC112170350 [Rosa chinensis])

HSP 1 Score: 448.4 bits (1152), Expect = 7.8e-122
Identity = 221/372 (59.41%), Postives = 263/372 (70.70%), Query Frame = 0

Query: 104 DSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFDSF 163
           DSDSS+  DEM   M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI++SF
Sbjct: 5   DSDSSSELDEMAQHMIICMNIYDYWSSYIDKVPCHTSILSGAEYVQELLNGHPDRIYNSF 64

Query: 164 RMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRV 223
           RMDK+ F+ LC  L   N L +D+ + I+EAV + L  V HS R R+    F   K D +
Sbjct: 65  RMDKHVFQRLCCTLESLNLLKDDRHVGIQEAVAIFLYIVSHSERMRMAAERFQRSK-DTI 124

Query: 224 SRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFQ-CVGL-----------RS 283
            RQF RVL A+C L   +I+  +  ETPPEILNNPKF P+F+ C+G              
Sbjct: 125 HRQFKRVLAALCKLSPQIIRAQSQGETPPEILNNPKFYPYFEKCIGAIDGTHVAAWAPAQ 184

Query: 284 KQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEG 343
           KQT YRGRKV+VTQN+MCACSF+M+FTFVYTGWEGTANDSRV  DA+ R  N FP P EG
Sbjct: 185 KQTSYRGRKVLVTQNVMCACSFDMMFTFVYTGWEGTANDSRVFADAVTRPENKFPFPNEG 244

Query: 344 KYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGV 403
            YY+VD+GYTNMPGFLAPYRGERYHLRDY+G  R PRGP+E FNY+HSSLRNVIERCFGV
Sbjct: 245 YYYVVDAGYTNMPGFLAPYRGERYHLRDYRGPRRTPRGPRELFNYRHSSLRNVIERCFGV 304

Query: 404 LKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEESLG 463
           LKARFPILK MPNYP R+QRRIPIACC +HNFIR  + RD LFE + V D+   +E S  
Sbjct: 305 LKARFPILKYMPNYPPRRQRRIPIACCVLHNFIRKEARRDRLFEAFDVEDMIFEEENSTP 364

BLAST of Tan0006737 vs. NCBI nr
Match: RZC55946.1 (hypothetical protein C5167_014815 [Papaver somniferum])

HSP 1 Score: 442.2 bits (1136), Expect = 5.6e-120
Identity = 236/482 (48.96%), Postives = 301/482 (62.45%), Query Frame = 0

Query: 1   MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPS 60
           M  +ST  PP+SD ERDLE EF+  GA                                 
Sbjct: 141 MSNASTRSPPDSDTERDLETEFLGKGA--------------------------------- 200

Query: 61  LDSRAYKTARTARQAILDDALKVWTKSMTEKCKLGTM-AQNGDHDSDSSTSSDEMVGRML 120
           L  R Y               ++W  +   K +   M +++    SDSS+ S++    +L
Sbjct: 201 LYQREYLE-------------EIWLLTGISKVQSIKMESRDTSSSSDSSSDSEDSFMELL 260

Query: 121 IVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERLRQ 180
           +V  +   Y     K P  TS L+G E++ ELLNGHP R+++  RMD +TF  LC  LR 
Sbjct: 261 MVKELHKRY----IKIPMMTSVLSGREFIFELLNGHPRRMYNLMRMDPSTFMLLCSTLRT 320

Query: 181 SNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGC 240
           +++L +D+ +S+EEAVG+ L TV  S RNR+V   F     + V R F +VL+A+C LGC
Sbjct: 321 NDFLQDDRSVSVEEAVGIFLATVSQSMRNRVVAEMFQH-SNETVYRHFKKVLKALCRLGC 380

Query: 241 DVIQGPNMMETPPEILNNPKFNPWF-QCVGL-----------RSKQTPYRGRKVIVTQNI 300
            +I+ PNM E PPEI+ NPKF PWF  CVG             SKQ P+RGRK  +TQNI
Sbjct: 381 LIIKPPNMDEVPPEIMTNPKFYPWFVDCVGAIDGTHISACVPASKQIPFRGRKAQITQNI 440

Query: 301 MCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFL 360
           MCACSF+MLFTFVYTGWEGTAND+RVL+DAI  E N FP+P EG+YY+VDS YTNMPGFL
Sbjct: 441 MCACSFDMLFTFVYTGWEGTANDARVLMDAISNEENKFPMPREGRYYVVDSAYTNMPGFL 500

Query: 361 APYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPI 420
            PYRGERYHLRD++GR R  +GP E FN++HSSLRNVIERCFGV K+RFPILK MPNYP+
Sbjct: 501 TPYRGERYHLRDFRGRSRQAKGPMELFNHRHSSLRNVIERCFGVWKSRFPILKCMPNYPL 560

Query: 421 RKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAY 468
           R+QR IP+ACC +HNFIR+NS  D LF ++   DL V DEES    QE   +D+++S A 
Sbjct: 561 RRQRLIPVACCTLHNFIRLNSRNDELFSQFMAEDLLVADEESSSTGQESTSIDIDVSAAN 571

BLAST of Tan0006737 vs. ExPASy TrEMBL
Match: A0A4Y7J673 (Uncharacterized protein OS=Papaver somniferum OX=3469 GN=C5167_014815 PE=3 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 2.7e-120
Identity = 236/482 (48.96%), Postives = 301/482 (62.45%), Query Frame = 0

Query: 1   MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPS 60
           M  +ST  PP+SD ERDLE EF+  GA                                 
Sbjct: 141 MSNASTRSPPDSDTERDLETEFLGKGA--------------------------------- 200

Query: 61  LDSRAYKTARTARQAILDDALKVWTKSMTEKCKLGTM-AQNGDHDSDSSTSSDEMVGRML 120
           L  R Y               ++W  +   K +   M +++    SDSS+ S++    +L
Sbjct: 201 LYQREYLE-------------EIWLLTGISKVQSIKMESRDTSSSSDSSSDSEDSFMELL 260

Query: 121 IVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERLRQ 180
           +V  +   Y     K P  TS L+G E++ ELLNGHP R+++  RMD +TF  LC  LR 
Sbjct: 261 MVKELHKRY----IKIPMMTSVLSGREFIFELLNGHPRRMYNLMRMDPSTFMLLCSTLRT 320

Query: 181 SNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGC 240
           +++L +D+ +S+EEAVG+ L TV  S RNR+V   F     + V R F +VL+A+C LGC
Sbjct: 321 NDFLQDDRSVSVEEAVGIFLATVSQSMRNRVVAEMFQH-SNETVYRHFKKVLKALCRLGC 380

Query: 241 DVIQGPNMMETPPEILNNPKFNPWF-QCVGL-----------RSKQTPYRGRKVIVTQNI 300
            +I+ PNM E PPEI+ NPKF PWF  CVG             SKQ P+RGRK  +TQNI
Sbjct: 381 LIIKPPNMDEVPPEIMTNPKFYPWFVDCVGAIDGTHISACVPASKQIPFRGRKAQITQNI 440

Query: 301 MCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFL 360
           MCACSF+MLFTFVYTGWEGTAND+RVL+DAI  E N FP+P EG+YY+VDS YTNMPGFL
Sbjct: 441 MCACSFDMLFTFVYTGWEGTANDARVLMDAISNEENKFPMPREGRYYVVDSAYTNMPGFL 500

Query: 361 APYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPI 420
            PYRGERYHLRD++GR R  +GP E FN++HSSLRNVIERCFGV K+RFPILK MPNYP+
Sbjct: 501 TPYRGERYHLRDFRGRSRQAKGPMELFNHRHSSLRNVIERCFGVWKSRFPILKCMPNYPL 560

Query: 421 RKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAY 468
           R+QR IP+ACC +HNFIR+NS  D LF ++   DL V DEES    QE   +D+++S A 
Sbjct: 561 RRQRLIPVACCTLHNFIRLNSRNDELFSQFMAEDLLVADEESSSTGQESTSIDIDVSAAN 571

BLAST of Tan0006737 vs. ExPASy TrEMBL
Match: A0A2P6SH66 (Putative harbinger transposase-derived nuclease domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr1g0354781 PE=3 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 1.2e-115
Identity = 208/358 (58.10%), Postives = 253/358 (70.67%), Query Frame = 0

Query: 118 MLIVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERL 177
           M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI++SFRMDK+ F+ LC  L
Sbjct: 1   MIICMNIYDYWSSYIDKVPCHTSILSGAEYVQELLNGHPDRIYNSFRMDKHVFQRLCCTL 60

Query: 178 RQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCML 237
           +    L +D+ + ++EAV + L  V HS R R+    F   K D + RQF RVL A+CML
Sbjct: 61  KSLKLLEDDRHVGVQEAVAIFLYIVSHSERMRMAAERFQRSK-DTIHRQFKRVLAALCML 120

Query: 238 GCDVIQGPNMMETPPEILNNPKFNPWFQ-CVGL-----------RSKQTPYRGRKVIVTQ 297
              +I+  +  ETP EILNNPKF P+F+ C+G              KQT YRGRKV+VTQ
Sbjct: 121 SPQIIRPQSEGETPAEILNNPKFYPYFEKCIGAIDGTHVAAWAPAQKQTSYRGRKVLVTQ 180

Query: 298 NIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPG 357
           N+MCACSF+M+FTFVYTGWEGTANDSR+  DA+ R  N FP P EG YY+VD+GYTNMPG
Sbjct: 181 NVMCACSFDMMFTFVYTGWEGTANDSRIFTDAVTRPENKFPFPNEGYYYVVDAGYTNMPG 240

Query: 358 FLAPYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNY 417
           FLAPYRGERYHLRDY+G  R PRGP+E FNY+HSSLRNVIERCFGVLK RFPILK MPNY
Sbjct: 241 FLAPYRGERYHLRDYRGPHRTPRGPRELFNYRHSSLRNVIERCFGVLKVRFPILKYMPNY 300

Query: 418 PIRKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQ 464
           P R+QRRIPIACC +HNFIR  + RD LFE + V D+   +E +     +    NL+Q
Sbjct: 301 PPRRQRRIPIACCVLHNFIRKEARRDRLFEAFDVEDMIFEEENNTPTNLDMSQENLAQ 357

BLAST of Tan0006737 vs. ExPASy TrEMBL
Match: A0A6P4ALU8 (uncharacterized protein LOC107422048 OS=Ziziphus jujuba OX=326968 GN=LOC107422048 PE=3 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 9.4e-97
Identity = 184/372 (49.46%), Postives = 244/372 (65.59%), Query Frame = 0

Query: 102 DHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFD 161
           D +S SS+SSDE     ++   +    +  + K P  TS L+G  ++ ELLNG     ++
Sbjct: 3   DINSSSSSSSDEEFQDFMMFLDVCEYTKRFLEKIPQRTSMLSGQNFITELLNGSEKTCYE 62

Query: 162 SFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRD 221
            FRMDKN F +LC  L+Q  YL + K + +EEA+ M L+ + H+   R++ + F     +
Sbjct: 63  LFRMDKNIFLSLCTCLKQHEYLKDTKEVRVEEALAMFLIIIGHNVGMRLIADRFQH-SLE 122

Query: 222 RVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFQ-CVGL----------- 281
            V R F+  LRA+C LG ++I   N    P  I+NNPK+ PWF+ C+G            
Sbjct: 123 TVDRHFTLTLRAICRLGKELICHTN-SPLPSHIVNNPKYFPWFEKCIGAIDGTHISAHVP 182

Query: 282 RSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPP 341
             KQ  YRGRK IVTQN++CAC+FNM+FTFVY GWEGTANDSRV LDAI R  N FPLP 
Sbjct: 183 AEKQVSYRGRKAIVTQNVLCACNFNMMFTFVYAGWEGTANDSRVFLDAITRSENKFPLPK 242

Query: 342 EGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCF 401
           EG+YY+VDSG+    GFL P+RGERYHL++Y GRGR PRGP+E FNY+HSSLRNVIERCF
Sbjct: 243 EGEYYVVDSGFPCTMGFLPPFRGERYHLQEYHGRGRQPRGPKELFNYRHSSLRNVIERCF 302

Query: 402 GVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVPDEE- 461
           GVLKARF ILK+MP Y   +Q  I IACC +HNFIR ++  D +F +++  + E+ DEE 
Sbjct: 303 GVLKARFRILKMMPPYKQSRQPLIVIACCTLHNFIRKSAQNDVMFTQWEEEEQEIEDEEA 362

BLAST of Tan0006737 vs. ExPASy TrEMBL
Match: A0A5B6ZZY6 (DDE Tnp4 domain-containing protein (Fragment) OS=Davidia involucrata OX=16924 GN=Din_019348 PE=3 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 6.7e-95
Identity = 179/330 (54.24%), Postives = 231/330 (70.00%), Query Frame = 0

Query: 147 YMLELLNGHPDRIFDSFRMDKNTFRALCERLRQSNYLVND-KIISIEEAVGMILLTVCHS 206
           Y+ ELL+GHP RI++  RMD  TF +LC  LR      N  + + IEE++ + LLTV HS
Sbjct: 1   YIRELLDGHPTRIYEMLRMDAPTFMSLCNTLRNGYLEENQHRHVPIEESLAIFLLTVGHS 60

Query: 207 TRNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWF- 266
           TR+R+V   F     + ++R    V+RA+  LG  +I+     ETPPEILNNPKFNP+F 
Sbjct: 61  TRHRVVAERFQ-YSTETINRHIKHVMRALATLGTVIIRPKIPYETPPEILNNPKFNPYFL 120

Query: 267 QCVGL-----------RSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRV 326
            C+G              KQT +RGRKV VTQN++ ACSF++LFTFVY GWEG+ANDSRV
Sbjct: 121 GCIGAIDGTHIAAWAPAVKQTAFRGRKVSVTQNVLAACSFDLLFTFVYPGWEGSANDSRV 180

Query: 327 LLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQEF 386
            L+AI R    FP+PP GKYY+VD+G+TNMPGFL+PY GERYH+RD++GRGR PRGP+E 
Sbjct: 181 FLNAITRGDVQFPMPPPGKYYVVDAGFTNMPGFLSPYWGERYHMRDFQGRGRGPRGPREL 240

Query: 387 FNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDTL 446
           FN++HSSLRN IERCFG+LKARFP+LK M NY + +Q  + IACCAIHNF+RM+S  D L
Sbjct: 241 FNHRHSSLRNAIERCFGILKARFPLLKHMTNYKVVRQGPLVIACCAIHNFVRMHSNADML 300

Query: 447 FEEYQVADLEVPDEESLGGTQEFLDMNLSQ 464
           F+E  V D+   +E+  G      D+N+SQ
Sbjct: 301 FQEEMVQDM---NEDGDGNHAMQPDVNMSQ 326

BLAST of Tan0006737 vs. ExPASy TrEMBL
Match: A0A438CFP0 (Protein ALP1-like OS=Vitis vinifera OX=29760 GN=VvCHDh000000_4 PE=3 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 8.2e-93
Identity = 176/379 (46.44%), Postives = 247/379 (65.17%), Query Frame = 0

Query: 103 HDSDSSTSSDEMVGRMLIVTTIVNEY-ECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFD 162
           +DS SS+   + +   + +  I+NEY E  + K P  TS L+G +++ +++ GHP   ++
Sbjct: 12  YDSTSSSEEGDDLDE-IFIAHIMNEYEEIFLCKTPQRTSMLSGAQFVRDMIEGHPQTCYE 71

Query: 163 SFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRD 222
            FRMDK TF  LC+ L++   L + + +++EEAV M LL V H+ R R+V + F     +
Sbjct: 72  LFRMDKETFMNLCDHLKRHENLQDTRFVTVEEAVAMFLLIVGHNVRMRVVADRFQH-STE 131

Query: 223 RVSRQFSRVLRAMCMLGCDVIQGPNMM-ETPPEILNNPKFNPWFQ-CVGL---------- 282
            V+R F  V RA+C LG  +I   NM  E    + +NPK+ PWF+ C+G           
Sbjct: 132 TVARHFKEVRRALCRLGKILICPNNMTNEVSSYVASNPKYFPWFKDCIGAIDGTHISAWV 191

Query: 283 -RSKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLP 342
              +QT +RGRK ++TQN+MCAC+F+M+FTFVY GWEGTAND+RV LDA+ R   NFP P
Sbjct: 192 PADRQTSFRGRKTVITQNVMCACNFDMMFTFVYAGWEGTANDARVFLDALTRPEVNFPWP 251

Query: 343 PEGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERC 402
            EGKYY+VDSGY  + GFL PYRGERYHL++Y+GR   P   +E FNY+HSSLRN+IERC
Sbjct: 252 SEGKYYVVDSGYPCISGFLPPYRGERYHLQEYRGRHNQPIRYKELFNYRHSSLRNIIERC 311

Query: 403 FGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMNSTRDTLFEEYQVADLEVP-DE 462
           FGVLK RFPIL++MP Y   +Q  I +ACC +HN+IR+++  D LF EY+V DL +  +E
Sbjct: 312 FGVLKTRFPILRMMPCYKPSRQPSIVVACCTLHNWIRLSTRNDQLFREYEVEDLSIEGEE 371

Query: 463 ESLGGTQEFLDMNLSQAYI 467
           ES       +D++   A +
Sbjct: 372 ESTSSRNHSIDLSDESAAV 388

BLAST of Tan0006737 vs. TAIR 10
Match: AT5G41980.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 196.1 bits (497), Expect = 6.5e-50
Identity = 114/329 (34.65%), Postives = 178/329 (54.10%), Query Frame = 0

Query: 132 IPKHPCHTSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERLRQSNYLVNDKIISI 191
           +PK     S  +G++++ ++LNG  ++ F++FRMDK  F  LC+ L+    L +   I I
Sbjct: 15  LPKEVSKISISDGNKFVYQILNGPNEQCFENFRMDKPVFYKLCDLLQTRGLLRHTNRIKI 74

Query: 192 EEAVGMILLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCMLGCDVIQGPNMMETP 251
           E  + + L  + H+ R R V   F     + +SR F+ VL A+  +  D  Q PN   + 
Sbjct: 75  EAQLAIFLFIIGHNLRTRAVQELF-CYSGETISRHFNNVLNAVIAISKDFFQ-PN---SN 134

Query: 252 PEILNNPKFNPWFQ-CVGL-----------RSKQTPYRGRKVIVTQNIMCACSFNMLFTF 311
            + L N   +P+F+ CVG+             +Q P+R    ++TQN++ A SF++ F +
Sbjct: 135 SDTLEND--DPYFKDCVGVVDSFHIPVMVGVDEQGPFRNGNGLLTQNVLAASSFDLRFNY 194

Query: 312 VYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERYHLRD 371
           V  GWEG+A+D +VL  A+ R   N    P+GKYY+VD+ Y N+PGF+APY G   + R+
Sbjct: 195 VLAGWEGSASDQQVLNAALTR--RNKLQVPQGKYYIVDNKYPNLPGFIAPYHGVSTNSRE 254

Query: 372 YKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCA 431
                      +E FN +H  L   I R FG LK RFPIL   P YP++ Q ++ IA CA
Sbjct: 255 ---------EAKEMFNERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQVKLVIAACA 314

Query: 432 IHNFIRMNSTRDTLFEEYQVADLEVPDEE 449
           +HN++R+    D +F  ++   L    E+
Sbjct: 315 LHNYVRLEKPDDLVFRMFEEETLAEAGED 325

BLAST of Tan0006737 vs. TAIR 10
Match: AT1G43722.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28730.1); Has 924 Blast hits to 912 proteins in 109 species: Archae - 0; Bacteria - 0; Metazoa - 222; Fungi - 31; Plants - 661; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 136.3 bits (342), Expect = 6.1e-32
Identity = 101/296 (34.12%), Postives = 137/296 (46.28%), Query Frame = 0

Query: 119 LIVTTIVNEYECEIPKHPCHTSSLNGHEYMLELLNGHPDRIFDSFRMDKNTFRALCERLR 178
           L++   +N Y+    + P       G   +   L           RM    F  LC  L 
Sbjct: 26  LVIQPALNYYDRYFQRAPVQIDRGLGWRNIWRRLQQDAAACLQLLRMSLPCFTTLCNML- 85

Query: 179 QSNYLVNDKI-ISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRVSRQFSRVLRAMCML 238
           Q+NY +   + ISIEE+V M L    H+   R V   F    ++ V R+F  VL A  +L
Sbjct: 86  QTNYDLQPTLNISIEESVAMFLRICGHNEVYRDVGLRFGR-NQETVQRKFREVLTATELL 145

Query: 239 GCDVIQGPNMME---TPPEILNNPKFNPWFQ-----------CVGLR-SKQTPYRGRKVI 298
            CD I+ P   E    P  +  + ++ P+F            CV ++   Q  Y  R   
Sbjct: 146 ACDYIRTPTRQELYRIPERLQVDQRYWPYFSGFVGAMDGTHVCVKVKPDLQGMYWNRHDN 205

Query: 299 VTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTN 358
            + NIM  C   MLFT+++ G  G+  D+ VL  A  +  + FPLPP  KYYLVDSGY N
Sbjct: 206 ASLNIMAICDLKMLFTYIWNGAPGSCYDTAVLQIA-QQSDSEFPLPPSEKYYLVDSGYPN 265

Query: 359 MPGFLAPYRGE-----RYHLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLK 394
             G LAPYR       RYH+  +   G  PR   E FN  H+SLR+VIER F + K
Sbjct: 266 KQGLLAPYRSSRNRVVRYHMSQFY-YGPRPRNKHELFNQCHTSLRSVIERTFRIWK 317

BLAST of Tan0006737 vs. TAIR 10
Match: AT5G35695.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 136.0 bits (341), Expect = 8.0e-32
Identity = 64/136 (47.06%), Postives = 86/136 (63.24%), Query Frame = 0

Query: 296 LFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERY 355
           +F +V +GWEG+A+DSRVL DA+             K+YLVD G+ N   FLAP+RG RY
Sbjct: 24  IFIYVLSGWEGSAHDSRVLSDAL------------RKFYLVDCGFANRLNFLAPFRGVRY 83

Query: 356 HLRDYKGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPI 415
           HL+++ G+ R P  P E FN +H SLRNVIER FG+ K+RF I K  P +  +KQ  + +
Sbjct: 84  HLQEFAGQRRDPETPHELFNLRHVSLRNVIERIFGIFKSRFAIFKSAPPFSYKKQAGLVL 143

Query: 416 ACCAIHNFIRMNSTRD 432
            C A+HNF+R     D
Sbjct: 144 TCAALHNFLRKECRSD 147

BLAST of Tan0006737 vs. TAIR 10
Match: AT5G28730.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 496 Blast hits to 496 proteins in 68 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 23; Plants - 470; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 94.4 bits (233), Expect = 2.7e-19
Identity = 64/197 (32.49%), Postives = 100/197 (50.76%), Query Frame = 0

Query: 164 RMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMILLTVCHSTRNRIVMNDFNTLKRDRV 223
           RM    F  LCE L     L +   IS++E+V + L+    +   R +   F    ++ +
Sbjct: 29  RMSSEAFTQLCEILHGKYGLQSSTNISLDESVAIFLIICASNDTQRDIALRFGH-AQETI 88

Query: 224 SRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFQCVGLRSKQTPYRGRKV-I 283
            R+F  VL+AM  L  + I+ P  +E    I N  + +         ++  P+    + I
Sbjct: 89  WRKFHDVLKAMERLAVEYIR-PRKVEELRAISNRLQDD---------TRYWPFLMDLLGI 148

Query: 284 VTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTN 343
            + N++  C  +MLFT+ + G  G+ +D+RVL  AI  +   F +PP+ KYYLVDSGY N
Sbjct: 149 ASFNVLAICDLDMLFTYCFVGMAGSTHDARVLSAAIS-DDPLFHVPPDSKYYLVDSGYAN 208

Query: 344 MPGFLAPYRGERYHLRD 360
             G+LAPYR E    +D
Sbjct: 209 KRGYLAPYRREHREAQD 213

BLAST of Tan0006737 vs. TAIR 10
Match: AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 71.6 bits (174), Expect = 1.8e-12
Identity = 42/92 (45.65%), Postives = 55/92 (59.78%), Query Frame = 0

Query: 308 ANDSRVLLDAIGREGNNFPLPPEGKYYLVDSGYTNMPGFLAPYRGERYHLRDYKGRGRHP 367
           ++D++V L    R  +  P P   KYYLV+S Y    G+L P+R   YHL  + GRG  P
Sbjct: 71  SHDTKV-LKYCARNESFSPHPSNRKYYLVNSVYPTTTGYLGPHRRILYHLGQF-GRGGPP 130

Query: 368 RGPQEFFNYKHSSLRNVIERCFGVLKARFPIL 400
              QE FN KH  LR+VI+R FGV KA++ IL
Sbjct: 131 VTVQELFNRKHLDLRSVIDRTFGVWKAKWRIL 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_008237544.21.9e-12356.63PREDICTED: uncharacterized protein LOC103336277 [Prunus mume][more]
XP_018505745.12.1e-12264.29PREDICTED: uncharacterized protein LOC103958625 [Pyrus x bretschneideri][more]
XP_024164615.12.7e-12259.36uncharacterized protein LOC112171704 [Rosa chinensis][more]
XP_024163380.17.8e-12259.41uncharacterized protein LOC112170350 [Rosa chinensis][more]
RZC55946.15.6e-12048.96hypothetical protein C5167_014815 [Papaver somniferum][more]
Match NameE-valueIdentityDescription
A0A4Y7J6732.7e-12048.96Uncharacterized protein OS=Papaver somniferum OX=3469 GN=C5167_014815 PE=3 SV=1[more]
A0A2P6SH661.2e-11558.10Putative harbinger transposase-derived nuclease domain-containing protein OS=Ros... [more]
A0A6P4ALU89.4e-9749.46uncharacterized protein LOC107422048 OS=Ziziphus jujuba OX=326968 GN=LOC10742204... [more]
A0A5B6ZZY66.7e-9554.24DDE Tnp4 domain-containing protein (Fragment) OS=Davidia involucrata OX=16924 GN... [more]
A0A438CFP08.2e-9346.44Protein ALP1-like OS=Vitis vinifera OX=29760 GN=VvCHDh000000_4 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41980.16.5e-5034.65CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT1G43722.16.1e-3234.12unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G35695.18.0e-3247.06CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G28730.12.7e-1932.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10890.11.8e-1245.65unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 273..422
e-value: 1.4E-16
score: 60.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..56
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..35
NoneNo IPR availablePANTHERPTHR22930:SF211NUCLEASE HARBI1-RELATEDcoord: 143..438
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 143..438

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006737.1Tan0006737.1mRNA