Tan0009576 (gene) Snake gourd v1

Overview
NameTan0009576
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionATP-dependent zinc metalloprotease
LocationLG02: 651242 .. 653890 (+)
RNA-Seq ExpressionTan0009576
SyntenyTan0009576
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGGCTCTCCTTACTTCACCCAAACTCCAAATTTCCTCTTCGCTTCTCAAATTCCAACCTTTCCATTACCCATTTTCCTTTCATTTTCAGCGGAAAACCCCTAATGGAACCAATAAACATTTCCATTTAGAACGCCATCAGCGTCGAGCTCTTCGCGAATGGCAGGACTACGAAGAGGCTGTGAAGCGCAAGGACCTCGCTGAAGCTCTCAGGTTTCTCGAATCTTTTGACAGAGAGAGCGCAATCGAACCCATTAATGGTTCGGCGGCTGCTGATTCAGCTCCGTCCGCTCTTGGGAATCCGCGGTTGTCTGGCTGGGAGAGGGACTGGGAGGTGCTAGACACTTGTTTGAATGCTGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTCTTCTCAATTTTGGAAAATGCAGGAACATTGGTACGCCCCTTTTCTTCTACTGTCTCCACCATGCGAATTTTTTTCTTTAGTGTTAGAAGGTTTTTCCCCTTTACTCATGCATTCTGCTGATATCAATTTCGGAAAGTTTAGTTTACCTTTGGAAGTTAGGTCCCATCTGTTAATGCTTTTAGTTGGTGCAAATGTACAAAATCCTTCAAGCCACTAATCTTATAGTTATGTGGAAGTTTTTTTTTGTAATTCACTATGGATTTTGGATTTTTTATTCCCTTTTTGTTTTATAAATTTCATCCACCAATGAAATTCTTGTTTCGGGGCTTTATTGTTGATTTTCAGTGCCCATTTAACTTGATGATCTGTGTATGTCGAGTTGGCTCGAGGCTTTACATTTTTCTGAAGTTAGGACTTGAAAGATAGCTCCCAAATTTATTATCTTCCAACTTAATCCACCAATTTTATACTTATTGGAGCATAGTTCATAGTCATACTGATGAACTGACCATACCTGCAGTTTTGGAGGGTCGAAGAGATGTCCCGCCATTTATGTTGGAGTCTACAACTGGATTAGAAGGTGATTTCATAAATCCTTGCTTTCAGATGGACTTATTATTCATATTCAACTTATCACTTACTTTAATTTTCCTGACCTCCCGTGTAATTCCAGTGTCCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTCGGTACGCTTTGATTGCTTTTCTTGGTGGAACATCATTTCTTCTCTCACAGGACATAGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTAGCATTTTTGGACTCTATCCTCCTTGGTGGTACTTGTCTAGCTCAAATCTCAAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGGTATGTTCTACTAATGTTGAATCATGTAGAACTTTGAGGTCATTTCTACCAATTGTTCATATGATATTTACTTGGAATAGTAGTTCATGCAATTATGCAAGCACAGCCGCTCAATGTTGATGAAAGTTCCTTGTATCTTATTGCAGCTTACCTCATGGGTTGCCCAATTCGTGGAGTAATTTTGGATCCAATTGTTGCTATGCAAATGGGGATACAAGGGCAGGTAAACTATCTTTTCATATGCATTTCAAACCATGGTCTTTAAATGATGGTAATTGAAAATGTTTTGAATATGAGATATTCATGTGGGTTATGCATCAAGCCGGTGGCTTTGTAACTCGACAAAAAGAATATGATAAAATTTATTCAATAAGATAAAACTCAACAGTACATTGTAGCTGGGTGTCAATGTGATAAAAAGGACAAAGAATCGATTAGATAATATAATTAGACTTCCGACAGTTTGTGGCTAATGAACAAGTTGTCTCATAAGTTGGATTGCTATATGTGAAATCTCTGTTTCTGTATTAATGAGTCTTTTTATTCTGAATTTAATAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCCTTTGATAGGTGATCATGTTAAATAGTTCCCCCTTTTGAAAATGGCTCCACGCCTTTTTCCATTCCTTTTCCTACTAAACAAATCAATCTGAAAAAGAGCTCAAGTAGCAATATTTATATTGGATTCGGCTTCCTGAGCCTTTTAAGGCTTCAAAAGGTTGATCTACAAAATAATGAAAGATATGAGACATTTCTTAGCAATGTTTTCTGATTTGAGTTATCATATGAGTGGACTAATTTTATGCCTGAAACTCTGTATATAATAACTAACTTGGCAATGCCCTGGGTCATGACTGTGGTCAGGTACTGCATGGTTCTGTTTGCGGGCATTGCTGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCGTTCTTTTGCAACCCCCATTGTCTGTTGCGCAGGTTCTCAATCTAACGTGCTTGAGTAGATATATTCTTCAAATGTGCACAGTTATAGACCTCTTTTCTGACTTCTTTTGGTGTTTTCAATTTATAGATGTCAAATCAAGCAAGATGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACAGGCACACCAAGTTGCTGTCAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAGAATTGAGGAAGCATTGTCGAAATATAGATGA

mRNA sequence

ATGATGGCTCTCCTTACTTCACCCAAACTCCAAATTTCCTCTTCGCTTCTCAAATTCCAACCTTTCCATTACCCATTTTCCTTTCATTTTCAGCGGAAAACCCCTAATGGAACCAATAAACATTTCCATTTAGAACGCCATCAGCGTCGAGCTCTTCGCGAATGGCAGGACTACGAAGAGGCTGTGAAGCGCAAGGACCTCGCTGAAGCTCTCAGGTTTCTCGAATCTTTTGACAGAGAGAGCGCAATCGAACCCATTAATGGTTCGGCGGCTGCTGATTCAGCTCCGTCCGCTCTTGGGAATCCGCGGTTGTCTGGCTGGGAGAGGGACTGGGAGGTGCTAGACACTTGTTTGAATGCTGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTCTTCTCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGTCGAAGAGATGTCCCGCCATTTATGTTGGAGTCTACAACTGGATTAGAAGTGTCCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTCGGTACGCTTTGATTGCTTTTCTTGGTGGAACATCATTTCTTCTCTCACAGGACATAGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTAGCATTTTTGGACTCTATCCTCCTTGGTGGTACTTGTCTAGCTCAAATCTCAAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTACCTCATGGGTTGCCCAATTCGTGGAGTAATTTTGGATCCAATTGTTGCTATGCAAATGGGGATACAAGGGCAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCCTTTGATAGGTACTGCATGGTTCTGTTTGCGGGCATTGCTGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCGTTCTTTTGCAACCCCCATTGTCTGTTGCGCAGATGTCAAATCAAGCAAGATGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACAGGCACACCAAGTTGCTGTCAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAGAATTGAGGAAGCATTGTCGAAATATAGATGA

Coding sequence (CDS)

ATGATGGCTCTCCTTACTTCACCCAAACTCCAAATTTCCTCTTCGCTTCTCAAATTCCAACCTTTCCATTACCCATTTTCCTTTCATTTTCAGCGGAAAACCCCTAATGGAACCAATAAACATTTCCATTTAGAACGCCATCAGCGTCGAGCTCTTCGCGAATGGCAGGACTACGAAGAGGCTGTGAAGCGCAAGGACCTCGCTGAAGCTCTCAGGTTTCTCGAATCTTTTGACAGAGAGAGCGCAATCGAACCCATTAATGGTTCGGCGGCTGCTGATTCAGCTCCGTCCGCTCTTGGGAATCCGCGGTTGTCTGGCTGGGAGAGGGACTGGGAGGTGCTAGACACTTGTTTGAATGCTGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTCTTCTCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGTCGAAGAGATGTCCCGCCATTTATGTTGGAGTCTACAACTGGATTAGAAGTGTCCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTCGGTACGCTTTGATTGCTTTTCTTGGTGGAACATCATTTCTTCTCTCACAGGACATAGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTAGCATTTTTGGACTCTATCCTCCTTGGTGGTACTTGTCTAGCTCAAATCTCAAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTACCTCATGGGTTGCCCAATTCGTGGAGTAATTTTGGATCCAATTGTTGCTATGCAAATGGGGATACAAGGGCAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCCTTTGATAGGTACTGCATGGTTCTGTTTGCGGGCATTGCTGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCGTTCTTTTGCAACCCCCATTGTCTGTTGCGCAGATGTCAAATCAAGCAAGATGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACAGGCACACCAAGTTGCTGTCAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAGAATTGAGGAAGCATTGTCGAAATATAGATGA

Protein sequence

MMALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQRRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALSKYR
Homology
BLAST of Tan0009576 vs. NCBI nr
Match: XP_008447096.1 (PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo] >XP_008447097.1 PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo] >KAA0051124.1 uncharacterized protein E6C27_scaffold511G00710 [Cucumis melo var. makuwa])

HSP 1 Score: 717.2 bits (1850), Expect = 7.7e-203
Identity = 362/401 (90.27%), Postives = 375/401 (93.52%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQ-------RRALRE 61
           MA+L+ PKL ISSSLL+ Q FHYP  FHFQ+K PNG NKHFHL+RH         RALRE
Sbjct: 1   MAILSPPKLLISSSLLQSQLFHYPIPFHFQQKNPNGINKHFHLQRHHYQRLLPLSRALRE 60

Query: 62  WQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVL 121
           WQDYEEAVKRKDLAEALRFLESFDR+SAIEPIN SA A SAPSA+GN RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSAIGNLRLSGWERDWEVL 120

Query: 122 DTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSP 181
           DTCLNADDMKLVANAY FL+DRGFL NFGKCRNIVLEG+RDV P +LESTTGLEVSKLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGQRDVTPSVLESTTGLEVSKLSP 180

Query: 182 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 241
           KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 242 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 301
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 302 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 361
           RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360

Query: 362 AVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           AVLQSYNLLKWHK AHQVAVKA+ESGSSLSVVIRRIE+ALS
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRRIEDALS 401

BLAST of Tan0009576 vs. NCBI nr
Match: XP_038888049.1 (uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida])

HSP 1 Score: 716.1 bits (1847), Expect = 1.7e-202
Identity = 363/402 (90.30%), Postives = 376/402 (93.53%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-----RALREWQ 61
           MA+L+ PKL ISSSLL+FQ  HYP  F+FQ+K PNG NKHF+LERHQR     RAL EWQ
Sbjct: 1   MAVLSPPKLLISSSLLQFQQLHYPIPFNFQQKNPNGINKHFYLERHQRLLPLSRALSEWQ 60

Query: 62  DYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVLDT 121
           DYEEAVKRKDLAEALRFLESFDR+SAIEPIN SA A SAPSAL NPRLSGWERDWEVLDT
Sbjct: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSALANPRLSGWERDWEVLDT 120

Query: 122 CLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSPKK 181
           CLNADDMKLVA+AYGFLRDRGFL NFGK RNIVLEGRRDV P +LESTTGLEVSKLSPKK
Sbjct: 121 CLNADDMKLVADAYGFLRDRGFLPNFGKFRNIVLEGRRDVTPSVLESTTGLEVSKLSPKK 180

Query: 182 WGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 241
           WG+SGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP
Sbjct: 181 WGVSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240

Query: 242 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 301
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRL 300

Query: 302 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 361
           DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360

Query: 362 LQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALSKYR 399
           LQSYNLLKWHK AHQ AVKALESGSSLSVVIRRIE+ALS  R
Sbjct: 361 LQSYNLLKWHKHAHQTAVKALESGSSLSVVIRRIEDALSTNR 402

BLAST of Tan0009576 vs. NCBI nr
Match: XP_022147989.1 (uncharacterized protein LOC111016783 [Momordica charantia] >XP_022147990.1 uncharacterized protein LOC111016783 [Momordica charantia])

HSP 1 Score: 713.0 bits (1839), Expect = 1.4e-201
Identity = 359/399 (89.97%), Postives = 375/399 (93.98%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-----RALREWQ 61
           MA+ + PKLQISSS L FQPF +  SFHF +KTP G  +HFHLER QR     RALREWQ
Sbjct: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60

Query: 62  DYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVLDT 121
           DYEEAVKRKDLAEALRFLESFDR+SAIEP+N SAAADSAPSAL NPRLSGWERDWEVLDT
Sbjct: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120

Query: 122 CLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSPKK 181
           CLNADDMKLVANAYGFLRDRGFL NFGKCRNIVLEGRRDV P +LES+TGL+V+KLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180

Query: 182 WGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 241
           WGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP
Sbjct: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240

Query: 242 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 301
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 302 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 361
           DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360

Query: 362 LQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           LQSYNLLKWHK AHQVAVKALESGSSLSVVIR+IE+ALS
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALS 399

BLAST of Tan0009576 vs. NCBI nr
Match: XP_004139896.1 (uncharacterized protein LOC101213430 [Cucumis sativus] >XP_011659042.1 uncharacterized protein LOC101213430 [Cucumis sativus])

HSP 1 Score: 706.8 bits (1823), Expect = 1.0e-199
Identity = 359/401 (89.53%), Postives = 371/401 (92.52%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-------RALRE 61
           MA+L+ PKL ISSSL + Q FHYP  FHFQ+K PNG NK+FHLERH         RALRE
Sbjct: 1   MAILSPPKLLISSSLPQSQLFHYPIPFHFQQKNPNGINKYFHLERHHHQRLLPLSRALRE 60

Query: 62  WQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVL 121
           WQDYEEAVKRKDLAEALRFLESFDR+SAIEPI  SA A SAPSA+ N RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPIKDSAPAGSAPSAIRNLRLSGWERDWEVL 120

Query: 122 DTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSP 181
           DTCLNADDMKLVANAY FL+DRGFL NFGKCRNIVLEGRRDV P +LE TTGLEVSKLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGRRDVTPSVLELTTGLEVSKLSP 180

Query: 182 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 241
           KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 242 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 301
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 302 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 361
           RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 360

Query: 362 AVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           AVLQSYNLLKWHK AHQVAVKA+ESGSSLSVVIR+IE+ALS
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRKIEDALS 401

BLAST of Tan0009576 vs. NCBI nr
Match: XP_022969425.1 (uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima] >XP_022969426.1 uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima])

HSP 1 Score: 703.0 bits (1813), Expect = 1.5e-198
Identity = 360/402 (89.55%), Postives = 371/402 (92.29%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-----RALREWQ 61
           M++ + PKL IS SLL+FQ FH P  FHFQ+K  NG N+HFHL+RHQR     RA+REWQ
Sbjct: 1   MSIHSPPKLLISPSLLQFQSFHCPLPFHFQQK--NGINEHFHLQRHQRLLLLPRAIREWQ 60

Query: 62  DYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVLDT 121
           +YEEAVKRKDLAEALRFLESF RESAIEP N SA ADSAPSALGNPRLSGWERDWEVLDT
Sbjct: 61  EYEEAVKRKDLAEALRFLESFGRESAIEPPNDSALADSAPSALGNPRLSGWERDWEVLDT 120

Query: 122 CLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSPKK 181
           CLNADDMKLVANAYGFLRDRGFL NFGKCRNIVLEG RDV P +LESTTGLEVSKLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGPRDVTPSVLESTTGLEVSKLSPKK 180

Query: 182 WGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 241
           WGLSGSSRYALIA LGGTSFLLSQDIDIRPNL ALLGLAFLDSILLGGTCLAQISS WPP
Sbjct: 181 WGLSGSSRYALIACLGGTSFLLSQDIDIRPNLFALLGLAFLDSILLGGTCLAQISSCWPP 240

Query: 242 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 301
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 302 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 361
           DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 360

Query: 362 LQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALSKYR 399
           LQSYNLLKWHK AHQVAVKALESGSSLSVVIRR+E ALS  R
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRMENALSTNR 400

BLAST of Tan0009576 vs. ExPASy TrEMBL
Match: A0A1S3BH83 (uncharacterized protein LOC103489633 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489633 PE=4 SV=1)

HSP 1 Score: 717.2 bits (1850), Expect = 3.7e-203
Identity = 362/401 (90.27%), Postives = 375/401 (93.52%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQ-------RRALRE 61
           MA+L+ PKL ISSSLL+ Q FHYP  FHFQ+K PNG NKHFHL+RH         RALRE
Sbjct: 1   MAILSPPKLLISSSLLQSQLFHYPIPFHFQQKNPNGINKHFHLQRHHYQRLLPLSRALRE 60

Query: 62  WQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVL 121
           WQDYEEAVKRKDLAEALRFLESFDR+SAIEPIN SA A SAPSA+GN RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSAIGNLRLSGWERDWEVL 120

Query: 122 DTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSP 181
           DTCLNADDMKLVANAY FL+DRGFL NFGKCRNIVLEG+RDV P +LESTTGLEVSKLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGQRDVTPSVLESTTGLEVSKLSP 180

Query: 182 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 241
           KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 242 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 301
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 302 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 361
           RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360

Query: 362 AVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           AVLQSYNLLKWHK AHQVAVKA+ESGSSLSVVIRRIE+ALS
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRRIEDALS 401

BLAST of Tan0009576 vs. ExPASy TrEMBL
Match: A0A5A7U732 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold511G00710 PE=4 SV=1)

HSP 1 Score: 717.2 bits (1850), Expect = 3.7e-203
Identity = 362/401 (90.27%), Postives = 375/401 (93.52%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQ-------RRALRE 61
           MA+L+ PKL ISSSLL+ Q FHYP  FHFQ+K PNG NKHFHL+RH         RALRE
Sbjct: 1   MAILSPPKLLISSSLLQSQLFHYPIPFHFQQKNPNGINKHFHLQRHHYQRLLPLSRALRE 60

Query: 62  WQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVL 121
           WQDYEEAVKRKDLAEALRFLESFDR+SAIEPIN SA A SAPSA+GN RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSAIGNLRLSGWERDWEVL 120

Query: 122 DTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSP 181
           DTCLNADDMKLVANAY FL+DRGFL NFGKCRNIVLEG+RDV P +LESTTGLEVSKLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGQRDVTPSVLESTTGLEVSKLSP 180

Query: 182 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 241
           KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 242 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 301
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 302 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 361
           RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360

Query: 362 AVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           AVLQSYNLLKWHK AHQVAVKA+ESGSSLSVVIRRIE+ALS
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRRIEDALS 401

BLAST of Tan0009576 vs. ExPASy TrEMBL
Match: A0A6J1D1P2 (uncharacterized protein LOC111016783 OS=Momordica charantia OX=3673 GN=LOC111016783 PE=4 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 7.0e-202
Identity = 359/399 (89.97%), Postives = 375/399 (93.98%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-----RALREWQ 61
           MA+ + PKLQISSS L FQPF +  SFHF +KTP G  +HFHLER QR     RALREWQ
Sbjct: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60

Query: 62  DYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVLDT 121
           DYEEAVKRKDLAEALRFLESFDR+SAIEP+N SAAADSAPSAL NPRLSGWERDWEVLDT
Sbjct: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120

Query: 122 CLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSPKK 181
           CLNADDMKLVANAYGFLRDRGFL NFGKCRNIVLEGRRDV P +LES+TGL+V+KLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180

Query: 182 WGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 241
           WGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP
Sbjct: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240

Query: 242 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 301
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 302 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 361
           DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360

Query: 362 LQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           LQSYNLLKWHK AHQVAVKALESGSSLSVVIR+IE+ALS
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALS 399

BLAST of Tan0009576 vs. ExPASy TrEMBL
Match: A0A0A0K7I5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G239000 PE=4 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 5.0e-200
Identity = 359/401 (89.53%), Postives = 371/401 (92.52%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-------RALRE 61
           MA+L+ PKL ISSSL + Q FHYP  FHFQ+K PNG NK+FHLERH         RALRE
Sbjct: 1   MAILSPPKLLISSSLPQSQLFHYPIPFHFQQKNPNGINKYFHLERHHHQRLLPLSRALRE 60

Query: 62  WQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVL 121
           WQDYEEAVKRKDLAEALRFLESFDR+SAIEPI  SA A SAPSA+ N RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPIKDSAPAGSAPSAIRNLRLSGWERDWEVL 120

Query: 122 DTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSP 181
           DTCLNADDMKLVANAY FL+DRGFL NFGKCRNIVLEGRRDV P +LE TTGLEVSKLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGRRDVTPSVLELTTGLEVSKLSP 180

Query: 182 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 241
           KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 242 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 301
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 302 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 361
           RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 360

Query: 362 AVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           AVLQSYNLLKWHK AHQVAVKA+ESGSSLSVVIR+IE+ALS
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRKIEDALS 401

BLAST of Tan0009576 vs. ExPASy TrEMBL
Match: A0A6J1HZW5 (uncharacterized protein LOC111468437 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111468437 PE=4 SV=1)

HSP 1 Score: 703.0 bits (1813), Expect = 7.2e-199
Identity = 360/402 (89.55%), Postives = 371/402 (92.29%), Query Frame = 0

Query: 2   MALLTSPKLQISSSLLKFQPFHYPFSFHFQRKTPNGTNKHFHLERHQR-----RALREWQ 61
           M++ + PKL IS SLL+FQ FH P  FHFQ+K  NG N+HFHL+RHQR     RA+REWQ
Sbjct: 1   MSIHSPPKLLISPSLLQFQSFHCPLPFHFQQK--NGINEHFHLQRHQRLLLLPRAIREWQ 60

Query: 62  DYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALGNPRLSGWERDWEVLDT 121
           +YEEAVKRKDLAEALRFLESF RESAIEP N SA ADSAPSALGNPRLSGWERDWEVLDT
Sbjct: 61  EYEEAVKRKDLAEALRFLESFGRESAIEPPNDSALADSAPSALGNPRLSGWERDWEVLDT 120

Query: 122 CLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFMLESTTGLEVSKLSPKK 181
           CLNADDMKLVANAYGFLRDRGFL NFGKCRNIVLEG RDV P +LESTTGLEVSKLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGPRDVTPSVLESTTGLEVSKLSPKK 180

Query: 182 WGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 241
           WGLSGSSRYALIA LGGTSFLLSQDIDIRPNL ALLGLAFLDSILLGGTCLAQISS WPP
Sbjct: 181 WGLSGSSRYALIACLGGTSFLLSQDIDIRPNLFALLGLAFLDSILLGGTCLAQISSCWPP 240

Query: 242 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 301
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 302 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 361
           DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 360

Query: 362 LQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALSKYR 399
           LQSYNLLKWHK AHQVAVKALESGSSLSVVIRR+E ALS  R
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRMENALSTNR 400

BLAST of Tan0009576 vs. TAIR 10
Match: AT1G56180.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27290.1); Has 436 Blast hits to 436 proteins in 83 species: Archae - 0; Bacteria - 153; Metazoa - 0; Fungi - 0; Plants - 160; Viruses - 0; Other Eukaryotes - 123 (source: NCBI BLink). )

HSP 1 Score: 505.4 bits (1300), Expect = 4.3e-143
Identity = 252/355 (70.99%), Postives = 296/355 (83.38%), Query Frame = 0

Query: 44  LERHQRR---ALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINGSAAADSAPSALG 103
           + +H+ R   ALREW++YE+AVKRKDLA ALRFL+S + +   + +     A    S LG
Sbjct: 37  VRKHELRRPSALREWREYEDAVKRKDLAGALRFLKSIENDEQRDSVESIVTAKL--SGLG 96

Query: 104 NPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLLNFGKCRNIVLEGRRDVPPFM 163
              L   ERDW+VLD CLNADDM+LV +A+ FL++RG L NFGK  +IVLEG R+V P +
Sbjct: 97  ALEL---ERDWQVLDACLNADDMRLVGSAFRFLKERGLLANFGKFTSIVLEGTREVTPTV 156

Query: 164 LESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSI 223
           L+S TGLEV+KLSPKKWGLSG S  AL A LGG S+LLSQ+ID+RPNL  +LGLA+LDS+
Sbjct: 157 LKSATGLEVTKLSPKKWGLSGGSSIALAALLGGVSYLLSQEIDVRPNLAVILGLAYLDSV 216

Query: 224 LLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGT 283
            LGGTCLAQ+S YWPP++RRI+VHEAGHLL AYLMGCPIRGVILDP+VAMQMG+QGQAGT
Sbjct: 217 FLGGTCLAQVSCYWPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQMGVQGQAGT 276

Query: 284 QFWDEKMASNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQ 343
           QFWD+KM S +AEGRL G+SFDRY MVLFAGIAAEALVYGEAEGGENDENLFRSI VLL+
Sbjct: 277 QFWDQKMESEIAEGRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLE 336

Query: 344 PPLSVAQMSNQARWAVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEALS 396
           PPLSVAQMSNQARW+VLQSYNLLKWHK AH+ AV+AL+ GS LS+VIRRIEEA+S
Sbjct: 337 PPLSVAQMSNQARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIEEAMS 386

BLAST of Tan0009576 vs. TAIR 10
Match: AT2G21960.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G56180.1); Has 224 Blast hits to 222 proteins in 59 species: Archae - 0; Bacteria - 65; Metazoa - 0; Fungi - 0; Plants - 134; Viruses - 0; Other Eukaryotes - 25 (source: NCBI BLink). )

HSP 1 Score: 102.1 bits (253), Expect = 1.1e-21
Identity = 60/167 (35.93%), Postives = 84/167 (50.30%), Query Frame = 0

Query: 227 LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEK 286
           ++  S+++P Y+ RI  HEA H L AYL+G PI G  LD          G+      DE+
Sbjct: 174 ISGFSTFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLD---------IGKEHVNLIDER 233

Query: 287 MASNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVA 346
           +A  +  G+LD    DR   V  AG+AAE L Y +  G   D    +      QP +S  
Sbjct: 234 LAKLIYSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNE 293

Query: 347 QMSNQARWAVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEA 394
           Q  N  RWAVL S +LLK +K  H+  + A+   +S+   I+ IE A
Sbjct: 294 QQQNLTRWAVLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIETA 331

BLAST of Tan0009576 vs. TAIR 10
Match: AT5G27290.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54680.3); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 93.2 bits (230), Expect = 5.1e-19
Identity = 70/237 (29.54%), Postives = 112/237 (47.26%), Query Frame = 0

Query: 170 SKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF-----LDSILLGG 229
           S LSP    L    R   IA + G   +  +  D+    L  L L F     LD +   G
Sbjct: 103 SLLSPTDTTLGSIERNLQIAAVSG-GIVAWKAFDLSSQQLFFLTLGFMFLWTLDLVSFNG 162

Query: 230 TCLAQI-----SSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQ 289
              + +      ++   Y  R++ HEAGH L AYL+G   RG  L  + A+Q    +  Q
Sbjct: 163 GIGSLVLDTTGHTFSQRYHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQ 222

Query: 290 AGTQFWDEKMASNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICV 349
           AG+ F D +    +  G++  T  +R+  +  AG+A E L+YG AEGG +D +    +  
Sbjct: 223 AGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLLYGYAEGGLDDISKLDGLVK 282

Query: 350 LLQPPLSVAQMSNQARWAVLQSYNLLKWHKQAHQVAVKALESGSSLSVVIRRIEEAL 395
            L    +  +  +Q RW+VL +  LL+ H+ A     +A+  G S+   I+ IE+++
Sbjct: 283 SL--GFTQKKADSQVRWSVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQIIEDSI 336

BLAST of Tan0009576 vs. TAIR 10
Match: AT5G27290.2 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54680.3); Has 199 Blast hits to 194 proteins in 57 species: Archae - 0; Bacteria - 61; Metazoa - 0; Fungi - 0; Plants - 129; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )

HSP 1 Score: 60.1 bits (144), Expect = 4.7e-09
Identity = 47/161 (29.19%), Postives = 72/161 (44.72%), Query Frame = 0

Query: 170 SKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF-----LDSILLGG 229
           S LSP    L    R   IA + G   +  +  D+    L  L L F     LD +   G
Sbjct: 103 SLLSPTDTTLGSIERNLQIAAVSG-GIVAWKAFDLSSQQLFFLTLGFMFLWTLDLVSFNG 162

Query: 230 TCLAQI-----SSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQ 289
              + +      ++   Y  R++ HEAGH L AYL+G   RG  L  + A+Q    +  Q
Sbjct: 163 GIGSLVLDTTGHTFSQRYHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQ 222

Query: 290 AGTQFWDEKMASNLAEGRLDGTSFDRYCMVLFAGIAAEALV 319
           AG+ F D +    +  G++  T  +R+  +  AG+A E L+
Sbjct: 223 AGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLL 262

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_008447096.17.7e-20390.27PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo] >XP_00... [more]
XP_038888049.11.7e-20290.30uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida][more]
XP_022147989.11.4e-20189.97uncharacterized protein LOC111016783 [Momordica charantia] >XP_022147990.1 uncha... [more]
XP_004139896.11.0e-19989.53uncharacterized protein LOC101213430 [Cucumis sativus] >XP_011659042.1 uncharact... [more]
XP_022969425.11.5e-19889.55uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima] >XP_022969426... [more]
Match NameE-valueIdentityDescription
A0A1S3BH833.7e-20390.27uncharacterized protein LOC103489633 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7U7323.7e-20390.27Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A6J1D1P27.0e-20289.97uncharacterized protein LOC111016783 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A0A0K7I55.0e-20089.53Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G239000 PE=4 SV=1[more]
A0A6J1HZW57.2e-19989.55uncharacterized protein LOC111468437 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G56180.14.3e-14370.99unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... [more]
AT2G21960.11.1e-2135.93unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
AT5G27290.15.1e-1929.54unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
AT5G27290.24.7e-0929.19unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037219Peptidase M41-likeGENE3D1.20.58.760Peptidase M41coord: 232..337
e-value: 4.8E-10
score: 41.5
IPR037219Peptidase M41-likeSUPERFAMILY140990FtsH protease domain-likecoord: 234..383
NoneNo IPR availablePANTHERPTHR33471FAMILY NOT NAMEDcoord: 40..396
NoneNo IPR availablePANTHERPTHR33471:SF7ATP-DEPENDENT ZINC METALLOPROTEASEcoord: 40..396

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009576.1Tan0009576.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016020 membrane
molecular_function GO:0005524 ATP binding
molecular_function GO:0004176 ATP-dependent peptidase activity
molecular_function GO:0004222 metalloendopeptidase activity