Tan0021779 (gene) Snake gourd v1

Overview
NameTan0021779
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCopper-binding periplasmic protein
LocationLG11: 22881278 .. 22882777 (-)
RNA-Seq ExpressionTan0021779
SyntenyTan0021779
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTACTCACTTTGTAACTCCACCGTTAATAAAAAAAAATAAAAAATTTAAAAATCTCAGTTCCATTCTCTCGCTAGGATTCTTCTTTTTGTAGGCTTTCGGTGCTTAATTTCAACACCTTTCATTCTCTTTCTCCACTTTCTCCTCCTCGTAATTCTCAGATTTTTGTTGTTCTTAACTTCGATTTCACAATCCATTCTCATTTCCTTGTTCTCATCCTCAATATGCCGATTTTCTCCATCGGAAAAGTCTGAAAAGCAGAGGAAAAAAAAAGCTGCTGTTTACTGACTCTCTTGTTCCTCAAATTCAAATCATAAGTTGATTTGACGAATTTCTTTGTTCGTTAAGCAGCTAATGACGATCGAAGTTTGCTCTGAGATTTCCAGTTTAGGAATCAGTCCGAGAATCTCGTTTTCGCATGATCTGAACCAGACTGATTTGTTGCCTTCTTCTTCCGATTGCAATGGCGATCGTTTGGATTTGACTCTTTTAGAATCCGATTTCGATTTCTGCATTGGTAATCTTCTTATGCAAGATCTCTCCTCGGCGGATGAACTTTTCTCCAATGGTAAAATTCTCCCCCTTGAAATCAAGAAATCGATCGAGCCAAACAGAGAAGTTTCTAGAGCTAACGATTCGGCTCATCTAATTCCTGAAGATCCTTCGAGAAATTCAGTCTCCTCCGAGAAGAAGAGCCTCAAAGAGCTTCTATCCGCAAGTTTCGATGCCGATGAGAAACCGCAATCCAAATCCTTCTGGCAGTTCAAGCGAAGCAGTAGTCTCAATTGCGAGAGCTCCAAGAGTAGAAGTCTGATTCGATCGTTGCATTTTCTCTCCAGAAGCAACTCGACTGGTTCGGCTTTGAATCCGAAACAGCAATCGCATTCGAAGGATTCTCAGAGGCCGAATTTGCAGAAACAGCCTTCGATTCCGAGTAGAAAATCATCGGCGTCTTCTTACTCGAACTCGTATTTTGCAAATTCGTCCTCTCAGAAGCCTTCGACGAGGAAGAATTTCGGACCGAACAATGGCAATGGAGTTCGAATTATCAGTCCTCTTCTCAATCTCCCTCCGCCATACATTTCCAAAGTAACTGTGAGTTTTTTTGGATTCGGTTCTCTGTTTTGTAATGGTAAGAATAAAAAGAAGAACAAATGAAGAAAAAAAAAAAAAAAAAAAACCTCTCGATGCTGTGTTCATAAAATTCTTCCTGCTCGAAACTGAATCTTCAAGTACAGATTAGATTGATTTCATCTGTCTATCATTTCTGCTTTCAAGCATGTGATCCCTCCCTGCAAATTCGAATAATCAGAGAAATTGCATTATTTTAATTTTGAAAATAACTTGGTGGGAATTGAAAATAGTTAATTCTTGGGGAGTGAAAAAAAAAAGGTAAAAGAAACAGTGATCTTATTATTGGTATAGCAAAAAAAATAAGGACAGTAAAAAGAGTTCATTCACATAAAAAAATGTATAAAGTTTAGGAGACACATAAACAAGT

mRNA sequence

TTTACTCACTTTGTAACTCCACCGTTAATAAAAAAAAATAAAAAATTTAAAAATCTCAGTTCCATTCTCTCGCTAGGATTCTTCTTTTTGTAGGCTTTCGGTGCTTAATTTCAACACCTTTCATTCTCTTTCTCCACTTTCTCCTCCTCGTAATTCTCAGATTTTTGTTGTTCTTAACTTCGATTTCACAATCCATTCTCATTTCCTTGTTCTCATCCTCAATATGCCGATTTTCTCCATCGGAAAAGTCTGAAAAGCAGAGGAAAAAAAAAGCTGCTGTTTACTGACTCTCTTGTTCCTCAAATTCAAATCATAAGTTGATTTGACGAATTTCTTTGTTCGTTAAGCAGCTAATGACGATCGAAGTTTGCTCTGAGATTTCCAGTTTAGGAATCAGTCCGAGAATCTCGTTTTCGCATGATCTGAACCAGACTGATTTGTTGCCTTCTTCTTCCGATTGCAATGGCGATCGTTTGGATTTGACTCTTTTAGAATCCGATTTCGATTTCTGCATTGGTAATCTTCTTATGCAAGATCTCTCCTCGGCGGATGAACTTTTCTCCAATGGTAAAATTCTCCCCCTTGAAATCAAGAAATCGATCGAGCCAAACAGAGAAGTTTCTAGAGCTAACGATTCGGCTCATCTAATTCCTGAAGATCCTTCGAGAAATTCAGTCTCCTCCGAGAAGAAGAGCCTCAAAGAGCTTCTATCCGCAAGTTTCGATGCCGATGAGAAACCGCAATCCAAATCCTTCTGGCAGTTCAAGCGAAGCAGTAGTCTCAATTGCGAGAGCTCCAAGAGTAGAAGTCTGATTCGATCGTTGCATTTTCTCTCCAGAAGCAACTCGACTGGTTCGGCTTTGAATCCGAAACAGCAATCGCATTCGAAGGATTCTCAGAGGCCGAATTTGCAGAAACAGCCTTCGATTCCGAGTAGAAAATCATCGGCGTCTTCTTACTCGAACTCGTATTTTGCAAATTCGTCCTCTCAGAAGCCTTCGACGAGGAAGAATTTCGGACCGAACAATGGCAATGGAGTTCGAATTATCAGTCCTCTTCTCAATCTCCCTCCGCCATACATTTCCAAAGTAACTGTGAGTTTTTTTGGATTCGGTTCTCTGTTTTGTAATGGTAAGAATAAAAAGAAGAACAAATGAAGAAAAAAAAAAAAAAAAAAAACCTCTCGATGCTGTGTTCATAAAATTCTTCCTGCTCGAAACTGAATCTTCAAGTACAGATTAGATTGATTTCATCTGTCTATCATTTCTGCTTTCAAGCATGTGATCCCTCCCTGCAAATTCGAATAATCAGAGAAATTGCATTATTTTAATTTTGAAAATAACTTGGTGGGAATTGAAAATAGTTAATTCTTGGGGAGTGAAAAAAAAAAGGTAAAAGAAACAGTGATCTTATTATTGGTATAGCAAAAAAAATAAGGACAGTAAAAAGAGTTCATTCACATAAAAAAATGTATAAAGTTTAGGAGACACATAAACAAGT

Coding sequence (CDS)

ATGACGATCGAAGTTTGCTCTGAGATTTCCAGTTTAGGAATCAGTCCGAGAATCTCGTTTTCGCATGATCTGAACCAGACTGATTTGTTGCCTTCTTCTTCCGATTGCAATGGCGATCGTTTGGATTTGACTCTTTTAGAATCCGATTTCGATTTCTGCATTGGTAATCTTCTTATGCAAGATCTCTCCTCGGCGGATGAACTTTTCTCCAATGGTAAAATTCTCCCCCTTGAAATCAAGAAATCGATCGAGCCAAACAGAGAAGTTTCTAGAGCTAACGATTCGGCTCATCTAATTCCTGAAGATCCTTCGAGAAATTCAGTCTCCTCCGAGAAGAAGAGCCTCAAAGAGCTTCTATCCGCAAGTTTCGATGCCGATGAGAAACCGCAATCCAAATCCTTCTGGCAGTTCAAGCGAAGCAGTAGTCTCAATTGCGAGAGCTCCAAGAGTAGAAGTCTGATTCGATCGTTGCATTTTCTCTCCAGAAGCAACTCGACTGGTTCGGCTTTGAATCCGAAACAGCAATCGCATTCGAAGGATTCTCAGAGGCCGAATTTGCAGAAACAGCCTTCGATTCCGAGTAGAAAATCATCGGCGTCTTCTTACTCGAACTCGTATTTTGCAAATTCGTCCTCTCAGAAGCCTTCGACGAGGAAGAATTTCGGACCGAACAATGGCAATGGAGTTCGAATTATCAGTCCTCTTCTCAATCTCCCTCCGCCATACATTTCCAAAGTAACTGTGAGTTTTTTTGGATTCGGTTCTCTGTTTTGTAATGGTAAGAATAAAAAGAAGAACAAATGA

Protein sequence

MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQDLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQQSHSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRIISPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK
Homology
BLAST of Tan0021779 vs. NCBI nr
Match: XP_022971826.1 (uncharacterized protein LOC111470497 isoform X1 [Cucurbita maxima])

HSP 1 Score: 428.3 bits (1100), Expect = 4.8e-116
Identity = 234/275 (85.09%), Postives = 245/275 (89.09%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLESDFDFCIGNLL+Q
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFCIGNLLLQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKELLS 120
           DLSSADELF NGKILP+EIKKSIEPNREVS+ N S   IP DP R+SVSSEKKSLKELLS
Sbjct: 61  DLSSADELFCNGKILPVEIKKSIEPNREVSKPNQSTPPIPPDPPRSSVSSEKKSLKELLS 120

Query: 121 ASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQ------ 180
           ASFDA+EKPQSKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSALN KQ      
Sbjct: 121 ASFDAEEKPQSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSALNSKQQSQSQS 180

Query: 181 --QSHSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRII 240
             QS SKDS+ P+LQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVRI+
Sbjct: 181 QSQSQSKDSRTPSLQKQPSISSRKSSVSSCSNSYFANSCSQKPSTRRNFGANNGNGVRIM 240

Query: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 275

BLAST of Tan0021779 vs. NCBI nr
Match: KAG6601812.1 (hypothetical protein SDJN03_07045, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 422.5 bits (1085), Expect = 2.6e-114
Identity = 230/269 (85.50%), Postives = 242/269 (89.96%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEV SEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLESDFDFCIGNLL+Q
Sbjct: 1   MAIEVSSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFCIGNLLLQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKELLS 120
           DLSSADELF NGKILP+EIKKS+EPNREVS+ N+S   IP DP  +SVSSEKKSLKELLS
Sbjct: 61  DLSSADELFCNGKILPVEIKKSVEPNREVSKPNESTPPIPPDPPTSSVSSEKKSLKELLS 120

Query: 121 ASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK--QQSHS 180
           ASFDA+EKP SKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSALN K   QS S
Sbjct: 121 ASFDAEEKPHSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSALNSKHQSQSQS 180

Query: 181 KDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRIISPLLNL 240
           KDS+ PNLQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVRI+SPLLNL
Sbjct: 181 KDSRTPNLQKQPSISSRKSSVSSCSNSYFANSFSQKPSTRRNFGANNGNGVRIMSPLLNL 240

Query: 241 PPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           PPPYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 PPPYISKVTVSFFGFGSLFCNGKNKKKNK 269

BLAST of Tan0021779 vs. NCBI nr
Match: XP_022959333.1 (uncharacterized protein LOC111460336 [Cucurbita moschata] >KAG7032519.1 hypothetical protein SDJN02_06568, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 419.1 bits (1076), Expect = 2.9e-113
Identity = 231/277 (83.39%), Postives = 243/277 (87.73%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLES--DFDFCIGNLL 60
           M IEVCSEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLES  DFDFCIGNLL
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFDFCIGNLL 60

Query: 61  MQDLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKEL 120
           +QDLSSADELF NGKILP+EIKKS+EPNREVS+ N+S   IP DP  +SVSSEKKSLKEL
Sbjct: 61  LQDLSSADELFCNGKILPVEIKKSVEPNREVSKPNESTPPIPPDPPTSSVSSEKKSLKEL 120

Query: 121 LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK----- 180
           LSASFDA+EKP SKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSALN K     
Sbjct: 121 LSASFDAEEKPHSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSALNSKHQSQS 180

Query: 181 ---QQSHSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVR 240
               QS SKDS+ PNLQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVR
Sbjct: 181 QSQSQSQSKDSRTPNLQKQPSISSRKSSVSSCSNSYFANSFSQKPSTRRNFGANNGNGVR 240

Query: 241 IISPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           I+SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 IMSPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 277

BLAST of Tan0021779 vs. NCBI nr
Match: XP_022939173.1 (uncharacterized protein LOC111445167 [Cucurbita moschata])

HSP 1 Score: 413.3 bits (1061), Expect = 1.6e-111
Identity = 231/272 (84.93%), Postives = 241/272 (88.60%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEIS++GISPRISFSHDLNQ DLLPSSSDCN +RLDLTLLESDFDFCIGNLL Q
Sbjct: 1   MAIEVCSEISTVGISPRISFSHDLNQADLLPSSSDCNRERLDLTLLESDFDFCIGNLLRQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDS--AHLIPEDPSRNSVSSEKKSLKEL 120
           DLSSADELFSNGKILP+EIKKSIEPNREV +   S     +P DP+RNSVSSEKKSLKEL
Sbjct: 61  DLSSADELFSNGKILPVEIKKSIEPNREVLKPIQSPPPPPVPPDPARNSVSSEKKSLKEL 120

Query: 121 LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQQSHS 180
           LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK QS S
Sbjct: 121 LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKSQSTS 180

Query: 181 KDSQRPNLQKQPSIPSRKSSASSYS---NSYFANSSSQKPSTRKNFGPNNGNGVRIISPL 240
           KD QRPNLQKQPSI S + S++S S   NSYFAN SSQKPS RKN GPNNGNGVR ISP 
Sbjct: 181 KDPQRPNLQKQPSISSSRRSSTSISTSANSYFANLSSQKPSMRKNSGPNNGNGVR-ISPF 240

Query: 241 LNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           LNLPPPYISKVTVSFFGFGSLFCNGK KKK K
Sbjct: 241 LNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 271

BLAST of Tan0021779 vs. NCBI nr
Match: XP_022971829.1 (uncharacterized protein LOC111470497 isoform X2 [Cucurbita maxima])

HSP 1 Score: 412.1 bits (1058), Expect = 3.5e-111
Identity = 225/267 (84.27%), Postives = 234/267 (87.64%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLESDFDFCIGNLL+Q
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFCIGNLLLQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKELLS 120
           DLSSADELF NGKILP+EIKKSIEPNREVS+ N S   IP DP R+SVSSEKKSLKELLS
Sbjct: 61  DLSSADELFCNGKILPVEIKKSIEPNREVSKPNQSTPPIPPDPPRSSVSSEKKSLKELLS 120

Query: 121 ASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQQSHSKD 180
           ASFDA+EKPQSKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSAL          
Sbjct: 121 ASFDAEEKPQSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSAL---------- 180

Query: 181 SQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRIISPLLNLPP 240
               NLQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVRI+SPLLNLPP
Sbjct: 181 ----NLQKQPSISSRKSSVSSCSNSYFANSCSQKPSTRRNFGANNGNGVRIMSPLLNLPP 240

Query: 241 PYISKVTVSFFGFGSLFCNGKNKKKNK 268
           PYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 PYISKVTVSFFGFGSLFCNGKNKKKNK 253

BLAST of Tan0021779 vs. ExPASy TrEMBL
Match: A0A6J1I4A0 (uncharacterized protein LOC111470497 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470497 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 2.3e-116
Identity = 234/275 (85.09%), Postives = 245/275 (89.09%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLESDFDFCIGNLL+Q
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFCIGNLLLQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKELLS 120
           DLSSADELF NGKILP+EIKKSIEPNREVS+ N S   IP DP R+SVSSEKKSLKELLS
Sbjct: 61  DLSSADELFCNGKILPVEIKKSIEPNREVSKPNQSTPPIPPDPPRSSVSSEKKSLKELLS 120

Query: 121 ASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQ------ 180
           ASFDA+EKPQSKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSALN KQ      
Sbjct: 121 ASFDAEEKPQSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSALNSKQQSQSQS 180

Query: 181 --QSHSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRII 240
             QS SKDS+ P+LQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVRI+
Sbjct: 181 QSQSQSKDSRTPSLQKQPSISSRKSSVSSCSNSYFANSCSQKPSTRRNFGANNGNGVRIM 240

Query: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 275

BLAST of Tan0021779 vs. ExPASy TrEMBL
Match: A0A6J1H5M5 (uncharacterized protein LOC111460336 OS=Cucurbita moschata OX=3662 GN=LOC111460336 PE=4 SV=1)

HSP 1 Score: 419.1 bits (1076), Expect = 1.4e-113
Identity = 231/277 (83.39%), Postives = 243/277 (87.73%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLES--DFDFCIGNLL 60
           M IEVCSEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLES  DFDFCIGNLL
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFDFCIGNLL 60

Query: 61  MQDLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKEL 120
           +QDLSSADELF NGKILP+EIKKS+EPNREVS+ N+S   IP DP  +SVSSEKKSLKEL
Sbjct: 61  LQDLSSADELFCNGKILPVEIKKSVEPNREVSKPNESTPPIPPDPPTSSVSSEKKSLKEL 120

Query: 121 LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK----- 180
           LSASFDA+EKP SKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSALN K     
Sbjct: 121 LSASFDAEEKPHSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSALNSKHQSQS 180

Query: 181 ---QQSHSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVR 240
               QS SKDS+ PNLQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVR
Sbjct: 181 QSQSQSQSKDSRTPNLQKQPSISSRKSSVSSCSNSYFANSFSQKPSTRRNFGANNGNGVR 240

Query: 241 IISPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           I+SPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 IMSPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 277

BLAST of Tan0021779 vs. ExPASy TrEMBL
Match: A0A6J1FKX8 (uncharacterized protein LOC111445167 OS=Cucurbita moschata OX=3662 GN=LOC111445167 PE=4 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 7.7e-112
Identity = 231/272 (84.93%), Postives = 241/272 (88.60%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEIS++GISPRISFSHDLNQ DLLPSSSDCN +RLDLTLLESDFDFCIGNLL Q
Sbjct: 1   MAIEVCSEISTVGISPRISFSHDLNQADLLPSSSDCNRERLDLTLLESDFDFCIGNLLRQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDS--AHLIPEDPSRNSVSSEKKSLKEL 120
           DLSSADELFSNGKILP+EIKKSIEPNREV +   S     +P DP+RNSVSSEKKSLKEL
Sbjct: 61  DLSSADELFSNGKILPVEIKKSIEPNREVLKPIQSPPPPPVPPDPARNSVSSEKKSLKEL 120

Query: 121 LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQQSHS 180
           LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK QS S
Sbjct: 121 LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKSQSTS 180

Query: 181 KDSQRPNLQKQPSIPSRKSSASSYS---NSYFANSSSQKPSTRKNFGPNNGNGVRIISPL 240
           KD QRPNLQKQPSI S + S++S S   NSYFAN SSQKPS RKN GPNNGNGVR ISP 
Sbjct: 181 KDPQRPNLQKQPSISSSRRSSTSISTSANSYFANLSSQKPSMRKNSGPNNGNGVR-ISPF 240

Query: 241 LNLPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           LNLPPPYISKVTVSFFGFGSLFCNGK KKK K
Sbjct: 241 LNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 271

BLAST of Tan0021779 vs. ExPASy TrEMBL
Match: A0A6J1I9N3 (uncharacterized protein LOC111470497 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111470497 PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 1.7e-111
Identity = 225/267 (84.27%), Postives = 234/267 (87.64%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEISS+GISPRISFSHDLNQ D LPSSSDCN  RLDL+LLESDFDFCIGNLL+Q
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQADSLPSSSDCNRGRLDLSLLESDFDFCIGNLLLQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSVSSEKKSLKELLS 120
           DLSSADELF NGKILP+EIKKSIEPNREVS+ N S   IP DP R+SVSSEKKSLKELLS
Sbjct: 61  DLSSADELFCNGKILPVEIKKSIEPNREVSKPNQSTPPIPPDPPRSSVSSEKKSLKELLS 120

Query: 121 ASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQQSHSKD 180
           ASFDA+EKPQSKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGSAL          
Sbjct: 121 ASFDAEEKPQSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSAL---------- 180

Query: 181 SQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRIISPLLNLPP 240
               NLQKQPSI SRKSS SS SNSYFANS SQKPSTR+NFG NNGNGVRI+SPLLNLPP
Sbjct: 181 ----NLQKQPSISSRKSSVSSCSNSYFANSCSQKPSTRRNFGANNGNGVRIMSPLLNLPP 240

Query: 241 PYISKVTVSFFGFGSLFCNGKNKKKNK 268
           PYISKVTVSFFGFGSLFCNGKNKKKNK
Sbjct: 241 PYISKVTVSFFGFGSLFCNGKNKKKNK 253

BLAST of Tan0021779 vs. ExPASy TrEMBL
Match: A0A6J1DFL1 (uncharacterized protein LOC111020041 OS=Momordica charantia OX=3673 GN=LOC111020041 PE=4 SV=1)

HSP 1 Score: 411.8 bits (1057), Expect = 2.2e-111
Identity = 229/270 (84.81%), Postives = 246/270 (91.11%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLPSSSDCNGDRLDLTLLESDFDFCIGNLLMQ 60
           M IEVCSEISS+GISPRISFSHDLNQTDLLP SSD + DRLDLTLLESDFDFCIGNLL+Q
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQTDLLP-SSDGSRDRLDLTLLESDFDFCIGNLLIQ 60

Query: 61  DLSSADELFSNGKILPLEIKKSIEPNRE-VSRANDSAHLIPEDPSRNSVSSEKKSLKELL 120
           DLSSADELFSNG+I P++IKKS++ N + V + N+SA + P +PSRNS+SSEKKSLKELL
Sbjct: 61  DLSSADELFSNGRIRPVQIKKSLDANTQLVHKPNESAPIAPPEPSRNSISSEKKSLKELL 120

Query: 121 SASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQQSHSK 180
           SASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS+LNPKQQ +SK
Sbjct: 121 SASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSSLNPKQQPNSK 180

Query: 181 D--SQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRIISPLLN 240
           D  SQRPNLQKQPSI SRKSSASS+ NSYF NSSSQK STRKNFGPNNGN VR ISPLLN
Sbjct: 181 DSSSQRPNLQKQPSISSRKSSASSFPNSYFTNSSSQKSSTRKNFGPNNGNAVR-ISPLLN 240

Query: 241 LPPPYISKVTVSFFGFGSLFCNGKNKKKNK 268
           LPPPYISKVTVSFFGFGSLFCNGK KKK K
Sbjct: 241 LPPPYISKVTVSFFGFGSLFCNGKIKKKKK 268

BLAST of Tan0021779 vs. TAIR 10
Match: AT1G68330.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 155 Blast hits to 147 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 19; Fungi - 3; Plants - 126; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 152.5 bits (384), Expect = 4.7e-37
Identity = 135/285 (47.37%), Postives = 167/285 (58.60%), Query Frame = 0

Query: 1   MTIEV-CSEISSLGISPRISFSHDLNQTDLLPSSSDCNGD-RLDLTLLE--SDFDFCIG- 60
           M I+V CSE S  GISPRISFS+DL+ TD        +G+ RLD TLL+  S+FDFC G 
Sbjct: 1   MAIDVCCSEASGSGISPRISFSYDLDSTD--------DGEVRLDSTLLDSGSEFDFCFGS 60

Query: 61  NLLMQDLSSADELFSNGKILPLEIKKSIE-PNREVSRANDSAHLIPEDPSRNSVSS---- 120
           +  +Q++S ADELFS GKILP++IKK    P     R   SA L     S +S SS    
Sbjct: 61  SCSVQEVSPADELFSEGKILPVQIKKEESLPQTVTFRVPRSASLSSSSSSSSSSSSSSRA 120

Query: 121 --EKKSLKE-LLSASFDADEKPQSKSFWQFKRSSSLNCESSK-SRSLIRSLHFLSRSNST 180
             +K  LKE LL+   D ++KP+   F QFKRS SLN + S+ S+ LIRS HFLSRSNST
Sbjct: 121 PEKKMRLKELLLNPESDFEDKPRG-LFLQFKRSISLNYDKSRNSKGLIRSFHFLSRSNST 180

Query: 181 GSA---LNPKQQSHSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGP 240
            +    L PK+  H   +   NL K      R SS SS S  ++    S+KP  R +FG 
Sbjct: 181 PNPNLDLLPKETHHPHKTH--NLPKHKPPLRRSSSLSSSSVPFY----SKKPLGRNSFG- 240

Query: 241 NNGNGVRIISPLLNLPPP-YISKVTVSFFGFGSLFCNGKNKKKNK 268
            NGNG   +SP+LN PPP +IS V   FF  GSL CNGK   K K
Sbjct: 241 -NGNGGVRVSPVLNFPPPAFISNVADGFFSIGSL-CNGKTNTKTK 267

BLAST of Tan0021779 vs. TAIR 10
Match: AT1G67050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G38320.1); Has 617 Blast hits to 318 proteins in 80 species: Archae - 0; Bacteria - 16; Metazoa - 141; Fungi - 62; Plants - 128; Viruses - 2; Other Eukaryotes - 268 (source: NCBI BLink). )

HSP 1 Score: 128.3 bits (321), Expect = 9.5e-30
Identity = 108/281 (38.43%), Postives = 159/281 (56.58%), Query Frame = 0

Query: 1   MTIEVCSEISSLGISPRISFSHDLNQTDLLP------SSSDCNGDRLDLTLLESDFDFCI 60
           M +++ SE S++  SPRISFS D  Q+D +P       SS+     L+ ++   DFDFCI
Sbjct: 1   MAVDLLSENSNM--SPRISFSRDFCQSDAIPIEKRPLRSSNSKPSSLNSSI---DFDFCI 60

Query: 61  ------GNLLMQDLSSADELFSNGKILPLEIKKSIEPNREVSRANDSAHLIPEDPSRNSV 120
                 G    Q   SADELFSNGKILP EIKK  EP ++           P+   +   
Sbjct: 61  PGGVNSGESFDQGSWSADELFSNGKILPTEIKKKPEPGKKEPEPK-PVKSKPDSRKQRKQ 120

Query: 121 SSEKKSLKELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS 180
            +E++   +++      +EK  +KSFW FKRSSSLNC S+  RSL   L  L+RSNSTGS
Sbjct: 121 PNEEQQEDDVI---ITTEEKTNTKSFWGFKRSSSLNCGSTYGRSLC-PLPLLNRSNSTGS 180

Query: 181 ALNPKQQSHS-KDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGN 240
             + ++QS S K ++   LQ+  S+ S  S++SS SN+ F+    +K     ++G + G 
Sbjct: 181 TSSKQKQSSSRKHNEHVKLQQSSSLSSSSSASSSLSNNGFSKPPLKKSYGGYSYGSHGGG 240

Query: 241 GVRIISPLLNLPPPYISKVTVSFFGFGSLFC-NGKNKKKNK 268
           G+R +SP++N+ P      + + FGFGS+F  NG++K K +
Sbjct: 241 GIR-VSPVINVVP------SGNLFGFGSMFSGNGRDKNKKR 264

BLAST of Tan0021779 vs. TAIR 10
Match: AT1G48780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G18300.1); Has 89 Blast hits to 89 proteins in 11 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 79.0 bits (193), Expect = 6.6e-15
Identity = 97/264 (36.74%), Postives = 125/264 (47.35%), Query Frame = 0

Query: 17  RISFSHDLNQTDLLPSS--SDCNGDRLDLTLLE---SDFDFCIGNLL-MQDLSSADELFS 76
           RISFS DL Q+D  P          R D TLL+   SDF+F I N     D S ADE+F+
Sbjct: 9   RISFSSDLGQSDKAPPPVIEPSGLIRRDETLLDSSNSDFEFHISNSFDPGDSSPADEIFA 68

Query: 77  NGKILPLEI-KKSIEPNR----EVSRANDSAHLIPEDPS-RNSVSSEKKSLKELLSASFD 136
           +G ILP  +   S  P R    E+     S    P  P    +  SEK++      A+ D
Sbjct: 69  DGMILPFHVTAASTVPKRLYKYELPPITSSLSPSPLSPQPLPTKHSEKETNGRASGANSD 128

Query: 137 ADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKQ-QSHSKDSQR 196
           ++ +  SKSFW FKRSSSLNC+  K  SLI S   L+RSNSTGS  N K+      ++ R
Sbjct: 129 SEAEKSSKSFWSFKRSSSLNCDIKK--SLICSFPRLTRSNSTGSVTNSKRAMLRDVNNHR 188

Query: 197 PNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNGNGVRIISPLLNLPPPYI 256
           P            SS SS  N+Y      QK + +K      G G   + P+LN P    
Sbjct: 189 P------------SSRSSCCNAY--QFRPQKHTGKK----GEGGGSFSVIPVLNGP---- 243

Query: 257 SKVTVSFFGFGSLFCNGKNKKKNK 268
                S FG GS+  +  +K K K
Sbjct: 249 -----STFGLGSILRHSNSKDKTK 243

BLAST of Tan0021779 vs. TAIR 10
Match: AT3G18300.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 69 Blast hits to 69 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 70.9 bits (172), Expect = 1.8e-12
Identity = 100/280 (35.71%), Postives = 134/280 (47.86%), Query Frame = 0

Query: 17  RISFSHDLNQTDL-LPSSSDCNGD-RLDLTLLE---SDFDFCI-GNLLMQDLSSADELFS 76
           R SF+ DL Q+D   P     +G  R D TLL+   SDF+F I  N    D S ADE+F+
Sbjct: 10  RFSFAGDLGQSDKGTPMEQQPSGPVRRDTTLLDSSNSDFEFHISSNFDPGDSSPADEIFA 69

Query: 77  NGKILPL----EIKKSIEPNR--------EVSRANDSAHL------IPEDPSRNSVSSEK 136
           +G ILP+        S  P R         VS    S++L      +PE   + SV   +
Sbjct: 70  DGMILPVLPFQVTATSTMPKRLYKYELPPIVSAPTLSSYLPPLPLPLPEHSRKYSVKETR 129

Query: 137 KSLKELLS-ASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALN 196
            SL    S A+ D++ +  SKSFW FKRSSSLNC+  K  SLI S   L+RSNSTGS   
Sbjct: 130 GSLNGRGSGANSDSEAEKSSKSFWSFKRSSSLNCDIKK--SLICSFPRLTRSNSTGSVAI 189

Query: 197 PKQQS----HSKDSQRPNLQKQPSIPSRKSSASSYSNSYFANSSSQKPSTRKNFGPNNG- 256
            K++     +   SQR  + +    PS      S   S+  +S   +P  +K+ G N G 
Sbjct: 190 SKREMLRDINKHSSQRHGVPRPGVNPSSHMRPPS---SFCCSSYQFRP--QKHAGKNGGG 249

Query: 257 -NGVRIISPLLNLPPPYISKVTVSFFGFGSLFCNGKNKKK 266
             G   I+P++  P P         FG GS+    K KKK
Sbjct: 250 RGGSFWIAPVIGGPSP---------FGLGSILRLTKEKKK 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022971826.14.8e-11685.09uncharacterized protein LOC111470497 isoform X1 [Cucurbita maxima][more]
KAG6601812.12.6e-11485.50hypothetical protein SDJN03_07045, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022959333.12.9e-11383.39uncharacterized protein LOC111460336 [Cucurbita moschata] >KAG7032519.1 hypothet... [more]
XP_022939173.11.6e-11184.93uncharacterized protein LOC111445167 [Cucurbita moschata][more]
XP_022971829.13.5e-11184.27uncharacterized protein LOC111470497 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1I4A02.3e-11685.09uncharacterized protein LOC111470497 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1H5M51.4e-11383.39uncharacterized protein LOC111460336 OS=Cucurbita moschata OX=3662 GN=LOC1114603... [more]
A0A6J1FKX87.7e-11284.93uncharacterized protein LOC111445167 OS=Cucurbita moschata OX=3662 GN=LOC1114451... [more]
A0A6J1I9N31.7e-11184.27uncharacterized protein LOC111470497 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1DFL12.2e-11184.81uncharacterized protein LOC111020041 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
Match NameE-valueIdentityDescription
AT1G68330.14.7e-3747.37unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G67050.19.5e-3038.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G48780.16.6e-1536.74unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G18300.11.8e-1235.71unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 164..225
NoneNo IPR availablePANTHERPTHR31722OS06G0675200 PROTEINcoord: 1..266
NoneNo IPR availablePANTHERPTHR31722:SF33COPPER-BINDING PERIPLASMIC PROTEINcoord: 1..266

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021779.1Tan0021779.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071704 organic substance metabolic process
cellular_component GO:0005576 extracellular region
molecular_function GO:0016985 mannan endo-1,4-beta-mannosidase activity