Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGATCGGACTTCATCCTTATGTCGCTCTCCTCCTGATTTCCCTCTTCTTCACCAAAAACTCCTTCGTCTCCTCCATTCCTTCTCAAGAGCTTGACGCAATGCTCTCCATTGTCAGAGCTCGAGGCTACAATCTCTTCTCCAACGCCATAACCACTTCCGATCTCCAACTCGATCTCCTCACCGCCGCCGCCAACGCCTCCTTTACCTTCTTCGCCCCTACCGACTCCTCTCTCTTCGCCCTCTCCATGACTCAGTCCGCCTCCGCCTACACCGCTACTCTCCGCTACCACGGCCTCCCTCGCCGTCTCTCTGTTTCCGATCTCAACCGCTTGCCTTCTCAGGTTCCGATCCAAACTCTGCTTCTCTCACAGTATGTTTCTCTCACTCGCCGCCTCCGCGGATCTCGCTCCGACGCCATTTCCGTCAATGGCGTCGATGTTGTTCTCCCTGGATTGTACTACGACCGTCACATTGCGGTCCATGGCTTGGGAGGAATTCTCAGTCTCCATTCTCAGATCGGATTTCCATACGATTCTCCGCCTCCTCTTCCGCTGTTTCGCCCTAGCGCTCCGAGTAGTTTTCCGCCTGAACTCGATCGCGGTTCCAAGGAGAACAGAGATTTCACCGCCCTAGCCGCTGGTCCCTCAATCGGCATCTCCTCCGTATTTCCACCGATTAGTCCGAGTAATTCTTGGAACAAAACTGTTGATCTTCCATTTAATCGTGTATTTCTTGGTCCTGCACCGGAACGATCGCATTTCCAAGTTGAACCGCCGGTAATTTCCTCTGTTTCGCCATCGACGAGTCCTGTACTTCCACCGGGCGTTTCGAATGATTTAACTCCGTCGGCGCAGAATGAGAAGGCGATTACGCCGACGCCATTGATGAGCGACTCGGCAGCGTGGATGATGAAATCGGAAGACGGTTTGAGCGGAAGAACAGTGGAGGAGTATGAGCCGTCGGATTGGATGACGTTTGGATTCATGAAAGGAAATTCAGAAGATGTTGATGTTGAAGATAGTCATCCTCGTCCAAATGTATTGCCGCCTTTTTGA
mRNA sequence
ATGAAGATCGGACTTCATCCTTATGTCGCTCTCCTCCTGATTTCCCTCTTCTTCACCAAAAACTCCTTCGTCTCCTCCATTCCTTCTCAAGAGCTTGACGCAATGCTCTCCATTGTCAGAGCTCGAGGCTACAATCTCTTCTCCAACGCCATAACCACTTCCGATCTCCAACTCGATCTCCTCACCGCCGCCGCCAACGCCTCCTTTACCTTCTTCGCCCCTACCGACTCCTCTCTCTTCGCCCTCTCCATGACTCAGTCCGCCTCCGCCTACACCGCTACTCTCCGCTACCACGGCCTCCCTCGCCGTCTCTCTGTTTCCGATCTCAACCGCTTGCCTTCTCAGGTTCCGATCCAAACTCTGCTTCTCTCACAGTATGTTTCTCTCACTCGCCGCCTCCGCGGATCTCGCTCCGACGCCATTTCCGTCAATGGCGTCGATGTTGTTCTCCCTGGATTGTACTACGACCGTCACATTGCGGTCCATGGCTTGGGAGGAATTCTCAGTCTCCATTCTCAGATCGGATTTCCATACGATTCTCCGCCTCCTCTTCCGCTGTTTCGCCCTAGCGCTCCGAGTAGTTTTCCGCCTGAACTCGATCGCGGTTCCAAGGAGAACAGAGATTTCACCGCCCTAGCCGCTGGTCCCTCAATCGGCATCTCCTCCGTATTTCCACCGATTAGTCCGAGTAATTCTTGGAACAAAACTGTTGATCTTCCATTTAATCGTGTATTTCTTGGTCCTGCACCGGAACGATCGCATTTCCAAGTTGAACCGCCGGTAATTTCCTCTGTTTCGCCATCGACGAGTCCTGTACTTCCACCGGGCGTTTCGAATGATTTAACTCCGTCGGCGCAGAATGAGAAGGCGATTACGCCGACGCCATTGATGAGCGACTCGGCAGCGTGGATGATGAAATCGGAAGACGGTTTGAGCGGAAGAACAGTGGAGGAGTATGAGCCGTCGGATTGGATGACGTTTGGATTCATGAAAGGAAATTCAGAAGATGTTGATGTTGAAGATAGTCATCCTCGTCCAAATGTATTGCCGCCTTTTTGA
Coding sequence (CDS)
ATGAAGATCGGACTTCATCCTTATGTCGCTCTCCTCCTGATTTCCCTCTTCTTCACCAAAAACTCCTTCGTCTCCTCCATTCCTTCTCAAGAGCTTGACGCAATGCTCTCCATTGTCAGAGCTCGAGGCTACAATCTCTTCTCCAACGCCATAACCACTTCCGATCTCCAACTCGATCTCCTCACCGCCGCCGCCAACGCCTCCTTTACCTTCTTCGCCCCTACCGACTCCTCTCTCTTCGCCCTCTCCATGACTCAGTCCGCCTCCGCCTACACCGCTACTCTCCGCTACCACGGCCTCCCTCGCCGTCTCTCTGTTTCCGATCTCAACCGCTTGCCTTCTCAGGTTCCGATCCAAACTCTGCTTCTCTCACAGTATGTTTCTCTCACTCGCCGCCTCCGCGGATCTCGCTCCGACGCCATTTCCGTCAATGGCGTCGATGTTGTTCTCCCTGGATTGTACTACGACCGTCACATTGCGGTCCATGGCTTGGGAGGAATTCTCAGTCTCCATTCTCAGATCGGATTTCCATACGATTCTCCGCCTCCTCTTCCGCTGTTTCGCCCTAGCGCTCCGAGTAGTTTTCCGCCTGAACTCGATCGCGGTTCCAAGGAGAACAGAGATTTCACCGCCCTAGCCGCTGGTCCCTCAATCGGCATCTCCTCCGTATTTCCACCGATTAGTCCGAGTAATTCTTGGAACAAAACTGTTGATCTTCCATTTAATCGTGTATTTCTTGGTCCTGCACCGGAACGATCGCATTTCCAAGTTGAACCGCCGGTAATTTCCTCTGTTTCGCCATCGACGAGTCCTGTACTTCCACCGGGCGTTTCGAATGATTTAACTCCGTCGGCGCAGAATGAGAAGGCGATTACGCCGACGCCATTGATGAGCGACTCGGCAGCGTGGATGATGAAATCGGAAGACGGTTTGAGCGGAAGAACAGTGGAGGAGTATGAGCCGTCGGATTGGATGACGTTTGGATTCATGAAAGGAAATTCAGAAGATGTTGATGTTGAAGATAGTCATCCTCGTCCAAATGTATTGCCGCCTTTTTGA
Protein sequence
MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDLLTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQTLLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDSPPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLPFNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPLMSDSAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNVLPPF
Homology
BLAST of HG10019034 vs. NCBI nr
Match:
XP_038888494.1 (uncharacterized protein LOC120078327 [Benincasa hispida])
HSP 1 Score: 565.8 bits (1457), Expect = 2.5e-157
Identity = 296/350 (84.57%), Postives = 316/350 (90.29%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
M+IGLH +VA L+ISLFFTK SFVSSIP++EL+AMLSIVRARGYNLFSNAITTSDLQLDL
Sbjct: 7 MRIGLHLFVAFLMISLFFTKTSFVSSIPTEELEAMLSIVRARGYNLFSNAITTSDLQLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
LTAAANASFTFFAPTDSSLFAL+MTQSA+AYTATLRYHGLPRRLSVSD NRLPSQ PIQT
Sbjct: 67 LTAAANASFTFFAPTDSSLFALAMTQSAAAYTATLRYHGLPRRLSVSDFNRLPSQSPIQT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYV+LTRRLRGSRSDAISVNGV+VVLPGLYY RH+AVHGLGGILSLHSQIGFPYDS
Sbjct: 127 LLRSQYVTLTRRLRGSRSDAISVNGVNVVLPGLYYSRHVAVHGLGGILSLHSQIGFPYDS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPPLPLFRPSAPSSFPP ENRDFTA AA VFPPISPSNS NKTVDLP
Sbjct: 187 PPPLPLFRPSAPSSFPP-------ENRDFTAPAA------DRVFPPISPSNSSNKTVDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPLMSDS 300
FNRV+LGP PERS+FQVEPPVISSVSPSTSPV+PP +SNDLTPS +NEK ITPTPLMS S
Sbjct: 247 FNRVYLGPPPERSNFQVEPPVISSVSPSTSPVVPPVISNDLTPSTRNEKEITPTPLMSGS 306
Query: 301 AAWMMKSEDGLSGRTV-EEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNVL 350
AAWM+KSEDGL GRTV EEYEP DWM FGF +GNS++VDVEDSHPRPNVL
Sbjct: 307 AAWMIKSEDGLIGRTVEEEYEPLDWMAFGFTEGNSKNVDVEDSHPRPNVL 343
BLAST of HG10019034 vs. NCBI nr
Match:
XP_011657526.1 (uncharacterized protein LOC105435867 [Cucumis sativus] >KGN47917.1 hypothetical protein Csa_004125 [Cucumis sativus])
HSP 1 Score: 503.1 bits (1294), Expect = 2.0e-138
Identity = 268/350 (76.57%), Postives = 294/350 (84.00%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
M IGL +VA+L+ISL TK SFVSSIP+QELDAMLS+VRA+GYNLFSNAITTSDL LDL
Sbjct: 7 MMIGLRLHVAILMISLLLTKTSFVSSIPTQELDAMLSVVRAQGYNLFSNAITTSDLYLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
LTAA NASFT FAPTDSSLFA++MTQSASAYTATLRYH LPRR S+ DLNRLPSQVPIQT
Sbjct: 67 LTAAPNASFTLFAPTDSSLFAIAMTQSASAYTATLRYHCLPRRFSLFDLNRLPSQVPIQT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVSLTRRLRGS SDAI VNGV++VLPGLYY RH+AVHGL GILSLHSQI FPY S
Sbjct: 127 LLPSQYVSLTRRLRGSSSDAIFVNGVNIVLPGLYYSRHVAVHGLEGILSLHSQIQFPYYS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPL F PSAPSSFPP KENRDFT AA SIGISSVFPP+SPSNS NKT+DLP
Sbjct: 187 LPPLSPFLPSAPSSFPP------KENRDFTGPAADRSIGISSVFPPLSPSNSLNKTIDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPST-SPVLPPGVSNDLTPSAQNEKAITPTPLMSD 300
FN FLGP+PERS+FQ EPPV SSVSPST SP+LPP VSN++TPS TPL S
Sbjct: 247 FNHTFLGPSPERSNFQFEPPVTSSVSPSTISPILPPAVSNEVTPS---------TPLTST 306
Query: 301 SAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNVL 350
SAAWMMK EDGL+G TV+EY+PS+WM FGF + NS++VDVEDSHPRPNVL
Sbjct: 307 SAAWMMKPEDGLNGETVDEYDPSNWMAFGFTEENSKNVDVEDSHPRPNVL 341
BLAST of HG10019034 vs. NCBI nr
Match:
XP_008449424.1 (PREDICTED: fasciclin-like arabinogalactan protein 19 [Cucumis melo] >KAA0057322.1 fasciclin-like arabinogalactan protein 19 [Cucumis melo var. makuwa] >TYK13412.1 fasciclin-like arabinogalactan protein 19 [Cucumis melo var. makuwa])
HSP 1 Score: 498.4 bits (1282), Expect = 4.9e-137
Identity = 266/348 (76.44%), Postives = 291/348 (83.62%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
MKIGL +VA+L+ISL TK SFVSSIP+QELDAMLS+VRA+GYNLFSNAITTSDL LDL
Sbjct: 7 MKIGLRLHVAILVISLLLTKTSFVSSIPTQELDAMLSVVRAQGYNLFSNAITTSDLHLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
LTAA NASFTFFAPTDSSLFAL+MTQSA YTATLRYH LPRRLS+SDLNR PSQV IQT
Sbjct: 67 LTAAPNASFTFFAPTDSSLFALAMTQSAFTYTATLRYHCLPRRLSLSDLNRFPSQV-IQT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVSLTRRLRGS SDAI VNG+++VLPGLYY RH+AVHGL GILSLHSQI FPY S
Sbjct: 127 LLPSQYVSLTRRLRGSSSDAIFVNGINIVLPGLYYSRHVAVHGLEGILSLHSQIQFPYYS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPLP FRPSAPSSFPPE ENRDFT AA PSIGISSV PPISPSNS NKT+DLP
Sbjct: 187 LPPLPPFRPSAPSSFPPE------ENRDFTGPAADPSIGISSVSPPISPSNSLNKTIDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPLMSDS 300
FN +FLGP+P RS+FQ EPPV+SSVSPSTSP+ PP VSN++TPS P MS S
Sbjct: 247 FNHIFLGPSPGRSNFQFEPPVMSSVSPSTSPIHPPAVSNEVTPS---------MPSMSTS 306
Query: 301 AAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNV 349
AAWM K EDGLSG TVEEYEP +W FGF + NS+++DVEDSHPRPNV
Sbjct: 307 AAWMTKPEDGLSGETVEEYEPLNWTPFGFTEENSKNIDVEDSHPRPNV 338
BLAST of HG10019034 vs. NCBI nr
Match:
KAG6588822.1 (Fasciclin-like arabinogalactan protein 19, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 491.1 bits (1263), Expect = 7.9e-135
Identity = 261/354 (73.73%), Postives = 289/354 (81.64%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
+KIG+H YVALL++S FF KNS V+SIP+QELDAMLSIVRARGYNLFSNAITTSDLQLDL
Sbjct: 7 IKIGIHGYVALLMLSFFFNKNSIVTSIPTQELDAMLSIVRARGYNLFSNAITTSDLQLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
L+AA NASFT FAPTDSSLFAL+MTQSAS YTATLRYHGLPRRLSVSDLN LPSQV I T
Sbjct: 67 LSAADNASFTLFAPTDSSLFALAMTQSASVYTATLRYHGLPRRLSVSDLNSLPSQVAIPT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVS+TR RGSRSDAISVNG++VVLPGLYY RH+ VHGLGGILSLHSQ G+ Y S
Sbjct: 127 LLRSQYVSVTRCHRGSRSDAISVNGINVVLPGLYYGRHVVVHGLGGILSLHSQNGYTYGS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPPL FRPS P SFPP++D GS ENRDFT A SI S V PPISPS+S NKTVDLP
Sbjct: 187 PPPLSRFRPSGPRSFPPQIDLGSNENRDFTVSPADRSIDESPVSPPISPSSSSNKTVDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPL---- 300
FNRVFLGPA E+ +F VE P IS SPS SP PP +S+DLTP+ QNE +TPTP
Sbjct: 247 FNRVFLGPASEQPNFPVESPAIS--SPSVSPGYPPTISSDLTPTMQNENTVTPTPTPLIS 306
Query: 301 ---MSDSAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPN 348
SDS MMKSED ++GRTVEEYEP DWM+ GF + NS+D+DVEDSHPRPN
Sbjct: 307 GEEASDSGR-MMKSEDDMTGRTVEEYEPLDWMSLGFGERNSDDIDVEDSHPRPN 357
BLAST of HG10019034 vs. NCBI nr
Match:
XP_022927853.1 (uncharacterized protein LOC111434622 [Cucurbita moschata])
HSP 1 Score: 491.1 bits (1263), Expect = 7.9e-135
Identity = 262/353 (74.22%), Postives = 288/353 (81.59%), Query Frame = 0
Query: 2 KIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDLL 61
KIGLH YVALL++S FF KNS V+SIP+QELDAMLSIVRARGYNLFSNAITTSDLQLDLL
Sbjct: 8 KIGLHGYVALLMLSFFFNKNSIVTSIPTQELDAMLSIVRARGYNLFSNAITTSDLQLDLL 67
Query: 62 TAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQTL 121
+AA NASFT FAPTDSSLFAL+MTQSAS YTATLRYHGLPRRLSVSDLN LPSQV I TL
Sbjct: 68 SAADNASFTLFAPTDSSLFALAMTQSASVYTATLRYHGLPRRLSVSDLNSLPSQVAIPTL 127
Query: 122 LLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDSP 181
L SQYVS+TR RGSRSDAISVNG++VVLPGLYY RH+ VHGLGGILSLHSQ G+ Y SP
Sbjct: 128 LRSQYVSVTRCHRGSRSDAISVNGINVVLPGLYYGRHVVVHGLGGILSLHSQNGYTYGSP 187
Query: 182 PPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLPF 241
PPL FRPS P SFPP++D GS ENRDFT A SI S V PPISPS+S NKTVDLPF
Sbjct: 188 PPLSRFRPSGPRSFPPQIDLGSNENRDFTVSPADRSIEESPVSPPISPSSSSNKTVDLPF 247
Query: 242 NRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPL----- 301
NRVFLGPA E+ +F VE P IS SPS SP PP +S+DLTP+ QNE +TPTP
Sbjct: 248 NRVFLGPASEQPNFPVESPAIS--SPSVSPGYPPTISSDLTPTMQNENTVTPTPTPLISG 307
Query: 302 --MSDSAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPN 348
SDS MMKSED ++GRTVEEYEP DWM+ GF + NS+D+DVEDSHPRPN
Sbjct: 308 EEASDSGR-MMKSEDDMTGRTVEEYEPLDWMSLGFGERNSDDIDVEDSHPRPN 357
BLAST of HG10019034 vs. ExPASy Swiss-Prot
Match:
Q5Q0H2 (Fasciclin-like arabinogalactan protein 19 OS=Arabidopsis thaliana OX=3702 GN=FLA19 PE=2 SV=2)
HSP 1 Score: 120.6 bits (301), Expect = 3.7e-26
Identity = 79/192 (41.15%), Postives = 108/192 (56.25%), Query Frame = 0
Query: 25 SSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDLLTAAANASFTFFAPTDSSLFALSM 84
+ +P +EL+ ++I+R RG LF+NAI TSDL DLL ++ S T FAPTDS LF L M
Sbjct: 28 TGVPLEELERAIAILRVRGRALFANAIITSDLLFDLL---SDESLTLFAPTDSMLFDLDM 87
Query: 85 TQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQTLLLSQYVSLTRRLRGSRSDAISVN 144
T S Y +TLR H +P RLS+S L LP+ + TLL S + LT+ S +D+I ++
Sbjct: 88 THSLPFYVSTLRLHSVPLRLSLSGLRSLPNSSSLPTLLPSHRLLLTK--HSSSNDSIFLD 147
Query: 145 GVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDSPPPLPLFRPSAPSSFPPELDRGSK 204
GV +++PGL+ +HIAVHGL + LPL PS+P+
Sbjct: 148 GVQLLIPGLFDGQHIAVHGLADL----------------LPLTAPSSPNRLV-------- 188
Query: 205 ENRDFTALAAGP 217
D TALA P
Sbjct: 208 --EDSTALAKSP 188
BLAST of HG10019034 vs. ExPASy TrEMBL
Match:
A0A0A0KHQ8 (FAS1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G411210 PE=3 SV=1)
HSP 1 Score: 503.1 bits (1294), Expect = 9.7e-139
Identity = 268/350 (76.57%), Postives = 294/350 (84.00%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
M IGL +VA+L+ISL TK SFVSSIP+QELDAMLS+VRA+GYNLFSNAITTSDL LDL
Sbjct: 7 MMIGLRLHVAILMISLLLTKTSFVSSIPTQELDAMLSVVRAQGYNLFSNAITTSDLYLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
LTAA NASFT FAPTDSSLFA++MTQSASAYTATLRYH LPRR S+ DLNRLPSQVPIQT
Sbjct: 67 LTAAPNASFTLFAPTDSSLFAIAMTQSASAYTATLRYHCLPRRFSLFDLNRLPSQVPIQT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVSLTRRLRGS SDAI VNGV++VLPGLYY RH+AVHGL GILSLHSQI FPY S
Sbjct: 127 LLPSQYVSLTRRLRGSSSDAIFVNGVNIVLPGLYYSRHVAVHGLEGILSLHSQIQFPYYS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPL F PSAPSSFPP KENRDFT AA SIGISSVFPP+SPSNS NKT+DLP
Sbjct: 187 LPPLSPFLPSAPSSFPP------KENRDFTGPAADRSIGISSVFPPLSPSNSLNKTIDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPST-SPVLPPGVSNDLTPSAQNEKAITPTPLMSD 300
FN FLGP+PERS+FQ EPPV SSVSPST SP+LPP VSN++TPS TPL S
Sbjct: 247 FNHTFLGPSPERSNFQFEPPVTSSVSPSTISPILPPAVSNEVTPS---------TPLTST 306
Query: 301 SAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNVL 350
SAAWMMK EDGL+G TV+EY+PS+WM FGF + NS++VDVEDSHPRPNVL
Sbjct: 307 SAAWMMKPEDGLNGETVDEYDPSNWMAFGFTEENSKNVDVEDSHPRPNVL 341
BLAST of HG10019034 vs. ExPASy TrEMBL
Match:
A0A5A7UN88 (Fasciclin-like arabinogalactan protein 19 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold496G00280 PE=3 SV=1)
HSP 1 Score: 498.4 bits (1282), Expect = 2.4e-137
Identity = 266/348 (76.44%), Postives = 291/348 (83.62%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
MKIGL +VA+L+ISL TK SFVSSIP+QELDAMLS+VRA+GYNLFSNAITTSDL LDL
Sbjct: 7 MKIGLRLHVAILVISLLLTKTSFVSSIPTQELDAMLSVVRAQGYNLFSNAITTSDLHLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
LTAA NASFTFFAPTDSSLFAL+MTQSA YTATLRYH LPRRLS+SDLNR PSQV IQT
Sbjct: 67 LTAAPNASFTFFAPTDSSLFALAMTQSAFTYTATLRYHCLPRRLSLSDLNRFPSQV-IQT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVSLTRRLRGS SDAI VNG+++VLPGLYY RH+AVHGL GILSLHSQI FPY S
Sbjct: 127 LLPSQYVSLTRRLRGSSSDAIFVNGINIVLPGLYYSRHVAVHGLEGILSLHSQIQFPYYS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPLP FRPSAPSSFPPE ENRDFT AA PSIGISSV PPISPSNS NKT+DLP
Sbjct: 187 LPPLPPFRPSAPSSFPPE------ENRDFTGPAADPSIGISSVSPPISPSNSLNKTIDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPLMSDS 300
FN +FLGP+P RS+FQ EPPV+SSVSPSTSP+ PP VSN++TPS P MS S
Sbjct: 247 FNHIFLGPSPGRSNFQFEPPVMSSVSPSTSPIHPPAVSNEVTPS---------MPSMSTS 306
Query: 301 AAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNV 349
AAWM K EDGLSG TVEEYEP +W FGF + NS+++DVEDSHPRPNV
Sbjct: 307 AAWMTKPEDGLSGETVEEYEPLNWTPFGFTEENSKNIDVEDSHPRPNV 338
BLAST of HG10019034 vs. ExPASy TrEMBL
Match:
A0A1S3BM06 (fasciclin-like arabinogalactan protein 19 OS=Cucumis melo OX=3656 GN=LOC103491315 PE=3 SV=1)
HSP 1 Score: 498.4 bits (1282), Expect = 2.4e-137
Identity = 266/348 (76.44%), Postives = 291/348 (83.62%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
MKIGL +VA+L+ISL TK SFVSSIP+QELDAMLS+VRA+GYNLFSNAITTSDL LDL
Sbjct: 7 MKIGLRLHVAILVISLLLTKTSFVSSIPTQELDAMLSVVRAQGYNLFSNAITTSDLHLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
LTAA NASFTFFAPTDSSLFAL+MTQSA YTATLRYH LPRRLS+SDLNR PSQV IQT
Sbjct: 67 LTAAPNASFTFFAPTDSSLFALAMTQSAFTYTATLRYHCLPRRLSLSDLNRFPSQV-IQT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVSLTRRLRGS SDAI VNG+++VLPGLYY RH+AVHGL GILSLHSQI FPY S
Sbjct: 127 LLPSQYVSLTRRLRGSSSDAIFVNGINIVLPGLYYSRHVAVHGLEGILSLHSQIQFPYYS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPLP FRPSAPSSFPPE ENRDFT AA PSIGISSV PPISPSNS NKT+DLP
Sbjct: 187 LPPLPPFRPSAPSSFPPE------ENRDFTGPAADPSIGISSVSPPISPSNSLNKTIDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPLMSDS 300
FN +FLGP+P RS+FQ EPPV+SSVSPSTSP+ PP VSN++TPS P MS S
Sbjct: 247 FNHIFLGPSPGRSNFQFEPPVMSSVSPSTSPIHPPAVSNEVTPS---------MPSMSTS 306
Query: 301 AAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPNV 349
AAWM K EDGLSG TVEEYEP +W FGF + NS+++DVEDSHPRPNV
Sbjct: 307 AAWMTKPEDGLSGETVEEYEPLNWTPFGFTEENSKNIDVEDSHPRPNV 338
BLAST of HG10019034 vs. ExPASy TrEMBL
Match:
A0A6J1EM74 (uncharacterized protein LOC111434622 OS=Cucurbita moschata OX=3662 GN=LOC111434622 PE=3 SV=1)
HSP 1 Score: 491.1 bits (1263), Expect = 3.8e-135
Identity = 262/353 (74.22%), Postives = 288/353 (81.59%), Query Frame = 0
Query: 2 KIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDLL 61
KIGLH YVALL++S FF KNS V+SIP+QELDAMLSIVRARGYNLFSNAITTSDLQLDLL
Sbjct: 8 KIGLHGYVALLMLSFFFNKNSIVTSIPTQELDAMLSIVRARGYNLFSNAITTSDLQLDLL 67
Query: 62 TAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQTL 121
+AA NASFT FAPTDSSLFAL+MTQSAS YTATLRYHGLPRRLSVSDLN LPSQV I TL
Sbjct: 68 SAADNASFTLFAPTDSSLFALAMTQSASVYTATLRYHGLPRRLSVSDLNSLPSQVAIPTL 127
Query: 122 LLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDSP 181
L SQYVS+TR RGSRSDAISVNG++VVLPGLYY RH+ VHGLGGILSLHSQ G+ Y SP
Sbjct: 128 LRSQYVSVTRCHRGSRSDAISVNGINVVLPGLYYGRHVVVHGLGGILSLHSQNGYTYGSP 187
Query: 182 PPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLPF 241
PPL FRPS P SFPP++D GS ENRDFT A SI S V PPISPS+S NKTVDLPF
Sbjct: 188 PPLSRFRPSGPRSFPPQIDLGSNENRDFTVSPADRSIEESPVSPPISPSSSSNKTVDLPF 247
Query: 242 NRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPL----- 301
NRVFLGPA E+ +F VE P IS SPS SP PP +S+DLTP+ QNE +TPTP
Sbjct: 248 NRVFLGPASEQPNFPVESPAIS--SPSVSPGYPPTISSDLTPTMQNENTVTPTPTPLISG 307
Query: 302 --MSDSAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPN 348
SDS MMKSED ++GRTVEEYEP DWM+ GF + NS+D+DVEDSHPRPN
Sbjct: 308 EEASDSGR-MMKSEDDMTGRTVEEYEPLDWMSLGFGERNSDDIDVEDSHPRPN 357
BLAST of HG10019034 vs. ExPASy TrEMBL
Match:
A0A6J1JR15 (uncharacterized protein LOC111486689 OS=Cucurbita maxima OX=3661 GN=LOC111486689 PE=3 SV=1)
HSP 1 Score: 489.2 bits (1258), Expect = 1.5e-134
Identity = 261/354 (73.73%), Postives = 287/354 (81.07%), Query Frame = 0
Query: 1 MKIGLHPYVALLLISLFFTKNSFVSSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDL 60
+KIGLH YVALL++S FF KNS V+SIP+QELDAMLSIVRARGYNLFSNAITTSDLQLDL
Sbjct: 7 IKIGLHGYVALLMLSFFFNKNSIVTSIPTQELDAMLSIVRARGYNLFSNAITTSDLQLDL 66
Query: 61 LTAAANASFTFFAPTDSSLFALSMTQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQT 120
L AA NASFT FAPTDSSLFAL+MTQSAS YTATLRYHGLPRRLSVSDLN LPSQV I T
Sbjct: 67 LAAADNASFTLFAPTDSSLFALAMTQSASVYTATLRYHGLPRRLSVSDLNSLPSQVAIPT 126
Query: 121 LLLSQYVSLTRRLRGSRSDAISVNGVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDS 180
LL SQYVS+TR RGSRSDAISVNG++VVLPGLYY RH+ VHGLGGILSLHSQ G+ Y S
Sbjct: 127 LLRSQYVSVTRCHRGSRSDAISVNGINVVLPGLYYGRHVVVHGLGGILSLHSQNGYTYGS 186
Query: 181 PPPLPLFRPSAPSSFPPELDRGSKENRDFTALAAGPSIGISSVFPPISPSNSWNKTVDLP 240
PPPL FRPS P SFPP++D GS ENRDFT A SI S V PPISPS+ NKTVDLP
Sbjct: 187 PPPLSRFRPSGPRSFPPQIDLGSNENRDFTVSPADRSIDQSPVSPPISPSSFSNKTVDLP 246
Query: 241 FNRVFLGPAPERSHFQVEPPVISSVSPSTSPVLPPGVSNDLTPSAQNEKAITPTPL---- 300
FNRVFLGPA E+ +F VE P IS SPS SP PP +S+DLTP+ QNE +TPTP
Sbjct: 247 FNRVFLGPASEQPNFPVESPAIS--SPSVSPGYPPTISSDLTPTMQNENTVTPTPTPLIS 306
Query: 301 ---MSDSAAWMMKSEDGLSGRTVEEYEPSDWMTFGFMKGNSEDVDVEDSHPRPN 348
SDS MMKSED ++GRTVEEYEP DWM+ GF + NS+D+DVEDSHPRPN
Sbjct: 307 GEEASDSGR-MMKSEDDMTGRTVEEYEPLDWMSLGFGERNSDDIDVEDSHPRPN 357
BLAST of HG10019034 vs. TAIR 10
Match:
AT1G15190.1 (Fasciclin-like arabinogalactan family protein )
HSP 1 Score: 120.6 bits (301), Expect = 2.6e-27
Identity = 79/192 (41.15%), Postives = 108/192 (56.25%), Query Frame = 0
Query: 25 SSIPSQELDAMLSIVRARGYNLFSNAITTSDLQLDLLTAAANASFTFFAPTDSSLFALSM 84
+ +P +EL+ ++I+R RG LF+NAI TSDL DLL ++ S T FAPTDS LF L M
Sbjct: 28 TGVPLEELERAIAILRVRGRALFANAIITSDLLFDLL---SDESLTLFAPTDSMLFDLDM 87
Query: 85 TQSASAYTATLRYHGLPRRLSVSDLNRLPSQVPIQTLLLSQYVSLTRRLRGSRSDAISVN 144
T S Y +TLR H +P RLS+S L LP+ + TLL S + LT+ S +D+I ++
Sbjct: 88 THSLPFYVSTLRLHSVPLRLSLSGLRSLPNSSSLPTLLPSHRLLLTK--HSSSNDSIFLD 147
Query: 145 GVDVVLPGLYYDRHIAVHGLGGILSLHSQIGFPYDSPPPLPLFRPSAPSSFPPELDRGSK 204
GV +++PGL+ +HIAVHGL + LPL PS+P+
Sbjct: 148 GVQLLIPGLFDGQHIAVHGLADL----------------LPLTAPSSPNRLV-------- 188
Query: 205 ENRDFTALAAGP 217
D TALA P
Sbjct: 208 --EDSTALAKSP 188
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038888494.1 | 2.5e-157 | 84.57 | uncharacterized protein LOC120078327 [Benincasa hispida] | [more] |
XP_011657526.1 | 2.0e-138 | 76.57 | uncharacterized protein LOC105435867 [Cucumis sativus] >KGN47917.1 hypothetical ... | [more] |
XP_008449424.1 | 4.9e-137 | 76.44 | PREDICTED: fasciclin-like arabinogalactan protein 19 [Cucumis melo] >KAA0057322.... | [more] |
KAG6588822.1 | 7.9e-135 | 73.73 | Fasciclin-like arabinogalactan protein 19, partial [Cucurbita argyrosperma subsp... | [more] |
XP_022927853.1 | 7.9e-135 | 74.22 | uncharacterized protein LOC111434622 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Q5Q0H2 | 3.7e-26 | 41.15 | Fasciclin-like arabinogalactan protein 19 OS=Arabidopsis thaliana OX=3702 GN=FLA... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KHQ8 | 9.7e-139 | 76.57 | FAS1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G411210 PE=3 S... | [more] |
A0A5A7UN88 | 2.4e-137 | 76.44 | Fasciclin-like arabinogalactan protein 19 OS=Cucumis melo var. makuwa OX=1194695... | [more] |
A0A1S3BM06 | 2.4e-137 | 76.44 | fasciclin-like arabinogalactan protein 19 OS=Cucumis melo OX=3656 GN=LOC10349131... | [more] |
A0A6J1EM74 | 3.8e-135 | 74.22 | uncharacterized protein LOC111434622 OS=Cucurbita moschata OX=3662 GN=LOC1114346... | [more] |
A0A6J1JR15 | 1.5e-134 | 73.73 | uncharacterized protein LOC111486689 OS=Cucurbita maxima OX=3661 GN=LOC111486689... | [more] |
Match Name | E-value | Identity | Description | |
AT1G15190.1 | 2.6e-27 | 41.15 | Fasciclin-like arabinogalactan family protein | [more] |