HG10016445 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016445
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr03: 5096808 .. 5099917 (-)
RNA-Seq ExpressionHG10016445
SyntenyHG10016445
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAACATTCACTACGATTGCGTTGGACAGGTTGTTAGAACCTGGAACTTCGAAATCTGTTGATAAGTCCCTTCCTAAACCTAAGCCTGCTCTGACCTTTAACCGTGCTCCAAGCACGAAGTTGGAGAGGAGAAATAGCGCATCAGTTGCTGACAGAAAAGTTCAGCGGCCTCAAATAAAGCCAGCACTATATACCACCCCAGAGGCAACTCCTCTTCCGGATTCACCATCTTCGTTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGTCTCTTGAAGAGTTTCTCTGAGGACGATGTCTCTCATAAAAAGATGAATGATAAGGATGTAGAAAATGGGACTGGGAAGGGTTCAGATATCAATGATGTAAAATTGACTGAGGGTGCTTCTGTTACTGGTGACATGCCTATTCAGGACAAAGATGGAGACAGATGTCTAGATTGTGCTAGTAGTAGTAATGTTGGTCAAAATGGGAGTGTTGATGGTGATCATGGTGCTACGGCTGTTCAACTTGTGAGCAATCACAGTAATCATGAAAGCAGTATAATGACGAGTAATGGTGTTGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGTCAAATTCAGAAAGTATTGGAGATACTGAAGACTTCTTTGACCCAAACGATTCTTTGAGTGTTACGAGTAACACGGATGGAGAGGATAATGGTTATGAACGTTCAGCTACGTTTGGTACTCCTATGGGTGAATTTTATGATGCTTGGGAAGGTAAACATGACTCTATGAATTGAAAAAGTTATTTCTGATATATTTTTTGATCGCAATACTTTATTCTACTGATTGTGCTCGTCTCTAATTATGCATTTACTGTTCGGTAGTTGGTACGTCATTTTTCATATTTTAAAAATTGTTTTGGACTTTGTAGTTCAAGTCATGCAGGGTTTTCTTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTATTTATTTATTTTTTAAAATTTAAATTCTAGAATCTAGTGTAACTGGCCAGACATTGCATAGCACATGTTTCTTTTGTAGTTACGTTGGATTAACGGATATGTTATAATTAAATTTAGAGATCAGGCTAGTTGAAAGCTTGGACATGAGATGACATGATGAATTTTTTTAATCTAGGAGTTGCCTTTGAACTACTTACATAAGATATGATTGCTAGCTTCAAGTCACAACTTCAATACTAATGACCTAGATTCGTTCTGCAGAGCTTTCTTCTGAGGGCTTGCCACAACCACCTACTACTGAAATTGAAGCTGAATTACGTGAAATGAAACTAACACTACTGATGGAACTAGAGAAACGTAAGCAGGCTGAGGAAGCACTGAATAAATTGCAGGGCCAGTGGCAGAGGCTTAGAGAACAGCTATTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCACAGAAGGGAGGCAGTTAGATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTTATCTTGCCAGGTTCGTGTCAGATTCTATAGGCAGGGGTATAGCAAGGGCAGAGGTGGAGACTGAGATGGAGGCGCAGCTTGAAGTCAAGAATTTTGAGATTGCTCGATTGCTGGACCGGCTCCATTACTATGAGGCAGTGAATCATGAAATGTCCCAGAGGAATCAAGAAGCTGTAGGTAAGAACTCTGTAAACCTCGACTGGTAGTTATTTCAATCACTCAATGCTGGTGTCTTGACCTCCCCCTTCTCTTTCATTTTTGTTTTATTTTTCTTAAGGAGGTGGCGTTGAGGGCGTTTGTTATATTATGTGTTTAATTCTGTATGTTTTACCACTCTTCCTCTTTGTTTCCTGGACTTCTTACCCACTTGCTAGTTGCCACTGCATCAGATTTTTATCGAACTTCTTTCTAATATATAGAGAGATATAGTTATTGGTTAGGTTACTGTAATGACTAGATTAGAATTTTTGTGCATATAAATAAAGGCTTCCAATTAAGAGGGGCATGATAGAAGTAATTATTAATTAGGAGGCGTGGTTTTCTAAAGGCTCATTCTTACATGGAAAGTAAGGGATGAAGGCCTGTAATGACTATATTAGAATTTTTGTGCATATAAATAAAGGCTTCCGGTTAAGAGGGGCATGATGGAAATGATTATTAGCTAGGAGGCGTGGTTTTTCATAGGATTATTCTTACATGCAAAGTAAGGGATGAAGGCCTCGGCCAGTTATTGGATTCCATTTAGTTGACATGATGACGTGCTCAAATTTCAATTCTTTCTCATCTACGTAGAAGATGCAACTATTTCAAACAGTGCCCTTGACTAGTGGAGCTTATTTTACCAAAGTTGTAGTCCGTGGTTGACCTGAACATTCAGAAACAATGCAAATGCCGTGAACAGATCAATGAGTACTGTTGTCTACTGTTGACAAATTTCTTATGAGTTATGACCATTATGAATACTCCAATATTGGTTTGAGTGATGTGATACTGTGTGCTTTACAGTTTGTCCCCTCTTATTGCCTTGCAAGTTCATTTGAAATAATGGATCTTGAAGGAATTTAACATCAAAGTATTTGTATGTAAACAGTAAATCTTCAATCGCGTTTAATTTTCCATGGATCCTAGTGCCAGTATGGGGGGTTCTTGTTCCAATTAATAGTGGGTTGTCAAAAATGTGCAAGTTCCCTCAAGCTATATAGCTCATTTTTACGGTCCTTCTCCATTTCGCCTTGGTGGTTTTTTCTTTCATAAATTGTTAGTTCAAGTATTAAAAAAATATATATTCAGACAGCGAGGAATAAAATCTTCTTTTCATATATTTATACCTGGACTCTATGATCTTCCATTTTAAGCTTCAGTCCTCTGTTCTCCTTGGTTATGTGTTGAATGTCTGTACTGCTGGCTAAGCTTTGTCTGTGTTTATTTGGTTGTCAGATTTGGCACGGCGCGAGAGGTTGAGAAGAAAAAGGAGGCAAAGATGGATCTGGGGTTCGGTTGCCACTGCGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTATCTTCCATCAGGAAAAGATTTGCCATCCAGCAACAATTCGAAGGCCGAGCACGATGATGTAACAGATTGA

mRNA sequence

ATGCCAACATTCACTACGATTGCGTTGGACAGGTTGTTAGAACCTGGAACTTCGAAATCTGTTGATAAGTCCCTTCCTAAACCTAAGCCTGCTCTGACCTTTAACCGTGCTCCAAGCACGAAGTTGGAGAGGAGAAATAGCGCATCAGTTGCTGACAGAAAAGTTCAGCGGCCTCAAATAAAGCCAGCACTATATACCACCCCAGAGGCAACTCCTCTTCCGGATTCACCATCTTCGTTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGTCTCTTGAAGAGTTTCTCTGAGGACGATGTCTCTCATAAAAAGATGAATGATAAGGATGTAGAAAATGGGACTGGGAAGGGTTCAGATATCAATGATGTAAAATTGACTGAGGGTGCTTCTGTTACTGGTGACATGCCTATTCAGGACAAAGATGGAGACAGATGTCTAGATTGTGCTAGTAGTAGTAATGTTGGTCAAAATGGGAGTGTTGATGGTGATCATGGTGCTACGGCTGTTCAACTTGTGAGCAATCACAGTAATCATGAAAGCAGTATAATGACGAGTAATGGTGTTGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGTCAAATTCAGAAAGTATTGGAGATACTGAAGACTTCTTTGACCCAAACGATTCTTTGAGTGTTACGAGTAACACGGATGGAGAGGATAATGGTTATGAACGTTCAGCTACGTTTGGTACTCCTATGGGTGAATTTTATGATGCTTGGGAAGAGCTTTCTTCTGAGGGCTTGCCACAACCACCTACTACTGAAATTGAAGCTGAATTACGTGAAATGAAACTAACACTACTGATGGAACTAGAGAAACGTAAGCAGGCTGAGGAAGCACTGAATAAATTGCAGGGCCAGTGGCAGAGGCTTAGAGAACAGCTATTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCACAGAAGGGAGGCAGTTAGATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTTATCTTGCCAGGTTCGTGTCAGATTCTATAGGCAGGGGTATAGCAAGGGCAGAGGTGGAGACTGAGATGGAGGCGCAGCTTGAAGTCAAGAATTTTGAGATTGCTCGATTGCTGGACCGGCTCCATTACTATGAGGCAGTGAATCATGAAATGTCCCAGAGGAATCAAGAAGCTGTAGATTTGGCACGGCGCGAGAGGTTGAGAAGAAAAAGGAGGCAAAGATGGATCTGGGGTTCGGTTGCCACTGCGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTATCTTCCATCAGGAAAAGATTTGCCATCCAGCAACAATTCGAAGGCCGAGCACGATGATGTAACAGATTGA

Coding sequence (CDS)

ATGCCAACATTCACTACGATTGCGTTGGACAGGTTGTTAGAACCTGGAACTTCGAAATCTGTTGATAAGTCCCTTCCTAAACCTAAGCCTGCTCTGACCTTTAACCGTGCTCCAAGCACGAAGTTGGAGAGGAGAAATAGCGCATCAGTTGCTGACAGAAAAGTTCAGCGGCCTCAAATAAAGCCAGCACTATATACCACCCCAGAGGCAACTCCTCTTCCGGATTCACCATCTTCGTTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGTCTCTTGAAGAGTTTCTCTGAGGACGATGTCTCTCATAAAAAGATGAATGATAAGGATGTAGAAAATGGGACTGGGAAGGGTTCAGATATCAATGATGTAAAATTGACTGAGGGTGCTTCTGTTACTGGTGACATGCCTATTCAGGACAAAGATGGAGACAGATGTCTAGATTGTGCTAGTAGTAGTAATGTTGGTCAAAATGGGAGTGTTGATGGTGATCATGGTGCTACGGCTGTTCAACTTGTGAGCAATCACAGTAATCATGAAAGCAGTATAATGACGAGTAATGGTGTTGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGTCAAATTCAGAAAGTATTGGAGATACTGAAGACTTCTTTGACCCAAACGATTCTTTGAGTGTTACGAGTAACACGGATGGAGAGGATAATGGTTATGAACGTTCAGCTACGTTTGGTACTCCTATGGGTGAATTTTATGATGCTTGGGAAGAGCTTTCTTCTGAGGGCTTGCCACAACCACCTACTACTGAAATTGAAGCTGAATTACGTGAAATGAAACTAACACTACTGATGGAACTAGAGAAACGTAAGCAGGCTGAGGAAGCACTGAATAAATTGCAGGGCCAGTGGCAGAGGCTTAGAGAACAGCTATTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCACAGAAGGGAGGCAGTTAGATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTTATCTTGCCAGGTTCGTGTCAGATTCTATAGGCAGGGGTATAGCAAGGGCAGAGGTGGAGACTGAGATGGAGGCGCAGCTTGAAGTCAAGAATTTTGAGATTGCTCGATTGCTGGACCGGCTCCATTACTATGAGGCAGTGAATCATGAAATGTCCCAGAGGAATCAAGAAGCTGTAGATTTGGCACGGCGCGAGAGGTTGAGAAGAAAAAGGAGGCAAAGATGGATCTGGGGTTCGGTTGCCACTGCGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTATCTTCCATCAGGAAAAGATTTGCCATCCAGCAACAATTCGAAGGCCGAGCACGATGATGTAACAGATTGA

Protein sequence

MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTGKGSDINDVKLTEGASVTGDMPIQDKDGDRCLDCASSSNVGQNGSVDGDHGATAVQLVSNHSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERSATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQGQWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEVETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGSVATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Homology
BLAST of HG10016445 vs. NCBI nr
Match: XP_038882592.1 (uncharacterized protein LOC120073808 [Benincasa hispida])

HSP 1 Score: 805.1 bits (2078), Expect = 3.2e-229
Identity = 426/458 (93.01%), Postives = 437/458 (95.41%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALT NRAPSTKLERRNSASVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTLNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSRKKMNDNDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           KGSD NDVK TEG+SVT DMPI +KDGDR   DCASSSNV QNGSVDGDHGATAVQLV+N
Sbjct: 121 KGSDSNDVKSTEGSSVTVDMPIPEKDGDRNGPDCASSSNVRQNGSVDGDHGATAVQLVNN 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HSNHES I+ SNGVAREK+SLKVVVSNSESIGDTEDFFDP+DSLSVTSNTDGEDNG+ERS
Sbjct: 181 HSNHESRIVVSNGVAREKNSLKVVVSNSESIGDTEDFFDPHDSLSVTSNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEGLPQP  ++IEAELREMKLTLLMELEKRKQAEEALNKLQG
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLTLLMELEKRKQAEEALNKLQG 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QW RLREQLLLVGLTLPSDP VATEG QLDSDPAEELCQQVYLARFVSDSIGRGIARAEV
Sbjct: 301 QWWRLREQLLLVGLTLPSDPPVATEGNQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVLAWSYLPSGKDLPSSNN+KAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNTKAEHDDVTD 458

BLAST of HG10016445 vs. NCBI nr
Match: TYK12610.1 (uncharacterized protein E5676_scaffold255G001960 [Cucumis melo var. makuwa])

HSP 1 Score: 771.5 bits (1991), Expect = 3.9e-219
Identity = 409/458 (89.30%), Postives = 432/458 (94.32%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           + SD NDVKLTEGASVT   PI DK GDR  LDCASSSN+G+NG VDGDHGATAVQLVS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSI+TS+G+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+PQP  ++IE + REM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLREQLLLVGLTLPSDPTVATEG+QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of HG10016445 vs. NCBI nr
Match: XP_008440744.1 (PREDICTED: uncharacterized protein LOC103485065 [Cucumis melo] >KAA0036213.1 uncharacterized protein E6C27_scaffold18G00100 [Cucumis melo var. makuwa])

HSP 1 Score: 770.0 bits (1987), Expect = 1.1e-218
Identity = 408/458 (89.08%), Postives = 432/458 (94.32%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           + SD NDVKLTEGASVT   PI DK GDR  LDCASSSN+G+NG VDGDHGATAVQLVS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSI+TS+G+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+PQP  ++IE + REM+  LLME+EK+KQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLREQLLLVGLTLPSDPTVATEG+QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of HG10016445 vs. NCBI nr
Match: XP_004143521.1 (uncharacterized protein LOC101222171 [Cucumis sativus] >KGN48848.1 hypothetical protein Csa_002999 [Cucumis sativus])

HSP 1 Score: 756.9 bits (1953), Expect = 1.0e-214
Identity = 403/458 (87.99%), Postives = 428/458 (93.45%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPS+KLERRNS SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSSKLERRNSTSVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KK NDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSRKKKNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           KGSD +DVKLTEGASVT + PI DKDGDR  LDCASSS+VG+NG V GDHGATAVQLVS+
Sbjct: 121 KGSDGSDVKLTEGASVTVNTPIPDKDGDRNGLDCASSSSVGENGCVGGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSIMTSNG+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSIMTSNGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+ QP  ++ E +LREM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVLQPSISDTEPDLREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLR +LLLVGLTLPSDPTVATE +QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLRARLLLVGLTLPSDPTVATEEKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVL WSYLPSGKDLPSSNNSK+EHDDVTD
Sbjct: 421 VATAITLGTAVLTWSYLPSGKDLPSSNNSKSEHDDVTD 455

BLAST of HG10016445 vs. NCBI nr
Match: KAG7034132.1 (hypothetical protein SDJN02_03859 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 730.3 bits (1884), Expect = 1.0e-206
Identity = 397/459 (86.49%), Postives = 412/459 (89.76%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIAL+RLLEPGTS+SVDKSLPKPKP+L  +RAPSTKLERRNS SVADRK+QRPQI
Sbjct: 1   MPTFTTIALERLLEPGTSRSVDKSLPKPKPSLNSDRAPSTKLERRNSPSVADRKIQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SHKKMNDKDVENGT 120
           KPALY TPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV S KKMNDKD+ NG 
Sbjct: 61  KPALYATPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSSRKKMNDKDIGNGN 120

Query: 121 GKGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVS 180
            KG+D NDVKLTEGASV  DMPI   DG R  LDCASSS+VGQNGSVD DHGA  VQL S
Sbjct: 121 VKGTDSNDVKLTEGASVVVDMPI--PDGHRNGLDCASSSHVGQNGSVDDDHGAAGVQLAS 180

Query: 181 NHSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYER 240
           NHSNH      SNGV REKDSLKVVVSNS  +GDTEDFFDP DSLSVTSNTDGEDNG ER
Sbjct: 181 NHSNHG----MSNGVTREKDSLKVVVSNSGGVGDTEDFFDPQDSLSVTSNTDGEDNGIER 240

Query: 241 SATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQ 300
           SA  GTP+GEFYDA E LSSEGLPQP  ++IEAEL EMKLTL MELEKRKQAEE L+K +
Sbjct: 241 SAKIGTPVGEFYDALEALSSEGLPQPCISDIEAELCEMKLTLSMELEKRKQAEEILDKFR 300

Query: 301 GQWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAE 360
           GQWQRLREQLLLVGLTLPSDPTVATEG+QLDSDPAEELCQQVYLARFVSDSIGRGIARAE
Sbjct: 301 GQWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVYLARFVSDSIGRGIARAE 360

Query: 361 VETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
           VETEMEAQLEVKNFEIARLLDRLHYYEA NHEMSQRNQEAVDLARRERLRRKRRQRWIWG
Sbjct: 361 VETEMEAQLEVKNFEIARLLDRLHYYEAANHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420

Query: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKA-EHDDVT 457
           SVATAITLGT VLAWSYLPSGKDLPSSNNSKA EHDD T
Sbjct: 421 SVATAITLGTVVLAWSYLPSGKDLPSSNNSKAVEHDDAT 453

BLAST of HG10016445 vs. ExPASy TrEMBL
Match: A0A5D3CMF0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G001960 PE=4 SV=1)

HSP 1 Score: 771.5 bits (1991), Expect = 1.9e-219
Identity = 409/458 (89.30%), Postives = 432/458 (94.32%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           + SD NDVKLTEGASVT   PI DK GDR  LDCASSSN+G+NG VDGDHGATAVQLVS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSI+TS+G+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+PQP  ++IE + REM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLREQLLLVGLTLPSDPTVATEG+QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of HG10016445 vs. ExPASy TrEMBL
Match: A0A5A7T005 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold18G00100 PE=4 SV=1)

HSP 1 Score: 770.0 bits (1987), Expect = 5.5e-219
Identity = 408/458 (89.08%), Postives = 432/458 (94.32%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           + SD NDVKLTEGASVT   PI DK GDR  LDCASSSN+G+NG VDGDHGATAVQLVS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSI+TS+G+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+PQP  ++IE + REM+  LLME+EK+KQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLREQLLLVGLTLPSDPTVATEG+QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of HG10016445 vs. ExPASy TrEMBL
Match: A0A1S3B1E0 (uncharacterized protein LOC103485065 OS=Cucumis melo OX=3656 GN=LOC103485065 PE=4 SV=1)

HSP 1 Score: 770.0 bits (1987), Expect = 5.5e-219
Identity = 408/458 (89.08%), Postives = 432/458 (94.32%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           + SD NDVKLTEGASVT   PI DK GDR  LDCASSSN+G+NG VDGDHGATAVQLVS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSI+TS+G+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+PQP  ++IE + REM+  LLME+EK+KQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLREQLLLVGLTLPSDPTVATEG+QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of HG10016445 vs. ExPASy TrEMBL
Match: A0A0A0KH17 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G502840 PE=4 SV=1)

HSP 1 Score: 756.9 bits (1953), Expect = 4.9e-215
Identity = 403/458 (87.99%), Postives = 428/458 (93.45%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPS+KLERRNS SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSSKLERRNSTSVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVENGTG 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KK NDKDV NG+ 
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSRKKKNDKDVGNGSV 120

Query: 121 KGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVSN 180
           KGSD +DVKLTEGASVT + PI DKDGDR  LDCASSS+VG+NG V GDHGATAVQLVS+
Sbjct: 121 KGSDGSDVKLTEGASVTVNTPIPDKDGDRNGLDCASSSSVGENGCVGGDHGATAVQLVSS 180

Query: 181 HSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHESSIMTSNG+A+EKDSLK VVSNSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSIMTSNGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 ATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQG 300
           A FGTPMGEFYDAWEELSSEG+ QP  ++ E +LREM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVLQPSISDTEPDLREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360
           QWQRLR +LLLVGLTLPSDPTVATE +QLDSDPAEELCQQV LARFVS+SIG+GIARAEV
Sbjct: 301 QWQRLRARLLLVGLTLPSDPTVATEEKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 458
           VATAITLGTAVL WSYLPSGKDLPSSNNSK+EHDDVTD
Sbjct: 421 VATAITLGTAVLTWSYLPSGKDLPSSNNSKSEHDDVTD 455

BLAST of HG10016445 vs. ExPASy TrEMBL
Match: A0A6J1GDK0 (uncharacterized protein LOC111453212 OS=Cucurbita moschata OX=3662 GN=LOC111453212 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 4.1e-206
Identity = 395/459 (86.06%), Postives = 411/459 (89.54%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTFTTIAL+RLLEPGTS+SVDKSLPKPKP+L  +RAPSTKLERRNS SVADRK+QRPQI
Sbjct: 1   MPTFTTIALERLLEPGTSRSVDKSLPKPKPSLNSDRAPSTKLERRNSPSVADRKIQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SHKKMNDKDVENGT 120
           KPALY TPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV S KKMNDKD+ NG 
Sbjct: 61  KPALYATPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSSRKKMNDKDIGNGN 120

Query: 121 GKGSDINDVKLTEGASVTGDMPIQDKDGDR-CLDCASSSNVGQNGSVDGDHGATAVQLVS 180
            KG+D NDVKLTEGASV  DMPI   DG R  LDCASSS+VGQNGSVD DHGA  VQL S
Sbjct: 121 VKGTDSNDVKLTEGASVVVDMPI--PDGHRNGLDCASSSHVGQNGSVDDDHGAAGVQLAS 180

Query: 181 NHSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDGEDNGYER 240
           NHSNH      SNGV REKDSLKVVVSNS  +GDTEDFFDP DSLSVTSNTDGEDNG ER
Sbjct: 181 NHSNHG----MSNGVTREKDSLKVVVSNSGGVGDTEDFFDPQDSLSVTSNTDGEDNGIER 240

Query: 241 SATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQAEEALNKLQ 300
           SA  GTP+GEFYDA E LSSEGLPQP  ++IEAEL EMKLTL MELEKRKQAEE L+K +
Sbjct: 241 SAKIGTPVGEFYDALEALSSEGLPQPCISDIEAELCEMKLTLSMELEKRKQAEEILDKFR 300

Query: 301 GQWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSIGRGIARAE 360
           GQWQRLRE LLLVGLTLPSDPTVATEG+QLDSDPAEELCQQVYLARFVSDSIGRG+ARAE
Sbjct: 301 GQWQRLRELLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVYLARFVSDSIGRGVARAE 360

Query: 361 VETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
           VETEMEAQLEVKNFEIARLLDRLHYYEA NHEMSQRNQEAVDLARRERLRRKRRQRWIWG
Sbjct: 361 VETEMEAQLEVKNFEIARLLDRLHYYEAANHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420

Query: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKA-EHDDVT 457
           SVATAITLGT VLAWSYLPSGKDLPSSNNSKA EHDD T
Sbjct: 421 SVATAITLGTVVLAWSYLPSGKDLPSSNNSKAVEHDDAT 453

BLAST of HG10016445 vs. TAIR 10
Match: AT3G50910.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G66480.1); Has 76 Blast hits to 75 proteins in 28 species: Archae - 0; Bacteria - 10; Metazoa - 7; Fungi - 2; Plants - 49; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 367.9 bits (943), Expect = 1.2e-101
Identity = 234/458 (51.09%), Postives = 302/458 (65.94%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
           MPTF+ IALDR+LEPG S SV+ S+P     L +++ P +KLE+       +R V RP +
Sbjct: 1   MPTFSAIALDRMLEPGASTSVE-SVPS-TTNLFYSKPPISKLEKGKGKLPNERTVTRPLM 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRG-PRLLKSFSEDDV---SHKKMNDKDVE 120
            PALY TP+A PLP+SPSSFPPSPYI+NHK RG PRLLKS SE +V   SH+K  +++  
Sbjct: 61  SPALYATPDAIPLPNSPSSFPPSPYIINHKSRGPPRLLKSSSEANVVSSSHQKTLEEETI 120

Query: 121 NGTGKGSDINDVKLT-EGASVTGDMPIQDKDGDRCLDCASSSNVGQ---NGSVDGDHGAT 180
                     DVK++    S +   PI +   D   +   +  VG    +G VDG  G  
Sbjct: 121 TAE------TDVKVSPRRRSTSFSFPITEVTEDDYSNGVHARTVGNYNFDGIVDGPVGNW 180

Query: 181 AVQLVSNHSNHESSI-MTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVTSNTDG 240
           +  L     N +S +   +NG+ R     + V   ++   ++EDF+DP +S S TSNTD 
Sbjct: 181 S-PLDGKSGNGKSELDNAANGLERVNGLSEPVPIKTDKESESEDFYDPGESASFTSNTDV 240

Query: 241 E-DNGYERSATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELEKRKQA 300
           E D G E S    TP+GEFYDAW+ELS++   Q     IE+EL E++L+LLME+EKRKQ 
Sbjct: 241 EGDAGDESSQRLATPVGEFYDAWDELSTDSGMQSSVNNIESELSEIRLSLLMEIEKRKQT 300

Query: 301 EEALNKLQGQWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARFVSDSI 360
           EEAL ++Q  WQRLREQ+  VGL +P DPT +T    L    +EEL  Q+ +ARFVSDS+
Sbjct: 301 EEALEQMQIHWQRLREQMAQVGLFVPIDPTASTNNMNL----SEELRCQLEIARFVSDSL 360

Query: 361 GRGIARAEVETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRK 420
           GRG+A+AEVE EME+ LE KNFEI RL DRLHYYEAVN EMSQRNQEA+++ARRER +RK
Sbjct: 361 GRGMAKAEVEMEMESMLETKNFEITRLSDRLHYYEAVNREMSQRNQEAIEVARRERQKRK 420

Query: 421 RRQRWIWGSVATAITLGTAVLAWSYLPSGKDLPSSNNS 449
           +RQRWIWGS+A  ITLG+A LAWSY+P+ K  PSS  S
Sbjct: 421 KRQRWIWGSIAATITLGSAALAWSYIPAAK--PSSEVS 443

BLAST of HG10016445 vs. TAIR 10
Match: AT5G66480.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50910.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 276.6 bits (706), Expect = 3.7e-74
Identity = 190/454 (41.85%), Postives = 263/454 (57.93%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTS-KSVDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQ 60
           MPTF+  AL R L  GTS  S   S  + KP++  + +   K          ++   RPQ
Sbjct: 1   MPTFSAAALGRSLNSGTSLSSKFPSTLQSKPSILNDESKQPK----------EKTFTRPQ 60

Query: 61  IKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSE-DDVSH-------KKMN 120
           + P+LY T +  P P+SPSS+PPSPYI+NHK RGP L    SE D  SH       K   
Sbjct: 61  MSPSLYATTKEIPHPNSPSSYPPSPYIINHKARGPVLFNRDSEVDGPSHPITSGEEKISG 120

Query: 121 DKDVENGTGKGSDIN-DVKLTEGASV--TGDMPIQDKDGDRCLDCAS--SSNVGQNGSVD 180
           + DVE         +    +TE  +V  T  +  Q        DC+    + + +    D
Sbjct: 121 NVDVEATASLSKSTSLSFPITEAIAVDHTNGVHTQGIHERPVWDCSPPLGTFLNEKSGRD 180

Query: 181 GDHGATAVQLVSNHSNHESSIMTSNGVAREKDSLKVVVSNSESIGDTEDFFDPNDSLSVT 240
             +G       +++   +S ++    +  +K+             + E+F++P + +S T
Sbjct: 181 ISNGGIGSNNATSNLEWQSYLLEPVRIKADKEL------------EPENFYNPGELVSFT 240

Query: 241 SNTDGED-NGYERSATFGTPMGEFYDAWEELSSEGLPQPPTTEIEAELREMKLTLLMELE 300
           SNT+ ED    E S +  T +GEFYDA +ELS++   Q     IE+E+REM+L LLME+E
Sbjct: 241 SNTEVEDFERAESSHSLATHVGEFYDACDELSTDSGMQSSANNIESEVREMRLGLLMEIE 300

Query: 301 KRKQAEEALNKLQGQWQRLREQLLLVGLTLPSDPTVATEGRQLDSDPAEELCQQVYLARF 360
           +R+QAE  L ++Q  W+RLR+QL  VG+ LP DPT +    Q   + A+EL  Q+ + RF
Sbjct: 301 RRRQAEATLEQMQVHWRRLRDQLADVGMFLPLDPTRS----QYSMNLADELRCQLEVTRF 360

Query: 361 VSDSIGRGIARAEVETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRE 420
           VSD++G  +A+ EVE EMEA+LE KNFEI RL DRLHYYE VN EMSQRNQEA+++ARR+
Sbjct: 361 VSDTLGSDLAKTEVEMEMEAELEAKNFEITRLSDRLHYYETVNQEMSQRNQEAIEVARRD 420

Query: 421 RLRRKRRQRWIWGSVATAITLGTAVLAWSYLPSG 440
             +RKRRQRWIWGS+A  ITLG+ VLAWSYLP G
Sbjct: 421 GQKRKRRQRWIWGSIAATITLGSGVLAWSYLPPG 428

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882592.13.2e-22993.01uncharacterized protein LOC120073808 [Benincasa hispida][more]
TYK12610.13.9e-21989.30uncharacterized protein E5676_scaffold255G001960 [Cucumis melo var. makuwa][more]
XP_008440744.11.1e-21889.08PREDICTED: uncharacterized protein LOC103485065 [Cucumis melo] >KAA0036213.1 unc... [more]
XP_004143521.11.0e-21487.99uncharacterized protein LOC101222171 [Cucumis sativus] >KGN48848.1 hypothetical ... [more]
KAG7034132.11.0e-20686.49hypothetical protein SDJN02_03859 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3CMF01.9e-21989.30Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7T0055.5e-21989.08Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3B1E05.5e-21989.08uncharacterized protein LOC103485065 OS=Cucumis melo OX=3656 GN=LOC103485065 PE=... [more]
A0A0A0KH174.9e-21587.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G502840 PE=4 SV=1[more]
A0A6J1GDK04.1e-20686.06uncharacterized protein LOC111453212 OS=Cucurbita moschata OX=3662 GN=LOC1114532... [more]
Match NameE-valueIdentityDescription
AT3G50910.11.2e-10151.09unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G66480.13.7e-7441.85unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 280..307
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 212..235
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 212..236
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..120
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..127
NoneNo IPR availablePANTHERPTHR35490BACTERIOPHAGE N4 ADSORPTION B PROTEINcoord: 1..454
NoneNo IPR availablePANTHERPTHR35490:SF2BACTERIOPHAGE N4 ADSORPTION B PROTEINcoord: 1..454

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016445.1HG10016445.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane