HG10001104 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001104
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPAP_fibrillin domain-containing protein
LocationChr09: 14031550 .. 14041930 (+)
RNA-Seq ExpressionHG10001104
SyntenyHG10001104
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAATCAATTGATTACTAACCTCGGACGAGCAAGCAACGGTGGCCGGAAAAGCGCCGGGAGCGGAAAGTCCTTCGAAGTTGGTGAGGAAGAACGATTTAGGGCTGAGAAGAATTGTGTTATTAAATCCAGGGGCAATTTTGGTATTGTAACAGGTGAATGGAGCTTATAGATCAACAAAAGGCCACGCGTATGAACTAAATTTAGGGGCGAATTGGGGGGCAATTTTGGTATTTCTTGCTCGATACTCCGTTTGTACTTTGTAAACGTTGTGAGACCGCGTATAGAACAACAAAAGGCAGCTGATGGGGTGTGGTGTAGGAGAATACCGCCACTTGTTGCCCGTGGCTACATGCTCATGTCAACCTTTTTTTATTTTTTTTTAATATACTTTCATTTTATATATTTTATAAATTAATTTGCGTTTAATTATGTTAGTGTTATTCCTCGTGACTTTGCATGTACGAGGTATCACACTAAATATTTCATTTTCTGTATTCCATTTCTATGAGATAATCTCAAGTGTCGCATAAACTCTTTGCGAGTTTCAAACAAGTGTGATTCCCAACGTCAAAACTACTTGCAACCACACTAAAAAAATGTTAGGAATCTTTTTTATTAGGGTTAAAGATCAATATTATCAAATGGAGTTCTACTCTTGATTTTTCAACATGTCTCCTCAAGATGGTGCTTTTTTTAGGTTCATCATTCTTGATCGGATCACAATTTCTTTTTATTGGACCAAATACCTCTAGAGTTTTATGGGCTATGATACCATTTTAAATCGTCGATTGACCTAAAAACTTAAGTTGATGGGTAAAGACAAATTTAATATTATATCATCTAATAGTAAAAGTAGTGTTGATTTGTTGTCATGTTGCGAGAATTTTGGAAGTCGATGGTCTTCTGGAGTTGAATGAATGATCACTTTTTGTAGTACATCAATTGTGAGGATCTCTAAGAGAAAAATCATGTCAATTACGGTTGAATTAAACTTATTTTAACAAGAATAATGTAGAACTTAAGGAGTTAGGGAGTGTCAATACTTTTCTTACCAAATTTCATATCAGAATTTATTCTAAGAGGAAATGCATAAATTTTGATGTTTACCTCATGAGACATTTTTATTTTGACTATTAAATTTTATACTCATGAGTACAATTAGCATGTTCAGATCTAATCTTTTAAAAAAAATTATGGATTAACTGTTCTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAGATTAATTAGCCGAAGTTATTTTTTGGATTTTTTTTGGGGGAAAAGATTTCAACTCAAATTCTCATCCAAAAGAAGAAAAAAATAATAAAAAGAAATGTCAATTCTCACCATAGCCTATTGCTCCACGTGTCGGAACGTAATTGAACCAGCTCAGATTTCTCCTTCACAAATGCAAATCAATAAACAGTTGCTTACACTGCCACCACTACCACACAGCTTCTACTTGTCCAAAGATAACCAAAAATCTGGGGCGTAGCTCAGCTTGCTCTGTTTTTGCCTTTCTCCCATAATGGCTCTTAAAGCTAATGCCATAACGCCGACATATCTTCACCGCTCATTCTTCACTCCACAAACTCCATTTCTCCCTCCGATCAAACGGTACCATCGCCACACACACATTCGTCTCCGTTGTCGATTTTCTCTTGTCGATGAGCAGCAGAAAGAAGTCGTCTCGTTCTCTGAACCCGAAAATTCGCTCATTGAGGCTCTCATTGGTGTCCAAGGTCGAGGCCGCTCTGTTTCATCTCAGCAGCTCAGCGTAATTTTTTTTTTTTGGCCTTCTCTTCGTTCTTTTGTGCTATGTTTATCTCAGTCCAGATTCAGAGTTTGAACTTCTTTTAGTCTACTGATTTCTTCAGAATGTCGAACGGGCTGTGAGTGTTCTGGAAAGTTTGGAAGGCGTGCGAGATCCGGTAATCCGGTTGGCTTTCCCTATCAATTCTTAATTGTCTAGTTTTAACTGCACAATTAACTGTACTGTCAAGTGATTTCTTGAAACGGTGATGAATAAGTCCACCAACTGTTTGAAATAAAGCCCGCGACGATTTTTTTTTTTGTCACTTTGTTGGGAAAAAAATACTAGAAACAGGATTTAAGTAACCTCTGGAAAATTATCTTTGTGCGAGCCAGTGGCTCAATTTTACAACCTTGTTCTGTTCTGTTCTGTTAGTTACTTTTTGTTGTATTATGGTTAAATGCACTTTGTTGGAATAATACTTGCTTACACACTAATAGACTATCTAATATGATTGATTTCTATTGCATTTAAACATGTCTTACTTATATTTTCAGCCATTACAAAATAAACTGCTTAGAAAATTCTGTGGGAATTGATCATTGTATTGGTAAGTATTGAATTTATACTGGCTGTTATTCAGAAAACCAGTTAATTTGCAGGCTTTATACGTTCACCAATTTTGTATTTCTCTCTTGTGCTCTGGAATAGAAAGGAACTGGCTTGTGGATGTACGAATACCTATACATGGTATTCAGAACTTTGGGTCCTTGGTATCCTTGATTACTTTCAGTAGCTACTTTGACCGATTTAATCATTCAAGTTTACTAGTTCTCAAATCATTAAAAATTGGTATGAGAGCTTTTCTGAGTTGATTTTGCCAGTGCTTGATAAGTGATATGCTCACTGGGGATAGGAACCACTGGATTTTAGATGGAGAAGTTTGATTGAAAGTGAGAATTGTCACTCTTTAGAAGAAAATCAAGGCACTCTAGATACATCTGACAGTTATAGTTGTGATTAAAGGTGAAATAAAGTTCTCCATCAGGTTGAAATGAAGAGCAAAAGATTGAGAAACTAGACACCAGTAAAGCGTTGGGAGAAGATCAGTTAAGAACAATTATAAGATGGCAGGAAAGGAGGGTTCAAACCCAGCTCCGGCTGGTGGTAAGGTTTCGAGTGCTTCAATGCTTGAGAGGAAACAACAAAGAAGAAAGAAATTCCAGGTGGAGATGCAATCATTGTATACTATAGGATGACTATGTCTGCACTTGTACAGATTTGCGTGTAACAATTAGCAAGGATATCGATAACCAATGGATTTTAGACTAGGGATGCATTTTCCCTATATCTCACAATAGAAGTTGACCTGAAAACTTTGAGAAATGAGATGGAGGACCAGAAGAGGGTATACCTGGAACAAAGAAACATATCATGAGAGGACCAAACGCTTCACCTGGATCTTTTCTCGTAAGGGAAATGGTATCTCTGTGAAAGGTTGTGTTAATTTCTGAGATTCATGCAATGGATAATCCCTGATTACTCAACCAATCTCTGGTACTAAGTCTAAAAAATGACCAGACTAGATACAATTTGCTAATTTCCAGCCTTTCTAAGCAACTTTGTTGGTAAGTCCAGCAGTCCCTTTCTTGAAAGCTGATGAGTGAAGATCAAATGGTCTGCAGATGAAGATTTGTGAGAGCTAGTGATAGCCTTTTGTTAGTTTTTTCTGTTATCTACTGGTTAAGAGCATCTTAATGTTGTTTGTTTTTTAGTTAGTTGGTGAGTAAGAAAGGGAACTATAGGTAGAATGCCAGGGTGACTGATTATTACCTCCCTCTCTGTTTAATATCCTCTCGTTTGAGCTTTGGAATAGAAAGGAATTGGCTTTCATGGTGGATGTAGGAGTATGTATGATACTTGAACCCCCATAATTCCAGGCGTTCTTAATTACTCTCTGCTTACTTCTTTAATTGCTTTAGTTTTACAAGTTTATTAGTTTGTCAGTGCAGTCTTCTTAAGAAATTGGTATGCTTCAAAGTCCTCTTTTGATTAGTCAATATTAAAGCAGCAAGTTGTTGTTACGTTATGTGGTGATCCGGAGACATTCTTGATTTATTTATCCATTTTATAGTGATTCAGAAATTCATCCTTAGTATTATGTCTTCAACATGAATAAGAATTAATCAAGATAACTGGCGTCTATTTTACACGTTTAAAGTTTTGTCTTTGGTCATCCTCACTTCTCTCTTTAACTGCCTGCAGACGAGCTCTAGCTTAATTGAAGGACGGTGGCAGTTGGTGTTCACAACTAGACCTGGAACAGCGTCCATAATACAGGTTCAATTTCAATTCACGACAACGTGCTCCAACAATAGCAAATCCCTTTTCTCATATTATGATCCTGTACCTAAACCATTATTAAATCATAGTTCAGAAGTCAGACTCCAAATTTCATAGTATTTTGATTAAAATTAGGTGAAAAGTCAGCTGAAAACAAATCAATTAAAATAACAAATATGACTATTTCATGTAATTACTGTGATTATCTCCTACGCCATAGACAACAAAAAGGATCGTGATTATCTTGTTGCTGAAGAAATTAGCGCATGATAAATGGATACATTTAAAAGAAGACTCTCTCTTATCTCCACTCACTCACTTTTTCTAGCTTTGCTCACTTTCCTCGTATGTGGGACACTCCCTATCTATTAGCAAAAGATACTTAGCCCAAGGAAAGAAATTTGTTAGACCCTTCATGAATATAAAACATCTGAGATGCTTCTTTTGAGGTCTATCCTTATCTTGAGGAATTTTAGAATTCTTTGAAGTCCAAGAAACAAAGTCTTTTAAAATATGTATTTACAAGATAAATCTAGAAAGGACTAATATATATTAAAGTCATTTAGTTCGTGTTGTAGGTTTTTTTTAACAAAGGATTTTGAAAAGGTTTTGTTGCTTTATCTCAGTAACTAATTCTTAGAAATTAAGTGCAAATTCTTATAAGATTGCACAGAACGAAAATATAATGAGTTTGATATATAATACTCTGAAGCAATGATATGGACAAGAAATGGTAAAATCTTGTAACTCTTCTCCTTCAGAGAACATTTGTTGGAGTTGATTTCTTCAGCGTGTTTCAAGAGATATTTCTACGGACAAACGACCCACGGGTCTCCAACATTGTTAAATTCTCTGATGCAATCGGAGAGCTGAAAGTAGAGGTATGTTTGTATATCTTTTAATAATTGGTTAACAATCACTCTTGCTTTCTAATGTTATTCTCCCCTTATAGGCGGCTGCATCAGTCAAAGATGGCAAACGCATTCTTTTCCAATTTGACACAGCAGCATTTTCTTTCAAGTTCTTTCCTTTTAAGGTTCCATATCCTGTACCATTCAGACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCCTCGGGGAACCTCCGCATCTCAAGAGGAAACAAGGTAAATTAGTTCACTTTCCTCGGTTACTCAAACATTTCTTACATACTTGAAACTTCTCGAGTTGATTTGGCAATTGCTAAGGACATACAGAACCACTGTAATAGTGTAGGGCACCACGTTTGTGTTGCAAAAGCAAACGGAAGTGAGGCAAAACTTGCTGCTAGCAATTTCTACAGGTAAAGGAATTGAAGGGGTAAGTCTACTTTCCACATACACTCATCATGGAGAAAATTTCAACTTCATTATATTCTCCTACGGCTGATTTCTTGAGTACTTGAACTACTACAGGCAATTGATAAGCTCATCTCTGAAAATCAGAATGGAAGTAAATTTGAAGAGGAGCTTCTCGAGGGGGACTGGAATATGCTATGGAGTTCACAGGTAGTCAATGCATTCATATTTTCGTATGCTGAACAAAATAAAATAGGTTACCAGATTATCAAAAGAAAAGGGTGTTAAAATGAGGACAGGCTGAATAAGTAATTCAATAAAGTATAAGACATTATCCATTAAGGCCATGTGACATGAGATGTCGAGTAGTAGTAATTCAAGTTTAACATACTTAAGATGAGTAGCAGCAGATTATAGTCTCGACAGAAAATTGTGATAATTTTGAAACCCATTTAGCTTCATATCCCTCTCAAAAAGCAATCTCTTGTACAATTTCATCTGGCTACCCTTGACATGCATTCTGCCGTAAATGAAGGAACTTTTAACAACTCCTAACCCACATGAAAGTTTGTCTGAAATCACACCAACCCTCTAACCAAGTTTAGTCCTGGGCTTCATTCTTCGTGGGAAAGAAAGAGCATGTGGGGTTCTTCTTCTTCTTCTTCTTCTTCTTAGAGCAAGGGGAAGATTGTTCCTATTTCATAAGAAGCTACTTATCGATTACTGTAAATTGCAGATGGAAACAGATAGTTGGATAGAGAATGCTGCAAATGGTCTCATGGGGATGCAGGTAGAGTAAATATGATAGCCCGCTGTATTAAACTTTGATGATGAAATAACCTTCATTTATAAAGTAAACTAATCACAATCATTTACTAGTCACATGCATAAAGTACACTCTTATACTACAGATTATCAAGAATGGGCAAATGAAGTTTCGAGTGGATATGTTGCTTGGACTGAGATTTTCCATGATTGGTACTCTTGTGTAAGTTCCCCAATCATATAGTTCATTTAATTGCGTAGGAAAGCTCTCAGAAAGAGTTTGGAAGGACAAAATCCAACAGGCAAATACATAACATGATAACGTTCCCATTTGTTATTGGGTCTTTGATTTATCTCTGTGGCAGAAAATCTGGAGACGACACATACGATGTAACCATGGATGATGCTGCAATTATTGGAGGCCCGTTTGGATATCCTTTGGAAATGGAGAGCCGGTTTAAGCTTCAACTTCTGTAAGATATCTCCATATCTGGTTGTTCCTTCATTTGGCTACTACACGGTTGTTTAACCTGTCAGATTATTTTCAGATACAATGATGGGAAGATAAGAATCACACGAGGATACAATAACATTTTATTTGTTCATCTACGGGTCGGCGAAACAAAGCAGGCGTAGGAAGACTTGTCAAAAAGAGGAGGTCCTTGAGGTGAGAAACCCATGTATAGTAGTAAGAGATGCCCTCGGCGTCTCTACAATATATACTCCATCAAATTGAAGCTATATAATAAAAATCTTAGAACGTCAAAATTTCACCCAAATCATTTCATACACTCTTTCTCAAGTTCTATTAATTAGTTAGGAAGAAAATATAATCTTCAACCTGTGTATCTCATACCAAGATTTGAGTTTTATTTCTAATTTATACTAAATCATTCCTTCCGTTGATGGTAATTCGTATTATTTTTTACCCATTCTCAATATCGGACCACTCTCTTCACAATTACCTGGCTGTCAAGTAGAAAATTTGAGAAATAATGAGCTTGTAAGGAAAAGAAGAGAGAATGCAACTTCTTAATAAATAACTACAAATCAATGAATCAGAAAAGAATTCCGATATCAAACCTTCGATCTCCATAAAGAGAGGGTATATGTACTGACAAAAAGTTGCTATTTGTTACAACTTACAGTAATGTGTACCTGATGGAAGGCTTACATCTATGGAGACATCCATATGTACTAAAAAAAAGCACTTGAATACCGAAGCACGGACCAAAGTTTATGCCCATCCACTTTTGAAAAGGCACAAACAGTTGAGTTCACATCATTCTTCTTGGGATGGACCCTAAAAAAAGCTCATACAAATGATACAAAACTTGACTCTTAAAGGATAGGCTGCCACCTCGAAAATGAAAACAATTGATGCTTGTCTGCTAAGGCTTTCCATTAACTGCTCATGGGAATAAATATACGTCGAACATATAGCAACGGCAAGAACCGAAGTCGGCAAATTGCCTTTTCAAATTTACTCTTCTTGGAAAGACATCTGGAGCAATTAAATTCCTGAAAATATAGAGATTTTAGCATGGAAAGTGGCAAATAATATTCTACCATACAAAATAAATCTTTATCGAAAAATGCCCAATATCACCAACGAACCTGAGATAACAACACATGCTATCCGAAAATGCAAATATTACAAACAAGGTTTGGGGAGCTTTGTTGAAAAGGGTATTTCATATGTAAAGTTCATGTAGATTTATAAATGATGTAGAAGACACATTAAATGCTAGAATATGAAACATATACGATAAACCTGAAACTATGCTAAATTATAACTTCCATATACTTTCAGGAATGCATAAATCATAATGAAGTAACACTACTAAAATTCCGACTATAGATGAAATTGAGTCTTTTTGTTTCTTATTATATATAAAAATAAAATTAAAAAAAAAAAAAAAAAAAAAAGAAGAAGAAGAAAGAAAGAAAAATGAAATTGAGTCTTGAAGACTCAAATTTTCAGCCAAAAATAAACAAATTGAACGGAACATAATTTTATTTAAGAGAAGTAGAAAAGTTCATAAGAATAAATTCGACTTCAAGTAACCATACCTATCTTGATCTCCCACTTTGATGACATTATATTTGTGTCATTGAAGTTCACTACCTTATCCTCTTGCAAAGACTCTAATGGGTACCTTTAGCATTTTTTGCCCTTTCCTTCCAACACAGAAGAACATGTAAACTTCTGCTGCCACTTCATATACATATGGTAAGAAAGGGAGAGAGAAATAGCTATCAATATCAAGAATCACTTCTCTTTATCCTTCAGAGCAAGGCATAATTTCGAACCAAATCAAATATGAGATTTTTATGTTACAAAAGAAACCATAAAAAACTAAAGCTTTCAAAATTGCAAGATAGTAATAGAGACAAAAAAGAAGAGAGTTATGATCATCAACCTTCTTTTTCCAAGAAAAGGTATCCAGATACTTTATAGGAAGGGAAAGATAATAACAATGGAACATAGAATTCAATTGACTTTAGGTCTAATGTCAAATACCTTGCCTCACAAATGTCAGAATTCTCTCTGTCAAAGAGAAGTAGGACCAACGTTGCCTACTGACTTCCGACGCCAACAATTCAAGCACAGTATTAAATCAACATAACAACGAACTATAACAAGGTCAATGAACTAGAAGAACTGACAAAGCCAAGCCATCAAAAAAGGAAAAGAAATAGAATTGACACAGACTATGTGCGTGCAAGTTATGAGAGGTATATTTCTTAATAAATAAAGAAAAATCATAGGGAATTATCAACAGGTCAAATCAAAGGCTTAAATTGATGGTTTATCAGAGCTTCCTATAAATCAGGATAAGTTCAGTTGCAACATAATCAGCTTATTTCTGGTTCATAGTGGAAAGGTTCATAATTTTTTTTAATCTTCAATCATCTTTAGTAAATTAAGAGTAACCAAATTACAAGAATTTTAGTGATGGATTAGACAACTAGTCTTCACCATTGAACCATAGCTCTCACAACATCAACATCGTCATAATGTGCTAACACCATCACAACACTCATTGTGTTGCCTTGGCTCAACTCATCATGACCAAGGGGGAGAAAAAAAAACCCAGAGAAATCTGTTAATTCATCATTCCCTCTCTCTTAGCCACGACCGCATCACACACCTTCACATTCTCTCACCACACGGACAATTACACCCCCTGCACCAAAGAAATTTTACAGAAATAAATGAAGTATATTGTTAGTAGACACTGCGTTAAAGTGGAAATTGATTTGTCAAAGTCAACTAAAACAAAGAACATGGCAACTTGACTTCTCTGCTGTCATCTTACACTAATGGCTTGTTACCGGAATGAGATCAAAAATGCAAAATTGGGAAAGAAACACTAGAGAGAAAAGGGAGTTCATCTTCAATCTTAGTACCAACGAAGGGAAAATCAAGACGATTCAAGCAAAAGAATGCATGGGTTTGTAAATTAGTATTTCAGCATAAGGATTGCCAACATCTACTGTGCAATCTTGGAGGCCGTTCTATTAACCAAAGACATGTTACAAATGTTACAACCAAAACACCAAAATTAAAAACGGTAAGTATACAATTCATTAAGACTGAACCGAATTTGTTCAGCTTCACCATTAAAGGAAAATATTTGCTCGAGGAGCATTTGATTCAAATTTTAAACACTTCGTGATTATATGGATAGCAACTAGGCAGCTATACAAGGCAATTACAATGATAATAAAGATATAAATGGACAATGTTATAAAACGAACACAATACAATAATATAATACAACTTTGAGAAAGCAGCTATGTTGATGTGAGAAAACGCCTCATTGCATCTAAGAGACCCAACTACGAAGATGGACATCCTTAAAGGCCCACATTGTTTTGATTAATCTTTTTCTTCATAAGGAAAGTATATTTTCTACTCAATCGAAAAATCTTTCAATTTTTTTTTTTTTTTTTTTTTTGAATTTTGGAAACTTAAAGTATTTAACAAGTAGAAGACTTTCTTCTGCAGTAATTGAAACTCGCACGGTATATGCTACTTTTGTCAGACAAATTTCAATAAGTACAGAGACAGGAGCTTCGGAATTGAACCTTAACACAACGGGAAGTTGTTTCAATGGCCGCAACTTCAACCTCAATGTCAAGAAGGTGGGGCTCTTCAGTGGCCGCGACTTTGAAGGTCAGGATTGA

mRNA sequence

ATGAAAAATCAATTGATTACTAACCTCGGACGAGCAAGCAACGGTGGCCGGAAAAGCGCCGGGAGCGGAAAGTCCTTCGAAGTTGGTGAGGAAGAACGATTTAGGGCTGAGAAGAATTGTGTTATTAAATCCAGGGGCAATTTTGGTATTGTAACAGCTAATGCCATAACGCCGACATATCTTCACCGCTCATTCTTCACTCCACAAACTCCATTTCTCCCTCCGATCAAACGGTACCATCGCCACACACACATTCGTCTCCGTTGTCGATTTTCTCTTGTCGATGAGCAGCAGAAAGAAGTCGTCTCGTTCTCTGAACCCGAAAATTCGCTCATTGAGGCTCTCATTGGTGTCCAAGGTCGAGGCCGCTCTGTTTCATCTCAGCAGCTCAGCAATGTCGAACGGGCTGTGAGTGTTCTGGAAAGTTTGGAAGGCGTGCGAGATCCGACGAGCTCTAGCTTAATTGAAGGACGGTGGCAGTTGGTGTTCACAACTAGACCTGGAACAGCGTCCATAATACAGAGAACATTTGTTGGAGTTGATTTCTTCAGCGTGTTTCAAGAGATATTTCTACGGACAAACGACCCACGGGTCTCCAACATTGTTAAATTCTCTGATGCAATCGGAGAGCTGAAAGTAGAGGCGGCTGCATCAGTCAAAGATGGCAAACGCATTCTTTTCCAATTTGACACAGCAGCATTTTCTTTCAAGTTCTTTCCTTTTAAGGTTCCATATCCTGTACCATTCAGACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCCTCGGGGAACCTCCGCATCTCAAGAGGAAACAAGGGCACCACGTTTGTGTTGCAAAAGCAAACGGAAGTGAGGCAAAACTTGCTGCTAGCAATTTCTACAGGTAAAGGAATTGAAGGGGCAATTGATAAGCTCATCTCTGAAAATCAGAATGGAAGTAAATTTGAAGAGGAGCTTCTCGAGGGGGACTGGAATATGCTATGGAGTTCACAGATGGAAACAGATAGTTGGATAGAGAATGCTGCAAATGGTCTCATGGGGATGCAGATTATCAAGAATGGGCAAATGAAGTTTCGAGTGGATATGTTGCTTGGACTGAGATTTTCCATGATTGGTACTCTTGTAAAATCTGGAGACGACACATACGATGTAACCATGGATGATGCTGCAATTATTGGAGGCCCGTTTGGATATCCTTTGGAAATGGAGAGCCGGTTTAAGCTTCAACTTCTACAAATTTCAATAAGTACAGAGACAGGAGCTTCGGAATTGAACCTTAACACAACGGGAAGTTGTTTCAATGGCCGCAACTTCAACCTCAATGTCAAGAAGGTGGGGCTCTTCAGTGGCCGCGACTTTGAAGGTCAGGATTGA

Coding sequence (CDS)

ATGAAAAATCAATTGATTACTAACCTCGGACGAGCAAGCAACGGTGGCCGGAAAAGCGCCGGGAGCGGAAAGTCCTTCGAAGTTGGTGAGGAAGAACGATTTAGGGCTGAGAAGAATTGTGTTATTAAATCCAGGGGCAATTTTGGTATTGTAACAGCTAATGCCATAACGCCGACATATCTTCACCGCTCATTCTTCACTCCACAAACTCCATTTCTCCCTCCGATCAAACGGTACCATCGCCACACACACATTCGTCTCCGTTGTCGATTTTCTCTTGTCGATGAGCAGCAGAAAGAAGTCGTCTCGTTCTCTGAACCCGAAAATTCGCTCATTGAGGCTCTCATTGGTGTCCAAGGTCGAGGCCGCTCTGTTTCATCTCAGCAGCTCAGCAATGTCGAACGGGCTGTGAGTGTTCTGGAAAGTTTGGAAGGCGTGCGAGATCCGACGAGCTCTAGCTTAATTGAAGGACGGTGGCAGTTGGTGTTCACAACTAGACCTGGAACAGCGTCCATAATACAGAGAACATTTGTTGGAGTTGATTTCTTCAGCGTGTTTCAAGAGATATTTCTACGGACAAACGACCCACGGGTCTCCAACATTGTTAAATTCTCTGATGCAATCGGAGAGCTGAAAGTAGAGGCGGCTGCATCAGTCAAAGATGGCAAACGCATTCTTTTCCAATTTGACACAGCAGCATTTTCTTTCAAGTTCTTTCCTTTTAAGGTTCCATATCCTGTACCATTCAGACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCCTCGGGGAACCTCCGCATCTCAAGAGGAAACAAGGGCACCACGTTTGTGTTGCAAAAGCAAACGGAAGTGAGGCAAAACTTGCTGCTAGCAATTTCTACAGGTAAAGGAATTGAAGGGGCAATTGATAAGCTCATCTCTGAAAATCAGAATGGAAGTAAATTTGAAGAGGAGCTTCTCGAGGGGGACTGGAATATGCTATGGAGTTCACAGATGGAAACAGATAGTTGGATAGAGAATGCTGCAAATGGTCTCATGGGGATGCAGATTATCAAGAATGGGCAAATGAAGTTTCGAGTGGATATGTTGCTTGGACTGAGATTTTCCATGATTGGTACTCTTGTAAAATCTGGAGACGACACATACGATGTAACCATGGATGATGCTGCAATTATTGGAGGCCCGTTTGGATATCCTTTGGAAATGGAGAGCCGGTTTAAGCTTCAACTTCTACAAATTTCAATAAGTACAGAGACAGGAGCTTCGGAATTGAACCTTAACACAACGGGAAGTTGTTTCAATGGCCGCAACTTCAACCTCAATGTCAAGAAGGTGGGGCTCTTCAGTGGCCGCGACTTTGAAGGTCAGGATTGA

Protein sequence

MKNQLITNLGRASNGGRKSAGSGKSFEVGEEERFRAEKNCVIKSRGNFGIVTANAITPTYLHRSFFTPQTPFLPPIKRYHRHTHIRLRCRFSLVDEQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDTAAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVRQNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESRFKLQLLQISISTETGASELNLNTTGSCFNGRNFNLNVKKVGLFSGRDFEGQD
Homology
BLAST of HG10001104 vs. NCBI nr
Match: XP_008449765.1 (PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Cucumis melo] >KAA0041357.1 putative plastid-lipid-associated protein 12 [Cucumis melo var. makuwa] >TYK21659.1 putative plastid-lipid-associated protein 12 [Cucumis melo var. makuwa])

HSP 1 Score: 674.5 bits (1739), Expect = 6.6e-190
Identity = 342/366 (93.44%), Postives = 351/366 (95.90%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHR-HTHIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANA+TPTYLHRSFFTP TP LPPIKRYHR HTHIRLRCR SLVDEQQKEVVSFSEPENSL
Sbjct: 5   ANAMTPTYLHRSFFTPHTPSLPPIKRYHRHHTHIRLRCRSSLVDEQQKEVVSFSEPENSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           I+ALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEA ASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAVASVKDGKRILFQFDR 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIST KG+E AIDKLISENQN +KFEEELLEG WNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQ+IKNGQMKF VDMLLGLRFSMIG+LVKSGD+TYDVTMDDAAIIGGPFGYPL MESR
Sbjct: 305 MGMQVIKNGQMKFGVDMLLGLRFSMIGSLVKSGDNTYDVTMDDAAIIGGPFGYPLGMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. NCBI nr
Match: XP_004142141.1 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucumis sativus] >KGN54108.1 hypothetical protein Csa_018101 [Cucumis sativus])

HSP 1 Score: 674.1 bits (1738), Expect = 8.7e-190
Identity = 342/366 (93.44%), Postives = 351/366 (95.90%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHR-HTHIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANA+TPTYLHRSFFTPQTP LPPIKRYHR HTHIRLRCR SLVDEQQKEVVSFS+PENSL
Sbjct: 43  ANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSL 102

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           I+ALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 103 IDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 162

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFD 
Sbjct: 163 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDR 222

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 223 AAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 282

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           Q LLLAIST KG+E AIDKLISENQN +KFEEELLEG WNMLWSSQMETDSWIENAANGL
Sbjct: 283 QKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGL 342

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQ+IKNGQMKF VDMLLGLRFSMIGTLVKSGD+ YDVTMDDAAIIGGPFGYPL MESR
Sbjct: 343 MGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESR 402

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 403 FKLQLL 408

BLAST of HG10001104 vs. NCBI nr
Match: XP_038902721.1 (probable plastid-lipid-associated protein 12, chloroplastic [Benincasa hispida])

HSP 1 Score: 673.7 bits (1737), Expect = 1.1e-189
Identity = 340/365 (93.15%), Postives = 351/365 (96.16%), Query Frame = 0

Query: 54  NAITPTYLHRSFFTPQTPFLPPIKRYHR-HTHIRLRCRFSLVDEQQKEVVSFSEPENSLI 113
           NAITPTYLH SFFTP TP LPPIKR+HR HTHIRLRCR SLVDEQQKEVVSFSEPENSLI
Sbjct: 6   NAITPTYLHLSFFTPLTPSLPPIKRFHRHHTHIRLRCRSSLVDEQQKEVVSFSEPENSLI 65

Query: 114 EALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTASI 173
           +ALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT SSLIEGRWQLVFTTRPGTASI
Sbjct: 66  QALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTDSSLIEGRWQLVFTTRPGTASI 125

Query: 174 IQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDTA 233
           IQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAAS+KDGKRILFQFD A
Sbjct: 126 IQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASIKDGKRILFQFDRA 185

Query: 234 AFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVRQ 293
           AFSFKF PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE RQ
Sbjct: 186 AFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQ 245

Query: 294 NLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGLM 353
           NLLLAISTGKG+E AIDKLISEN+N +KFEEEL+EG+WNMLWSSQMETDSWIENAANGLM
Sbjct: 246 NLLLAISTGKGVEEAIDKLISENRNENKFEEELVEGEWNMLWSSQMETDSWIENAANGLM 305

Query: 354 GMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESRF 413
           GMQ+IK+GQMK+RVDMLLGLRFSM GTLVKSGDD YDVTMDDAAIIGGPFGYPLEMESRF
Sbjct: 306 GMQVIKSGQMKYRVDMLLGLRFSMTGTLVKSGDDIYDVTMDDAAIIGGPFGYPLEMESRF 365

Query: 414 KLQLL 418
           KLQLL
Sbjct: 366 KLQLL 370

BLAST of HG10001104 vs. NCBI nr
Match: XP_022987045.1 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 667.2 bits (1720), Expect = 1.1e-187
Identity = 337/366 (92.08%), Postives = 349/366 (95.36%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHRH-THIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANAI PTYLHRSFFTPQ P LPPIKRYHR+ THIRLRCR SLVDEQQKEVVSFSEPENSL
Sbjct: 5   ANAIAPTYLHRSFFTPQAPSLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEPENSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           IEALIGVQGRGR+VSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IEALIGVQGRGRAVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDK 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIS GK +E AIDKL+SE +N +KF++ELLEGDWNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISAGKRVEEAIDKLVSEYRNENKFQQELLEGDWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQIIKNGQMKFRVDMLLG+RFSM GT VKS DDTYDV+MDDAAIIGGPFGYP+EMESR
Sbjct: 305 MGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSADDTYDVSMDDAAIIGGPFGYPVEMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. NCBI nr
Match: KAG6570366.1 (putative plastid-lipid-associated protein 12, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 667.2 bits (1720), Expect = 1.1e-187
Identity = 338/366 (92.35%), Postives = 349/366 (95.36%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHRH-THIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANAI PTYLHRSFFTPQ P LPPIKRYHR+ THIRLRCR SLVDEQQKEVVSFSEPENSL
Sbjct: 5   ANAIAPTYLHRSFFTPQAPSLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEPENSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           IEALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IEALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDK 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKRTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIS GK +E AIDKLISE +N +KF++ELLEGDWNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISAGKRVEEAIDKLISEYRNENKFQQELLEGDWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQIIKNGQMKFRVDMLLG+RFSM GT VKS DDTYDV+MDDAAIIGGPFGYP+EMESR
Sbjct: 305 MGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSADDTYDVSMDDAAIIGGPFGYPVEMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. ExPASy Swiss-Prot
Match: Q8LAP6 (Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PAP12 PE=1 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 1.9e-123
Identity = 221/335 (65.97%), Postives = 268/335 (80.00%), Query Frame = 0

Query: 85  IRLRCRFSLVDEQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLE 144
           +R+ C  S     Q +  SF++ E  LI+ALIG+QGRG+S S +QL++VE AV VLE LE
Sbjct: 52  LRISCSSSSTVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKVLEGLE 111

Query: 145 GVRDPTSSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLR-TNDPRVSNIVK 204
           G+++PT S LIEGRW+L+FTTRPGTAS IQRTF GVD F+VFQ+++L+ TNDPRVSNIVK
Sbjct: 112 GIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRVSNIVK 171

Query: 205 FSDAIGELKVEAAASVKDGKRILFQFDTAAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTT 264
           FSD IGELKVEA AS+KDGKR+LF+FD AAF  KF PFKVPYPVPFRLLGDEAKGWLDTT
Sbjct: 172 FSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKGWLDTT 231

Query: 265 YLSPSGNLRISRGNKGTTFVLQKQTEVRQNLLLAISTGKGIEGAIDKLISENQNGSKFEE 324
           YLSPSGNLRISRGNKGTTFVLQK+T  RQ LL  IS  KG+  AID+ ++ N N ++   
Sbjct: 232 YLSPSGNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNSAEDNY 291

Query: 325 ELLEGDWNMLWSSQMETDSWIENAANGLMGMQII-KNGQMKFRVDMLLGLRFSMIGTLVK 384
           ELLEG W M+WSSQM TDSWIENAANGLMG QII K+G++KF V+++   RFSM G  +K
Sbjct: 292 ELLEGSWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMKGKFIK 351

Query: 385 SGDDTYDVTMDDAAIIGGPFGYPLEMESRFKLQLL 418
           S   TYD+ MDDAAIIGG FGYP+++ +  +L++L
Sbjct: 352 SESSTYDLKMDDAAIIGGAFGYPVDITNNIELKIL 386

BLAST of HG10001104 vs. ExPASy Swiss-Prot
Match: Q94KU6 (Plastid lipid-associated protein 2, chloroplastic OS=Brassica campestris OX=3711 GN=PAP2 PE=1 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 5.0e-07
Identity = 60/237 (25.32%), Postives = 101/237 (42.62%), Query Frame = 0

Query: 93  LVDEQQKEVVSFSEP-ENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTS 152
           L+ E+  E V  +E  + SL+++L G   RG S SS+  + +   ++ LES      PT 
Sbjct: 80  LMAEEAIESVEETEVLKRSLVDSLYGTD-RGLSASSETRAEIGDLITQLESKNPTPAPTD 139

Query: 153 S-SLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSD--AI 212
           +  L+ G+W L +T+  G   ++ R  V +       +  + +++  V N V F+   A 
Sbjct: 140 ALFLLNGKWILAYTSFVGLFPLLSRGIVPLVKVDEISQT-IDSDNFTVENSVLFAGPLAT 199

Query: 213 GELKVEAAASVKDGKRILFQFDTAAFS-------------FKFFPFKVPY---------- 272
             +   A   ++  KR+  +F+                   +F   K+            
Sbjct: 200 TSISTNAKFEIRSPKRVQIKFEEGVIGTPQLTDSIEIPEYVEFLGQKIDLTPIRGLLTSV 259

Query: 273 ---------------PVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 288
                          P+ F L GD A+ WL TTYL    ++RISRG+ G+ FVL K+
Sbjct: 260 QDTATSVARTISSQPPLKFSLPGDSAQSWLLTTYLDK--DIRISRGDGGSVFVLIKE 312

BLAST of HG10001104 vs. ExPASy Swiss-Prot
Match: O49629 (Probable plastid-lipid-associated protein 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PAP2 PE=1 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 3.3e-06
Identity = 54/234 (23.08%), Postives = 97/234 (41.45%), Query Frame = 0

Query: 95  DEQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSS-S 154
           +E  ++V      + SL+++L G   RG S SS+  + +   ++ LES      PT +  
Sbjct: 74  EEAIEDVEETERLKRSLVDSLYGTD-RGLSASSETRAEIGDLITQLESKNPTPAPTEALF 133

Query: 155 LIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIG--EL 214
           L+ G+W L +T+      ++ R  V +       +  + +++  V N V+F+  +G   +
Sbjct: 134 LLNGKWILAYTSFVNLFPLLSRGIVPLIKVDEISQT-IDSDNFTVQNSVRFAGPLGTNSI 193

Query: 215 KVEAAASVKDGKRILFQFDTAAFSFKFF--PFKVPY------------------------ 274
              A   ++  KR+  +F+             ++P                         
Sbjct: 194 STNAKFEIRSPKRVQIKFEQGVIGTPQLTDSIEIPEYVEVLGQKIDLNPIRGLLTSVQDT 253

Query: 275 ------------PVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 288
                       P+ F L  D A+ WL TTYL    ++RISRG+ G+ FVL K+
Sbjct: 254 ASSVARTISSQPPLKFSLPADNAQSWLLTTYLDK--DIRISRGDGGSVFVLIKE 303

BLAST of HG10001104 vs. ExPASy Swiss-Prot
Match: Q94FZ9 (Plastid lipid-associated protein 1, chloroplastic OS=Brassica campestris OX=3711 GN=PAP1 PE=1 SV=1)

HSP 1 Score: 53.1 bits (126), Expect = 9.5e-06
Identity = 62/243 (25.51%), Postives = 96/243 (39.51%), Query Frame = 0

Query: 92  SLVDEQQKEVVSFSEPENSLIEALIGV---QGRGRSVSSQQLSNVERAVSVLESLEGVRD 151
           S+ ++  +E +  +E    L   L G      RG S SS+  + +   ++ LES      
Sbjct: 84  SVAEKVAEEAIESAEETERLKRVLAGSLYGTDRGLSASSETRAEISELITQLESKNPNPA 143

Query: 152 PTSS-SLIEGRWQLVFTTRPGTASIIQR---TFVGVDFFSVFQEIFLRTNDPRVSNIVKF 211
           P  +  L+ G+W LV+T+  G   ++ R     V VD  S      + ++   V N V+F
Sbjct: 144 PNEALFLLNGKWILVYTSFVGLFPLLSRRISPLVKVDEISQ----TIDSDSFTVHNSVRF 203

Query: 212 SD--AIGELKVEAAASVKDGKRILFQFDTAAF--------------------SFKFFPFK 271
           +   A   L   A   V+  KR+  +F+                             P K
Sbjct: 204 ASPLATTSLSTNAKFEVRSPKRVQVKFEQGVIGTPQLTDSIEIPEFVEVLGQKIDLNPIK 263

Query: 272 ------------------VPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVL 288
                                P+ F L GD A+ WL TTYL    +LRISRG+ G+ FVL
Sbjct: 264 GLLTSVQDTASSVARTISSQPPLKFSLPGDSAQSWLLTTYLDK--DLRISRGDGGSVFVL 320

BLAST of HG10001104 vs. ExPASy Swiss-Prot
Match: P80471 (Light-induced protein, chloroplastic OS=Solanum tuberosum OX=4113 PE=1 SV=2)

HSP 1 Score: 52.4 bits (124), Expect = 1.6e-05
Identity = 73/294 (24.83%), Postives = 111/294 (37.76%), Query Frame = 0

Query: 50  IVTANAITPTYLHRS-----FFTPQTPFLPPIKRYHRH-------THIRLRCRFSLVDEQ 109
           I + N  + T LHRS     F  P+  F      Y +          IR      + +E 
Sbjct: 33  ISSTNFPSKTELHRSISVKEFTNPKPKFTAQATNYDKEDEWGPEVEQIRPGGVAVVEEEP 92

Query: 110 QKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSS-SLIE 169
            KE       +  L ++L G   RG S SS+  + +   ++ LES      PT + +L+ 
Sbjct: 93  PKEPSEIELLKKQLADSLYGT-NRGLSASSETRAEIVELITQLESKNPNPAPTEALTLLN 152

Query: 170 GRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPR---VSNIVKFSD--AIGEL 229
           G+W L +T+  G   ++ R  + +    V  E   +T D     V N V F+   A   +
Sbjct: 153 GKWILAYTSFSGLFPLLSRGNLPL----VRVEEISQTIDSESFTVQNSVVFAGPLATTSI 212

Query: 230 KVEAAASVKDGKRILFQFDTAAF--------------------SFKFFPFK--------- 288
              A   V+  KR+  +F+                             PFK         
Sbjct: 213 STNAKFEVRSPKRVQIKFEEGIIGTPQLTDSIVLPENVEFLGQKIDLSPFKGLITSVQDT 272

BLAST of HG10001104 vs. ExPASy TrEMBL
Match: A0A5D3DEA0 (Putative plastid-lipid-associated protein 12 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold859G00220 PE=3 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 3.2e-190
Identity = 342/366 (93.44%), Postives = 351/366 (95.90%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHR-HTHIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANA+TPTYLHRSFFTP TP LPPIKRYHR HTHIRLRCR SLVDEQQKEVVSFSEPENSL
Sbjct: 5   ANAMTPTYLHRSFFTPHTPSLPPIKRYHRHHTHIRLRCRSSLVDEQQKEVVSFSEPENSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           I+ALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEA ASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAVASVKDGKRILFQFDR 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIST KG+E AIDKLISENQN +KFEEELLEG WNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQ+IKNGQMKF VDMLLGLRFSMIG+LVKSGD+TYDVTMDDAAIIGGPFGYPL MESR
Sbjct: 305 MGMQVIKNGQMKFGVDMLLGLRFSMIGSLVKSGDNTYDVTMDDAAIIGGPFGYPLGMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. ExPASy TrEMBL
Match: A0A1S3BNF7 (probable plastid-lipid-associated protein 12, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103491552 PE=3 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 3.2e-190
Identity = 342/366 (93.44%), Postives = 351/366 (95.90%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHR-HTHIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANA+TPTYLHRSFFTP TP LPPIKRYHR HTHIRLRCR SLVDEQQKEVVSFSEPENSL
Sbjct: 5   ANAMTPTYLHRSFFTPHTPSLPPIKRYHRHHTHIRLRCRSSLVDEQQKEVVSFSEPENSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           I+ALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEA ASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAVASVKDGKRILFQFDR 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIST KG+E AIDKLISENQN +KFEEELLEG WNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQ+IKNGQMKF VDMLLGLRFSMIG+LVKSGD+TYDVTMDDAAIIGGPFGYPL MESR
Sbjct: 305 MGMQVIKNGQMKFGVDMLLGLRFSMIGSLVKSGDNTYDVTMDDAAIIGGPFGYPLGMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. ExPASy TrEMBL
Match: A0A0A0KX56 (PAP_fibrillin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G286330 PE=3 SV=1)

HSP 1 Score: 674.1 bits (1738), Expect = 4.2e-190
Identity = 342/366 (93.44%), Postives = 351/366 (95.90%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHR-HTHIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANA+TPTYLHRSFFTPQTP LPPIKRYHR HTHIRLRCR SLVDEQQKEVVSFS+PENSL
Sbjct: 43  ANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSL 102

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           I+ALIGVQGRGRSVSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 103 IDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 162

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFD 
Sbjct: 163 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDR 222

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 223 AAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 282

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           Q LLLAIST KG+E AIDKLISENQN +KFEEELLEG WNMLWSSQMETDSWIENAANGL
Sbjct: 283 QKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGL 342

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQ+IKNGQMKF VDMLLGLRFSMIGTLVKSGD+ YDVTMDDAAIIGGPFGYPL MESR
Sbjct: 343 MGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESR 402

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 403 FKLQLL 408

BLAST of HG10001104 vs. ExPASy TrEMBL
Match: A0A6J1JFQ6 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484606 PE=3 SV=1)

HSP 1 Score: 667.2 bits (1720), Expect = 5.1e-188
Identity = 337/366 (92.08%), Postives = 349/366 (95.36%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHRH-THIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANAI PTYLHRSFFTPQ P LPPIKRYHR+ THIRLRCR SLVDEQQKEVVSFSEPENSL
Sbjct: 5   ANAIAPTYLHRSFFTPQAPSLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEPENSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           IEALIGVQGRGR+VSSQQLSNVERAVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IEALIGVQGRGRAVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDK 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIS GK +E AIDKL+SE +N +KF++ELLEGDWNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISAGKRVEEAIDKLVSEYRNENKFQQELLEGDWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQIIKNGQMKFRVDMLLG+RFSM GT VKS DDTYDV+MDDAAIIGGPFGYP+EMESR
Sbjct: 305 MGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSADDTYDVSMDDAAIIGGPFGYPVEMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. ExPASy TrEMBL
Match: A0A6J1FUZ3 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448350 PE=3 SV=1)

HSP 1 Score: 662.1 bits (1707), Expect = 1.6e-186
Identity = 336/366 (91.80%), Postives = 348/366 (95.08%), Query Frame = 0

Query: 53  ANAITPTYLHRSFFTPQTPFLPPIKRYHRH-THIRLRCRFSLVDEQQKEVVSFSEPENSL 112
           ANAI PTYLHRSFFTPQ P LPPIKRYHR+ THIRLRCR SLVDEQQKEVVSFSEPE SL
Sbjct: 5   ANAIAPTYLHRSFFTPQAPPLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEPEKSL 64

Query: 113 IEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRPGTAS 172
           IEALIGVQGRGRSVSSQQLSNVE+AVSVLE LEGVRDPT+SSLIEGRWQLVFTTRPGTAS
Sbjct: 65  IEALIGVQGRGRSVSSQQLSNVEQAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTAS 124

Query: 173 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDT 232
           IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFD 
Sbjct: 125 IIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDK 184

Query: 233 AAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEVR 292
           AAFSFKF PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTE R
Sbjct: 185 AAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEAR 244

Query: 293 QNLLLAISTGKGIEGAIDKLISENQNGSKFEEELLEGDWNMLWSSQMETDSWIENAANGL 352
           QNLLLAIS GK ++ AIDKLISE +N +KF++ELLEGDWNMLWSSQMETDSWIENAANGL
Sbjct: 245 QNLLLAISAGKRVDEAIDKLISEYRNENKFQQELLEGDWNMLWSSQMETDSWIENAANGL 304

Query: 353 MGMQIIKNGQMKFRVDMLLGLRFSMIGTLVKSGDDTYDVTMDDAAIIGGPFGYPLEMESR 412
           MGMQIIKNGQMKFRVDMLLG+RFSM GT VKS DDTYDV+MDDAAIIGGPFGYP+EMESR
Sbjct: 305 MGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSEDDTYDVSMDDAAIIGGPFGYPVEMESR 364

Query: 413 FKLQLL 418
           FKLQLL
Sbjct: 365 FKLQLL 370

BLAST of HG10001104 vs. TAIR 10
Match: AT1G51110.1 (Plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 444.1 bits (1141), Expect = 1.4e-124
Identity = 221/335 (65.97%), Postives = 268/335 (80.00%), Query Frame = 0

Query: 85  IRLRCRFSLVDEQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLE 144
           +R+ C  S     Q +  SF++ E  LI+ALIG+QGRG+S S +QL++VE AV VLE LE
Sbjct: 52  LRISCSSSSTVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKVLEGLE 111

Query: 145 GVRDPTSSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLR-TNDPRVSNIVK 204
           G+++PT S LIEGRW+L+FTTRPGTAS IQRTF GVD F+VFQ+++L+ TNDPRVSNIVK
Sbjct: 112 GIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRVSNIVK 171

Query: 205 FSDAIGELKVEAAASVKDGKRILFQFDTAAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTT 264
           FSD IGELKVEA AS+KDGKR+LF+FD AAF  KF PFKVPYPVPFRLLGDEAKGWLDTT
Sbjct: 172 FSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKGWLDTT 231

Query: 265 YLSPSGNLRISRGNKGTTFVLQKQTEVRQNLLLAISTGKGIEGAIDKLISENQNGSKFEE 324
           YLSPSGNLRISRGNKGTTFVLQK+T  RQ LL  IS  KG+  AID+ ++ N N ++   
Sbjct: 232 YLSPSGNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNSAEDNY 291

Query: 325 ELLEGDWNMLWSSQMETDSWIENAANGLMGMQII-KNGQMKFRVDMLLGLRFSMIGTLVK 384
           ELLEG W M+WSSQM TDSWIENAANGLMG QII K+G++KF V+++   RFSM G  +K
Sbjct: 292 ELLEGSWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMKGKFIK 351

Query: 385 SGDDTYDVTMDDAAIIGGPFGYPLEMESRFKLQLL 418
           S   TYD+ MDDAAIIGG FGYP+++ +  +L++L
Sbjct: 352 SESSTYDLKMDDAAIIGGAFGYPVDITNNIELKIL 386

BLAST of HG10001104 vs. TAIR 10
Match: AT4G22240.1 (Plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 54.7 bits (130), Expect = 2.3e-07
Identity = 54/234 (23.08%), Postives = 97/234 (41.45%), Query Frame = 0

Query: 95  DEQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSS-S 154
           +E  ++V      + SL+++L G   RG S SS+  + +   ++ LES      PT +  
Sbjct: 74  EEAIEDVEETERLKRSLVDSLYGTD-RGLSASSETRAEIGDLITQLESKNPTPAPTEALF 133

Query: 155 LIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIG--EL 214
           L+ G+W L +T+      ++ R  V +       +  + +++  V N V+F+  +G   +
Sbjct: 134 LLNGKWILAYTSFVNLFPLLSRGIVPLIKVDEISQT-IDSDNFTVQNSVRFAGPLGTNSI 193

Query: 215 KVEAAASVKDGKRILFQFDTAAFSFKFF--PFKVPY------------------------ 274
              A   ++  KR+  +F+             ++P                         
Sbjct: 194 STNAKFEIRSPKRVQIKFEQGVIGTPQLTDSIEIPEYVEVLGQKIDLNPIRGLLTSVQDT 253

Query: 275 ------------PVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 288
                       P+ F L  D A+ WL TTYL    ++RISRG+ G+ FVL K+
Sbjct: 254 ASSVARTISSQPPLKFSLPADNAQSWLLTTYLDK--DIRISRGDGGSVFVLIKE 303

BLAST of HG10001104 vs. TAIR 10
Match: AT3G26070.1 (Plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 52.8 bits (125), Expect = 8.8e-07
Identity = 47/193 (24.35%), Postives = 91/193 (47.15%), Query Frame = 0

Query: 96  EQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLI 155
           E+QK++      +  L+EA+  ++ RG + S      +++    +E++   ++P  S L+
Sbjct: 69  EKQKQL------KQELLEAIEPLE-RGATASPDDQLRIDQLARKVEAVNPTKEPLKSDLV 128

Query: 156 EGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVK---FSDAIGELK 215
            G+W+L++TT   +ASI+Q         S+     +  +  +V N+     ++   G++K
Sbjct: 129 NGKWELIYTT---SASILQAKKPRF-LRSITNYQSINVDTLKVQNMETWPFYNSVTGDIK 188

Query: 216 VEAAASVKDGKRILFQFDTAAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLR 275
                   + K++  +         F P K P         D A+G L+ TY+     LR
Sbjct: 189 ------PLNSKKVAVKLQVFKI-LGFIPIKAP---------DSARGELEITYVDE--ELR 232

Query: 276 ISRGNKGTTFVLQ 286
           +SRG+KG  F+L+
Sbjct: 249 LSRGDKGNLFILK 232

BLAST of HG10001104 vs. TAIR 10
Match: AT4G04020.1 (fibrillin )

HSP 1 Score: 50.8 bits (120), Expect = 3.3e-06
Identity = 58/240 (24.17%), Postives = 94/240 (39.17%), Query Frame = 0

Query: 92  SLVDEQQKEVVSFSEPENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTS 151
           S+ D+  + V      + SL ++L G   RG SVSS   + +   ++ LES      P  
Sbjct: 79  SVADKAIESVEETERLKRSLADSLYGTD-RGLSVSSDTRAEISELITQLESKNPTPAPNE 138

Query: 152 S-SLIEGRWQLVFTTRPGTASIIQR---TFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDA 211
           +  L+ G+W L +T+  G   ++ R     V VD  S      + ++   V N V+F+  
Sbjct: 139 ALFLLNGKWILAYTSFVGLFPLLSRRIEPLVKVDEISQ----TIDSDSFTVQNSVRFAGP 198

Query: 212 IG--ELKVEAAASVKDGKRILFQFDTAAF-------------SFKFFPFKVPY------- 271
                    A   ++  KR+  +F+                 S +    K+         
Sbjct: 199 FSTTSFSTNAKFEIRSPKRVQIKFEQGVIGTPQLTDSIEIPESVEVLGQKIDLNPIKGLL 258

Query: 272 ------------------PVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 288
                             P+ F L  D  + WL TTYL    +LRISRG+ G+ +VL K+
Sbjct: 259 TSVQDTASSVARTISNQPPLKFSLPSDNTQSWLLTTYLDK--DLRISRGDGGSVYVLIKE 311

BLAST of HG10001104 vs. TAIR 10
Match: AT3G26080.1 (plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 46.6 bits (109), Expect = 6.3e-05
Identity = 44/180 (24.44%), Postives = 83/180 (46.11%), Query Frame = 0

Query: 108 ENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLESLEGVRDPTSSSLIEGRWQLVFTTRP 167
           ++ L+EA+  ++ RG + S      +++    +E++   ++P  S LI G+W+L++TT  
Sbjct: 66  KHELVEAIEPLE-RGATASPDDQLLIDQLARKVEAVNPTKEPLKSDLINGKWELIYTT-- 125

Query: 168 GTASIIQRTFVGVDFFSVFQEIFLR--TNDPRVSNIVKFSDAIGELKVEAAASVKDGKRI 227
            +A+I+Q            +  FLR  TN   ++      D +   ++E           
Sbjct: 126 -SAAILQAK----------KPRFLRSLTNYQCIN-----MDTLKVQRMETWPFYNSVTGD 185

Query: 228 LFQFDTAAFSFKFFPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQ 286
           L   ++   + K   FK+   +P +     A+G L+ TY+     LRISRG     F+L+
Sbjct: 186 LTPLNSKTVAVKLQVFKILGFIPVKAPDGTARGELEITYVDE--ELRISRGKGNLLFILK 224

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008449765.16.6e-19093.44PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Cucumis ... [more]
XP_004142141.18.7e-19093.44probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucumis ... [more]
XP_038902721.11.1e-18993.15probable plastid-lipid-associated protein 12, chloroplastic [Benincasa hispida][more]
XP_022987045.11.1e-18792.08probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucurbit... [more]
KAG6570366.11.1e-18792.35putative plastid-lipid-associated protein 12, chloroplastic, partial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q8LAP61.9e-12365.97Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thali... [more]
Q94KU65.0e-0725.32Plastid lipid-associated protein 2, chloroplastic OS=Brassica campestris OX=3711... [more]
O496293.3e-0623.08Probable plastid-lipid-associated protein 2, chloroplastic OS=Arabidopsis thalia... [more]
Q94FZ99.5e-0625.51Plastid lipid-associated protein 1, chloroplastic OS=Brassica campestris OX=3711... [more]
P804711.6e-0524.83Light-induced protein, chloroplastic OS=Solanum tuberosum OX=4113 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A5D3DEA03.2e-19093.44Putative plastid-lipid-associated protein 12 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A1S3BNF73.2e-19093.44probable plastid-lipid-associated protein 12, chloroplastic OS=Cucumis melo OX=3... [more]
A0A0A0KX564.2e-19093.44PAP_fibrillin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G2863... [more]
A0A6J1JFQ65.1e-18892.08probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurb... [more]
A0A6J1FUZ31.6e-18691.80probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurb... [more]
Match NameE-valueIdentityDescription
AT1G51110.11.4e-12465.97Plastid-lipid associated protein PAP / fibrillin family protein [more]
AT4G22240.12.3e-0723.08Plastid-lipid associated protein PAP / fibrillin family protein [more]
AT3G26070.18.8e-0724.35Plastid-lipid associated protein PAP / fibrillin family protein [more]
AT4G04020.13.3e-0624.17fibrillin [more]
AT3G26080.16.3e-0524.44plastid-lipid associated protein PAP / fibrillin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006843Plastid lipid-associated protein/fibrillin conserved domainPFAMPF04755PAP_fibrillincoord: 108..284
e-value: 7.3E-42
score: 143.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availablePANTHERPTHR31906:SF30PLASTID-LIPID-ASSOCIATED PROTEIN 12, CHLOROPLASTIC-RELATEDcoord: 84..315
IPR039633Plastid-lipid-associated proteinPANTHERPTHR31906FAMILY NOT NAMEDcoord: 84..315

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001104.1HG10001104.1mRNA