Carg04833 (gene) Silver-seed gourd

NameCarg04833
Typegene
OrganismCucurbita argyrosperma (Silver-seed gourd)
Descriptionprotein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2-like
LocationCucurbita_argyrosperma_scaffold_028 : 1202562 .. 1204013 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGTACGATTCCATTCTTGCGAGGAGCTTTAGCAAGCATGAACAAAAGAAGATGGGTTATGGAGCTTTGATGGGTTGCTTGTTCATTGCATTTAGCTTCAGCATTATGCTCAAACCTCATCTGGGTCGCCATTTTCTAGCTTGTAAGTCCCTATTGACATGTTCTTTGATGTAAAAACCAAGCCAGTTCTTGAATTTCTCTGTTCATTTTGCAGTGAATCTGAGGCTTTTTCCAAGAGCTCCAGGAATCAAAATGGCTGACATTGTTACAAAGAAGCCAACGCCAACATGCAATTTGATGGATTGGACAGACTTCTGCGACATTGATATGAATGTTAGAATTCATGGAGAATCTTCCTCAGTGATGTTTGCTTCAACCGACATGGAAGAACACTTAGAACGCAACAGCACTTGGAAAATCAAGCCATATGCTAGAAAACAAGACGCCAATGCAATGAAGAACACAAGAGAATGGTCTATAAAAGCAGTAAAAAGCCCCCAAAACTTGCCTCAATGCACACAAAATCATGGCGTTCCAGCCATTCTGTTCTCCCTTGGTGGATATGCAGGAAACCATTTCCATGACTTCACAGACGTGATCATTCCACTGTTCATAACAGCAAGAAAATTCAATGGAGAGGTCCAATTCCTCATAACTGATAGAAAATCTTGGTGGGTTTTAAAGTATCAAGCAATACTATCAAAGTTGTCGAACTATGATATCATTTACATTGACAAGGAGGCGCAAGTTCTTTGCTTTCCTCATGTCATTGTGGGGCTCAAACGTGACCCAAAAGAGCTTAGCATTGATTCAAACAACCATTCATTCTCCATGAAGGATTTCAAAGAGTTCTTGAGAAGTTCTTACTCGTTGAACAGAGGCAGAGCAATGGAAAACAGAGGAAGAGGCAGAGGAAGGAAAAAGAGGGAGATTAAGCCACGGCTTCTCATAGTTGCGAGGAGGAAAACAAGGTCATTTACGAACACAGGGGAGATCATCAAGATGGCTAAGAAACTGGGTTTTCAAGTGATTGTTACAGAGCCAGATGCTAACTTGAAGAAAGTTGCAGAAACTGTGAACTCTTGTGATGTGATGATGGGAGTTCATGGCGCTGGGCTTACCAACATTGTGTTTCTTCCTGAAAGGTCAGTTTTGATTCAGATTGTACCATTTGGAGGAGCAGAATGGGTATCCACAAGGTTCTTTGGAGAGCCATCGAAGGACATGGAATTGAAGTATTTGGGGTATGATATCAGTTTGAAAGAAAGCACGTTGATTCAGCAGTACCCTAAAGAACATGTGGTTTTGAGAGATCCTGTGGCTATCCAAAAACAGGGGTGGAGTGCTTTCAAGTCCATCTATTTTGATAACCAAAATGTGAAACTTGATATCAATAGATTTAGACCCACTTTGCTTAAAGCTCTTGAGCTTCTGCACCACAGTAGCTAG

mRNA sequence

ATGTTGTACGATTCCATTCTTGCGAGGAGCTTTAGCAAGCATGAACAAAAGAAGATGGGTTATGGAGCTTTGATGGGTTGCTTGTTCATTGCATTTAGCTTCAGCATTATGCTCAAACCTCATCTGGGTCGCCATTTTCTAGCTTTGAATCTGAGGCTTTTTCCAAGAGCTCCAGGAATCAAAATGGCTGACATTGTTACAAAGAAGCCAACGCCAACATGCAATTTGATGGATTGGACAGACTTCTGCGACATTGATATGAATGTTAGAATTCATGGAGAATCTTCCTCAGTGATGTTTGCTTCAACCGACATGGAAGAACACTTAGAACGCAACAGCACTTGGAAAATCAAGCCATATGCTAGAAAACAAGACGCCAATGCAATGAAGAACACAAGAGAATGGTCTATAAAAGCAGTAAAAAGCCCCCAAAACTTGCCTCAATGCACACAAAATCATGGCGTTCCAGCCATTCTGTTCTCCCTTGGTGGATATGCAGGAAACCATTTCCATGACTTCACAGACGTGATCATTCCACTGTTCATAACAGCAAGAAAATTCAATGGAGAGGTCCAATTCCTCATAACTGATAGAAAATCTTGGTGGGTTTTAAAGTATCAAGCAATACTATCAAAGTTGTCGAACTATGATATCATTTACATTGACAAGGAGGCGCAAGTTCTTTGCTTTCCTCATGTCATTGTGGGGCTCAAACGTGACCCAAAAGAGCTTAGCATTGATTCAAACAACCATTCATTCTCCATGAAGGATTTCAAAGAGTTCTTGAGAAGTTCTTACTCGTTGAACAGAGGCAGAGCAATGGAAAACAGAGGAAGAGGCAGAGGAAGGAAAAAGAGGGAGATTAAGCCACGGCTTCTCATAGTTGCGAGGAGGAAAACAAGGTCATTTACGAACACAGGGGAGATCATCAAGATGGCTAAGAAACTGGGTTTTCAAGTGATTGTTACAGAGCCAGATGCTAACTTGAAGAAAGTTGCAGAAACTGTGAACTCTTGTGATGTGATGATGGGAGTTCATGGCGCTGGGCTTACCAACATTGTGTTTCTTCCTGAAAGGTCAGTTTTGATTCAGATTGTACCATTTGGAGGAGCAGAATGGGTATCCACAAGGTTCTTTGGAGAGCCATCGAAGGACATGGAATTGAAGTATTTGGGGTATGATATCAGTTTGAAAGAAAGCACGTTGATTCAGCAGTACCCTAAAGAACATGTGGTTTTGAGAGATCCTGTGGCTATCCAAAAACAGGGGTGGAGTGCTTTCAAGTCCATCTATTTTGATAACCAAAATGTGAAACTTGATATCAATAGATTTAGACCCACTTTGCTTAAAGCTCTTGAGCTTCTGCACCACAGTAGCTAG

Coding sequence (CDS)

ATGTTGTACGATTCCATTCTTGCGAGGAGCTTTAGCAAGCATGAACAAAAGAAGATGGGTTATGGAGCTTTGATGGGTTGCTTGTTCATTGCATTTAGCTTCAGCATTATGCTCAAACCTCATCTGGGTCGCCATTTTCTAGCTTTGAATCTGAGGCTTTTTCCAAGAGCTCCAGGAATCAAAATGGCTGACATTGTTACAAAGAAGCCAACGCCAACATGCAATTTGATGGATTGGACAGACTTCTGCGACATTGATATGAATGTTAGAATTCATGGAGAATCTTCCTCAGTGATGTTTGCTTCAACCGACATGGAAGAACACTTAGAACGCAACAGCACTTGGAAAATCAAGCCATATGCTAGAAAACAAGACGCCAATGCAATGAAGAACACAAGAGAATGGTCTATAAAAGCAGTAAAAAGCCCCCAAAACTTGCCTCAATGCACACAAAATCATGGCGTTCCAGCCATTCTGTTCTCCCTTGGTGGATATGCAGGAAACCATTTCCATGACTTCACAGACGTGATCATTCCACTGTTCATAACAGCAAGAAAATTCAATGGAGAGGTCCAATTCCTCATAACTGATAGAAAATCTTGGTGGGTTTTAAAGTATCAAGCAATACTATCAAAGTTGTCGAACTATGATATCATTTACATTGACAAGGAGGCGCAAGTTCTTTGCTTTCCTCATGTCATTGTGGGGCTCAAACGTGACCCAAAAGAGCTTAGCATTGATTCAAACAACCATTCATTCTCCATGAAGGATTTCAAAGAGTTCTTGAGAAGTTCTTACTCGTTGAACAGAGGCAGAGCAATGGAAAACAGAGGAAGAGGCAGAGGAAGGAAAAAGAGGGAGATTAAGCCACGGCTTCTCATAGTTGCGAGGAGGAAAACAAGGTCATTTACGAACACAGGGGAGATCATCAAGATGGCTAAGAAACTGGGTTTTCAAGTGATTGTTACAGAGCCAGATGCTAACTTGAAGAAAGTTGCAGAAACTGTGAACTCTTGTGATGTGATGATGGGAGTTCATGGCGCTGGGCTTACCAACATTGTGTTTCTTCCTGAAAGGTCAGTTTTGATTCAGATTGTACCATTTGGAGGAGCAGAATGGGTATCCACAAGGTTCTTTGGAGAGCCATCGAAGGACATGGAATTGAAGTATTTGGGGTATGATATCAGTTTGAAAGAAAGCACGTTGATTCAGCAGTACCCTAAAGAACATGTGGTTTTGAGAGATCCTGTGGCTATCCAAAAACAGGGGTGGAGTGCTTTCAAGTCCATCTATTTTGATAACCAAAATGTGAAACTTGATATCAATAGATTTAGACCCACTTTGCTTAAAGCTCTTGAGCTTCTGCACCACAGTAGCTAG

Protein sequence

MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGIKMADIVTKKPTPTCNLMDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDPKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMENRGRGRGRKKREIKPRLLIVARRKTRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHSS
BLAST of Carg04833 vs. NCBI nr
Match: XP_022924124.1 (uncharacterized protein LOC111431653 [Cucurbita moschata])

HSP 1 Score: 901.4 bits (2328), Expect = 1.3e-258
Identity = 457/459 (99.56%), Postives = 457/459 (99.56%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60
           MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI
Sbjct: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60

Query: 61  KMADIVTKKPTPTCNLMDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPY 120
           KMADIVTKKPTPTCNLMDWTDFCDIDMN RIHGESSSVMFASTDMEEHLERNSTWKIKPY
Sbjct: 61  KMADIVTKKPTPTCNLMDWTDFCDIDMNARIHGESSSVMFASTDMEEHLERNSTWKIKPY 120

Query: 121 ARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPL 180
           ARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPL
Sbjct: 121 ARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPL 180

Query: 181 FITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD 240
           FITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD
Sbjct: 181 FITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD 240

Query: 241 PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKT 300
           PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKT
Sbjct: 241 PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKT 300

Query: 301 RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS 360
           RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS
Sbjct: 301 RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS 360

Query: 361 VLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ 420
           VLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ
Sbjct: 361 VLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ 420

Query: 421 KQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHSS 460
           KQGWSAFKSIYFDNQNVKLDI RFRPTLLKALELLHHSS
Sbjct: 421 KQGWSAFKSIYFDNQNVKLDIKRFRPTLLKALELLHHSS 459

BLAST of Carg04833 vs. NCBI nr
Match: XP_023001686.1 (uncharacterized protein LOC111495750 [Cucurbita maxima])

HSP 1 Score: 887.1 bits (2291), Expect = 2.5e-254
Identity = 443/459 (96.51%), Postives = 445/459 (96.95%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60
           MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRA GI
Sbjct: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRASGI 60

Query: 61  KMADIVTKKPTPTCNLMDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPY 120
           KMADI+TKKPTP CN MDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPY
Sbjct: 61  KMADIITKKPTPACNWMDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPY 120

Query: 121 ARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPL 180
           ARKQDANAMKNTREWS+KA KSPQ LPQCTQNH VPAILFSLGGYAGNHFHDFTDVIIPL
Sbjct: 121 ARKQDANAMKNTREWSVKAAKSPQKLPQCTQNHSVPAILFSLGGYAGNHFHDFTDVIIPL 180

Query: 181 FITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD 240
           FITARKFNGEVQFLITD KSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD
Sbjct: 181 FITARKFNGEVQFLITDSKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD 240

Query: 241 PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKT 300
           PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAM   XXXXXX   EIKPRLLIVARRKT
Sbjct: 241 PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMENRXXXXXXKKWEIKPRLLIVARRKT 300

Query: 301 RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS 360
           RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS
Sbjct: 301 RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS 360

Query: 361 VLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ 420
           V IQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ
Sbjct: 361 VFIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ 420

Query: 421 KQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHSS 460
           KQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHSS
Sbjct: 421 KQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHSS 459

BLAST of Carg04833 vs. NCBI nr
Match: XP_022137684.1 (protein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2-like [Momordica charantia])

HSP 1 Score: 567.4 bits (1461), Expect = 4.5e-158
Identity = 290/399 (72.68%), Postives = 325/399 (81.45%), Query Frame = 0

Query: 61  KMADIVTKKPTPTCNLMDWTDFCD-IDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKP 120
           ++AD V KK TP C+L D  DFCD ID+NVRI GESSSV FASTDM    E NS+WKI+P
Sbjct: 13  EVADAVIKKTTPICSLADRADFCDIIDVNVRIDGESSSVYFASTDMGIS-EGNSSWKIRP 72

Query: 121 YARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIP 180
           YARK+D  AM NTREWS+KAVK+PQ +PQCT+NH VPAILFS GGYAGNHFHDFTDV+IP
Sbjct: 73  YARKEDEAAMGNTREWSVKAVKNPQKMPQCTRNHSVPAILFSTGGYAGNHFHDFTDVVIP 132

Query: 181 LFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKR 240
           LF+TAR+FNGEVQFLITD + WWVLKYQAI+SKLS +DII IDKE QV CFP  IVGLKR
Sbjct: 133 LFVTAREFNGEVQFLITDTQPWWVLKYQAIISKLSKFDIINIDKETQVHCFPRAIVGLKR 192

Query: 241 DPKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRK 300
           D KEL ID   HS+SM+DFKEFLRS YSL R R               +KP+LLIVARR+
Sbjct: 193 DDKELRIDPKKHSYSMRDFKEFLRSCYSLKRDRG------GGRGRKKGLKPQLLIVARRR 252

Query: 301 TRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPER 360
           TRSFTNT EI KMA+KLGF+VIV EPD N+KKVAE VNSCDVMMGVHGAGL NIVFLP+ 
Sbjct: 253 TRSFTNTEEIRKMARKLGFKVIVMEPDVNVKKVAEIVNSCDVMMGVHGAGLANIVFLPKN 312

Query: 361 SVLIQIVPFGG--AEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPV 420
           +V IQIVPFG   A+W+S  FFGEPSKDMELKYL Y++S++ESTLIQQYPK+ VVLRDP 
Sbjct: 313 AVFIQIVPFGSRWAKWLSKNFFGEPSKDMELKYLEYNMSVEESTLIQQYPKDDVVLRDPQ 372

Query: 421 AIQKQ-GWSAFKSIYFDNQNVKLDINRFRPTLLKALELL 456
           AIQ Q GW AFKS+YF  QNV LDINRFRPTLLKALELL
Sbjct: 373 AIQDQGGWRAFKSVYFVKQNVNLDINRFRPTLLKALELL 404

BLAST of Carg04833 vs. NCBI nr
Match: XP_007209830.2 (uncharacterized protein LOC18777258 [Prunus persica] >XP_020420225.1 uncharacterized protein LOC18777258 [Prunus persica] >ONI09268.1 hypothetical protein PRUPE_5G227600 [Prunus persica])

HSP 1 Score: 519.2 bits (1336), Expect = 1.4e-143
Identity = 275/523 (52.58%), Postives = 342/523 (65.39%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLG--------------RHF 60
           ++YDSILARSFS+HEQKK+ YGA + CL IA  F  +LKP+L                  
Sbjct: 2   VMYDSILARSFSRHEQKKLRYGAFVCCLLIALCFCAVLKPNLNPLPALKLQRSMGVKHQI 61

Query: 61  LAL-----------NLRLFP---------------------------------------R 120
           LAL           N+  +P                                       +
Sbjct: 62  LALWETNSTQQVIKNMLEYPEINTNTTEEFENMTSQVAADSDDLQQNASTLTNPIASSSQ 121

Query: 121 APGIKMADIVTKKPTPTCNLMD-WTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTW 180
           A   K+ ++VTK   P CN M+  T+FC+++M+V +  +SSS    S+ +      N +W
Sbjct: 122 AEEAKIEEVVTKNLEPLCNTMEAKTEFCELNMDVHVDAKSSSAFVVSSQI-----GNRSW 181

Query: 181 KIKPYARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTD 240
            I+PYARK+D  AM  TR WS+K V     +PQC +NH VPAILFS GGY GNHFH+FTD
Sbjct: 182 SIRPYARKEDKTAMSRTRAWSVKPVIGDLEIPQCNRNHRVPAILFSNGGYTGNHFHEFTD 241

Query: 241 VIIPLFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIV 300
           V+IPLFIT+RK++GEVQFLI+D K +WV KYQA+L  LS YDII IDKE  V CFP + V
Sbjct: 242 VVIPLFITSRKYDGEVQFLISDIKPFWVTKYQAVLKGLSKYDIIDIDKEDVVHCFPSLTV 301

Query: 301 GLKRDPKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIV 360
           GLKR  KELSID + HS+SMKDF+EFLR+S+SL +  A+              +PRLLI+
Sbjct: 302 GLKRHEKELSIDPSKHSYSMKDFREFLRNSFSLKKANAIRIKDGHQRK-----RPRLLII 361

Query: 361 ARRKTRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVF 420
            R++TRSFTNTGEI KMA++LGF+VIV E D NL K AE VNSCDV+MGVHGAGLTNI+F
Sbjct: 362 PRKRTRSFTNTGEISKMARRLGFKVIVAEADINLSKFAEVVNSCDVLMGVHGAGLTNILF 421

Query: 421 LPERSVLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRD 459
           LPE +V IQI+P GG EW++T  FGEPS+DM LKYL Y IS +ESTLIQQYP +H V  D
Sbjct: 422 LPENAVFIQILPIGGFEWLATNDFGEPSQDMNLKYLEYKISNEESTLIQQYPLDHAVFTD 481

BLAST of Carg04833 vs. NCBI nr
Match: PQQ15549.1 (uncharacterized protein Pyn_28942 [Prunus yedoensis var. nudiflora])

HSP 1 Score: 518.5 bits (1334), Expect = 2.4e-143
Identity = 276/521 (52.98%), Postives = 341/521 (65.45%), Query Frame = 0

Query: 3   YDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLG--------------RHFLA 62
           YDSILARSFS+HEQKK+GYGA + CL IA  F  +LKP+L                  LA
Sbjct: 4   YDSILARSFSRHEQKKLGYGAFVCCLLIALCFCTVLKPNLNPLPALKLQRSMGVKHKILA 63

Query: 63  L-----------NLRLFP---------------------------------------RAP 122
           L           N+  +P                                       +A 
Sbjct: 64  LWETNSTQQVIKNMLEYPEINTNTTEEFENVYSQVAADSDDLQQNASTLTNPIASSSQAE 123

Query: 123 GIKMADIVTKKPTPTCNLMD-WTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKI 182
             K+A++VTK   P CN M+  T+FC+I+  VR+  +SSS    S+ +      N +W I
Sbjct: 124 EAKIAEVVTKNLEPLCNTMEAKTEFCEINKEVRVDAKSSSAFVVSSQI-----GNRSWSI 183

Query: 183 KPYARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVI 242
           +PYARK+D  AM +TR WS+K V     +PQC +NH VPAILFS GGY GNHFH+ TDV+
Sbjct: 184 RPYARKEDETAMSHTRAWSVKPVIGDLEIPQCNRNHRVPAILFSNGGYTGNHFHEITDVV 243

Query: 243 IPLFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGL 302
           IPLFIT+RK++GEVQFLI+D K +WV KY+A+L  LS YDII IDKE  V CFP + VGL
Sbjct: 244 IPLFITSRKYDGEVQFLISDIKPFWVPKYRAVLKGLSKYDIIDIDKEDVVHCFPSLTVGL 303

Query: 303 KRDPKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVAR 362
           KR  KELSID + HS+SMKDF+EFLR+SYSL +  A+              +PRLLI+ R
Sbjct: 304 KRHEKELSIDPSKHSYSMKDFREFLRNSYSLKKANAIRIKDGHQRK-----RPRLLIIPR 363

Query: 363 RKTRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLP 422
           ++TRSFTNTGEI KMA++LGF+VIV E D NL K AE VNSCDV+MGVHGAGLTNI+FLP
Sbjct: 364 KRTRSFTNTGEISKMARRLGFKVIVAEADINLSKFAEVVNSCDVLMGVHGAGLTNILFLP 423

Query: 423 ERSVLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPV 459
           E +V IQI+P GG EW++T  FGEPS+DM LKYL Y IS +ESTLIQQYP +H V  DP 
Sbjct: 424 ENAVFIQILPVGGFEWLATNDFGEPSQDMNLKYLEYKISNEESTLIQQYPLDHAVFTDPY 483

BLAST of Carg04833 vs. TAIR10
Match: AT3G18180.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 433.3 bits (1113), Expect = 1.8e-121
Identity = 216/479 (45.09%), Postives = 318/479 (66.39%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHL------------GRHFLA 60
           +LYD++LARSFSK +QK++  GA +  L +  +   ++KP+L            G     
Sbjct: 6   ILYDTVLARSFSKTDQKRLCCGAFIASLLLVLTLCTVVKPYLSPLPIVELQLSVGTGLRM 65

Query: 61  LNLRLFPRAPGIKMADIVT-----KKPTPTCNLMDWTDFCDIDMNVRIHGESSSVMFAST 120
           L++        I   ++++     +KP   CN +   +FCD+  +VRIHG+S++V+ A T
Sbjct: 66  LSITELTTNTTISKEEVISECNKMEKPICHCNTLGSKEFCDVSGDVRIHGKSATVLAAVT 125

Query: 121 DMEEHLERNSTWKIKPYARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLG 180
                   NSTW ++PYARK    AMK  REW++K V++  +L +C +NH VPAILFSLG
Sbjct: 126 FA---FSGNSTWYMRPYARKDQVPAMKRVREWTVKLVQN-ASLSRCVRNHSVPAILFSLG 185

Query: 181 GYAGNHFHDFTDVIIPLFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDK 240
           G++ N+FHDFTD++IPL+ TAR+F+GEVQFL+T++   W+ K++ ++ KLSNY++IYID+
Sbjct: 186 GFSLNNFHDFTDIVIPLYTTARRFSGEVQFLVTNKNLLWINKFKELVRKLSNYEVIYIDE 245

Query: 241 EAQVLCFPHVIVGLKRD---PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXX 300
           E +  CF  VIVGL R     KEL+ D +N  +SM DF++FLR +YSL            
Sbjct: 246 EDETHCFSSVIVGLNRHRDYDKELTTDPSNSEYSMSDFRKFLRDTYSLRNSAVTTRR--- 305

Query: 301 XXXXXXEIKPRLLIVARRKTRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCD 360
                   KPR+LI++R ++R+F N GEI + A+++GF+V+V E +  +   A TVNSCD
Sbjct: 306 --------KPRILILSRSRSRAFVNAGEIARAARQIGFKVVVAEANTEIASFAITVNSCD 365

Query: 361 VMMGVHGAGLTNIVFLPERSVLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKES 420
           VM+GVHGAG+TN+VFLP+ +++IQI+P GG EW++   F  PSK M L+YL Y I+ +ES
Sbjct: 366 VMLGVHGAGMTNMVFLPDNAIVIQILPIGGFEWLAKMDFEYPSKGMNLRYLEYKITAEES 425

Query: 421 TLIQQYPKEHVVLRDPVAIQKQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHSS 460
           TL++QY ++H  +RDP+A+ K+GW  FKS+Y   QNV +DINRF+  L+KALELLH+ S
Sbjct: 426 TLVKQYGRDHEFVRDPLAVAKRGWGTFKSVYLVQQNVSVDINRFKLVLVKALELLHNQS 469

BLAST of Carg04833 vs. TAIR10
Match: AT3G18170.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 406.8 bits (1044), Expect = 1.8e-113
Identity = 194/392 (49.49%), Postives = 277/392 (70.66%), Query Frame = 0

Query: 72  PTCNLMDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKN 131
           P C  +  T+FC+++ +VR+HG+S++V  A T        NSTW I+PYARK D  AMK 
Sbjct: 3   PICTKLARTEFCELNGDVRVHGKSATVSAAITFA---FSGNSTWHIRPYARKGDTVAMKR 62

Query: 132 TREWSIKAVKSPQ-----NLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARK 191
            REW++K  ++       N  +C +NH VPA++FSLGGY+ N+FHDFTD++IPL+ TAR+
Sbjct: 63  VREWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYSMNNFHDFTDIVIPLYTTARR 122

Query: 192 FNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDP---KE 251
           FNGEVQFL+T++   W+ K++ ++ KLSNY++IYID+E +  CF  V VGL R     KE
Sbjct: 123 FNGEVQFLVTNKSPSWINKFKELVRKLSNYEVIYIDEEDETHCFSSVTVGLTRHREYFKE 182

Query: 252 LSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKTRSF 311
           L+ID +N  +SM DF+ FLR +YSL                    +PR+LI+AR ++R+F
Sbjct: 183 LTIDPSNSEYSMSDFRSFLRDTYSLRNDAVATRQIRRR-------RPRILILARGRSRAF 242

Query: 312 TNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLI 371
            NTGEI + A+++GF+V+V E +  + K A+TVNSCDVM+GVHGAGLTN+VFLPE +V+I
Sbjct: 243 VNTGEIARAARQIGFKVVVAEANIGIAKFAQTVNSCDVMLGVHGAGLTNMVFLPENAVVI 302

Query: 372 QIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQG 431
           Q++P GG EW++   F +PS+ M L+YL Y I+++ESTL+++Y ++H ++RDP A+ K G
Sbjct: 303 QVLPIGGFEWLAKTDFEKPSEGMNLRYLEYKIAVEESTLVKKYGRDHEIVRDPSAVAKHG 362

Query: 432 WSAFKSIYFDNQNVKLDINRFRPTLLKALELL 456
           W  FKS+Y   QNV +DINRF+P L+KALELL
Sbjct: 363 WEMFKSVYLVQQNVSIDINRFKPVLVKALELL 384

BLAST of Carg04833 vs. TAIR10
Match: AT3G10320.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 249.6 bits (636), Expect = 3.7e-66
Identity = 140/383 (36.55%), Postives = 226/383 (59.01%), Query Frame = 0

Query: 80  TDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKNTREWSI-- 139
           +D C +  ++R H  SSS+ F  T  +   ++    KIKPY RK + + M+   E  +  
Sbjct: 109 SDICFMKGDIRTHSPSSSI-FLYTSNDLTTDQVLQEKIKPYTRKWETSIMETIPELKLVT 168

Query: 140 KAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARKFNGEVQFLITD 199
           K +K   +  +C   H VPA+LFS GGY GN +H+F D +IPL+IT+++FN +V F+I +
Sbjct: 169 KDMKLFGDKRKCEVIHEVPAVLFSTGGYTGNLYHEFNDGLIPLYITSKRFNKKVVFVIAE 228

Query: 200 RKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDPKELSIDSN---NHSFS 259
              WW +KY  +LS+LS+Y +I  +K+ +  CF   IVGL R   EL++D +   +   +
Sbjct: 229 YHKWWEMKYGDVLSQLSDYSLIDFNKDKRTHCFKEAIVGL-RIHGELTVDPSQMQDDGTT 288

Query: 260 MKDFKEFLRSSY--SLNR-GRAMXXXXXXXXXXXXEIK-PRLLIVARRKTRSFTNTGEII 319
           + +F+  L  +Y   +NR  R              + K P+L + +R  +R  TN   ++
Sbjct: 289 INEFRNVLDRAYRPRINRLDRLEEQRFHARLAQRRKAKRPKLALFSRTGSRGITNEDLMV 348

Query: 320 KMAKKLGFQVIVTEPD--ANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLIQIVPF 379
           KMA+++GF + V  PD    L K+   +NS  VM+GVHGA +T+ +F+   S+ IQI+P 
Sbjct: 349 KMAQRIGFDIEVLRPDRTTELAKIYRVLNSSKVMVGVHGAAMTHFLFMKPGSIFIQIIPL 408

Query: 380 GGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQGWSAFK 439
            G +W +  ++GEP+K + L Y GY I  +ES+L ++Y K+  +L+DP +I K+GW   K
Sbjct: 409 -GTDWAAETYYGEPAKKLGLDYNGYKILPRESSLYEKYDKDDPILKDPNSITKKGWQFTK 468

Query: 440 SIYFDNQNVKLDINRFRPTLLKA 452
            IY ++Q V+LD++RF+  L+ A
Sbjct: 469 GIYLNDQKVRLDLHRFKKLLIDA 488

BLAST of Carg04833 vs. TAIR10
Match: AT2G41640.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 248.8 bits (634), Expect = 6.4e-66
Identity = 136/382 (35.60%), Postives = 228/382 (59.69%), Query Frame = 0

Query: 80  TDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKNTREWSIKA 139
           +D C +  +VR +  SSS+   ++    + +     KIKPY RK + + M   +E ++  
Sbjct: 108 SDICVMKGDVRTNSASSSIFLFTSSTNNNTKPE---KIKPYTRKWETSVMDTVQELNLIT 167

Query: 140 VKSPQNLPQ-CTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARKFNGEVQFLITDR 199
             S ++  + C   H VPA+ FS GGY GN +H+F D IIPLFIT++ +N +V F+I + 
Sbjct: 168 KDSNKSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFITSQHYNKKVVFVIVEY 227

Query: 200 KSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDPKELSIDSN--NHSFSMK 259
             WW +KY  ++S+LS+Y ++  + + +  CF    VGL R   EL+++S+    + ++ 
Sbjct: 228 HDWWEMKYGDVVSQLSDYPLVDFNGDTRTHCFKEATVGL-RIHDELTVNSSLVIGNQTIV 287

Query: 260 DFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEI--KPRLLIVARR-KTRSFTNTGEIIKMA 319
           DF+  L   YS +R +++            +   KP+L+I++R   +R+  N   ++++A
Sbjct: 288 DFRNVLDRGYS-HRIQSLTQEETEANVTALDFKKKPKLVILSRNGSSRAILNENLLVELA 347

Query: 320 KKLGFQVIVTEPD--ANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLIQIVPFGGA 379
           +K GF V V  P     + K+  ++N+ DVM+GVHGA +T+ +FL  ++V IQI+P  G 
Sbjct: 348 EKTGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKTVFIQIIPL-GT 407

Query: 380 EWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQGWSAFKSIY 439
           +W +  ++GEP+K + LKY+GY I+ KES+L ++Y K+  V+RDP ++  +GW   K IY
Sbjct: 408 DWAAETYYGEPAKKLGLKYVGYKIAPKESSLYEEYGKDDPVIRDPDSLNDKGWEYTKKIY 467

Query: 440 FDNQNVKLDINRFRPTLLKALE 454
              QNVKLD+ RFR TL ++ +
Sbjct: 468 LQGQNVKLDLRRFRETLTRSYD 483

BLAST of Carg04833 vs. TAIR10
Match: AT3G57380.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 248.4 bits (633), Expect = 8.3e-66
Identity = 146/390 (37.44%), Postives = 226/390 (57.95%), Query Frame = 0

Query: 80  TDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKNTREWSI-- 139
           +D C +  +VR H  SSSV F  T ++   +   T KIKPY RK + + M+  +E ++  
Sbjct: 104 SDVCIMKGDVRTHSASSSV-FLFTSLKN--KTKITKKIKPYTRKWETSVMQTVQELNLVY 163

Query: 140 -------KAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARKFNGE 199
                    V S  ++  C   + VPA+ FS GGY GN +H+F D IIPLFIT+  FN +
Sbjct: 164 RDEENNSLVVSSVNDI--CDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPLFITSHHFNKK 223

Query: 200 VQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDPKELSIDSN- 259
           V F+I +  SWW++KY  I+S+LS+Y  +  + + +  CF   IVGLK    EL+++S+ 
Sbjct: 224 VVFVIVEYHSWWIMKYGDIVSQLSDYPPVDFNGDKRTHCFKEAIVGLKIH-DELTVESSL 283

Query: 260 -NHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXE---IKPRLLIVARRKTRSFTN 319
              + ++ DF+  L  +Y   R   +            E    KP L+I++R  +R   N
Sbjct: 284 MLGNKTILDFRNVLDQAY-WPRIHGLIQEEELKAANKTEDGFKKPILVILSRNGSREILN 343

Query: 320 TGEIIKMAKKLGFQVIVTEPD--ANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLI 379
              ++++A+++GF V V  PD    L K+   +NS DVM+GVHGA +T+++FL  ++V I
Sbjct: 344 ESLLVELAEEIGFIVHVLRPDKTTELAKIYRCLNSSDVMIGVHGAAMTHLLFLKPKTVFI 403

Query: 380 QIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQG 439
           QI+P  G EW +  ++G+P+K M LKY+GY I  KES+L  +Y  +  ++RDP +  ++G
Sbjct: 404 QIIPI-GTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYGIDDPIIRDPKSFTQKG 463

Query: 440 WSAFKSIYFDNQNVKLDINRFRPTLLKALE 454
           W   K IY + QNVKLD+ RFR  L +A +
Sbjct: 464 WDYTKKIYLERQNVKLDLKRFRKPLSRAYD 485

BLAST of Carg04833 vs. Swiss-Prot
Match: sp|Q5NDE4|PMGT2_TAKRU (Protein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2 OS=Takifugu rubripes OX=31033 GN=pomgnt2 PE=2 SV=1)

HSP 1 Score: 47.4 bits (111), Expect = 5.1e-04
Identity = 47/224 (20.98%), Postives = 94/224 (41.96%), Query Frame = 0

Query: 168 NHFHDFTDVIIPLFITARKF-NGEVQFLITDRKSWWVLKYQAILSKLSNYDII---YIDK 227
           N  H F D ++P F T ++F + +    +   + W    +  +   LSN   +    +  
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDSDEDARLVFMEGWEEGPHFELYRLLSNKQPLLKEQLRN 221

Query: 228 EAQVLCFPHVIVGLKR-------------DPKELSIDSNNHSFSMKDFKEFLRSSYSLNR 287
             +++CF    +GL +              PK   + S N    ++ F + L    ++ R
Sbjct: 222 FGKLMCFTKSYIGLSKMTTWYQYGFVQPQGPKANILVSGN---EIRHFAKVLMEKMNITR 281

Query: 288 GRAMXXXXXXXXXXXXEIKPR---LLIVARRKTRSFTNTGEIIKMAKKLGFQ---VIVTE 347
                           + KP+   +++ +R  TR   N  E+I MA    FQ   V V+ 
Sbjct: 282 AAG----GEKDQGNAEDEKPKDEYIVVFSRSTTRLILNEAELI-MALAQEFQMRVVTVSL 341

Query: 348 PDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLIQIVPF 369
            + +   + + ++   +++ +HGA L   +FLP  +V++++ PF
Sbjct: 342 EEQSFPSIVQVISGASMLVSMHGAQLITSLFLPPGAVVVELYPF 377

BLAST of Carg04833 vs. TrEMBL
Match: tr|M5WGC2|M5WGC2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa025612mg PE=4 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 6.4e-145
Identity = 267/477 (55.97%), Postives = 334/477 (70.02%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60
           ++YDSILARSFS+HEQKK+ YGA + CL IA  F  +LKP+L      +++ +F      
Sbjct: 2   VMYDSILARSFSRHEQKKLRYGAFVCCLLIALCFCAVLKPNLNPLPACMSICVFSEFSLC 61

Query: 61  KMADIV------------------TKKPTPTCNLMD-WTDFCDIDMNVRIHGESSSVMFA 120
           +    +                  T    P CN M+  T+FC+++M+V +  +SSS    
Sbjct: 62  QFFFFLVTSFCFIQDEIAAVNGGQTPDTEPLCNTMEAKTEFCELNMDVHVDAKSSSAFVV 121

Query: 121 STDMEEHLERNSTWKIKPYARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFS 180
           S+ +      N +W I+PYARK+D  AM  TR WS+K V     +PQC +NH VPAILFS
Sbjct: 122 SSQI-----GNRSWSIRPYARKEDKTAMSRTRAWSVKPVIGDLEIPQCNRNHRVPAILFS 181

Query: 181 LGGYAGNHFHDFTDVIIPLFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYI 240
            GGY GNHFH+FTDV+IPLFIT+RK++GEVQFLI+D K +WV KYQA+L  LS YDII I
Sbjct: 182 NGGYTGNHFHEFTDVVIPLFITSRKYDGEVQFLISDIKPFWVTKYQAVLKGLSKYDIIDI 241

Query: 241 DKEAQVLCFPHVIVGLKRDPKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXX 300
           DKE  V CFP + VGLKR  KELSID + HS+SMKDF+EFLR+S+SL +  A+       
Sbjct: 242 DKEDVVHCFPSLTVGLKRHEKELSIDPSKHSYSMKDFREFLRNSFSLKKANAIRIKDGHQ 301

Query: 301 XXXXXEIKPRLLIVARRKTRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDV 360
                  +PRLLI+ R++TRSFTNTGEI KMA++LGF+VIV E D NL K AE VNSCDV
Sbjct: 302 RK-----RPRLLIIPRKRTRSFTNTGEISKMARRLGFKVIVAEADINLSKFAEVVNSCDV 361

Query: 361 MMGVHGAGLTNIVFLPERSVLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKEST 420
           +MGVHGAGLTNI+FLPE +V IQI+P GG EW++T  FGEPS+DM LKYL Y IS +EST
Sbjct: 362 LMGVHGAGLTNILFLPENAVFIQILPIGGFEWLATNDFGEPSQDMNLKYLEYKISNEEST 421

Query: 421 LIQQYPKEHVVLRDPVAIQKQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLHHS 459
           LIQQYP +H V  DP +I KQGW AFKSI+ + QNVKL++NRFRPTLLKALELLH +
Sbjct: 422 LIQQYPLDHAVFTDPYSIGKQGWEAFKSIFLEKQNVKLNVNRFRPTLLKALELLHQN 468

BLAST of Carg04833 vs. TrEMBL
Match: tr|A0A251PCG8|A0A251PCG8_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G227600 PE=4 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 9.2e-144
Identity = 275/523 (52.58%), Postives = 342/523 (65.39%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLG--------------RHF 60
           ++YDSILARSFS+HEQKK+ YGA + CL IA  F  +LKP+L                  
Sbjct: 2   VMYDSILARSFSRHEQKKLRYGAFVCCLLIALCFCAVLKPNLNPLPALKLQRSMGVKHQI 61

Query: 61  LAL-----------NLRLFP---------------------------------------R 120
           LAL           N+  +P                                       +
Sbjct: 62  LALWETNSTQQVIKNMLEYPEINTNTTEEFENMTSQVAADSDDLQQNASTLTNPIASSSQ 121

Query: 121 APGIKMADIVTKKPTPTCNLMD-WTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTW 180
           A   K+ ++VTK   P CN M+  T+FC+++M+V +  +SSS    S+ +      N +W
Sbjct: 122 AEEAKIEEVVTKNLEPLCNTMEAKTEFCELNMDVHVDAKSSSAFVVSSQI-----GNRSW 181

Query: 181 KIKPYARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTD 240
            I+PYARK+D  AM  TR WS+K V     +PQC +NH VPAILFS GGY GNHFH+FTD
Sbjct: 182 SIRPYARKEDKTAMSRTRAWSVKPVIGDLEIPQCNRNHRVPAILFSNGGYTGNHFHEFTD 241

Query: 241 VIIPLFITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIV 300
           V+IPLFIT+RK++GEVQFLI+D K +WV KYQA+L  LS YDII IDKE  V CFP + V
Sbjct: 242 VVIPLFITSRKYDGEVQFLISDIKPFWVTKYQAVLKGLSKYDIIDIDKEDVVHCFPSLTV 301

Query: 301 GLKRDPKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIV 360
           GLKR  KELSID + HS+SMKDF+EFLR+S+SL +  A+              +PRLLI+
Sbjct: 302 GLKRHEKELSIDPSKHSYSMKDFREFLRNSFSLKKANAIRIKDGHQRK-----RPRLLII 361

Query: 361 ARRKTRSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVF 420
            R++TRSFTNTGEI KMA++LGF+VIV E D NL K AE VNSCDV+MGVHGAGLTNI+F
Sbjct: 362 PRKRTRSFTNTGEISKMARRLGFKVIVAEADINLSKFAEVVNSCDVLMGVHGAGLTNILF 421

Query: 421 LPERSVLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRD 459
           LPE +V IQI+P GG EW++T  FGEPS+DM LKYL Y IS +ESTLIQQYP +H V  D
Sbjct: 422 LPENAVFIQILPIGGFEWLATNDFGEPSQDMNLKYLEYKISNEESTLIQQYPLDHAVFTD 481

BLAST of Carg04833 vs. TrEMBL
Match: tr|A0A2N9I0B3|A0A2N9I0B3_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS45336 PE=4 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 7.8e-143
Identity = 267/506 (52.77%), Postives = 335/506 (66.21%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60
           M+YD +LA+SFSKHEQKK+GYGA +GCL IA SF  +  P+LG     LNL+L   A G 
Sbjct: 1   MMYDDVLAKSFSKHEQKKLGYGAFVGCLLIALSFCTVFNPYLG-PLPVLNLKLSMSA-GF 60

Query: 61  KMADIVTKKPTPTCNLMDWTDFCDIDMN-------------------------------- 120
           KM  +     +P+  L     F + + +                                
Sbjct: 61  KMLMLRDSSGSPSPPLGPLNHFSEKNTSSSQETVKVAEIMTDKKXXXXXXXXXXXXXXXX 120

Query: 121 -----------------VRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKN 180
                             RI G SSSV   S+ M      N++W I+PYARK D  AM  
Sbjct: 121 XXXXXXXXXXXXXXXXXXRIQGNSSSVFLVSSQMNILAGNNNSWSIRPYARKGDRTAMSQ 180

Query: 181 TREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARKFNGEV 240
            REWS+K +   + +P C +NH VPAILFS GGY GNHFHDFTDVIIPL++T+R++NGEV
Sbjct: 181 VREWSVK-LSRRKEIPACNKNHSVPAILFSQGGYTGNHFHDFTDVIIPLYLTSRQYNGEV 240

Query: 241 QFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDPKELSIDSNNH 300
           QFLITD++ WW+ K++AI+  LS Y++I IDKE +V CFP VIVGLKR+ KEL+ID + +
Sbjct: 241 QFLITDKRPWWITKFKAIIKNLSRYELIDIDKEEEVHCFPSVIVGLKRNVKELTIDPSKY 300

Query: 301 SFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKTRSFTNTGEIIK 360
           S+SM+DF EFLRS YSL +  A+              KPRLLI++RR+TR+FTN GEI K
Sbjct: 301 SYSMRDFTEFLRSCYSLKKANAIKLRDGQRK------KPRLLIISRRRTRAFTNIGEITK 360

Query: 361 MAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLIQIVPFGGA 420
           MA KLG++VIV+EP   + K AE VNSCDV+MGVHGAGLTN VFLP+ ++ IQIVPFGG 
Sbjct: 361 MASKLGYKVIVSEPTMEVSKFAELVNSCDVLMGVHGAGLTNFVFLPKNAIFIQIVPFGGF 420

Query: 421 EWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQGWSAFKSIY 458
           EW++   F EP+KDM L YL Y I+ +ES+LIQQYP +HVVLRDP++IQKQGW AFKS+Y
Sbjct: 421 EWIARIDFEEPTKDMNLNYLDYKITKEESSLIQQYPLDHVVLRDPLSIQKQGWEAFKSVY 480

BLAST of Carg04833 vs. TrEMBL
Match: tr|A0A061G2D4|A0A061G2D4_THECC (Glycosyltransferase family 61 protein, putative OS=Theobroma cacao OX=3641 GN=TCM_012531 PE=4 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 7.8e-143
Identity = 250/456 (54.82%), Postives = 327/456 (71.71%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60
           ++YD+I ARSFS+ +QKK+GYGA +GCL IA  F ++ KP+     + LN          
Sbjct: 5   IMYDTIFARSFSRSDQKKLGYGAFLGCLLIALCFCLVFKPYTDPRSVRLN---------- 64

Query: 61  KMADIVTKKPTPTCNLMDWTDFCDIDMNVRIHGESSSVMFASTDMEEHLERNSTWKIKPY 120
                V K+    CN    +DFC+I+ ++RI  +SS+V+F+++  E  LE NS+  I+PY
Sbjct: 65  ----SVPKRMKLVCNSETRSDFCEINGDIRIDAKSSTVLFSASPQESILEENSSRVIRPY 124

Query: 121 ARKQDANAMKNTREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPL 180
            RK+D +AM   ++WSIK       +PQC QNHGVPA+LFSLGGY+GN++HDFTD+IIPL
Sbjct: 125 TRKEDEHAMSTVKKWSIKPAVDNNTIPQCNQNHGVPAVLFSLGGYSGNNYHDFTDIIIPL 184

Query: 181 FITARKFNGEVQFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRD 240
           + TAR F+GEV+FLITDR  WW+ K+Q IL KLSNYD++ ID E  + CF  VIVGLKR 
Sbjct: 185 YSTARLFDGEVKFLITDRNPWWIKKFQIILHKLSNYDVVDIDNEESIHCFTSVIVGLKRS 244

Query: 241 PKELSIDSNNHSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKT 300
           P ELSID+    +SMK+F++FLRS+YSLN+   +            + +PRLLIV+R +T
Sbjct: 245 PHELSIDTTKSPYSMKNFRQFLRSAYSLNKSTTI------RMEDDGKARPRLLIVSRSRT 304

Query: 301 RSFTNTGEIIKMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERS 360
           R+FTNT EI +MA+ LG+ V+V E   N+ + AE VNSCDVMMGVHGAGLTN+VFLPE +
Sbjct: 305 RTFTNTDEIARMARNLGYDVVVAEA-TNVPRFAEIVNSCDVMMGVHGAGLTNMVFLPENA 364

Query: 361 VLIQIVPFGGAEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQ 420
           +LIQI+P GG EW +   FGEPSKDM ++YL Y I  +ESTLIQQYP +H VL +P +I 
Sbjct: 365 ILIQIIPIGGVEWPARTAFGEPSKDMNIRYLDYKIKTEESTLIQQYPPQHEVLNNPSSIW 424

Query: 421 KQGWSAFKSIYFDNQNVKLDINRFRPTLLKALELLH 457
           KQGW AFK++Y DNQNV LD+NRFRPTLL+ALELLH
Sbjct: 425 KQGWLAFKAVYLDNQNVNLDVNRFRPTLLRALELLH 439

BLAST of Carg04833 vs. TrEMBL
Match: tr|A0A2P4I5E8|A0A2P4I5E8_QUESU (Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_26597 PE=4 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 3.6e-140
Identity = 261/506 (51.58%), Postives = 333/506 (65.81%), Query Frame = 0

Query: 1   MLYDSILARSFSKHEQKKMGYGALMGCLFIAFSFSIMLKPHLGRHFLALNLRLFPRAPGI 60
           M+YD +LARSFSKHE+KK+GYGA  GCL IA SF  + KP+LG    ALNL+    A G 
Sbjct: 1   MMYDDLLARSFSKHEKKKLGYGAFFGCLLIALSFCTVFKPYLG-PLPALNLKQ-SMAAGF 60

Query: 61  KMADIVTKKPTPTCNLMDWTDFCDIDMN-------------------------------- 120
           KM  +  K  + +  L   T   + + +                                
Sbjct: 61  KMLMVRDKSGSYSTPLAPLTHLSETNTSSSQETVKAAKIVXXXXXXXXXXXXXXXXXXXX 120

Query: 121 -----------------VRIHGESSSVMFASTDMEEHLERNSTWKIKPYARKQDANAMKN 180
                             RI G SSSV   S++M      N++W I+PYARK D  AM  
Sbjct: 121 XXXXXXXXXXXXXXXXXXRIQGNSSSVFLVSSEMSILARNNNSWSIRPYARKGDRAAMIQ 180

Query: 181 TREWSIKAVKSPQNLPQCTQNHGVPAILFSLGGYAGNHFHDFTDVIIPLFITARKFNGEV 240
            +EWS+K +     +P C +NH VPAILFS GGY GNHFHDF+DV+IPL++T+R++NGEV
Sbjct: 181 VKEWSVK-LSGRNEIPPCNKNHTVPAILFSQGGYTGNHFHDFSDVVIPLYLTSRQYNGEV 240

Query: 241 QFLITDRKSWWVLKYQAILSKLSNYDIIYIDKEAQVLCFPHVIVGLKRDPK-ELSIDSNN 300
           QFLITD++ WW+ K++AIL  LS Y++I ID+E +V CFP  +VGLKRD K +LSID + 
Sbjct: 241 QFLITDKRPWWIAKFKAILKNLSRYELIDIDREEEVHCFPSAVVGLKRDVKAQLSIDPSK 300

Query: 301 HSFSMKDFKEFLRSSYSLNRGRAMXXXXXXXXXXXXEIKPRLLIVARRKTRSFTNTGEII 360
           +S+SM DFKEFLRS YSL    A+              KP+LLI++R++TRSFTN GEI 
Sbjct: 301 NSYSMSDFKEFLRSCYSLKNANAIKLRDGQRK------KPQLLIISRKRTRSFTNIGEIT 360

Query: 361 KMAKKLGFQVIVTEPDANLKKVAETVNSCDVMMGVHGAGLTNIVFLPERSVLIQIVPFGG 420
           KMA +LG++VIV EP  N+ K AE VNSCDV+MGVHGAGL N+VFLP+ ++LIQ+VPFGG
Sbjct: 361 KMASRLGYKVIVAEPTMNVSKFAELVNSCDVLMGVHGAGLANVVFLPKNAILIQVVPFGG 420

Query: 421 AEWVSTRFFGEPSKDMELKYLGYDISLKESTLIQQYPKEHVVLRDPVAIQKQGWSAFKSI 457
            EW++  ++GEP+KDM +KYL Y IS KESTL+QQYP +HVV +DP +IQKQGW AFKS+
Sbjct: 421 FEWLAKTYYGEPTKDMNVKYLEYKISTKESTLVQQYPPDHVVFKDPYSIQKQGWIAFKSV 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022924124.11.3e-25899.56uncharacterized protein LOC111431653 [Cucurbita moschata][more]
XP_023001686.12.5e-25496.51uncharacterized protein LOC111495750 [Cucurbita maxima][more]
XP_022137684.14.5e-15872.68protein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2-like [Momord... [more]
XP_007209830.21.4e-14352.58uncharacterized protein LOC18777258 [Prunus persica] >XP_020420225.1 uncharacter... [more]
PQQ15549.12.4e-14352.98uncharacterized protein Pyn_28942 [Prunus yedoensis var. nudiflora][more]
Match NameE-valueIdentityDescription
AT3G18180.11.8e-12145.09Glycosyltransferase family 61 protein[more]
AT3G18170.11.8e-11349.49Glycosyltransferase family 61 protein[more]
AT3G10320.13.7e-6636.55Glycosyltransferase family 61 protein[more]
AT2G41640.16.4e-6635.60Glycosyltransferase family 61 protein[more]
AT3G57380.18.3e-6637.44Glycosyltransferase family 61 protein[more]
Match NameE-valueIdentityDescription
sp|Q5NDE4|PMGT2_TAKRU5.1e-0420.98Protein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2 OS=Takifugu ... [more]
Match NameE-valueIdentityDescription
tr|M5WGC2|M5WGC2_PRUPE6.4e-14555.97Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa025612mg PE=4 SV=1[more]
tr|A0A251PCG8|A0A251PCG8_PRUPE9.2e-14452.58Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G227600 PE=4 SV=1[more]
tr|A0A2N9I0B3|A0A2N9I0B3_FAGSY7.8e-14352.77Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS45336 PE=4 SV=1[more]
tr|A0A061G2D4|A0A061G2D4_THECC7.8e-14354.82Glycosyltransferase family 61 protein, putative OS=Theobroma cacao OX=3641 GN=TC... [more]
tr|A0A2P4I5E8|A0A2P4I5E8_QUESU3.6e-14051.58Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_26597 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016757transferase activity, transferring glycosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR007657Glycosyltransferase_61
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Carg04833-RACarg04833-RAmRNA


Analysis Name: InterPro Annotations of silver-seed gourd
Date Performed: 2019-03-07
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007657Glycosyltransferase 61PFAMPF04577DUF563coord: 168..367
e-value: 7.9E-20
score: 71.8
IPR007657Glycosyltransferase 61PANTHERPTHR20961GLYCOSYLTRANSFERASEcoord: 69..456
NoneNo IPR availablePANTHERPTHR20961:SF5GLYCOSYLTRANSFERASEcoord: 69..456

The following gene(s) are paralogous to this gene:

None