Cp4.1LG02g03930 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g03930
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionNeuroguidin
LocationCp4.1LG02 : 2764543 .. 2767899 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAGATTTGACGGGATTTTCTATAGAGACCTTTGCCGCCAGCGCTCCGAAGCAAGCAGTTGCCGCGAAGAGCACAGCAATATCATTATTGCTCTCACCGACGGCAACGGCGACACCGAATCCCTCTCTCAGGTTCTTTGCATTCCCTATCTCTCTTCTCTCGCTCCGTTGCAGTCGTAGTAATCTCACTGTTACTCTCATCGACGGCGAAAATATTCATCTCTGTTTCTGATCCTTGTTTCGTTTGGCTTATTGTTTTCTTTGATATATATGAATTCTCCGATACAATTGATTGAGATTTCTTTACTTTTTGCAAATACACAGTTATCTGATATTTTTCAATATTGAAGTACATTTTGTGCTTCAGACTTATTATGGAAGGTTTATGAAATCCTCGTGATATGGAATTACTTGTTTTTACATCTTATATGCCTATTTATCTCAGTTTGTGGATTGCGTTTCTCTTGGCAATGTTATTCTTTCTTAAGCTTGAGATTAGTCTTTCATAAATGAACTTTTTTTCTGATGTTCCATTGTGTTAGGTTGCTTATTTGCGGAGTGTGAGTTGCTCGAAGCTCAAACAGTTATCTCTAAGATATTGTAGCCTGAAAACAGAGTGCAGTCTAGAAATTTAGCGTTTCAGTTACTGCGATGGAGGAACACAACAACATGCGTTGTGATGATAGAACAAACAAGTAACGGAATCTTTTTTNTTTCGTAACGGATTTTTTTTTTTTTTTTTTTTGGCATAAATTGTTAATTTATCTTGGAATGGATAGTTCCAATGCTTGTCATAAGATTAAAATGCTCTGACTACTCCAATTTGTTCCATGGCAATGGATTTTTTTGGAGTGACATACTTATGTTGCCTTTTTTCCATGATTATATTTGTCTGACTTACTATGAATGAATTCTCTCAGAGAAGCTTCTCAACTAACTGCATTGTTGAAGGAAATGAAGGATGGATTGGACACAGTCACAAATAAAGTCCAGGCTTTAACTGCCAAGGTTAGAACATTCTCAGTATCTGCTAATCCCAGATAGCAAAGAGAACACTCTCGTATAAATCATAATTTTTTTCATTTTAGATATGTCAATTTAGGTAGGAAGAAATTTAGATTATCATGCTGGCATAGCTAATATGCAGGAAGAAATATGTGCAAAGAATCAGCGTCGATGTTTATTTTCACGTGCTATTGTCTCTTTCGGGTCGGTGGGATTCTATTATCTGAAGTTTGCATGCAATTCAATCTTCAAGTCTGTGAAAGTTATAGTCTGAAAGATTGATAATGAATGAGGGTTTGAAGTCTCCAGAGTCTACATTGTGTTTCCAAAGTGATGTCCTTACCTAATAAAAAATATATTAATAATGCACTTTCTCCTGTTTTCGTACACTTGACTTGCGAGTTATGTCCATCTTTAGCATGTATATAGCTATGGAAAGTTAATAATCATATGTGATTATGTGTGTTCTTTGTATTATAATCTGCAAAATATTATTGTTCTAAAAAATTTGACATTAAGTTTGTGTCAAATTTGTGTTTATTAGTGTATCAATTGCAAGTTTTCAAAATATTGAAGAGATTCTTGTGGCCAGAATCTCAACGTGTCTCCAAATTCAAAAAGTTATTTTTTGGTTAAACATGACAATTTCTCTGCAGAAAATCTTATCTTGTGCTTTTTTGACATTCGAAAGTGTATATTTATGTTCTTTATCCTATTTAGTATGTTCTAATTATAGTATCTGATTTGAGAAACTATAGTTTGCTTTAAGTGATTGCCTTTTGTGCGTATCTTGCCTTTGTTCAAGCAAGAAGTCCTGTTGTTTTAAGAGTACTCCTTGAAAAACACTGGCTGATCTTGTAGAACACCTTTTTTGGATATCCTTAACTGATCCTATGGGATGATTAACTTCTATAGCAGAAATTTGAACCTACAATTTTATTTGATAAATTCTGGATATCTCGACCGTAATCATCATAAAAACCCGTGGATGACAATAAGCCCACATATTTATCAAGAATATATGAGCAAAAACGTGCGAAAAATGTCTTTACCCTTTGTCACAGAATTTGGGCCGAAATATCTTATTATTTTTAGTCTTTACTTCTTTTAGCTACAAAAAATTCCAGCTTTTGATTGATGGCTTTATCTGTGTTTTGTATGTCAGGTAAAATCAAACGAGCTTCCTACATCAGATGGGATAAGCTATCTTGACGCCAAATATTTATTACTTCTTAACTATTGCTCTTCACTTGTATACTATTTGCTTCGCAAAGCCAAGGGGTTCTCAATTGAGGGCCATCCTGTTGTCAGGAGTCTTGTAGAGATAAGGTTATTTCTCGAGAAGGTTCGTTCCTTTTCCCCTACTCTTTTCCGCTTGCTTACCTGCATGGGACTTATACGTAAGCCCTGCCTTTTGGTTGATGCAGATTCGACCTATCGATAAGAAGCTTGAGTATCAGATTCAGAAGCTGACAAGAGTTTCTGTTGTTGCAAAAGAGGATGCATTCGTGGATGAAAAGGAATCAGCTACACCTCAGGCTACGGATGATCGGTTAAAATATCGTCCAAACCCTGACATGCTTGTCAGTAAATCAGAAGGGATTGCTGAGGTGAGGCATTAAGTCAAGGATGAAGTTTAATAACTCCTTTATTCCAATTGTTATGCGAAAAACTCAACAATGGCTTTTGTTTTCTGAACTGAACATTCGAATAGGATGGAGATGGTATATATCGTCCACCAAGGTTTGCCCCTACTACTATGGATGAAGATAAGAGCTCTAGGAAGGAGAGAAATTCCTCGAGGAAGGATTTAGAGACGCTCCGAAGAGCTCGACAGAGTGATTATATGAGGGAGCTAATGGATGACATGGCTGGGAAACCTGAAGAGGTTTGTTATAGATCTACTATTATATAATATGGGTTTTTTTCTAAATATCCAACGTATAGAGTACTTAAAACTTGTGTTCATGGTAACTACAGATTAAAGAAAGCATTGGACTCGAAAATAGAGAAGTTGCTAGATATGTAGCTAAAATGGACGAACGTGATCGAAGAGAAGAGGAGCTTTTCACTCGTGCACCGCTTACAAAGATGGAGAAAAAGAGAGAAAAATACCTAAAGAAGTCGAGATATGGGTACGAGAACTTTGATTCGATCACCGTACTTCTAAAACATCTAATGATTGTTTCCTTAAGTTCGGTTTTGACACACTTCATTTTATTCTCTAGGATGGGTGGCGTAACCGATAGTTTCTTTGACGAAGTAAAATCGATGCCCTTGGGAGGTGCTGATGACGAGCAACCGACCAGTTTTGGTAGTAGTAGAGGCGGAATGAGAAAATATAAGAAGCGC

mRNA sequence

CAAAGATTTGACGGGATTTTCTATAGAGACCTTTGCCGCCAGCGCTCCGAAGCAAGCAGTTGCCGCGAAGAGCACAGCAATATCATTATTGCTCTCACCGACGGCAACGGCGACACCGAATCCCTCTCTCAGGTTGCTTATTTGCGGATTACTGCGATGGAGGAACACAACAACATGCGTTGTGATGATAGAACAAACAAAGAAGCTTCTCAACTAACTGCATTGTTGAAGGAAATGAAGGATGGATTGGACACAGTCACAAATAAAGTCCAGGCTTTAACTGCCAAGGTAAAATCAAACGAGCTTCCTACATCAGATGGGATAAGCTATCTTGACGCCAAATATTTATTACTTCTTAACTATTGCTCTTCACTTGTATACTATTTGCTTCGCAAAGCCAAGGGGTTCTCAATTGAGGGCCATCCTGTTGTCAGGAGTCTTGTAGAGATAAGGTTATTTCTCGAGAAGATTCGACCTATCGATAAGAAGCTTGAGTATCAGATTCAGAAGCTGACAAGAGTTTCTGTTGTTGCAAAAGAGGATGCATTCGTGGATGAAAAGGAATCAGCTACACCTCAGGCTACGGATGATCGGTTAAAATATCGTCCAAACCCTGACATGCTTGTCAGTAAATCAGAAGGGATTGCTGAGGATGGAGATGGTATATATCGTCCACCAAGGTTTGCCCCTACTACTATGGATGAAGATAAGAGCTCTAGGAAGGAGAGAAATTCCTCGAGGAAGGATTTAGAGACGCTCCGAAGAGCTCGACAGAGTGATTATATGAGGGAGCTAATGGATGACATGGCTGGGAAACCTGAAGAGATTAAAGAAAGCATTGGACTCGAAAATAGAGAAGTTGCTAGATATGTAGCTAAAATGGACGAACGTGATCGAAGAGAAGAGGAGCTTTTCACTCGTGCACCGCTTACAAAGATGGAGAAAAAGAGAGAAAAATACCTAAAGAAGTCGAGATATGGGTACGAGAACTTTGATTCGATCACCGTACTTCTAAAACATCTAATGATTGTTTCCTTAAGTTCGGTTTTGACACACTTCATTTTATTCTCTAGGATGGGTGGCGTAACCGATAGTTTCTTTGACGAAGTAAAATCGATGCCCTTGGGAGGTGCTGATGACGAGCAACCGACCAGTTTTGGTAGTAGTAGAGGCGGAATGAGAAAATATAAGAAGCGC

Coding sequence (CDS)

CAAAGATTTGACGGGATTTTCTATAGAGACCTTTGCCGCCAGCGCTCCGAAGCAAGCAGTTGCCGCGAAGAGCACAGCAATATCATTATTGCTCTCACCGACGGCAACGGCGACACCGAATCCCTCTCTCAGGTTGCTTATTTGCGGATTACTGCGATGGAGGAACACAACAACATGCGTTGTGATGATAGAACAAACAAAGAAGCTTCTCAACTAACTGCATTGTTGAAGGAAATGAAGGATGGATTGGACACAGTCACAAATAAAGTCCAGGCTTTAACTGCCAAGGTAAAATCAAACGAGCTTCCTACATCAGATGGGATAAGCTATCTTGACGCCAAATATTTATTACTTCTTAACTATTGCTCTTCACTTGTATACTATTTGCTTCGCAAAGCCAAGGGGTTCTCAATTGAGGGCCATCCTGTTGTCAGGAGTCTTGTAGAGATAAGGTTATTTCTCGAGAAGATTCGACCTATCGATAAGAAGCTTGAGTATCAGATTCAGAAGCTGACAAGAGTTTCTGTTGTTGCAAAAGAGGATGCATTCGTGGATGAAAAGGAATCAGCTACACCTCAGGCTACGGATGATCGGTTAAAATATCGTCCAAACCCTGACATGCTTGTCAGTAAATCAGAAGGGATTGCTGAGGATGGAGATGGTATATATCGTCCACCAAGGTTTGCCCCTACTACTATGGATGAAGATAAGAGCTCTAGGAAGGAGAGAAATTCCTCGAGGAAGGATTTAGAGACGCTCCGAAGAGCTCGACAGAGTGATTATATGAGGGAGCTAATGGATGACATGGCTGGGAAACCTGAAGAGATTAAAGAAAGCATTGGACTCGAAAATAGAGAAGTTGCTAGATATGTAGCTAAAATGGACGAACGTGATCGAAGAGAAGAGGAGCTTTTCACTCGTGCACCGCTTACAAAGATGGAGAAAAAGAGAGAAAAATACCTAAAGAAGTCGAGATATGGGTACGAGAACTTTGATTCGATCACCGTACTTCTAAAACATCTAATGATTGTTTCCTTAAGTTCGGTTTTGACACACTTCATTTTATTCTCTAGGATGGGTGGCGTAACCGATAGTTTCTTTGACGAAGTAAAATCGATGCCCTTGGGAGGTGCTGATGACGAGCAACCGACCAGTTTTGGTAGTAGTAGAGGCGGAATGAGAAAATATAAGAAGCGC

Protein sequence

QRFDGIFYRDLCRQRSEASSCREEHSNIIIALTDGNGDTESLSQVAYLRITAMEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR
BLAST of Cp4.1LG02g03930 vs. Swiss-Prot
Match: NGDN_BOVIN (Neuroguidin OS=Bos taurus GN=NGDN PE=2 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 2.1e-27
Identity = 95/286 (33.22%), Postives = 143/286 (50.00%), Query Frame = 1

Query: 74  ALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKA 133
           ALLK +++ +  VT +VQ LT KV++   PT  G+S L+ K  LLL Y   L + +L KA
Sbjct: 16  ALLKNLQEQVMAVTAQVQTLTKKVQAKAYPTEKGLSLLEVKDQLLLMYLMDLSHLILDKA 75

Query: 134 KGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDEKESATPQ 193
            G S++GHP V  LVEIR  LEK+RP+D+KL+YQI KL + +V                 
Sbjct: 76  SGGSLQGHPAVLRLVEIRTVLEKLRPLDQKLKYQIDKLVKTAVTGS-------------L 135

Query: 194 ATDDRLKYRPNPDMLVSK------SEGIAEDGDG-------------IYRPPRFAPTTMD 253
           + +D L+++P+P  ++SK       E  AE+G                Y PPR  P   D
Sbjct: 136 SENDPLRFKPHPSNMMSKLSSEDEEEDEAEEGQSGASGKKSGKGTAKKYVPPRLVPVHYD 195

Query: 254 EDKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKM 313
           E ++ R+++   R      RRA  S  +REL +  +  PEEI+++    +  V R   + 
Sbjct: 196 ETEAEREKKRLER----AKRRALSSSVIRELKEQYSDAPEEIRDA---RHPHVTRQSQED 255

Query: 314 DERDRREEELFTRAPLTKMEK---KREKYLKKSRYGYENFDSITVL 338
             R   EE +  R  ++K EK   KR   +    +   +F  I+ L
Sbjct: 256 QHRINYEESMMVRLSVSKREKGRRKRANVMSSQLHSLTHFSDISAL 281

BLAST of Cp4.1LG02g03930 vs. Swiss-Prot
Match: NGDN_MOUSE (Neuroguidin OS=Mus musculus GN=Ngdn PE=1 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 3.6e-27
Identity = 94/301 (31.23%), Postives = 147/301 (48.84%), Query Frame = 1

Query: 59  MRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLL 118
           M   +    + S    LLK +++ +  VT ++QALT KV++    T  G+S+L+ K  LL
Sbjct: 1   MAAPEVLESDVSSSITLLKNLQEQVMAVTAQIQALTTKVRAGTYSTEKGLSFLEVKDQLL 60

Query: 119 LNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVA 178
           L Y   L + +L KA G S++GHP V  LVEIR  LEK+RP+D+KL+YQI KL + +V  
Sbjct: 61  LMYLMDLSHLILDKASGASLQGHPAVLRLVEIRTVLEKLRPLDQKLKYQIDKLVKTAVTG 120

Query: 179 KEDAFVDEKESATPQATDDRLKYRPNPDMLVSK------SEGIAEDGDG----------- 238
                          + +D L+++P+P  +VSK       E  AE+G             
Sbjct: 121 S-------------LSENDPLRFKPHPSNMVSKLSSEDEEESEAEEGQSEASGKKSAKGS 180

Query: 239 --IYRPPRFAPTTMDEDKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKES 298
              Y PPR  P   DE ++ R+++   +      RRA  S  +REL +  +  PEEI+++
Sbjct: 181 AKKYVPPRLVPVHYDETEAEREQKRLEK----AKRRALSSSVIRELKEQYSDAPEEIRDA 240

Query: 299 IGLENREVARYVAKMDERDRREEELFTRAPLTKMEK---KREKYLKKSRYGYENFDSITV 338
               +  V R   +   R   EE +  R  ++K EK   +R   +    +   +F  I+ 
Sbjct: 241 ---RHPHVTRQSQEDQHRVNYEESMMVRLSVSKREKGLRRRASAMSSQLHSLTHFSDISA 281

BLAST of Cp4.1LG02g03930 vs. Swiss-Prot
Match: NGDN_HUMAN (Neuroguidin OS=Homo sapiens GN=NGDN PE=1 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 6.9e-26
Identity = 93/285 (32.63%), Postives = 144/285 (50.53%), Query Frame = 1

Query: 75  LLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAK 134
           LLK +++ +  VT +V++LT KV++   PT  G+S+L+ K  LLL Y   L + +L KA 
Sbjct: 17  LLKNLQEQVMAVTAQVKSLTQKVQAGAYPTEKGLSFLEVKDQLLLMYLMDLTHLILDKAS 76

Query: 135 GFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDEKESATPQA 194
           G S++GH  V  LVEIR  LEK+RP+D+KL+YQI KL + +V                 +
Sbjct: 77  GGSLQGHDAVLRLVEIRTVLEKLRPLDQKLKYQIDKLIKTAVTGS-------------LS 136

Query: 195 TDDRLKYRPNPDMLVSK------SEGIAEDGD----------GI---YRPPRFAPTTMDE 254
            +D L+++P+P  ++SK       E  AED            G+   Y PPR  P   DE
Sbjct: 137 ENDPLRFKPHPSNMMSKLSSEDEEEDEAEDDQSEASGKKSVKGVSKKYVPPRLVPVHYDE 196

Query: 255 DKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMD 314
            ++ R+++   R      RRA  S  +REL +  +  PEEI+++    +  V R   +  
Sbjct: 197 TEAEREKKRLER----AKRRALSSSVIRELKEQYSDAPEEIRDA---RHPHVTRQSQEDQ 256

Query: 315 ERDRREEELFTRAPLTKMEK---KREKYLKKSRYGYENFDSITVL 338
            R   EE +  R  ++K EK   KR   +    +   +F  I+ L
Sbjct: 257 HRINYEESMMVRLSVSKREKGRRKRANVMSSQLHSLTHFSDISAL 281

BLAST of Cp4.1LG02g03930 vs. Swiss-Prot
Match: NGDN_XENTR (Neuroguidin OS=Xenopus tropicalis GN=ngdn PE=2 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 1.2e-25
Identity = 109/344 (31.69%), Postives = 156/344 (45.35%), Query Frame = 1

Query: 75  LLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAK 134
           L + ++D +  VT  VQALT KV+S    T  G+S+L+ K  LLL Y   L + +L K  
Sbjct: 18  LFRTLQDQVTKVTAHVQALTQKVRSGIYNTDKGLSFLELKDQLLLFYLQDLTHLMLEKTN 77

Query: 135 GFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDEKESATPQA 194
           G SI+G+P +  LVE+R  LEK+RPID+KL+YQI KL R SV                  
Sbjct: 78  GKSIKGNPGILRLVELRTVLEKMRPIDQKLKYQIDKLVRASVTGS-------------LG 137

Query: 195 TDDRLKYRPNPDMLVSK----SEGIAEDGDGI---------------YRPPRFAPTTMDE 254
            +D L+++PNP  L+SK     EG ++ G+                 Y PPR AP   D+
Sbjct: 138 ENDPLRFKPNPQNLISKLSEADEGESDSGEDCAESGNAKKPQSKVKKYIPPRLAPVHYDD 197

Query: 255 DKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMD 314
            ++ R+ R   R      + A  S  +REL +  +  PEEI+E        + R+  +  
Sbjct: 198 TEAEREHRIIER----AKKLALSSSTIRELKEQYSDAPEEIREG---RAYHMMRHDKEEQ 257

Query: 315 ERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFIL 374
            R   EE +  R  +T+ EK R+K                   + L + S  + LTHF  
Sbjct: 258 HRINHEESMMVRLNMTRKEKARKK-------------------RVLAMTSQLNSLTHFSD 309

Query: 375 FSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
            S + G               G  D+   S   SR G +K KKR
Sbjct: 318 ISALTGGE-------------GRTDDLVPSVKKSRKGPKKSKKR 309

BLAST of Cp4.1LG02g03930 vs. Swiss-Prot
Match: NGDNA_XENLA (Neuroguidin-A OS=Xenopus laevis GN=ngdn-a PE=1 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 1.2e-22
Identity = 104/344 (30.23%), Postives = 154/344 (44.77%), Query Frame = 1

Query: 75  LLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAK 134
           L   ++D +  VT  VQ LT KV+S    T  G+S+L+ K  LLL Y   L + +L K  
Sbjct: 18  LFNTLQDQITKVTAHVQDLTQKVRSGIYNTDKGLSFLELKDQLLLFYLQDLTHLMLEKTN 77

Query: 135 GFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDEKESATPQA 194
           G SI+G+P +  LVE+R  LEK+RPID+KL+YQI KL + +V                  
Sbjct: 78  GKSIKGNPGILRLVELRTVLEKMRPIDQKLKYQIDKLVKAAVTGS-------------LG 137

Query: 195 TDDRLKYRPNPDMLVSK--------SEGIAEDGDG-----------IYRPPRFAPTTMDE 254
            +D L+++PNP  L+SK        S+   E  +G            Y PPR AP   D+
Sbjct: 138 ENDPLRFKPNPQNLMSKLSEPDERESDSGEEGAEGGVAKKPQSKVKRYIPPRLAPVHYDD 197

Query: 255 DKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMD 314
            ++ R+ R   R      + A  S  +REL +  +  PEEI+E        + R+  +  
Sbjct: 198 TEAEREHRIVER----AKKLALSSSTIRELKEQYSDAPEEIREG---RAYHMMRHDKEEQ 257

Query: 315 ERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFIL 374
            R   EE +  R  +T+ EK R+K                   + L + S  + LTHF  
Sbjct: 258 HRINHEESMMVRLNMTRKEKARKK-------------------RVLSMTSQLNSLTHFSD 310

Query: 375 FSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
            S + G              G A+D  P+   S +G  +  KK+
Sbjct: 318 ISALTGGE------------GRAEDMVPSMKKSKKGPKKSKKKK 310

BLAST of Cp4.1LG02g03930 vs. TrEMBL
Match: A0A0A0LHN9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G892730 PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.2e-139
Identity = 266/338 (78.70%), Postives = 293/338 (86.69%), Query Frame = 1

Query: 62  DDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNY 121
           ++ +NKEASQLTALLKEMK+GLDTVTNKVQALTAKVKSN+LPTSDGISYLDAKY LLLNY
Sbjct: 2   EELSNKEASQLTALLKEMKEGLDTVTNKVQALTAKVKSNQLPTSDGISYLDAKYFLLLNY 61

Query: 122 CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKED 181
           CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKL +VS+V+KE+
Sbjct: 62  CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLAKVSIVSKEN 121

Query: 182 AFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRK 241
           AF+DEK+SATPQ  DDRLKYRPNPDMLVSK+EG AEDGDG+YRPP+FAPT+M+EDK SRK
Sbjct: 122 AFMDEKDSATPQDVDDRLKYRPNPDMLVSKTEGTAEDGDGMYRPPKFAPTSMEEDKKSRK 181

Query: 242 ERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMDERDRRE 301
           ERNS RKDL+TLR+ARQ+DYMRELMDDMAGKPEEIKES+GLENREVARYVA+++ERDRRE
Sbjct: 182 ERNSMRKDLQTLRQARQNDYMRELMDDMAGKPEEIKESVGLENREVARYVARLEERDRRE 241

Query: 302 EELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFILFSRMGG 361
           EELFTRAPLTKMEKKREKYLKKSRYG                               MGG
Sbjct: 242 EELFTRAPLTKMEKKREKYLKKSRYG-------------------------------MGG 301

Query: 362 VTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
           VTDSF++EVKS+PL  ADDEQPT FGS  G MRK+KKR
Sbjct: 302 VTDSFYEEVKSLPLEVADDEQPTDFGSGSGRMRKHKKR 308

BLAST of Cp4.1LG02g03930 vs. TrEMBL
Match: A0A061FF13_THECC (Sas10/Utp3/C1D family isoform 1 OS=Theobroma cacao GN=TCM_034534 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 4.8e-103
Identity = 209/348 (60.06%), Postives = 263/348 (75.57%), Query Frame = 1

Query: 53  MEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLD 112
           MEE NN +  +R  KEA+QL A+LKEMK GLD VT KV+ALTAKVK+N LPT+DGISYL+
Sbjct: 1   MEETNNTKGSERLKKEANQLAAVLKEMKAGLDVVTTKVRALTAKVKANNLPTADGISYLE 60

Query: 113 AKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT 172
           AK+LLLLNYC SLVYYLLRKAKG+SIEGHPVVRSLVEIRLFLEKIRPIDKKL+YQIQKLT
Sbjct: 61  AKHLLLLNYCQSLVYYLLRKAKGYSIEGHPVVRSLVEIRLFLEKIRPIDKKLQYQIQKLT 120

Query: 173 RVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTT 232
           RVS  A +    +EK S  PQ T+D L YRPNPDML+SK++ ++EDG G+Y+PP+FAP  
Sbjct: 121 RVSGSATQQELSNEK-SDEPQRTEDPLNYRPNPDMLISKTDMMSEDGAGVYKPPKFAPAA 180

Query: 233 MDED-KSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYV 292
           ++ED K SR+ERN+ R++ E LR+A QS Y+RE+MDD+ G+PEE++E IG E+RE++RY+
Sbjct: 181 VEEDHKMSREERNALRREKEALRKASQSAYIREMMDDLEGRPEEVREIIGTESRELSRYM 240

Query: 293 AKMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLT 352
           AKM+ R ++EEELFTRAP+ + +KK EK+LKKSR G                        
Sbjct: 241 AKMERRAQQEEELFTRAPVARKDKKIEKHLKKSRNG------------------------ 300

Query: 353 HFILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
                  + G+TDSF+DE+K++PLG    +QPTSF +S GGM K KKR
Sbjct: 301 -------LLGLTDSFYDEIKTLPLGDVAGDQPTSFSNSNGGMWKLKKR 316

BLAST of Cp4.1LG02g03930 vs. TrEMBL
Match: W9RAA8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_011567 PE=4 SV=1)

HSP 1 Score: 378.6 bits (971), Expect = 9.1e-102
Identity = 207/347 (59.65%), Postives = 259/347 (74.64%), Query Frame = 1

Query: 53  MEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLD 112
           MEE+NN   +DR  KEA QL +LLKEMKDGLDTV +KVQALTAKV++++ PT++G+SYL+
Sbjct: 1   MEENNNSN-NDRIEKEAPQLVSLLKEMKDGLDTVRSKVQALTAKVRADQFPTAEGMSYLE 60

Query: 113 AKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT 172
            K+LLLLNYC SLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKL+YQIQKLT
Sbjct: 61  VKHLLLLNYCQSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLQYQIQKLT 120

Query: 173 RVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTT 232
           RV+  A  +A   EKE    Q T+D LKYRPNP+MLVSK++   +  DG+YRPP+FAPT+
Sbjct: 121 RVTATATTNADASEKEPDASQKTEDLLKYRPNPNMLVSKTDETTK--DGVYRPPKFAPTS 180

Query: 233 MDEDKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVA 292
           M+EDK SR+ERN+ RK+   LR+ARQSDY+RELMDD+ G+PEE++E +G E+RE+ RY+A
Sbjct: 181 MEEDKISRQERNAMRKEKNALRQARQSDYVRELMDDVEGRPEEVREIVGTESRELTRYLA 240

Query: 293 KMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTH 352
           KM+ R +REEELF RAP TK EKK+EK+LKKSR G                         
Sbjct: 241 KMEARAKREEELFIRAPFTKEEKKKEKHLKKSRNG------------------------- 300

Query: 353 FILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
                 + G+TDSF+DE+K++ + G +D Q   FG S  GM + +KR
Sbjct: 301 ------LLGLTDSFYDEIKTLAVEGENDVQMAGFGGSSSGMGRLEKR 313

BLAST of Cp4.1LG02g03930 vs. TrEMBL
Match: A0A0D2TPJ7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G228600 PE=4 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 3.8e-100
Identity = 203/348 (58.33%), Postives = 256/348 (73.56%), Query Frame = 1

Query: 53  MEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLD 112
           MEE +N   ++R  +E+ QL  +LKEMK GLD VT K+QALTAKVK+N  PT+DGISYL+
Sbjct: 1   MEEISNRNENERLRRESIQLATVLKEMKSGLDVVTTKIQALTAKVKANNFPTTDGISYLE 60

Query: 113 AKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT 172
           AK+LLLLNYC SLVYYLLRKAKG+SIEGHPVVRSLVEIRLFLEKIRPIDKKL+YQIQKLT
Sbjct: 61  AKHLLLLNYCQSLVYYLLRKAKGYSIEGHPVVRSLVEIRLFLEKIRPIDKKLQYQIQKLT 120

Query: 173 RVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTT 232
           RVS  A +      + S   + T D L YRPNPDML+SK++ I +DG G+YRPP+FAP  
Sbjct: 121 RVSGSATQQQGASNEASDGREKTQDPLTYRPNPDMLISKADMIPDDGTGVYRPPKFAPAV 180

Query: 233 MDED-KSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYV 292
           ++ED K SR+ERN+ R++ ETLR+A +S ++RE+MDD+ GKPEE++E IG E+RE+ RY+
Sbjct: 181 VEEDQKMSREERNALRREKETLRKASRSGFIREMMDDLEGKPEEVREIIGTESRELTRYM 240

Query: 293 AKMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLT 352
            KM+ R ++EEELFTRAP+TKM+KK EK+LKKSR G                        
Sbjct: 241 EKMERRAQQEEELFTRAPVTKMDKKIEKHLKKSRNG------------------------ 300

Query: 353 HFILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
                  + G+TDSF+DE+K++PLGG  +EQPT FG+   GM K KKR
Sbjct: 301 -------LLGLTDSFYDEIKTLPLGGDTNEQPTVFGNGSSGMGKIKKR 317

BLAST of Cp4.1LG02g03930 vs. TrEMBL
Match: A0A0B0Q007_GOSAR (Neuroguidin OS=Gossypium arboreum GN=F383_02124 PE=4 SV=1)

HSP 1 Score: 372.9 bits (956), Expect = 5.0e-100
Identity = 204/348 (58.62%), Postives = 255/348 (73.28%), Query Frame = 1

Query: 53  MEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLD 112
           MEE +N   ++R  KE++QL  +LKEMK GLD VT KVQALTAKVK+N  PT+DGISYL+
Sbjct: 1   MEEISNRNENERLRKESNQLAMVLKEMKSGLDVVTTKVQALTAKVKANNFPTADGISYLE 60

Query: 113 AKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT 172
           AK+LLLLNYC SLVYYLLRKAKG+SIEGHPVVRSLVEIRLFLEKIRPIDKKL+YQIQKLT
Sbjct: 61  AKHLLLLNYCQSLVYYLLRKAKGYSIEGHPVVRSLVEIRLFLEKIRPIDKKLQYQIQKLT 120

Query: 173 RVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTT 232
           RVS  A +      K S   + T D L YRPNPDML+SK++ I +DG G+YRPP+FAP  
Sbjct: 121 RVSGSATQQQGASNKASDGHEKTQDPLTYRPNPDMLISKADMIPDDGTGVYRPPKFAPAV 180

Query: 233 MDED-KSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYV 292
           ++ED K SR+ERN+ R++ ETLR+A +S Y+RE+MDD+ GKPEE++E IG E+RE+ RY+
Sbjct: 181 VEEDQKMSREERNALRREKETLRKASRSGYIREMMDDLEGKPEEVREIIGTESRELTRYM 240

Query: 293 AKMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLT 352
            KM+ R ++EEELFTRAP+TK +KK EK+LKKSR G                        
Sbjct: 241 EKMERRAQQEEELFTRAPVTKKDKKIEKHLKKSRNG------------------------ 300

Query: 353 HFILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
                  + G+TDSF+DE+K++P+GG  + QPT FG+   GM K KKR
Sbjct: 301 -------LLGLTDSFYDEIKTLPMGGDTNGQPTVFGNGSSGMGKIKKR 317

BLAST of Cp4.1LG02g03930 vs. TAIR10
Match: AT1G07840.2 (AT1G07840.2 Sas10/Utp3/C1D family)

HSP 1 Score: 268.1 bits (684), Expect = 8.8e-72
Identity = 142/258 (55.04%), Postives = 196/258 (75.97%), Query Frame = 1

Query: 67  KEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLV 126
           KEA QL ++L+EMK+ LD V +KV+ALTA VK+N  PT+ GISYL+AK+LLLL+YC  LV
Sbjct: 15  KEAPQLASVLREMKNVLDVVRSKVEALTALVKANSFPTAGGISYLEAKHLLLLSYCQDLV 74

Query: 127 YYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDE 186
           YY+LRKAKG SI+GHP+VRSLVEIR+FLEKIRPIDKKL+YQIQKLT       E A  + 
Sbjct: 75  YYILRKAKGLSIDGHPLVRSLVEIRMFLEKIRPIDKKLQYQIQKLTTAGGPVTELAHSEG 134

Query: 187 KESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRKERNSS 246
           K S   Q ++D   Y+P PD+L  K +   ++ DG+YRPP+FAP +M EDK+S++ER+++
Sbjct: 135 KGSCEAQKSEDLSNYKPKPDLLADKEDD--QEDDGVYRPPKFAPMSM-EDKTSKQERDAA 194

Query: 247 RKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMDERDRREEELFT 306
           RK+    R+A ++ YM++++DD+  +PEEI++  G+E+ E  R++A+ + + + EEELFT
Sbjct: 195 RKEKHFFRQATENTYMKDVLDDLEDRPEEIRDYYGVESNEQKRFMAQYERQQKAEEELFT 254

Query: 307 RAPLTKMEKKREKYLKKS 325
           RAP TK +KKREK LK S
Sbjct: 255 RAPRTKEDKKREKRLKSS 269

BLAST of Cp4.1LG02g03930 vs. TAIR10
Match: AT2G43650.1 (AT2G43650.1 Sas10/U3 ribonucleoprotein (Utp) family protein)

HSP 1 Score: 68.2 bits (165), Expect = 1.3e-11
Identity = 74/303 (24.42%), Postives = 128/303 (42.24%), Query Frame = 1

Query: 37  GDTESLSQVAYLRITAMEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAK 96
           GD ++  +     I ++ +   M   D     A ++  LL E+ D ++ + +K+  +  K
Sbjct: 207 GDKDTHVEEIKKDINSLSKEEQM---DVVYSSAPEIVGLLSELNDAVEELESKINPVMNK 266

Query: 97  VKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEK 156
           +K  E+  +    YL+ K LLLL YC S+ +YLL K++G  I  HPV+  LVEI+  L+K
Sbjct: 267 LKEGEISLNGLARYLEVKQLLLLTYCQSITFYLLLKSEGQPIRDHPVLARLVEIKSLLDK 326

Query: 157 IRPIDKKLEYQIQK-LTR------VSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLV 216
           I+ +D++L    ++ L R      V  V KED        +  + T D  K         
Sbjct: 327 IKELDEELPPGFEESLARSIANGAVQKVVKEDQLTSPVSDSVDRITQDTAK--------- 386

Query: 217 SKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRKERNSSRKDLETLRRARQSDYMRELMDDM 276
                               P  +D  +  +K++   RK    L    QS+ M +L   +
Sbjct: 387 --------------------PMKIDNAREEKKKKGEKRKHQNDLVDV-QSEEMLKLRAAL 446

Query: 277 AG-------------KPEEIKESIGLENREVARYVAKMDERDRREEELFTRAPLTKMEKK 320
            G             K ++ ++   L NR++  +   +D+ D     + T   LTK+   
Sbjct: 447 EGKLRTNGVLGSTVSKSDKAQKRQKLANRKLETFDDYVDDADNSTHNV-TADKLTKLVST 475

BLAST of Cp4.1LG02g03930 vs. NCBI nr
Match: gi|659132329|ref|XP_008466143.1| (PREDICTED: neuroguidin isoform X1 [Cucumis melo])

HSP 1 Score: 511.5 bits (1316), Expect = 1.3e-141
Identity = 273/338 (80.77%), Postives = 294/338 (86.98%), Query Frame = 1

Query: 62  DDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNY 121
           ++  NKEASQLTALLKEMK+GLDTVTNKVQALTAKVKSN+LPTSDGISYLDAKYLLLLNY
Sbjct: 2   EELRNKEASQLTALLKEMKEGLDTVTNKVQALTAKVKSNQLPTSDGISYLDAKYLLLLNY 61

Query: 122 CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKED 181
           CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT+VSVV+KE+
Sbjct: 62  CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTKVSVVSKEN 121

Query: 182 AFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRK 241
           AF+DEK+SATPQ  DDRLKYRPNPDMLVSK+EG AEDGDGIYRPP+FAPT+M+EDK SRK
Sbjct: 122 AFMDEKDSATPQDVDDRLKYRPNPDMLVSKTEGTAEDGDGIYRPPKFAPTSMEEDKKSRK 181

Query: 242 ERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMDERDRRE 301
           ERNS RKDL+TLR+ARQSDYMRELMDDMAGKPEEIKES GLENREVARYVA+M+ERDRRE
Sbjct: 182 ERNSMRKDLQTLRQARQSDYMRELMDDMAGKPEEIKESAGLENREVARYVARMEERDRRE 241

Query: 302 EELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFILFSRMGG 361
           EELFTRAPLTKMEKKREKYLKKSRYG                               MGG
Sbjct: 242 EELFTRAPLTKMEKKREKYLKKSRYG-------------------------------MGG 301

Query: 362 VTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
           VTDSF++EVKS+PL GADDEQPT FGS  G MRK+KKR
Sbjct: 302 VTDSFYEEVKSLPLEGADDEQPTDFGSGSGRMRKHKKR 308

BLAST of Cp4.1LG02g03930 vs. NCBI nr
Match: gi|449436946|ref|XP_004136253.1| (PREDICTED: neuroguidin [Cucumis sativus])

HSP 1 Score: 503.1 bits (1294), Expect = 4.6e-139
Identity = 266/338 (78.70%), Postives = 293/338 (86.69%), Query Frame = 1

Query: 62  DDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNY 121
           ++ +NKEASQLTALLKEMK+GLDTVTNKVQALTAKVKSN+LPTSDGISYLDAKY LLLNY
Sbjct: 2   EELSNKEASQLTALLKEMKEGLDTVTNKVQALTAKVKSNQLPTSDGISYLDAKYFLLLNY 61

Query: 122 CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKED 181
           CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKL +VS+V+KE+
Sbjct: 62  CSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLAKVSIVSKEN 121

Query: 182 AFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRK 241
           AF+DEK+SATPQ  DDRLKYRPNPDMLVSK+EG AEDGDG+YRPP+FAPT+M+EDK SRK
Sbjct: 122 AFMDEKDSATPQDVDDRLKYRPNPDMLVSKTEGTAEDGDGMYRPPKFAPTSMEEDKKSRK 181

Query: 242 ERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMDERDRRE 301
           ERNS RKDL+TLR+ARQ+DYMRELMDDMAGKPEEIKES+GLENREVARYVA+++ERDRRE
Sbjct: 182 ERNSMRKDLQTLRQARQNDYMRELMDDMAGKPEEIKESVGLENREVARYVARLEERDRRE 241

Query: 302 EELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFILFSRMGG 361
           EELFTRAPLTKMEKKREKYLKKSRYG                               MGG
Sbjct: 242 EELFTRAPLTKMEKKREKYLKKSRYG-------------------------------MGG 301

Query: 362 VTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
           VTDSF++EVKS+PL  ADDEQPT FGS  G MRK+KKR
Sbjct: 302 VTDSFYEEVKSLPLEVADDEQPTDFGSGSGRMRKHKKR 308

BLAST of Cp4.1LG02g03930 vs. NCBI nr
Match: gi|659132333|ref|XP_008466145.1| (PREDICTED: neuroguidin isoform X2 [Cucumis melo])

HSP 1 Score: 488.8 bits (1257), Expect = 9.0e-135
Identity = 260/321 (81.00%), Postives = 279/321 (86.92%), Query Frame = 1

Query: 79  MKDGLDTVTNKVQALTAKVKSNELPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAKGFSI 138
           MK+GLDTVTNKVQALTAKVKSN+LPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAKGFSI
Sbjct: 1   MKEGLDTVTNKVQALTAKVKSNQLPTSDGISYLDAKYLLLLNYCSSLVYYLLRKAKGFSI 60

Query: 139 EGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTRVSVVAKEDAFVDEKESATPQATDDR 198
           EGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT+VSVV+KE+AF+DEK+SATPQ  DDR
Sbjct: 61  EGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLTKVSVVSKENAFMDEKDSATPQDVDDR 120

Query: 199 LKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTTMDEDKSSRKERNSSRKDLETLRRARQ 258
           LKYRPNPDMLVSK+EG AEDGDGIYRPP+FAPT+M+EDK SRKERNS RKDL+TLR+ARQ
Sbjct: 121 LKYRPNPDMLVSKTEGTAEDGDGIYRPPKFAPTSMEEDKKSRKERNSMRKDLQTLRQARQ 180

Query: 259 SDYMRELMDDMAGKPEEIKESIGLENREVARYVAKMDERDRREEELFTRAPLTKMEKKRE 318
           SDYMRELMDDMAGKPEEIKES GLENREVARYVA+M+ERDRREEELFTRAPLTKMEKKRE
Sbjct: 181 SDYMRELMDDMAGKPEEIKESAGLENREVARYVARMEERDRREEELFTRAPLTKMEKKRE 240

Query: 319 KYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTHFILFSRMGGVTDSFFDEVKSMPLGGA 378
           KYLKKSRYG                               MGGVTDSF++EVKS+PL GA
Sbjct: 241 KYLKKSRYG-------------------------------MGGVTDSFYEEVKSLPLEGA 290

Query: 379 DDEQPTSFGSSRGGMRKYKKR 400
           DDEQPT FGS  G MRK+KKR
Sbjct: 301 DDEQPTDFGSGSGRMRKHKKR 290

BLAST of Cp4.1LG02g03930 vs. NCBI nr
Match: gi|590596206|ref|XP_007018273.1| (Sas10/Utp3/C1D family isoform 1 [Theobroma cacao])

HSP 1 Score: 382.9 bits (982), Expect = 7.0e-103
Identity = 209/348 (60.06%), Postives = 263/348 (75.57%), Query Frame = 1

Query: 53  MEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLD 112
           MEE NN +  +R  KEA+QL A+LKEMK GLD VT KV+ALTAKVK+N LPT+DGISYL+
Sbjct: 1   MEETNNTKGSERLKKEANQLAAVLKEMKAGLDVVTTKVRALTAKVKANNLPTADGISYLE 60

Query: 113 AKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT 172
           AK+LLLLNYC SLVYYLLRKAKG+SIEGHPVVRSLVEIRLFLEKIRPIDKKL+YQIQKLT
Sbjct: 61  AKHLLLLNYCQSLVYYLLRKAKGYSIEGHPVVRSLVEIRLFLEKIRPIDKKLQYQIQKLT 120

Query: 173 RVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTT 232
           RVS  A +    +EK S  PQ T+D L YRPNPDML+SK++ ++EDG G+Y+PP+FAP  
Sbjct: 121 RVSGSATQQELSNEK-SDEPQRTEDPLNYRPNPDMLISKTDMMSEDGAGVYKPPKFAPAA 180

Query: 233 MDED-KSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYV 292
           ++ED K SR+ERN+ R++ E LR+A QS Y+RE+MDD+ G+PEE++E IG E+RE++RY+
Sbjct: 181 VEEDHKMSREERNALRREKEALRKASQSAYIREMMDDLEGRPEEVREIIGTESRELSRYM 240

Query: 293 AKMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLT 352
           AKM+ R ++EEELFTRAP+ + +KK EK+LKKSR G                        
Sbjct: 241 AKMERRAQQEEELFTRAPVARKDKKIEKHLKKSRNG------------------------ 300

Query: 353 HFILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
                  + G+TDSF+DE+K++PLG    +QPTSF +S GGM K KKR
Sbjct: 301 -------LLGLTDSFYDEIKTLPLGDVAGDQPTSFSNSNGGMWKLKKR 316

BLAST of Cp4.1LG02g03930 vs. NCBI nr
Match: gi|703110347|ref|XP_010099561.1| (hypothetical protein L484_011567 [Morus notabilis])

HSP 1 Score: 378.6 bits (971), Expect = 1.3e-101
Identity = 207/347 (59.65%), Postives = 259/347 (74.64%), Query Frame = 1

Query: 53  MEEHNNMRCDDRTNKEASQLTALLKEMKDGLDTVTNKVQALTAKVKSNELPTSDGISYLD 112
           MEE+NN   +DR  KEA QL +LLKEMKDGLDTV +KVQALTAKV++++ PT++G+SYL+
Sbjct: 1   MEENNNSN-NDRIEKEAPQLVSLLKEMKDGLDTVRSKVQALTAKVRADQFPTAEGMSYLE 60

Query: 113 AKYLLLLNYCSSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLEYQIQKLT 172
            K+LLLLNYC SLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKL+YQIQKLT
Sbjct: 61  VKHLLLLNYCQSLVYYLLRKAKGFSIEGHPVVRSLVEIRLFLEKIRPIDKKLQYQIQKLT 120

Query: 173 RVSVVAKEDAFVDEKESATPQATDDRLKYRPNPDMLVSKSEGIAEDGDGIYRPPRFAPTT 232
           RV+  A  +A   EKE    Q T+D LKYRPNP+MLVSK++   +  DG+YRPP+FAPT+
Sbjct: 121 RVTATATTNADASEKEPDASQKTEDLLKYRPNPNMLVSKTDETTK--DGVYRPPKFAPTS 180

Query: 233 MDEDKSSRKERNSSRKDLETLRRARQSDYMRELMDDMAGKPEEIKESIGLENREVARYVA 292
           M+EDK SR+ERN+ RK+   LR+ARQSDY+RELMDD+ G+PEE++E +G E+RE+ RY+A
Sbjct: 181 MEEDKISRQERNAMRKEKNALRQARQSDYVRELMDDVEGRPEEVREIVGTESRELTRYLA 240

Query: 293 KMDERDRREEELFTRAPLTKMEKKREKYLKKSRYGYENFDSITVLLKHLMIVSLSSVLTH 352
           KM+ R +REEELF RAP TK EKK+EK+LKKSR G                         
Sbjct: 241 KMEARAKREEELFIRAPFTKEEKKKEKHLKKSRNG------------------------- 300

Query: 353 FILFSRMGGVTDSFFDEVKSMPLGGADDEQPTSFGSSRGGMRKYKKR 400
                 + G+TDSF+DE+K++ + G +D Q   FG S  GM + +KR
Sbjct: 301 ------LLGLTDSFYDEIKTLAVEGENDVQMAGFGGSSSGMGRLEKR 313

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NGDN_BOVIN2.1e-2733.22Neuroguidin OS=Bos taurus GN=NGDN PE=2 SV=1[more]
NGDN_MOUSE3.6e-2731.23Neuroguidin OS=Mus musculus GN=Ngdn PE=1 SV=1[more]
NGDN_HUMAN6.9e-2632.63Neuroguidin OS=Homo sapiens GN=NGDN PE=1 SV=1[more]
NGDN_XENTR1.2e-2531.69Neuroguidin OS=Xenopus tropicalis GN=ngdn PE=2 SV=1[more]
NGDNA_XENLA1.2e-2230.23Neuroguidin-A OS=Xenopus laevis GN=ngdn-a PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LHN9_CUCSA3.2e-13978.70Uncharacterized protein OS=Cucumis sativus GN=Csa_3G892730 PE=4 SV=1[more]
A0A061FF13_THECC4.8e-10360.06Sas10/Utp3/C1D family isoform 1 OS=Theobroma cacao GN=TCM_034534 PE=4 SV=1[more]
W9RAA8_9ROSA9.1e-10259.65Uncharacterized protein OS=Morus notabilis GN=L484_011567 PE=4 SV=1[more]
A0A0D2TPJ7_GOSRA3.8e-10058.33Uncharacterized protein OS=Gossypium raimondii GN=B456_009G228600 PE=4 SV=1[more]
A0A0B0Q007_GOSAR5.0e-10058.62Neuroguidin OS=Gossypium arboreum GN=F383_02124 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07840.28.8e-7255.04 Sas10/Utp3/C1D family[more]
AT2G43650.11.3e-1124.42 Sas10/U3 ribonucleoprotein (Utp) family protein[more]
Match NameE-valueIdentityDescription
gi|659132329|ref|XP_008466143.1|1.3e-14180.77PREDICTED: neuroguidin isoform X1 [Cucumis melo][more]
gi|449436946|ref|XP_004136253.1|4.6e-13978.70PREDICTED: neuroguidin [Cucumis sativus][more]
gi|659132333|ref|XP_008466145.1|9.0e-13581.00PREDICTED: neuroguidin isoform X2 [Cucumis melo][more]
gi|590596206|ref|XP_007018273.1|7.0e-10360.06Sas10/Utp3/C1D family isoform 1 [Theobroma cacao][more]
gi|703110347|ref|XP_010099561.1|1.3e-10159.65hypothetical protein L484_011567 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007146Sas10/Utp3/C1D
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0000462 maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA)
cellular_component GO:0005575 cellular_component
cellular_component GO:0005730 nucleolus
cellular_component GO:0032040 small-subunit processome
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g03930.1Cp4.1LG02g03930.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007146Sas10/Utp3/C1DPFAMPF04000Sas10_Utp3coord: 76..156
score: 1.6
NoneNo IPR availableunknownCoilCoilcoord: 76..96
scor
NoneNo IPR availablePANTHERPTHR13237SOMETHING ABOUT SILENCING PROTEIN 10-RELATEDcoord: 67..335
score: 2.6