Volume 9
Issue 4
Veterinary Medicine
JOURNAL OF
POLISH
AGRICULTURAL
UNIVERSITIES
Available Online: http://www.ejpau.media.pl/volume9/issue4/art-46.html
SEQUENCE OF ORANGUTAN (PONGO PYGMAEUS) FOXP2 GENE.
Monika Świszczorowska1, Arleta Lebioda1, Barbara Świątek2, Grażyna Dmochowska1, Barbara Kosowska3, Tadeusz Dobosz1
1 Molecular Techniques Laboratory,
Wrocław Medical University, Poland
2 Institute of Forensic Medicine,
Wrocław Medical University, Poland
3 Department of Genetics and Animal Breeding,
Wrocław University of Environmental and Life Sciences, Poland
Our article concerns on resequencing foxp2 gene, called “speech gene”. It has been already sequenced in human, 7 species of primates, several species of birds, mammals and reptiles. Mutations in human foxp2 locus causes changes in developing of brain parts responsible for speech process. While comparing sequencing results we found some divergences in the orangutans (Pongo pygmaeus) foxp2 sequences. Three research teams published different versions of foxp2 sequence. The effects of this article are results of resequencing foxp2 gene in orangutan and its declaration to GenBank with the access number: DQ789573.
Key words: orangutan, foxp2 gene.
INTRODUCTION
In the GenBank, at the address ncbi.nlm.nih.gov [5], are collected sequencing results of many different genes from many organisms. One of the hundreds of thousands sequencing results collected in this database concerns on the foxp2 gene. In 2001, Lai et al. [2] described mutations of that gene, being effective with changes of development some brain regions responsible for speech process. This discovery was proclaim as an exploration of “gene of speech” what was a large exaggeration. Nevertheless it caused a large interest of laryngologists from many research centers because it appeared as a hope to explain some inexplicable causes of muteness and as a potential possibility to diagnose and predict this disorder. During analysis of the foxp2 mutations in children with delayed speech development [3] we also checked the status of these analysis in other organisms. On the beginning of April 2006 foxp2 gene was already sequence in 7 species of birds, alligator, danio fish, pygmy hippopotamus, bottlenose dolphin, mouse, and 7 primate species (Macaca mulata, Pongo pygmaeus, Gorilla gorilla, Hylobates lar, Pan paniscus, Pan troglotydes and Homo sapiens) [5]. While comparing these sequences we found some divergences among the sequencing results of foxp2 gene in orangutan (Pongo pygmaeus). Three research groups published three differing in details versions of the orangutan foxp2 sequence. It induced us to resequence orangutan foxp2 gene ourselves to explain those divergences.
MATERIAL AND METHODS
Orangutan’s DNA was bought in ECACC company (catalogue No. 019A). Primers for all 16 exons were designed by ourselves and their sequences are as following:
E1F 5’TAA TGG AAG TGT GAA GGT GTA ACG 3’
E1R 5’GCA CTG AGG AAA ACA AAA CTT ACC 3’
E2F 5’TGG GAG TTC TTG TAC ATT GAA GCC 3’
E2R 5’AAG GAC AGG AGT GCT TTA CTA ACC 3’
E3F 5’AAC AGA TAT TTG GTT ATG ACC ACG 3’
E3R 5’GCT AAT AGG TTG TCC TTT CAA ATG 3’
E4F 5’TAT GCC AGT CTA GAA GAG TTT AGG 3’
E4R 5’TGT TAT GTA TCA ATG AGA TAA CCG 3’
E5F 5’TTC TCT GCT GTT TAC TGG TTT GGG 3’
E5R 5’TAA GGC AGA AAG GCC ATG AAA TGG 3’
E6F 5’GTG AAG CTT TCT TTC ATT TTA TAG 3’
E6R 5’TTC TTT TTC GTT GTC TCA ATT GTG 3’
E7F 5’TGA CGT CGT GTT CTT TTG CTA CAG 3’
E7R 5’GCC TAA AAT GCC CAT ATA ATC CAG 3’
E8F 5’GTT TTG TGT CTT CTG TTT GTT TAG 3’
E8R 5’AAG AGT CCA TTA AGT GAG CAT TTA 3’
E9F 5’TCT GAT CTC ACT CTT TCT TAA CAG 3’
E9R 5’CTT TAT AAA GCA ATA TGC ACT TAC 3’
E10F 5’TCC CCT TCT CTT GCG CCT TTG CAG 3’
E10R 5’CTC TAC TGA GCA TTC ATA TGC TAC 3’
E11+12F 5’AAC ACC TAG CTT TTA TTT TTA TAG 3’
E11+12R 5’TAA AAC TGC TGG AAA AGT AGT TAC 3’
E13F 5’TTT CTT CTC TTC TGT CTG CTT TAG 3’
E13R 5’ATT ACA AAA CTA TCA CAA ACA TAC 3’
E14F 5’AGT GTG AGA CAA GCC AGA ACA TAC 3’
E14R 5’GTG TCC TTC TTG CTT CTT CAC ATC 3’
E15F 5’TTT ACA CAA TCT TCA TTT CAC TAG 3’
E15R 5’AGA GAG TCT GTT AAC CAC ACT TAC 3’
E16F 5’TTG GTG TAT CTA CAT GTT TTT CAG 3’
E16R 5’TCC ATG CTT AAT AAA TAG TCA CAG 3’
PCR reactions were performed with Qiagen Multiplex PCR Kit, strict due to the rules of manual. Sequencing PCR reactions were performed with F primers for each exon and for second thread of DNA with R primers to control. Sequencing reactions were performed with genetic analyzer ABI Prism 310 with Big DyeTerminator v 1.1 Cycle Sequencing Kit. Results analysis was executed with computer software ABI Prism Sequencing Analysis v. 3.7.
RESULTS AND DISCUSSION
We compared results of our foxp2 gene sequencing in orangutan to other author results. The differences between these results are shown in tables I to III. Exon 1 sequence is: atg1 atg2 cag3 gaa4 tct5 gtg6 aca7 gag8 aca9 ata10 agc11 aac12 agt13 tca14 atg15 aat16 caa17 aat18 gga19 atg20 agc21 act22 cta23 agc24 agc25 caa26 tta27 gat28 gct29 ggc30 agc31 aga32 gat33 gga34 aga35 tca36 agt37 ggt38 gac39 acc40 agc41 tct42 gaa43 gta44 agc45 aca46 gta47 gaa48 ctg49 ctg50 cat51 ctg52 caa53 caa54 cag55 cag56. The codon number was marked over line. Within the whole exon 1 we did not found alleles.
Exons from 2 to 5 are exactly the same both in our and previously published analysis (AF512949, AY143181). Sequences of those exons are as following:
exon 2 – gct57 ctc58 cag59 gca60 gca61 aga62 caa63 ctt64 ctt65 tta66 cag67 cag68 caa69 aca70 agt71 gga72 ttg73 aaa74 tct75 cct76 aag77 agc78 agt79 gat80 aaa81 cag82 aga83 cca84 ctg85 cag86,
exon 3 – gtg87 cct88 gtg89 tca90 gtg91 gcc92 atg93 atg94 act95 ccc96 cag97 gtg98 atc99 acc100 cct101 cag102 caa103 atg104 cag105 cag106 atc107 ctt108 cag109 caa110 caa111 gtc112 ctg113 tct114 cct115 cag116 cag117 ctc118 caa119 gcc120 ctt121 ctc122 caa123 caa124 cag125 cag126 gcc127 gtc128 atg129 ctg130 cag131 cag132,
exon 4 – caa133 caa134 cta135 caa136 gag137 ttt138 tac139 aag140 aaa141 cag142 caa143 gag144 cag145 tta146 cat147 ctt148 cag149 ctt150 ttg151 cag152 cag153 cag154 caa155 cag156 cag157 cag158 cag159 caa160 caa161 cag162 cag163 caa164 caa165 cag166 cag167 cag168 cag169 caa170 caa171 caa172 caa173 cag174 cag175 cag176 caa177 cag178 cag179 cag180 cag181 cag182 caa183 cag184 cag185 cag186 cag187 cag188 caa189 cag190 cat191 cct192 gga193 aag194 caa195 gcg196 aaa197 gag198.
During analysis of the exon 5 we explained divergences between sequences AF512949 and AY143181. Beginning of the exon 5 is similar in both: cag199 cag200 cag201 cag202 cag203 cag204. Further we have found difference showed in table no. 1.
Table 1. Differences between orangutan’s foxp2 sequences [1, 6] within exon 5 after codon 204 |
Source |
Codon number and nucleotide sequence |
NCBI AY143181 NCBI AF512949 and our results |
cag (codon addition between cag204 and caa205) --- (lack of codon) |
Due to this result correct sequence was registered by Enard et al. [1] (NCBI access number AF512949). As show in table 1, Hang et al. [6] (NCBI access number AY143181) add to sequence additional codon CAG, between codons 204 and 205. Further part of all exon 5 sequences is the same in both published sequences (AF512949, AY143181) and our results. That part of exon 5 is as following: caa205 cag206 cag207 ttg208 gca209 gcc210 cag211 cag212 ctt213 gtc214 ttc215 cag216 cag217 cag218 ctt219 ctc220 cag221 atg222 caa223 caa224 ctc225 cag226 cag227 cag228 cag229 cat230 ctg231 ctc232 agc233 ctt234 cag235 cgt236 cag237 gga238 ctc239 atc240 tcc241 att242 cca243 cct244 ggc245 cag246 gca247 gcc248 ctt249 cct250 gtc251 caa252 tcg253 ctg254 cct255 caa256.
When we were analyzing exon 6, a new (third) published foxp2 sequence has appeared in GenBank – AY064615 by Walther et al. [4]. Unfortunately it did not contain all exon 6 sequence, but only part from codon 277 till the end. Nevertheless it was included to exon 6 and further exons analysis. After resequencing exon 6 we did not found any divergences between our and published data from codon 257 to 304 inclusive. Sequence of this fragment is as following: gct257 ggc258 tta259 agt260 cct261 gct262 gag263 att264 cag265 cag266 tta267 tgg268 aaa269 gaa270 gtg271 act272 gga273 gtt274 cac275 agt276 atg277 gaa278 gac279 aat280 ggc281 att282 aaa283 cat284 gga285 ggg286 cta287 gac288 ctc289 act290 act291 aac292 aat293 tcc294 tcc295 tcg296 act297 acc298 tcc299 tcc300 acc301 act302 tcc303 aaa304.
In codon 305 we found one nucleotide polymorphism a/g. It is shown in table 2.
Table 2. SNP a/g type polymorphism in codon 306 of exon 6 resulting from comparison our results with published data [1,4,6] |
Source |
Codon number and nucleotide sequence |
AY064615, AF512949 and our results |
gca305 |
Sequence from 306 to 313 codon is equal in all analyzed data (published and our results) and is as following: tca306 cca307 cca308 ata309 act310 cat311 cat312 tcc313. In position 314 codon 6 we found SNP a/c polymorphism, which is shown in table 3.
Table 3. SNP a/c type polymorphism in codon 314 of exon 6 resulting from comparison our results with published data [1,4,6] |
Source |
Codon number and nucleotide sequence |
AY064615, AF512949 and our results |
ata314 |
From codon 315 till the end of this exon nucleotide sequence is similar in all analyzed data and is as following: gtg315 aat316 gga317 cag318 tct319 tca320 gtt321 cta322 aat323 gca324 aga325 cga326 gac327 agc328. Codon 328 was teared by intron and its last nucleotide “C” in reality begins exon 7.
Exon 7 also did not indicate any diversity between our results and those from GenBank. Sequence of exon 7 is as following: tcg329 tca330 cat331 gag332 gag333 act334 ggg335 gcc336 tct337 cac338 act339 ctc340 tat341 ggc342 cat343 gga344 gtt345 tgc346 aaa347 tgg348 cca349 ggc350 tgt351 gaa352 agc353 att354 tgt355 gaa356 gat357 ttt358 gga359 cag360 ttt361 tta362 aag363. Last nucleotide of tore codon 363 – “g”, came across in the beginning of exon 8.
Exons from 8 to 19 were exactly the same in all published results and our research, and its sequence was as following:
exon 8 – cac364 ctt365 aac366 aat367 gaa368 cac369 gca370 ttg371 gat372 gac373 cga374 agc375 act376 gct377 cag378 tgt379 cga380 gtg381 caa382 atg383 cag384 gtg385 gtg386 caa387 cag388 tta389 gaa390 ata391 cag392,
exon 9 – ctt393 tct394 aaa395 gaa396 cgc397 gaa398 cgt399 ctt400 caa401 gca402 atg403 atg404 acc405 cac406 ttg407 cac408 atg409 cga410 ccc411 tca412 gag413 ccc414 aaa415 cca416 tct417 ccc418 aaa419 cct420,
exon 10 – cta421 aat422 ctg423 gtg424 tct425 agt426 gtc427 acc428 atg429 tcg430 aag431 aat432 atg433 ttg434 gag435 aca436 tcc437 cca438 cag439 agc440 tta441 cct442 caa443 acc444 cct445 acc446 aca447 cca448 acg449 gcc450 cca451 gtc452 acc453 ccg454 att455 acc456 cag457 gga458 ccc459 tca460 gta461 atc462 acc463 cca464 gcc465 agt466 gtg467 ccc468 aat469 gtg470 gga471 gcc472 ata473 cga474 agg475 cga476 cat477 tca478 gac479 aaa480 tac481 aac482 att483 ccc484 atg485 tca486 tca487.
Similarly to exons 6 and 7, first codon of exon 11 (codon nr. 488) was amused and its first nucleotide “g” was added on the begging of exon 10. Sequences of exon 11 did not differ between all analyzed data and were as following: gaa488 att489 gcc490 cca491 aac492 tac493 gaa494 ttt495 tat496 aaa497 aat498 gca499 gat500 gtc501 aga502 cct503 cca504 ttt505 act506 tat507 gca508 act509 ctc510 ata511 agg512 cag513.
Exons 12, 13 and 14 had exactly the same sequence both in published data and our research. Sequence of these exons was as following:
exon 12 – gct514 atc515 atg516 gag517 tca518 tct519 gac520 agg521 cag522 tta523 aca524 ctt525 aat526 gaa527 att528 tac529 agc530 tgg531 ttt532 aca533 cgg534 acg535 ttt536 gct537 tac538 ttc539 agg540 cgt541 aat542 gca543 gca544 act545 tgg546 aag547,
exon 13 – aat548 gca549 gta550 cgt551 cat552 aat553 ctt554 agc555 ctg556 cac557 aag558 tgt559 ttt560 gtt561 cga562 gta563 gaa564 aat565 gtt566 aaa567 gga568 gca569 gta570 tgg571 act572 gtg573 gat574 gaa575 gta576 gaa577 tac578 cag579 aag580 cga581 agg582 tca583 caa584 aag585 ata586 aca587 gga588,
exon 14 – agt589 cca590 acc591 tta592 gta593 aaa594 aat595 ata596 cct597 acc598 agt599 tta600 ggc601 tat602 gga603 gca604 gct605 ctt606 aat607 gcc608 agt609 ttg610 cag61.
Last codon (588) of exon 13 was amused and its first nucleotide was attached to the beginning of exon 14.
Exon 15 almost among whole sequence (when we skip last codon) was similar when we compared data from published and our sequence. Sequence of this codon is as following: gct612 gcc613 ttg614 gca615 gag616 agc617 agt618 tta619 cct620 ttg621 cta622 agt623 aat624 mct625 gga626 ctg627 atc628 aat629 aat630 gca631 tcc632 agt633 ggc634 cta635 ctg636 cag637 gcc638 gtc639 cac640 gaa641 gac642 ctc643 aat644 ggt645 tct646 ctg647 gat648 cac649 att650 gac651 agc652 aat653 gga654 aac655 agt656 agt657 ccg658 ggc659 tgc660 tcg661 cct662 cag663 ccg664 cac665 ata666. Last codon (666) of exon 15 is tore and “a” nucleotide was attached to exon 16. Form of writing codon 625 was taken from AY064615, and was written as mct (m means a or c, but both in our results and sequence AF512949 instead of m is c).
The last exon 16 did not have any differences between our results and published data [1, 4, 6], and its structure is as following: cat667 tca668 atc669 cat670 gtc671 aag672 gaa673 gag674 cca675 gtg676 att677 gca678 gag679 gat680 gaa681 gac682 tgc683 cca684 atg685 tcc686 tta687 gtg688 aca689 aca690 gct691 aat692 cac693 agt694 cca695 gaa696 tta697 gaa698 gac699 gac700 aga701 gag702 att703 gaa704 gaa705 gag706 cct707 tta708 tct709 gaa710 gat711 ctg712 gaa713.
After exon 16 we found non coding part of DNA. This part of DNA did not belong to mRNA sequence – CDS, and as promoteur and introns should not be published. However cited authors published this part [1, 4 and 6] and we decided to do the same. In AY064615, AF512949, AY143181 and also our resuts it is TGA fragment. Moreover in AY064615 it is further fragment GAACTGACTTGTGAAACCTCAGCGTGAAGGGACATATCACTGACCTTCATAACCACTCCACAACAATGAATATTTGACAAATTTTTACTGTGACTATTTATT, which is similar to our results, and the last part which we did not sequence: AAGCATGGAT.
To sum up appropriate CDS sequence of foxp2 gene in Pongo pygmaeus, taking into account all already published and discovered by us alleles, written in fasta format is as following:
atgatgcaggaatctgtgacagagacaataagcaacagttcaatgaatcaaaatggaatgagcactctaagcagccaattagatgctggcagcagagatggaagatcaagtggtgacaccagctctgaagtaagcacagtagaactgctgcatctgcaacaacagcaggctctccaggcagcaag
acaacttcttttacagcagcaaacaagtggattgaaatctcctaagagcagtgataaacagagaccactgcaggtgcctgtgtcagtggccatgatgactccccaggtgatcacccctcagcaaatgcagcagatccttcagcaacaagtcctgtctcctcagcagctccaagcccttctccaacaacag
caggccgtcatgctgcagcagcaacaactacaagagttttacaagaaacagcaagagcagttacatcttcagcttttgcagcagcagcaacagcagcagcagcaacaacagcagcaacaacagcagcagcagcaacaacaacaacagcagcagcaacagcagcagcagcagcaacagcag
cagcagcagcaacagcatcctggaaagcaagcgaaagagcagcagcagcagcagcagcaacagcagttggcagcccagcagcttgtcttccagcagcagcttctccagatgcaacaactccagcagcagcagcatctgctcagccttcagcgtcagggactcatctccattccacctggccaggcag
cccttcctgtccaatcgctgcctcaagctggcttaagtcctgctgagattcagcagttatggaaagaagtgactggagttcacagtatggaagacaatggcattaaacatggagggctagacctcactactaacaattcctcctcgactacctcctccaccacttccaaagcatcaccaccaataactcatcat
tccatagtgaatggacagtcttcagttctaaatgcaagacgagacagctcgtcacatgaggagactggggcctctcacactctctatggccatggagtttgcaaatggccaggctgtgaaagcatttgtgaagattttggacagtttttaaagcaccttaacaatgaacacgcattggatgaccgaagcactgctcag
tgtcgagtgcaaatgcaggtggtgcaacagttagaaatacagctttctaaagaacgcgaacgtcttcaagcaatgatgacccacttgcacatgcgaccctcagagcccaaaccatctcccaaacctctaaatctggtgtctagtgtcaccatgtcgaagaatatgttggagacatccccacagagct
tacctcaaacccctaccacaccaacggccccagtcaccccgattacccagggaccctcagtaatcaccccagccagtgtgcccaatgtgggagccatacgaaggcgacattcagacaaatacaacattcccatgtcatcagaaattgccccaaactacgaattttataaaaatgcagatgtcagacct
ccatttacttatgcaactctcataaggcaggctatcatggagtcatctgacaggcagttaacacttaatgaaatttacagctggtttacacggacgtttgcttacttcaggcgtaatgcagcaacttggaagaatgcagtacgtcataatcttagcctgcacaagtgttttgttcgagtagaaaatgttaaaggagcagtat
ggactgtggatgaagtagaataccagaagcgaaggtcacaaaagataacaggaagtccaaccttagtaaaaaatatacctaccagtttaggctatggagcagctcttaatgccagtttgcaggctgccttggcagagagcagtttacctttgctaagtaatmctggactgatcaataatgcatccagtg
gcctactgcaggccgtccacgaagacctcaatggttctctggatcacattgacagcaatggaaacagtagtccgggctgctcgcctcagccgcacatacattcaatccatgtcaaggaagagccagtgattgcagaggatgaagactgcccaatgtccttagtgacaacagctaatcacagtccagaattag
aagacgacagagagattgaagaagagcctttatctgaagatctggaa
– and in his form was submitted to GenBank NCBI [5], where it received reference number: DQ789573.
REFERENCES
Enard W., Przeworski M., Fisher S.E., Lai C.S.L., Wiebe V., Kitano T., Monaco A.P., Pääbo S., 2002. Molecular evolution of FOXP2, a gene involved in speech and language. Nature 418: 869-872 and direct submission to NCBI (AF512949). Lai C.S.L., Fisher S.E., Hurst J.A., Varga-Khadem F., Monaco A.P., 2001. A fokhead-domain gene in mutated ina Severe speech and language disorder. Nature 413:519-523. Mielnik J., 2006. Foxp2 u dzieci z Prostym Opóznionym Rozwojem mowy. Praca doktorska, Akademia Medyczna we Wrocławiu. Walter N.A.R., Thompson J., McGoldrick D., Messier W., 2001. Direct submission to NCBI (AY064615). www.ncbi.nlm.nih.hov Zhang J., Webb D.M., Podlaha O., 2002. Direct submission to NCBI (AY143181).
Accepted for print: 7.12.2006
Monika Świszczorowska
Molecular Techniques Laboratory,
Wrocław Medical University, Poland
M.Curie -Skłodowskiej 52, Wrocław, Poland
Arleta Lebioda
Molecular Techniques Laboratory,
Wrocław Medical University, Poland
M.Curie -Skłodowskiej 52, Wrocław, Poland
Barbara Świątek
Institute of Forensic Medicine,
Wrocław Medical University, Poland
Mikulicza-Radeckiego 4, 50-368 Wrocław, Poland
Grażyna Dmochowska
Molecular Techniques Laboratory,
Wrocław Medical University, Poland
M.Curie -Skłodowskiej 52, Wrocław, Poland
Barbara Kosowska
Department of Genetics and Animal Breeding,
Wrocław University of Environmental and Life Sciences, Poland
Kożuchowska 7, 51-631 Wrocław, Poland
email: basia@gen.ar.wroc.pl
Tadeusz Dobosz
Molecular Techniques Laboratory,
Wrocław Medical University, Poland
M.Curie -Skłodowskiej 52, Wrocław, Poland
Responses to this article, comments are invited and should be submitted within three months of the publication of the article. If accepted for publication, they will be published in the chapter headed 'Discussions' and hyperlinked to the article.