Humans and Apes not closely related after all?

Job 33:6 · Sep 19, 2017

I wanted to get a bit of perspective, preferably from any biologists in the house, on the below article.

Does anyone understand what is being described? And what it means? Or does anyone have any opinions or insight they could offer about the article? Or perhaps any critique of it that could be offered?

Just curious of peoples thoughts or ideas. Preferably from a technical perspective.

Thanks,

Analysis of 101 Chimpanzee Trace Read Data Sets

Abstract
The current chimpanzee genome assembly has problems that reduce its veracity as an authentic representation. First, it has been assembled using the human genome as a reference scaffold and does not stand on its own merits. Second, given the fact that significant levels of human DNA exist in non-primate databases due to laboratory and worker contamination, the potential for human DNA in the pre-assembled chimpanzee sequencing reads is highly probable. Therefore, 101 Sanger-style publically available trace read data sets were downloaded, end-trimmed for low quality bases, and purged of vector sequence. Then, 25,000 sequences were selected at random from each of the 101 data sets and queried against the human genome using BLASTN v2.2.31 with gap extension. Results from the BLASTN analysis indicated that two different groups of chimpanzee DNA sequences could be found. Those that were completed early in the chimpanzee genome project that contributed to the initial 5-fold draft genome, were considerably more similar to human than those that were produced later in the project by a difference of about 7% overall data set identity and produced 6% less hits onto the human genome. Sequences (both alignable and non-alignable) from the seemingly less contaminated data sets indicate that the chimpanzee genome is approximately 85% identical overall to human. Extensive poor alignment of chimpanzee DNA sequences that did not have hits on the human genome that were blasted on the chimpanzee genome revealed regions of miss-assembly for the chimpanzee genome.

Keywords: comparative genomics, human-chimp DNA similarity, human genome, chimpanzee genome, primate evolution
Introduction
One of the problems with the current status of the chimpanzee genome is that it has not been constructed on its own merits through the use of an accurate integrated physical-genetic map (Tomkins 2011). Instead, all of the short DNA sequences produced by the DNA sequencing machinery (known as trace reads) have been assembled onto the human genome—using it as a framework scaffold or reference sequence (Mikkelsen et al. 2005; Prado-Martinez et al. 2013; Tomkins 2011). This was done out of budget constraints, convenience, and a healthy dose of evolutionary presupposition that humans evolved from apes.

Another serious potential problem with the chimpanzee genome is the issue of human DNA contamination that would also result in the production of a more human-like chimpanzee genome. In 2011, a very interesting study was published in which the researchers screened 2749 non-primate public DNA databases and found 492 to be contaminated with human sequence at levels of up to 10% (Longo, O’Neill, and O’Neill 2011). The contaminated DNA databases represented species ranging from bacteria to plants to fish. Ape and monkey databases were not tested, leaving the question open as to how much human DNA contamination may be present in them. The sequencing of archaic human DNA such as Neandertal has also been plagued with the problem of modern human DNA contamination—leading to the recent development of strict laboratory precautions (Skoglund et al. 2014; Thomas and Tomkins 2014). Nevertheless, modern human DNA contamination is a standard problem in earlier published ancient DNA studies (Noonan 2010; Skoglund et al. 2014; Thomas and Tomkins 2014). In light of results from these studies combined with the fact that the DNA sequencing that led to the 2005 rough draft of the chimpanzee genome was produced during an era in which the problem of human DNA contamination was not yet adequately realized or appreciated, the potential for human DNA contamination in the chimpanzee genome is a valid possibility.

Given that both a biased sequence assembly using the human genome as a framework combined with the distinct possibility of human DNA contamination may very well have led to the development of a chimpanzee genome that is more human-like than it should be, research was initiated to assess characteristics of chimpanzee Sanger-style trace read DNA sequences produced between the years 2000 and 2011. One future goal of this effort would be to identify DNA sequence datasets having indicators of reduced levels of human DNA contamination for a reassembly of the chimpanzee genome without the use of the human genome as a reference. This is called a de novo assembly, meaning that no reference genome is used (Bradnam et al. 2013; Narzisi and Mishra 2011). The end result will be a genome that is more accurate, but considerably less contiguous than one assembled using a reference sequence (Henson, Tischler, and Ning 2012).

Materials and Methods
Chimpanzee Sanger-style trace read files, their corresponding quality files, and xml files containing sequencing run information were downloaded from the NCBI trace read archive (ftp://ftp.ncbi.nlm.nih. gov/pub/TraceDB/pan_troglodytes). There were a total of 101 Sanger-style trace read file sets available. Low quality bases were end-trimmed using a Phred value of 20, vector sequence was trimmed using the comprehensive NCBI ‘univ_vec_11-4-2014.fa’ file, and empty sequences and those less than 100 bases were discarded using the Lucy2 software package (Chou and Holmes 2001; Li and Chou 2004). The xml sequence information files were queried for run date information with minimum (beginning) and maximum (ending) run dates for experiments being parsed into an SQL table using a Python script written by this author. Only 84 of the 101 xml files contained information for run date.

After processing the trace reads, the average mean number of sequences per data set was 438,213 with a range of 46,251 to 496,267 sequences, and a median value of 470,609 sequences. Because each trace read file was on average extremely large, 25,000 test sequences were extracted at random from each data set and parsed into new FASTA format files using a Python script written by this author. The sequences were then queried against the hg19 version of the human genome using BLASTN v2.2.31 with the following parameters: evalue 0.1, word_size 11, outfmt 10, qseqid, qstart, qend, mismatch, gapopen, pident, nident, length, qlen, max_target_seqs 1, max_ hsps 1, dust no, soft_masking false, perc_identity 50, gapopen 3, gapextend 3, num_threads 10. Given the fact that previous versions of the BLASTN algorithm have had problems omitting query sequences, all data sets had the non-hitting sequences reblasted onto the human genome with the same parameters. In all cases, no further hits were obtained indicating that the BLASTN algorithm was not omitting query sequences as reported in previous versions (Tomkins 2015).

Resulting BLASTN output CSV format files were analyzed for a variety of basic statistical parameters and data visualization using a Python script written by this author. The CSV and FASTA files were also concatenated and imported into SQL tables for more detailed joins, views, and queries along with the run date information mentioned above. All Python parsing and analysis scripts, SQL table/ database generation Python scripts and SQL queries created by this author have been placed at github (GitHub - jt-icr/chimp_trace_25k: Scripts from the BLASTN analysis (against human genome) of 25,000 random chimp trace reads per data set taken from 101 Sanger-style trace read data sets.). Student T-tests for two-tailed, two-sample, unequal variance comparisons of datasets were done in Excel using the T.TEST function.

Results
Overall statistics and trends
At present, there are 101 DNA sequence datasets available to the public that were produced using Sanger style sequencing technology that yielded much longer read lengths than current next generation technologies which produce a greater amount of total bulk sequence of much shorter lengths (Henson, Tischler, and Ning 2012; Mardis 2008). The longer the read, the easier it is to computationally assemble into contiguous genomic regions called sequencing contigs. Therefore, all 101 of these datasets were downloaded and the sequences end-trimmed for poor quality bases and cloning vector contamination.

After sequence trace read processing to remove low quality bases, short reads (less than 100 bases), and vector contamination, the 101 multi-fasta files were analyzed for basic statistics. The minimum file size contained 46,251 sequences while the maximum was 496,267 sequences. Sequence length varied between 100 and 2012 bases with an average (mean) of 704 bases. Given that a total of 44,259,587 sequences were available from all 101 data sets, this represents a genome coverage of about 10.4 fold (after read processing) assuming a chimpanzee genome size of about 3 billion base pairs. The chimpanzee genome publication from 2005 utilized about half this amount of coverage as they reported an initial draft genome of about 5-fold coverage (Mikkelsen et al. 2005).

To ascertain the quality of each chimpanzee end-trimmed dataset, 25,000 DNA sequences were selected from each FASTA file at random and queried against the human genome (version hg19) using the most recent version of the BLASTN algorithm (version 2.2.31+). Liberal gap extension parameters were employed to allow for the longest possible alignments. Total number of sequences examined in this study was over 2.5 million.

Because previous versions of the BLASTN algorithm are proven to omit query sequences when blasting using large query datasets (Tomkins 2015), non-hitting sequences were re-blasted to verify that the algorithm was working correctly. In all cases, none of the reblasted sequences provided hits as was the case in previous releases of the algorithm that exhibited the bug. Clearly, the bug has been fixed, perhaps due in part to complaints to the developer team at NCBI by this author.

Overall, the basic statistics for the 101 data sets as a whole were as follows: The average alignment identify was 96.3% with an average length of 677 bases and 27 bases on average not aligning. When the non-aligning bases in each read are included, the average identity for the reads that hit on human was 92.6%. These results conflict with the initially reported alignable identity of 98.5% given in the 2005 chimpanzee genome publication.

Interestingly, data analyzed as a whole taken from this study do tend to more closely agree with leading primate evolutionist Todd Preuss who stated, “It is now clear that the genetic differences between humans and chimpanzees are far more extensive than previously thought; their genomes are not 98% or 99% identical” and “One consequence of the numerous duplications, insertions, and deletions, is that the total DNA sequence similarity between humans and chimpanzees is not 98% to 99%, but instead closer to 95% to 96%” (Preuss 2012). Preuss then cites three publications supporting this claim (Britten 2002; Varka and Nelson 2007; Wetterbom et al. 2006)—a list that does not include the 2005 chimpanzee genome paper.

It is noteworthy that the alignable DNA similarity of about 96% omits sequences that are too dissimilar to align onto human and thus inflates the actual overall genome similarity between chimpanzees and human. When including all non-alignable sequence, overall chimpanzee DNA sequence identity is only 90.8% for all 101 data sets sampled. Interestingly, this estimate of about 90% overall genome similarity is similar to a previous study by this author using the chimpanzee genome assembly as a substrate (Tomkins 2015). However, upon further analysis of the data in the current study, even this estimate is shown to be suspicious and likely inflated due to fundamental problems in the assembly of the chimpanzee genome as described below.

When the 101 data sets were plotted, it was clear that a major difference existed between them for overall DNA similarity—a trend which generally corresponded with the progression of the data sets by file name (fig. 1). It was also apparent that many data sets had overall DNA identities below 90%. Therefore, the data were divided into two different bins corresponding to below 90% overall identity or above 90% overall identity. Fifty-seven of the data sets had overall identities above 90% and 44 were below 90%. The basic statistics for each are shown in Table 1. When the two data sets were compared using overall identity as the test variable in a two-sample T-test, they were significantly different from each other (P < 0.0000001).

Fig. 1. Overall data set percent identity using BLASTN for the 101 chimpanzee trace read data sets compared to the human genome. Data sets are labeled 001 to 101.

Table 1. Basic statistics for BLASTN results of 25,000 sequences from each of 101 chimpanzee trace read data sets queried against the human genome (hg19). All values are listed in percent. Total number of data sets = 101, number of high-identity data sets = 57, and number of low-identity data sets = 44.
All Data Sets High-Identity Sets Low-Identity Sets
Ave. Alignment identity 96.3 96.7 95.8
Ave. Query Seq. identity 92.2 94.3 89.5
Ave. Hit Frequency 98.1 99.7 96.1
Overall data set identity 90.6 94.1 85.6
Human-Chimp DNA Similarity and Run Date
To determine if the apparent trend in reduced sequence similarity was associated with year of sequencing, run date information was extracted from each of the corresponding XML sequence information files. However, run date information was only recorded in 86 of the 101 XML files (85%). Furthermore, the run date data in each of the XML files in which it was recorded contained a range of years. Therefore, the SQL data tables extracted from the XML files were set up to include both the beginning and ending run dates. When queries were run to return information based on the ending year of sequencing, there were eight years involved between 2000 and 2011 (table 2).

Table 2. Basic BLASTN statistics (against human) for the 86 chimpanzee trace read datasets that contained run date information in their XML sequence information files. NULL = no sequencing date information given for dataset.
Sequencing End Year No. Data Sets Ave Alignment Identity (Percent) Ave Query Seq Identity (Percent) Overall Data Set Identity (Percent) Percent Hits
NULL 15 97.0 96.0 95.3 99.0
2002 10 97.1 93.8 93.7 99.1
2003 33 96.3 93.8 93.2 98.6
2004 21 95.6 89.8 89.1 98.5
2005 2 95.0 84.1 74.4 88.0
2006 13 96.2 92.1 86.7 93.6
2007 5 96.1 90.9 85.5 93.8
2008 1 96.0 92.2 86.2 93.0
2011 1 96.3 92.0 85.4 92.0
2002–2004 64 96.2 92.5 91.9 98.7
2005–2011 22 96.1 91.1 85.2 93.1
An approximately 5-fold genome coverage would have been obtained through the range of data sets completed through 2004, which would largely correspond with the data represented in the 2005 chimpanzee genome paper and the first initial draft of the chimpanzee genome. Therefore, these data were compared with those completed after 2004. The first set of sequences contained an overall DNA sequence identity (including non-hitting sequences) of 91.9% compared to 85.2% for the sequences corresponding to those completed after the first rough draft of the chimpanzee genome. When the two data sets were compared in a two-sample T-test, they were found to be significantly different from each other (P < 0.0000001).

Interestingly, the data sets completed through 2004 contained an average of 98.7% of the sequences providing hits onto the human genome. However, the data sets with completion dates of 2005 or later, only had a hit rate of 93.1%. A difference of 5.6% which was also significantly different in a two-sample T-Test (P < 0.0000001).

For the data sets with run date information, the progression of numbers in file names generally correspond well with the completion date of the data sets. It is clear from these analyses that the initial data sets used for the chimpanzee genome initial rough draft have significantly higher levels of DNA similarity than those produced later in the project (table 2). These early data would not only inflate the level of DNA similarity for chimpanzee compared to human as initially reported in 2005, but also bias the assembly and make it more human-like than it should be—compounding the problems caused by using the human genome as an assembly scaffold.

Another interesting aspect of this study was for the data sets that lacked information for run date—an oddity and indicator of sloppiness in the process of Sanger Style DNA sequencing (table 2). The recording of run date information in an accompanying xml sequence info file is a key factor in the trouble shooting of past sequencing runs. The data sets that lacked run date information were generally more similar to human than all the others in regard to alignment characteristics and had overall hit levels onto human of 99%. The corresponding file names (containing numbers in the range of 12 to 48) suggested that these data sets contributed to the initial 5-fold chimpanzee genome assembly. Of all the data sets evaluated, these had the highest levels of indicators for human DNA contamination.

Evaluating Chimpanzee Sequences Not Hitting Onto Human
Clearly, the data sets produced later in the chimpanzee genome project in the post-2005 genome paper timeframe, had much lower levels of hit percentages onto human. One question that arises in light of these results is whether these non-hitting sequences are of chimpanzee origin—a question that is difficult to answer given the questionable nature of the chimpanzee genome as an accurate substrate onto which they could be tested. Nevertheless, the chimpanzee sequences that had no hits on the human genome were blasted against the chimpanzee genome. The results were surprising and suggestive of miss-assembly in the chimpanzee genome due to a human framework bias by which it was constructed combined with the distinct possibility of assembly integration of human DNA contamination.

A total of 47,803 DNA sequences from the 2.5 million sequences sampled from the 101 data sets tested did not hit on the human genome. I refer to these as non-hitters. Using the same liberal BLASTN extension parameters as were done with human, 29,880 of the non-hitters (62.5%) provided hits onto the most recent version of the chimpanzee genome assembly at the time of this study (PanTro4), albeit at highly reduced identities with shorter alignments— compared to chimpanzee sequences that aligned to human.

When blasting chimpanzee trace reads onto an allegedly accurate representation of the chimpanzee genome, one would expect alignment identities of 99.9 to 100%. However, the average alignment identity (excluding all non-hitting sequence), was only 85.2%. These results strongly suggest that the chimpanzee genome is miss-assembled and more human-like than it should be.

Summary
In regard to data sets that included run date information, two different sets of chimpanzee DNA sequences related to the Sanger-style data sets used to construct the chimpanzee genome exist. The sequences that were produced early on in the chimpanzee genome project that contributed to the initial five-fold coverage of the chimpanzee draft genome (Mikkelsen et al. 2005), are significantly more similar to human than those that were produced later in the project by a difference of about 5% overall data set sequence identity. Contributing to this difference is the additional fact that a 5.6% difference in the amount of sequences that hit onto the human genome also exist.

When not considering run date, but instead including all sequences, two bins of data were constructed: data sets with overall identities below 90% and those above 90%. In doing this, the difference in sequence identity between the two data sets widened to 7%. This is largely due to the fact that the sequences lacking run date information were the most highly similar to human out of all the data sets. Because these data sets all contained filename numbers between 13 and 48, it is safe to assume that they contributed to the initial rough draft of the chimpanzee genome in 2005, inflating its human-like characteristics accordingly.

It may be that greater precautions towards human DNA contamination were taken later in the project producing less contamination. If the data from these seemingly less contaminated sets are considered, the chimpanzee genome is no more than about 85% similar to human. If all the data sets taken together are considered, despite the apparent human DNA contamination, then the chimpanzee genome is no more than about 90% similar to human.

It is very probable that the current chimpanzee genome assembly suffers from two major problems that make it more human-like that it should be. First, chimpanzee DNA sequences from both Sanger-style sequencing and next generation sequencing technologies, have been assembled using the human genome as a reference framework (Mikkelsen et al. 2005; Prado-Martinez et al. 2013). In other words, the chimpanzee genome does not stand on its own merits using its own framework-based genomic resources (e.g. an accurate integrated physical-genetic map for chimpanzee) as I described in an earlier publication (Tomkins 2011). Second, given the fact that significant levels of human DNA exist in non-primate databases due to laboratory and worker contamination (Longo et al. 2011), the potential for human DNA in the pre-assembled chimpanzee sequencing reads is highly probable and could be tested for by simply comparing the chimpanzee-human BLASTN analyses of the different data sets one to another. The main questions would be, are there significant differences between data sets, and are there any obvious patterns for these differences? The answer to both questions is a resounding yes.

In determining this, 101 Sanger-style publically available trace read data sets were downloaded, providing the longest possible trace read data source, were end-trimmed for low quality bases, and purged of contaminating plasmid cloning vector sequence. Then, 25,000 sequences were selected at random from each data set and queried against the human genome using BLASTN v2.2.31 with liberal gap extension. Results from the BLASTN analysis indicated that two different groups of chimpanzee DNA sequences existed. Those that were produced early in the chimpanzee genome project that contributed to the initial chimpanzee genome publication were considerably more similar to human than those that were produced later in the project by a difference of about 5%. It may be that greater measures towards alleviating human DNA contamination were performed as the project progressed. Data from the seemingly less contaminated sets indicate that the chimpanzee genome is no more than about 85% identical to human.

Furthermore, when chimpanzee sequences that did not hit onto the human genome were blasted against the chimpanzee assembly, the average alignment identity was only 85% when 99.9 to 100% identity should have been the result if the chimpanzee genome was accurately assembled.

Devin P · Sep 19, 2017

I didn't read any of that, because, science. But, apparently (after listening to a video) I heard that they had to ignore like 40 billion pieces of genetics, and with what was left they get the 98% figure. I'm not sure if this is what your post pointed out, but I have a serious dislike for science lately. So, winner in my book regardless.

Have you looked into evolution? Apparently, they tried saying they found an entirely new evolutionary species of humans by finding ONE tooth. An entire person, out of a tooth... They taught it in schools, and then were found out that it wasn't even a human tooth, but a pig's... and then, on top of that, they KEPT teaching it in schools AFTER they knew they had been disproven. So many more examples of this are involved in evolution.

Also, look at stars. Videos of them. People who have nice cameras and such, can look up at them... they look nothing like what scientists have led us to believe they do. Idk what to make of it, but it's interesting. They look... as if they're being seen through water? Idk how to describe it, but that works pretty well.

FrumiousBandersnatch · Sep 19, 2017

Devin P said:
Also, look at stars. Videos of them. People who have nice cameras and such, can look up at them... they look nothing like what scientists have led us to believe they do. Idk what to make of it, but it's interesting. They look... as if they're being seen through water? Idk how to describe it, but that works pretty well.

Not sure if this is serious, but stars twinkle because their light is refracted by turbulence in the atmosphere. The big optical telescopes shine lasers into the sky to detect the distortions and then use this information to compensate the image.

Presbyterian Continuist · Sep 19, 2017

KomatiiteBIF said:
I wanted to get a bit of perspective, preferably from any biologists in the house, on the below article.

Does anyone understand what is being described? And what it means? Or does anyone have any opinions or insight they could offer about the article? Or perhaps any critique of it that could be offered?

Just curious of peoples thoughts or ideas. Preferably from a technical perspective.

Thanks,

Analysis of 101 Chimpanzee Trace Read Data Sets

Abstract
The current chimpanzee genome assembly has problems that reduce its veracity as an authentic representation. First, it has been assembled using the human genome as a reference scaffold and does not stand on its own merits. Second, given the fact that significant levels of human DNA exist in non-primate databases due to laboratory and worker contamination, the potential for human DNA in the pre-assembled chimpanzee sequencing reads is highly probable. Therefore, 101 Sanger-style publically available trace read data sets were downloaded, end-trimmed for low quality bases, and purged of vector sequence. Then, 25,000 sequences were selected at random from each of the 101 data sets and queried against the human genome using BLASTN v2.2.31 with gap extension. Results from the BLASTN analysis indicated that two different groups of chimpanzee DNA sequences could be found. Those that were completed early in the chimpanzee genome project that contributed to the initial 5-fold draft genome, were considerably more similar to human than those that were produced later in the project by a difference of about 7% overall data set identity and produced 6% less hits onto the human genome. Sequences (both alignable and non-alignable) from the seemingly less contaminated data sets indicate that the chimpanzee genome is approximately 85% identical overall to human. Extensive poor alignment of chimpanzee DNA sequences that did not have hits on the human genome that were blasted on the chimpanzee genome revealed regions of miss-assembly for the chimpanzee genome.

Keywords: comparative genomics, human-chimp DNA similarity, human genome, chimpanzee genome, primate evolution
Introduction
One of the problems with the current status of the chimpanzee genome is that it has not been constructed on its own merits through the use of an accurate integrated physical-genetic map (Tomkins 2011). Instead, all of the short DNA sequences produced by the DNA sequencing machinery (known as trace reads) have been assembled onto the human genome—using it as a framework scaffold or reference sequence (Mikkelsen et al. 2005; Prado-Martinez et al. 2013; Tomkins 2011). This was done out of budget constraints, convenience, and a healthy dose of evolutionary presupposition that humans evolved from apes.

Another serious potential problem with the chimpanzee genome is the issue of human DNA contamination that would also result in the production of a more human-like chimpanzee genome. In 2011, a very interesting study was published in which the researchers screened 2749 non-primate public DNA databases and found 492 to be contaminated with human sequence at levels of up to 10% (Longo, O’Neill, and O’Neill 2011). The contaminated DNA databases represented species ranging from bacteria to plants to fish. Ape and monkey databases were not tested, leaving the question open as to how much human DNA contamination may be present in them. The sequencing of archaic human DNA such as Neandertal has also been plagued with the problem of modern human DNA contamination—leading to the recent development of strict laboratory precautions (Skoglund et al. 2014; Thomas and Tomkins 2014). Nevertheless, modern human DNA contamination is a standard problem in earlier published ancient DNA studies (Noonan 2010; Skoglund et al. 2014; Thomas and Tomkins 2014). In light of results from these studies combined with the fact that the DNA sequencing that led to the 2005 rough draft of the chimpanzee genome was produced during an era in which the problem of human DNA contamination was not yet adequately realized or appreciated, the potential for human DNA contamination in the chimpanzee genome is a valid possibility.

Given that both a biased sequence assembly using the human genome as a framework combined with the distinct possibility of human DNA contamination may very well have led to the development of a chimpanzee genome that is more human-like than it should be, research was initiated to assess characteristics of chimpanzee Sanger-style trace read DNA sequences produced between the years 2000 and 2011. One future goal of this effort would be to identify DNA sequence datasets having indicators of reduced levels of human DNA contamination for a reassembly of the chimpanzee genome without the use of the human genome as a reference. This is called a de novo assembly, meaning that no reference genome is used (Bradnam et al. 2013; Narzisi and Mishra 2011). The end result will be a genome that is more accurate, but considerably less contiguous than one assembled using a reference sequence (Henson, Tischler, and Ning 2012).

Materials and Methods
Chimpanzee Sanger-style trace read files, their corresponding quality files, and xml files containing sequencing run information were downloaded from the NCBI trace read archive (ftp://ftp.ncbi.nlm.nih. gov/pub/TraceDB/pan_troglodytes). There were a total of 101 Sanger-style trace read file sets available. Low quality bases were end-trimmed using a Phred value of 20, vector sequence was trimmed using the comprehensive NCBI ‘univ_vec_11-4-2014.fa’ file, and empty sequences and those less than 100 bases were discarded using the Lucy2 software package (Chou and Holmes 2001; Li and Chou 2004). The xml sequence information files were queried for run date information with minimum (beginning) and maximum (ending) run dates for experiments being parsed into an SQL table using a Python script written by this author. Only 84 of the 101 xml files contained information for run date.

After processing the trace reads, the average mean number of sequences per data set was 438,213 with a range of 46,251 to 496,267 sequences, and a median value of 470,609 sequences. Because each trace read file was on average extremely large, 25,000 test sequences were extracted at random from each data set and parsed into new FASTA format files using a Python script written by this author. The sequences were then queried against the hg19 version of the human genome using BLASTN v2.2.31 with the following parameters: evalue 0.1, word_size 11, outfmt 10, qseqid, qstart, qend, mismatch, gapopen, pident, nident, length, qlen, max_target_seqs 1, max_ hsps 1, dust no, soft_masking false, perc_identity 50, gapopen 3, gapextend 3, num_threads 10. Given the fact that previous versions of the BLASTN algorithm have had problems omitting query sequences, all data sets had the non-hitting sequences reblasted onto the human genome with the same parameters. In all cases, no further hits were obtained indicating that the BLASTN algorithm was not omitting query sequences as reported in previous versions (Tomkins 2015).

Resulting BLASTN output CSV format files were analyzed for a variety of basic statistical parameters and data visualization using a Python script written by this author. The CSV and FASTA files were also concatenated and imported into SQL tables for more detailed joins, views, and queries along with the run date information mentioned above. All Python parsing and analysis scripts, SQL table/ database generation Python scripts and SQL queries created by this author have been placed at github (GitHub - jt-icr/chimp_trace_25k: Scripts from the BLASTN analysis (against human genome) of 25,000 random chimp trace reads per data set taken from 101 Sanger-style trace read data sets.). Student T-tests for two-tailed, two-sample, unequal variance comparisons of datasets were done in Excel using the T.TEST function.

Results
Overall statistics and trends
At present, there are 101 DNA sequence datasets available to the public that were produced using Sanger style sequencing technology that yielded much longer read lengths than current next generation technologies which produce a greater amount of total bulk sequence of much shorter lengths (Henson, Tischler, and Ning 2012; Mardis 2008). The longer the read, the easier it is to computationally assemble into contiguous genomic regions called sequencing contigs. Therefore, all 101 of these datasets were downloaded and the sequences end-trimmed for poor quality bases and cloning vector contamination.

After sequence trace read processing to remove low quality bases, short reads (less than 100 bases), and vector contamination, the 101 multi-fasta files were analyzed for basic statistics. The minimum file size contained 46,251 sequences while the maximum was 496,267 sequences. Sequence length varied between 100 and 2012 bases with an average (mean) of 704 bases. Given that a total of 44,259,587 sequences were available from all 101 data sets, this represents a genome coverage of about 10.4 fold (after read processing) assuming a chimpanzee genome size of about 3 billion base pairs. The chimpanzee genome publication from 2005 utilized about half this amount of coverage as they reported an initial draft genome of about 5-fold coverage (Mikkelsen et al. 2005).

To ascertain the quality of each chimpanzee end-trimmed dataset, 25,000 DNA sequences were selected from each FASTA file at random and queried against the human genome (version hg19) using the most recent version of the BLASTN algorithm (version 2.2.31+). Liberal gap extension parameters were employed to allow for the longest possible alignments. Total number of sequences examined in this study was over 2.5 million.

Because previous versions of the BLASTN algorithm are proven to omit query sequences when blasting using large query datasets (Tomkins 2015), non-hitting sequences were re-blasted to verify that the algorithm was working correctly. In all cases, none of the reblasted sequences provided hits as was the case in previous releases of the algorithm that exhibited the bug. Clearly, the bug has been fixed, perhaps due in part to complaints to the developer team at NCBI by this author.

Overall, the basic statistics for the 101 data sets as a whole were as follows: The average alignment identify was 96.3% with an average length of 677 bases and 27 bases on average not aligning. When the non-aligning bases in each read are included, the average identity for the reads that hit on human was 92.6%. These results conflict with the initially reported alignable identity of 98.5% given in the 2005 chimpanzee genome publication.

Interestingly, data analyzed as a whole taken from this study do tend to more closely agree with leading primate evolutionist Todd Preuss who stated, “It is now clear that the genetic differences between humans and chimpanzees are far more extensive than previously thought; their genomes are not 98% or 99% identical” and “One consequence of the numerous duplications, insertions, and deletions, is that the total DNA sequence similarity between humans and chimpanzees is not 98% to 99%, but instead closer to 95% to 96%” (Preuss 2012). Preuss then cites three publications supporting this claim (Britten 2002; Varka and Nelson 2007; Wetterbom et al. 2006)—a list that does not include the 2005 chimpanzee genome paper.

It is noteworthy that the alignable DNA similarity of about 96% omits sequences that are too dissimilar to align onto human and thus inflates the actual overall genome similarity between chimpanzees and human. When including all non-alignable sequence, overall chimpanzee DNA sequence identity is only 90.8% for all 101 data sets sampled. Interestingly, this estimate of about 90% overall genome similarity is similar to a previous study by this author using the chimpanzee genome assembly as a substrate (Tomkins 2015). However, upon further analysis of the data in the current study, even this estimate is shown to be suspicious and likely inflated due to fundamental problems in the assembly of the chimpanzee genome as described below.

When the 101 data sets were plotted, it was clear that a major difference existed between them for overall DNA similarity—a trend which generally corresponded with the progression of the data sets by file name (fig. 1). It was also apparent that many data sets had overall DNA identities below 90%. Therefore, the data were divided into two different bins corresponding to below 90% overall identity or above 90% overall identity. Fifty-seven of the data sets had overall identities above 90% and 44 were below 90%. The basic statistics for each are shown in Table 1. When the two data sets were compared using overall identity as the test variable in a two-sample T-test, they were significantly different from each other (P < 0.0000001).

Fig. 1. Overall data set percent identity using BLASTN for the 101 chimpanzee trace read data sets compared to the human genome. Data sets are labeled 001 to 101.

Table 1. Basic statistics for BLASTN results of 25,000 sequences from each of 101 chimpanzee trace read data sets queried against the human genome (hg19). All values are listed in percent. Total number of data sets = 101, number of high-identity data sets = 57, and number of low-identity data sets = 44.
All Data Sets High-Identity Sets Low-Identity Sets
Ave. Alignment identity 96.3 96.7 95.8
Ave. Query Seq. identity 92.2 94.3 89.5
Ave. Hit Frequency 98.1 99.7 96.1
Overall data set identity 90.6 94.1 85.6
Human-Chimp DNA Similarity and Run Date
To determine if the apparent trend in reduced sequence similarity was associated with year of sequencing, run date information was extracted from each of the corresponding XML sequence information files. However, run date information was only recorded in 86 of the 101 XML files (85%). Furthermore, the run date data in each of the XML files in which it was recorded contained a range of years. Therefore, the SQL data tables extracted from the XML files were set up to include both the beginning and ending run dates. When queries were run to return information based on the ending year of sequencing, there were eight years involved between 2000 and 2011 (table 2).

Table 2. Basic BLASTN statistics (against human) for the 86 chimpanzee trace read datasets that contained run date information in their XML sequence information files. NULL = no sequencing date information given for dataset.
Sequencing End Year No. Data Sets Ave Alignment Identity (Percent) Ave Query Seq Identity (Percent) Overall Data Set Identity (Percent) Percent Hits
NULL 15 97.0 96.0 95.3 99.0
2002 10 97.1 93.8 93.7 99.1
2003 33 96.3 93.8 93.2 98.6
2004 21 95.6 89.8 89.1 98.5
2005 2 95.0 84.1 74.4 88.0
2006 13 96.2 92.1 86.7 93.6
2007 5 96.1 90.9 85.5 93.8
2008 1 96.0 92.2 86.2 93.0
2011 1 96.3 92.0 85.4 92.0
2002–2004 64 96.2 92.5 91.9 98.7
2005–2011 22 96.1 91.1 85.2 93.1
An approximately 5-fold genome coverage would have been obtained through the range of data sets completed through 2004, which would largely correspond with the data represented in the 2005 chimpanzee genome paper and the first initial draft of the chimpanzee genome. Therefore, these data were compared with those completed after 2004. The first set of sequences contained an overall DNA sequence identity (including non-hitting sequences) of 91.9% compared to 85.2% for the sequences corresponding to those completed after the first rough draft of the chimpanzee genome. When the two data sets were compared in a two-sample T-test, they were found to be significantly different from each other (P < 0.0000001).

Interestingly, the data sets completed through 2004 contained an average of 98.7% of the sequences providing hits onto the human genome. However, the data sets with completion dates of 2005 or later, only had a hit rate of 93.1%. A difference of 5.6% which was also significantly different in a two-sample T-Test (P < 0.0000001).

For the data sets with run date information, the progression of numbers in file names generally correspond well with the completion date of the data sets. It is clear from these analyses that the initial data sets used for the chimpanzee genome initial rough draft have significantly higher levels of DNA similarity than those produced later in the project (table 2). These early data would not only inflate the level of DNA similarity for chimpanzee compared to human as initially reported in 2005, but also bias the assembly and make it more human-like than it should be—compounding the problems caused by using the human genome as an assembly scaffold.

Another interesting aspect of this study was for the data sets that lacked information for run date—an oddity and indicator of sloppiness in the process of Sanger Style DNA sequencing (table 2). The recording of run date information in an accompanying xml sequence info file is a key factor in the trouble shooting of past sequencing runs. The data sets that lacked run date information were generally more similar to human than all the others in regard to alignment characteristics and had overall hit levels onto human of 99%. The corresponding file names (containing numbers in the range of 12 to 48) suggested that these data sets contributed to the initial 5-fold chimpanzee genome assembly. Of all the data sets evaluated, these had the highest levels of indicators for human DNA contamination.

Evaluating Chimpanzee Sequences Not Hitting Onto Human
Clearly, the data sets produced later in the chimpanzee genome project in the post-2005 genome paper timeframe, had much lower levels of hit percentages onto human. One question that arises in light of these results is whether these non-hitting sequences are of chimpanzee origin—a question that is difficult to answer given the questionable nature of the chimpanzee genome as an accurate substrate onto which they could be tested. Nevertheless, the chimpanzee sequences that had no hits on the human genome were blasted against the chimpanzee genome. The results were surprising and suggestive of miss-assembly in the chimpanzee genome due to a human framework bias by which it was constructed combined with the distinct possibility of assembly integration of human DNA contamination.

A total of 47,803 DNA sequences from the 2.5 million sequences sampled from the 101 data sets tested did not hit on the human genome. I refer to these as non-hitters. Using the same liberal BLASTN extension parameters as were done with human, 29,880 of the non-hitters (62.5%) provided hits onto the most recent version of the chimpanzee genome assembly at the time of this study (PanTro4), albeit at highly reduced identities with shorter alignments— compared to chimpanzee sequences that aligned to human.

When blasting chimpanzee trace reads onto an allegedly accurate representation of the chimpanzee genome, one would expect alignment identities of 99.9 to 100%. However, the average alignment identity (excluding all non-hitting sequence), was only 85.2%. These results strongly suggest that the chimpanzee genome is miss-assembled and more human-like than it should be.

Summary
In regard to data sets that included run date information, two different sets of chimpanzee DNA sequences related to the Sanger-style data sets used to construct the chimpanzee genome exist. The sequences that were produced early on in the chimpanzee genome project that contributed to the initial five-fold coverage of the chimpanzee draft genome (Mikkelsen et al. 2005), are significantly more similar to human than those that were produced later in the project by a difference of about 5% overall data set sequence identity. Contributing to this difference is the additional fact that a 5.6% difference in the amount of sequences that hit onto the human genome also exist.

When not considering run date, but instead including all sequences, two bins of data were constructed: data sets with overall identities below 90% and those above 90%. In doing this, the difference in sequence identity between the two data sets widened to 7%. This is largely due to the fact that the sequences lacking run date information were the most highly similar to human out of all the data sets. Because these data sets all contained filename numbers between 13 and 48, it is safe to assume that they contributed to the initial rough draft of the chimpanzee genome in 2005, inflating its human-like characteristics accordingly.

It may be that greater precautions towards human DNA contamination were taken later in the project producing less contamination. If the data from these seemingly less contaminated sets are considered, the chimpanzee genome is no more than about 85% similar to human. If all the data sets taken together are considered, despite the apparent human DNA contamination, then the chimpanzee genome is no more than about 90% similar to human.

It is very probable that the current chimpanzee genome assembly suffers from two major problems that make it more human-like that it should be. First, chimpanzee DNA sequences from both Sanger-style sequencing and next generation sequencing technologies, have been assembled using the human genome as a reference framework (Mikkelsen et al. 2005; Prado-Martinez et al. 2013). In other words, the chimpanzee genome does not stand on its own merits using its own framework-based genomic resources (e.g. an accurate integrated physical-genetic map for chimpanzee) as I described in an earlier publication (Tomkins 2011). Second, given the fact that significant levels of human DNA exist in non-primate databases due to laboratory and worker contamination (Longo et al. 2011), the potential for human DNA in the pre-assembled chimpanzee sequencing reads is highly probable and could be tested for by simply comparing the chimpanzee-human BLASTN analyses of the different data sets one to another. The main questions would be, are there significant differences between data sets, and are there any obvious patterns for these differences? The answer to both questions is a resounding yes.

In determining this, 101 Sanger-style publically available trace read data sets were downloaded, providing the longest possible trace read data source, were end-trimmed for low quality bases, and purged of contaminating plasmid cloning vector sequence. Then, 25,000 sequences were selected at random from each data set and queried against the human genome using BLASTN v2.2.31 with liberal gap extension. Results from the BLASTN analysis indicated that two different groups of chimpanzee DNA sequences existed. Those that were produced early in the chimpanzee genome project that contributed to the initial chimpanzee genome publication were considerably more similar to human than those that were produced later in the project by a difference of about 5%. It may be that greater measures towards alleviating human DNA contamination were performed as the project progressed. Data from the seemingly less contaminated sets indicate that the chimpanzee genome is no more than about 85% identical to human.

Furthermore, when chimpanzee sequences that did not hit onto the human genome were blasted against the chimpanzee assembly, the average alignment identity was only 85% when 99.9 to 100% identity should have been the result if the chimpanzee genome was accurately assembled.

I have friends who, when I see them eating a banana, tempt me to believe that evolution is true. :sorry:

LittleLambofJesus · Sep 19, 2017

Oscarr said:
I have friends who, when I see them eating a banana, tempt me to believe that evolution is true.

I get that feeling when I watch "Planet of the Apes"

..........

........

animals-monkey-ape-chimp-giraffe-elephant-shl090430_low.jpg

Occams Barber · Sep 19, 2017

Oscarr said:
I have friends who, when I see them eating a banana, tempt me to believe that evolution is true.

Wait a second...... has a thing about bananas and evolution and comes from New Zealand. Hmmm.....

Oscarr, are you Ray Comfort's dad?
OB

Papias · Sep 19, 2017

Being that this "paper" has never been peer reviewed, as far as I can tell it's word salad arranged in a way to make it look professional for the purpose of fooling people with no biology background. I can't even see if he has any biology degrees. We'll have to see what actual biologists say about it. @sfs ?

sfs · Sep 19, 2017

KomatiiteBIF said:
Does anyone understand what is being described? And what it means? Or does anyone have any opinions or insight they could offer about the article? Or perhaps any critique of it that could be offered?

Sure. It's Jeff Tomkins, creationist, finding yet another way to do an indefensibly bad job of comparing the human and chimpanzee genomes. He's done repeated comparisons, he always finds that humans and chimpanzees aren't actually genetically similar, and he's wrong every time. The interesting thing is that he comes up with a different way of being wrong each time.

In this case, he's taken raw sequencing reads from the chimpanzee genome project and compared them to the human genome. He finds that the reads became less similar at some point during the project. From this he concludes that the chimpanzee genome data actually contains large amounts of contaminating human DNA, and that the project improved its procedures after a while, so that the later reads reflect the true genetic similarity. Thus the researchers were mistaken to conclude that the two genomes were very similar. (There's some other stuff there too, but that's the main thrust.)

This is all complete hogwash. Tomkins is using raw reads, which contain both good sequencing results and crappy sequencing results. He did trim the ends off the reads, where the sequencing quality is usually poor, but he did nothing to remove the crappy sequence from the middle. The thing is, we know that some of the results are wrong, and we have quality scores to help filter it out. There are also other, more sophisticated techniques (mostly the Neighborhood Quality Score) to help identify base calls that are likely to be wrong. These techniques were used by the people who sequenced the chimpanzee genome, but are ignored here. That guarantees meaningless results. This is Sequencing 101: there is just no excuse for not following basic quality control steps. (Someone has gone ahead and shown that the results are wrong, by the way, but I forget where I saw that described. It's really not worth the time to hunt it down.)

Another researcher has made a more detailed (and different) critique of the paper in post 150 in this thread.

sfs · Sep 19, 2017

Devin P said:
I didn't read any of that, because, science. But, apparently (after listening to a video) I heard that they had to ignore like 40 billion pieces of genetics, and with what was left they get the 98% figure.

Yeah, that didn't happen.

Devin P said:
Have you looked into evolution? Apparently, they tried saying they found an entirely new evolutionary species of humans by finding ONE tooth. An entire person, out of a tooth... They taught it in schools, and then were found out that it wasn't even a human tooth, but a pig's... and then, on top of that, they KEPT teaching it in schools AFTER they knew they had been disproven.

Yeah, that didn't happen either.

pat34lee · Sep 19, 2017

"Genome-wide, only 70% of the chimpanzee DNA was similar to human
under the most optimal sequence-slice conditions."
Human and Chimp DNA Only 70% Similar, At Least According to This Study – Proslogion

The only way to get a higher similarity is cherry picking only some of the
DNA and throwing out the rest.

pat34lee · Sep 19, 2017

sfs said:
Yeah, that didn't happen.

Yeah, that didn't happen either.

Sure it did. Gee, its nice just asserting without having to show
any thought, much less proof. Then again, I don't do that.
Nebraska Man Hoax

sfs · Sep 19, 2017

pat34lee said:
"Genome-wide, only 70% of the chimpanzee DNA was similar to human
under the most optimal sequence-slice conditions."
Human and Chimp DNA Only 70% Similar, At Least According to This Study – Proslogion

The only way to get a higher similarity is cherry picking only some of the
DNA and throwing out the rest.

That was another of Tomkins' efforts, or rather, two of them. Would you like to know why they were wrong?

sfs · Sep 19, 2017

pat34lee said:
Sure it did. Gee, its nice just asserting without having to show
any thought, much less proof.

Well, you could ask. And it's not like the original claimant provided either thought or proof.

As for the first claim, the actual procedures used to compare the human and chimpanzee genomes are laid out in the paper describing the study (that's this one). If anyone has specific questions about the approach used or arguments against it, by all means make them. It's a little hard to rebut as vague a claim as was made here. In the absence of specifics, here's a rundown of the human-chimp stats that I wrote a while ago:

Only 2.7 Gb [billions of base-pairs] of chimpanzee genome was sequenced successfully, so the maximum that could have been aligned would have been those 2.7 Gb (out of ~3 Gb, even if the genomes were identical. The first supplementary note from the paper describes what happened to the remaining ~0.3 Gb that didn't align. 0.24 Gb could be aligned, but the alignments were to many places in the human genome, so the sequence was discarded. This is perfectly sensible, since the bulk of this is certainly bad assembly in the chimpanzee genome, and there's no point in making a comparison using bad pieces of assembly. Some of this poorly aligning sequence might represent multiple rearrangements of one or both genomes, but even if it does, there is no reason to think that the pieces of sequence would themselves be any more diverged than the rest of the genome. Finally, there was 0.09 Gb of sequence that didn't align at all. Again, a small part of this might be real sequence that is unique to the chimpanzee genome, but it also includes all kinds of other junk, including sequence that's present in the human genome but that hasn't been assembled; given the small amount of sequence reliably found in insertions/deletions, it is unlikely that much of it is really unique chimp sequence.

As for the second claim, it's presumably talking about "Nebraska Man". You can read about the actual history in Wikipedia; it does not include the putative feature being included in textbooks, then or later.

Presbyterian Continuist · Sep 19, 2017

Occams Barber said:
Wait a second...... has a thing about bananas and evolution and comes from New Zealand. Hmmm.....

Oscarr, are you Ray Comfort's dad?
OB

Nope. Why; does he like bananas?
Actually, my ginger cat could have evolved from the apes, because he goes bananas sometimes!
I had a friend called Bill, but he got too close to the cat and now they call him Claude.

mark kennedy · Sep 19, 2017

For whatever reason the evolutionist isn't interested in talking about indels. I'm not sure why but I think it comes down to the mutation rate.

sfs · Sep 19, 2017

mark kennedy said:
For whatever reason the evolutionist isn't interested in talking about indels.

What "evolutionist" doesn't want to talk about indels? What does your comment have to do with this thread, or Tomkins' paper? Do you understand why Tomkins' conclusion was wrong? What do you think of his approach?

mark kennedy · Sep 19, 2017

sfs said:
What "evolutionist" doesn't want to talk about indels? What does your comment have to do with this thread, or Tomkins' paper? Do you understand why Tomkins' conclusion was wrong? What do you think of his approach?

I don't know much about his approach but I never worried much about what I got from AIG or Talk Origins for that matter. I have very straight forward problems with how AIG does an exposition of Genesis 1. When that goes out the window I'm not much interested in how they do an algorithm.

Devin P · Sep 19, 2017

sfs said:
Well, you could ask. And it's not like the original claimant provided either thought or proof.

As for the first claim, the actual procedures used to compare the human and chimpanzee genomes are laid out in the paper describing the study (that's this one). If anyone has specific questions about the approach used or arguments against it, by all means make them. It's a little hard to rebut as vague a claim as was made here. In the absence of specifics, here's a rundown of the human-chimp stats that I wrote a while ago:

Only 2.7 Gb [billions of base-pairs] of chimpanzee genome was sequenced successfully, so the maximum that could have been aligned would have been those 2.7 Gb (out of ~3 Gb, even if the genomes were identical. The first supplementary note from the paper describes what happened to the remaining ~0.3 Gb that didn't align. 0.24 Gb could be aligned, but the alignments were to many places in the human genome, so the sequence was discarded. This is perfectly sensible, since the bulk of this is certainly bad assembly in the chimpanzee genome, and there's no point in making a comparison using bad pieces of assembly. Some of this poorly aligning sequence might represent multiple rearrangements of one or both genomes, but even if it does, there is no reason to think that the pieces of sequence would themselves be any more diverged than the rest of the genome. Finally, there was 0.09 Gb of sequence that didn't align at all. Again, a small part of this might be real sequence that is unique to the chimpanzee genome, but it also includes all kinds of other junk, including sequence that's present in the human genome but that hasn't been assembled; given the small amount of sequence reliably found in insertions/deletions, it is unlikely that much of it is really unique chimp sequence.

As for the second claim, it's presumably talking about "Nebraska Man". You can read about the actual history in Wikipedia; it does not include the putative feature being included in textbooks, then or later.

I didn't have to have thought or evidence, I was sharing something with someone else who thought that topic was interesting, not trying to debate and or smash someone's opinion down.

My apologies, after I looked it up, I found many articles, and websites on the publishing of Nebraska Man in school textbooks, but nothing I felt sure enough to post about. I do remember hearing about Piltdown Man though. It's a mix of an old human skull, and an orangutang jaw bone someone put together, said it was a new species, and it actually WAS taught in schools for several years. Since I again, heard it on a video, it's likely I confused the two.

Conservapedia:Piltdown Man - RationalWiki

"This particular fraud was taught to an entire generation of students worldwide from 1912 to 1953, when it was conclusively proven to the public to be a hoax. The Piltdown Man was featured in A Civic Biology, the textbook at issue in the Scopes trial in Tennessee."

Even after it was proven fake, they still tried to teach it. Why? This isn't the only fake evolutionary find. I know of three, counting the two we've already talked about. There are undoubtably more, as evolution is ridiculously fake, brought on only to "disprove" creation. If evolution exists, then there can't be a God. But, I know all too well, that isn't the case.

I'd share some links regarding us not being even remotely close to monkeys, but someone already beat me to it. Which, they're pretty awesome for doing so.

sfs · Sep 19, 2017

Devin P said:
My apologies, after I looked it up, I found many articles, and websites on the publishing of Nebraska Man in school textbooks, but nothing I felt sure enough to post about. I do remember hearing about Piltdown Man though. It's a mix of an old human skull, and an orangutang jaw bone someone put together, said it was a new species, and it actually WAS taught in schools for several years.

No problem. Note that the find was actually altered to make it look legitimate -- Piltdown wasn't a mistake, it was outright fraud. And yes, it was accepted, mostly among English scientists, though.

Devin P said:
Even after it was proven fake, they still tried to teach it. Why?

Who tried to teach it after 1953?

Devin P said:
I know of three, counting the two we've already talked about.

You've only listed one fake: Piltdown. You mentioned a mistake, which was never accepted as a definite find.

Devin P said:
There are undoubtably more, as evolution is ridiculously fake, brought on only to "disprove" creation.

Evolution is the central organizing theory of the science of biology. Scientists, including me, because it explains and predicts a vast range of data from many different fields, and because no alternative that is remotely as successful is on offer. I suggest you read creationist Todd Wood's statement on evolution: "It is a productive framework for lots of biological research, and it has amazing explanatory power. There is no conspiracy to hide the truth about the failure of evolution. There has really been no failure of evolution as a scientific theory. It works, and it works well."

Devin P said:
If evolution exists, then there can't be a God.

Biologists who are believers overwhelmingly accept evolution, as do many denominations and many theologians. Your statement is simply wrong.

mark kennedy · Sep 19, 2017

Devin P said:
I didn't have to have thought or evidence, I was sharing something with someone else who thought that topic was interesting, not trying to debate and or smash someone's opinion down.

My apologies, after I looked it up, I found many articles, and websites on the publishing of Nebraska Man in school textbooks, but nothing I felt sure enough to post about. I do remember hearing about Piltdown Man though. It's a mix of an old human skull, and an orangutang jaw bone someone put together, said it was a new species, and it actually WAS taught in schools for several years. Since I again, heard it on a video, it's likely I confused the two.

Conservapedia:Piltdown Man - RationalWiki

"This particular fraud was taught to an entire generation of students worldwide from 1912 to 1953, when it was conclusively proven to the public to be a hoax. The Piltdown Man was featured in A Civic Biology, the textbook at issue in the Scopes trial in Tennessee."

Even after it was proven fake, they still tried to teach it. Why? This isn't the only fake evolutionary find. I know of three, counting the two we've already talked about. There are undoubtably more, as evolution is ridiculously fake, brought on only to "disprove" creation. If evolution exists, then there can't be a God. But, I know all too well, that isn't the case.

I'd share some links regarding us not being even remotely close to monkeys, but someone already beat me to it. Which, they're pretty awesome for doing so.

The Piltdown Hoax was the flagship transitional of Darwinism for nearly half a century and it was a hoax. A skull taken from a mass grave site used during the Black Plague matched up with an orangutan jawbone. Even Louis Leakey, the famous paleontologist, had said that jaw didn’t belong with that skull so people knew, long before it was exposed, that Piltdown was contrived.

Leakey mentions the Piltdown skull in his book 'Adam's Ancestors':

'If the lower jaw really belongs to the same individual as the skull, then the Piltdown man is unique in all humanity. . . It is tempting to argue that the skull, on the one hand, and the jaw, on the other, do not belong to the same creature. Indeed a number of anatomists maintain that the skull and jaw cannot belong to the same individual and they see in the jaw and canine tooth evidence of a contemporary anthropoid ape.'

He referred to the whole affair as an enigma: In By the Evidence he says 'I admit . . . that I was foolish enough never to dream, even for a moment, that the true explanation lay in a deliberate forgery.' (Leakey and Piltdown)

The problem was that there was nothing to replace it as a transitional from ape to man. Concurrent with the prominence of the Piltdown fossil Raymond Dart had reported on the skull of an ape that had filled with lime creating an endocast or a model of what the brain would have looked like. Everyone considered it a chimpanzee child since it’s cranial capacity was just over 400cc but with the demise of Piltdown, a new icon was needed in the Darwinian theater of the mind. Raymond Dart suggests to Louis Leakey that a small brained human ancestor might have been responsible for some of the supposed tools the Leaky family was finding in Africa. The myth of the stone age ape man was born.

The Scottish anthropologist Sir Arthur Keith had built his long and distinguished career on the Piltdown fossil. When it was exposed it sent Darwinians scrambling, Arthur Keith had always rejected the Taung Child (Raymond Dart’s discovery) a chimpanzee child. Rightfully so since it’s small even for a modern chimpanzee. Keith would eventually apologized to Dart and Leakey would take his suggested name for the stone age ape man, Homo habilis, but there was a very real problem. The skull was too small to be considered a human ancestor, this impasse became known as the Cerebral Rubicon and Leakey’s solution was to simply ignore the cranial capacity.

"Sir Arthur Keith, one of the leading proponents of Piltdown Man, was particularly instrumental in shaping Louis's thinking. "Sir Arthur Keith was very much Louis's father in science" noted Frida. Brilliant, yet modest and unassuming, Keith was regarded at the time of Piltdown's discovery as England's most eminent anatomist and an authority on human ancestry...a one man court of appeal for physical anthropologists from around the world....and his opinion that assured Piltdown a place on every drawing of humankinds family tree." (Ancestral Passions, Virginia Morell)

Humans and Apes not closely related after all?

Well-Known Member

Well-Known Member

Well-Known Member

Senior Veteran

Hebrews 2:14.... Pesky Devil, git!

Newbie

Listening to TW4

Senior Member

Senior Member

Messianic

Messianic

Senior Member

Senior Member

Senior Veteran

Natura non facit saltum

Senior Member

Natura non facit saltum

Well-Known Member

Senior Member

Natura non facit saltum

Similar threads