README.md

An R Package of datasets to help predict the timeline preceding an Intelligence Explosion.

Installation

install.packages("https://github.com/AABoyles/AIPredict/archive/master.tar.gz", type = "source")

Data

This package contains the following datasets:

Moore's Law

Roughly stated, Moore's law predicts that the density of transistors in a single processor core grows exponentially. (Moore and Fellow 1998) It is a widely used and cited metric in predictions about the development of Artificial General Intelligence. Perhaps the best-known of these is futurist Ray Kurzweil's projections in The Singularity is Near (2005), which are based on simple, linear extrapolations of Moore's law.

library(AIPredict)
library(dplyr)
library(ggplot2)

ai_moores_law %>%
  ggplot(aes(Year, Transistors)) +
  geom_point() +
  scale_y_log10() +
  stat_smooth(method="lm")

Koomey's Law

Koomey's law states that the electricty required to execute some number of computations declines exponentially over time (Koomey et al. 2011). While less well-known than Moore's law, it offers another critical benchmark for comparison to the human brain, which computes a mind on a metabolic budget of approximately 10 watts.

ai_koomeys_law %>%
  ggplot(aes(Year, WattsPerMCPS)) +
  geom_point() +
  scale_y_log10() +
  stat_smooth(method="lm")

Every decade, the energy cost of computing falls approximately two orders of magnitude.

Top 500 Supercomputers

The Top 500 Supercomputers. Another good source for demonstrating Moore's and Koomey's laws.

ai_top500 %>%
  filter(Rank == 1) %>%
  mutate(RMAX=ifelse(is.na(RMax), Rmax, RMax)) %>%
  filter(!is.na(RMAX)) %>%
  ggplot(aes(Year, RMAX)) +
  geom_jitter(width = 1, height = 0, alpha=.3, size=5) +
  scale_y_log10() +
  stat_smooth(method = "lm")

While generally accessible, these data currently require a good deal of cleaning, which I'll perform and document in due course.

Bitcoin Hashrate

The Hashrate of the Bitcoin network provides a useful insight into the growth of financially-motivated expenditure of computing resources to compute the solution of a single problem. I suspect that this will be a useful point of comparison as the network's exercised capacity approaches levels comparable to the the human brain.

ai_bitcoin_hashrate %>%
  ggplot(aes(Date, GHPS)) +
  geom_line()

Animal Brains

As our computational capacity climbs through the ranks of the animal kingdom, there are a variety of metrics which would be useful for comparison. The Number of Neurons, Cortical Neurons (in mammals), Synapses, Brain size, Brain-to-Body Mass Ratio, Encephalization Quotient and Cranial Capacity might all be useful in this line of research. Sadly, I've not yet found any sources (let alone reliable ones) for more than a few species. This dataset was scraped from the Wikipedia's List of Animals by Number of Neurons. Those, in turn, were assembled from a variety of sources, most prominently Herculano-Houzel et al. (2007b). (This last set is on my list of sources to add.)

ai_animal_neurons %>%
  ggplot(aes(Neurons, Synapses)) +
  geom_point() +
  scale_y_log10() +
  scale_x_log10() +
  stat_smooth(method = "lm")
Warning: Removed 47 rows containing non-finite values (stat_smooth).

Warning: Removed 47 rows containing missing values (geom_point).

The logarithmic scale in both dimensions suggests a power-law relationship, but this is derived from a very small, very noisy sample.

Future of Life Institute Winning Grants

I don't know if there's anything interesting to be inferred from the Future of Life Institute Grant Recipients, but I collected this data when it was first published and this seems as appropriate a venue for its dissemination as any.

ai_fli_winners %>%
  group_by(Institution) %>%
  summarise(Total = sum(Amount)) %>% 
  ggplot(aes(Institution, Total)) +
  geom_bar(stat = "identity") +
  theme(axis.text.x = element_text(angle=90, hjust = 1, vjust = .5))

Data to be added

This is an obviously incomplete project. Besides the sources demonstrated above, I have the following known sources to be prepared use in this package:

Desired Data

This notably misses estimates about the computational capacities of the biological brains. Other types of estimates about the growth of computing power (e.g. global computing power, specs on the most powerful supercomputer at any given time, size of various commercial cloud infrastructure providers, etc.) are also needed. If you know of any reliable sources of these or related to these topics, please email me.

References

“Brief Introduction.” n.d. http://www.graph500.org/. http://www.graph500.org/.

Herculano-Houzel, Suzana, Christine E Collins, Peiyan Wong, and Jon H Kaas. 2007a. “Cellular Scaling Rules for Primate Brains, Table 2.” http://www.pnas.org/content/104/9/3562/suppl/DC1.

———. 2007b. “Cellular Scaling Rules for Primate Brains.” Proceedings of the National Academy of Sciences of the United States of America 104 (9): 3562–7. doi:10.1073/pnas.0611396104.

Koomey, J G, S Berard, M Sanchez, and H Wong. 2011. “Implications of Historical Trends in the Electrical Efficiency of Computing.” IEEE Annals of the History of Computing 33 (3): 46–54. doi:10.1109/MAHC.2010.28.

Kurzweil, Ray. 2005. The Singularity Is Near: When Humans Transcend Biology. Penguin. https://books.google.com/books?hl=en&lr=&id=9FtnppNpsT4C&oi=fnd&pg=PT22&dq=The%2BSingularity%2Bis%2BNear&ots=K29jXGYYAz&sig=wWisXrrCfssdSBBEGDMAsjgPrBk.

Moore, Gordon E, and Life Fellow. 1998. “Cramming More Components onto Integrated Circuits.” Proceedings of the IEEE 86 (1). http://www.cs.utexas.edu/~fussell/courses/cs352h/papers/moore.pdf.

Wikipedia contributors. 2016. “List of Animals by Number of Neurons.” https://en.wikipedia.org/w/index.php?title=List_of_animals_by_number_of_neurons&oldid=716197676. https://en.wikipedia.org/w/index.php?title=List_of_animals_by_number_of_neurons&oldid=716197676.



AABoyles/AIPredict documentation built on Nov. 15, 2019, 9:20 a.m.