DiEbLog

Posts at Uncommon Descent

2019-02-27T07:18:00.002-08:00

Here are the posts and comments which Uncommon Descent received for each month from Apr 2005 until Dec 2018. The area of the circles is proportional to the number of views those posts gathered until mid-February 2019 (and most probably starting sometimes in 2011...) The years can be seen more clearly in the faceted view:

Views, Comments, and Articles per Year

Year	Articles	Comments	Views
2005	600	9000	125000
2006	1100	23000	274000
2007	900	23000	527000
2008	800	23000	413000
2009	900	41000	446000
2010	900	25000	359000
2011	2900	42000	1701000
2012	2000	28000	1407000
2013	1700	43000	1503000
2014	2300	58000	1784000
2015	1900	51000	1038000
2016	1900	27000	812000
2017	1600	24000	719000
2018	1700	22000	844000

Articles with the most Comments per Year

Year	Article	Views	Comments
2005	Stephen Jay Gould’s Contempt for the John Templeton Foundation		453	259
2006	Gil Has Never Grasped the Nature of a Simulation Model		398	200
2007	Kevin Padian: The Archie Bunker Professor of Paleobiology at Cal Berkeley		3718	387
2008	Complex speciation of humans and chimpanzees		1065	571
2009	Answers for Judge Jones		892	768
2010	Intelligent Design and the Demarcation Problem		868	712
2011	Science and Freethinking		8275	686
2012	UB Sets It Out Step-By-Step		14008	1432
2013	Please Take the Time to Understand Our Arguments Before You Attack Them		6874	856
2014	Mystery at the heart of life		8527	3500
2015	Bad math: Why Larry Moran’s “I’m not a Darwinian” isn’t a valid reply to Meyer’s argument		4212	670
2016	Durston and Craig on an infinite temporal past . . .		7162	1416
2017	FFT: Gender as a social construct — what is the vid below telling us on where our intellectual culture has now reached?		4977	576
2018	The Ubiquitin System: Functional Complexity and Semiosis joined together.		9705	953

Articles with the most Views per Year

Year	Article	Views	Comments
2005	Why Scientists Should NOT Dismiss Intelligent Design		4836	30
2006	Respected Cornell geneticist rejects Darwinism in his recent book		6258	106
2007	Icon of Evolution “Lucy” Bites the Dust		15504	28
2008	Lactose digestion in E. coli		8155	20
2009	Darwin reader: Darwin’s racism		10603	141
2010	The Bacterial Flagellum – Truly An Engineering Marvel!		10336	3
2011	A Whale of a Problem for Evolution: Ancient Whale Jawbone Found in Antartica		26907	180
2012	Seven Nobel Laureates in science who either supported Intelligent Design or attacked Darwinian evolution		26361	10
2013	Macroevolution, microevolution and chemistry: the devil is in the details		17239	129
2014	A world-famous chemist tells the truth: there’s no scientist alive today who understands macroevolution		374312	489
2015	Physicist Paul Davies’ killer argument against the multiverse		9811	29
2016	Durston and Craig on an infinite temporal past . . .		7162	1416
2017	FFT*: Charles unmasks the anti-ID trollish tactic of attacking God, Christian values and worldview themes		5691	515
2018	The Ubiquitin System: Functional Complexity and Semiosis joined together.		9705	953

Somewhat Amusing Oddity

The article with the least number of eyeballs was 2011's Cash awards available for research or essays on the uses and abuses of biology : though two users commented (one of them BornAgain77 with an off topic), it was visited only 17 times until Feb 25, 2019 (The average number of views for articles in 2011 was 660). I suppose that most robots will not follow links which are promising "cash awards"!

Conclusion

Not much to see, really. UD is soldiering on after their peak in 2014. The lack of inter-monthly volatility over the last couple of years indicates that traffic at UD does not depend so much on outside influences (like news), but on the set of the same acteurs...

An Amazon Review: Still waiting for the ultimate book on Intelligent Design

2018-02-23T15:08:00.000-08:00

I wrote a review at amazon for Dr. Robert J. Marks II's, Dr. Dr. William A. Dembski's, and Dr. Winston Ewert's book Introduction to Evolutionary Informatics (1st Edition):

We are all waiting for the ultimate book on Intelligent Design, written by R. Marks and W. Dembski. Instead we get a "textbook", another attempt to explain the concepts to laymen. I got the impression that the authors used this setting to avoid the necessary rigour: they just do not define terms like "search" which they use hundreds of times. This allows for a lot of hand-waving, like the following sentence on p. 174:

"We note, however, the choice of an algorithm along with its parameters and initialization imposes a probability distribution over the search space"

That unsubstantiated claim is essential for their following proofs on "The Search for a Search"!

And then there are details like this one:

p. 130: "For the Cracker Barrel puzzle [we got] an endogenous information of I = 7.15 bits"
p. 138: "We return now to the Cracker Barrel puzzle. We showed that the endogenous information [...] is I = 7.4 bits"

I tried to solve this conundrum, but I came up with I = 7.8 bits. I contacted the authors, but got no reply.

Not surprisingly, I gave it only two stars.

Some Details on the Cracker Barrel Puzzle

A more complete quote from p. 130 is:

For the Cracker Barrel puzzle, all of the 15 holes are filled with pegs and, at random, a single peg is removed. This starts the game. Using random initialization and random moves, simulation of four million games using a computer program resulted in an estimated win probability p = $0.007\,0$ and an endogenous information of $$I_\Omega = − \log_2\,p\;=\;7.15\,bits.$$

They didn't calculate the correct value, but they simulated the puzzle 4,000,000. A simulation is the most easy programmable way to get a result - but how good is it? It should be pretty good: performing one simulation is a Bernoulli trial with a probability of success $p_t$, the theoretical probability to win a single game by chance. Repeating 4,000,000 Bernoulli trials leads to a binomial experiment $B(4,000,000; p_t)$, so $\sigma = 0.000\,042$ for $p_t$ - that's why stating four positions after the decimal point isn't overconfident: assuming that there is no systemic error, then the probability that the actual value $p_t$ lies within $0.007\,00 \pm 0.000\,05$ is $77\%$.

Giving three significant digits for $I_\Omega$ oversells the power of their experiment slightly: this implies that they expect $p_t$ to be in the interval $[0.007\,067;0.007\,065]$ with a reasonable probability - but the probability is at best about $44\%$.

Confining themselves to only two significant digits on p. 138: $I_\Omega = 7.4\;bits$ yields much more reliable results: again, assuming that there is nothing systematically wrong with their calculation, they can say that $p_t$ is in $[0.005\,72;0.006\,30]$ with a probability of more than $99.999\,99\%$! Well done...

Or not: it is very improbably that both values are correct. Very, very, very, very - using the most favourite estimations, then the second result should only occur with a probability of less than $10^{-98}$ if the first experiment was correctly implemented. It is even worse the other way around: $10^{-112}$.

Which value is correct?

Not surprising the answer: both are wrong - the three authors somehow botched the implementation of even the easiest way to approach the question - a simulation. How can I be so cock-sure? I simulated it myself - 4,000,000 times - and got a value of $p = 0.004\,5$. Then, I calculated the theoretical value by enumerating all possible games and their respective probabilities: again, $p = 0.004\,5$. Then, I published part of my code at The Sceptical Zone, and thankfully, Roy and Corneel also implemented a simulation - which got compatible results. Lastly, Tom English programmed the problem much more cleverly, getting exactly the same results as I (I just had to wait for mine much longer...)

Why didn't the authors do the same?

The Search Problem of William Dembski, Winston Ewert, and Robert Marks

2018-01-29T15:18:00.002-08:00

Introduction to Evolutionary Informatics, by Robert J. Marks II, the “Charles Darwin of Intelligent Design”; William A. Dembski, the “Isaac Newton of Information Theory”; and Winston Ewert, the “Charles Ingram of Active Information.” World Scientific, 332 pages.

Classification: Engineering mathematics. Engineering analysis. (TA347)
Subjects: Evolutionary computation. Information technology–Mathematics.¹

Search is a central term in the work of Dr. Dr. William Dembski jr, Dr. Winston Ewert, and Dr. Robert Marks II (DEM): it appears in the title of a couple of papers written by at least two of the authors, and it is mentioned hundreds of times in their textbook "Introduction to Evolutionary Informatics". Strangely - and in difference from the other central term information, it is not defined in this textbook, and neither is search problem or search algorithm. Luckily, dozens of examples of searches are given. I took a closer look to find out what DEM see as the search problem in the "Introduction to Evolutionary Informatics" and how their model differs from those used by other mathematicians and scientists.

A Smörgåsbord of Search Algorithms

In their chapter 3.8 "A Smörgåsbord of Search Algorithms", DEM present

Table 3.2. A list of some search algorithms.
active set method³⁸ adaptive coordinate descent³⁹ alpha–beta pruning⁴⁰ ant colony optimization⁴¹ artificial immune system optimization⁴² auction algorithm⁴³ Berndt–Hall–Hall–Hausman algorithm⁴⁴ blind search branch and bound⁴⁵ branch and cut⁴⁶ branch and price⁴⁷ Broyden–Fletcher–Goldfarb–Shanno (BFGS) method⁴⁸ Constrained optimization by linear approximation (COBYLA)⁴⁹ conjugate gradient method⁵⁰ CMA-ES (covariance matrix adaptation evolution strategy)⁵¹ criss-cross algorith ⁵² cross-entropy optimization⁵³ cuckoo search⁵⁴ Davidon’s variable metric method⁵⁵ differential evolution⁵⁶ eagle strategy⁵⁷ evolutionary programs⁵⁸ evolutionary strategies exhaustive search Fibonacci search^59,60 firefly algorithm⁶¹ Fletcher–Powell method ⁶² genetic algorithms⁶³	glowworm swarm optimization⁶⁴ golden section search^65,66 gradient descent⁶⁷ great deluge algorithm⁶⁸ harmony search⁶⁹ imperialist competitive algorithm⁷⁰ intelligent water drop optimization⁷¹ Karmarkar’s algorithm⁷² Levenberg–Marquardt algorithm⁷³ Linear, Quadratic, Integer and Convex Programming⁷⁴ Nelder–Mead method⁷⁵ Newton–Raphson method⁷⁶ one-at-a-time search⁷⁷ particle swarm optimization⁷⁸ pattern search⁷⁹ POCS (alternating projections onto convex sets) ⁸⁰ razor search⁸¹ Rosenbrock methods ⁸² sequential unconstrained minimization technique (SUMT)⁸³ shuffled frog-leaping algorithm⁸⁴ simplex methods⁸⁵ simulated annealing⁸⁶ social cognitive optimization⁸⁷ stochastic gradient search⁸⁸ stochastic hill climbing⁸⁹ Tabu search⁹⁰ Tree search⁹¹ Zionts–Wallenius method⁹²

Table 3.2. A list of some search algorithms.

active set method³⁸
adaptive coordinate descent³⁹
alpha–beta pruning⁴⁰
ant colony optimization⁴¹
artificial immune system optimization⁴²
auction algorithm⁴³
Berndt–Hall–Hall–Hausman algorithm⁴⁴
blind search
branch and bound⁴⁵
branch and cut⁴⁶
branch and price⁴⁷
Broyden–Fletcher–Goldfarb–Shanno (BFGS) method⁴⁸
Constrained optimization by linear approximation (COBYLA)⁴⁹
conjugate gradient method⁵⁰
CMA-ES (covariance matrix adaptation evolution strategy)⁵¹
criss-cross algorith ⁵²
cross-entropy optimization⁵³
cuckoo search⁵⁴
Davidon’s variable metric method⁵⁵
differential evolution⁵⁶
eagle strategy⁵⁷
evolutionary programs⁵⁸
evolutionary strategies
exhaustive search
Fibonacci search^59,60
firefly algorithm⁶¹
Fletcher–Powell method ⁶²
genetic algorithms⁶³

glowworm swarm optimization⁶⁴
golden section search^65,66
gradient descent⁶⁷
great deluge algorithm⁶⁸
harmony search⁶⁹
imperialist competitive algorithm⁷⁰
intelligent water drop optimization⁷¹
Karmarkar’s algorithm⁷²
Levenberg–Marquardt algorithm⁷³
Linear, Quadratic, Integer and Convex Programming⁷⁴
Nelder–Mead method⁷⁵
Newton–Raphson method⁷⁶
one-at-a-time search⁷⁷
particle swarm optimization⁷⁸
pattern search⁷⁹
POCS (alternating projections onto convex sets) ⁸⁰
razor search⁸¹
Rosenbrock methods ⁸²
sequential unconstrained minimization technique (SUMT)⁸³
shuffled frog-leaping algorithm⁸⁴
simplex methods⁸⁵
simulated annealing⁸⁶
social cognitive optimization⁸⁷
stochastic gradient search⁸⁸
stochastic hill climbing⁸⁹
Tabu search⁹⁰
Tree search⁹¹
Zionts–Wallenius method⁹²

A smörgåsbord indeed - and for me it is absolutely not clear how this list was constructed: DEM just write "[a]n incomplete list of search algorithms³⁷ is provided in Table 3.2." and give as a footnote David Knuth's third volume of the "Art of Computing Programming: Sorting and Searching". But obviously, this list is not taken from the book, as

Knuth definition of a search covers only a finite search-space:
"In general, we shall suppose that a set of N records has been stored, and the problem is to locate the appropriate one. As in the case of sorting, we assume that each record includes a special field called its key."
some of the methods were developed after 1973, the year Knuth's book was published according to DEM

I assume it never hurts to mention David Knuth. Fortunately, the footnotes in the table (which is listed at the end of the chapter) are a little bit more to the point. To save jumping back and forth, I added the given source to every item in the list in a second column. I looked up some of them and I tried to find out which kind of sort problem the authors of the paper have in mind - this, I put into the third column².

method	source	search problem
active set method	J. Nocedal and S. Wright, Numerical Optimization (Springer Science & Business Media, 2006).	optimization problem: $\min_{x \in \mathbb{R}^n}f(x)$ subject to $\begin{cases}c_i(x) = 0, &i \in \mathcal{E}\\c_i(x) \ge 0,& i \in \mathcal{I} \end{cases}$
adaptive coordinate descent	I. Loshchilov, M. Schoenauer, and M. Sebag, “Adaptive Coordinate Descent.” In Proceedings of the 13th Annual Conference on Genetic and Evolutionary Computation (ACM, 2011), pp. 885–892.	(separable) continuous optimization problems
alpha–beta pruning	Donald E. Knuth and Ronald W. Moore, “An analysis of alpha-beta pruning.” Artif Intel, 6(4), pp. 293–326 (1976).	"searching a large tree of potential continuations" (p. 234)
ant colony optimization	M. Dorigo, V. Maniezzo, and A. Colorni, “Ant system: optimization by a colony of cooperating agents.” IEEE Transactions on Systems, Man, and Cybernetics — Part B, 26(1), pp. 29–41 (1996).	stochastic combinatorial optimization (here)
artificial immune system optimization	Leandro N. de Castro and J. Timmis, Artificial Immune Systems: A New Computational Intelligence Approach (Springer, 2002), pp. 57– 58.
auction algorithm	Dimitri P. Bertsekas, “A distributed asynchronous relaxation algorithm for the assignment problem.” Proceedings of the IEEE International Conference on Decision and Control, pp. 1703–1704 (1985).
Berndt–Hall–Hall–Hausman algorithm	Ernst R. Berndt, Bronwyn H. Hall, Robert E. Hall, and Jerry A. Hausman, “Estimation and inference in nonlinear structural models.” Annals of Economic and Social Measurement, 3(4), pp. 653–665 (1974).	non-linear least squares problems
blind search
branch and bound	Patrenahalli M. Narendra and K. Fukunaga, “A branch and bound algorithm for feature subset selection.” IEEE Transactions on Computers, 100(9), pp. 917–922 (1977).
branch and cut	M. Padberg and G. Rinaldi, “A branch-and-cut algorithm for the resolution of large-scale symmetric traveling salesman problems.” SIAM Rev, 33(1), pp. 60–100 (1991).
branch and price	Cynthia Barnhart, Ellis L. Johnson, George L, Nemhauser, Martin W.P. Savelsbergh, and Pamela H. Vance, “Branch-and-price: Column generation for solving huge integer programs.” Operations Research, 46(3), pp. 316–329 (1998).
Broyden–Fletcher–Goldfarb–Shanno (BFGS) method	J. Nocedal and Stephen J. Wright, Numerical Optimization, 2nd edition (Springer-Verlag, Berlin, New York, 2006).
Constrained optimization by linear approximation (COBYLA)	Thomas A. Feo and Mauricio G.C. Resende, “A probabilistic heuristic for a computationally difficult set covering problem.” Op Res Lett, 8(2), pp. 67–71 (1989).
conjugate gradient method	A.V. Knyazev and I. Lashuk, “Steepest descent and conjugate gradient methods with variable preconditioning.” SIAM J Matrix Anal Appl, 29(4), pp. 1267–1280 (2007).	linear system with a real symmetric positive definite matrix of coefficients A
CMA-ES (covariance matrix adaptation evolution strategy)	Y. Akimoto, Y. Nagata, I. Ono, and S. Kobayashi. “Bidirectional relation between CMA evolution strategies and natural evolution strategies.” Parallel Problem Solving from Nature, PPSN XI, pp. 154–163 (Springer, Berlin Heidelberg, 2010).
criss-cross algorithm	Dick den Hertog, C. Roos, and T. Terlaky, “The linear complimentarity problem, sufficient matrices, and the criss-cross method.” Linear Algebra Appl, 187, pp. 1–14 (1993).
cross-entropy optimization	R.Y. Rubinstein, “Optimization of computer simulation models with rare events.” Eur J Ops Res, 99, pp. 89–112 (1997). R.Y. Rubinstein and D.P. Kroese, The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning (Springer-Verlag, New York, 2004).	given a set $\mathcal{X}$, and an $\mathbb{R}-$valued function on $\mathcal{X}$, determine $\max_{\textbf{x} \in \mathcal{X}} S(\textbf{x})$ (here p.4)
cuckoo search	X.S. Yang and S. Deb, “Cuckoo search via Lévy flights.” World Congress on Nature & Biologically Inspired Computing (NaBIC 2009). IEEE Publications, pp. 210–214. arXiv:1003.1594v1.
Davidon’s variable metric method	W. C. Davidon, “Variable metric method for minimization.” AEC Research Development Rept. ANL-5990 (Rev.) (1959).
differential evolution	P. Rocca, G. Oliveri, and A. Massa, “Differential evolution as applied to electromagnetics.” Antennas and Propagation Magazine, IEEE, 53(1), pp. 38–49 (2011).
eagle strategy	Xin-She Yang and Suash Deb, “Eagle strategy using Lévy walk and firefly algorithms for stochastic optimization.” Nature Inspired Cooperative Strategies for Optimization (NICSO 2010) (Springer Berlin Heidelberg, 2010), pp. 101–111.
evolutionary programs	Jacek M. Zurada, R.J. Marks II and C.J. Robinson; Editors, Computational Intelligence: Imitating Life (IEEE Press, 1994). M. Palaniswami, Y. Attikiouzel, Robert J. Marks II, D. Fogel, and T. Fukuda; Editors, Computational Intelligence: A Dynamic System Perspective (IEEE Press, 1995).
evolutionary strategies
exhaustive search
Fibonacci search	David E. Ferguson, “Fibonaccian searching.” Communications of the ACM, 3(12), p. 648 (1960). J. Kiefer, “Sequential minimax search for a maximum.” Proceedings of the American Mathematical Society, 4(3), pp. 502–506 (1953).
firefly algorithm	Xin-She Yang, “Firefly algorithms for multimodal optimization.” In Stochastic Algorithms: Foundations and Applications (Springer Berlin Heidelberg, 2009), pp. 169–178.
Fletcher–Powell method	R. Fletcher and M.J.D. Powell, “A rapidly convergent descent method for minimization.” Computer J. (6), pp. 163–168 (1963).
genetic algorithms	David E. Goldberg, Genetic Algorithms in Search Optimization and Machine Learning (Addison Wesley, 1989). R. Reed and R.J. Marks II, “Genetic Algorithms and Neural Networks: An Introduction.” Northcon/92 Conference Record (Western Periodicals Co., Ventura, CA, Seattle WA, October 19–21, 1992), pp. 293–301.
glowworm swarm optimization	K.N. Krishnanand and D. Ghose. “Detection of multiple source locations using a glowworm metaphor with applications to collective robotics.” Proceedings of the 2005 IEEE Swarm Intelligence Symposium (SIS 2005), pp. 84–91 (2005).
golden section search	A. Mordecai and Douglass J. Wilde. “Optimality proof for the symmetric Fibonacci search technique.” Fibonacci Quarterly, 4, pp. 265–269 (1966). A. Mordecai and Douglass J. Wilde. “Optimality proof for the symmetric Fibonacci search technique.” Fibonacci Quarterly, 4, pp. 265–269 (1966).
gradient descent	Jan A. Snyman, Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-Based Algorithms (Springer Publishing, 2005).	constrained optimization problem $minimize_{w.r.t. \mathbf{x}}f(\mathbf{x})$, $\mathbf{x} = [x_1, x_2,\ldots, x_n]^T \in \mathbb{R}^n$ subject to $\begin{cases}g_j(\mathbf{x}) \le 0, & j=1,2, \ldots, m \\ h_j(\mathbf{x})=0,& j=1,2,\ldots,r\end{cases}$
great deluge algorithm	Gunter Dueck, “New optimization heuristics: the great deluge algorithm and the record-to-record travel.” J Comp Phys, 104(1), pp. 86–92 (1993).
harmony search	Zong Woo Geem, “Novel derivative of harmony search algorithm for discrete design variables.” Applied Mathematics and Computation, 199, (1), pp. 223–230 (2008).
imperialist competitive algorithm	Esmaeil Atashpaz-Gargari and Caro Lucas, “Imperialist competitive algorithm: an algorithm for optimization inspired by imperialistic competition.” 2007 IEEE Congress on Evolutionary Computation (CEC 2007), pp. 4661– 4667 (2007).
intelligent water drop optimization	Shah-Hosseini Hamed, “The intelligent water drops algorithm: a natureinspired swarm-based optimization algorithm.” Int J Bio-Inspired Comp, 1(1/2), pp. 71–79 (2009).
Karmarkar’s algorithm	Karmarkar Narendra, “A new polynomial-time algorithm for linear programming.” Proceedings of the Sixteenth Annual ACM Symposium on Theory of Computing, pp. 302–311 (1984).
Levenberg–Marquardt algorithm	Kenneth Levenberg, “A Method for the Solution of Certain Non-Linear Problems in Least Squares.” Quart App Math, 2, pp. 164–168 (1944).
Linear, Quadratic, Integer and Convex Programming	Alexander Schrijver, Theory of Linear and Integer Programming (John Wiley & Sons, 1998). Yurii Nesterov, Arkadii Nemirovskii, and Yinyu Ye, “Interior-point polynomial algorithms in convex programming.” Vol. 13. Philadelphia Society for Industrial and Applied Mathematics (1994).	given a subset $\Pi \subset \Sigma^* \times \Sigma^$, where $\Sigma$ is some alphabet, then the search problem is: given string $z \in \Sigma^$, find a string $y$ such that $(z,y) \in \Pi$, or decide that no such string $y$ exists.
Nelder–Mead method	K.I.M. McKinnon, “Convergence of the Nelder–Mead simplex method to a non-stationary point.” SIAM J Optimization, 9, pp. 148–158 (1999).
Newton–Raphson method	E. Süli and D. Mayers, An Introduction to Numerical Analysis (Cambridge University Press, 2003).
one-at-a-time search	A.H. Boas, “Modern mathematical tools for optimization,” Chem Engrg (1962).
particle swarm optimization	J. Kennedy and R. Eberhart, “Particle Swarm Optimization.” Proceedings of IEEE International Conference on Neural Networks IV, pp. 1942–1948 (1995).
pattern search	A. W. Dickinson, “Nonlinear optimization: Some procedures and examples.” Proceedings of the 19th ACM National Conference (ACM, 1964), pp. 51–201.
POCS (alternating projections onto convex sets)	Robert J. Marks II, Handbook of Fourier Analysis & its Applications (Oxford University Press, 2009).
razor search	J.W. Bandler and P.A. Macdonsdd, “Optimization of microwave networks by razor search.” IEEE Trans. Microwave Theory Tech., 17(8), pp. 552–562 (1969).
Rosenbrock methods	H.H. Rosenbrock, “An automatic method for finding the greatest or least value of a function.” Comp. J., 3, pp. 175–184 (1960).
sequential unconstrained minimization
technique (SUMT)	John W. Bandler, “Optimization methods for computer-aided design.” IEEE Transactions on Microwave Theory and Techniques, 17(8), pp. 533–552 (1969).
shuffled frog-leaping algorithm	Muzaffar Eusuff, Kevin Lansey, and Fayzul Pasha, “Shuffled frog-leaping algorithm: a memetic meta-heuristic for discrete optimization.” Engineering Optimization, 38(2), pp. 129–154 (2006).
simplex methods	M.J. Box, “A new method of constrained optimization and a comparison with other methods.” Computer J., (8), pp. 42–52 (1965). J.A. Nelder and R. Mead, “A simplex method for function minimization.” Computer J., 7, pp. 308–313 (1965).
simulated annealing	S. Kirkpatrick, C.D. Gelatt, and M.P. Vecchi, “Optimization by simulated annealing.” Science, 220(4598), pp. 671–680 (1983).
social cognitive optimization	X.-F. Xie, W. Zhang, and Z. Yang, “Social cognitive optimization for nonlinear programming problems.” Proceedings of the First International Conference on Machine Learning and Cybernetics, 2, pp. 779–783 (Beijing, 2002).
stochastic gradient search	James C. Spall, Introduction to Stochastic Search and Optimization (2003).	Find the value(s) of a vector $\mathbf{\theta} \in \Theta$ that minimize a scalar-valued $loss function$ $L(\mathbf{\theta})$ or: Find the value(s) of $\mathbf{\theta} \in \Theta$ that solve the equation $\mathbf{g}(\mathbf{\theta}) = \mathbf{0}$ for some vector-valued function $\mathbf{g}(\mathbf{\theta})$
stochastic hill climbing	Brian P. Gerkey, Sebastian Thrun, and Geoff Gordon, “Parallel stochastic hillclimbing with small teams.” Multi-Robot Systems. From Swarms to Intelligent Automata, Volume III, pp. 65–77. (Springer Netherlands, 2005).
Tabu search	F. Glover, “Tabu Search — Part I.” ORSA J Comput, 1(3), pp. 190–206 (1989). “Tabu Search — Part II”, ORSA J Comput, 2(1), pp. 4–32 (1990).
Tree search	Athanasios K. Sakalidis, “AVL-Trees for Localized Search.” Inform Control, 67, pp. 173–194 (1985). R. Seidel and C.R. Aragon, “Randomized search trees.” Algorithmica, 16(4–5), pp. 464–497 (1996).
Zionts–Wallenius method	S. Zionts and J. Wallenius, “An interactive programming method for solving the multiple criteria problem.” Manage Sci, 22(6), pp. 652–663 (1976).

Often the quoted texts are scientific papers: those expect their readers to be accustomed to the general framework of their specific problems, and will not define the term "search problem" from scratch. Instead, they will just mention the specific problem which they tackle - like statistical combinatorial optimization.

But there are some textbooks, too: I tried to look them up and to quote what the authors of those define as their search problem. Nocedil and Snyman both describe the classical optimization problem: here, the search space is a subset of an $n$-dimensional vector space $V$ over $\mathbf{R}$ - as $V$ is restricted by some (in)equations. Finding the target means minimizing an $R$-valued function on this set.

On the other hand, Macready and Wolpert - in their classical paper "No Free lunch Theorems for Optimization"¹ - look at two finite sets $\mathcal{X}$ and a sortable $\mathcal{Y}$ and wish to minimize a function $f: \mathcal{X} \rightarrow \mathcal{Y}$ by finding the $x \in \mathcal{X}$ such that $f(x)$ is minimal.

What have all these approaches in common? A set to search on and a function which can be optimized. In most cases, the range of the function is $\mathbb{R}$, $\mathbb{Z}$, or $\mathbb{N}_0$, but for some problems (as mentioned in section 3.7.2. Pareto optimization and optimal sub-optimality of Introduction into Evolutionary Informatics), another partially ordered set will be used. I will ignore this for the time and just choose an ordered set for my definition of a optimization problem which should cover virtually all the cases discussed above:

General Optimization Problem
given:
a set $\Omega$
an ordered set $\mathcal{Y}$ and
a function $f: \Omega \rightarrow \mathcal{Y}$
find $x \in \Omega$ such that $f(x) = \min f$

As said before, optimizing and searching are virtually the same, but to stress the character of a search I introduce a target - something, which is mentioned in all the searches of DEM. So, my search problem is:

General Search Problem
given:
a set $\Omega$
an ordered set $\mathcal{Y}$
a target $T \subset \Omega$ and
a function $f: \Omega \rightarrow \mathcal{Y}$ such that $T = \{\tau \in \Omega | f(\tau)=\min f \}$
find $x \in T$

Nothing substantial has changed, the definition just became a little more verbose. I am quite sure that most authors of the papers on the table would accept this as a good attempt of a definition - but is it the search problem which DEM have in mind?

On page 48, they provide an example of a search credited to Walter Bradley:

Kirk is an armadillo foraging for grubs when he is bitten by a spider that makes him blind. Kirk wants to return to his armadillo hole, but is disoriented. He knows, though, that his hole is at the lowest elevation in the immediate area, so he balls up and rolls downhill to his hole. When Kirk does this, he is not explicitly seeking his hole. His surroundings are fortuitously designed to take him there. Kirk’s target is thus implicit in the sense it is not specifically sought, but is a result of the environment’s action on him. He can bounce off of trees and be kicked around by playful kids. And repeated trials of rolling down the hill might take drastically different paths. But ultimately, Kirk will end up in his hole at the bottom of the hill. Kirk reaches his home because of information he acquires from his environment. The environment must be designed correctly for this to happen.

Here, $\Omega$ is Kirk's habitat, $f$ is given as the elevation. What is surprising is that DEM make a distinction between the minimum of the function $f$ and Kirk's intended target $T$, his borrow hole. Luckily, both coincide, but DEM imply that this is not necessarily the case!

Next, they revisit their "pancake search example": here, the taste of the pancake as a function depends smoothly on a variety of factors like amount of ingredients, oven temperature, baking time, etc. - the possible combinations of which make up $\Omega$. On this $\Omega$, a cook looks for the best taste by optimizing the taste function. Now, they restrict $\Omega$ by additional conditions to $\Omega'$, such that the original extreme of $f$ does not lie in the new restricted set.

For the definitions of optimization/search problem above, this does not pose a problem: there is now the set $\Omega'$ to search on, looking for the optimum of $f|_{\Omega'}$. Though the new solution will taste worse than the original one, the new target is the solution of the new restricted problem.

Not so for DEM: "If, however, the environment is constrained in a negative way, the target may never be found even if it was available prior to the alteration."

That is the great difference between the problems which all other scientists discuss and the ones of DEM: DEM have decoupled the optimum of the function and the target, arriving quite another version of a search problem:

DEM's Search Problem
given:
a set $\Omega$
an ordered set $\mathcal{Y}$
a target $T \subset \Omega$ and
a function $f: \Omega \rightarrow \mathcal{Y}$
find $x \in T$

The Problems with DEM's Search Problem

First there is of course the problem of applicability: it is not clear how any of DEM's results is relevant for the problems in the table as those concern fundamentally different problems.

Then there is a problem of procedure: for an algorithm for a search or optimization, generally some information about $\Omega$ is given and (a finite number of) values of $f$ can be obtained. If $T$ is independent of $f$, how is it ever possible to say that a target was hit? This additional information can only be given ex cathedra afterwards!

Not every one of the search algorithms stated in the table will always identify the target, but in many cases, this is possible - at least theoretically: if possible, an exhaustive search will always give you the target. Not so for DEM: even if you have calculated $f$ for all elements of $\Omega$ and found the optimum, this does not have to be the intended target which still has to be revealed.

Why do DEM use their definition?

I would like to answer this question using Dawkins's weasel. Then

$\Omega$ is the set of strings consisting from 28 letters chosen from the alphabet ABCDEFGHIJKLMNOPQRZ plus * as a sign indicating a space
$T=$METHINKS*IT*IS*LIKE*A*WEASEL
$f$ is given by the number of correct letters - a number from 0 to 28.

Imagine someone has programmed an algorithm using $f$ which will find the target in 100% of all runs. The big question: How will it fare for the target string I*REALLY*DO*NOT*LIKE*WEASELS?

My answer would be fantastic: if I*REALLY*DO*NOT*LIKE*WEASELS is the target, then it is the optimum of $f$, so $f$ for finding this phrase is the number of common letters with I*REALLY*DO*NOT*LIKE*WEASELS...
DEM's answer would be abysmal: though the target is I*REALLY*DO*NOT*LIKE*WEASELS, $f$ still is defined as the number of common letters with METHINKS*IT*IS*LIKE*A*WEASEL. The algorithm would result in METHINKS*IT*IS*LIKE*A*WEASEL

The advantage for DEM is stated on p. 173 "We note, however, the choice of an algorithm along with its parameters and initialization imposes a probability distribution over the search space." Indeed, it does in their case - and it will not work with my definition. This probability distribution may appear absolutely counterintuitive to any practitioner of optimization problems, but it is the basic building block for many of DEM's most important results.

How does DEM's search problem work for evolution?

Some interesting characters make a cameo in DEM's textbook: not only Bob and Monica, the pirates X, Y, and Z, Melody and Maverick, but also God and Abraham. In this spirit I would like to invent a dialogue between God and Darwin's Bulldog:

Bulldog:	"The horse is a marvellous creature: fully adapted to its niche, really, survival of the fittest at play"
God:	"Oh know, that one is a total disaster - it may function better than any other creature in its environment, but I was aiming for pink unicorns"

For short: I think that DEM's model does not work for the usual optimization and search problems in mathematics. It is even worse as a model applied to the real world.

Perhaps these are all strawmen?

It could be that I have erected an elaborate strawman, and that the search problem which I attributed to DEM has nothing to do with their ideas. In this case, it should be easy for DEM - or their apologists - to come forward with their definition. Or perhaps - if I am right - they may just wish to explain why their model is not horrible.

^{1. Still thankful for the nice header, Tom!↩}
^{2. Obviously, the work is not completed yet. I will look up more in the future - and I will be grateful for any contribution to this project!↩}
^{3. David H. Wolpert, William G. Macready "No Free Lunch Theorems for Optimization", IEEE Transactions on Evolutionary Computation Vol. 1, No. 1, April 1997, p. 68↩}

Prof. Marks gets lucky at Cracker Barrel

2018-01-18T05:13:00.002-08:00

Classification: Engineering mathematics. Engineering analysis. (TA347)
Subjects: Evolutionary computation. Information technology–Mathematics.¹

Yesterday, I looked again through "Introduction to Evolutionary Informatics", when I spotted the Cracker Barrel puzzle in section 5.4.1.2 Endogenous information of the Cracker Barrel puzzle (p. 128). The rules of this variant of a triangular peg-solitaire are described in the text (or can be found at wikipedia's article on the subject). The humble authors then describe a simulation of the game to calculate how probable it is to solve the puzzle using moves at random:

A search typically requires initialization. For the Cracker Barrel puzzle, all of the 15 holes are filled with pegs and, at random, a single peg is removed. This starts the game. Using random initialization and random moves, simulation of four million games using a computer program resulted in an estimated win probability p = 0.0070 and an endogenous information of $$I_\Omega = -\log_2 p = 7.15 bits.$$ Winning the puzzle using random moves with a randomly chosen initialization (the choice of the empty hole at the start of the game) is thus a bit more difficult than flipping a coin seven times and getting seven heads in a row

Naturally, I created such an simulation in R for myself: I encoded all thirty-six moves that could occur in a matrix cb.moves, each row indicating the jumping peck, the peck which is jumped over, and the place on which the peck lands. And here is the little function which simulates a single random game:

cb.simul <- function(pos){ # pos: boolean vector of length 15 indating position of pecks # a move is allowed if there is a peck at the start position & on the field which is # jumped over, but not at the final position allowed.moves <- pos[cb.moves[,1]] & pos[cb.moves[,2]] & (!pos[cb.moves[,3]]) # if now move is allowed, return number of pecks left if(sum(allowed.moves)==0) return(sum(pos)) # otherwise, chose an allowed move at random number.of.move <- ((1:36)[allowed.moves])[sample(1:sum(allowed.moves),1)] pos[cb.moves[number.of.move,]] <- c(FALSE,FALSE,TRUE) return(cb.simul(pos)) }

I run the simulation 4,000,000 times, changing the start position at random. But as a result, my estimated win probability was $p_e=0.0045$ - only two thirds of the number in the text. How can this be? Why were Prof. Marks et.al. so much luckier than I? I re-run the simulation, checked the code, washed, rinsed, repeated: no fundamental change. So, I decided to take a look at all possible games and on the probability with which they occur. The result was this little routine:

cb.eval <- function(pos, prob){ #pos: boolean vector of length 15 indicating position of pecks #prob: the probability with which this state occurs # a move is allowed if there is a peck at the start position & on the field which is #jumped over, but not at the final position allowed.moves <- pos[cb.moves[,1]] & pos[cb.moves[,2]] & (!pos[cb.moves[,3]]) if(sum(allowed.moves)==0){ #end of a game: prob now holds the probability that this game is played nr.of.pecks <- sum(pos) #number of remaining pecks cb.number[nr.of.pecks] <<- cb.number[nr.of.pecks]+1 #the number of remaining pecks is stored in a global variable cb.prob[nr.of.pecks] <<- cb.prob[nr.of.pecks] + prob #the probability of this game is added to the appropriate place of the global variable return() } for(k in 1:sum(allowed.moves)){ #moves are still possible, for each move the next stage will be calculated d <- pos number.of.move <- ((1:36)[allowed.moves])[k] d[cb.moves[number.of.move,]] <- c(FALSE,FALSE,TRUE) cb.eval(d,prob/sum(allowed.moves)) } }

I now calculated the probabilities for solving the puzzle for each of the fifteen possible starting positions. The result was $$p_s=0.0045 .$$This fits my simulation, but not the one of our esteemed and humble authors! What had happened?

An educated guess

I found it odd that the authors run 4,000,000 simulations - 1.000,000 or 10,000,000 seem to be more commonly used numbers. But when you look at the puzzle, you see that it was not necessary for me to look at all fifteen possible starting positions - whether the first peck is missing in position 1 or position 11 does not change the quality of the game: you could rotate the board and perform the same moves. Using symmetries, you find that there are only four essentially different starting positions. the black, red, and blue group with three positions each, and the green group with six positions. For each group, you get a different probability of success

group	black	green	red	blue
prob. of choosing this group	.2	.4	.2	.2
prob. of success	.00686	.00343	.00709	.001726

One quite obvious explanation for the result of the authors is that they did not run one simulation using a random starting position for 4,000,000 times, but simulated for each of the four groups the game 1,000,000 times. Unfortunately they either did not cumulate their results, but took only the one of the results of the black and the red group (or both), or they only thought they switched starting positions from one group of simulations to the next, but indeed always used the black or the red one.

Is it a big deal?

It is easily corrigible: instead of "For the Cracker Barrel puzzle, all of the 15 holes are filled with pegs and, at random, a single peg is removed." they could write "For the Cracker Barrel puzzle, all of the 15 holes are filled with pegs and, one peck at the tip of the triangle is removed." If the book was actually used as a textbook, the simulation of the Cracker Barrel puzzle is an obvious exercise. I doubt that it is used that way anywhere, so no pupils were annoyed. It is somewhat surprising that such an error occurs: it seems that the program was written by a single contributor and not checked. That seems to have been the case in previous publications, too. Perhaps the authors thought that the program was too simple to be worthy of the full attention - and the more complicated stuff is properly vetted. OTOH, it could be a pattern.... Well, it will certainly be changed in the next edition.

UD in 2017

2018-01-08T15:46:00.001-08:00

Just a few pics:

A letter to Winston Ewert

2017-07-17T16:22:00.001-07:00

Winston Ewert, Wiliam Dembski, and Robert Marks have written a new book "Introduction to Evolutionary Informatics" Fair to say, I do not like it very much - so I wrote a letter to Winston Ewert, the most accessible of the "humble authors"...

Dear Winston,
congratulations for publishing your first book! It took me some time to get to read it (though I'm always interested in the output of the Evo Lab). Over the last couple of weeks I've discussed your oeuvre on various blogs. I assume that some of you are aware of the arguments at UncommonDescent and TheSkepticalZone, but as those are not peer reviewed papers, the debates may have been ignored. Fair to say, I'm not a great fan of your new book. I'd like to highlight my problems by looking into two paragraphs which irked me during the first reading: In your section about "Loaded Die and Proportional Betting", you write on page 77:

The performance of proportional betting is akin to that of a search algorithm. For proportional betting, you want to extract the maximum amount of money from the game in a single bet. In search, you wish to extract the maximum amount of information in a single query. The mathematics is identical"

This is at odds with the previous paragraphs: proportional betting doesn't optimize a single bet, but a sequence of bets - as you have clearly stated before. I'm well aware of Cover's and Thomas's "Elements of Information Theory", but I fail to say how their chapter on "Gambling and Data Compression" is applicable to your idea of a search. I tried to come up with an example, but if I have to search two equally sized subsets $\Omega_1$ and $\Omega_2$, and the target is to be found in $\Omega_1$ with a probability bigger than to be found in $\Omega_2$, proportional betting isn't the optimal way to go! Does proportional betting really extract the maximum of information in a single guess?

Then there is this following paragraph on page 173:

One’s first inclination is to use an S4S search space populated by different search algorithms such as particle swarm, conjugate gradient descent or Levenberg-Marquardt search. Every search algorithm, in turn, has parameters. Search would not only need to be performed among the algorithms, but within the algorithms over a range of different parameters and initializations. Performing an S4S using this approach looks to be intractable. We note, however, the choice of an algorithm along with its parameters and initialization imposes a probability distribution over the search space. Searching among these probability distributions is tractable and is the model we will use. Our S4S search space is therefore populated by a large number of probability distributions imposed on the search space.

Identifying/representing/translating/imposing a search and a probability distribution is central to your theory. It's quite disappointing that you are glossing over it in your new book! While you give generally a quite extensive bibliography, it is surprising that you do not quote any mechanism which translates the algorithm in a probability distribution.

Therefore I do not know whether you are thinking about the mechanism as described in "Conservation of Information in Search: Measuring the Cost of Success": this one results in every exhaustive search finding its target. Or are you talking about the "representation" in "A General Theory of Information Cost Incurred by Successful Search": here, all exhaustive searches will do on average at best as a single guess (and yes, I think that this in counter-intuitive). As you are talking about $\Omega$ and not any augmented space, I suppose you have the latter in mind...

But if two of your own "representations" result in such a difference between probabilities ($1$ versus $1/|\Omega|$), how can you be comfortable with making such a wide-reaching claim like "each search algorithm imposes a probability distribution over the search space" without further corroboration? Could you - for example - translate the damping parameters of the Levenberg-Marquardt search into such a probability distribution? I suppose that any attempt to do so would show a fundamental flaw in your model: the separation between the optimum of the function and the target....

I'd appreciate if you could address my concerns - at UD, TSZ, or my blog.

Thanks,
Yours Di$\dots$ Eb$\dots$

P.S.: I have to add that I find the bibliographies quite annoying: why can't you add the number of the page if you are citing a book? Sometimes the terms which are accompanied by a footnote cannot be found at all in the given source! It is hard to imagine what the "humble authors" were thinking when they send their interested readers on such a futile search!

Some Pies for "The Skeptical Zone"

2016-02-02T02:53:00.000-08:00


In 2015, there some 45,000 comments were made at The Skeptical Zone. Here are the top ten of the commentators (just a quantitative, not a qualitative judgement.) I'll stick to the color scheme for all of figures in this post...	"The Skeptical Zone" has a handy "reply to"-feature, which allows you to address a previous comments (with or without inline quotation.) It is used to various degree - and though some don't use it at all, nearly 50% of all comments were replies.


While the previous figure showed who made replies, this one shows who receives them.	Editors at "The Skeptical Zone" are also allowed to make postings and create new threads.

How popular are these threads? Here the number of comments editors gathered with there threads.	Quite another question: A comment can be a short remark, a well-thought argument, or just a orgy of copying-and-pasting. How much text did the commentators write? Here is the length of the plain texts given in the comments - again, just a quantitative, not a qualitative deliberation.

This figure gives an impression of how many comments were attracted over time by threads sorted by the editors who had created them.

And here is the network of those who created - or received - at least 50 replies.

"Uncommon Descent" and "The Skeptical Zone" in 2015

2016-01-27T04:55:00.002-08:00

Since 2005, Uncommon Descent (UD) - founded by William Dembski - has been the place to discuss intelligent design. Unfortunately, the moderation policy has always been one-sided (and quite arbitrary at the same time!) Since 2011, the statement "You don't have to participate in UD" is not longer answered with gritted teeth only, but with a real alternative: Elizabeth Liddl's The Skeptical Zone (TSZ). So, how were these two sites doing in 2015?

Number of Comments 2005 - 2015

year	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014	2015
UD	8,400	23,000	22,400	23,100	41,100	24,800	41,400	28,400	42,500	53,700	53,100
TSZ	-	-	-	-	-	-	2,200	15,100	16,900	20,400	45,200

In 2015, there were still 17% more comments at UD than at TSZ.

Though UD is still going strong, there is a slight downside trend (yellow line) in the daily number of comments.

The upside trend at TSZ is much stronger, but is fuelled by the very weak participation in the first couple of months of 2015. This can be seen when comparing the number of comments on a monthly base, too:

There are many ways how both sites interact with each other: the editors on both blogs may react to the same event, rising the number of comments on both sites. Or an editor, disgruntled with one site, may take his energy to the other one. Overall there is a slightly negative correlation (adj. R²=.256) between the number of comments per week:

There is one big difference between both sites: the number of posts. On TSZ, there have been 265 threads with comments, while this number was 1741 at UD (there were another 200 without any comments). Therefore, the number of comments per thread is smaller at UD than at TSZ:

At UD, most of the posts (16% or 271 out of 1741) gathered between five and eight comments (or 1,700 - 3.2% - of the 53,100 total comments in 2015), while at TSZ, most of the threads (20% or 56 out of 265) have between 65 and 128 comments (or 5000 - 11% - of the 45,200 comments)

This difference is shown in this stream of comments. With the notable exception of the thread Mystery at the Heart of Life, even the busiest posts aren't active for longer than a month at UD:

In fact, an average an article at UD will get comments over an period of 5.3 days. This average is 23.7 days for TSZ. Certainly eternal threads like Moderation Rules and Noyau play a role here, but other mainly philosophical topics are discussed over great periods of time, too.

My personal favourites of 2015 unfortunately got very few comments: Winson Ewert's offer to Ask Dr. Ewert at UD, Tom English's excellent reply A Question for Winston Ewert at TSZ, and then Dr. Ewert Answers, again at UD - which were commented less than eighty times in total. I had hoped for a discussion about the mathematical aspects of Intelligent Design (see my posts). Unfortunately, the design-side didn't show any interest in anything other but an token interaction. Another chance missed.

Note: UD and TSZ both use WordPress, so they should have numerous ways to get statistics for their sites. I could look only from the outside, crawling the threads and comments. Though I'm fairly sure to got the all the visible data, I cannot guarantee to paint the real picture absolutely accurately.

The "Discovery Institute" trembles before the mighty powers of DiEbLog!

2016-01-26T06:37:00.000-08:00

Just kidding. It isn't. But they published some of the pages the absence of which I had criticized in my previous post: John G. West wrote an article on Dennis Prager Was Right: Atheists Are More Open-Minded on ID than Some United Methodist Officials, in which he included further pages from the poll which the Discovery Institute (DI) had ordered on the subject of being snubbed by the United Methodist Church.

I assume that this little blog mainly flies under the RADAR of the DI, but they most probably follow astutely the very amusing Sensuous Curmudgeon, where I raised the problem earlier.

So, as I have guessed there was a question Q9, regarding the religious beliefs of the participants of the study. Why did the DI need an extra day to put a spin on the answers to this questions? Did they think it to be especially juicy, so that they were able to get yet another article from it? Or were they annoyed that one third of the participants of the poll identified themselves as agnostic or atheists?

Let's wait and see for Q8 - the question for the degree of education. Perhaps some scientists named Steve were involved, that result could be unpleasant...

OMG - The Discovery Institute is Committing Censorship!!!11!!1!

2016-01-26T01:11:00.001-08:00

Does the Discovery Institute (DI) want to keep its much coveted Censor of the Year Award for itself this year?

If you are interested in this kind of things, you will have noticed the tantrum John G. West and his friends are collectively throwing over at Evolution News & views (EN&V) because they were somewhat rebuffed by the United Methodist Church (UMC). Here is some background as it presents itself to me (EN&V's viewpoint may differ): The UMC is holding its ''General Conference'' once every four years. In May 2016, it will be taking place at the ''Oregon Convention Center''. ''Sponsors and exhibitioners'' may rent booths at the center to present themselves to the estimated 6,500 participants of the event. The DI was willing to pay the 900 Dollar - 1200 Dollar fee to become an exhibitioner, but their application was turned down. There may have been various problems, but unfortunately for them, it did not seem to match the fourth criterium for eligibility:

Proven Business Record: Purchasers must have a proven business record with their products/services/resources. Exhibits are not to provide a platform to survey or test ideas; rather, to provide products/services/resources which are credible and proven.

It is fair to say that the DI has not recovered from this blow yet- over the last eight days, there have been at least fourteen articles been published on this matter at EN&V. One of the highlights was this New Poll: Most Americans Turn Thumbs Down on United Methodist Ban on Intelligent Design: The DI spent the money it has saved on the booth to have a survey performed by SurveyMonkey. It asked:

The United Methodist Church recently banned a group from renting an information table at the Church’s upcoming general conference because the group supports intelligent design—the idea that nature is the product of purposeful design rather than an unguided process. Some have criticized the ban as contrary to the United Methodist Church’s stated commitment to encourage “open hearts, open minds, open doors.” Rate your level of agreement or disagreement with the following statements:

1. The United Methodist Church should not have banned an intelligent design group from renting an information table at its conference.

2. The United Methodist Church’s ban on the intelligent design group seems inconsistent with the Church’s stated commitment to encourage “open hearts, open minds, open doors.”

What surprised me: thought the question was obviously leading, still 30% didn't agree with the first statement and 22% didn't agree with the second one! Or, as the DI describes it:

More than 70% of the 1,946 respondents to the nationwide survey agreed that “the United Methodist Church should not have banned an intelligent design group from renting an information table at its conference.” More than 78% of respondents agreed that “the United Methodist Church’s ban on the intelligent design group seems inconsistent with the Church’s stated commitment to encourage ‘open hearts, open minds, open doors.’”

But here is the cinch: Though EN&V announced that the "full report" can be downloaded from here, it is obvious from the pagination that at least two pages are missing!

Enter panic mode: OMG! The Discovery Instituted is censoring its report! What are they covering up? Are they beating puppies? Like Darwin! They should get their own Censorship Award!!!!11!!1

The truth is a little bit less sinister: Survey Monkey asks you about your age (Q11), your gender (Q12), your income (Q13), your party affiliation (Q10) and the region you are living in (Q14). What is surprisingly missing are questions about your religious orientation and your education. These two characteristics are of obvious interest for a poll like this one - so, I am guessing that the questions Q8 and Q9 were about these matters. Maybe the results did not please the DI and thus, were omitted from the final report.

Edit: Instead of trying to claim that it was meant to be ironic, I just corrected an embarrassing spelling mistake in the headline...

Uncommon Descent in Numbers - 2nd edition

2015-05-31T03:01:00.000-07:00

Three years ago, I put up some pictures showing the number of comments and threads at Uncommon Descent. Now seems to be a good occasion to up-date some of this information.

1. Google Trends

Look for yourself: The phrase Uncommon Descent was most searched for in 2008. After that, everybody had bookmarked the site, so further googling became unnecessary. The same holds true for The Panda's Thumb - both sites are equally popular...

2. Threads per Month

The number of new threads per month peaked in 2011, but is still on a high level - though it seems to be decreasing. What makes all the difference is "News" - a.k.a. Denyse O'Leary - adding her news items. While in 2011/2012, those often were left uncommented, since 2013, they attract the attention of her fellow editors (though I got the impression that some commentators use them for their off-topic-remarks, while others just cannot let the copious factual inaccuracies stand uncommented.)

3. Threads per Author and Year

Over the last four-and-a-half years, Denyse O'Leary contributed the majority of new threads (as "O'Leary" and "News"). Cornelius Hunter uses Uncommon Descent regularly to rise the attention for his blog, while the president and chief-enforcer Barry Arrington delights us more and more with his insights.

4. Edits per month

The public interest in Uncommon Descent may by decreasing, but the interest in debate isn't. It peeked in Nov 2014 with nearly 9,300 comments in a single month, discussing topics like An attempt at computing dFSCI for English language, HeKS suggests a way forward on the KS “bomb” argument, and Evolution driven by laws? Not random mutations?. This spike was probably a result of the general amnesty, which allowed free contribution without throttling by the moderation queue (see next section.)

5. Editors per month

In Oct 2014, Barry Arrington announced a general amnesty for all banned editors, a step which perhaps didn't increase the number of commentators per month as much as hoped. Furthermore, the policy was quickly (and silently) revoked, and the banning returned to a "normal" level.

6. Mathematics at Uncommon Descent

Uncommon Descent was founded by William A. Dembski, the "Isaac Newton of Information Theory". Though it is the premier blog in favour of intelligent design, there isn't much mathematics happening over there. One practical reason for this is that not only that there is no $\LaTeX$ extension, Uncommon Descent doesn't allow anything but ascii in the comments, even a Ω will be replaced by a "?" when the comment appears - and basic html tags like <sup></sup> or <sub></sub> cannot be used, neither. But there aren't any mathematicians in the current list of authors - though, when William A. Dembski edited Uncommon Descent regularly, even he addressed questions of a mathematical nature very seldom. Hopefully, this will change with Dr. Winson Ewert...

7. Personal Note

I started editing Uncommon Descent in 2008, and have contributed some 500 edits. For most if the time, I tried to contribute to the mathematical aspects of intelligent design. You have to be quite determined to do so: until Berry Arrington's general amnesty, my comments didn't appear directly, but had to be vetted by one of the moderators - a process which could take days! What it took to get even some indisputable facts to be recognized by the "other side" can be seen in this thread: Evolutionary Informatics Lab website receives facelift... Currently, I'm blocked: I had asked about the disappearance of the numerous comments of Aurelio Smith. I did so three times in a row, as I thought it was a technical glitch which made my question disappear - but it turned out to be design, or better: the will of the designer. Perhaps it is fitting that this is my last conversation at Uncommon Descent:

8. Shout-out to kairosfocus

The crime that got me blocked was asking about Aurelio Smith's comments. I know that your line of reasoning is "He was blocked, therefore he must be guilty of a nefarious crime", but as so often, you are wrong.

Update:I wrote an email to Barry Arrington, linking to my blog and telling him, that I'd like to interact with Winston Ewert on Uncommon Descent. Shortly after, Barry Arrington informed me that my email-address was taken from the block list.

The Natural Probability on M(Ω)

2015-05-25T00:59:00.001-07:00

Two weeks ago, Dr. Winston Ewert announced at Uncommon Descent a kind of open mike. He put up a page at Google Moderator and asked for questions. Unfortunately, not many took advantage of this offer, but I added three questions from the top of my head. The experience made me revisit the paper A General Theory of Information Cost Incurred by Successful Search again, and when I tried - as usual - to construct simple examples, I run into further questions - so, here is another one:

In their paper, the authors W. Dembski, W. Ewert, and R. Marks (DEM) talk about something they call the natural probability:

Processes that exhibit stochastic behavior arise from what may be called a natural probability. The natural probability characterizes the ordinary stochastic behavior of the process in question. Often the natural probability is the uniform probability. Thus, for a perfect cube with distinguishable sides composed of a rigid homogenous material (i.e., an ordinary die), the probability of any one of its six sides landing on a given toss is 1/6. Yet, for a loaded die, those probabilities will be skewed, with one side consuming the lion’s share of probability. For the loaded die, the natural probability is not uniform.

This natural probability on the search space translates through their idea of lifting to the space of measures $\mathbf{M}(\Omega)$:

As the natural probability on $\Omega$, $\mu$ is not confined simply to $\Omega$ lifts to $\mathbf{M}(\Omega)$, so that its lifting, namely $\overline{\mu}$, becomes the natural probability on $\mathbf{M}(\Omega)$ (this parallels how the uniform probability $\mathbf{U}$, when it is the natural probability on $\Omega$, lifts to the uniform probability $\overline{\mathbf{U}}$ on $\mathbf{M}(\Omega)$, which then becomes the natural probability for this higher-order search space).

As usual, I look at an easy example: a loaded coin which always shows head. So $\Omega=\{H,T\}$ and $\mu=\delta_H$ is the natural measure on $\Omega$. What happens on $\mathbf{M}(\Omega)= \{h\cdot\delta_H + t\cdot\delta_T|0 \le h,t \le 1; h+t=1 \}$? Luckily, $$(\mathbf{M}(\{H,T\}),\mathbf{U}) \cong ([0,1],\lambda).$$ Let's jump the hoops:

The Radon-Nikodym derivative of $\delta_H$ with respect to $\mathbf{U}$ is $f(H) = \frac{d\delta_H}{d\mathbf{U}}(H) = 2$, $f(T) = \frac{d\delta_H}{d\mathbf{U}}(T) = 0$
Let $\theta \in \mathbf{M}(\{H,T\})$, i.e., $\theta= h\delta_H + t\delta_T$. Then$$\overline{f}{(\theta)} = \int_{\Omega} f(x)d\theta(x)$$ $$=f(H)\cdot\theta(\{H\}) + f(T) \cdot\theta(\{T\})$$ $$=2 \cdot h$$

Here, I have the density of my natural measure on $\mathbf{M}(\Omega)$ with regard to $\overline{\mathbf{U}}$, $$d\overline{\delta_H}(h\cdot\delta_H + t\cdot\delta_T) = 2 \cdot h \cdot d\overline{\mathbf{U}}(h\cdot\delta_H + t\cdot\delta_T).$$ But what is it good for? For the uniform probability, DEM showed the identity $$\mathbf{U}=\int_{\mathbf{M}(\Omega)}\theta d\overline{\mathbf{U}} .$$ Unfortunately, for $\int_{\mathbf{M}(\Omega)}\theta d\overline{\delta_H}$, I get nothing similar: $$\int_{\mathbf{M}(\Omega)}\theta d\overline{\delta_H} = \frac{2}{3}\delta_H + \frac{1}{3}\delta_T$$

So, again, what does this mean? Wouldn't the Dirac delta function be a more natural measure on $\mathbf{M}(\Omega)$?

I hope that Dr. Winston Ewert reacts to all of the questions before Google Moderator shuts down for good on June 30, 2015...

Five Years of "The Search for a Search"

2015-05-11T10:46:00.002-07:00

The Journal of Advanced Computational Intelligence and Intelligent Informatics published the paper The Search for a Search: Measuring the Information Cost of Higher Level Search of William A. Dembski and Robert J. Marks II (DM) in its July edition in 2010. With the five year jubilee of the publication coming, it seems to be appropriate to revisit a pet peeve of mine...

(Shell game performed on Karl-Liebknecht-Straße in Berlin, photograph by E.asphys)

Imagine a shell game. You have observed the con artist for a while, and now you know:

The pea ends up under each of the three shells (left, middle, and right) with the same probability, i.e., $$P(Pea=left)=P(Pea=middle)=P(Pea=right)=1/3$$
If the pea ends up under the left or the middle shell, you are able to track its way. So, in these cases, you will find the pea with probability 1 $$P(Finding\,Pea|Pea=left)=P(Finding\,Pea|Pea=middle)=1$$
However, if the pea ends up under the right shell, in 999 times out 1000, you make a mistake during your tracking and be convinced that it is under the left or the middle shell - the probability of finding this pea is 1/1000$$P(Finding\,Pea|Pea=right)=1/1000$$

You are invited to play the game. Should you use your knowledge (method $M_1$), or should you chose a shell at random (method $M_2)$? Let's calculate the average probability for finding the pea using your knowledge $$AM_1= P(Pea=left) \cdot P(Finding\,Pea|Pea=left)$$ $$+ P(Pea=middle) \cdot P(Finding\,Pea|Pea=middle)$$ $$+ P(Pea=right) \cdot P(Finding\,Pea|Pea=right)$$ $$AM_1 = \frac{1}{3} \cdot 1 + \frac{1}{3} \cdot 1 + \frac{1}{3} \cdot \frac{1}{1000} = \frac{2001}{3000} \approx \frac{2}{3} $$

What is the average probability when choosing a shell at random? As the pea is in the left, middle or right position with the same probability, we get $$AM_2 = \frac{1}{3} $$

So, $M_1$ or $M_2$? The answer seem to be obvious: you should stick to the first method, as it wins twice as often. Not so fast, say the Drs Dembski and Marks. You should calculate the average of the active information - the active entropy: $$H_1 = \frac{1}{3} \cdot \log_2 \frac{1}{1/3} + \frac{1}{3} \cdot \log_2 \frac{1}{1/3} +\frac{1}{3} \cdot \log_2 \frac{1/1000}{1/3} $$ $$= \log_2(\sqrt[3]{1 \cdot 1 \cdot \frac{1}{1000}}) - \log_2(\sqrt[3]{\frac{1}{3}\cdot \frac{1}{3} \cdot \frac{1}{3}})$$ $$=-\log_2(10) + \log_2(3)\approx -1.737$$ And their conclusion (p. 477): as on average the calculation results in negative active information, the search performance is rendered worse than random search. This is obviously false.

What went wrong? To calculate the overall performance, it is appropriate to use the arithmetic mean, as done for $AM_1$ and $AM_2$. By calculating the active entropy, you are de facto comparing the geometric means of your method and of the random method. The geometric mean favors equidistribution, thereby preferring the inferior random method over your more successful method, which tends to fail in only one case (pea under the third shell). This is a usual scenario: you often find algorithms which do well on most of the cases, but fail in some - often especially manufactured - instances.

Conclusion: The average of the active information is not a good method to describe the performance of a search. If you think otherwise, maybe I can interest you in buying a bridge?

(edited to clarify the search process)

Conservation of Information in Evolutionary Search - Talk by William Dembski - part 5

2014-09-28T13:06:00.000-07:00

For an introduction to this post, take a look here. As I ended part 4 quite abruptly, this section starts in the middle of things....

Part 4: 45' 00" - 52' 50"

Topics: What is Conservation of Information? Example continued.

William Dembski: These tickets have probability 1/2, 1/2, 1/2, 1/2, and this one ticket has probability 1. If I happen to get this ticket, I have probability 1/2 of choosing curtain 1, but it is also probability 1/9 of getting that ticket. When you run the numbers, at the end of the day, by using these tickets, I'm not better of than I was originally. It is still only a probability of 1/3 of finding curtain 1, of finding the prize there. Once one factors in how did I limit myself to these tickets in the first place. Going from this whole space to this, that is information intensive. I have ruled out certain possibilities, that incurs an information cost. As I said, the cost is 5/9. It is really just an accounting thing. That is what conservation of information is. Once you factor in the information that it takes to get the search, get a search which has improved the probability for finding your original target, we haven't gained anything. It is called Conservation of Information, as the problem can even get worse. At this case, we have broken even, we are back to 1/3 for the probability of getting the prize, but let's say, you really want to improve the probability, you want to guarantee that you get that prize with this tickets. Well, then you have got only one ticket that will work for you.

William Dembski: This one. If you get this ticket, you are guaranteed to say "Curtain 1", and you get the prize behind it, but this is one of nine possible tickets. Once you have factored that is, your probability of doing a search for this ticket, and then, with this ticket, find the prize, ends up being 1/9, so you are actually going down. The reason that it is called conservation of information is that conservation is the best you can do, that you can break even. Often times, with these search-for-a-search spaces, they grow exponentially, and your probability of finding the target by going to the search-for-a-search ends up being worse than doing a blind search on the original space. Let me give you one last example, and then we can open this up for some questions. That example:

William Dembski: Find some buried treasure. You have this huge island which is very, very big, so that exhaustive search is impossible. The query limit is very small, there are only a few places that you can check. How do you find the treasure which is hidden inside? You go to a map room.

William Dembski: This map room is actually a bar in Cleveland, but let's imagine that it is a room with maps. What you are going to do is to find a map that has got an X marking where that treasure is. You have displaced the problem of finding the treasure to finding the map in the map room which will take you to the treasure. But how do you know that the map is the right map.

William Dembski: What if there are lot of maps. And you are Randy McNelly. For every place with an X mark, there will be another map with another X marked. The problem of finding the treasure on the island now becomes displaced to finding the search-for-the-search, finding the right map. And the problem is, when you try to represent this mathematically, the search-for-the-search is much less tractable than the original search problem, because, I skipped over a few slides, but these are actually theorems which we have proven on conservation of information. You represent the search-for-a-search, and you find that the information problem has actually intensified.

William Dembski: With the search-for-a-search, searches are as real as the things being searched. I think that is what the Darwinists like Richard Dawkins fail to recognize. By handing us a Darwinian search, when it works, it works because it has been carefully crafted, fine-tuned to work. That is what he bets on.

William Dembski: Let me just finally speak to the question that came up about targets. I had a correspondence with Dawkins. This goes back fourteen years, we have been playing with these ideas for a long time. I was challenging him on his METHINKS IT IS LIKE A WEASEL example, and he wrote: "In real life of course, the criterion for optimisation is not an arbitrarily chosen distant target but SURVIVAL. It's as simple as that. This is non-arbitrary." What is survival? In which context does survival happen? Let us say biology does have targets.

William Dembski: Actually, it is not that simple. The targets that biology presents us with are teleological systems/agents. If you will, the teleology of evolutionary search is to produce teleology. (James Shapiro might refer to these as systems that do their own "natural genetic engineering.") I'd say that even Dawkins' makes a tacit admission of targets in biological evolution.

William Dembski: This is also from his "Blind Watchmaker": "Complicated things have some quality, specifiable in advance, that is highly unlikely to have been acquired by random chance alone. In the case of living things, the quality that is specified in advance is ... the ability to propagate genes in reproduction." That is the [???], but it is still specified in advance, that is the teleology he even admits to. Let me give you one other statement of the conservation of information:

William Dembski: To increase the probability of success of a search from p to q requires a search for a search, where the higher level search incurs an information cost of at least p/q. This means that the probability of finding a search with probability of success q in no more than p/q, which in turn means that the probability of finding the original target by first finding the successful search and the applying that search is less than or equal to p.
The search-for-a-search requires that there is an information cost [???] This implies a regress. I can do a search for the search for the search and so on. At every point you have not [???] the probabilities. When you work everything out, when you do all the commutative operations [?], I have this search-for-a-search, I get this search, and with this search I get a certain probability to find the target. When you do that, and you can regress back as far as you want, the probabilities never get any better. If anything, the information cost does either stay constant or it becomes worse. Which then raise a question: if evolutionary processes, evolutionary search, is not able to create new information but only redistributes already existing information - that is what the conservation of information shows - what then is the ultimate source of that information. I just leave it with that. So, thank you, and we have got a few minutes for questions.

Here endeth the lesson. The following Q&A section was harder to understand, but I'll try my best - and I will think of some questions which should have been asked. But back to the example. Obviously,
$\frac{1}{9}$=$P("Choosing\,Curtain\,1"|"using\,the\,first\,ticket")\cdot P("using\,the\,first\,ticket") \le$ $P("Choosing\,Curtain\,1"|"using\,the\,first\,ticket")\cdot P("using\,the\,first\,ticket")$ + $ P("Choosing\,Curtain\,1"|"using\,the\,first\,ticket")\cdot P("using\,another\,ticket")$ = $P("Choosing\,Curtain\,1")$=$\frac{1}{3}$
Again, just a thought: imagine nine guys, each one having one of the tickets. Will those who win the prize without holding ticket 1 be less the winner? Repeat the game a couple of times, and each time, someone wins with his ticket, another contestant with the same ticket enters the game. After ten generations, you have more than 1000 holding ticket 1, more than 150 having another ticket mentioning curtain 1, and only 4 guys still having a ticket which always loses....

Conservation of Information in Evolutionary Search - Talk by William Dembski - part 4

2014-09-28T07:33:00.000-07:00

For an introduction to this post, take a look here.

Part 4: 31' 25" - 45' 00"

( I had to pause at 45', there is such an elementary mistake in Dembski's math, it was just to funny...)

Topics: What is Conservation of Information?

William Dembski: Now let us get to the heart of things "Conservation of Information". What is that conservation? Let me put on the next slide.

William Dembski: This is probably the most gem-packed slide in this talk. I want to make a distinction between -what I call - probable and improbable events, and probable and improbable searches. An improbable event is just something that is high in improbability: flip a coin a thousand times, get a thousand heads in a row. Highly improbable. It happens: if you believe in a multi-universe, then there is a universe where this is happening, where someone like me is speaking, my double-ganger flips a coin over the next hour and sees 1000 heads in a row. Probable and improbable search, that is where what is the probability that a search is successful. It is not so much asking whether it actually succeeds, it is not concerned with the result. It is concerned with the probability distribution associated with the search. This is an important distinction because so many intelligent design arguments look for a discontinuity in the evolutionary process. We look for highly improbable events. Such as the intelligent design people: you get for instance Thomas Nagel's "Mind and Cosmos". He is basically looking at probabilistic miracles. Think how the origin of life undercuts a materialistic understanding of biology. So he is looking into improbable events. That is what we do when we try to find evidence for a discontinuity. What I'm doing in this talk is saying, look, I'm going to give you evolution, give you common ancestry, all of that. That is no problem. What I'm interested though is the probability of success for a search.

member of the audience: What are we searching for?

William Dembski: It is whatever the target happens to be.

member of the audience: [???] Can you give an example? [???]

William Dembski: I think that is what I would challenge you on. Actually, you are jumping ahead. I will address this a little bit later. Someone like Richard Dawkins will say that the problem this METHINKS IT IS LIKE A WEASEL example is that it introduces a target, while real biology does not give us targets - and then he takes that back. I will give you a quote from that later. But I would say that the target in biology is teleology. Biological systems are teleological systems, teleological agents, that is what they produce, that is what needs to be explained. If you want to put it in terms of philosophy: there is a natural kind that becomes the target, and that is teleological agents. In fact, one of my good friends and colleagues also is here, James Barham [?], if you want to talk with him , that would be good. Give me a moment, because I want to speak to that, it will really come up.
In the computational context, it is never a problem, you are trying to solve something. Even the people who are writing these AVIDA and ev programs: for instance, in AVIDA, if you saw the article in "Nature" back in 2003 where they were arguing that this program was evolving irreducibly complex systems, they were specifically trying to get Boolean operators of a certain complexity. That was what they were rewarding. That was their target. What I describe to you now is Conservation of Information in a theoretical [???]. What we then do is we go and we look at these actually evolving systems - usually in silico - and then show where the information was put in. We have a theory, and then we show how the theory applies to these specific cases. Give me a moment - I know what you are asking. This is commonly how evolution is built, that there is supposed to be absent teleology. In fact, what I think that they do is, they are slipping it in.
Improbable search. Think of it this way: You have a got disease and two procedures you can take to get well. One has a higher probability, but maybe is more expensive. Which procedure do you want to use? You want to use the hight probability. The actual outcomes may vary: someone who takes the low-probability procedure, it may be successful, he may get lucky. And the high-probability one, he may be unlucky. But the concern is: how likely is the search to find the target. That is what we are interested in in science. Getting lucky is not a good scientific explanation. If you are doing a needle-in-the-haystack problem, try to find that needle, what are you going to do? You try to find a better search which does not make it a needle-in-the-haystack, that provides you with a high probability. That is what Dawkins does.
In METHINKS IT IS LIKE A WEASEL, he does not solve it with randomly shaking out scrabble pieces - that would be $28^{27}$, that would be $10^{40}$, that would be your waiting time on average to get to that target sequence. That becomes your waiting time, waiting time and probability are interchangeable. That would be your average waiting time to get there. Because he substitutes for blind search his Darwinian search, he gets there much faster. But the question then is: what justifies him substituting that search?... The sense that I'm getting of my presentation is that time is running, and I think, this is a good place to come in with this.
But what Dawkins does is essentially, he says: Look, there is this blind search that is hopeless, it is needle-in-the-haystack, a highly improbable search. What I'm going to do - and that is why Darwin is so great - I give you a high probability search that is going to get you there. Then he says, see, Darwin has solved all our problems. Now I think we have somebody on faculty here who has a blog "Why Evolution is True". I'd say it should be probably renamed "How Evolution is True", because the question "why evolution is true", why does this work so well, what did Dawkins do to give us this search, this Darwinian search which is supposed to work, why does it work? Because he infused it with information. That is why it works. That is where I'm going with it...
So, the distinction between probable and improbable search. We can think then of a p-search as a search that has probability p of finding the target. Next, consider that a search can itself be an object of search. What did Dawkins do with METHINKS IT IS LIKE A WEASEL? He did a search for the search. He gave us a search which then with high probability found the target sequence he is after.
This is something people in optimization do, one name for it is "hyper heuristics". You are looking at heuristics, searches, and then it is about how you choose among your heuristic. Or, if you are choosing among heuristics, you are doing a search for a search. We abbreviate that as S4S. Conservation of Information - usually abbreviated as CoI - this is probably the purpose of this talk, it is as clear a statement as you can get: If you have $p < q$ and you want to improve a p-search to a q-search - the p in Dawkins weasel is about $1:10^{40}$ - now you are going to improve this to, well, if you allow yourself 50 or 60 queries, then q is to be close to 1, that improvement requires a $p/q$-search for a q-search. What you have done is: the search for the search has become difficult. If p is very small, and q is large, than $p/q$ becomes pretty small. The search for a search becomes difficult, the search for a good search becomes difficult. If you think of Dawkins weasel, the unimodal distribution is one of many other unimodal distributions.

William Dembski: Let me give you an example. You have got an Easter Egg hunt. Standard Easter Egg. An Easter Egg that is well hidden, but it is hidden in a huge field. Blind search is being highly unlikely to find you that Easter Egg. What you are going to want is a directed search, a search which is [assisted?]. Blind search would be a lot of sampling, you may try to do an exhaustive search but you are not able to exhaust things, because your query limit does not allow you to exhaust the search space.

William Dembski: So you are going to do a directed search. What does a directed search look like? You are walking along the field, and somebody is telling you "warm", "warmer", "cold", "warmer", "warmer", "hot", "you are burning up" - and there it is. That sort of direction - "warm", "warmer", "hot" - what is that? That is information. It is information that is going into the search. Here is the question: Where is this information source? Does the information source know where it is? Is it a search for the information search? Perhaps not a search for the information search. The information source knows the answer, but the process - in this case me meandering about - is getting information. I am doing a search. Let me give you another angle on conservation of information, because I have described information as something that increases the probability [???]. Usually, you are doing negative-logarithmic transformation and then you turn information in something that becomes additive and looks more like money, which is convenient. But let us going to think of it probabilisticly. But we do pay to increase probabilities all the time.

William Dembski: If I'm playing a lottery, the more lottery tickets, the more likely I am to win.

William Dembski: But, this is in the case of a fair lottery (unlike the lotteries that the state runs), where everything what was payed in gets payed out under proper probabilistic principles, by buying more tickets, I will increase my probability of winning the lottery. But have I increased my expected gain? No. I can pay more to increase the probability of winning, but in the end, I did not gain anything. Conservation of information works like that. Let me give you perhaps the simplest example, and actually do the numbers for you.

William Dembski: We all remember "Let's Make a Deal" with Monty Hall.

William Dembski: There are three curtains with a prize behind one of the curtains. Let us say the prize is behind curtain 1. What is the probability of winning? I'm going to do this search. I have got one opportunity. That is my query limit. One opportunity, so I have got a probability of 1/3 to win this thing. But now let's say that someone comes to me and gives me a ticket:

William Dembski:It is one of these tickets. This ticket (1,1) will say "it is behind curtain 1", this one (1,2) will say "it is behind curtain 1 or curtain 2 with equal probability". From the nine possible tickets, these five will increase my probability of getting to curtain 1 and thus winning the prize. But the thing is: only five of these tickets! p is 1/3, that is the original probability, I'm now trying to bump it up to 1/2, that is q, but the actually probability of finding one of these tickets is less than that, it is 5/9, the probability is going down, it is less than p/q. This is typical for these search-for-a-search situations:

So, Dr. Dr. William Dembski does the numbers for us, for this, the simplest of all examples. $p = 1/3$ and $q= 1/2$. What? Wasn't q the probability of finding the prize while using our search strategy, i.e., $P(Choosing\,curtain\,1|Using\,one\,of\,the\,five\,tickets)$? But that is not $1/2$ as he says, it is actually $\frac{4}{5}\cdot \frac{1}{2} + \frac{1}{5} \cdot 1 = \frac{3}{5}$! And therefore, $p/q$ = $\frac{1}{3} / \frac{3}{5} = \frac{5}{9}$, exactly the probability of finding a circled ticket. No surprise here, that is how conditional probabilities work:

$p = \frac{1}{3}$=$P(Choosing\,curtain\,1)$ = $P(Choosing\,curtain\,1|Using\,one\,of\,the\,five\,tickets) \cdot P(Using\,one\,of\,the\,five\,tickets)$ + $P(Choosing\,curtain\,1|Using\,one\,of\,the\,other\,tickets) \cdot P(Using\,one\,of\,the\,other\,tickets)$ = $ \frac{3}{5} \cdot \frac{5}{9} + 0 \cdot \frac{4}{9}$=$q \cdot \frac{5}{9}$

This error is so elementary that the audience wasn't able to spot it...

I have to agree with Dembski, though: This is typical for these search-for-a-search-situations

Conservation of Information in Evolutionary Search - Talk by William Dembski - part 3

2014-09-27T13:54:00.002-07:00

For an introduction to this post, take a look here. There is some interaction with the audience (15'30" - 18'00") which I wasn't able to understand fully. Any help is appreciated!

Part 3: 12' 45" - 31' 25"

Topics: What is an evolutionary search?

William Dembski: Now let's add this next term evolutionary. What does evolutionary - when we put it in front of search - add to the discussion? I think it changes one key aspect here. Whereas we were looking at some query feedback, now this query feedback takes the form of fitness: how good is it? Query feedback can be quite general. Maybe the query feedback is nothing, when we examine it. Or maybe the query feedback may just say "I'm in the target" or "I'm not in the target". That would be very simple. Fitness is going to give some sort of range of values that ideally identify how close am I to the target.

William Dembski: There are examples of evolutionary search. There is the Dawkins' weasel example from his book "The Blind Watchmaker", that is the one I'm going to focus on here. Then there are various - what I would regard as - embellishments of that, because I don't think that there is anything fundamentally new about them. There is MSU's Avida program, Tom Ray's Tierra, Schneider's ev. What is at the heart of these programs that these are computer programs which mimic - try to mimic - Darwinian evolutionary processes. What are they supposed to show? That is interesting. Look at the history of this field of evolutionary computing and there is a reason why people wanted to do evolution in the computer. That is because the computer would allow evolution to be done in real time, because we cannot really see it in real time in the wild.

William Dembski: Nils Barricelli in 1962: "The Darwinian idea that evolution takes place by random hereditary changes and selection has from the beginning been handicapped by the fact that no proper test has been found to decide whether such evolution was possible and how it would develop under controlled conditions".

William Dembski: J. L. Crosby says substantially the same thing in '67.

William Dembski: Heinz Pagels in a popular book in 1989 wrote "The only way to see evolution in action is to make computer models because in real time these changes take aeons, and experiment is impossible".
Now, there is Richard Lenski at Michigan who - I think - has run 30 - 40,000 generations of E. coli, which probably corresponds to a million or so years of primate evolution. But I'd say that he has not seen a whole lot of changes, at the end of the game E. coli is still E. coli. So if you want to see some massive saltations, I think what Heinz Pagels says does still apply.

member of the audience: Can I ask you something?

William Dembski: Yes.

member of the audience: The Times had a very interesting article, very recently, exactly about this point. It was about a book that was written by Peter and Rosemary Grant. They looked at finches. And the claim is that they actually did exert evolution in forty years of time. They were basically looking at the evolution of finches in the Galapagos Islands. So, can you speak to it?

William Dembski: Finch beak variation, yes, in this case it was [???], they saw some. There were some changes which Richard Lenski saw in E. coli, but I think what is supposed to make evolution interesting is not how finches' beaks vary, but how you get beaks in the first place, how you get birds in the first place. That is the sort of evolution that I think these people who are talking about evolution in silico are thinking about: that we can really speed it up, so that we can see some of these big, impressive evolutionary changes.

member of the audience: So small evolutionary changes don't bother you.

William Dembski: It's not a question of bothering me. They are there. I mean, the evidence for them is clear. I think there is even evidence for large-scale evolutionary changes. The question is: what is driving them? For Darwinians, it is natural selection. For Non-Darwinian, those mechanisms seem to be insufficient.

William Dembski: You are standing up [???]

member of the audience: [???] About two plants growing together and there cells fusing. [???] You get new species. And that's how we make new species in real-time. So, evolution can occur. There is a 1954 [???] Scientific American [???] cataclysmic variation [???]

William Dembski: That happens only in plants. I don't know of any case like that in animals.

Leo Kadanoff: Okay, you made your point. [???] Go ahead. I'd build my argument for example [???] two plants [???].

William Dembski: You might argue better by using other plants. Let's look at this example. I don't know how many of you have read the book "The Blind Watchmaker". This is an example that [???] worked countlessly, even in literature trying to justify the power of Darwinian processes to create information. Underscore that word "create", because that what it is about: is it creating or is it shuffling about already existing information.

William Dembski: Let's look at this example in vantage of search - I had these seven key components. What is that example. What you are trying to do, you take a random string of 28 letters and spaces - that's the reference class, that's the search space: letter and spaces. So there are $27^{28}$ possibilities. Start out with a random sequence - that is the initialization. Your target is "METHINKS_IT_IS_LIKE_A_WEASEL", this is a line from Shakespeare's Hamlet. You have a fitness that is going to measure how many letters correspond in a given sequence to the target sequence, so, that is basically a Hamming measure. You are going to have an update rule which is going to say "take an existing sequence and then - one possibility would be - generate 50 offspring by some sort of random mutation process and then take the one that is closest, and that becomes the next one", so that becomes the update rule. Stop criterion is "you stop when you hit the target sequence". And then the query limit is going to be whatever your computational resources allow. The thing is, with this setup, you are going to evolve to this final target sequence very, very quickly.

William Dembski: I'm just trying to give you a sense that all the components are there in this example. The fitness function in this case is a unimodal fitness where basically you are counting the distance letter by letter from the target sequence. For instance, here we have a score of 27, because you have a "J" where there should be a space. So, when the "J" disappears, then we are actually there. That's the example. I will talk about it a bit more.

William Dembski: I'm throwing that in as a type of digression. There is a kind of lunatic vitality to this example. I keep seeing it in places, and people keep challenging me on the Internet because I come back to this example as though this somehow misses something fundamental or that it is to simplified. But in fact this example just keeps getting reworked. Most recently - I thank a member of the audience for pointing that out to me - Michael Yarus in his 2010 book [???]. The target phrase for him is NOTHING IN BIOLOGY MAKES SENSE EXCEPT IN THE LIGHT OF EVOLUTION. There is a popular book by Jeffrey Satinover, in the "The Quantum Brain" MONKEYS WROTE SHAKESPEARE. Bern-Olaf Küppers in the 1990s, his target phrase was EVOLUTION THEORY. This type of example, where you are evolving symbol strings to some target, keeps getting used in the evolutionary literature to justify biological evolution. That is where we want to go with this. The question is: evolutionary search as I've described it to you, this is widely done, in some ways it is part of computational intelligence, in the sense of evolutionary computing, genetic algorithms, even falls under operation research as some kind of optimization procedure. How does this compare to real life evolution? Now, there are people who think that actually the computational case does provide justification for real life.

William Dembski: Robert Pennock for instance, who worked on this AVIDA program, he says: "I do scientific research on experimental evolution and evolutionary design using evolving computer organisms, including work showing how evolutionary mechanism can produce the kinds of complex features creationists say is impossible... My colleagues and I have demonstrated experimentally that a Darwinian mechanism can discover irreducibly complex systems." I think he is overstating his case, there are some details his leaves behind. The thing to get from this is that he is using what is happening in computational evolutionary searches to justify biological evolution.

William Dembski: Ken Miller in his 2008 book "Only a Theory" - he is a biologist at Brown University - says what is needed to drive biological evolution (that is the question he poses): "Just three things: selection, replication, and mutation... Where the information 'comes from' is , in fact, from the selective process itself." I would say that this is actually the received view, that the Darwinian mechanism is able to produce all these nifty things that you see, that all this biological information can be handed over to Darwinian mechanisms, and there we go. I want to address this from the vantage of what I call the "Conservation of Information", but before I do this, I want to create some doubts for you that this can be the whole story. Not by invoking anything like "Conservation of Information", but by actually going back to somebody at the time of Darwin who was looking at the logic of induction, and raised a method of induction, that actually - I think - undercuts this kind of Darwinian mechanism to produce, to create biological information.

William Dembski: This is Mill's method of difference. He formulated this in his "System of Logic" in 1843. It run to eight editions, the last edition was 1882, so he is a contemporary of Darwin. Mill's method of difference shows that the Darwinian mechanism by itself cannot generate biological information. How does that work?

William Dembski: The method of difference says: "To explain a difference in effects, one must identify a difference in causes." What does that mean?

William Dembski: Common causes cannot explain differences in effects. Imagine, here is a difference in effect: Slowed reflexes versus ordinary reflexes.

William Dembski: Watching television, combing hair, o-oh, consuming alcohol. Alcohol is the difference maker. One person consumed it, the other person didn't. You have people watching television or not watching television, that is not making any difference. The difference maker which accounts for the slowed reflexes versus the ordinary reflexes is consuming the alcohol. Now let's look at the Darwinian mechanism.

William Dembski: We have replication, heritability, random variation, natural selection, all these basic components of the Darwinian mechanism. When you run a Darwinian mechanism, if you are a Darwinist, then you would say in a cellular context it is going to produce, we are going to see a lot of interesting evolution. But there are cases - for instance, Sol Spiegelman had an experiment back in the sixties in which he looked at polynucleotide synthesis and found instead of these evolving polynucleotides becoming more and more complex and more interesting, in fact, they tended towards simplicity, where the replicators would replicate as quick as possible. What supposed to make evolution interesting is that we go from monad to man, right? It is not that we go from cave-fish or cave-fishes that have working eyes to cave-fishes with eye-knobs, because in a case of use it or lose it, in this dark environment they have lost it and now they have eye-knobs. That is evolution, but that is not interesting evolution. It is how you these eyes in the first place, how you get the beaks in the first place, how you get the birds.
Cellular automata: You can have cellular automata that follow Darwinian principles and never go anywhere. And artificial life, [???] the same thing. You can have cases of interesting evolution and evolution that goes in a simplifying direction, that goes nowhere, with all these features. If this is the case, if the Darwinian mechanism is common to cases where you have interesting evolution and evolution that is not going anywhere, then something besides the Darwinian mechanism must being involved. That is the logic. It seems to me that this should be uncontroversial.

William Dembski: But Stuart Kauffmann, a complexity theorist who is not Darwinian, and not an Intelligent Design guy like me, has seen this problem. I think he puts it very well in his book "Investigations". He says: "In the absence of any knowledge, or constraint, on the fitness landscape, on average, any search procedure is as good as any other."
This is a no-free-lunch theorem, which actually really upset people. Jon Holland and the evolutionary [???] community back in the nineties - I have a colleague who was there on one of their meetings when this happened.
"But life uses mutation, recombination, and selection. These search procedures seem to be working quite well. Your typical bat or butterfly has managed to get itself evolved and seems a rather impressive entity.... If mutation, recombination, and selection only work well on certain kinds of fitness landscapes, yet most organisms are sexual, and hence use recombination, and all organisms use mutation as a search mechanism, where did these well-wrought fitness landscapes come from, such that evolution manages to produce the fancy stuff around us?... No one knows"
When I pose this to Darwinians, they often say: "Well, it is just the environment. That is where we get the fitness." I will revisit that. I think Kauffmann asked the right question here, it is a question that many people do not even see is a question. Let's go back: there are seven key components of our evolutionary search. Question is: where is the information coming from? We do this in a computational context, this is usually where it is, it is put there in the fitness, it is put in the update rule. My friend Bob Marks had a colleague at Boeing who called himself a himself a "penalty function artist". If you had the right penalty function, the optimization problem was solved. What is a penalty function? That is basically the inverse of a fitness. [???] That is usually where it comes in. Where does the information come in in this METHINKS IT IS LIKE A WEASEL? It came in obviously in setting up the fitness. You have a unimodal fitness function which measures how close you are to this METHINKS IT IS LIKE A WEASEL target phrase. You could have set up a fitness for any other phrase, for gibberish, and it would have evolved there. It was by choosing that fitness that you got it to evolve where it did. By the way, there are about $10^{40}$ ($27^{28}$) sequences of length 28 having 27 possible characters. Any idea how many unimodal Hamming-distance fitness landscapes there are over that space? It is the same: $10^{40}$. For every possible element there you got a unimodal fitness landscape. What he has done there is to say "I evolve this thing to the target sequence", but what he has not told you is "In doing that, I had a fitness landscape which I have carefully adapted". The search for the target phrase became the search for the right unimodal fitness landscape. This is a expression Paul Nelson - a good friend and colleague of mine - gave to me, which I use over and over again: "Filling one hole by digging an other".

A longer excerpt this time, and one which include a few gems, though I apologize for not getting everything which was said. My thoughts:

"At the end of the game E. coli is still E. coli." Yes, William Dembski really did say this.
The audience seems to have expected to be confronted with a creationist like Ken Ham, but Dembski has not problem with evolution, neither on the small scale nor on the large scale.
However, he uses the term "interesting evolution" as a kind of straw-man: things have to get more complex and considerably diverse. Is the creation of a tiny bit of information by a Darwinian process unproblematic for him? I doubt it...
I'm not totally convinced by Dembski's application of the method of differences, he seems to ignore the influence of chance altogether: neglecting the influence of chance, two guys playing Russian Roulette should end up both dead and both alive...
BTW, at 30'15'', there is an impressive animation illustrating how big the number 40 is....

Conservation of Information in Evolutionary Search - Talk by William Dembski - part 2

2014-09-26T13:48:00.000-07:00

For an introduction to this post, take a look here. This is quite a short section, with some annotations from me.

Part 2: 09' 40" - 12' 45''

Topics: What is a search?

William Dembski: We talked about information. Let's now look at that second key term "Search". What is a search. There are seven key components in a search.

William Dembski: You have a search space, you have a target - we are looking for something in the search space. There is initialization - where do we start off? There is a query limit - how many things in the search space can we check out? There is query feedback - when we have checked out, when we have located some item - what is it telling us about itself in terms of how it relates to the target? There is an update rule - once we have queried something, what do we query next? And then finally a stop criterion - when do we stop? How do we know that we have done enough? This is very general.

William Dembski: Let me say something about the query limit, because that will always be involved. Fact is, even though there may be multiple universes, our own universe is very small, there is not a whole lot computational power in it. The best supercomputers now are operating in petaflops, $10^{15}$ to $10^{16}$, there are less than $10^{18}$ seconds in the year, no research group that I know has ever operated for more than $10^2$ or one hundred years. The number of researchers seems to be bounded by $10^{10}$. Actually, those numbers I gave you add up to $10^{45}$. So, m for all practical purposes is always to be bounded by $10^{40}$, I think that is save to say. If you are unhappy with that, if you are a really theory based person thinking what is the absolute limit, Seth Lloyd, a quantum computational theorist at MIT, sets the absolute computational limit of the universe to $10^{120}$. That is the most computations that can ever be done. A computation is going to be involved in search, that is the assumption that I make. Especially if you are representing search in silico [???] about the limit [???] anything that we are looking at in our live-time, even with Moorse's law.

William Dembski: These are the seven key components of search. There is a connection with information, obviously: in finding a target, a search produces information. It gets to the target and rules out things that are not in the target, and thereby realizes one possibility to the exclusion of others. So searches produce information in the sense I have just described.

William Dembski's definition of a search differs crucially from the definitions of virtually everybody else - for whom searches just build a subset of optimization problems. Dembski and his collaborators separate the target and the feedback. While everybody else is trying to find the optimum of a function (e.g., the characteristic function of a subset $T$ of the searchspace $\Omega$), and will say that elements in the inverse image of the optimum are in the target, this kind of feedback isn't enough for Dembski: you may have found an element of $\Omega$ with an optimal feedback, but this may or may not lie in the target. In a game of Hangman with Dembski, guessing the letters F and O for a three letter word and writing them down as a solution, you may think that FOO is the solution, but even after writing the word out, Dembski would inform you that the real target was BAR. Or in evolutionary terms: some Darwin Finch may have quite a good beak for his purpose, and his species may flourish, but in real, his niche should be occupied by a unicorn.
More interestingly, the given seven elements of a search are quite different from the description of a search in their paper "A General Theory of Information Cost Incurred by Successful Search", which Dembski announced as one of the three key theoretical publications on CoI! At least, the new elements of a search don't sound as pompously as the former arrangement of the initiator, the terminator, the inspector, the navigator, the nominator, and the discriminator. Now, the search-space $\Omega$ and the target $T$ made the list, the initialization is the former initiator, the query limit $m$ and the stop criterion are the terminator, query feedback is the inspector, and the update rule seems to supplant navigator and nominator. Most importantly, the discriminator is gone.
I had an interesting exchange with Winston Ewert - one of the authors of the paper - at my blog and at a thread at Uncommon Descent: Questioning Information Cost. In fact, I think that was one of the most fruitful discussions I had with a proponent of Intelligent Design for quite a while.
Winston Ewert was able to clear up some of my misconceptions on their concept, and replace them with new objections. One of my main problems was that in their model even exhaustive searches not necessarily find the target, in fact, on average, all exhaustive searches perform only as good as a single random guess.
I can only assume that Dembski, Marks, and Ewert finally recognized that this is indeed a problem for their framework, and perhaps have dropped the poor discriminator unceremoniously. At last, that would answer my question I’d like to know whether this “general framework” is still in use in my exchange with Ewert with a no.
I don't think much of those calculations of computational limits of the universe. Combinatorics lead to big numbers without great fuss: There are $52! \approx 8.07 \times 10^{62}$ ways to arrange a single deck of cards, many more than can be computed using Dembski's limit of $10^{40}$. With two identical decks, I can find $\frac{104!}{2^{52}} \approx 2.29 \times 10^{150}$ ways to arrange them, more than Seth Lloyd's limit of $10^{120}$. And still, card games are played - even solitaire...

Previous: Part 1 - Introduction, What is information?

Conservation of Information in Evolutionary Search - Talk by William Dembski - part 1

2014-09-25T13:47:00.000-07:00

For an introduction to this post, take a look here.

Part 1: 00' 00" - 09' 40''

Topics: Introduction, What is information?

Leo Kadanoff: [???] He went on to broader interests in subjects including information theory, philosophy and parts of biology. The best write-up I could find about him was the Discovery Institute's write-up on the web: "mathematician philosopher William A. Dembski is senior fellow with the Discovery Institute. He has taught at the Northwestern University, the University of Notre Dame, and the University of Dallas. He has done postdoctoral work in mathematics at MIT, in physics in Chicago, and in computer science at Princeton. He is a graduate of the University of Illinois, of the University of Chicago, and of Princeton.
His fields include mathematics, physics and philosophy, as well as theology. We probably hear only a fraction of those interests today in his talk about the "Creation of Information in Evolutionary Search".

William Dembski: Okay, well, Leo, it is a pleasure to be back here. Leo was my adviser back in 87/88, along with Patrick Billingsley and [???]. The topic is actually "Conservation of Information in Evolutionary Search. I want to speak about that

Leo Kadanoff: I said creation! [???]

William Dembski: I'm called a creationist enough, so I make that distinction when I can. What I will describe is the work that I have done with the Evolutionary Informatics Lab - this is their website.

William Dembski: The key person there who runs the lab is Robert Marks. He was for twenty-five years on the faculty of the University of Washington. His field was computational intelligence, he is one of the creators of that field which includes evolutionary computing, neural networks, and fuzzy logic. So, he has been at Baylor for about ten years and we started collaborating about a decade ago but it really came to head about 2007 and we have been publishing since about 2009 in this area. So, what I will describe is really in this talk the theoretical work which came out of these three papers.

William Dembski: "Conservation of Information in Search: Measuring the Cost of Success", that was a IEEE publication, then the next paper "The Search for a Search", that was a Japanese journal on computational intelligence, and the last that is [???], that was a conference proceeding. So, anyway, what I would like to do is talk about, just go through the key-words in the titles. Let's start with information.

William Dembski: What is information? We live in the information age, right?

William Dembski: But the statement that I came across years ago - actually in a philosophy course - which to me really puts it best is the following quote from a philosopher at MIT Robert Stalnaker, that is in his book "Inquiry", 1984, "To learn something, to acquire information, is to rule out possibilities. To understand the information conveyed in a communication is to know what possibilities would be excluded by its truth." This for me has captured what is most crucial about information. So, if you want a definition here is how I would define it: "Information is the realization of one possibility to the exclusion of others within a reference class of possibilities" [???] I want to round this up.

William Dembski: I just want to add: it is one thing to say, "okay, this is what information is", but if you want to do science, especially if you want to do exact science, you got to have to measure information. And how do you measure information? Well, you measure it by probabilities. The smaller the probability, the greater the information. Now, information theory adds to that, it takes the log, it usually does logarithmic transformation of probabilities, it takes averages, that is very common in communication theory, [???] it does other transformations as well, integrals, powers and things like that. But at its core, information is measured in probabilities, so let me say something about that: but before I elaborate on the definition of some measurements, I want to give you another way of thinking about information as a decision.

William Dembski: Decision and homicide come from the same Latin word, they come from "caedere", to kill, to slay or to cut off. Just as a homicide kills somebody, a decision withdraws options, rules out possibilities. The reason I give this is, I'm trying to massage your intuitions, but a decision is something active. Often, when we think of information we point to something, we say there is an item of information. There is a sense in which items of information have validity, but information fundamentally I think is more of a verb than a noun. I show this in my next slide. We think of information as a decision, then information becomes in the first instance of [???] an act rather than an item. That's when we speak about an item of information we keep in mind the act that produced it. Let's give you some examples...

William Dembski: Let's say I tell you it is raining outside. What have I done? Well, I've excluded that is not raining outside. So I have actually given you some information. If I say it is raining outside or it is not raining outside, have I given you any information? Well, I haven't ruled anything out. But what is the reference class there? It is the weather, it is the weather that is outside. Now, what if I put that in quotes "it is raining outside". Now it is a symbol string, that is being communicated across a communication channel. In that case the reference class is going to be other symbol strings that might be competing with it. In that case "It is raining outside or it is not raining outside" - now with the quotes - becomes another symbol string, that could be put across a communication channel.

William Dembski: It would actually contain more information, because it is longer, it is more improbable, it is harder to reproduce the same symbol string. So what constitutes information is going to be in a sense context [???], context is the reference class in which you are considering it. If I say "it is raining outside", what about measuring that probability? If I say that in Chicago - it rains here some, maybe with a certain probability. If I tell you in the Sahara desert "it is raining outside", that is going to be much more improbable, there will be much more information conveyed in that. In terms of the measurement of information, this is how information theorists do it: think of - for instance - a poker hand. If I tell you "this is a hand which has a pair", or "two pairs", there are a lot of different poker hands, about 2.5 million poker hands. But if I tell you "Royal flush", that narrows it down quite a bit. The range of possibilities becomes more constricted, it is more improbable and there is more information. We are doing some basics here, but this is at a more general level than you would be getting it in an information theory book, which tends to look at symbols, strings, and trying to get them [???] across a communication channel. Now, what is communication in that case?

William Dembski: I would define communication as the coincidence or correlation of two acts or items of information. Look at Shannon's original diagram in his "Mathematical Theory of Communication" from 1949, you have basically a source and a receiver, and then you have some act of information here which will be mirrored in some way over there. We do this all the time: we see this sort of set-up when I am sending an email communication, there will be some simple strings from my keyboard, that are getting coded in a certain way, and there will be some transport protocols, and there will be use of error correction, and it will be moved until it ends up on your computer. This process is happening several times, there will be multiple - if you will - acts of information that are going to happen.

William Dembski: It is interesting to look at the history: Shannon's original concern in coming up with the communication of information was the transmission of intelligence. That is an exact quote, released [???]. I think that was even in his undergraduate papers.

In my opinion, there are some problems already in this part of the talk. Some can only be spotted with some knowledge of William Dembski's publications, others should be spotted by an audience just generally interested in information theory, e.g.:

William Dembski is talking only about information of Shannon's type. This seems to be a very narrow approach.
William Dembski is well aware of the problems with his paper "The Search for a Search: Measuring the Cost of Success", see for example Tom English's The theorem that never was: Diversionary “erratum” from Dembski and Marks. Dembski knows that there is no valid proof for one of his main theorems in this paper (his grandiosely named Horizontal No Free Lunch Theorem), but he chose to ignore this fact, even delete an erratum without further comment. And then he presents this paper to a less informed audience as one of the three "Key Publications on CoI"!
And one amusing thought: "It is raining outside". Who creates this information? The intelligent observer William Dembski or the unintelligent weather in Chicago, which realized the possibility of raining?

Next: Part 2 - What is a search?

William Dembski's talk at the University of Chicago

2014-09-25T06:30:00.000-07:00

Invited by Leo Kadanoff, William Dembski spoke on Aug 15, 2014 at the University of Chicago's "Computations in Science" seminar. Jerry A. Coyne - a professor in the department of ecology and evolution at the same university - questioned the judgement of the seminar's organizers. Afterwards, the Discovery Institute was very pleased with its paladin William Dembski.

"The talk itself and the Q&A afterward, which were at a pretty high level, went very well."

, and they loved a concluding remark by Leo Kadanoff:

I think the ball is in the court of people who believe in evolution. They have to deal with these questions. ...Bill has made his case and we should all go home and think.

At William Dembski's former blog Uncommon Descent, a video of the talk-cum-questions was posted on Sep 14, 2014:

This video has gotten very little resonance. To make it easier to access, I have created a transcript, which I will publish on this blog in a short series of posts. Obviously, the usual caveats apply: I'm not a native speaker, but I tried my best to understand and reproduce the talk as truthfully as possible. I apologize in advance for my errors, which inevitably have occurred, and I'm grateful for any correction.

How "official" is the video?

The question arose: who actually taped the talk? Some student, who then put it up on youtube? I think that it is a work of members of the Discovery Institute:

The youtube channel MissIngaNiball on which the video is presented seems to belong to Robert Marks (wikipedia, American Loons), or at least a member of his family (in which case a predilection for feeble puns would be hereditary).
Two stills taken from the video are credited to Paul Nelson (wikipedia, American Loons)in the Discovery Institute's article.

Dembski's talk: Part 1 - 5

Dembski's, Ewert's and Marks's Concept of a Search Applied to Exhaustive Searches

2013-07-14T01:25:00.000-07:00

At Uncommon Descent, Winston Ewert, co-author of the paper A General Theory of Information Cost Incurred by Successful Search, writes:

"The search is defined to be a six-tuple consisting of the initiator, terminator, inspector, navigator, nominator, and discriminator. The paper studies the question of picking a search at random, and that would imply picking each of the six components at random. We did not consider it necessary to specifically state that each individual component was also selected at random. That would seem to be implied.

So, let $\Omega = \{\omega_1, \omega_2, \dots, \omega_N\}$ be our finite search space with $N$ elements. We are looking for a single element $\omega_k$, so we try to maximize the fitness function $f = \chi_{\omega_k}$. To keep everything finite, we don't allow repetitions, i.e., in our search each place can only be visited once. This is - as Macready and Wolpert observed - always possible by keeping a look-up table and thus doesn't change the set-up. Therefore, our search is completed in at most $N$ steps.
(BTW: The claim that "each of the six components [is picked] at random" seems not to apply to the inspector: this is a fixed function for a search - in our case, the inspector returns the value of the fitness function. Of course, you can say that we pick the inspector at random out of the set of the one possible inspector.)
Let's take a look at all the searches which are ended by their terminator only after the $N$-s step, i.e., the subset of all exhaustive searches. The price question: What is the probability to find the target in such an exhaustive search? Until now, everyone looking at such problems would have thought that this probability is one: we certainly visited $\omega_k$ and spotted that the function $f$ takes it maximum there. But in the world of Dembski, Ewert, and Marks it is not, as a random discriminator takes its toll - and discriminators aren't obliged to return the target if it was found and identified...
Counterintuitive? That is a flattering description: the discriminator's purpose seems to be to turn even a search which is successful by all human standards into a guess to fit the idée fixe that each search can be "represented" by a measure on the search space.
Addendum: We can drop the condition of not having repetitions in our searches and just look at those searches which are terminated only after the whole search space was visited: terminators with this property exist. Such searches may have length $N$, but can be much longer. The result is the same: the probability of finding the target during a complete enumeration of the search space is (much) less than one. I have to ask: What good is a model in which an exhaustive search doesn't fare much better than a single guess?

Questioning Information Cost - A reply to Winston Ewert

2013-07-13T03:34:00.000-07:00

Over at Uncommon Descent, Winston Ewert (one of the three authors of the paper A General Theory of Information Cost Incurred by Successful Search) answers in the article "Questioning Information Cost to "a number of questions and objections to the paper" I raised. He states fives points, which I will address in this post. Obviously, I'll give my reply at Uncommon Descent, too, but their format doesn't allow for mathematical formulas, so it is easier to make a first draft here. I thank Winston Ewert for his answers, but I'd appreciate some further clarifications.

Firstly, Dieb objects that the quasi-Bayesian calculation on Page 56 is incorrect, although it obtains the correct result. However, the calculation is called a quasi-Bayesian calculation because it engages in hand-waving rather than presenting a rigorous proof. The text in question is shortly after a theorem and is intended to explicate the consequences of that theorem rather than rigorously prove its result. The calculation is not incorrect, but rather deliberately oversimplified.

Fair enough. So it's not a quasi-Bayesian calculation, but a Bayesian quasi-calculation. I will amend my post (Please show all your work for full credit...) by Winston Ewert's explanation.

Secondly, Dieb objects that many quite different searches can be constructed which are represented by the same probability measure. However, if searches were represented as a mapping from the previously visited points to a new point (as in Wolpert and Macready’s original formulation), algorithms which derive the same queries in different ways will be represented the same way. Giving multiple searches the same representation is neither avoidable nor inherently problematic.

The problem is that Dembski's, Ewert's and Marks's construction of the representation does not only depend on the discriminator (see the next point), but on the target, too. Take $\Omega = \{1,2,3,4\}$ and two searches with two steps:

The first search consist just of two random guesses, i.e., at each step, one of the numbers is given with probability $1/4$.
The second search has two guesses, too. But at the first step, $1$ is taken with probability $7/16$ and each other number with $3/16$, while at the second step, one is omitted from the guess and each other number it guessed with a probability of $1/3$.

These two searches are quite different: the first may produce a query $(1,1)$ with probability $1/16$, while the second never will. Now take a discriminator $\Delta$ which returns the target if it is in the query and otherwise another element in the query at random. Such a discriminator seems to be quite natural and it is certainly within the range of the definition on pages 35--36.
Now, the distribution which $\Delta$ infers on $\Omega$ depends on the target: if we are looking for $\{1\}$, we get:

First search: $\mu_{\{1\}}^1$ given by $\mu_{\{1\}}^1(\{1\}) = 7/16$, $\mu_{\{1\}}^1(\{2\})= \mu_{\{1\}}^1(\{3\})= \mu_{\{1\}}^1(\{4\})=3/16$
Second search: $\mu_{\{1\}}^2 = \mu_{\{1\}}^1$

These are two algorithms which don't derive the same queries albeit in different ways, but nonetheless they will be represented the same way!
In fact, if our target is $\{2\}$, we get other distributions:

First search: $\mu_{\{2\}}^1$ given by $\mu_{\{2\}}^1(\{2\}) = 7/16$, $\mu_{\{2\}}^1(\{1\})= \mu_{\{2\}}^1(\{3\})= \mu_{\{2\}}^1(\{4\})=3/16$
Second search: $\mu_{\{2\}}^2\{1\} = 14/96$, $\mu_{\{2\}}^2\{2\} = 44/96$, $\mu_{\{2\}}^2\{3\} = \mu_{\{2\}}^2\{4\}= 19/96$.

Frankly, this seems to be "inherently problematic".

Thirdly, Dieb objects that a search will be biased by the discriminator towards selecting elements in the target, not a uniform distribution. However, Dieb’s logic depends on assuming that we have a good discriminator. As the paper states, we do not assume this to be the case. If choosing a random search, we cannot assume that we have a good discriminator (or any other component). The search for the search assumes that we have no prior information, not even the ability to identify points in the target.

This seems to be a little absurd. Shouldn't your representation work for any discriminator - even a good one? If we are following Wolpert's and Macready's formulation, a blind search means that we try to maximize a characteristic function. So, the natural discriminator should return this maximum if it is found in a query. If it doesn't, we build a discriminator which does: we have the output of the inspector, so why not use it? If you are telling us that the output of the inspector may be false, then I'd use another inspector, one which gives us the output of the fitness function. If you say now that the output of the fitness function may be dubious, I'd say "tough luck: I maximize this function whether the function is right or wrong - what else is there to do?". These added layers of entities which have a hidden knowledge about the target which isn't inherent to the fitness function seem to be superfluous.

Fourthly, Dieb doesn’t see the point in the navigator’s output as it is can be seen as just the next element of the search path. However, the navigator produces information like a distance to the target. The distance will be helpful in determining where to query, but it does not determine the next element of the search path. So it cannot be seen as just the next element of the search path.

So, what is the difference between the inspector and the navigator? The navigator may take the output of the inspector into account, but nonetheless one could conflate both into a single pair of values - especially as you allow "different forms" for the inspector. So you could get rid of the third row of the search matrix.

Fifthly, Dieb objects that the inspector is treated inconsistently. However, the output of the inspector is not inconsistent but rather general. The information extracted by the inspector is the information relevant to whether or not a point is in the target. That information will take different forms depending on the search, it may be a fitness value, a probability, a yes/no answer, etc.

Sorry, I may have been confused by the phrase "The inspector $O_{\alpha}$ is an oracle that, in querying a search-space entry, extracts information bearing on its probability of belonging to the target $T$": if we look at the Dawkins's Weasel and take the Hamming-distance as the fitness function, each returned value other than $0$ tells us that the probability of belonging to the target $T$ for an element is zero itself, whether it is "METHINKS IT IS LIKE A WEASER" or "AAAAAAAAAAAAAAAAAAAAAAAAAAAA". I understand that you want to avoid the notion of proximity to a target, but your phrasing is misleading, too. Have you any example of a problem where the inspector returns a probability other than 0 or 1? In your examples, it seems to be always the output of a fitness function.

The authors of the paper conclude that Dieb’s objections derive from misunderstanding our paper. Despite five blog posts related to this paper, we find that Dieb has failed to raise any useful or interesting questions. Should Dieb be inclined to disagree with our assessment, we suggest that he organize his ideas and publish them as a journal article or in a similar venue.

It's always possible that I've misunderstood certain aspects of the paper. I would be grateful if you helped to clear up such misunderstanding. I hope that my comments above count as useful and at least a little bit interesting. I'm preparing an article, as I've promised earlier, but the work is quite tedious, and any clarification of the matters above. Furthermore, I'd like to know whether this "general framework" is still in use, or whether you have tried another way of representing searches as measures. Again, thank you Winston Ewert!

BI:NP - A General Theory of Information Cost Incurred by Successful Search

2013-07-04T02:53:00.000-07:00

(This is an email I wrote to William Dembski, Winston Ewert and Robert Marks II)

Hi,

it's nice to be able to read the proceedings of the conference on Biological Information – New Perspectives for free. However, I have a few questions regarding your contribution "A General Theory of Information Cost Incurred by Successful Search":

1) Your quasi-Baysian calculation on p. 56 gets the right result, but IMO it isn't correct: Please see http://dieben.blogspot.de/2013/07/please-show-all-your-work-for-full.html for details.

2) You claim that you have found a representation for searches as measures on the original space. Again, this works for guesses, but seems to be quite problematic when it comes to searches: here, many quite different searches can be constructed which are "represented" by the same $\mu$ in $M(\Omega)$!

3) You are using the uniform measure on $M(\Omega)$. Again, fine with guesses - but when it comes to searches, this becomes questionable: if $\mu_{(X_1, X_2 \dots, X_n)}$ are measures representing searches $S(X_1, X_2 \dots, X_n$), where at each step an element of $\Omega$ is chosen according to a (uniformly random) chosen measure $\theta_k$, then the measures induced by a "discriminator" (which returns an element of $T$ if it was found, otherwise a random element of the first line of the search matrix) aren't again uniformly distributed on $M(\Omega)$. In fact, we will get that for n tending to infinity, the measures approach $\delta_T$!

4) For me, your description of a search is quite convoluted: I don't see the point of the "navigator"'s output, as this can be seen just as the next element of your search path. And then there is the output of the "inspector": you are treating it quite inconsistently - once, it is the probability of an element to be a member of the target, the next time it is the output of a fitness function...

I'd like to see you addressing these issues above. Denyse O'Leary promised a series of posts at Uncommon Descent, each one dedicated to an article of the proceedings. If you don't wish to answer via mail - or comment on my blog - perhaps we can discuss these questions there?

Yours

Di…Eb…

Please show all your work for full credit...

2013-07-03T15:38:00.000-07:00

(I promised a chapter-by-chapter critique of "A General Theory of Information Cost incurred by Successful Search". This is quite tedious work, so I wanted to make this little point up front - for full reading pleasure you have to be acquainted with (some of) the definitions used in the paper.)

(Nota Bene: In a reply to this post, Winston Ewert wrote on Uncommon Descent: "Dieb objects that the quasi-Bayesian calculation on Page 56 is incorrect, although it obtains the correct result. However, the calculation is called a quasi-Bayesian calculation because it engages in hand-waving rather than presenting a rigorous proof. The text in question is shortly after a theorem and is intended to explicate the consequences of that theorem rather than rigorously prove its result. The calculation is not incorrect, but rather deliberately oversimplified.")

On page 55 of their article "A General Theory of Information Cost incurred by Successful Search" (free download as pdf), the authors W. A. Dembski, W. Ewert and R. J Marks II (in future I'll refer to them as DEM) write:

To see how the probability costs ossociated with null and alternative searches relate, it is instructive to consider the following two quasi-Bayesian ways of reckoning these costs:
$\mathbf{P}$(locating $T$ via null search)=$\mathbf{P}$(null search locates T & null search is available)

=$\mathbf{P}$(null search locates T|null search is avail.) × $\mathbf{P}$(null search is avail.)

=$\mathbf{U}(T) \times 1$ [because the availability of null search is taken for granted]

=$p$.

$\mathbf{P}$(locating $T$ via alt. search)=$\mathbf{P}$(alt. search locates T & alt. search is available)

=$\mathbf{P}$(alt. search locates T|alt. search is avail.) × $\mathbf{P}$(alt search is avail.)

=$\mu(T) \times \overline{\mathbf{U}}(\overline{T}_q)$

$\le q\,\times\,p/q$

=$p$.

I have no problems with the results - at least if we can assume that the uniform measure is apt to be used on $\mathbf{M}(\Omega)$. But the equations seems to be a little bit fishy. Let me explain what I mean, using the most simple setting possible: Let $\Omega = \{0,1\}$, a set with two elements, and let $T=1$ be our target. Then $\mathbf{M}(\Omega)$ can be represented by the interval $[0;1]$: for $x \in [0;1]$, $\mu_x = (1-x) \delta_0 + x \delta_1$ is the measure with $\mu_x(\{1\}) = x$. We can even introduce an associated search $S_x := S_(\mu_x)$, which is in fact just a single guess on $\Omega$ distributed according to $\mu_x$. Ergo $\mathbf{E}(S_x) = x$. Now we can perform an experiment in two steps:

Choose a measure $\mu_x \in \mathbf{M}(\Omega)$ at random.
Try to locate $T$ using $S_x$.

This experiment can be represented by choosing (X,Y) on $[0;1] \times [0;1]$ according to the uniform distribution on the unit square: We look up a number x, which represents our measure, then a number y: if $y \le x$, we have located our target using $S_x$, otherwise not.

The picture displays the situation and allows us two answer some questions easily:

What's the probability to locate our target using the process above? Well, it's $p = 1/2$, represented by the whole green area
For a fixed $q$, what is the probability to choose $\mu_x$ and find our target? That's zero (or nil): the red line symbolizes this event, which is a null-set.

Dembski, Ewert, and Marks (DEM) obviously don't want to have this, that's why they don't look at $\{\theta \in \mathbf{M}(Q) | \theta(T) = q \}$, but at $\overline{T}_q = \{\theta \in \mathbf{M}(Q) | \theta(T) \ge q \}$. (pp. 53-54)

What's the probability to choose a measure for which the associated guess finds the target with a probability of at least q, i.e., $\overline{\mathbf{U}}(\overline{T}_q) $? That would be (1-q), easily to be seen in our case, but much more difficult to calculate for more complicated arrangements. DEM give a upper limit for this probability of $p/q$.
What is the probability of finding our target when we have chosen a measure in $\overline{\mathbf{U}}(\overline{T}_q)$? That depends on the measure, but it is at least $q$. On average, it is $\frac{1+q}{2}$: we get this by examining the darker green area...
What is the probability of choosing a measure in $\overline{\mathbf{U}}(\overline{T}_q)$ and finding the target? This is given by the darker green area, ergo $(1-q)\frac{1+q}{2}=\frac{1-q^2}{2}$

Now, the darker green area will always be smaller than the whole green area, not only in this simple example, but for all others, too. Therefore the statement: $$\mathbf{P}\text{(locating }T\text{ via alt. search}) \le p$$ is absolutely (and trivially) correct, as $\mathbf{P}\text{(locating }T\text{ via alt. search})$ is the probability of choosing an element of $\overline{\mathbf{U}}(\overline{T}_q)$ and finding the target using that element. But there is a problem in the equality $$\mu(T) \times \overline{\mathbf{U}}(\overline{T}_q) \le q\,\times\,p/q$$ While $\overline{\mathbf{U}}(\overline{T}_q) \le p/q$ we find that $$\mu(T) \ge q:$$ A measure taken from $\overline{T}_q$ will result in a search which finds the target with a probability of at least $q$. Above, we have seen, that the probability is on average $\frac{1+q}{2} > q$. So we cannot say anything about the size of $\mu(T) \times \overline{\mathbf{U}}(\overline{T}_q)$!. The shaded area in the picture shows $q\,\times\,p/q$: it has nothing to do with the probabilities which one can see so neatly in the graphic, it just happens to have the right area of $p$...
BTW: I don't think that we can split $\mathbf{P}$(alt. search locates T & alt. search is available) neatly into a product used by DEM, a little integrating would be necessary...
So, I wouldn't give full marks for this exercise, but perhaps I'm wrong?

Review of "A General Theory of Information Cost Incurred by Successful Search" - Introduction

2013-06-23T23:26:00.003-07:00

(For some background information, go here)

There are two main ways to apply mathematics: the first is to shed light on a subject and look for a deeper understanding, the second just wants to create the impression that something important is happening somehow. After looking into the article "A General Theory of Information Cost Incurred by Successful Search" (free download as pdf) I became convinced that the authors are following the second path.

The abstract states:

This paper provides a general framework for understanding targeted search. It begins by defining the search matrix, which makes explicit the sources of information that can affect search progress. The search matrix enables a search to be represented as a probability measure on the original search space. This representation facilitates tracking the information cost incurred by successful search (success being defined as finding the target). To categorize such costs, various information and efficiency measures are defined, notably, active information. Conservation of information characterizes these costs and is precisely formulated via two theorems, one restricted (proved in previous work of ours), the other general (proved for the first time here). The restricted version assumes a uniform probability search baseline, the general, an arbitrary probability search baseline. When a search with probability q of success displaces a baseline search with probability p of success where q > p, conservation of information states that raising the probability of successful search by a factor of q/p(>1) incurs an information cost of at least log(q/p). Conservation of information shows that information, like money, obeys strict accounting principles.

The general framework is introduced pp 26 — 38. In my next post, I'll try to relate it to the usual definitions, but I fail to see how this new frameworks improves e.g., the ideas of David Wolpert and William G. Macready significantly (NFLT at wikipedia). pp 38 — 45 provide examples, interestingly without applying the new framework to them. Then follow a couple of pages with sound math (pp. 45 — 61), it is just not clear what they have to do with the claims the authors are making. For their mathematics to work, they have to show that searches can be represented as measures. Indeed, the authors write:

"This representation will be essential throughout the sequel. (p. 37)

I will elaborate how I think that the authors failed to do so, and that the "representation" is at least a misnomer... Another point will be the subject of "Information Cost": this term isn't defined in the paper...

The Ithaca Papers

2013-06-23T13:02:00.000-07:00

William A. Dembski announces in his CV/Resumé on his web site Design Inference - Education in Culture and Worldview some books which are still in preparation. Top of the list is

Biological Information: New Perspectives (co-edited with Robert J. Marks II, John Sanford, Michael Behe, and Bruce Gordon). Under contract with Springer Verlag.

Well, rejoice, the electronic version of this book has been published (and is free for download!), and the hard copy is announced for August 2013. Albeit the publisher switched from Springer to World Scientific, the announcement hasn't changed:

In the spring of 2011, a diverse group of scientists gathered at Cornell University to discuss their research into the nature and origin of biological information. This symposium brought together experts in information theory, computer science, numerical simulation, thermodynamics, evolutionary theory, whole organism biology, developmental biology, molecular biology, genetics, physics, biophysics, mathematics, and linguistics. This volume presents new research by those invited to speak at the conference.

While the publication of Stephen C. Meyer's new book Darwin's Doubt is hailed with great fanfare at the Discovery Institute's news-outlet Evolution News, the appearance of this volume hasn't made their news yet - though Dembski and Meyer are both fellows of the Discovery Institute's Center for Science and Culture (granted, Meyer is its director). Only at Dembski's (former) blog, Uncommon Descent, there are two posts about the book:

Instantly, there arose a discussion about Denyse O'Leary's (commenting under the nom de guerre "News") choice of title, where the usual combatants switched sides: the evolutionists claimed the title was designed to mislead the average reader to think that the Cornell University was somewhat involved in the conference, the apologists of Intelligent Design argued that this was just chance. Unfortunately, no one answered to my comment:

In the interest of discussing the data and the evidence, could we have posts on various articles of the book? I’d be quite interested in a thread on Chapter 1.1.2 “A General Theory of Information Cost Incurred by Successful Search” by William A. Dembski, Winston Ewert and Robert J. Marks II.
I hope that the authors are still reading this blog: this way, we could have a productive discussion, and perhaps some questions could be answered by the people involved!
And for the sake of a swift exchange of ideas: could someone please release me from the moderation queue?

Maybe there is no interest in such a discussion at Uncommon Descent. Maybe no one read the comment - it was hold in the moderation queue for five days, and when it appeared, the article wasn't any longer at the front page. Therefore I'll start a number of posts on “A General Theory of Information Cost Incurred by Successful Search” here at my blog: I just can't believe that this peer-edited article would have been successfully peer-reviewed by Springer....