Author

Oren Bochman

Published

Tuesday, September 17, 2024

Keywords

language evolution, signaling systems, compositionality, lewis signaling game, partial pooling equilibria, categories of signals, morphology, bayesian game, types, knights and knaves, partial observed markov decision process, PMDP, error detection, error correction, source coding, compression, easy decoding, logic, information theory, entropy, marco baroni, deep networks, neural networks, generalization, reinforcement learning, transfer learning, capsules, entanglement

I was introduced to the subject of language evolution by Brian Skryms in his book “Signals: Evolution, Learning, and Information” where he discusses the evolution of signaling systems and the emergence of language. In it he discusses the role of compositionality in the emergence of language and how it is a key feature of human language. Signals provides a coherent yet multifaceted views of the problem - philosophy, signaling system creation and assimilation via evolution or reinforcement learning. Skryms also considers Logic and complex signaling systems. Yet a unifying theme for this work is a reductionist view of the problem and his attempt to reduce the problem to a model that follows closely the Lewis Signaling Game.

I like this reductionist approach but I like to also to turn it on its head. By looking at how the problem takes form in more challenging and realistic settings can often uncover the true nature of the problem. Since language emergence is so open ended one might also use it to consider how it empowers agents to coordinate on better decision in ever more challenging problems and settings.

I first became frustrated with complex signaling systems when I read the chapters in Signals and realized that unlike the other chapters Skryms had not summarized how researchers in the field had come up with a definitive solution to the problem. I reread it a couple of times and finally realized that although he made some very interesting claims this topic was still unresolved. There are many interesting results but there are at least as many open questions.

The second time I became frustrated was when I tried to convert the simple signaling RL games into complex ones. Just the material in the book had versions with multiple agents signaling in parallel, one agent signaling without sequence, and agents signaling in sequence. The book also hints at cases where agents may make mistakes and that this is important for the evolution of signaling systems.

I also was coming across more and more research that isn’t covered in the book that looks at morphology and syntax in the emergence of language. Further more people were using deep learning to overcome the lewis signaling game inability of of arriving decoders for complex signals.

At this point I realized that there might be three problems that are being conflated in nature and that we might want to consider them separately as well as together.

  1. the coordination problem - how agents learn a common convention for signaling and what is the most effective form of the solution.
  2. the serialization problem - how the medium will e.g. a noisy channel can introduce additional desireable contraints like shorter signals, saliency, early decoding, (compression, error detection and correction, easy decoding, signal distributions, ). This problem is one which is solved by a descion tree. But the different options for the settings will lead to different optimal solutions. These are hidden by the symmetric form of the rewards in the lewis game.
  3. the signal composition problem - given a simple signaling systems and a encoder decoder for the channel how can we add aggregation to the signaling system to make it more efficient. (more expressive, easier to learn, easier to extend, more robust to different errors.)

This might help answer questions like - why does english use just 39-44 phonems instead of the full we have a languages making a full use of human phonemes (600 consonants and 200 vowels) ?

What became apparent to me is that the nature of a complex signaling system, depends very much on the game being played by the agents.

Metrics:

has an extensive bibliography and I have been following up on some of the references.

This is a quick summary of a talk by Marco Baroni on the topic of compositionality in language. In it he outlines some of his work and his collegues/students work on the topic and the conclusions he has drawn from it.

Citation

BibTeX citation:
@online{bochman2024,
  author = {Bochman, Oren},
  title = {Rethinking {Signaling} Systems via the Lens of
    Compositionality},
  date = {2024-09-17},
  url = {https://orenbochman.github.io/posts/2024/2024-10-10-marco-baoni-composionality/lewis.html},
  langid = {en}
}
For attribution, please cite this work as:
Bochman, Oren. 2024. “Rethinking Signaling Systems via the Lens of Compositionality .” September 17, 2024. https://orenbochman.github.io/posts/2024/2024-10-10-marco-baoni-composionality/lewis.html.