Search Results for author: Emil Carlsson

Found 7 papers, 1 papers with code

Active Preference Learning for Ordering Items In- and Out-of-sample

no code implementations5 May 2024 Herman Bergström, Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson

Learning an ordering of items based on noisy pairwise comparisons is useful when item-specific labels are difficult to assign, for example, when annotators have to make subjective assessments.

Active Learning

Pure Exploration in Bandits with Linear Constraints

1 code implementation22 Jun 2023 Emil Carlsson, Debabrota Basu, Fredrik D. Johansson, Devdatt Dubhashi

Both these algorithms try to track an optimal allocation based on the lower bound and computed by a weighted projection onto the boundary of a normal cone.

Cultural evolution via iterated learning and communication explains efficient color naming systems

no code implementations17 May 2023 Emil Carlsson, Devdatt Dubhashi, Terry Regier

It has been argued that semantic systems reflect pressure for efficiency, and a current debate concerns the cultural evolutionary process that produces this pattern.

Pragmatic Reasoning in Structured Signaling Games

no code implementations17 May 2023 Emil Carlsson, Devdatt Dubhashi

In this work we introduce a structured signaling game, an extension of the classical signaling game with a similarity structure between meanings in the context, along with a variant of the Rational Speech Act (RSA) framework which we call structured-RSA (sRSA) for pragmatic reasoning in structured domains.

Multi-agent Reinforcement Learning reinforcement-learning

Towards Learning Abstractions via Reinforcement Learning

no code implementations28 Dec 2022 Erik Jergéus, Leo Karlsson Oinonen, Emil Carlsson, Moa Johansson

In this paper we take the first steps in studying a new approach to synthesis of efficient communication schemes in multi-agent systems, trained via reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Thompson Sampling for Bandits with Clustered Arms

no code implementations6 Sep 2021 Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson

We propose algorithms based on a multi-level Thompson sampling scheme, for the stochastic multi-armed bandit and its contextual variant with linear expected rewards, in the setting where arms are clustered.

Clustering Thompson Sampling

Learning Approximate and Exact Numeral Systems via Reinforcement Learning

no code implementations28 May 2021 Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson

The agents gradually learn to communicate using reinforcement learning and the resulting numeral systems are shown to be efficient in the information-theoretic framework of Regier et al. (2015); Gibson et al. (2017).

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.