Michael Littman

Navedeno

	Vse	Od leta 2019
Navedbe	59158	22260
indeks h	94	62
indeks i10	248	166

4900

2450

1225

3675

19961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024219 335 480 492 594 644 833 1089 1265 1510 1497 1856 1849 1939 2076 2122 2244 2350 2214 2254 2290 2476 2980 3399 4002 4204 4342 4886 1414

Javni dostop

Prikaži vse

49 člankov

0 člankov

na voljo

ni na voljo

Na podlagi zahtev v povezavi s financiranjem

Spremljaj

Michael Littman

Brown University

Preverjeni e-poštni naslov na brown.edu - Domača stran

reinforcement learning machine learning artificial intelligence


Naslov Razvrsti po navedbah Razvrsti po letniku Razvrsti po naslovu	Navedeno Navedeno	Leto
Reinforcement learning: A survey LP Kaelbling, ML Littman, AW Moore Journal of artificial intelligence research 4, 237-285, 1996	11479	1996
Planning and acting in partially observable stochastic domains LP Kaelbling, ML Littman, AR Cassandra Artificial intelligence 101 (1-2), 99-134, 1998	5585	1998
Markov games as a framework for multi-agent reinforcement learning ML Littman Machine learning proceedings 1994, 157-163, 1994	3922	1994
Measuring praise and criticism: Inference of semantic orientation from association PD Turney, ML Littman acm Transactions on Information Systems (tois) 21 (4), 315-346, 2003	2385	2003
Activity recognition from accelerometer data N Ravi, N Dandekar, P Mysore, ML Littman Aaai 5 (2005), 1541-1546, 2005	2266	2005
Packet routing in dynamically changing networks: A reinforcement learning approach J Boyan, M Littman Advances in neural information processing systems 6, 1993	1191	1993
Learning policies for partially observable environments: Scaling up ML Littman, AR Cassandra, LP Kaelbling Machine Learning Proceedings 1995, 362-370, 1995	1006	1995
Acting optimally in partially observable stochastic domains AR Cassandra, LP Kaelbling, ML Littman Aaai 94, 1023-1028, 1994	1004	1994
Convergence results for single-step on-policy reinforcement-learning algorithms S Singh, T Jaakkola, ML Littman, C Szepesvári Machine learning 38, 287-308, 2000	989	2000
Friend-or-foe Q-learning in general-sum games ML Littman ICML 1 (2001), 322-328, 2001	926	2001
Graphical models for game theory M Kearns, ML Littman, S Singh arXiv preprint arXiv:1301.2281, 2013	811	2013
On the complexity of solving Markov decision problems ML Littman, TL Dean, LP Kaelbling arXiv preprint arXiv:1302.4971, 2013	748	2013
Interactions between learning and evolution D Ackley, M Littman Artificial life II 10, 487-509, 1991	735	1991
Predictive representations of state M Littman, RS Sutton Advances in neural information processing systems 14, 2001	715	2001
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes AR Cassandra, ML Littman, NL Zhang arXiv preprint arXiv:1302.1525, 2013	683	2013
Computerized cross-language document retrieval using latent semantic indexing TK Landauer, ML Littman US Patent 5,301,109, 1994	641	1994
PAC model-free reinforcement learning AL Strehl, L Li, E Wiewiora, J Langford, ML Littman Proceedings of the 23rd international conference on Machine learning, 881-888, 2006	622	2006
An analysis of model-based interval estimation for Markov decision processes AL Strehl, ML Littman Journal of Computer and System Sciences 74 (8), 1309-1331, 2008	611	2008
Algorithms for sequential decision-making ML Littman Brown University, 1996	590	1996
Towards a unified theory of state abstraction for MDPs. L Li, TJ Walsh, ML Littman AI&M 1 (2), 3, 2006	582	2006

Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.

Članki 1–20

Št. navedb na leto

Podvojene navedbe

Združene navedbe

Dodajanje soavtorjevSoavtorji

Spremljaj

Navedeno