Mathematics - Microsoft Research

Video

Counterfactual Multi-Agent Policy Gradients

July 6, 2017 | Shimon Whiteson

Many real-world problems, such as network packet routing and the coordination of autonomous vehicles, are naturally modelled as cooperative multi-agent systems. In this talk, I overview some of the key challenges in developing reinforcement learning…

Counterfactual Multi-Agent Policy Gradients

0:48:42

Video

Policy Gradient Methods: Tutorial and New Frontiers

July 3, 2017 | John Schulman

In this tutorial we discuss several recent advances in deep reinforcement learning involving policy gradient methods. These methods have shown significant success in a wide range of domains, including continuous-action domains such as manipulation, locomotion,…

Policy Gradient Methods: Tutorial and New Frontiers

1:09:15

Publication

Microstructural transition in an ordered set of magnetic spheres immersed in a carrier liquid

R.G. Gontijo, Sara Malvar

Mechanics Research Communications | June 2017, Vol 83: pp. 12-17

Publication

Follow the Compressed Leader: Faster Online Learning of Eigenvectors and Faster MMWU

Zeyuan Allen-Zhu, Yuanzhi Li

June 2017

Project

Publication

Foundations of Data Science

Avrim Blum, John Hopcroft, Ravi Kannan

June 2017

Video Video Video Video Video Video Video Video Video Video

Video

Life without CONS

June 5, 2017 | Neil D. Jones

Can higher-order functional programs solve more problems than first-order programs? Answer: NO, since both program classes are Turing complete. The reason is that higher-order values can be simulated by first-order values: use function “closures” built…

1:03:19

Video

Streaming Lower Bounds for Approximating MAX-CUT

April 17, 2017 | Michael Kapralov

We consider the problem of estimating the value of MAX-CUT in a graph in the streaming model of computation. We show that there exists a constant $\e_* > 0$ such that any randomized streaming algorithm…

1:03:31

Video

Information-Performance Tradeoffs in Control

April 13, 2017 | Victoria Kostina

Consider a flying drone controlled from the ground by an observer who communicates with it via wireless. We are interested in how well the drone can be controlled via a channel that accepts r bits/sec.…

Information-Performance Tradeoffs in Control

1:01:52

Publication

Local Max-Cut In Smoothed Polynomial Time

Omer Angel, Sébastien Bubeck, Yuval Peres, Fan Wei

April 2017

Project

Publication

Assessing Percolation Threshold Based on High-Order Non-Backtracking Matrices

Yuan Lin, Wei Chen, Zhongzhi Zhang

26th International World Wide Web Conference (WWW’2017), Perth, Australia, April, 2017 | April 2017