Abhishek Gupta

chapter

Learning modular neural network policies for multi-task and multi-robot transfer

Coline Devin, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2169 - 2176

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep reinforcement learning to train general purpose neural network policies alleviates some of the burden of manual representation engineering by using expressive policy...

chapter

Information structures and values in zero-sum stochastic games

Ashutosh Nayyar, Abhishek Gupta

2017 American Control Conference (ACC) > 3658 - 3663

2017 American Control Conference (ACC)

We consider a zero-sum stochastic game where two players have a common observation of a global state, and each player makes a private observation of its local state at every time step. This asymmetry of information among the players makes it difficult to the compute the equilibrium cost (called the value of the zero-sum game). To help us determine the value of such a game, we first consider a game...

chapter

Privacy-aware stochastic control with a “snoopy” adversary: A game-theoretic approach

Abhishek Gupta

2016 Annual Conference on Information Science and Systems (CISS) > 187 - 191

2016 Annual Conference on Information Science and Systems (CISS)

We consider a scenario in which a controller and an adversary dynamically act on a system over a finite or infinite horizon. The controller and the adversary do not want to reveal their actions to each other, and at the same time, the controller acts to minimize an expected cost, and the adversary acts to maximize it. We model this scenario as a dynamic zero-sum game, prove the existence of a unique...

chapter

Equilibria in two-stage electricity markets

Abhishek Gupta, Rahul Jain, Kameshwar Poolla, Pravin Varaiya

2015 54th IEEE Conference on Decision and Control (CDC) > 5833 - 5838

2015 54th IEEE Conference on Decision and Control (CDC)

Most electricity markets have multiple stages, which include one or more forward markets and the spot market. We consider two stages - a day-ahead market and a real-time market. We study equilibrium outcomes in such markets assuming demand to be deterministic. We show via counterexamples that in such two-stage electricity markets, (i) a Nash equilibrium may not exist, or (ii) there may be multiple...

chapter

A three-stage Colonel Blotto game with applications to cyberphysical security

Abhishek Gupta, Galina Schwartz, Cedric Langbort, S. Shankar Sastry, more

2014 American Control Conference > 3820 - 3825

2014 American Control Conference - ACC 2014

We consider a three-step three-player complete information Colonel Blotto game in this paper, in which the first two players fight against a common adversary. Each player is endowed with a certain amount of resources at the beginning of the game, and the number of battlefields on which a player and the adversary fights is specified. The first two players are allowed to form a coalition if it improves...

chapter

A dynamic transmitter-jammer game with asymmetric information

Abhishek Gupta, Ashutosh Nayyar, Cedric Langbort, Tamer Basar

2012 IEEE 51st IEEE Conference on Decision and Control (CDC) > 6477 - 6482

2012 IEEE 51st Annual Conference on Decision and Control (CDC)

We consider a jamming attack on a transmitter-receiver pair, in which the transmitter wants to transmit the state of an i.i.d. Gaussian process across an unsecured communication channel to the receiver while minimizing its cost functional. The transmitter decides whether or not to transmit the current state of the random process. The jammer disrupts the transmission on the channel strategically in...

chapter

One-stage control over an adversarial channel with finite codelength

Abhishek Gupta, Cedric Langbort, Tamer Basar

IEEE Conference on Decision and Control and European Control Conference > 4072 - 4077

2011 50th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC 2011)

We consider a model of stealthy attack on a networked control system by formulating a static zero-sum game among four players. Three of the players constitute a team of encoder, decoder and controller for a scalar discrete-time linear plant, while the fourth player is a jammer, who acts to flip the bits of the binary encoded observation signal of the communication channel between the plant and the...

INFONA - science communication portal

Search results for: Abhishek Gupta

Learning modular neural network policies for multi-task and multi-robot transfer

Information structures and values in zero-sum stochastic games

Privacy-aware stochastic control with a “snoopy” adversary: A game-theoretic approach

Equilibria in two-stage electricity markets

A three-stage Colonel Blotto game with applications to cyberphysical security

A dynamic transmitter-jammer game with asymmetric information

One-stage control over an adversarial channel with finite codelength

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Abhishek Gupta

Learning modular neural network policies for multi-task and multi-robot transfer

Information structures and values in zero-sum stochastic games

Privacy-aware stochastic control with a “snoopy” adversary: A game-theoretic approach

Equilibria in two-stage electricity markets

A three-stage Colonel Blotto game with applications to cyberphysical security

A dynamic transmitter-jammer game with asymmetric information

One-stage control over an adversarial channel with finite codelength

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options