Andrei Barbu, Siddharth Narayanaswamy, and Jeffrey Mark Siskind School of Electrical and Computer Engineering

The document describes a system that enables robots to learn how to play board games through visual observation of other robots playing. Three robots were used - two that played against each other while the third watched and later took over playing. The watching robot was able to learn the initial board setup, legal moves, and winning conditions for Hexapawn and other games by observing multiple games being played. It used computer vision to reconstruct the game state and an inductive logic programming system to learn and represent the game rules as logical formulas that could then be used to play the games itself. Future work proposed expanding the approach to more complex games and other tasks involving object assembly and manipulation.

Uploaded by

manishpali

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views

Andrei Barbu, Siddharth Narayanaswamy, and Jeffrey Mark Siskind School of Electrical and Computer Engineering

Uploaded by

manishpali

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Learning Physically-Instantiated Game Play Through Visual Observation

Andrei Barbu, Siddharth Narayanaswamy, and Jeffrey Mark Siskind

School of Electrical and Computer Engineering
Introduction

Task

Results

goal is to emulate a 2-year-old child

I an integrated robotics system for learning to play board games
I learn a full set of rules; learn to play, not to play well
I learn the initial board, legal-move generator, and outcome predicate
I two robots play a board game, while a third watches and takes over
I fully automatic with no human intervention
I no communication between the robots

reliable robust operation was achieved for 62 games, approximately

2000 pick-up and put-down operations with fewer than 20
interventions
I robotic manipulation based on dead-reckoning due to non-linear
relationship between servo control and angular position
I error detection by interleaving manipulation and visual
reconstruction of board states
I learned Hexapawn rule set:

initial_board([[x,x,x],[none,none,none],[o,o,o]],player_x).
legal_move(A,B,C) :- row(D), col(E), owns(A,F), empty(G),
forward(A,H,D), at(H,E,B,F,I), at(H,E,C,G,J),
at(D,E,B,G,K), at(D,E,C,F,L), frame_obj(I,K,J,L,B,C).
legal_move(A,B,C) :- row(D), col(E), opponent(A,F),
owns(A,G), empty(H), forward(A,I,D), owns(F,J),
sideways(E,K), at(D,K,C,G,L), at(I,E,B,G,M),
at(I,E,C,H,N), at(D,K,B,J,O), frame_obj(L,N,O,M,C,B).
outcome(A,B,C) :- row(D), opponent(A,E), forward(E,D,F),
forward(E,F,G), owns_outcome(E,C), owns_piece(C,H),
at(G,I,B,H,J).
outcome(A,B,C) :- opponent(A,D), has_no_move(A,B),
owns_outcome(D,C).

Why games, and why learning?

games are an idealized version of the real world
I most AI cannot deal with real-world complexity
I children learn from observation
I not only when rules are unavailable, c.f. Social Learning Theory
I have you read the rules for board games youve played?
I

plays

Experimental setup
protagonist

Games
I off-the-shelf game hardware, but judiciously chosen to simplify
robotic manipulation
I depressions in the board provide for easy piece placement
I large, easy-to-grab pieces
I Tic-Tac-Toe with standard rules learned
I Hexapawn; three pawns on opposing sides; win by queening,
capture, or force an inability to move
I learned 5 variants of Hexapawn: regular, forward diagonal moves,
forward and backward diagonal moves, vertical backward moves,
vertical backward and sideways moves

Fill in attribution

Robots
I custom robots with a 4-DOF arm, two fingers, and two eyes
I eyes on a 1-DOF pendulum arm that rotates around the game
I each eye can pan/tilt independently
I mounted in a custom housing
I parts primarily from Lynxmotion, enhanced with custom parts to
provide greater support for the arm and eyes and increased efficacy
of operation
Computer Vision
I reconstruct the game state from visual information
I must detect the board itself; this is a calibration step done once on
startup where 9 ellipses arranged in a grid are found
I O PEN CV ellipse finder is used with multiple thresholds and voting in
order to detect Xs, Os, and empty board positions

plays

G AME
antagonist

protagonist

watches
=

wannabe

plays

I
G AME

learned similar rule sets for all 6 games

wannabe

Lincoln Logs & language

antagonist

assembly task using assembly toys, e.g. Lincoln Logs

I novel computer-vision system to recognize block assemblies from
grammars by extracting features, fitting them to known grammars,
and searching for implied necessary or possible blocks
I novel language component to describe assemblies in terms of walls
and windows, and reconstruct the same structure out of different
assembly toys
I more advanced robotic system, with custom grippers, farther reach,
tactile sensors, and a palm-mounted camera
I more robust robotic manipulation using visual servoing
I

Rules
P ROGOL, an inductive-logic-programming (ILP) system, is used to
learn the initial board, legal-move generator, and outcome predicate
I rules represented as logical formulae, i.e. Horn clauses
I learned rules are then directly executed with P ROLOG
I system is given background knowledge about the world, such as: a
board exists, pieces can be on the board, players own pieces, the
concepts of linearity, forwards, sideways, and the frame axiom
I background knowledge is of the type any child would have
I search the space of possible initial board, legal-move generators,
and outcome predicates for 3 3 games, given the evidence of n
games, and find the most compressed rule set which best explains
the observed games
I learn the initial board first, then the legal-move generator, and
finally the outcome predicate
I use the previously acquired knowledge to learn the next item
I can learn a full game description in a modest number of games,
typically 36
I

Yoururl

Future work
complete the Lincoln Log task and move on to other assembly toys
I expand upon the current game learning and scale up to games of
higher complexity, e.g. checkers
I learn the mapping from world state to game state
I learn Lincoln Log and other assembly-toy grammars
I integrate more sensors, e.g. a laser pointer and ultrasonic range
finder
I stochastic ILP for fault tolerance
I a custom ILP system with better heuristics for learning games
I

{your,names}@here

The Perfect Organism - The AI of Alien - Isolation
No ratings yet
The Perfect Organism - The AI of Alien - Isolation
8 pages
p400 FINAL GA Robot (LIVE) PDF
No ratings yet
p400 FINAL GA Robot (LIVE) PDF
10 pages
108 Melodies On Harmonium
75% (12)
108 Melodies On Harmonium
126 pages
K2 Scada 20130117 - Eng PDF
No ratings yet
K2 Scada 20130117 - Eng PDF
45 pages
Rules To Play Twilight Imperium 3 Edition Solitaire
No ratings yet
Rules To Play Twilight Imperium 3 Edition Solitaire
7 pages
Fate Numenera
100% (1)
Fate Numenera
24 pages
Open Sim Users Guide 04042012
No ratings yet
Open Sim Users Guide 04042012
258 pages
Ime-Ita Apostila Ingles Vol 1
No ratings yet
Ime-Ita Apostila Ingles Vol 1
34 pages
Sophia Jong - Steam Final Project
No ratings yet
Sophia Jong - Steam Final Project
2 pages
TAAI2016-SnakeAIusingEAv 1 0-Web
No ratings yet
TAAI2016-SnakeAIusingEAv 1 0-Web
8 pages
Sid The Spider PDF
No ratings yet
Sid The Spider PDF
15 pages
Sid The Spider
No ratings yet
Sid The Spider
15 pages
Uncanny Echo 08 - Fabrication
0% (1)
Uncanny Echo 08 - Fabrication
16 pages
Introduction To Tic Tac Toe: Left or The Right Side Move
No ratings yet
Introduction To Tic Tac Toe: Left or The Right Side Move
18 pages
Introduction To Game World: Left or The Right Side Move
No ratings yet
Introduction To Game World: Left or The Right Side Move
19 pages
(PE) (Lewins) Physical Systems in Computer Games
No ratings yet
(PE) (Lewins) Physical Systems in Computer Games
4 pages
Trying To Beat Chess Computers
No ratings yet
Trying To Beat Chess Computers
20 pages
Game Playing in Artificial Intelligence
No ratings yet
Game Playing in Artificial Intelligence
17 pages
Mac Snake Game Project
No ratings yet
Mac Snake Game Project
33 pages
Game Theory
No ratings yet
Game Theory
27 pages
Ai3 1
No ratings yet
Ai3 1
23 pages
Game Playing in Artificial Intelligence
No ratings yet
Game Playing in Artificial Intelligence
12 pages
Goblin Punch_ OSR-Style Challenges_ _Rulings Not Rulesis Insufficient
No ratings yet
Goblin Punch_ OSR-Style Challenges_ _Rulings Not Rulesis Insufficient
13 pages
Project G2 EightPuzzle
No ratings yet
Project G2 EightPuzzle
22 pages
Unit 5 Slides
No ratings yet
Unit 5 Slides
60 pages
Artificial Intelligence: Project Report "Implementation of Snake Game Using Deep Q-Learning Algorithm"
No ratings yet
Artificial Intelligence: Project Report "Implementation of Snake Game Using Deep Q-Learning Algorithm"
9 pages
English 202C - Kyle Brower: A Case Study Regarding The Structure of A JAVA Program
100% (1)
English 202C - Kyle Brower: A Case Study Regarding The Structure of A JAVA Program
6 pages
Ruling Not Rules Is Insufficient
No ratings yet
Ruling Not Rules Is Insufficient
4 pages
Games With Sequential Moves PDF
100% (1)
Games With Sequential Moves PDF
3 pages
BrainyGamesV1 8521
No ratings yet
BrainyGamesV1 8521
45 pages
Analysis
No ratings yet
Analysis
9 pages
Chess Design
No ratings yet
Chess Design
7 pages
Analysis Draft
No ratings yet
Analysis Draft
10 pages
AngryBirds Physics
No ratings yet
AngryBirds Physics
3 pages
Introduction To AI Techniques: Game Search, Minimax, and Alpha Beta Pruning June 8, 2009
No ratings yet
Introduction To AI Techniques: Game Search, Minimax, and Alpha Beta Pruning June 8, 2009
19 pages
3 8 (Amar)
No ratings yet
3 8 (Amar)
23 pages
Cs171 Hw1 Solutions
No ratings yet
Cs171 Hw1 Solutions
5 pages
Tetris_Autoplayer
No ratings yet
Tetris_Autoplayer
8 pages
Evolving An Expert Checkers Playing Program Without Using Human Expertise
No ratings yet
Evolving An Expert Checkers Playing Program Without Using Human Expertise
25 pages
COR2-13 Into The Dying Lands PDF
100% (1)
COR2-13 Into The Dying Lands PDF
37 pages
Games and Rules: Game Mechanics for the »Magic Circle«
From Everand
Games and Rules: Game Mechanics for the »Magic Circle«
Beat Suter
No ratings yet
Educationalgamedesigndoc
No ratings yet
Educationalgamedesigndoc
5 pages
AI-UNIT-2
No ratings yet
AI-UNIT-2
88 pages
Probabilistic Robotics Final Report
No ratings yet
Probabilistic Robotics Final Report
9 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
15 pages
COR2-02 Brendingund's Brood PDF
No ratings yet
COR2-02 Brendingund's Brood PDF
24 pages
102A_hw_04_monopoly_instructions
No ratings yet
102A_hw_04_monopoly_instructions
6 pages
Senha c Programming
No ratings yet
Senha c Programming
17 pages
Brain o Mancer
No ratings yet
Brain o Mancer
10 pages
User Requirements: The Game's Wikipedia Page (Links To An External Site.) Links To An External Site
No ratings yet
User Requirements: The Game's Wikipedia Page (Links To An External Site.) Links To An External Site
18 pages
Assignment 1: Arachnophobia: 1 Goal
No ratings yet
Assignment 1: Arachnophobia: 1 Goal
3 pages
patchworkworldplaintext
No ratings yet
patchworkworldplaintext
65 pages
AI, Games and Behaviours
No ratings yet
AI, Games and Behaviours
19 pages
Aeon Angelus Necronomicon: by Luke Walker
No ratings yet
Aeon Angelus Necronomicon: by Luke Walker
41 pages
SE1012 PM Assignment 2024
No ratings yet
SE1012 PM Assignment 2024
4 pages
4.1. Adversarial Search and Games
No ratings yet
4.1. Adversarial Search and Games
37 pages
92a95642-c4fa-405b-b824-7a1f9f89c6e6
No ratings yet
92a95642-c4fa-405b-b824-7a1f9f89c6e6
20 pages
Deep Reinforcement Learning in Games
No ratings yet
Deep Reinforcement Learning in Games
9 pages
Game Level Design - Course 02 - Game Feel
No ratings yet
Game Level Design - Course 02 - Game Feel
32 pages
Tic Tac Toe: G.K.Gaikwad
No ratings yet
Tic Tac Toe: G.K.Gaikwad
6 pages
Picozine 4 0 3 Print PDF
No ratings yet
Picozine 4 0 3 Print PDF
40 pages
Chin CD Cover
No ratings yet
Chin CD Cover
32 pages
Reinforcement Learning: Russell and Norvig: CH 21
No ratings yet
Reinforcement Learning: Russell and Norvig: CH 21
16 pages
Ai Unit 2 Notes
No ratings yet
Ai Unit 2 Notes
56 pages
Fatiguehandbook Small PDF
100% (1)
Fatiguehandbook Small PDF
532 pages
NOTES 4 Sensors For Vibration Measurement
No ratings yet
NOTES 4 Sensors For Vibration Measurement
77 pages
अपना घर
No ratings yet
अपना घर
2 pages
Japa Reform Notebook
No ratings yet
Japa Reform Notebook
152 pages
Om Śrī Paramātmane Nama . Atha A Ādaśodhyāya . Mok Asa Nyāsayoga
No ratings yet
Om Śrī Paramātmane Nama . Atha A Ādaśodhyāya . Mok Asa Nyāsayoga
4 pages
Biomechanics-Inter-segmental Forces and Joint Stability, Notes
No ratings yet
Biomechanics-Inter-segmental Forces and Joint Stability, Notes
60 pages
Vf2 Sub GVraph Iso Impl
No ratings yet
Vf2 Sub GVraph Iso Impl
10 pages
Olt Gpon GL5610 1 1
No ratings yet
Olt Gpon GL5610 1 1
5 pages
Static Timing Analysis - Maharshi
100% (1)
Static Timing Analysis - Maharshi
29 pages
By Mamo Abebe (PHD) Department of Mathematics Wallaga University
No ratings yet
By Mamo Abebe (PHD) Department of Mathematics Wallaga University
66 pages
Managing & Tabulating Data in Microsoft Excel
No ratings yet
Managing & Tabulating Data in Microsoft Excel
184 pages
Computer-Architecture Hari Aryal Ioe
No ratings yet
Computer-Architecture Hari Aryal Ioe
163 pages
05 Query Processing and Optimization-TELU
No ratings yet
05 Query Processing and Optimization-TELU
56 pages
Itr Shoe Bonded
No ratings yet
Itr Shoe Bonded
2 pages
Micro850 24-Point Programmable Controllers: Catalog Numbers
No ratings yet
Micro850 24-Point Programmable Controllers: Catalog Numbers
24 pages
Move and Selection Tools: Polygonal Lasso Tool
No ratings yet
Move and Selection Tools: Polygonal Lasso Tool
5 pages
Orion Sirius Eq-G: Computerized Goto Equatorial Mount
No ratings yet
Orion Sirius Eq-G: Computerized Goto Equatorial Mount
20 pages
SSM Institute of Engineering and Technology: P.Mohana Karthiga A.P/EEE
No ratings yet
SSM Institute of Engineering and Technology: P.Mohana Karthiga A.P/EEE
7 pages
Fragmented Contract Management: Challenges, Impacts and Solutions
No ratings yet
Fragmented Contract Management: Challenges, Impacts and Solutions
22 pages
Flyeralarm terms & conditions for consumers
No ratings yet
Flyeralarm terms & conditions for consumers
17 pages
Topic: Visual::Worksheet Number:10: 1 - Find The Missing Part From The Options Given Below
100% (1)
Topic: Visual::Worksheet Number:10: 1 - Find The Missing Part From The Options Given Below
5 pages
Rules For Defining Data Identifier
No ratings yet
Rules For Defining Data Identifier
5 pages
WD - Final Question Bank Students
100% (1)
WD - Final Question Bank Students
1 page
Planning, Commissioning and Maintenance
No ratings yet
Planning, Commissioning and Maintenance
3 pages
Vertex Vx-2100-2200 Series Specsheet 0609 A4
No ratings yet
Vertex Vx-2100-2200 Series Specsheet 0609 A4
2 pages
Single Channel Speech Dereverberation Using The LP Residual Cepstrum
No ratings yet
Single Channel Speech Dereverberation Using The LP Residual Cepstrum
5 pages
Cimco DNC Max 7 User Guide en
No ratings yet
Cimco DNC Max 7 User Guide en
75 pages
Energy Analyzer For Three-Phase Systems: Benefits
No ratings yet
Energy Analyzer For Three-Phase Systems: Benefits
22 pages
Goes To Campus: Last Update Desember 2018
No ratings yet
Goes To Campus: Last Update Desember 2018
27 pages
Computer Systems Servicing DLL
No ratings yet
Computer Systems Servicing DLL
11 pages
Division of Electrical, Electronics, and Computer Sciences (EECS) Indian Institute of Science, Bangalore
No ratings yet
Division of Electrical, Electronics, and Computer Sciences (EECS) Indian Institute of Science, Bangalore
2 pages
Perbandingan Lampu Induksi LVD Dan Lampu LED
No ratings yet
Perbandingan Lampu Induksi LVD Dan Lampu LED
5 pages
Fsqm-080 Ppap Checklist
100% (1)
Fsqm-080 Ppap Checklist
14 pages
BI Guidebook v1.0
No ratings yet
BI Guidebook v1.0
60 pages