1. Introduction

1. Introduction

Initial approach to create a controller for TORCS by learning how another controller

First, each kind of controller is imitated separately, then a mix of data is used to

Human players realize they are not playing vs. another human – and finds a way to beat the NPC (non-player character).

Create opponents as intelligent as a human player.

Create competitive NPCs that imitates the human behavior.

In all the experiments the controller created is a feed-forward ANN (Artificial Neural Networks) that was trained with data generated by the controllers.

1. Introduction

Wide area of researching is to create computational intelligence in games with ANN.

An approaches to adapt the AI of the game to the player

One researcher clone the behavior of RoboCup player using case base reasoning (solving new problems based on the solutions of similar past problems)

Other researcher program robosoccer agents by modelling human behaviors with successful results.

1. Introduction

Very realistic simulator that has a sophisticated physic engine that takes into account many aspects of the racing such as fuel consumption, collisions or traction.

Info. Provided:

1. Introduction

In the experiments we use the data obtained from three different controllers:

The information that the human gets from watching the game monitor is much richer.

Created my Matt Simmerson. This controller was the winner of the WCCI 2008 simulated car racing competition.

The OUTPUTs of the ANN were:

The idea of creating another controller was due to the human controller sometimes make mistakes and the Simmerson’s controller does not perform one complete lap in all the tracks and sometimes gets out from the track.

To calculate the values for the acceleration and the brake, we calculate the speed the car should have (estimated speed).

The acceleration and brake values are proportional to the absolute value of the difference of the actual speed and the estimated speed:

The steering value calculation:

Finally the gear is calculated by:

1. Introduction

For the goal of learning the behavior of the controller we have used an ANN.

Inputs:

For all the experiments the ANN has 3 hidden layers of 28 neurons each one, were trained during 1000 cycles and the learning rate starts in 0.9 and finished in 0.0001.

That have been obtained per each controller:

1. Introduction

For the “controllers learning by imitation” we used the data of all tracks to train the ANN of the controller and then test it in each track.

The time obtained by the controllers described before each track

The results of the learnt controllers for all the 3-controllers.

A controller created with the data of human, Simmersons and handcoded controllers was not created because they did not get good result with mixed configurations, as shown in the last 3 tables.

1. Introduction

It is very complicated to learn the human behavior in a video game:

The gear problem: the gear change has not been learned, despite it is probably the easiest output to learn. Maybe it is because of the high amount of data and due to the gear is also an input of the ANN.

1. Introduction

Pre-process of the data before use it to train the neural network. Two ideas:

Train the controllers with some data of one controller.

If we want to imitate the human behavior:

The ANN:

Dostları ilə paylaş:

1. Introduction

1. Introduction

1. Introduction

2. Related work

3. TORCS competition

4. Controllers

5. Controller learning by imitation

6. Results

7. Conclusions

8. Future works

Initial approach to create a controller for TORCS by learning how another controller

Initial approach to create a controller for TORCS by learning how another controller

or humans play the game.

The data obtained from 3 controllers.

One human player and two controllers:

First, each kind of controller is imitated separately, then a mix of data is used to

First, each kind of controller is imitated separately, then a mix of data is used to

create new controllers.

The imitation is performed by means of training a feed forward neural network with the data, using the backpropagation algorithm for learning.

Human players realize they are not playing vs. another human – and finds a way to beat the NPC (non-player character).

Human players realize they are not playing vs. another human – and finds a way to beat the NPC (non-player character).

NPC sometimes cheats for winning humans.

Another option is to play in Internet against other human players

With a lot of cheats or playing versus experienced human make you lose in every game – boring!

Create opponents as intelligent as a human player.

Create opponents as intelligent as a human player.

The AI must be able to adapt its behavior depending on the opponent (play in the same level of the human)

In this way the AI will provide a better entertainment for the player!

Create competitive NPCs that imitates the human behavior.

Create competitive NPCs that imitates the human behavior.

The controller can play as well as its opponent – in the same level.

NPC can adapt its adapt its behavior when the human improve his/her player skills to remain competitive.

Realistic game where human plays versus one or more NPC.

There is option to compare the results with other researchers.

Allows to analyze behaviors that take place in a short period of time.

In all the experiments the controller created is a feed-forward ANN (Artificial Neural Networks) that was trained with data generated by the controllers.

In all the experiments the controller created is a feed-forward ANN (Artificial Neural Networks) that was trained with data generated by the controllers.

The learning algorithm for the ANN was backpropagation.

1. Introduction

1. Introduction

2. Related work

3. TORCS competition

4. Controllers

5. Controller learning by imitation

6. Results

7. Conclusions

8. Future works

Wide area of researching is to create computational intelligence in games with ANN.

Wide area of researching is to create computational intelligence in games with ANN.

NEAT – NeuroEvolution of Augmenting Topologies

NEAT is an effective method (algorithm) to create ANN - it alters both the weighting parameters and structures of networks.

It starts with a small population of random ANN (with only input & output layers) that evolves to the problem.

An approaches to adapt the AI of the game to the player

An approaches to adapt the AI of the game to the player

Examples:

Rapidly adaptive game AI – method that applies continuously small adaptations to the AI based on observations and evaluation of the user actions.

Dynamic Scripting – based on a set of rules that are used for the game, whose weights to select one or another rule are modified through a machine learning algorithm.

One researcher clone the behavior of RoboCup player using case base reasoning (solving new problems based on the solutions of similar past problems)

One researcher clone the behavior of RoboCup player using case base reasoning (solving new problems based on the solutions of similar past problems)

Other researcher program robosoccer agents by modelling human behaviors with successful results.

Other researcher program robosoccer agents by modelling human behaviors with successful results.

1. Introduction

1. Introduction

2. Related work

3. TORCS competition

4. Controllers

5. Controller learning by imitation

6. Results

7. Conclusions

8. Future works

Very realistic simulator that has a sophisticated physic engine that takes into account many aspects of the racing such as fuel consumption, collisions or traction.

Very realistic simulator that has a sophisticated physic engine that takes into account many aspects of the racing such as fuel consumption, collisions or traction.

Provides a lot of tracks, cars with different feature

TORCS is open software and that allows the researchers to make modifications to the game and adapt it to their requirements.

Info. Provided:

Info. Provided:

The lap (current lap time, best lap time, distance raced, race position)

The car status (damage, fuel, actual gear, speed, lateral speed and R.P.M)

Distanse between the car and the track edges

More..