Feb 21 - March 7 Journal

Genetic Algorithm:
After briefly looking into SARSA learning, I decided to switch and explore genetic algorithms after receiving a research paper Connie sent me. In this paper, researchers who were teaching an AI to play dots and boxes explained that they had used a genetic algorithm to create their neural network. As a result, I looked into genetic algorithms and was quite excited. This was because I could really connect with the algorithm as it was based on biology that I have studied. The close comparison made this research more tangible to me and I felt that I could relate more to it. The algorithm is explained in my Concept Map video below but it is basically an algorithm that simulates natural selection to choose the most "fit" neural network. 


Caltech Visit:
We visited Caltech on Thursday though we were short on people. This was because Edmond and Will were at a Robotics competition and Puja was stuck in Chemistry. Nevertheless, Robert, Connie, and I talked with the grad students about which algorithm would be best for the project as the game had already been coded. Robert and Connie focused on Q learning while I talked about the genetic algorithm and showed them the paper. In the end, they decided that Q learning would be better and that it would be better if we shrunk the game space to a 4x4 dots and boxes game. Mr. Lee brought up Will's Monte Carl search tree idea but they thought it would be too complicated.


Tic Tac Toe:
As a tutorial for our applying of Q learning and supervised learning (backup) to dots and boxes, we were tasked with looking into tic tac toe first. This was because tic tac toe was simpler and likely had more data available. I looked around and found for the group three datasets from UVA with tic tac toe game states. However, we were unsure about some values within those datasets and also how to convert this dataset into input. Today, Will showed us a tic tac toe game that ran on the Monte Carlo search tree (depicted below) after finding code for it. It seemed to not be overly complicated and was also quite interested in it as it is similar to the genetic algorithm.



Next:
On Monday, we were still not very sure as to what we would be doing next. After filling in Puja, Edmond, and Will though, I am looking forward to exploring Q learning with Robert and Connie. We found a tutorial for Q learning that involved the Fifa soccer game and I am currently working on part of that tutorial. The tutorial was split amongst us three, but I'm a bit unsure as to converting the Fifa aspects of the code to dots and boxes. I am also looking into a bit on how to save game states though, I do feel that my coding skills need some more work for editing this code. 


Comments

Popular posts from this blog

April 10 Technical Journal

March 13 Technical Journal

Start of School - Sep 12 Journal