is_assignments/a1/report/solution.tex

\documentclass[A4]{article}
\usepackage{amsmath}
\usepackage{graphicx}
\usepackage{fancyvrb}

\begin{document}
\title{%
Seminar Assignment 1\\
\small Intelligent Systems - FRI}
\author{Gasper Spagnolo}
\maketitle \section{Introduction}
In the first seminar assignment, your goal is to use genetic algorithms to find a path out of a maze,
represented as a vector of strings, where $\#$ characters represent walls, $.$ represent empty spaces, and
S and E represent the starting and ending points, as in a given example below:
\begin{center}
\begin{BVerbatim}
	maze = c("####E######",
		"##...#.####",
		"#..#.#.####",
		"#.##...####",
		"#.##.#..S##",
		"###########")
\end{BVerbatim}
\end{center}
You can move through the maze in four directions, left, right, up, and down. In the example above,
the shortest path from the starting position S to the exit E is composed of the following moves: left,
left, up, left, left, up, up, up. In your solution, this should be represented as a string ”LLULLUUU”.
Your task is to create a function that will be able to find path as short as possible out of any maze
represented in such a way

\section{Solution}
\subsection{Task 1}
I decided to write this assignment in python using the pygad library becouse I am more familiar with this programming language. 
\subsubsection{Task description}
Create a function that reads the 2D representation of a maze and returns the shortest path found by
a genetic algorithm. To do this, you will need to:
\begin{itemize}
	\item Read the map into a suitable format (for example, a matrix).
	\item choose a suitable representation of your solutions (the path). Hint: you don’t need to use strings
when working with the genetic algorithm. You can use numeric or binary representations for the
GA function and then convert the result to a string as the final result.
	\item Define the fitness function. Make sure to penalise paths through walls - those are invalid solutions
	\item Run the genetic algorithm with suitable settings.
\end{itemize}

\subsubsection{Read the map into a suitable format}
I decided to read all the maps provided in the assignment into a list of lists. Each list represents a
maze and each element of the list represents a row of the maze.

\subsubsection{Choose a suitable representation of your solutions}
I decided to use a binary representation of the solution same size as an original maze. Each bit represents a move. 0 means that the agent did
not visit the cell and 1 means that the agent visited the cell. So if maze is of size N x M, the solution will be of size N x M. But there is no
such thing as N-dimensional array that GA accepts. So I reshaped the matrix into a vector of size N * M and worked with that kind of solution.

\subsubsection{Define the fitness function}
This part was the most difficult for me. I maybe overcomplicated that part but at least it yields good results. 
Before runing the algorithm I have decided to construct a punish matrix, which is a matrix of the same size as the maze. Each cell in the punish matrix is
evaluated before the algorithm starts. The evaluation is based on the position of walls and valid paths. So if there is a wall in the cell, the fitness value in that cell is set to some low scalar. 
If there is a valid move then the fitness value in that cell is high. So everytime the fitness function is called, the matrix product will be executed and some initiall fitness value will be computed asfollows:
\begin{center}
\begin{BVerbatim}
    fitness = np.sum(path * maze.punish_matrix.reshape(-1))
\end{BVerbatim}
\end{center}

But though experimentation I found that this approach was not good enough so I modified the function by adding punsihment if the agent did not start at the starting position and
if the agent did not end at the ending position. Still the results were not good enough so I decided to check if there is a valid path from the starting position to the ending position. 
If there is no valid path then I would punish the agent otherwise I would give him some reward. This approach yielded better results. 
But still I was not satisfied with the results so I decided to add some more punishes and rewards:
\begin{itemize}
	\item Add a reward if agent finds a shorter path than the best path found so far.
	\item Update weights in punish matrix so that the agent will prefer to move on best path found so far.
	\item If the agent does not find any valid path until 80\% of the GA iterations then activate critical search phase. That means that the agent
		will be rewarded if he finds \textbf{any} path from start to end, even if it maybe isn't the correct one. This way the weights are updated
		so that it converges to the correct path.
\end{itemize}

\subsubsection{Run the genetic algorithm with suitable settings}
I used the following settings wen running the algorithm:
\begin{small}
\begin{itemize}
	\item \begin{verbatim}number_of_genes = N * M \end{verbatim}(if the maze is of size N x M)
		So the solution is a vector of size N * M.
	\item \begin{verbatim} num_of_generations = 1000 \end{verbatim}
		How many generations will the algorithm run.
	\item \begin{verbatim} sol_per_pop = 2 \end{verbatim}
		Number of solutions in the population.
	\item \begin{verbatim} num_parents_mating = 2 \end{verbatim}
		Number of solutions to be selected as parents in the mating pool.
	\item \begin{verbatim} keep_parents = -1 \end{verbatim}
		If -1, this means all parents in the current population will be used in the next population
	\item \begin{verbatim} allow_duplicate_genes = True \end{verbatim}
		If True, then a solution/chromosome may have duplicate gene values.
	\item \begin{verbatim} mutation_type = "random" \end{verbatim}
		Mutation type is random.
	\item \begin{verbatim} crossover_type = "two_point" \end{verbatim}
		Applies the 2 points crossover. It selects the 2 points randomly at which crossover takes place between the pairs of parents
	\item \begin{verbatim} parent_selection = "tournament" \end{verbatim}
		Selects the parents using the tournament selection technique. Later, these parents will mate to produce the offspring.
	\item \begin{verbatim} gene_type = int \end{verbatim}
		We will be predicting integer values.	
	\item \begin{verbatim} gene_space = [0,1] \end{verbatim}
		Define binary subset to be gene space.
	\item \begin{verbatim} fitness_func = fitness_func \end{verbatim}
		Specify fitness function.
	\item \begin{verbatim} parallel_processing = 4 \end{verbatim}
		Spawn 4 additional threads to speed up computing.
\end{itemize}
\end{small}

\subsubsection{Results}

\begin{enumerate}
	\item On first maze I got a perfect score:
\textit{The shortest path is [(3, 1), (2, 1), (2, 2), (1, 2), (0, 2)]}
	
	\begin{figure}[h]
		\centering
		\includegraphics[width=1cm]{./images/task_1_maze_1.png}
		\caption{Solution to the first maze}
		\label{image:task_1_maze_1}
	\end{figure}
	\item Same for the second one:
\textit{The shortest path is [(4, 5), (4, 4), (4, 3), (4, 2), (3, 2), (2, 2), (2, 3), (2, 4), (2, 5), (1, 5), (0, 5)]}
	
	
	\begin{figure}[h]
		\centering
		\includegraphics[width=2cm]{./images/task_1_maze_2.png}
		\caption{Solution to the second maze}
		\label{image:task_1_maze_4}
	\end{figure}
	\item  The third one had many problems and it did not want to converge to propper soluition.
	\item  The fourth one also found the solution pretty quickly.
\textit{The shortest path is [(5, 5), (4, 5), (3, 5), (3, 6), (3, 7), (3, 8), (2, 8), (1, 8), (1, 7), (1, 6), (1, 5), (0, 5)]}
\end{enumerate}
	
	
	\begin{figure}[h]
		\centering
		\includegraphics[width=3cm]{./images/task_1_maze_4.png}
		\caption{Solution to the fourth maze}
		\label{image:task_1_maze_2}
	\end{figure}

Other mazes found also found some solutions, but they were not optimal. Or they were trying to go through a wall becouse the critical section was activated. I think that the problem is that the mutation and crossover operators are not good enough.

	\begin{figure}[h]
		\centering
		\includegraphics[width=4cm]{./images/task_1_broken_solution.png}
		\caption{Example of solution using the critical section}
		\label{image:task_1_broken_solution.png}
	\end{figure}


So I will try to improve them in the following sections.


\end{document}