is_assignments/a1/report/solution.tex

\documentclass[A4]{article}
\usepackage{amsmath}
\usepackage{graphicx}
\usepackage{fancyvrb}

\begin{document}
\title{%
Seminar Assignment 1\\
\small Intelligent Systems - FRI}
\author{Gasper Spagnolo}
\maketitle
\pagebreak
\section{Introduction}
In the first seminar assignment, your goal is to use genetic algorithms to find a path out of a maze,
represented as a vector of strings, where $\#$ characters represent walls, $.$ represent empty spaces, and
S and E represent the starting and ending points, as in a given example below:
\begin{center}
\begin{BVerbatim}
	maze = c("####E######",
		"##...#.####",
		"#..#.#.####",
		"#.##...####",
		"#.##.#..S##",
		"###########")
\end{BVerbatim}
\end{center}
You can move through the maze in four directions, left, right, up, and down. In the example above,
the shortest path from the starting position S to the exit E is composed of the following moves: left,
left, up, left, left, up, up, up. In your solution, this should be represented as a string ”LLULLUUU”.
Your task is to create a function that will be able to find path as short as possible out of any maze
represented in such a way

\section{Solution}
\subsection{Task 1}
I decided to write this assignment in python using the pygad library becouse I am more familiar with this programming language.
\subsubsection{Task description}
Create a function that reads the 2D representation of a maze and returns the shortest path found by
a genetic algorithm. To do this, you will need to:
\begin{itemize}
	\item Read the map into a suitable format (for example, a matrix).
	\item choose a suitable representation of your solutions (the path). Hint: you don’t need to use strings
when working with the genetic algorithm. You can use numeric or binary representations for the
GA function and then convert the result to a string as the final result.
	\item Define the fitness function. Make sure to penalise paths through walls - those are invalid solutions
	\item Run the genetic algorithm with suitable settings.
\end{itemize}

\subsubsection{Read the map into a suitable format}
I decided to read all the maps provided in the assignment into a list of lists. Each list represents a
maze and each element of the list represents a row of the maze.

\subsubsection{Choose a suitable representation of your solutions}
I decided to use a binary representation of the solution same size as an original maze. Each bit represents a move. 0 means that the agent did
not visit the cell and 1 means that the agent visited the cell. So if maze is of size N x M, the solution will be of size N x M. But there is no
such thing as N-dimensional array that GA accepts. So I reshaped the matrix into a vector of size N * M and worked with that kind of solution.

\subsubsection{Define the fitness function}
This part was the most difficult for me. I maybe overcomplicated that part but at least it yields good results.
Before runing the algorithm I have decided to construct a punish matrix, which is a matrix of the same size as the maze. Each cell in the punish matrix is
evaluated before the algorithm starts. The evaluation is based on the position of walls and valid paths. So if there is a wall in the cell, the fitness value in that cell is set to some low scalar.
If there is a valid move then the fitness value in that cell is high. So everytime the fitness function is called, the matrix product will be executed and some initiall fitness value will be computed asfollows:
\begin{center}
\begin{BVerbatim}
    fitness = np.sum(path * maze.punish_matrix.reshape(-1))
\end{BVerbatim}
\end{center}

But though experimentation I found that this approach was not good enough so I modified the function by adding punsihment if the agent did not start at the starting position and
if the agent did not end at the ending position. Still the results were not good enough so I decided to check if there is a valid path from the starting position to the ending position.
If there is no valid path then I would punish the agent otherwise I would give him some reward. This approach yielded better results.
But still I was not satisfied with the results so I decided to add some more punishes and rewards:
\begin{itemize}
	\item Add a reward if agent finds a shorter path than the best path found so far.
	\item Update weights in punish matrix so that the agent will prefer to move on best path found so far.
	\item If the agent does not find any valid path until 80\% of the GA iterations then activate critical search phase. That means that the agent
		will be rewarded if he finds \textbf{any} path from start to end, even if it maybe isn't the correct one. This way the weights are updated
		so that it converges to the correct path.
\end{itemize}

The critical section evaluation in code is done as follows:
\begin{small}
\begin{verbatim}
    def walk_through_maze(self, solution_matrix, critical_situation):
        queue = [[self.start_pos]]

        def add_to_queue(full_path, x, y):
            if (x,y) not in full_path:
		full_path = full_path.copy()
                full_path.append((x, y))
                queue.append(full_path)

        while queue != []:
            full_path = queue.pop()
            x, y = full_path[-1]
            if(self.maze[x][y] == 'E'):
                return full_path
            if x + 1 < len(self.maze) :
                if solution_matrix[x+1, y] == 1 and
			(critical_situation or (self.maze[x+1][y] == "." or self.maze[x+1][y] == "E")):
                    add_to_queue(full_path, x+1, y)
            if x - 1 >= 0:
                if solution_matrix[x-1, y] == 1 and
			(critical_situation or (self.maze[x-1][y] == "." or self.maze[x-1][y] == "E")):
                    add_to_queue(full_path, x-1, y)
            if y + 1 < len(self.maze) :
                if solution_matrix[x, y+1] == 1 and
			(critical_situation or(self.maze[x][y+1] == "." or self.maze[x][y+1] == "E")):
                    add_to_queue(full_path, x, y+1)
            if y - 1 >= 0:
                if solution_matrix[x, y-1] == 1 and
			(critical_situation or (self.maze[x][y-1] == "." or self.maze[x][y-1] == "E")):
                    add_to_queue(full_path, x, y-1)
        return []
\end{verbatim}
\end{small}


\subsubsection{Run the genetic algorithm with suitable settings}
I used the following settings wen running the algorithm:
\begin{small}
\begin{itemize}
	\item \begin{verbatim}number_of_genes = N * M \end{verbatim}(if the maze is of size N x M)
		So the solution is a vector of size N * M.
	\item \begin{verbatim} num_of_generations = 1000 \end{verbatim}
		How many generations will the algorithm run.
	\item \begin{verbatim} sol_per_pop = 20 \end{verbatim}
		Number of possible solutions in the population.
	\item \begin{verbatim} num_parents_mating = 15 \end{verbatim}
		Number of solutions to be selected as parents in the mating pool.
	\item \begin{verbatim} keep_parents = -1 \end{verbatim}
		If -1, this means all parents in the current population will be used in the next population
	\item \begin{verbatim} allow_duplicate_genes = True \end{verbatim}
		If True, then a solution/chromosome may have duplicate gene values.
	\item \begin{verbatim} mutation_type = "random" \end{verbatim}
		Mutation type is random.
	\item \begin{verbatim} crossover_type = "two_point" \end{verbatim}
		Applies the 2 points crossover. It selects the 2 points randomly at which crossover takes place between the pairs of parents
	\item \begin{verbatim} parent_selection = "tournament" \end{verbatim}
		Selects the parents using the tournament selection technique. Later, these parents will mate to produce the offspring.
	\item \begin{verbatim} gene_type = int \end{verbatim}
		We will be predicting integer values.
	\item \begin{verbatim} gene_space = [0,1] \end{verbatim}
		Define binary subset to be gene space.
	\item \begin{verbatim} fitness_func = fitness_func \end{verbatim}
		Specify fitness function.
	\item \begin{verbatim} parallel_processing = 4 \end{verbatim}
		Spawn 4 additional threads to speed up computing.
\end{itemize}
\end{small}

\subsubsection{Results}

\begin{enumerate}
	\item On first maze I got a perfect score:
\textit{The shortest path is [(3, 1), (2, 1), (2, 2), (1, 2), (0, 2)]}

	\begin{figure}[h]
		\centering
		\includegraphics[width=1cm]{./images/task_1_maze_1.png}
		\caption{Solution to the first maze}
		\label{image:task_1_maze_1}
	\end{figure}
	\item Same for the second one:
\textit{The shortest path is [(4, 5), (4, 4), (4, 3), (4, 2), (3, 2), (2, 2), (2, 3), (2, 4), (2, 5), (1, 5), (0, 5)]}


	\begin{figure}[h]
		\centering
		\includegraphics[width=2cm]{./images/task_1_maze_2.png}
		\caption{Solution to the second maze}
		\label{image:task_1_maze_4}
	\end{figure}
	\item  The third one had many problems and it did not want to converge to propper soluition.
	\item  The fourth one also found the solution pretty quickly.
\textit{The shortest path is [(5, 5), (4, 5), (3, 5), (3, 6), (3, 7), (3, 8), (2, 8), (1, 8), (1, 7), (1, 6), (1, 5), (0, 5)]}
\end{enumerate}


	\begin{figure}[h]
		\centering
		\includegraphics[width=3cm]{./images/task_1_maze_4.png}
		\caption{Solution to the fourth maze}
		\label{image:task_1_maze_2}
	\end{figure}

Other mazes found also found some solutions, but they were not optimal. Or they were trying to go through a wall becouse the critical section was activated. I think that the problem is that the mutation and crossover operators are not good enough.

	\begin{figure}[h]
		\centering
		\includegraphics[width=3cm]{./images/task_1_broken_solution.png}
		\caption{Example of solution using the critical section}
		\label{image:task_1_broken_solution.png}
	\end{figure}


So I will try to improve them in the following sections.

\pagebreak
\subsection{Task 2}

\subsubsection{Task description}
The default mutation and crossover functions in R are not well-suited for this task because they do not
necessarily return valid paths (for example, the mutation might introduce a move that goes through a
wall). To fix this, modify the mutation and selection functions so that they take the walls into account.
Additionally, try to create a starting population in a way that takes walls into account.
You can base your crossover and mutation functions on existing GA library functions.
Modify at least one crossover or mutation function in a way that makes them more suitable for this task.

\subsubsection{Mutation function}
I initially used the random mutation type provided by library pygad. It was not good enough for this task, because it was not taking into account the walls.
So I redefined mutation function in such way, that we add random bits where there is no wall. If there is a wall, we set random number of bits to 0.

The function is defined as follows:
\begin{small}
	\begin{center}
		\begin{verbatim}
def on_mutation(generations, ga_instance):
    maze = mazes[maze_ix]

    # Firtly find the instances where there are no walls
    no_wall_instances = np.where(maze.mutation_matrix.reshape(-1) == 1)[0]
    wall_instances = np.where(maze.mutation_matrix.reshape(-1) == 0)[0]

    # Loop through the population
    for i in range(len(generations)):
        #  select random number of the instances where there are walls
        random_false_instances = np.random.choice(wall_instances,
		size=int(len(wall_instances)* random.uniform(0.01, 1.0)), replace=False)
        # Then randomly select random number of the instances where there are no walls
        random_true_instances = np.random.choice(no_wall_instances,
		size=int(len(no_wall_instances)* random.uniform(0.01, 1.0)), replace=False)
        # Then apply those values to generation
        generations[i][random_true_instances] = 1
        generations[i][random_false_instances] = 0

    return generations

		\end{verbatim}
	\end{center}
\end{small}
I also generated the initial population using the same function, but firstly I generated some random bitarrays and then applied the same function to them.
\begin{small}
	\begin{center}
		\begin{verbatim}
        initial_population = np.random.choice([0, 1],
		size=(self.punish_matrix.size, self.initial_population_size))
		\end{verbatim}
	\end{center}
\end{small}

The results I got using this approach were suprising! I got a perfect score on all mazes. I think that the reason for this is that the mutation function is not only taking into account the walls, but also the previous solution. The algorithm converges really fast now. In 5 generations we get a shortest path! I even generated a 10000 x 10000 maze and it solved it!


\subsection{Task 3}

\subsubsection{Task description}
In Task 3, mazes also contain treasure (marked with T). For example:

\begin{center}
\begin{BVerbatim}
	maze2 = c("####E######",
	"##...#.####",
	"#..#.#.####",
	"#.##...####",
	"#T##T#..S##",
	"###########")
\end{BVerbatim}
\end{center}
Your task is to modify your approach so that the solution returns as short a path as possible that also
collects all the treasure.

\subsection{Task 4}

\subsubsection{Task description}
Present a report that describes your approach, shows highlights of your code, and presents the results.
The results have to include performance comparisons between different settings of the genetic algorithm
(different mutation, crossover and selection functions, different starting populations and so on). Make
sure to evaluate your approach on different mazes, the one in the instructions is just an example. The
mazes.r file on ucilnica contains several additional examples of various sizes and complexities. Find
the largest size of a maze that can still be solved with your approach - feel free to create your own
mazes if the example mazes are too small. Produce a graph to show how the maze size affects the
running time of the genetic algorithm
\end{document}