With the advancement of remote sensing equipment, we have access to high frequency high resolution multi-spectral images of the world. This is a key component in up-to-date precision monitoring for changes in land use and land cover. But such feats can only be achieved if we can process the acquired data fast enough. This is why we require appropriate machine learning and computer vision algorithms. Nowadays these fields are dominated by deep learning and in our work we asses the use of deep neural networks for crop classification. An important component in this scenario is the use of temporal information. We experiment with two types of architectures that are capable of using temporal data. Despite the availability of satellite imagery, the constraints on high quality labeled data make training machine learning methods difficult. For the purpose of experiments in this work, we have prepared a dataset of crops in Slovenia during the year 2017. The access to the data was granted within the EU project Perceptive Sentinel. Similar studies were limited to smaller regions and to the best of our knowledge this will be the first one done on a whole country. We first evaluate the importance of the time series frequency and compare the importance of spectral information. The chosen method is then further improved with the use of spatial information, which enables us to include the context around the observed cell. The method achieves better average performance on the selected domain, but due to the high fragmentation of the fields in the dataset the improvement is not as large as one would have expected, however, the resulting regions are more homogeneous.