In the advertising industry the understanding of different advertising campaigns' parameters is key for workflow optimization. One of these parameters is the number of master designs used to prepare image based adverts, which is a crucial for determining the complexity of a campaign. Adverts which originate from the same master design typically use similar typographies, graphical elements and compositions.
In the experimental part of this thesis, we develop two predictive pipelines for the task of predicting the number of master designs in sets of image based adverts.
Both pipelines use convolutional neural networks for feature extraction and clustering algorithms. The main difference between the two is in the way that the similarity between individual adverts is computed.
Both developed models achieve better results than our baseline approach which, based on the distribution of data, randomly predicts the number of master designs. Our predictive models achieve a 5.2% and 1.2% classification accuracy improvement respectively over the baseline when tested on a sample of 50 campaigns. Our second model, which is based on the similarity of regions between adverts, achieves qualitatively better results than our first model, which is based on simple comparisons of the adverts' features.
|