In this thesis we present methods for solving problems of high-dimensional biological data imputation collected by sequencing individual cells. We try to assign values to the missing data, replacing them with estimations. We tried several imputation methods. We have implemented imputation methods as a module in programming language Python. Then we tested them using synthetic data and real biological data. The evaluation showed that all methods achieve good results. The pCMF method performed the best.
|