There's a shut link involving machine learning and compression. A technique that predicts the posterior probabilities of the sequence provided its full history may be used for optimum information compression (by making use of arithmetic coding within the output distribution). Illustration of linear regression on an information established Regression Evaluation https://www.crunchbase.com/organization/ventura-it