Session

Categorical Data Encoding for Regression Models

The session explores the usage and comparison of 5 different text encoding techniques: one-hot, target, hashing, binary and entity embeddings. The methods are needed in cases when text data must be represented numerically before used in a predictive model or neural network. A dataset from a support system is used where the aim is predicting case duration.

Hristo Hristov

MCSE Data Management and Analytics, Project Engineer at Atlas Copco Airpower

Antwerpen, Belgium

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top