Session
Categorical Data Encoding for Regression Models
The session explores the usage and comparison of 5 different text encoding techniques: one-hot, target, hashing, binary and entity embeddings. The methods are needed in cases when text data must be represented numerically before used in a predictive model or neural network. A dataset from a support system is used where the aim is predicting case duration.
Hristo Hristov
MCSE Data Management and Analytics, Project Engineer at Atlas Copco Airpower
Antwerpen, Belgium
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top