Quantcast
Channel: Active questions tagged python - Stack Overflow
Viewing all articles
Browse latest Browse all 14011

How to run clustering with categorical variables

$
0
0

I'm trying to run clustering only with categorical variables. Since Kmeans is applicable only for Numeric data, are there any clustering techniques available?

I have 30 variables like zipcode, age group, hobbies, preferred channel, marital status, credit risk (low, medium, high), education status, etc. If I convert each of these variable in to dummies and run kmeans, I would be having 90 columns (30*3 - assuming each variable has 4 factors). Is this correct?


Viewing all articles
Browse latest Browse all 14011

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>