Quantcast
Channel: SCN : All Content - All Communities
Viewing all articles
Browse latest Browse all 3074

Limit in number of Parameters in PAL Clustering algorithms

$
0
0

Hi experts,

 

we are currently working on a predictive analytics use case where we want to do a clustering of persons depending on attributes that are assigned to these persons. The basic data we have, looks schematically like this:

 

IDAttribute Value
1A
1B
1C
1E
2A
2B
2C
3D
3E
4A

 

In order to perfrom the clustering we transform this representation into a matrix consisting of boolean entries (0/1)

 

IDAttribute AAttribute BAttribute CAttribute DAttribute E
111101
211100
300011
410000

Let's call this table input table.

 

In order to call the PAL algorithm (like hierarchical clustering) we have to persist this input table as a column table that we hand over to the PAL procedure. And here we think that we face a limitation: According to the HANA guides the number of columns is limited up to 1000. If we would create the input data table across all attributes we would have more than 1000 attributes and hence more columns which would not be applicable.

 

Now my question(s):

  • Am I completly wrong with that statement concnerning the limitation or did I misunderstand something in PAL guideline?
  • Did anybody face the same restrictions and if yes how did he overcome it


Any hints are appreciated


BR

Christian


Viewing all articles
Browse latest Browse all 3074

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>