Hi experts,
if i am trying to merge the input data with the results of the algorithm i am usally joining them together. For example, DBScan Clustering:
Input:
ID --- Value
1 --- 5
2 --- 10
3 --- 15
4 --- 10
5 --- 200
Output/Result:
ID --- Cluster
1 --- 1
2 --- 1
3 --- 1
4 --- 1
5 --- -1
Now i am joining them to get this result:
ID --- Cluster --- Value
1 --- 1 --- 5
2 --- 1 --- 10
3 --- 1 --- 15
4 --- 1 --- 10
5 --- -1 --- 200
Is there a better way to do that? Is it possible to get the input data without a join as a result with the algorithm results? I am asking because the joining progress needs time and isn't that easy if a ID has multiple rows (with different secondary keys/combined keys, because the algorithm just uses the ID and not secondary key columns)
Thanks!