So I have 5 rows like this
userid, col
------开发者_StackOverflow中文版--------
1, a
1, b
2, c
2, d
3, e
How would I do query so it will look like this
userid, combined
1, a b
2, c d
3, e
In hive you can use
SELECT userid, collect_set(combined) FROM tabel GROUP BY user_id;
collect_set removes duplicated. If you need to keep them you can check this post:
COLLECT_SET() in Hive, keep duplicates?
Use the GROUP_CONCAT aggregate function:
SELECT yt.userid,
GROUP_CONCAT(yt.col SEPARATOR ' ') AS combined
FROM YOUR_TABLE yt
GROUP BY yt.userid
The default separator is a comma (","), so you need to specify the SEPARATOR of a single space to get the output you desire.
If you want to ensure the order of the values in the GROUP_CONCAT, use:
SELECT yt.userid,
GROUP_CONCAT(yt.col ORDER BY yt.col SEPARATOR ' ') AS combined
FROM YOUR_TABLE yt
GROUP BY yt.userid
SELECT
userid,
concat_ws(" ", collect_set(col)) AS combined
FROM table
GROUP BY userid
MySQL
with duplicates:select col1, group_concat(col2) from table1 group by col1
MySQL
without duplicates:select col1, group_concat(distinct col2) from table1 group by col1
Hive
with duplicates:select col1, collect_list(col2) from table1 group by col1
Hive
without duplicates:select col1, collect_set(col2) from table1 group by col1
精彩评论