bash - R: excluding all duplicates rows (both of each pair) based on one column -
this question has answer here:
i have file (called example.txt) looks following:
a b c d e f h c z b y b c t e f w o f based on column 2, identify duplicate rows obtain following file:
h c w o f
you can compute values occur once , select rows - this:
tab = table(df$v2) vals = unlist(attr(tab, "dimnames"))[which(tab == 1)] df[df$v2 %in% vals, ] v1 v2 v3 3 h c 7 w o f
Comments
Post a Comment