I have a l开发者_开发知识库arge dataset with the lifespan of threads on an discussion board. I want a histogram that shows the distribution of lifespan, so I did this:
dall <- read.csv("lifespan.csv")
colnames(dall) <- c("thread.id", "seconds.alive", "start.time")
hist(dall$seconds.alive)
which generated this hard to read image:
My questions are a) is changing y-axis to a log-scale a good way to make it more readable? Apparently some people think is a bad idea to change y-axis to log.
b) how do I do that?
I would try using hist(log10(dall$seconds.alive))
instead.
Also try specifying breaks=100
or smaller/larger number:
hist(log10(dall$seconds.alive), breaks=100)
精彩评论