Entries tagged data-analysis | Hugonweb Annotated Link Bibliography

Powerful data analysis and composition with the UNIX-shell

https://doi.org/10.1007/978-3-030-19274-7_49

Highlights commands that can be used for command-line data analysis using UNIX pipes.

Including:

  • less
  • head
  • tail
  • cut
  • grep
  • sed
  • awk
  • sort
  • uniq
  • comm
  • join
  • tr
  • wc
  • diff
  • comm

I'd add seq for testing things out.

Idiomatic awk

https://backreference.org/2010/02/10/idiomatic-awk/index.html

I've wanted to learn awk for a long time. I can't decide if it's too cryptic to learn and maintain or would be useful.

I have this idea of using old-fashioned text processing Unix tools for data analysis. Awk would be helpful, since it is a bit like sed but can do math (among other things).

Another resource is the second edition of the book The AWK Programming Language.

Visidata

https://www.visidata.org

A terminal-based spreadsheet data exploration program. It leverages Python's data analysis ecosystem, in particular Pandas.