Skip to Main Content

Research Data Management

General Data Processing and Analysis Tools

R & RStudio
R is a free programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. RStudio is an Integrated Development Environment (IDE) for R. It includes a console, syntax-highlighting editor that supports direct code execution, as well as tools for plotting, history, debugging and workspace management.

Python & Anaconda
Python is a free and open source, interpreted, general purpose programming language. With an approachable syntax, Python is an easy language to learn for both beginners as well as experienced programmers. The language is also popular within the scientific community, with libraries written specifically to assist in the task of data analysis of large and complex scientific datasets. Anaconda is a distribution of the Python and R programming languages for scientific computing, that aims to simplify package management and deployment. The distribution includes data-science packages suitable for Windows, Linux, and macOS.

OpenRefine
OpenRefine is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data. OpenRefine always keeps your data private on your own computer until you want to share or collaborate.

Excel
Excel works great for simple data processing and statistical analysis. However, it's not very suitable for large scale data manipulation, advanced statistical analysis or large amounts of data.

Commonly Used Tools for Data Processing and Analysis

Name

Purpose

AntConc

Text analysis

ArcGIS (various products available)

Geospatial data (GIS)

Atlas.ti

Qualitative analysis

BaseX

XML processing

CARTO

Geospatial data (GIS)

Cytoscape

Network analysis

Gale Digital Scholar Lab

Text analysis

Gephi

Network analysis

Google Earth Pro

Geospatial data (GIS)

GRASS GIS

Geospatial data (GIS)

jamovi

Quantitative analysis

Lexos

Text analysis

MALLET

Text analysis

MATLAB

Quantitative analysis

MAXQDA

Qualitative analysis

Notepad++

Editor

NVivo

Qualitative analysis

OpenRefine

Data cleanup

Oxygen XML

XML processing

Palladio

Network analysis

PowerBI

Data visualization

Python (Anaconda)

Programming and scripting

QGIS

Geospatial data (GIS)

R (R Studio)

Programming and scripting

SAS

Quantitative analysis

Sci2 Tool

Network analysis

SPSS

Quantitative analysis

Stata

Quantitative analysis

Tableau

Data visualization

Taguette

Qualitative analysis

Voyant

Text analysis

Recommended Books for Data Processing and Analysis