Tools

Interactive browser-based tools for text analysis — no installation required

Great Court, The University of Queensland

LADAL tools are self-contained interactive tools that run entirely in your browser — no installation of R or any other software is required.

LADAL currently offers two types of tools:


Jupyter Notebook Tools

What are Jupyter Notebook tools? These are interactive R notebooks that run in a JupyterLab environment in your browser — no installation required, and available to everyone worldwide.

Each tool can be launched on two platforms:
🌐 MyBinder — a community-run open platform. Free for everyone worldwide, no account needed. Startup typically takes 2–3 minutes.
🇦🇺 ARDC BinderHub — hosted by the Australian Research Data Commons. Requires an AAF or Tuakiri login (Australian/NZ universities). May offer faster startup for Australian users.

Text Preparation Notebooks

Notebook-based tools for cleaning and annotating text data before analysis.

🔍

Concordance Explorer

KWIC concordances — find any word or phrase in context

Search for words or phrases across your uploaded texts and display them in KWIC format. Results are sortable by left or right context and downloadable as Excel or CSV.

→ Concordancing Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
🧹

Text Cleaner

Remove or replace words, tags, URLs, and patterns

Remove or replace specific words, XML/HTML tags, URLs, and text patterns from your files using pre-built options or custom regular expressions.

→ String Processing Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
🏷️

Part-of-Speech Tagger

POS tagging and dependency parsing in 65+ languages

Tokenise, lemmatise, and part-of-speech tag texts in more than 65 languages using the UDPipe toolkit. Results download as a tidy table.

→ Tagging and Parsing Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide

Text Analysis Notebooks

Notebook-based tools for linguistic and statistical analysis of text collections.

🔗

Collocation Analyser

Association measures showing which words attract each other

Calculate MI, t-score, log-likelihood, and other association measures to identify which words significantly collocate with a target word in your corpus.

→ Collocation Analysis Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
🔑

Keyword Finder

Over- and under-represented words vs. a reference corpus

Identify vocabulary that is statistically distinctive in your target corpus compared to a reference corpus using keyness measures including G², chi-squared, and log-ratio.

→ Keyness and Keyword Analysis Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
🕸️

Network Visualiser

Network graphs from structured edge-list data

Create and explore network visualisations from structured edge-list data. Customise layout, node size, and colour, and download the resulting network graph.

→ Network Analysis Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
📊

Topic Explorer

LDA topic discovery across text collections

Discover latent themes across a text collection using LDA topic modelling. Adjust topic numbers interactively and explore topic-document distributions.

→ Topic Modelling Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
💬

Sentiment Explorer

Polarity scoring and eight basic emotion categories (NRC lexicon)

Score uploaded texts for positive/negative polarity and eight basic emotion categories using the NRC lexicon. Visualise sentiment over time or across documents.

→ Sentiment Analysis Tutorial
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
📋

Frequency Analyser

Word and n-gram lists, TTR, and Zipf distribution

Generate ranked word or n-gram frequency lists with normalised counts, type-token ratio, hapax legomena, and a Zipf law plot. Supports unigrams, bigrams, and trigrams with optional stopword filtering.

→ Tools overview
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide
📖

Readability Analyser

Reading level, lexical density, and text complexity scores

Score texts on Flesch Reading Ease, Flesch-Kincaid Grade, Gunning Fog, SMOG, and ARI readability indices. Also reports type-token ratio, lexical density, average sentence length, and average word length.

→ Tools overview
MyBinder Launch
ARDC Launch
🌐 Open access No login required — available worldwide

Shiny Tools

⚠️ Shiny Tools temporarily unavailable
The LADAL Shiny tools are currently offline due to ongoing server updates on the ARDC BinderHub infrastructure. We are working to resolve this as quickly as possible and will restore full access once the updates are complete. We apologise for the inconvenience.

The ARDC BinderHub launch links below are preserved for administrative purposes (status checking). Regular users are encouraged to use the Jupyter Notebook tools above in the meantime, which cover the most common text analysis tasks and are available to everyone worldwide via MyBinder.
Tool Description Related tutorial Status ARDC link
✏️ FileRenamer Batch rename plain-text files using find-and-replace, numbering, or prefix/suffix rules String Processing Unavailable ARDC
🧹 TextCleaner Remove XML/HTML tags, punctuation, URLs, and more; preview and download cleaned texts String Processing Unavailable ARDC
🏷️ POSTagger Part-of-speech tagging and dependency parsing in 65+ languages via UDPipe Tagging and Parsing Unavailable ARDC
🔍 WordFinder KWIC concordancing — search for words or phrases and sort by context Concordancing Unavailable ARDC
🔑 KeywordExtractor Keyness analysis — identify statistically distinctive vocabulary vs. a reference corpus Keyword Analysis Unavailable ARDC
🕸️ WordWebber Word co-occurrence networks scored by Mutual Information; download as PNG or CSV Network Analysis Unavailable ARDC
💬 SentimentExplorer Polarity and eight NRC emotion categories scored across uploaded texts Sentiment Analysis Unavailable ARDC
🔗 CollocationCalculator MI, t-score, log-likelihood, and more for collocations in your corpus Collocation Analysis Unavailable ARDC
📊 TopicDetector Unsupervised and seeded LDA topic modelling with interactive topic and term controls Topic Modelling Unavailable ARDC

Reporting Errors

🔧

Something not working?

If a Jupyter tool fails to launch or behaves unexpectedly, please email Martin at m.schweinberger@uq.edu.au. When reporting an error, please include the tool name and a brief description of what happened. We will investigate and resolve issues as quickly as we can — thank you for letting us know.


Back to top | Back to HOME