This site uses cookies

Some of these cookies are essential, while others help us to improve your experience by providing insights into how the site is being used.

For more detailed information please check our Cookie notice


Necessary cookies

Necessary cookies enable core functionality. This website cannot function properly without these cookies.


Cookies that measure website use

If you provide permission, we will use Google Analytics to measure how you use the website so we can improve it based on our understanding of user needs. Google Analytics sets cookies that store anonymised information about how you got to the site, the pages you visit, how long you spend on each page and what you click on while you’re visiting the site.

Coding Demonstrations: Text Mining in Python

2 Sep 2020 - 30 Sep 2020 3:00 pm - 4:00 pm
Online
Training
Data skills
Other
Social scientists may find that working with semi-unstructured data, such as natural language text, is an essential but time consuming and difficult part of a research project. However, there are some simple computational techniques that researchers can learn that can improve how they work with text data. These techniques can help researchers speed up and simplify their text analysis as well as make their research methods more transparently documented and reproducible.
This free series, organised by the UK Data Service, introduces core text-mining concepts and demonstrates some basic and advanced methods that can be customised to the needs of individual research projects. 
Led by Dr Diarmuid McDonnell and Dr Julia Kasmire of the UK Data Service, September’s demonstrations address the following topics:
  • Importing text data and basic preparations – 02 September 2020 - recording
  • Basic Natural Language Processing – 16 September 2020 - recording
  • Training a classifier for sentiment analysis  – 23 September 2020 - recording
  • Extracting named entities and creating a social network – 30 September 2020 - recording
Each code demo works through coding examples line-by-line, explaining the logical and programmatic aspects of the methods being demonstrated. During each session you will be able to run the code yourself in real time without any installation on your machine. Each demonstration uses the popular Python programming language and lasts for roughly 30 minutes, followed by up to 30 minutes for Q&A and other interaction. 
This series is part of a wider training programme on new forms of data for social science research
View recordings of past UK Data Service events on our YouTube channel.