Data Science Institute

We aim to set the global standard for a truly interdisciplinary approach to contemporary data-driven research challenges. Established in 2015, the Data Science Institute (DSI) has over 300 members and has raised over £40 million in research grants.

An abstract diagram of networks

About us

We are working to create a world-class Data Science Institute at Lancaster (DSI@Lancaster) that sets the global standard for a truly interdisciplinary approach to contemporary data-driven research challenges. DSI@Lancaster aims to have an internationally recognised and distinctive strength in being able to provide an end-to-end interdisciplinary research capability - from infrastructure and fundamentals through to globally relevant problem domains and the social, legal and ethical issues raised by the use of Data Science.

The Institute is initially focusing on the fundamentals of Data Science including security and privacy together with cross-cutting theme areas consisting of environment, resilience and sustainability;health and ageing, data and society and creating a world-leading institute with over 300 affiliated academics, researchers, and students.

Our data science, health data science and business analytics programmes have launched the careers of hundreds of data professionals over the last 10 years. Students from our programmes have progressed to data science roles at Amazon, PWC, Ernst & Young, Hawaiian Airlines, eBay, Zurich Insurance, the Co-operative Group, N Brown, the NHS and many others - please look at our Education pages for further details of the courses on offer.

Research Retreat Report 

Geospatial Research Retreat 20th & 21st June

On 20th June twenty academics from departments and research centres across the university with interests in mapping and analyzing geographical data travelled to Low Wood Hotel, Ambleside, for a 2-day research retreat organized by the Data Science Institute. The aims of the retreat were twofold – (i) to get to know each other and identify areas of common interest and expertise and (ii) to begin to think about ways we could collaborate in exciting new forms of research by exploiting discipline strengths and interdisciplinary connections.

The retreat began with a keynote presentation from David Bodenhamer (Indiana University) on ‘Connecting Matter, Meaning, and Experience: Data Science, Spatial Humanities, and Deep Maps’. Drawing on his extensive experience as both an academic and leader of a highly influential interdisciplinary research centre, David talked with authority about the ways in which geospatial technologies have influenced, and continue to influence, humanities research. His presentation literally had something for everyone in the room, and many notes were taken!

This was followed by a series of informal ‘getting to know’ activities facilitated by Paul Mahony (Countryscape), leading to the formation of embryonic groups representing different discipline strengths and areas of application. The afternoon session began with a presentation on partnerships and knowledge exchange by Charlotte Stewart (RES) before returning to group work to sketch out initial project ideas. The first day ended with an evening meal and further informal networking as the sun set over the beautiful Low Wood bay.

The second day began with groups ‘walking and talking’ through their project ideas in the grounds of the hotel and along the shores of Windermere. It was liberating to be away from the conference room, flip chart and maker pen, albeit briefly! Mandy Dillon (RES) then gave a presentation on research funding, giving examples of recent and up-and-coming UK and EU calls with an emphasis on geospatial problems or solutions. The remainder of the day was spent working up the detail of the project proposals in readiness for a Dragons’ Den event in which one group member had 2 minutes to pitch the project idea to the assembled dragons – David, Mandy and Charlotte. The tension was palpable, with some groups avoiding the flames whilst others were well and truly singed!

Looking back, there is a lot to be said for getting away from campus, albeit briefly, to get to know colleagues from other departments and research centres and develop new interdisciplinary connections that may ultimately lead to exciting new collaborations. The research retreat has already been a success in this respect.

Over the next 12 months we hope to (i) grow the membership of the group and (ii) organize a series of events designed to further stimulate the development of exciting multidisciplinary research and scholarship in geospatial data science. If you would like to be part of this group, please do get in touch!

Duncan Whyatt, Lancaster Environment Centre

Patricia Murrieta-Flores, Digital Humanities

Barry Rowlingson, Health and Medicine (CHICAS)

Latest News

First International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2024)

On the 29 to 30 July 2024, 向日葵视频 is hosting the first international conference on Natural Language Processing and Artificial Intelligence for Cyber Security.

This is the very first event of its kind and it could not come at a more critical time. In today’s digital age, cyber security has emerged as a heightened priority for both individual users and organisations. With the surge in online information, new innovative solutions are required to address the challenge of traditional security measures falling short against evolving threats.

The conference will also have a special theme track with the goal of stimulating discussion on the future of cyber security in the era of Large Language Models (LLMs) and generative AI. Delegates will examine the challenges, risks and safety issues associated with employing these models in everyday tasks, focusing on aspects including fairness, ethics, and responsibility.

Lancaster staff and students who want to attend, please fill out the or contact Julia Carradus if you have any questions.

For more information on the conference please see the or email info@nlpaics.com.

NLPAICS 2024 will bring researchers, academics, and business industry leaders together as they hear the latest solutions to address risks in processing digital information. It will explore a range of themes around the employment of NLP and AI for cyber security training including:

  • Societal and human security and safety
  • Anomaly detection and threat intelligence
  • Systems and infrastructure security
  • Ethics, bias, and legislation in cyber security

The event is being organised by the 向日葵视频 UCREL NLP research group, the Data Science Institute and Security Lancaster

NLPAICS logo

Workshop on time-series analysis of noisy data at 向日葵视频 - 25th to 27th September 2024

Workshop on time-series analysis of noisy data

The aim of the workshop is to review recent progress in discerning cyclic processes in noisy background, focusing especially on the widespread case of oscillations with time-varying frequencies.

This topic will be discussed in the context of linear, stationary, non-stationary, nonlinear, chaotic, stochastic, autonomous and non-autonomous processes and systems. Practical problems in all areas of human endeavour where data are being collected will be addressed. These include, but are not limited to, living systems, medicine, neuroscience, chronobiology, ecology, climate, economics, space science, astrophysics, lasers, optics and photonics, semiconductors, battery lifecycles, classical and quantum turbulence, engineering and oceanography.

More details can be find on the conference website:

DSI Workshops

All these workshops have now taken place - for the latest news please visit the workshops page

Katie McDonough (DH History) & Daniel Wilson (Turing Institute) Advertising Machines - 21st & 22nd March 2024

Elisa Rubegni, PhD (SCC) & Kate Cain (Psychology) Empowering Tomorrow: Bridging Disciplines for Inclusive Child-Tech - 2nd & 3rd May 2024

Kate Cain (Psychology) and Anastasia Ushakova (CHICAS/FHM) T-READS Tracking Reading and Educational Attainment through Data Science - 3rd & 4th June

Jess Bridgen (Mathematical Sciences) and Jon Read (Lancaster Medical School) Real-time Modelling of Nosocomial Transmission: the unanswered Questions - 11th June 2024

David Parkes (LEC), Luke Rhodes-Leader (Management Science), Paul Cureton (LICA), Eduard Campillo-Funollet (Mathematical Sciences). Calibration and validation for complex system modelling - 11th & 12th June

Paul Rayson (SCC), Jo Knight (CHICAS/FHM), Daisy Harvey and Nick King HealTAC 2024 Healthcare Text Analysis Conference 12th - 14th June 2024

Paul Smith (Mathematical Sciences), Alex Bush (LEC), Emma Eastoe (Mathematical Sciences) Environmental and Ecological Statistics Group - 1st-3rd July 2024 at 向日葵视频

Zoe Alker (History) and Esmorie Miller (Sociology), The Technologies of Everyday Bordering, Past, Present and Future - 5th July at The Storey

Barbara Shih (Biomedical and Life Science) & Richard Mort (Biomedical & Life Science) Disentangling the genes contributing to Dalmatian spotting and associated disease - 10th July, 向日葵视频

DSI ECR Showcase Talks

These talks are to showcase the incredible research our ECR community is involved in, it also gives speakers the opportunity to practice for conference presentations and get feedback on unpublished research.

Get in touch with any questions - we can do these talks in the Lent and Summer term too - always happy to talk about your research and how to showcase your work.

If you are interested in giving a talk, please email David Parkes (d.parkes@lancaster.ac.uk) to discuss dates.

Get in touch with any questions - we can do these talks in the Lent and Summer term too - always happy to talk about your research and how to showcase your work.

Research Themes

Data Science at Lancaster was founded in 2015 on Lancaster’s historic research strengths in Computer Science, Statistics and Operational Research. The environment is further enriched by a broad community of data-driven researchers in a variety of other disciplines including the environmental sciences, health and medicine, sociology and the creative arts.

  • Foundations

    Foundations research sits at the interface of methods and application: with an aim to develop novel methodology inspired by the real-world challenge. These could be studies about the transportation of people, goods & services, energy consumption and the impact of changes to global weather patterns.

  • Health

    The Health theme has a wide scope. Current areas of strength include spatial and spatiotemporal methods in global public health, design and analysis of clinical trials, epidemic forecasting and demographic modelling, health informatics and genetics.

  • Society

    Data Science has brought new approaches to understanding long-standing social problems concerning energy use, climate change, crime, migration, the knowledge economy, ecologies of media, design and communication in everyday life, or the distribution of wealth in financialised economies.

  • Environment

    The focus of the environment theme has been to seek methodological innovations that can transform our understanding and management of the natural environment. Data Science will help us understand how the environment has evolved to its current state and how it might change in the future.

  • Data Engineering

    The Data Engineering theme aims to explore how we can utilise digital technologies to accelerate and enhance our research processes across the University.

Research Software Engineering

Within the Data Science Institute, our aim is to improve the reproducibility and replicability of research by improving the reusability, sustainability and quality of research software developed across the University. We are currently funded by the N8CIR, and work closely with our partner institutions across N8 Research.

Research Software Engineering

Upcoming Events