Working with Messy Data in OpenRefine

So you found a dataset, but it doesn’t quite work for you; now what? Never fear! We have lots of tips and tricks that you can use to clean up a messy data set, adjust the data to line up with what you already have, or just reduce the size of the dataset to only include what you need. We will look at how you can use Excel and OpenRefine to evaluate and work with your data.

Location: 
Tisch Library, Data Lab, Room 203
Date: 
Friday, February 22, 2019 - 1:00pm to 2:00pm
Library Instructor:

Today's
Hours

10:00am - 1:00am

Workshops This Month And Beyond

All Workshops

Wednesday, March 27, 2019 - 10:00am to 11:30am
Location: Tisch Library, Data Lab, Room 203

Designed to replace paper notebooks, a LabArchives Digital Notebook can be used to document ideas, notes, research, experiments, and procedures. They are designed for intuitive use, ease of search, copying and versioning. They support collaboration with many users and includes access controls. In this workshop you will be provided an overview of the LabArchives Digital Research Notebooks. We will go over the feature tools and show you the benefits of moving from a paper to a Digital Notebook.

Computers will be provide during the training session but feel free to use your laptop. And your Tufts account is required to log into the LabArchives Digital Notebook solution.

Wednesday, March 27, 2019 - 11:30am to 1:00pm
Location: Tisch Library, Data Lab, Room 203

In this workshop, we will explore advance sharing with group management and widgets to create customized forms for data entry. Focused items are creation of research group specific forms and database for Chemical Inventory and Freezer Box.

Computers will be provide during the training session but feel free to use your laptop. And your Tufts account is required to log into the LabArchives Digital Notebook solution.

Wednesday, March 27, 2019 - 3:00pm to 4:00pm
Location: Tisch Library, Data Lab, Room 203

Charts, graphs, maps, and infographics are a great way to communicate research, but not all visualizations are created equal. In this workshop we will talk about the ways that you can create truthful and effective visualizations and look at a series of examples to see how visualizations can go terribly wrong. We will also start to work with Tableau to make attractive visualizations.

Thursday, March 28, 2019 - 1:30pm to 3:00pm
Location: CLIC 114

This is designed to be an introductory level workshop on Basic Linux (the command line environment and some useful commands).

**Required** This is a hands-on workshop and we will be using Tufts HPC cluster for demonstration purposes. If you don't already have access to Tufts HPC cluster, please go to https://research.uit.tufts.edu/ and fill out the "Request account for Research Computing Cluster" Form at least 2 days before the workshop.

No previous Linux experience is required.

**Required** Please bring your own laptop!

Friday, March 29, 2019 - 1:30pm to 3:00pm
Location: CLIC 114

This is designed to be an introductory level workshop on Basic Linux (the command line environment and some useful commands).

**Required** This is a hands-on workshop and we will be using Tufts HPC cluster for demonstration purposes. If you don't already have access to Tufts HPC cluster, please go to https://research.uit.tufts.edu/ and fill out the "Request account for Research Computing Cluster" Form at least 2 days before the workshop.

No previous Linux experience is required.

**Required** Please bring your own laptop!

Friday, March 29, 2019 - 2:00pm to 3:00pm
Location: Tisch Library, Data Lab, Room 203

Just when you finally get ArcMap down, they develop a new platform for ArcGIS! A new ribbon based and map centric product, ArcGIS Pro has all the tools you’ve come to know and love, plus it’s easier to use than ArcMap (once you get used to it). Come learn the future ESRI… you might like it more than you think!

Monday, April 1, 2019 - 9:00am to 12:00pm
Location: SEC 117

This is a two day workshop that will run both Monday, April 1 AND Tuesday, April 2 from 9 am - 12 pm. Please bring your own computer.

High throughput RNA sequencing allows genome-wide investigation of gene expression and regulation. However, designing an experiment and choosing the right tools for analysis can be challenging. This two-day course aims to teach the basics of obtaining and analyzing RNA-seq data: Experiment design, quality control, alignment and visualization of reads and differential expression analysis. Analysis will be done with tools available on the Tufts High Performance Compute Cluster and example code and tutorials will be available at https://sites.tufts.edu/biotools/tutorials/.

Wednesday, April 3, 2019 - 10:30am to 12:00pm
Location: Tisch Library, Data Lab, Room 203

This is a hands-on workshop designed to get you started analyzing data using Python and Jupyter Notebooks. We will provide a whirlwind overview of what Python is (and isn't) and why you might want to use it to look at your data. Then we'll dive into some programming basics and discuss:

  • What are variables and how do I use them?
  • What other data structures are available?
  • Essential Python packages for data analysis (NumPy, Matplotlib, Pandas, and more) and how to import them
  • The basics of visualization with Matplotlib
  • How do I import my data?

No previous programming experience is required for this workshop, and computers will be provided through the Data Lab. If you already have some data that you would like to explore, there will be time to do so at the end, so please feel free to bring it along (preferably in Excel or CSV format). Sample datasets will also be provided.

Thursday, April 4, 2019 - 1:30pm to 3:00pm
Location: CLIC 114

This workshop is a breif introduction of the structure of the Tufts HPC cluster, as well as the basic usage of it's scheduler "SLURM". 

**Required** This is a hands-on workshop and we will be using Tufts HPC cluster for demonstration purposes and practices. If you don't already have access to Tufts HPC cluster, please go to https://research.uit.tufts.edu/ and fill out the "Request account for Research Computing Cluster" Form at least 2 days before the workshop.

Basic Linux knowledge is required. If you are not familiar with Linux, please check out our "Introduction to Basic Linux Workshop".

**Required** Please bring your own laptop!

Friday, April 5, 2019 - 1:00pm to 2:00pm
Location: Tisch Library, Data Lab, Room 203

Charts, graphs, maps, and infographics are a great way to communicate research, but not all visualizations are created equal. In this workshop we will talk about the ways that you can create truthful and effective visualizations and look at a series of examples to see how visualizations can go terribly wrong. We will also start to work with Tableau to make attractive visualizations.

Monday, April 8, 2019 - 10:00am to 11:00am
Location: Tisch Library, Data Lab, Room 203

Description Pending

Wednesday, April 10, 2019 - 1:00pm to 2:30pm
Location: Tisch Digital Design Studio (DDS)

In this workshop you will learn editing tools and strategies using found media with the aim of understanding different ways one could approach the footage. We will cover media management, starting a new project, importing, editing techniques, and exporting. Demo media will be provided. 

Thursday, April 11, 2019 - 1:30pm to 3:00pm
Location: CLIC 114

This is designed to be an introductory level workshop on Basic Linux (the command line environment and some useful commands).

**Required** This is a hands-on workshop and we will be using Tufts HPC cluster for demonstration purposes. If you don't already have access to Tufts HPC cluster, please go to https://research.uit.tufts.edu/ and fill out the "Request account for Research Computing Cluster" Form at least 2 days before the workshop.

No previous Linux experience is required.

**Required** Please bring your own laptop!

Friday, April 12, 2019 - 2:00pm to 3:00pm
Location: Tisch Library, Data Lab, Room 203

 Description Pending

Wednesday, April 17, 2019 - 3:00pm to 4:00pm
Location: Tisch Library, Data Lab, Room 203

Data is an essential part of your research, but keeping track of it can be tricky and data management plans are an increasingly important component of project planning. In this workshop we will talk about the tools and strategies that you can use to manage your data from collection all the way through to when you want to put it somewhere to be preserved.

Thursday, April 18, 2019 - 10:30am to 11:30am
Location: Tisch Digital Design Studio (DDS)

An interactive session on elements and principles of poster design. While these principles are applicable regardless of what software you design in (or even non-digital design), we’ll make use of the easy to use, free web-based application Canva.

Thursday, April 18, 2019 - 1:30pm to 3:00pm
Location: CLIC 114

This workshop is a breif introduction of the structure of the Tufts HPC cluster, as well as the basic usage of it's scheduler "SLURM". 

**Required** This is a hands-on workshop and we will be using Tufts HPC cluster for demonstration purposes and practices. If you don't already have access to Tufts HPC cluster, please go to https://research.uit.tufts.edu/ and fill out the "Request account for Research Computing Cluster" Form at least 2 days before the workshop.

Basic Linux knowledge is required. If you are not familiar with Linux, please check out our "Introduction to Basic Linux Workshop".

**Required** Please bring your own laptop!

Friday, April 19, 2019 - 2:00pm to 3:00pm
Location: Tisch Library, Data Lab, Room 203

Sick of using only PowerPoint to make presentations? Esri Story Maps are web applications that let authors combine beautiful, interactive maps with narrative text, striking images, and multimedia including video. Impress your professors, bosses or friends with these attractive and easy to use applications while bringing your story to life. Story maps can be embedded in web pages and truly up your presentation game!