Search
Loading

Purdue Libraries and School of Information Studies News

Publishing Open Access and Transformed Datasets

Publishing Open Access and Transformed Datasets

October 24th, 2019

Oct. 21-27, 2019, is International Open Access Week. This is part of a series — written by Purdue faculty and staff — that demonstrates the benefits of open access scholarly publishing. For the entire series, visit https://blogs.lib.purdue.edu/news/category/oaweek19/.


Sandi Caldrone, Purdue University Libraries and School of Information Studies
Sandi Caldrone

by Sandi Caldrone, Data Repository Outreach Specialist

Publishing open access data requires imagination. When I review datasets submitted for publication in the Purdue University Research Repository (PURR), I try to put myself in the shoes of a scholar hoping to reuse this dataset, and I try to imagine every question the scholar might have. When you share your data with the world, you open it up to new possibilities—possibilities that are hard to anticipate.

On November 10, 1981, French philosopher Gilles Deleuze gave a lecture on cinema in a Paris university. When he prepared his notes for class that day, he could have had no way of knowing that a student’s audio recording of that lecture, along with dozens of his other lectures, would eventually find their way to the French National Library, and from there to PURR, where anyone can download it to hear his words or text mine the transcriptions.

When Deleuze gave this lecture a little less than 40 years ago, that tape recorder was the most advanced technology in the room. Now, digital humanities students can plug his words into online tools that spin out word clouds, bubble charts, and network graphs. That’s why data curators are always pushing for richer descriptions of data. We want to give future researchers everything they might need to conduct analyses we can’t even imagine yet.

The cycle of imaginative reuse doesn’t have to take forty years. In PURR, we’re already starting to see second-generation open access data—open access data that has been combined, transformed, and republished as a new open access dataset.

As it was in Deleuze’s classroom, it is students who are in the vanguard.

In 2019, PURR has started to see examples of student-faculty collaborations in which students collect data from various open access datasets and put in the labor required to prepare those data for analysis. By publishing their transformed data, they give other researchers the opportunity to pick up where they left off and push scholarship forward, instead of reinventing the wheel. See two excellent examples:

It’s hard to imagine what students might do with data 40 years from now, but I’m really looking forward to finding out.


Explore the Purdue University Research Repository at https://purr.purdue.edu/.

Learn more about Purdue’s Open Access resources, including Purdue e-Pubs, Purdue’s open access digital repository, at www.lib.purdue.edu/openaccess.


Thinking Creatively About Theses

May 29th, 2019

On Thursday, May 23, a Symposium on Electronic Theses and Dissertations was held at Purdue University. Ashlee Messersmith, manager, thesis/dissertation, The Graduate School at Purdue University, and Michael Witt, associate professor, Purdue Libraries and School of Information Studies, organized the event, with support from the United States Electronic Thesis and Dissertation Association (USETDA).

By Michael Witt, Head, Distributed Data Curation Center (D2C2), and Associate Professor of Library Science

"Purdue Graduate School Thesis and Dissertation Policy Changes: Giant Leaps Forward" at the Symposium on Electronic Dissertations and Theses May 23 in Purdue's Wilmeth Active Learning Center.
Ashlee Messersmith (far left) and James L. Mohler, deputy chair, The Graduate School at Purdue, and professor in computer graphics technology (CGT), presented “Purdue Graduate School Thesis and Dissertation Policy Changes: Giant Leaps Forward” at the Symposium on Electronic Theses and Dissertations May 23 in Purdue’s Wilmeth Active Learning Center.

The presenters at the Symposium on Electronic Theses and Dissertations (ETD) highlighted a wide variety of creative works produced by graduate students in earning their degrees, such as:

  • a newly discovered chemical structure with directions for building your own model of it using a 3D printer;
  • training materials for board game designers to help them write better instructions for teaching people how to play their games; and
  • an online map of the state of Indiana with embedded ecological data to improve natural resource management.

Graduate students will typically prepare and defend a written thesis, even if their research can be communicated in a more meaningful or impactful format than a document. There are other examples, such as software source code and research data, videos and photos from exhibits and performances, mixed media, dynamic websites, and much more produced by students; but this type of content is often left out of a traditional thesis.

Professor and Dean of Libraries at the University of North Carolina at Greensboro Martin Halbert addressed the landscape and life cycle of electronic theses and dissertations, as well as the ETDPlus resource Symposium on Electronic Theses and Dissertations on May 23 at Purdue University.
On May 23, Professor and Dean of Libraries at the University of North Carolina at Greensboro Martin Halbert addressed the landscape and life cycle of electronic theses and dissertations, as well as the ETDPlus resource, at the Symposium on Electronic Theses and Dissertations at Purdue University.

In some cases, these non-traditional works could be considered as the primary product of the students’ scholarship — without the need for a written thesis.

Recent changes to the policies of Purdue’s Graduate School reflect a progressive approach and support for non-traditional theses, embracing both the opportunities and challenges they present for the Purdue’s faculty, thesis office, and libraries.

“As emerging technologies continue to influence higher education, we needed to set a precedent through which students are permitted to express their creativity,” Messersmith explained. “Exploring these influences and their implications was the focus of the symposium, which was held in the Wilmeth Active Learning Center. We invited experts to share ideas and brainstorm with participants who supervise theses and manage the processes and platforms for producing and archiving them.”

Guiding Graduate Students in Data Management in Practice
Michael Witt presented “Guiding Graduate Students in Data Management in Practice” at the ETD Symposium May 23 at Purdue. Witt’s presentation covered the Purdue University Research Repository (PURR), which helps university researchers plan and implement effective data management plans, share and manage their data with collaborators while the research is taking place, publish their data in a scholarly context, archive data for the long-term, and measure the impact of sharing their data.

The opening keynote presentation by Professor and Dean of Libraries at the University of North Carolina at Greensboro Martin Halbert addressed the landscape and life cycle of electronic theses and dissertations, as well as the ETDPlus resource.

The closing keynote, delivered by Jean-Pierre Hérubel, professor, Purdue Libraries and School of Information Studies, dove into the history and culture of the doctoral dissertation, as well as variations and transformations of its purpose and form.

Other presentations from Purdue faculty and staff explored issues related to student perspectives, digital humanities, graduate college policies, research data management, digital preservation, and scholarly publishing. Throughout the symposium, participants discussed important questions related to sharing current practices; interfacing with faculty to observe and respect local cultures related to credentialing students; identifying concerns and opportunities for graduate colleges, libraries, and technology providers; and increasing collaboration within the University and among universities. A lively round of lightning talks in the afternoon featured specific examples of theses that challenge conventions from other universities.

Presentation slides and collaborative notes from the symposium are available on Purdue Libraries and School of Information Studies’ e-Pubs repository at https://docs.lib.purdue.edu/etdgiantleaps/.

 


PURR and Libraries: Providing Purdue Researchers with Data-Management and Archiving Tool

September 7th, 2018

Data Repository Outreach Specialist Sandi Caldrone (Research Data, Purdue Libraries)
Data Repository Outreach Specialist Sandi Caldrone (Research Data, Purdue Libraries)

Last week, the Washington Post published an article about the data a Purdue University professor (and two of his research colleagues) gathered on “every confirmed, line-of-duty police killing a civilian in 2014 and 2015.Logan Strother, assistant professor in the Purdue Department of Political Science, used the Purdue University Research Repository, or PURR, to publish the dataset of police shootings he references in the piece. (Co-authors include Charles Menifield and Geiguen Shin, both at Rutgers University, Newark.) According to Data Repository Outreach Specialist (Research Data, Purdue University Libraries) Sandi Caldrone, by using PURR to publish the dataset, Strother is promoting transparency in scholarship.

“It also allows others researchers to replicate or build upon his work,” she noted.

She said the dataset referenced in the Washington Post piece is freely available for public download on the PURR website at doi.org/10.4231/R70G3HCR. It is an example of how one Purdue faculty member uses the valuable PURR research data-management tool.

“PURR is available to anyone at Purdue—faculty, staff, and students,” Caldrone said. “We support researchers throughout the research data-management lifecycle, providing help with data-management planning, online file storage for ongoing projects, data-publication services, and data preservation and archiving.”

According to Vikki Weake, assistant professor in biochemistry at Purdue, she and her lab team members have used PURR extensively to archive datasets associated with their published studies.

“Data management and archiving are becoming increasingly important in the life sciences,” Weake noted. “This is really important, as other researchers have access to the raw data, so they can replicate our analyses and results. The National Institutes of Health have recognized that we need efforts to improve rigor and reproducibility in biomedical science, and services that make raw data freely available are a great way for labs to be transparent about the work that they are doing. Ideally, other groups should be able to take our data and replicate our findings, or if new knowledge becomes available—they might use our data to gain novel insight into a biological process.”

In a brief Q&A below, Caldrone shares how PURR fits into the work that researchers at Purdue University perform and how she and Libraries’ faculty and staff can support them via PURR.

Q. How does PURR fit into the resources and services provided to campus by the Purdue Libraries?

Caldrone: Most of our resources are available online at purr.purdue.edu, but what really sets us apart from other data-management tools is that we have a team on campus to help every step of the way. We’re part of the Research Data unit, which provides consultations and support to help Purdue researchers plan, describe, disseminate, steward, and archive datasets.

Q. Why would faculty and students want to use PURR for their research needs?

Caldrone: Data is a valuable research product, and increasingly funders and publishers expect that product to be shared with the public. We provide the support to meet those funder and publisher requirements. There are lots of other places to publish data online. Our advantage is that we have support staff on campus to help with the process.

Since we are part of the Libraries, we also take preservation seriously, and we carefully archive all of our published datasets. During data collection, many researchers also take advantage of our online file storage space. It’s accessible anywhere on the web and is a simple, easy option for sharing files with off-campus collaborators.

Students learning about data should also look to PURR for sample datasets. See what data looks like in your discipline, download data files, and use them to test data analysis and visualization tools. Or, just explore our collections.

Q. Recently, PURR was redesigned. Why it was needed? What changed about it?

Caldrone: Our look hadn’t changed much since we started in 2011, so we were definitely due for a visual redesign. We took that opportunity to make functional improvements, as well. We increased our storage space, streamlined the registration process, and really expanded our collection of help resources.

Home page of the Purdue University Research Repository. Images that appear on the home page are part of datasets stored in PURR. This image is from "Biological, chemical and flow characteristics of five river sampling sites in the Wabash River watershed near Lafayette, Indiana – 2014."
Home page of the Purdue University Research Repository. Images that appear on the home page are part of datasets stored in PURR. This image is from “Biological, chemical and flow characteristics of five river sampling sites in the Wabash River watershed near Lafayette, Indiana – 2014.”

Q. When in the research process should a researcher at Purdue begin to think about using PURR?

Caldrone: We’re happy to help researchers at any stage, but ideally we hope people will think about PURR early in the planning process. We provide helpful resources and in-person guidance for researchers writing data-management plans, whether or not they decide to publish their data in PURR. Having sound data-management practices in place before data collection starts saves a lot of work and stress down the road.

Q. How should a researcher reach out to you and your team members about using PURR? What kind of customer service help can you provide them to help get them started?

Caldrone: We have written instructions and video demos online showing how to use the PURR (see purr.purdue.edu/guides). We also provide one-on-one or group training sessions and consultations. Researchers can reach out to us at purr@purdue.edu or submit a support ticket on the website. You can also reach the entire Libraries Research Data team at researchdata@purdue.edu.

Q. Any other information you would like to impart to the audience at Purdue?

Caldrone: We’ve had some exciting data collections published recently. Standa Pejsa, PURR’s data curator, worked closely with Professor Nicholas Rauh in classics to publish an image database of hundreds of pottery sherds from Dr. Rauh’s archaeological work in the Cilicia region in what is now Turkey. Their publication is the result of years of hard work and can be found at https://purr.purdue.edu/publications/2924/1.

We’re also working with the philosophy department to publish audio recordings and transcripts of lectures given by French philosopher Gilles Deleuze. This work is still underway, but we have several semesters’ worth of lectures already published. Anyone who would like to hear what it was like to take a course with Deleuze can check out The Movement-Image: Bergsonian Lessons on Cinema.


Purdue University Research Repository Gets Makeover

March 6th, 2018

Home page of the Purdue University Research Repository. Images that appear on the home page are part of datasets stored in PURR. This image is from "Biological, chemical and flow characteristics of five river sampling sites in the Wabash River watershed near Lafayette, Indiana – 2014."
Home page of the Purdue University Research Repository (PURR). Images that appear on the home page are part of datasets stored in PURR. This image is from “Biological, chemical and flow characteristics of five river sampling sites in the Wabash River watershed near Lafayette, Indiana – 2014.”

The Purdue University Research Repository, a research data-management resource for Purdue faculty, staff, and students, has been updated recently to better serve the University community.

PURR provides an online, collaborative workspace for Purdue researchers with tools for learning about data-management best practices, privately sharing research files, and publishing completed data sets. Its goal is to help researchers meet funders’ data-sharing requirements and to move science forward by publishing and preserving open data, says Sandi Caldrone, data repository outreach specialist in the Research Data unit of Purdue Libraries.

The improvements made to PURR include:

  • 10 times more storage space for private files;
  • full-featured datasets gallery on the home page;
  • consolidation of all instructions, guidelines, tips, and tricks to one “Help” section;
  • new videos and tutorials; and
  • easier access for graduate students.

The design was based on results from a usability study conducted in 2017. The website also has been streamlined so that it is much easier to navigate and use. It also is mobile-friendly.

Caldrone says that if someone has used PURR before, all of their data is still there. She also notes that published data sets are now called “Datasets,” instead of “Publications.”

The account registration process also has been streamlined for easier use. Purdue users can log in using their Purdue career accounts. Non-Purdue researchers can register for an account if they are invited by a Purdue researcher to collaborate on a project.

PURR is a collaboration among Purdue Libraries, ITaP, and the Office of the Executive Vice President for Research and Partnerships.

For more information, contact Caldrone at scaldron@purdue.edu or email PURR at purr@purdue.edu.

 

PURR (Purdue University Research Repository) enables users to explore the open data behind Purdue University's world-class research. PURR publishes and archives digital datasets from researchers across campus and welcomes all kinds of open data, from images and videos to spreadsheets and source code.
PURR (Purdue University Research Repository) enables users to explore the open data behind Purdue University’s world-class research. PURR publishes and archives digital datasets from researchers across campus and welcomes all kinds of open data, from images and videos to spreadsheets and source code.

Article courtesy of Megan Huckaby, Purdue University Marketing and Media


Garimella Honored with Leadership in Open Access Award by Purdue Libraries, Office of Provost

October 26th, 2017

Suresh V. Garimella, Executive Vice President for Research and Partnerships and the R. Eugene and Susie E. Goodson Distinguished Professor in the School of Mechanical Engineering at Purdue University, was honored with the 2017 Leadership in Open Access Award from Purdue University Libraries and the Office of the Provost Monday, Oct. 23.

This week (Oct. 23-29) academic institutions and libraries across the globe are celebrating the benefits of Open Access for research and scholarship during the 10th annual International Open Access Week commemoration.

Purdue University’s Suresh V. Garimella (seated in the photo), Executive Vice President for Research and Partnerships and the Goodson Distinguished Professor of Mechanical Engineering, was honored with the Leadership in Open Access Award for 2017 from the Office of the Provost and Purdue Libraries. Pictured, L to R: Jay T. Akridge, interim Provost and Executive Vice President for Academic Affairs and Diversity; Garimella; James L. Mullins, Dean of Libraries and Esther Ellis Norton Professor; and Nina Collins, Scholarly Publishing Specialist, Purdue Scholarly Publishing Division.

According to Dean of University Libraries James L. Mullins, Garimella was selected to receive the recognition this year for leading by example in the Open Access movement at Purdue University. Garimella has more than 400 works posted in the Purdue e-Pubs repository, which have been downloaded close to 256,500 times.

“Dr. Garimella has demonstrated leadership in Open Access to Scholarly Publications by depositing his numerous papers and articles, consistent with copyright and contractual agreements, into Purdue e-Pubs. Therefore, we present the 2017 Leadership in Open Access Award to him in recognition of his outstanding leadership and continued partnership with Purdue e-Pubs to increase visibility of scholarship at Purdue,” Mullins noted.

“It is a great honor to be recognized for our research group’s commitment to Open Access. I am deeply thankful to the scores of students in my group who, over the years, have contributed to the impactful publications that have been eagerly downloaded through the University’s excellent Purdue e-Pubs portal,” Garimella said.

Since 2012, Purdue e-Pubs has close to 15,153,000 downloads from users all over the world, with the average download rate of 2,256,893 per year.

“Dr. Garimella embodies the spirit of the land-grant institution through his work to make scholarly research widely available,” said Jay Akridge, interim Provost and Executive Vice President for Academic Affairs and Diversity. “I congratulate him and all of the students in his group who contribute to global learning by broadening the reach of scholarship.”

For more information about Open Access at Purdue, visit www.lib.purdue.edu/openaccess. Learn more about Purdue e-Pubs at http://docs.lib.purdue.edu/.


Faculty Spotlight: Purdue Libraries Professor Recognized for International Research Data Initiative

March 31st, 2017

Purdue University Libraries Associate Professor Michael Witt has been recognized by the American Library Association (ALA) with the Oberly Award for Bibliography in the Agricultural or Natural Sciences. He shares the award with Frank Scholze, from the Karlsruhe Institute of Technology, for their collaboration on re3data, which is an online registry of research data repositories.

Michael Witt, Purdue University Libraries
Michael Witt

“Researchers are being required by funding agencies to share the data generated by grant-funded research,” Witt said. “They can search re3data to identify the best repository for them to deposit their datasets to satisfy these requirements and to increase the impact of their research by sharing their data.”

re3data has been widely adopted and is referenced in guidelines by such funders as the European Commission, National Science Foundation, National Institutes of Health, and Department of Transportation, as well as publishers, including Nature, PeerJ, Copernicus, and the Public Library of Science.

The registry also helps students, publishers, research administrators, and librarians find research data and track the impact of data-sharing. Users can browse and search for data repositories in re3data by subjects, keywords, policies, locations, and a variety of other features.

Repositories are identified and entered into the registry by an international editorial board of librarians working toward comprehensive coverage of every discipline from repositories around the world.

“All entries in re3data.org are reviewed twice by the editorial board for accuracy. The search, browse, and filtering options make this registry quite useful, and the icons and other metadata indicating important repository characteristics, such as access restrictions and persistent identifiers, add considerable value,” said Sara Scheib, chair of the award committee.

Early work on re3data was supported in the United States by the Institute of Museum and Library Services and the Deutsche Forschungsgemeinschaft in Germany, and it is currently managed in collaboration with DataCite, an international, nonprofit organization that promotes data citation.

Purdue University Libraries is a founding organizational member of DataCite, and re3data complements other campus services the Purdue Libraries offer, such as consulting on data management with researchers, teaching data literacy, and providing the Purdue University Research Repository (PURR).

Witt is also the director of PURR, a university core research facility, provided by the Libraries, the Office of the Executive Vice President for Research and Partnerships, and Information Technology at Purdue (ITaP).

The Oberly Award is given every other year by the Science and Technology Section of the Association of College and Research Libraries (ACRL), which is a division of ALA. It will be presented in June at the ALA Annual Conference in Chicago.