Artificial Intelligence Research Data Science Specialist at Dartmouth College

Posted in Other about 15 hours ago.

Location: Hanover, New Hampshire





Job Description:

Posting date:

12/18/2024


Open Until Filled:

Yes


Position Number:

1129150


Position Title:

Artificial Intelligence Research Data Science Specialist


Department this Position Reports to:

Research Data Services


Hiring Range Minimum:

$108,700


Hiring Range Maximum:

$125,000


Union Type:

DCLWU


SEIU Level:

Not an SEIU Position


FLSA Status:

Exempt


Employment Category:

Regular Full Time


Scheduled Months per Year:

12


Scheduled Hours per Week:

40


Location of Position:

Hanover, NH


Remote Work Eligibility?:

Hybrid


Is this a term position?:

No


If yes, length of term in months.:

NA


Is this a grant funded position?:

No


Position Purpose:

This position works as part of the Dartmouth Libraries Research Data Services team to support research, curricular, and applied artificial intelligence work on campus. The person in this role will bring data science skills together with necessary expertise in information curation and knowledge management to support a variety of generative artificial intelligence applications, such as semantic search, retrieval augmented generation, and information/data retrieval application development. Working alongside campus partners engaged in data science and generative artificial intelligence work, this role will focus on database creation, data ingestion, information preprocessing and embedding, vector database management, and system optimization.This position is hybrid work location eligible.


Required Qualifications - Education and Yrs Exp:

Bachelors plus 3-5 years' experience or equivalent combination of education and experience


Required Qualifications - Skills, Knowledge and Abilities:



  • BA in quantitative or related field + 3-5 years experience, or; MA in a quantitative or related field + 1-3 years, or; PhD in a quantitative or related field; or MLIS + 1-3 years

  • 1-3 years of relevant education or work experience in research or applied AI environments

  • Demonstrated knowledge of programming/ scripting languages and analysis applications (e.g., R, Python, SAS, SPSS)

  • Experience with using GenAI, Deep Learning frameworks, and Natural Language Processing (NLP) for projects; or, experience with database design and development

  • Experience with preparing data for analysis, visualization, and other procedures

  • Demonstrated ability to work independently and as a team member to solve problems

  • Excellent oral and written communication skills

  • Strong interpersonal and organizational skills

  • Excellent analytical skills

  • Willingness to learn new programming languages, statistical analysis tools or other relevant tools as needed


Preferred Qualifications:



  • Experience with data tools and services, including HPC, in a research library or academic/research setting

  • Demonstrated ability to initiate, plan, coordinate, implement, and assess complex programs, projects, and services.

  • Professional experience working with research data and/or in an academic library

  • Demonstrated knowledge of data management, curation, and preservation principles and practices

  • Demonstrated knowledge of open data, data repositories, and the data life cycle


Department Contact for Recruitment Inquiries:

Lora Leligdon, Head of Research Data Services


Department Contact Phone Number:

603-646-3845


Department Contact for Cover Letter and Title:

Lora Leligdon, Head of Research Data Services


Department Contact's Phone Number:

603-646-3845


Equal Opportunity Employer:

Dartmouth College is an equal opportunity/affirmative action employer with a strong commitment to diversity and inclusion. We prohibit discrimination on the basis of race, color, religion, sex, age, national origin, sexual orientation, gender identity or expression, disability, veteran status, marital status, or any other legally protected status. Applications by members of all underrepresented groups are encouraged.


Background Check:

Employment in this position is contingent upon consent to and successful completion of a pre-employment background check, which may include a criminal background check, reference checks, verification of work history, conduct review, and verification of any required academic credentials, licenses, and/or certifications, with results acceptable to Dartmouth College. A criminal conviction will not automatically disqualify an applicant from employment. Background check information will be used in a confidential, non-discriminatory manner consistent with state and federal law.


Is driving a vehicle (e.g. Dartmouth vehicle or off road vehicle, rental car, personal car) an essential function of this job?:

Not an essential function


Special Instructions to Applicants:

Dartmouth College has a Tobacco-Free Policy. Smoking and the use of tobacco-based products (including smokeless tobacco) are prohibited in all facilities, grounds, vehicles or other areas owned, operated or occupied by Dartmouth College with no exceptions. For details, please see our policy.
https://policies.dartmouth.edu/policy/tobacco-free-policy



Quick Link:


https://searchjobs.dartmouth.edu/postings/77026


Description:

Works with researchers, staff, and students to refine the collection and curation of corpus documents to ensure datasets are suitable for artificial intelligence and related computational techniques. Designs database architectures for storing documents and the vector databases that will hold document embeddings. While ensuring database scalability, reliability, and performance optimization, monitors the system's performance and optimizes queries to ensure quick retrieval times and high relevance of retrieved documents. Regularly updates the database with new entries and re-indexes as needed.


Percentage Of Time:

30%


Description:

Assists researchers, staff and students in the development and application of document preprocessing pipelines to clean and prepare text data for embedding. Automates transcription processing where necessary, including language detection, segmentation, and annotation.Collaborate with librarians to properly handle metadata and maintain data integrity.


Percentage Of Time:

20%


Description:

Utilizes machine learning models to generate embeddings from preprocessed text data. Indexes embeddings efficiently within the vector database for fast retrieval. Analyzes retrieval accuracy and optimizes the system by applying query transformations and result reranking techniques.


Percentage Of Time:

20%


Description:

Provides instruction, outreach, and consultations on advanced computing concepts for faculty, students, and staff to expand computational research skills (including data discovery, curation, management, storage, analysis, visualization, and preservation) as needed for curricular or research projects.


Percentage Of Time:

10%


Description:

Collaborates with Library Research Data colleagues and Information Technology & Consulting Colleagues to integrate databases effectively with campus AI infrastructure and large language models, and to fine-tune the models based on the data structure and requirements.


Percentage Of Time:

10%


Description:

Engages in focused professional development activities and serves on applicable Dartmouth committees and task forces, with an emphasis on data science techniques, generative artificial intelligence, and ethical applications of novel technologies. Recommends and facilitates improvements to existing programs and services, and participates in internal training and professional development for Dartmouth Library and related staff.


Percentage Of Time:

10%


--:

Demonstrates a commitment to diversity, inclusion, and cultural awareness through actions, interactions, and communications with others.


--:

Performs other duties as assigned.


More jobs in Hanover, New Hampshire


Dartmouth College

Dartmouth College

Dartmouth College
More jobs in Other


Hilton

Fox Hollow Post Acute

Kern River Transitional Care