See all roles

Data Annotation Engineer

Work from home Full-time role Hiring

Veryfi is a YC-funded Silicon Valley startup that uses AI to understand documents like receipts and invoices. As a Data Engineer at Veryfi, you'll contribute to the evolution of our training data infrastructure and the development of new features and projects. You'll gather, process, and analyze diverse datasets to generate high-quality training data for our machine-learning models. Furthermore, by delving deep into our system, you'll have the autonomy to identify challenges and opportunities, taking ownership of developing solutions to refine existing tools and algorithms. Key Responsibilities:

  • Gather, process, and analyze diverse datasets to generate training data that fuels the development of our ML projects.
  • Expand and optimize the training data pipelines to improve the speed and accuracy of our processes.
  • Collaborate with a cross-functional team to define requirements and prioritize development efforts.

Essential Skills:

  • Proficient in Python programming for data handling and processing, with experience in utilizing data science tools such as Pandas, NumPy, SciPy, and others.
  • Strong analytical thinking with a focus on delivering results.
  • Meticulous attention to detail, ensuring accuracy and precision in all data handling and processing tasks.
  • Enthusiastic about learning and adapting to new technologies and methodologies, particularly in the realm of Machine Learning (ML).
  • Innovation mindset, adept at challenging existing processes and driving positive change.

Preferred Qualifications:

  • Familiarity with regex development, software engineering principles, and Linux command line tools.
  • Experience with Natural Language Processing (NLP) techniques and libraries, including the use of Large - - -- - Language Models (LLMs) and supervised learning for document data extraction.
  • Effective organizational abilities, capable of managing projects independently from inception to completion.
  • Exceptional verbal and written communication skills, effectively communicating problems, proposed solutions, and results to stakeholders in a multicultural environment.

A Bachelor's degree in computer science, engineering, or a related field. Postgraduate studies are a plus but not required. Keywords: NLP, Patterns Detection, Data Labeling, Software Development, Data Engineering. Apply tot his job Apply To this Job

You might like

Software Engineer, iOS/Mobile - Electronic Flight Bag (EFB)

Work from home Full-time role

Customer Success Manager – Credit & Consumer Finance SaaS (Remote)

Work from home Full-time role

Performance Marketing Manager

Work from home Full-time role

Senior Value Consultant

Work from home Full-time role

Account Executive – SaaS Sales

Work from home Full-time role

Sales Executive – SaaS (USA - Remote)

Work from home Full-time role

Venture Deal Flow Scout

Work from home Full-time role

Founding SDR

Work from home Full-time role

Patient Scheduling Associate

Work from home Full-time role

Co-Founder / Founding Operator (Stealth Startup) – Menlo Park, California

Work from home Full-time role

Experienced Data Entry Clerk Wanted - Remote Work Opportunity with arenaflex

Work from home Full-time role

Experienced Customer Service Representative – Amazon Live Chat Support From Home Opportunity at arenaflex

Work from home Full-time role

Remote Member Relations Associate

Work from home Full-time role

Experienced Customer Service Representatives - Live Chat Support for arenaflex

Work from home Full-time role

Senior Internal Auditor, Operations

Work from home Full-time role

Senior Software Engineer

Work from home Full-time role

Experienced Data Entry Clerk/Data Entry IV – Sensitive Information Handling and Case Management

Work from home Full-time role

RN Surgical Oncology Weekend Only

Work from home Full-time role

Business Process Leader, Finance

Work from home Full-time role

Remote Junior Data Entry Clerk (Part-Time) - Launch Your Career with arenaflex

Work from home Full-time role