Data Engineer, OneDegree AI
薪資範圍:700,000 ~ 1,200,000 TWD / year
We are seeking a skilled and motivated individual to join our team as a Data Engineer. The successful candidate will assist in research, support model training, and optimize machine learning solutions. The role requires organizing relevant data, analyzing test results, and fine-tuning models. Additionally, the candidate will be responsible for analyzing and interpreting large datasets to support our B2B AI large language model (LLM) Verification solution.
OneDegree Tech Blog: https://medium.com/onedegree-tech-blog
-
How to apply
Please apply this position through 👉 https://grnh.se/3d78328e4us
It will help us process your applications faster
*Please apply by English CV, thank you.
-
Responsibilities
- Assist in Research and Support Model Training:
- Provide technical support during the model training phase, ensuring the process runs smoothly and efficiently.
- Conduct literature reviews and stay updated with the latest advancements in machine learning and AI.
- Evaluate and Optimize Machine Learning Solutions:
- Perform thorough evaluations of machine learning models to assess their performance and accuracy.
- Implement optimization techniques to enhance model efficiency and effectiveness.
- Organize Relevant Data Based on Requirements and Optimize Related Models:
- Collect and organize datasets according to project requirements, ensuring data quality and integrity.
- Continuously optimize models based on data insights and project needs.
- Analyze Test Results and Fine-Tune Models:
- Conduct detailed analyses of test results to identify patterns, anomalies, and areas for improvement.
- Conduct detailed analyses of test results to identify patterns, anomalies, and areas for improvement.
- Analyze and Interpret Large Datasets:
- Work with large volumes of data to extract meaningful insights that inform model development.
-
- Proficient in Data Warehousing, Data Preprocessing Concepts, Data Cleaning, Data Preparation, Tokenization, and Related Data.
- Skilled in tokenization and transforming raw data into structured formats suitable for machine learning models.
- Proficient in Python (including numpy, scipy, pandas) .
- Experience with SQL and Database Design (e.g., SQL/NoSQL/Vector DB).
- Understanding of database design principles, including both SQL and NoSQL databases.
- Familiarity with vector databases and their application in AI and machine learning contexts.
- Experience with OpenAI, Langchain, and RAG Architecture:
- Hands-on experience with OpenAI technologies and integrating them into machine learning workflows.
- Knowledge of Langchain and RAG (Retrieval-Augmented Generation) architecture, and their implementation in practical projects.
- Demonstrated ability to analyze large datasets, using statistical and machine learning techniques to derive insights.
- Strong Team Collaboration and Self-Learning Abilities:
- Proven ability to work effectively in a team environment, collaborating with colleagues to achieve common goals.
- Self-motivated with a strong desire to continuously learn and stay updated with the latest industry trends and technologies.
-
Plus
- Familiarity with Machine Learning Frameworks (e.g., Keras, TensorFlow, PyTorch) is a Plus:
- Experience using popular machine learning frameworks such as Keras, TensorFlow, or PyTorch.
- Experience using popular machine learning frameworks such as Keras, TensorFlow, or PyTorch.
- Familiarity with NLU/NLG/NLP Architectures (e.g., BERT, Transformers) is a Plus:
- Knowledge of Natural Language Understanding (NLU), Natural Language Generation (NLG), and Natural Language Processing (NLP) architectures.
- Understanding of Java programming language and its application in data engineering and machine learning projects.
-
公司地址:
台北市信義區四段460號7樓其他:
Phone interview: 1 hour meet with HROnsite Interview: 1-2 hours1-1.5 hours meet with our team and hiring managers0.5 hour meet with HR-2024-11-19