Software Engineer - AI
Apply NowCompany: Super Micro Computer, Inc
Location: San Jose, CA 95123
Description:
Job Req ID: 26294
About Supermicro:
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.
Job Summary:
Supermicro is seeking an experienced and exceptional Application Software Engineer to work on web-based applications for business process automation. This is a key role that will give you the opportunity to expand your existing knowledge in programming.
Essential Duties and Responsibilities:
Includes the following essential duties and responsibilities (other duties may also be assigned)
Integrating open-source LLMs (e.g., Llama 3.2 90B) with open-source vector databases, search indexing, and contextual query management
Design and implement Retrieval-Augmented Generation (RAG) pipelines, incorporating embedding generation, vector search, re-ranking, and contextual retrieval techniques
Optimize search and retrieval systems using Elasticsearch and vector databases
Develop and deploy an intelligent AI Agent to assist customers in selecting and purchasing the correct servers based on their unique requirements and use cases
Integrate AI Agents with backend databases, recommendation engines, and decision-making pipelines
Design workflows for task automation, contextual reasoning, and real-time recommendations
Design scalable web scraping pipelines using tools like Scrapy, Selenium, and BeautifulSoup to acquire structured and unstructured data
Process and clean scraped data to integrate it seamlessly into databases and knowledge retrieval systems
Design and manage relational databases (include., PostgreSQL, MySQL, MS SQL) for structured data storage and retrieval
Work with document-based databases (e.g., MongoDB) for handling unstructured data sources
Optimize database queries and structures to ensure efficient system performance
Design, test, and optimize prompts for large language models (LLMs) to improve response accuracy, context management, and task completion
Experiment with prompt tuning and contextual input adjustments to enhance LLM performance in specific use cases
Extract, clean, and preprocess data from various sources, including relational databases, document databases, PDFs, and images
Write code for parsing and processing non-text data formats
Develop Python-based web services using popular framework to enable backend APIs and real-time interactions
Create interactive dashboards for data visualization and system control using Streamlit
Collaborate with frontend developers to ensure seamless integration between APIs and user-facing interfaces
Deploy system components in Linux environments using Docker for scalability and portability
Optimize system performance for GPU-intensive tasks, ensuring efficient resource utilization
Identify common user queries, challenges, and areas for improvement
Test the system regularly from the user's perspective to validate its performance and accuracy.
Analyze user feedback and satisfaction, iterating on system design, prompts, and workflows to improve response quality and relevance
Collaborate with cross-functional teams to implement enhancements based on user behavior and feedback trends
Handle intricate, repetitive, or time-consuming tasks, such as dataset cleaning, normalization, and troubleshooting
Ensure data accuracy and reliability, understanding that these foundational tasks are critical for system success
Qualifications:
BS or above in Computer/Information Science or other relevant degree
Minimum 5 years of working experience in software development preferred
Programming Skills in C#, SQL, Java, JavaScript, AJAX
C# ASP.NET project experience is a plus
Salary Range
$147,000 - $168,000
The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.
EEO Statement
Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.
About Supermicro:
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.
Job Summary:
Supermicro is seeking an experienced and exceptional Application Software Engineer to work on web-based applications for business process automation. This is a key role that will give you the opportunity to expand your existing knowledge in programming.
Essential Duties and Responsibilities:
Includes the following essential duties and responsibilities (other duties may also be assigned)
Integrating open-source LLMs (e.g., Llama 3.2 90B) with open-source vector databases, search indexing, and contextual query management
Design and implement Retrieval-Augmented Generation (RAG) pipelines, incorporating embedding generation, vector search, re-ranking, and contextual retrieval techniques
Optimize search and retrieval systems using Elasticsearch and vector databases
Develop and deploy an intelligent AI Agent to assist customers in selecting and purchasing the correct servers based on their unique requirements and use cases
Integrate AI Agents with backend databases, recommendation engines, and decision-making pipelines
Design workflows for task automation, contextual reasoning, and real-time recommendations
Design scalable web scraping pipelines using tools like Scrapy, Selenium, and BeautifulSoup to acquire structured and unstructured data
Process and clean scraped data to integrate it seamlessly into databases and knowledge retrieval systems
Design and manage relational databases (include., PostgreSQL, MySQL, MS SQL) for structured data storage and retrieval
Work with document-based databases (e.g., MongoDB) for handling unstructured data sources
Optimize database queries and structures to ensure efficient system performance
Design, test, and optimize prompts for large language models (LLMs) to improve response accuracy, context management, and task completion
Experiment with prompt tuning and contextual input adjustments to enhance LLM performance in specific use cases
Extract, clean, and preprocess data from various sources, including relational databases, document databases, PDFs, and images
Write code for parsing and processing non-text data formats
Develop Python-based web services using popular framework to enable backend APIs and real-time interactions
Create interactive dashboards for data visualization and system control using Streamlit
Collaborate with frontend developers to ensure seamless integration between APIs and user-facing interfaces
Deploy system components in Linux environments using Docker for scalability and portability
Optimize system performance for GPU-intensive tasks, ensuring efficient resource utilization
Identify common user queries, challenges, and areas for improvement
Test the system regularly from the user's perspective to validate its performance and accuracy.
Analyze user feedback and satisfaction, iterating on system design, prompts, and workflows to improve response quality and relevance
Collaborate with cross-functional teams to implement enhancements based on user behavior and feedback trends
Handle intricate, repetitive, or time-consuming tasks, such as dataset cleaning, normalization, and troubleshooting
Ensure data accuracy and reliability, understanding that these foundational tasks are critical for system success
Qualifications:
BS or above in Computer/Information Science or other relevant degree
Minimum 5 years of working experience in software development preferred
Programming Skills in C#, SQL, Java, JavaScript, AJAX
C# ASP.NET project experience is a plus
Salary Range
$147,000 - $168,000
The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.
EEO Statement
Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.