Current Dataset: Our current dataset is substantial, encompassing 32GB of uncompressed data in JSON format. This dataset is a rich collection of product information, including codes, names, and detailed descriptions. Plase see sample data provided
Primary Goals:
Database Development and Migration: The core of this project involves designing and constructing a scalable database using Google BigQuery. This includes migrating our existing dataset into this new environment, ensuring efficiency and data integrity.
User Interface Design: We are committed to creating an intuitive and interactive user interface. This interface will allow end-users to access, view, and modify the dataset as needed, facilitating a dynamic user experience.
Future Applications of the Dataset:
While the project focuses on database and UI development, it is essential for the engineering team to understand the broader context and future uses of this data. Please include in your proposal solutions for number 1 and 2, but it is not required for the main purpose of this project.
-E-commerce Integration: Post-project, the data will be used to enrich our Shopify store. By utilizing Shopify’s API, selected products from the dataset will be uploaded to our store.
-Large Language Model Training: The dataset will serve as a foundation for training a Large Language Model (LLM) focused on product recognition, driving forward our capabilities in AI and machine learning.
-Data Query and Utilization: We intend to leverage the dataset for internal systems, using queries to extract and populate relevant product information, thereby enhancing operational efficiency and decision-making.
Posted On: January 23, 2024 20:28 UTC
Category: Full Stack Development
Skills:AI Development, Web Application
Country: Chile
click to apply
Powered by WPeMatico
