Objective: This project is centered around the development of a data management system, tailored to handle extensive datasets with an emphasis on optimizing data for future applications in e-commerce and AI.

Current Dataset: Our current dataset is substantial, encompassing 32GB of uncompressed data in JSON format. This dataset is a rich collection of product information, including codes, names, and detailed descriptions. Plase see sample data provided

Primary Goals:

Database Development and Migration: The core of this project involves designing and constructing a scalable database using Google BigQuery. This includes migrating our existing dataset into this new environment, ensuring efficiency and data integrity.
User Interface Design: We are committed to creating an intuitive and interactive user interface. This interface will allow end-users to access, view, and modify the dataset as needed, facilitating a dynamic user experience.

Future Applications of the Dataset:
While the project focuses on database and UI development, it is essential for the engineering team to understand the broader context and future uses of this data. Please include in your proposal solutions for number 1 and 2, but it is not required for the main purpose of this project.

-E-commerce Integration: Post-project, the data will be used to enrich our Shopify store. By utilizing Shopify’s API, selected products from the dataset will be uploaded to our store.
-Large Language Model Training: The dataset will serve as a foundation for training a Large Language Model (LLM) focused on product recognition, driving forward our capabilities in AI and machine learning.
-Data Query and Utilization: We intend to leverage the dataset for internal systems, using queries to extract and populate relevant product information, thereby enhancing operational efficiency and decision-making.

Posted On: January 23, 2024 20:28 UTC
Category: Full Stack Development
Skills:AI Development, Web Application

Country: Chile

click to apply

Powered by WPeMatico