Google launches beta version of Cloud AI Platform Pipelines to reduce time in ML developments

  • Google Cloud announced Beta version of cloud AI Platform Pipelines to support easy to install secure execution environments for machine learning workflows.

  • The service is designed to deploy a robust, repeatable AI pipeline in the cloud along with other features such as monitoring, auditing, version tracking, and reproducibility.

  • The newly launched feature is going to be beneficial that will help companies reduce the time it takes to put a product into production.


When you're just prototyping a machine learning (ML) model in a notebook, it can seem fairly straightforward. But when you need to start paying attention to the other pieces required to make an ML workflow sustainable and scalable, things become more complex. A machine learning workflow can involve many steps with dependencies on each other, from data preparation and analysis to training to evaluation to deployment, and more. It’s hard to compose and track these processes in an ad-hoc manner—for example, in a set of notebooks or scripts—and things like auditing and reproducibility become increasingly problematic.
 

Recently, Google announced the beta launch of Cloud AI Platform Pipelines, a service designed to deploy robust, repeatable AI Pipelines along with monitoring, auditing, version tracking, and reproducibility in the cloud. Google’s pitching it as a way to deliver an “easy to install” secure execution environment for machine learning workflows, which could reduce the number of time enterprises spends bringing products to production.

“When you’re just prototyping a machine learning model in a notebook, it can seem fairly straightforward. But when you need to start paying attention to the other pieces required to make a [machine learning] workflow sustainable and scalable, things become more complex.”

Anusha Ramesh, Product Manager, Google



AI Platform Pipelines has two major parts.

1) The infrastructure for deploying and running structured AI workflows that are integrated with Google Cloud Platform services and

2) The pipeline tools for building, debugging and sharing pipelines and components.


The service runs on a Google Kubernetes cluster that is automatically created during the installation process and is accessible through the Cloud AI Platform dashboard. With AI Platform Pipelines, developers can specify pipelines using the Kubeflow Pipelines Software Development Kit (SDK) or by using the TFX SDK to customize the TensorFlow Extended (TFX) Pipeline template. This SDK compiles the pipeline and submits it to the Pipelines REST API server, which stores and plans to execute the pipeline.
 

Learn more: Many new security features, services added to Google Cloud
��

AI Pipeline uses the open source Argo workflow engine to run pipelines and has other microservices to record metadata, process component IO, and plan pipeline runs. The pipeline steps are performed as independent Pods in the cluster, and each component can utilize Google Cloud services, such as data streaming, AI platform training and forecasting, BigQuery, etc. At the same time, the pipeline can include the steps of performing graphics card and tensor processing unit calculations directly in the cluster, directly using functions such as automatic scaling and automatic node settings.
 

AI Platform Pipeline runs include automatic metadata tracking using ML Metadata, a library for recording and retrieving metadata related to machine learning developer and data scientist workflows. Automatic metadata tracks the artifacts used in each pipeline step, the pipeline parameters, and the links between input/output artifacts, as well as the pipeline steps that create and use them. In a news, Google starts beta evaluation of new AI developer tools to help developers more easily integrate AI capabilities in apps.
 

Also, AI Platform Pipelines supports pipeline versioning, which enables developers to upload multiple versions of the same pipeline and group them in the UI, as well as automatic artifact and lineage tracking. Native workpiece tracking can track models, data statistics, model evaluation indicators, and more. Lineage tracking shows history and versions of models, data, etc.

“A machine learning workflow can involve many steps with dependencies on each other, from data preparation and analysis to training, to evaluation, to deployment, and more. It’s hard to compose and track these processes in an ad-hoc manner — for example, in a set of notebooks or scripts — and things like auditing and reproducibility become increasingly problematic.”


Learn more: Google Cloud chooses DataStax for the database as a service offering

 

Google said that shortly, AI platform pipelines will gain multi-user isolation, which will allow everyone who accesses the pipeline cluster to control who can access their pipelines and other resources. Here is an article where it has explained why is storage on Kubernetes is hard. Other upcoming features include workload identification to support transparent access to Google Cloud Services; UI-based settings for back-end data storage outside the cluster, including metadata, server data, job history and metrics; simpler cluster upgrades; And more templates for authoring workflows.
 

What’s next?


Google Cloud has some new Pipelines features coming soon, including support for:

  • Multi-user isolation, so that each person accessing the Pipelines cluster can control who can access their pipelines and other resources

  • Workload identity, to support transparent access to GCP services

  • Easy, UI-based setup of off-cluster storage of backend data—including metadata, server data, job history, and metrics—for larger-scale deployments and so that it can persist after cluster shutdown

  • Easy cluster upgrades

  • More templates for authoring ML workflows

Spotlight

Other News
AI Tech

AI and Big Data Expo North America announces leading Speaker Lineup

TechEx Events | March 07, 2024

AI and Big Data Expo North America announces new speakers! SANTA CLARA, CALIFORNIA, UNITED STATES, February 26, 2024 /EINPresswire.com/ -- TheAI and Big Expo North America, the leading event for Enterprise AI, Machine Learning, Security, Ethical AI, Deep Learning, Data Ecosystems, and NLP, has announced a fresh cohort of distinguishedspeakersfor its upcoming conference at the Santa Clara Convention Center on June 5-6, 2024. Some of the top industry speakers set to take the stage are: - Sam Hamilton - Head of Data & AI – Visa - Dr Astha Purohit - Director - Product (Tech) Ops – Walmart - Noorddin Taj - Head of Architecture and Design of Intelligent Operations - BP - Temi Odesanya - Director - AI Governance Automation - Thomson Reuters - Katie Sanders - Assistant Vice President – Tech - Union Pacific Railroad - Prasanth Nandanuru – SVP - Wells Fargo - Rodney Brooks - Professor Emeritus - MIT These esteemed speakers bring a wealth of knowledge and expertise to an already impressive lineup, promising attendees a truly enlightening experience. In addition to the speakers, theAI and Big Data Expo North Americawill feature a series of presentations covering a diverse range of topics in AI and Big Data exploring the latest innovations, implementations and strategies across a range of industries. Attendees can expect to gain valuable insights and practical strategies from presentations such as: How Gen AI Positively Augments Workforce Capabilities Trends in Computer Vision: Applications, Datasets, and Models Getting to Production-Ready: Challenges and Best Practices for Deploying AI Ensuring Your AI is Responsible and Ethical Mitigating Bias and Promoting Fairness in AI Systems Security Challenges in the Era of Gen AI and Data Science AI for Good: Social Impact and Ethics Selling Data Democratization to Executives Spreading Data Insights across the Business Barriers to Overcome: People, Processes, and Technology Optimizing the Customer Experience with AI Using AI to Drive Growth in a Regulated Industry Building an MLOps Foundation for AI at Scale The Expo offers a platform for exploration and discovery, showcasing how cutting-edge technologies are reshaping a myriad of industries, including manufacturing, transport, supply chain, government, legal sectors, financial services, energy, utilities, insurance, healthcare, retail, and more. Attendees will have the chance to witness firsthand the transformative power of AI and Big Data across various sectors, gaining insights that are crucial for staying ahead in today's rapidly evolving technological landscape. Anticipating a turnout of over 7000 attendees and featuring 200 speakers across various tracks, AI and Big Data Expo North America offers a unique opportunity for CTO’s, CDO’s, CIO’s , Heads of IOT, AI /ML, IT Directors and tech enthusiasts to stay abreast of the latest trends and innovations in AI, Big Data and related technologies. Organized by TechEx Events, the conference will also feature six co-located events, including the IoT Tech Expo, Intelligent Automation Conference, Cyber Security & Cloud Congress, Digital Transformation Week, and Edge Computing Expo, ensuring a comprehensive exploration of the technological landscape. Attendees can choose from various ticket options, providing access to engaging sessions, the bustling expo floor, premium tracks featuring industry leaders, a VIP networking party, and a sophisticated networking app facilitating connections ahead of the event. Secure your ticket with a 25% discount on tickets, available until March 31st, 2024. Save up to $300 on your ticket and be part of the conversation shaping the future of AI and Big Data technologies. For more information and to secure your place at AI and Big Data Expo North America, please visit https://www.ai-expo.net/northamerica/. About AI and Big Data Expo North America: The AI and Big Data Expo North America is a leading event in the AI and Big Data landscape, serving as a nexus for professionals, industry experts, and enthusiasts to explore and navigate the ever-evolving technological frontier. Through its focus on education, networking, and collaboration, the Expo continues to be a beacon for those eager to stay at the forefront of technological innovation. “AI and Big Data Expo North Americais a part ofTechEx. For more information regardingTechExplease see onlinehere.”

Read More