ML Business & Strategy: Transform Data into Business Value

Are AI Tools Hurting Developer Productivity?

adharaneedharan@sandsmedia.com — Wed, 30 Jul 2025 12:23:58 +0000

Contributors:

By Sebastian Springer, Paul Dubs, Christoph Henkelmann, Melanie Bauer, Rainer Hahnekamp, Pieter Buteneers, Tam Hanna, Rainer Stropek, Christian Weyer, Veikko Krypczyk.

Sebastian Springer:

Lately, there have been several studies highlighting the negative aspects of AI: AI makes us less productive, less creative… I believe it really depends on how we use the tools. The same could be said about search engines or platforms like Stack Overflow. If I rely on such channels for every aspect of my work, I’d become less productive as well. With modern AI tools, the risk is naturally greater, since they’re much more integrated into our work environments and are far more intuitive to use.

On the topic of productivity: Personally, I feel more productive thanks to tools like Copilot and similar tools. That’s mainly because I use them to solve repetitive tasks. There are situations where writing a good prompt takes significantly longer than writing the code myself. And of course, working with AI tools comes with the risk of being distracted from the actual problem or heading in the wrong direction. In other cases, the suggestions the AI offers – without any manual prompt – are exactly what I need.

In general, I think: Whether AI makes us unproductive, uncreative, or even dumb – it’s a technology that’s established itself in the market, and one we simply can’t ignore. So, we should focus on leveraging its strengths. And if we already know it has downsides (as almost every technology does), we should try to avoid those pitfalls as much as possible. Besides, AI is in good company: People once claimed that steam engines would never be economical, newspapers would overwhelm us mentally, and written information in general was dangerous – let alone the internet, which supposedly makes people stupid and causes crime to skyrocket. There’s always a grain of truth in every accusation, but in the end, it all comes down to how we deal with it.

Paul Dubs:

Based on my experience with AI tooling for development, which I discussed in a keynote at the JAX conference in May, the impact on productivity is highly dependent on how these tools are used and the developer’s experience level with them. The study actually supports what I’ve observed: there’s a significant learning curve with AI development tools. The one developer in that study who had substantial prior experience with Cursor was notably faster, an anomaly that proves the point. Like any tool, you need to know how to use it effectively to see productivity gains.

During my keynote, I described using agentic AI coding tools as “playing chess with a pigeon”: they would destroy the game and claim victory. Claude Code struggled to navigate projects properly and would even sabotage its own progress by resetting the Git state. The Claude 3.5 / 3.7 models used in the study weren’t well-suited for larger changes or project navigation. However, things changed dramatically with Claude 4’s release at the end of May. Even the smaller, faster Sonnet model became quite capable when used correctly. I now use Roo Code, a Visual Studio Code plugin that allows me to create specialized prompts for different tasks: debugging, programming, documentation, and language-specific work. This customization has made me considerably more productive.

The productivity gains aren’t uniform across all project types. I’m much more productive on greenfield (new) projects. For brownfield (existing) projects requiring major changes, I need to provide extensive additional context, often directly referencing the specific files the AI needs to work with. When I handle the navigation burden myself, the AI can be quite effective. There’s an important caveat: using AI tools creates a knowledge and memory gap. Since I’m not writing every line myself, it feels like delegating to someone else and doing a quick review. When I return to AI-generated code later, I need to reread it because I don’t fully remember the implementation details. It’s similar to working on a project where multiple developers touch every piece: you lose that intimate familiarity with the codebase.

The study’s findings align with my experience: developers unfamiliar with AI tools often see productivity losses, while those with significant experience can achieve net gains. The outlier in the study who was more productive validates this. Success with AI coding tools requires understanding their limitations, using them appropriately for the task at hand, and accepting the trade-off between speed and deep code familiarity.

Christoph Henkelmann:

The issue with AI-assisted coding is the same as with many current AI debates: it’s dominated by hype and quick dismissals, rather than a nuanced understanding. Yes, AI tools can deliver massive productivity gains – but only if you actually learn how to use them. This means understanding the basics of LLMs, knowing your domain, and practicing with the tools until you develop a sense for when they help and when they don’t. Most people just install something like Cursor and expect miracles. Naturally, this leads to disappointment. “Vibe coding” might get you a prototype, but real productivity comes from what Paul Dubs calls “omega coding”: deep domain knowledge, familiarity with your tools, and persistent practice. These tools don’t replace thinking; they amplify skill. Managers hoping for instant results will see the opposite at first: initial productivity drops, much like switching to a new IDE. But if you invest the time to learn and adapt, the gains are real and substantial. Most don’t (or better: aren’t given the time to do so), which is why recent studies show lackluster results.

Melanie Bauer:

As an informatics student, I spend a lot of time researching and learning about new tools and topics, especially in the field of software development. AI tools have made this process significantly easier and faster for me. For example, when I have a question, I can get direct and precise answers without having to scroll through extensive documentation.

That’s why tools like GitHub Copilot, Cursor, and ChatGPT have become a regular part of my workflow as a future software developer. Of course, at the end of the day, AI doesn’t think for me, and I am still responsible for reviewing and validating the generated output. But overall, I’ve noticed a clear increase in my productivity, especially when it comes to routine tasks, reducing the ramp-up time when learning new technologies, or understanding code snippets and programming concepts by having them broken down and explained step by step.

Rainer Hahnekamp:

Based on my experience, the use of AI in software development can be divided into three levels:

Code Completion in the IDE: Here, AI offers valuable support by suggesting small code snippets that boost productivity without taking control away from the developer.
Automated Code Generation: In this area – where the AI generates larger code blocks or even entire files – I’ve found that the time required to correct and adapt the output often outweighs the immediate benefit. Still, I see this as an investment in learning how to work with AI effectively. While it may currently slow things down, I’m confident that the technology will improve – and when it does, I want to be ready to make the most of it.
AI-Supported Research and Conceptual Work: Using AI as a sparring partner for brainstorming, idea generation, and problem-solving has proven extremely helpful. It supports creativity and often leads to productive insights.

Personally, I can’t confirm a loss in productivity – quite the opposite. While I haven’t read the details of the referenced study, I suspect the reasons might be due to the current lack of best practices and the necessary intuition for using AI effectively. And, of course, to be transparent, this statement reflects my personal opinion, but the wording was created with the assistance of AI .

Pieter Buteneers:

I use the following AI tools:
Cursor (the agents are a big step forward) (my go-to tool)
ChatGPT (GPT 4.1) (if cursor makes mistakes)
Claude 4 Sonnet if the above don’t know the answer which is once every 2-3 weeks or so.

In terms of advantages, I started using typescript (ts) instead of python and it really helps me understand the syntax and convert python code into ts much faster. It writes fewer errors in ts than in python, allows me to write more code, writes unit tests for me and allows me to use new packages/technology much faster. It helps me with the DevOps side of things which I am a real noob at. Overall, it makes me about 2 times faster

I use it a lot to brainstorm ideas and figure out best/bad practices, but it comes with a huge list of caveats. There is a lot of code duplication since it doesn’t know your entire code. So, your code becomes hard to maintain and turns into ‘spaghetti’ fast. Cursor often fixes a bug by just writing some code to cover an edge case, but it doesn’t always go deep into the underlying problem, so you think you end up with a fix, where it is just an ugly patch and you don’t understand the code. Ultimately, you still need a senior dev to tell you what good code practices are. I spend more time debugging than writing code, so tests are even more important.

Tam Hanna:

At Tamoggemon Holdings, we currently use AI systems mainly for menial tasks. Using them to write stock correspondence (think cover letters, etc.) has shown to be a significant performance booster, allowing us to refocus on more productive tasks. As for line work (EE or SW), we – so far – have not seen the systems as a valid replacement to classic manual work.

Rainer Stropek:

After gaining extensive experience with modern AI tools, I can’t imagine my daily work as a developer without their support. My productivity has noticeably increased because I have consistently aligned my entire workflow around collaboration with AI. This goes far beyond classic code completion: autocomplete suggestions are convenient but often too generic and sometimes break my flow. Chat agents, by default, start every conversation from scratch. To work efficiently with them, one must formulate complete, consistent requirements in the prompt, prompt context, document the architecture, establish coding guidelines, and provide meaningful test data with expected results. This level of diligence would be advisable anyway; working with AI makes it essential.

Spec-Driven Development instead of Vibe Coding
Many developers underestimate prompting and context management. A few buzzwords are only enough as long as the goal remains vague. As soon as I face concrete customer requirements, I rely on Spec-Driven Development:

I invest significant time in detailing the requirements.
The AI questions and discusses the specification with me.
Only once a sufficient level of maturity is reached do I let the AI implement the solution and review the result.
It’s crucial to create clarity before I let the AI write code.

From Coder to AI Orchestrator
My role is shifting. Instead of primarily writing code, I define work packages that I delegate to AI agents. This is similar to delegating to human team members. I see my future in the role of a product developer with a strong focus on requirements engineering and software architecture – structuring complex requirements in a way that makes them executable by AIs.

Limitations of Today’s AI Systems (especially in large projects)
Despite larger context windows and advanced retrieval (e.g., using MCP servers or function tools integrated in IDEs), AI still lacks a holistic overview of large projects. Humans remain responsible for slicing and documenting tasks so they can be worked on without requiring knowledge of the entire project. If this is done successfully, project size becomes almost irrelevant to the use of AI.

What Companies Need to Do Now
The tool landscape is evolving in months, not years. Instead of committing to a single tool long-term, companies should:

Allocate budgets and create space for teams to experiment with various AI tools.
Deploy pilot groups to quickly gain hands-on experience.
Embrace usage-based pricing models and make their cost-benefit ratio transparent.

From my perspective, those who don’t start building practical experience now risk losing competitiveness. AI is no longer just a nice-to-have add-on – it is fundamentally changing the way we develop software. Those who ignore these new ways of working risk losing productivity and, in the medium term, competitiveness. Now is the time to sharpen specifications, rethink roles, and encourage experimental team setups.

Christian Weyer:

“Never trust a study you didn’t fake yourself” .
Just kidding, of course.
But seriously: At Thinktecture, we’ve seen an unprecedented productivity boost across the team. Personally, I feel significantly more creative – which directly translates into being faster and producing better results.

The key? I don’t let AI tools disrupt my natural flow. Instead, I deliberately configure them to fit my individual thinking and working style. Tools like GitHub Copilot, Windsurf, Cursor, or Cline all offer great ways to customize the experience with your own guardrails.

Maybe many developers don’t yet fully leverage these configuration options – or don’t even know they exist. Used right, these tools amplify productivity instead of hindering it.

Veikko Krypczyk:

In my experience, artificial intelligence can be meaningfully applied throughout all phases of the software development process – from early ideation and UI design to architectural decisions and the implementation of complex algorithms. AI is by no means flawless, but it acts as a virtual work partner that can complete many tasks faster, more diversely, and sometimes even more creatively than would be possible alone.

The actual productivity gain strongly depends on two key factors: the quality of the prompts and the critical evaluation of the generated content. Those who can formulate clearly and have solid domain knowledge will greatly benefit from AI tools – whether it’s generating boilerplate code, writing test cases, supporting refactoring, or systematically exploring technical options.

Of course, AI outputs should never be accepted without reflection. It remains essential for developers to understand, question, and, if necessary, improve the generated suggestions. Domain expertise is not replaced by AI – quite the opposite: it becomes even more crucial to ensure the quality of the outcomes.

My conclusion: when used properly, AI enhances efficiency and broadens perspectives – both individually and in team processes. I find working with AI tools inspiring, more efficient, and often more focused, as they help offload routine work and spark creative thinking. I only experience a loss in productivity when AI is treated as an autopilot rather than as a co-pilot.

Links & Literature

[1] https://arxiv.org/abs/2507.09089

The post Are AI Tools Hurting Developer Productivity? appeared first on ML Conference.

Agentic AI: The Future of Business Process Automation

skansal — Wed, 26 Feb 2025 10:36:12 +0000

Artificial intelligence has demonstrated its transformative power in various aspects of business, from customer engagement to data analysis. However, a new frontier is emerging: Agentic AI, a paradigm that moves beyond static automation into dynamic, self-optimizing systems.

Businesses today rely on rigid, predefined workflows that often struggle to adapt to real-time changes. Agentic AI introduces intelligent, autonomous agents that can analyze, plan, execute, and refine business processes without human intervention.

AI has already transformed multiple facets of business operations. From automating repetitive tasks to providing deep insights through data analytics, AI has become an indispensable tool for enterprises across industries (see Figure 1). The adoption of AI-powered solutions has surged, enabling businesses to increase efficiency, reduce costs, and enhance customer experiences.

Figure 1: AI Market Size Trends (source: Precedence Research)

However, the current AI landscape is still largely dominated by Generative AI and Predictive Analytics, which, while powerful, have limitations when it comes to real-time decision-making and autonomous execution.

While Generative AI models like GPT-4, Claude, and Gemini have demonstrated impressive capabilities in content creation, customer support, and decision support, they operate reactively. These AI models require explicit prompts and cannot independently plan, execute, and optimize business workflows without human intervention.

What is Agentic AI?

Agentic AI represents a paradigm shift in Artificial Intelligence, moving beyond traditional rule-based automation and generative AI models. Unlike conventional AI, which primarily assists users by generating responses or executing predefined tasks, Agentic AI possesses autonomy, decision-making capabilities, and adaptability. These AI agents operate with a level of independence, dynamically adjusting to new inputs and evolving environments.

Autonomy: AI agents function independently, making decisions based on dynamic input and real-time data.
Adaptability: AI agents continuously learn from new information and adjust their strategies accordingly.
Multi-Agent Collaboration: Multiple AI agents work together, dividing tasks, verifying results, and optimizing workflows.
Context Awareness: AI agents interpret their environment, understand business needs, and take relevant actions.
Goal-Oriented Execution: Unlike simple automation scripts, AI agents set and pursue long-term objectives.

Agentic AI Core Technologies

Before exploring Agentic AI impact on the business process layer, you must have a clear understanding of Agentic AI and its foundational technologies.

The rise of Agentic AI is powered by several advanced technologies (see Figure 2), including:

Large Language Models (LLMs): These models provide a foundation for understanding and generating human-like text, enabling AI agents to interpret and respond intelligently to queries.
Reinforcement Learning: This technique allows AI agents to refine decision-making processes through trial and error.
Multi-Agent Systems: These frameworks enable multiple AI entities to collaborate, enhancing efficiency and scalability.
APIs and Integration Layers: AI agents leverage APIs to interact with enterprise systems, retrieving and processing data in real time.

Figure 2: AI Agent Reference Architecture (source: Debmalya Biswas)

The Business Process Layer and Its Challenges

To fully grasp the impact of agentic AI on business operations, you must understand the role of the business process layer within a modern enterprise architecture. Organizations operate through interconnected layers, each with distinct responsibilities. AI agents are uniquely positioned to transform the business process layer, driving automation, intelligence, and adaptability at an unprecedented scale.

A Layered Model for Business Operations

A modern enterprise can be conceptualized as a multi-layered structure, with each layer playing a crucial role in ensuring smooth operations and efficient decision-making. These layers typically include:

Business Layer: Value, competencies, processes, and services aspects;
Information Layer: The application systems and data components;
Technology Layer: The platform and infrastructure components.

Figure 3: Overview of the Common Enterprise Layers (source: Polovina et al.)

Thus, the Business Process layer serves as the critical bridge between enterprise applications and customer interactions, ensuring that business operations are streamlined, consistent, and aligned with strategic goals.

Understanding the Business Process Layer

The Business Process layer serves as the backbone of enterprise operations, orchestrating workflows that connect different functions within an organization, that is, this layer acts as the “central nervous system” of an organization, dictating how work gets done and ensuring that all systems operate in sync. At this you’ll find systems for inventory management, supply chain logistics, customer service operations, and financial transactions.

The Business Process layer is responsible for structuring and managing workflows across the enterprise. Its key functions include:

Process Orchestration: Ensuring seamless execution of tasks by integrating various enterprise systems.
Workflow Automation: Standardizing repetitive tasks to improve efficiency and reduce human intervention.
Decision Logic Execution: Implementing business rules and logic that guide operations, such as approval workflows and compliance checks.
Data Coordination: Managing the flow of information between systems, ensuring consistency and accuracy.
Operational Monitoring: Tracking process performance and identifying bottlenecks or inefficiencies.

However, while business processes are essential for operational efficiency, they often suffer from significant limitations:

Fragmentation: Processes are frequently siloed across departments, leading to inefficiencies.
Manual Intervention: Many workflows still require human oversight, slowing down execution.
Lack of Adaptability: Traditional automation struggles with dynamic and unpredictable business environments.
Data Silos: Information is often stored in separate systems, making real-time decision-making difficult.

Most enterprise business processes today are highly structured but inflexible. Conventional automation, such as Robotic Process Automation (RPA), is rule-based and rigid. It works well for repetitive, predictable tasks but lacks the adaptability needed to handle complex decision-making scenarios and fails to adapt to dynamic conditions. This often results in inefficiencies, bottlenecks, and missed opportunities.

Agentic AI overcomes these limitations by introducing autonomy and intelligence into the Business Process layer. AI agents offer a solution by:

Enhancing adaptability: AI agents dynamically adjust processes based on real-time data.
Reducing human intervention: They autonomously manage decision-making within workflows.
Optimizing operational efficiency: AI agents continuously refine workflows to maximize performance.
Scaling intelligence across departments: AI-powered automation enables seamless coordination between different business functions.

Examples of Agentic AI in the Business Process Layer

Here’s why agentic AI is causing a paradigm shift:

Dynamic and Self-Optimizing Workflows

Traditional workflows are often rigid, requiring manual intervention to adjust to new conditions. AI agents can continuously learn from real-time data and modify workflows dynamically.

Example:

A supply chain AI agent can detect a delay from a supplier and automatically reroute orders to an alternative vendor while notifying relevant stakeholders.

Autonomous Decision-Making in Business Processes

AI agents can analyze vast amounts of data and make autonomous decisions based on predefined objectives and real-time insights. This reduces reliance on human intervention and speeds up decision-making.

Example:

In loan processing, an AI agent can assess a customer’s creditworthiness by analyzing financial patterns and automatically approve or flag applications for review.

Intelligent Process Coordination Across Departments

Business processes often span multiple departments, requiring coordination and data exchange. AI agents can streamline these interactions by serving as intermediaries, ensuring smooth collaboration.

Example

An AI agent in an HR department can coordinate recruitment by screening resumes, scheduling interviews, and updating the internal applicant tracking system in real-time.

Real-Time Exception Handling and Adaptive Learning

Unlike traditional automation, which fails when encountering unexpected scenarios, agentic AI can handle exceptions by learning from new data and adapting responses accordingly.

Example

A customer service AI agent can escalate complex complaints to a human agent while analyzing recurring issues and recommending policy adjustments.

Seamless Integration Between Systems of Engagement and Systems of Record

Organizations struggle with siloed systems where customer-facing applications don’t communicate effectively with back-end databases. AI agents act as intelligent connectors, ensuring seamless data exchange.

Example

A retail AI agent can sync real-time inventory data between an e-commerce website and a warehouse management system to prevent stockouts.

Continuous Compliance Monitoring and Regulation Adaptation

Regulatory landscapes change frequently, requiring businesses to update their processes accordingly. AI agents can monitor compliance requirements, detect non-compliance risks, and suggest automatic adjustments.

Example

In finance, an AI agent can track new anti-money laundering regulations and adjust transaction monitoring rules accordingly.

How AI Agents Are Reshaping the Business Process Layer

AI Agents are fundamentally transforming the business process layer by introducing automation, intelligence, and adaptability at an unprecedented scale. This section delves into the key ways that AI Agents are driving changes and their implications for enterprises.

Automating Repetitive Tasks

AI agents excel at automating repetitive, rule-based tasks that traditionally required human intervention. Examples include:

Data entry and validation: AI-powered systems in banking automatically verify and process customer forms, reducing errors and improving efficiency.
Processing invoices and financial transactions: Companies like SAP use AI-driven automation to process vendor invoices with OCR technology, ensuring timely payments and reconciliation.
Managing customer service inquiries through chatbots and virtual assistants: E-commerce platforms like Amazon use AI chatbots to handle routine customer queries, allowing human agents to focus on complex issues.

By automating these tasks, businesses can reduce costs, minimize human errors, and free up employees to focus on more strategic activities.

Enhancing Decision-Making with AI-Driven Insights

AI agents can analyze vast amounts of data and provide real-time insights that aid decision-making. This capability is crucial in areas such as:

Supply chain management: AI-driven demand forecasting at Walmart helps optimize inventory levels, preventing stockouts and overstock situations.
Marketing: Netflix uses AI to analyze user behavior and recommend personalized content, increasing user engagement and retention.
Risk management: AI-driven fraud detection in financial services, like Visa’s AI-powered fraud prevention system, reduces fraudulent transactions and enhances security.

These insights help businesses make informed, data-driven decisions quickly and accurately.

Enabling Dynamic and Adaptive Workflows

Traditional business processes are often static and require manual intervention for adjustments. AI agents introduce adaptability by:

Continuously monitoring workflows and adjusting operations based on real-time data: AI in manufacturing, such as predictive maintenance by Siemens, minimizes downtime by proactively addressing equipment failures.
Identifying bottlenecks and optimizing workflow efficiency: AI-based workflow optimization at DHL streamlines logistics and enhances delivery efficiency.
Integrating with multiple enterprise systems to ensure seamless operation: AI-driven ERP systems like SAP S/4HANA unify business functions, improving decision-making and reducing inefficiencies.

Improving Customer Experiences

AI agents enhance customer interactions by providing:

24/7 support through AI-driven chatbots and voice assistants: Bank of America’s Erica AI assistant helps customers with transactions, account inquiries, and financial planning.
Personalized recommendations based on past interactions and preferences: Spotify’s AI algorithm curates custom playlists, improving user experience and increasing retention.
Faster response times and resolution of queries, leading to improved customer satisfaction: AI-driven customer support at Zappos reduces wait times and enhances user experience.

Bridging the Gap Between Systems of Engagement and Systems of Record

In enterprise architectures, the business process layer serves as a bridge between customer-facing applications (systems of engagement) and backend databases (systems of record). AI agents facilitate smoother integration by:

Automating data flow between disparate systems: AI-powered RPA at UiPath connects CRM and ERP systems, improving data consistency.
Ensuring data consistency and reducing duplication errors: AI-based data deduplication in Salesforce enhances customer relationship management efficiency.
Enabling real-time updates across various business applications: AI-driven integration at Microsoft Power Automate ensures real-time synchronization across different platforms.

The Role of AI Agents in Compliance and Regulatory Processes

Compliance and regulatory requirements are often complex and constantly evolving. AI agents help businesses stay compliant by:

Monitoring changes in regulations and automatically updating business processes: AI-driven compliance monitoring at PwC helps financial institutions stay up-to-date with evolving regulations.
Ensuring accurate reporting and audit trails: AI-based audit automation at KPMG reduces manual effort and enhances regulatory compliance.
Identifying potential compliance risks and suggesting corrective actions: AI in healthcare compliance, such as IBM Watson, detects regulatory issues and ensures adherence to standards.

Implementing Agentic AI in Enterprises

By following these steps, enterprises can successfully implement agentic AI to optimize its operations and drive innovation (see Figure 4).

Figure 4: Implementing Enterprise Agentic AI (source: own)

Step 1: Assessing AI Readiness

Before deploying AI agents, enterprises must assess their existing capabilities and environment to ensure they can support the introduction of AI. This phase includes:

1. Data Infrastructure

Is real-time data accessible?

Ensure that the organization has the necessary infrastructure for capturing, storing, and processing real-time data. AI agents rely heavily on up-to-date information to make decisions and carry out tasks effectively.
This involves evaluating whether data pipelines are in place and if the data is clean, structured, and accessible across departments.

2. Automation Maturity

What processes are already automated?

AI agents are often most useful in environments where manual processes can be automated. Assess which processes have already been automated (e.g., workflows, repetitive tasks) and identify opportunities to further expand automation using AI.
Understand the existing automation tools and platforms to determine if they can be integrated with AI agents.

3. AI Governance

How will ethical considerations be managed?

A crucial step is to ensure that the organization has a clear framework for managing ethical issues related to AI deployment. This includes bias mitigation, data privacy, transparency, accountability, and compliance with regulations.
Governance mechanisms should be set in place for monitoring AI decision-making and ensuring ethical AI practices throughout the lifecycle.

4. Business Goals

What specific problems should AI agents solve?

Define clear business objectives for deploying AI agents. These could include improving operational efficiency, enhancing customer service, reducing costs, or automating specific tasks.
Focus on aligning AI deployment with the strategic goals of the organization to ensure that the AI agents will deliver measurable value.

Step 2: Selecting the Right AI Tools

Once the organization is ready, the next step is to choose the right AI tools to build the AI agent ecosystem. Here are some key tools and platforms to consider:

1. Multi-Agent Collaboration

Tools like AutoGen and CrewAI enable AI agents to work collaboratively, allowing them to handle complex tasks that require coordination between different agents.
These platforms help develop agents that can interact and cooperate to solve broader challenges.

2. Process Orchestration

Platforms such as Langflow and Zapier AI are used to orchestrate processes and ensure smooth communication between systems and agents. These tools help AI agents handle task flows in a way that is structured, streamlined, and efficient.
Process orchestration tools help ensure that various parts of the organization are working in harmony, leveraging AI to create a seamless workflow.

3. LLM Integration

Choose platforms like OpenAI API or Google Gemini to integrate large language models (LLMs) into AI agents. These models provide agents with natural language processing capabilities, enabling them to understand and respond to user inputs in a more human-like manner.
LLMs enhance the agent’s ability to carry out tasks involving text generation, comprehension, and problem-solving.

Step 3: Pilot Testing and Scaling

Before full deployment, it’s crucial to test AI agents on a smaller scale to assess their impact and make any necessary adjustments.

1. Start with a Small-Scale Proof of Concept

Begin by selecting a small, manageable project for the AI agent to work on (e.g., AI-powered invoice processing, and automated customer support).
This allows the organization to test the agents in a controlled environment while mitigating the risk of large-scale failure.

2. Monitor Performance Metrics

Track key performance indicators (KPIs) such as accuracy, efficiency, and cost reduction to evaluate the effectiveness of the AI agents.
Assess how well the AI agents are performing their designated tasks and identify any bottlenecks or issues that need to be addressed.

3. Gradually Scale AI Capabilities

Once the pilot has proven successful, expand the use of AI agents to more complex and high-impact business processes.
Gradually increase the scale of deployment, ensuring that the AI agents are effectively handling more demanding tasks and improving overall business performance.

Key Considerations in Agentic AI Adoption

Data Readiness: Ensuring high-quality, structured data for AI training.
Infrastructure Scalability: Investing in cloud-based AI infrastructure for scalability.
Change Management: Preparing employees for AI-driven workflow transformations.
Ethical AI Usage: Implementing governance frameworks to ensure fairness and accountability.

The Future of Agentic AI in Business

Agentic AI adoption marks a shift from rule-based automation to truly autonomous, intelligent business systems. As organizations embrace AI-driven decision-making, adaptive workflows, and self-optimizing processes, they will gain a significant competitive advantage in the evolving digital economy. Companies that successfully integrate AI agents into their business process layer will reduce costs, improve efficiency, and unlock new opportunities for innovation and growth.

As AI agents become more sophisticated, their role in reshaping business processes will continue to expand. Future advancements may include:

Greater collaboration between AI agents and human workers: AI-powered virtual assistants like Google’s Duet AI enhance productivity by assisting human employees in completing tasks efficiently.
More advanced AI-driven decision-making models: AI-driven strategic decision-making at McKinsey enhances consulting insights with predictive analytics.
Enhanced capabilities for handling unstructured data and complex problem-solving: AI-powered document analysis at DocuSign automates contract processing and legal document review.

Agentic AI represents the next frontier in business process automation, offering enterprises the ability to streamline workflows, enhance decision-making, and drive innovation. As organizations integrate AI agents into their operations, they will transition from static, rule-based systems to dynamic, intelligent automation frameworks, setting the stage for the future of business efficiency.

The post Agentic AI: The Future of Business Process Automation appeared first on ML Conference.

AI as a Superpower: LAION and the Role of Open Source in Artificial Intelligence

eguemuesdere@sandsmedia.com — Wed, 21 Jun 2023 10:20:21 +0000

**devmio: Hello, Christoph! Could you tell us what LAION is and what role you play there?**

**Christoph Schuhmann:** LAION stands for Large-Scale Artificial Intelligence Open Network. First and foremost, it’s simply a huge community of people who share the dream of open-source AI models, research, and datasets. That’s what connects us all. We have a [Discord server](https://discord.com/invite/xBPBXfcFHd) where anyone can come in and share a bit about the latest research in the field. You can also propose a new project and find people to work on it with you. And if you ask the mods, me, or other people, you might even get a channel for your project. That’s basically the core.

When we had such surprising success with our first dataset called [LAION-400M](https://laion.ai/blog/laion-400-open-dataset/), we set up a small non-profit association that doesn’t actually do anything. We have a bank account with a bit of money coming into it from a few companies that support us. That’s primarily Hugging Face, but also StabilityAI, although we’re mostly supported not by money but by cloud compute.

StabilityAI, for example, has a huge cluster with 4000 or now 5600 GPUs, and there we or our members who are approved by the core team can use preemptable GPUs, for example, what is not being used at the moment and is idle.

**devmio: So we can just come to you and contribute? Propose our ideas and ask for help with our projects or help with ongoing projects?**

**Christoph Schuhmann:** Exactly! You can now come to our Discord server and say that you want to contribute to a project or help us with PR or whatever. You are most welcome!

**devmio: Is LAION based in Germany? And you are the chairman and co-founder?**

**Christoph Schuhmann:** Exactly. I am a physics and computer science teacher, I have been regularly involved with machine learning, and I also have a background in reform-oriented education. I made a Kickstarter documentary seven or eight years ago about schools where you can learn without grades and curriculum. After that took off, I did tutorials on how to start such an independent school. So I knew how to set up a grassroots non-profit organization. I am not paid for my work at LAION.

## The Beginnings of LAION

**devmio: How did LAION come to life? How did you get to know the other members?**

**Christoph Schuhmann:** I actually started LAION after reading a lot about deep learning and machine learning and doing online courses in my spare time over the last five to six years. When the first version of DALL-E was published at the beginning of 2021, I was totally shocked by how good it was. At that time, however, many non-computer scientists didn’t find it that impressive.

I then asked on a few Discord servers about machine learning and what we would need to replicate something similar and make it open-source. There was a well-known open-source programmer at the time called Philip Wang (his alias on GitHub is lucidrains) who is a legend in the community because whenever a new paper comes out he has the associated codebase implemented within a few days. He also built an implementation of the first version of DALL-E in Pytorch called [DALLE-pytorch](https://github.com/lucidrains/DALLE-pytorch). This model was then trained by a few people using small data sets on Discord, and that was proof of concept.

But the data was missing, and I suggested going to [Common Crawl](https://commoncrawl.org/), a non-profit from Seattle that scraps HTML code from the internet every two to three months and makes it available. A snapshot, so to speak, of the HTML code of all possible websites, which is 250 terabytes zip file. I then suggested downloading a gigabyte as a test and wrote a script that extracts image tags together with alt tags and then uses the CLIP model to see how well they fit together.

Then two “machine learning nerds”, who were much better at it than I was at the time, implemented it efficiently but didn’t finish it. That was a shame, but they were developing the GPT open-source variant [GPT-J](https://huggingface.co/docs/transformers/model_doc/gptj) and therefore didn’t have the time.

Then in the spring of 2021, I sat down and just wrote down a huge spaghetti code in a Google Colab and then asked around on Discord who wanted to help me with it. Someone got in touch, who later turned out to be only 15 at the time. And he wrote a tracker, basically a server that manages lots of colabs, each of which gets a small job, extracts a gigabyte, and then uploads the results. At that time, the first version was still using Google Drive.

## The Road to the LAION-400M Dataset

It was a complete disaster because Google Drive wasn’t suitable for it, but it was the easiest thing we could do quickly. Then I looked for some people on a Discord server, made some more accounts, and then we ended up with 50 Google Colabs working all the time.

But it worked, and then, within a few weeks, we had filtered 3 million image-text pairs, which at the time was more than Google’s [Conceptual Captions](https://ai.google.com/research/ConceptualCaptions/), a very well-known dataset of 2019. That little success got us so much attention on the Discord server that people just started supporting us and writing things like, “I have 50 little virtual machines here from my work, you could use them, I don’t need them right now,” or “I have another 3090 lying around here with me, I can share it with you.”

After three months, we had 413 million filtered image-text pairs. That was our LAION-400M dataset. At the time, it was by far the largest image-text dataset freely available, over 30 times larger than [Google’s Conceptual Caption 12M](https://github.com/google-research-datasets/conceptual-12m), with about 12 million pairs.

We then did a [blog post about our dataset](https://laion.ai/blog/laion-400-open-dataset/), and after less than an hour, I already had an email from the Hugging Face people wanting to support us. I had then posted on the Discord server that if we had $5,000, we could probably create a billion image-text pairs. Shortly after, someone already agreed to pay that: “If it’s so little, I’ll pay it.” At some point, it turned out that the person had his own startup in text-to-image generation, and later he became the chief engineer of Midjourney.

As you can see, it was simply a huge community, just 100 people who only knew each other from chat groups with aliases. At some point, I made the suggestion to create an association, with a banking account, etc. That’s how LAION was founded.

## Even Bigger: LAION-5B and LAION-Aesthetics

We then also got some financial support from Hugging Face and started working on LAION-5B, which is a dataset containing five billion image-text pairs. By the end of 2021, we were done with just under 70 percent of it, and then we were approached by someone who wanted to create a start-up that was like OpenAI but really open-source. He offered to support us with GPUs from AWS. This was someone who introduced himself as a former investment banker or hedge fund manager, which I didn’t quite believe at first. In the end, it was just some guy from Discord. But then the access data for the first pods came, and it turned out that the guy was Emad Mostaque, the founder of StabilityAI.

**devmio: What is the relationship between LAION and Stability AI?**

**Christoph Schuhmann:** Contrary to what some AI-art critics claim, we are not a satellite organisation of Stability AI. On the contrary, Stability AI came to us after the LAION-5B dataset was almost finished and wanted to support us unconditionally. They then did the same with LAION-Aesthetics.

**devmio: Could you explain what LAION-Aesthetics is?**

**Christoph Schuhmann:** I trained a model that uses the CLIP embeddings of the LAION images to estimate how pretty the images are on a scale of one to ten. It’s a very small model, a multilayer perceptron running on a CPU. At some point, I ran the model over a couple of 100,000 images, sorted them, and thought that the ones with the high scores looked really good. The next step was to run it on 2.3 billion CLIP embeddings.

## From LAION-Aesthetics to Stable Diffusion

**devmio: How did LAION-Aesthetics help with the development of Stable Diffusion?**

**Christoph Schuhmann:** I had already heard about Robin Rombach, who was still a student in Heidelberg at the time and had helped develop latent diffusion models at the CompVis Group. Emad Mostaque, the founder of StabilityAI, told me in May 2022 that he would like to support Robin Rombach with compute time, and that’s how I got in touch with Robin.

I then sent him the LAION-Aesthetics dataset. The dataset can be thought of as a huge Excel spreadsheet containing links to images and the associated alt text. In addition, each image is given a score, such as whether something contains a watermark or smut. Robin and his team later trained the first prototype of Stable Diffusion on this. However, the model only got the name Stable Diffusion through Stability AI, to whom the model then migrated.

LAION also got access to the Stability AI cluster. But we were also lucky enough to be able to use JUWELS, one of the largest European supercomputers, because one of our founding members, Jenia Jitsev, is the lab director at the Jülich Supercomputer Center for Deep Learning. We then applied for compute time to train our own OpenCLIP models. And now we have the largest CLIP models available in open source.

## LAION’s OpenCLIP

**devmio: What exactly do CLIP models do? And what makes LAION’s OpenCLIP so special?**

**Christoph Schuhmann:** On the Stability AI cluster, a Ph.D. student from UC Washington has trained a model called CLIP-ViT-G. This model can tell you how well an image matches a text, and this model has managed to crack the 80 percent zero-shot mark. This means that we have now built a general-purpose AI model that is better than the best state-of-the-art models from five years ago that were built and trained specifically for this purpose.

These CLIP models are in turn used as text encoders, as “text building blocks” by Stable Diffusion and by many other models. CLIP models have an incredible number of applications. For example, they can be used for zero-shot image segmentation, zero-shot object detection with bounding boxes, zero-shot classification, or even for text-to-image generation.

We have trained and further developed these models. We now have a variant that not only trains these CLIP models but also generates captions through a text decoder. This model is called [CoCa](https://laion.ai/blog/coca/) and is quite close to the state of the art.

We have many such projects running at the same time, sometimes so many that I almost lose track of them. Currently, we cooperate with Mila, an institute of excellence from Montreal, and together we have access to the second largest supercomputer in the US, Summit. We have been given 6 million GPU hours there and are training all kinds of models.

**devmio: You have already talked a lot about Stable Diffusion, and Robin Rombach, the inventor, is a member of your team. Is Stable Diffusion managed by you, is that “your” model?**

**Christoph Schuhmann:** No, we don’t have anything to do with that for now. But we have made the development and training of Stable Diffusion easier with LAION-Aesthetics and LAION-5B.

## Open Source as a Superpower

**devmio: LAION is committed to making the latest developments in AI freely available. Why is open source so important in AI?**

**Christoph Schuhmann:** Let’s take the sentence: “AI should be open source so that it is available to the general public.” Now let’s take that sentence and replace “AI” with “superpowers”: “Superpowers should be open source and available to the public.” In this case, it becomes much more obvious what I’m actually getting at.

Imagine if there was such a thing as superpowers, and only OpenAI, Microsoft, Google, maybe the Chinese and American governments, and five other companies, have control over it and can decide what to do with it. Now, you could say that governments only ever want what’s best for their citizens. That’s debatable, of course, but let’s assume that’s the case. But does that also apply to Microsoft? Do they also have our best interests at heart, or does Microsoft simply want to sell its products?

If you have a very dark view of the world, you might say that there are a lot of bad people out there, and if everyone had superpowers now, there would certainly be 10, 20, or 30 percent of all people who would do really bad things. That’s why we have to control such things, for example through the state. But if you have a rather positive and optimistic view of the world, like me, for example, then you could say that most people are relatively nice. No angels, no do-gooders, but most people don’t want to actively do something bad, or destroy something, but simply live their lives. There are some people who are do-gooders and also people who have something bad in mind. But the latter are probably clearly in the minority.

If we assume that everyone has superpowers, then everyone would also have the opportunity to take action against destructive behaviour and limit its effects. In such a world, there would be a lot of positive things. Things like superpower art, superpower music, superpower computer games, and superpower productivity of companies that simply produce goods for the public. If you now ask yourself what kind of world you would like to live in and assume that you have a rather positive worldview, then you will probably decide that it would be good to make superpowers available to the general public as open source. And once you understand that, it’s very easy to understand that AI should also be open source.

AI is not the same as superpowers, of course, but in a world in which the internet plays an ever greater role, in which every child grows up with YouTube, in which AI is getting better and better, in which more and more autonomous systems are finding their way into our everyday lives, AI is incredibly important. Software and computerised things are sort of superpowers. And that’s going to get much more blatant, especially with ChatGPT. In three to four years, ChatGPT will be much better than it is today.

Now imagine if the whole world used technologies like ChatGPT and only OpenAI and Microsoft, Google and maybe two or three other big companies controlled those technologies. They can cut you off at any time, or tell you “Sorry, but I can’t do this task, it’s unethical in my opinion”, “I have to block you for an hour now”, or “Sorry, your request might be in competition with a Microsoft product, now I have to block you forever. Bye.”

**devmio: We had also spoken to other experts, for example, Pieter Buteneers and Christoph Henkelkmann, who had similar concerns. But the question remains whether everyone should really have unrestricted access to such technologies, right?**

**Christoph Schuhmann:** A lot of criticism, not directed at LAION but at Stable Diffusion, goes in this direction. There is criticism that there are open-source models like Stable Diffusion that can be used to create negative content, circumvent copyright and create fakes, etc. Of course, it’s wrong to violate copyright, and it’s also wrong to create negative content and fakes. But imagine if these technologies were only in the hands of Microsoft, Google, and a few more large research labs. They would develop really well in the background, and at some point, you would be able to generate everything perfectly with them. And then they leak out or there is a replica, and society is not prepared at all. Small and medium-sized university labs wouldn’t be prepared at all to look at the source code and discover the problems.

We have something similar with LAION-5B. There are also some questionable images in the dataset that we were unable to filter. As a result, there is also a disclaimer that it is a research dataset that should be thoroughly filtered and examined before being used in production. You have to handle this set carefully and responsibly. But this also means that you can find things in the set that you would like to remove from the internet.

For example, there is an organisation of artists, [Have I Been Trained](https://haveibeentrained.com/), that provides a tool that artists can use to determine if their artwork is included in LAION-5B. This organisation has simply taken our open-source code and used it for their own purposes to organise the disappointed artists.

And that’s a great thing because now all those artists who have images on the internet that they don’t want there can find them and have them removed. And not only artists! For example, if I have a picture of myself on the internet that I don’t want there, I can find out through LAION-5B where it is being used. We don’t have the images stored in LAION-5B, we just have a table with the links, it’s just an index. But through that, you can find out which URL is linked to the image and then contact the owners of the site and have the image removed. By doing this, LAION generates transparency and gives security researchers an early opportunity to work with these technologies and figure out how to make them more secure. And that’s important because this technology is coming one way or another.

In probably a lot less than five years, you’re going to be able to generate pretty much anything in terms of images that you can describe in words, photo-realistically, so that a human being with the naked eye can’t tell whether it’s a photo or not.

## AI in Law, Politics, and Society

**devmio: Because you also mentioned copyright: The legal situation in Germany regarding AI, copyright, and other issues is probably not entirely clear. Are there sufficient mechanisms? Do you think that the new EU regulations that are coming will be sufficient while not hindering creativity and research?**

**Christoph Schuhmann:** I am not a lawyer, but we have good lawyers advising us. There is a Data Mining Law, an EU-wide exception to copyright. It allows non-profit institutions, such as universities, but also associations like ours, whose focus is on research and who make their results publicly available, to download and analyse things that are openly available on the internet.

We are allowed to temporarily store the links, texts, whatever, and when we no longer need them for research, we have to delete them. This law explicitly allows data mining for research, and that is very good. I don’t think all the details of what’s going to happen in the future, especially with ChatGPT and other generative AIs for text and images, were anticipated in these laws. The people who made the law probably had more statistical analysis of the internet in mind and less training data for AIs.

I would like to see more clarity from legislators in the future. But I think that the current legal situation in Germany is very good, at least for non-profit organisations like LAION. I’m a bit worried that when the [EU AI Act](https://digital-strategy.ec.europa.eu/de/policies/european-approach-artificial-intelligence), which is being drafted, comes, something like general purpose AI, like ChatGPT, would be classified as high risk. If that were to be the case, it would mean that if you as an organisation operate or train a ChatGPT-like service, you would have to constantly account for everything meticulously and tick off a great many compliance rules, catalogues, and checklists.

Even if this is certainly well-intentioned, it would also extremely restrict research and development, especially of open source, associations, and of grassroots movements, so that only Big Tech Corporate would be able to comply with all the rules. Whether this will happen is unclear so far. I don’t want high-risk applications like facial recognition to go unregulated either. And I don’t want to be monitored all day.

But if any lawmakers are reading this: Politicians should keep in mind that it is very important to continue to enable open-source AI. It would be very good if we could continue to practice as we have been doing. Not only for LAION but for Europe. I am sure that quite a lot of companies and private people, maybe even state institutions can benefit from such models as CLIP or from the datasets that we are making.

And I believe that this can generate a lot of value for citizens and companies in the EU. So I would even go so far as to call for politicians and donors to maybe think about building something similar to a CERN for AI. With a billion euros, you could probably build a great open-source supercomputer that all companies and universities, in fact, anyone, could use to do AI research under two conditions: First, the whole thing has to be reviewed by some smart people, maybe experts and people from the open-source community. Second, all results, research papers, checkpoints of models, and datasets must be released under a fully open-source licence.

Because then a lot of companies that can’t afford a supercomputer at the moment could open source their research there and only keep the fine-tuning or anything that is really sensitive to the business model on the companies’ own computers. But all the other stuff happens openly. That would be great for a lot of companies, that would be great for a lot of medium and small universities, and that would also be great for groups like LAION.

_**Editor’s note**: After the interview, LAION started a petition for a CERN-like project. Read more on [LAION’s blog](https://laion.ai/blog/petition/)._

## AI for a Better World

**Christoph Schuhmann:** Another application for AI would be a project close to my heart: Imagine there is an open-source ChatGPT. You would then take, say, 100 teachers and have them answer questions from students about all sorts of subjects. For these questions, you could make really nice step-by-step explanations that really make sense. And then, you would collect data from the 100 teachers for the school material up to the tenth grade. That’s at least similar everywhere in the Western world, except, of course, history, politics, etc. But suppose you were to simply break down the subject matter from 100 countries, from 100 teachers, from the largest Western countries, and use that to fine-tune a ChatGPT model.

You need a model that has maybe 20 to 30 billion parameters, and you could use it to give access to first-class education to billions of children in the Third World who don’t have schools but have an old mobile phone and internet access. You don’t need high-tech future technology, you can do that with today’s technology. And these are big problems of the world that could be addressed with it.

Or another application: My mum is 83 years old, she can’t handle a computer and is often lonely. Imagine if she had a Siri that she could have a sensible conversation with. Not as a substitute for human relationships, but as a supplement. How many lonely old people do you think would be happier if they could just ask what’s going on in the world. Or “Remember when I told you that story, Siri? Back in my second marriage 30 years ago?” That would make a lot of people happier. And I think things like that can have a lot of effect with relatively little financial outlay.

**devmio: And what do you see next in AI development?**

**Christoph Schuhmann:** What I just talked about could happen in the next five years. Everything that happens after that, I can’t really predict. It’s going to be insane.

**devmio: Thank you very much for taking the time to talk to us!**

The post AI as a Superpower: LAION and the Role of Open Source in Artificial Intelligence appeared first on ML Conference.

Three Key Considerations When Implementing AI

ML editorial team — Wed, 16 Nov 2022 08:20:42 +0000

For fields in which analytics, prediction, machine learning, and decision making are paramount AI works best when categories can be strictly defined and there is an established ground-truth – a definitive answer from which it can be modeled. Essentially AI needs to be taught by examining a problem back to front, from there it figures out which attributes are deterministic or descriptive and applies these learnings to new data sets. So, in a sense, AI is only as good as the education it receives.

The application of AI in patent offices

This template is seen in the application of AI to patent office examinations, patent classifications, reporting, and other critical workflows. Within examinations, it is used to greatly accelerate and increase the accuracy of a necessary step in the patent approval process – prior art searches.

Prior art is any published evidence that an invention is already known, which can take numerous forms, from just a description of an idea or formula to a centuries-old piece of technology or an existing product. When a new patent application is filed, patent office searchers and examiners spend much of their time performing searches of documents and other assets around the work and evaluating the results to determine if the target application encroaches on existing prior art.

The results of these searches determine whether an invention meets the patent protection criteria for novelty and obviousness. The former is the notion that an invention must be new or novel and therefore not known in the public domain prior to the application filing date, while the latter is the notion that an invention must be non-obvious and not a logical extension of a pre-existing invention that any skilled member of that field could feasibly surmise. Across the millions of patents, a single instance of prior art can be used to reject a patent or to send it back to the applicant for revision.

The process of searching for prior art is complicated, iterative, and time-consuming. For each search, examiners must devise a search strategy, select which databases to search, create the search parameters, perform the search, evaluate the results, and then if needed, modify and rerun the search.

According to an analysis of search activity conducted by the European Patent Office, a comprehensive patent application search draws on around 1.3 billion technical records in 179 databases, leading to about 600 million documents appearing in search results monthly. Another study by the Japan Patent Office estimated that its staff spent around 40% of their time conducting and reviewing prior art searches through traditional and rather labor-intensive tools.

The rapid growth in patent applications and the complexity of inventions coupled with the staggering volume of materials to search means that patent offices are always considering new ways to accelerate the application process to avoid long pendencies and in a few cases, backlogs. Indeed, according to WIPO (World Intellectual Property Office) in 2019, there were 5.7 million patent applications pending worldwide. To keep up with this flood of applications, patent offices hire more examiners and adopt technologies to improve productivity.

The integration of AI solutions in Brazil’s patent office

In 2020, one such office experiencing a sizeable patent backlog was INPI Brazil. With around 150,000 applications pending and an average wait time of more than 10 years, their backlog was significantly impacting innovation in Latin America’s largest economy and thereby limiting investments.

A sizeable chunk of their backlog, around 15%, consisted of chemistry patents. Chemistry patents require searches of both text and chemical structures within patent and non-patent publications and include full text and structure queries, which make finding similarities and relevance between the application patent and existing art a far more demanding review process than other patent applications.

INPI partnered with CAS, who offered an AI solution that could analyze the complexities of chemistry prior art to solve this problem, streamline their workflow processes, and tackle their backlog. In collaboration with INPI, a unique AI approach was created, which accelerated the laborious task of discovering prior art by focusing the solution’s search algorithms on multiple facets of patents to determine similarity between the target patent application and existing patent and non-patent publications, and refine results. An additional algorithm then created a relevant ranked data set for examiners to review. The results of this solution were impressive, with up to a 50% reduction in examination times, reduced search times for over 75% of applications processed, and contributing to an overall reduction of 80% in the office’s patent backlog. However, CAS arrived at the tailored solution with constant refinement and consideration of three factors.

Three considerations when implementing AI:

1. Quality data and human-curated data sets

While AI solutions can speed up prior art searches exponentially, AI alone is not a silver bullet and cannot replace patent examiners. However, AI can become a powerful tool patent examiners can use to enhance performance of their workflows. The secret lies in possessing curated and highly structured content that can train an algorithm correctly and then utilizing experts to maximize its application. In this regard, CAS see AI as just the latest technology that they layer upon their continuously updated data sets to improve search and retrieval of information and supplement this data and technology with extensive subject matter expertise and services.

Two waves in publishing have made the careful curation of content even more necessary, namely digitization and globalization. Digitization is the process of converting physical materials, such as books, illustrations, objects, and analog recordings and photos, into digital form. While globalization is the translation of these sorts of materials into other languages, as patents are territorial and must be filed in each country where protection is sought. These waves pose significant roadblocks to optimizing AI-powered prior art searches. Digitization often leads to transcription errors, mislabelled units, and overly complex patent language, while globalization leads to patents in dozens of languages. Each of these make human curation a necessity for quality data that can be easily searched and retrieved.

Thankfully CAS has a vast catalog of expertly human-curated data. In fact, CAS has been crowdsourcing data for over a century, by gathering abstracts from public and private domains since 1907. This vast catalog has been normalized, prepared, and connected in a structured format which improves the training of AI algorithms and increases the performance of prior art searches. By augmenting AI technology with human expertise for INPI, CAS scientists fed clean and structured data to the AI solution improving the predictive accuracy.

2. Domain expertise

Another consideration is to leverage the know-how of domain experts to refine the AI solution throughout a project. The INPI project required CAS to provide a wide array of expertise from distributed algorithms and machine learning to data science, cheminformatics, patent searching, and high-performance computing.

The CAS IP search team was therefore able to support the examiners’ searches by validating algorithm results during development and performing highly complex searches to augment the office’s capabilities. With individual prior art searches often variable in scope, different search professionals are likely to design different strategies for a given search. Having a team of search experts available to analyze algorithm results enabled them to yield insights into how those algorithms can be fine-tuned to improve relevancy.

3. Choosing the right algorithms

As has been established, completing a comprehensive prior art search is a painstaking process that requires the consideration of multiple facets of possible similarity. Therefore, choosing only one algorithm focusing on a single type of analysis, such as semantics, will prove insufficient to the task. For the INPI project, CAS chose to integrate four types of algorithms for text-based and substance-based analysis, including deep learning and term frequency-inverse document frequency. Using multiple algorithms allowed the AI to find semantic, syntactic, and substance similarities all in one multifaceted solution.

Traditional knowledge graphs were also added to analyze the connectedness between the vast amounts of data. The INPI Brazil project deployed one for chemistry and one for non-chemistry to determine ontological similarity and connectedness between documents using keywords, scientific topics, roles, and nomenclature.

The first-level algorithms evaluated semantics, such as title, abstract, and claims between patent and non-patent publications, and used a syntactic-driven algorithm that compared the prevalence of special terms in the target document to their uniqueness across all other documents to return an accurate set of similarity results.

Then, at the second level, an algorithm for a patented ensemble learning process combined the results to produce an optimal predictive model, which was then used to generate relevance-ranked results based on search context and each algorithm’s strengths and limitations. The ensemble learning algorithm then analyzed the ranked results arriving at a single prioritized list of patent and non-patent publications that were most likely to conflict with the target patent for the examiners to review.

Worldwide applicability of tailored AI

When implemented correctly, as in the INPI project, AI can transform patent office workflows and remove tedious tasks to free up researchers’ and examiners’ time for value-add work. There is no one size fits all solutions for these complex workflows and undertakings. The key is having close collaboration between the office and solutions experts to ensure the approach is perfectly aligned with the office’s strategic objectives.

Global patent offices face fundamental challenges that put their operational sustainability at risk. By combining AI, human-curated data, and workflow transformation, CAS has established an extremely effective approach for improving patent office timeliness, patent quality, and efficiency to help accelerate innovation around the world.

The post Three Key Considerations When Implementing AI appeared first on ML Conference.

AI in Vaccine Development and Rollout

ML editorial team — Tue, 04 Oct 2022 10:46:24 +0000

Developing a vaccine is an expensive endeavor. It can amount to $500 million [1] starting from the research phase to vaccine registration, and the failure rate is high. The situation is aggravated by the fact that viruses mutate, rendering vaccines less effective. And even when vaccines are already in production, it is still a challenge to manage administration and protect the most endangered population segments.

Fortunately, it’s possible to accelerate and improve the processes by involving AI in vaccine development and rollout.

This article describes use cases and tools that AI healthcare companies [2] and research teams built to facilitate vaccine design, speed up trials, predict mutations, prioritize patients, and address vaccine hesitancy.

How AI contributes to vaccine development

Artificial intelligence can analyze massive datasets representing virus structure to pinpoint viable vaccine targets, predict virus mutation, assist in clinical trials, and help researchers organize and access a large volume of scientific publications.

AI identifies vaccine targets

Vaccine development is a data-intensive process as one needs to understand the virus itself and how the immune system will react to it. Machine learning algorithms [3] can analyze large datasets to identify which targets (or epitopes) of a virus are most likely to provoke an immune response. After obtaining a list of targets, scientists design matching vaccines.

While determining vaccine targets, one needs to be very careful to not enlist any entities similar to the host proteins that inhibit human bodies to avoid cross-reactions and undesirable side effects.

Protein-based AI vaccines

Machine learning algorithms can identify antigens from protein sequences and determine the most viable vaccine target. There are several research initiatives [4] that use AI models to fight COVID-19. One method employs AI to develop a vaccine that would contain both T-cell and B-cell epitopes (the part of an antigen that the immune system can recognize). This study discovered 17 potential vaccine peptides working with both immune cells.

In another example, a team of researchers from Baylor College of Medicine and Amity University in India built an AI-driven platform that facilitates vaccine target discovery [5]. Researchers used this software to develop a vaccine against Chagas disease. They identified eight main target proteins and top epitopes for each target, producing a multi-epitope vaccine. Since the emergence of the Delta coronavirus variant, the scientists have been collaborating with several pharmaceutical companies to design a new vaccine.

DNA and RNA-based AI vaccines

Such vaccines are supposed to mimic a partial genetic sequence of a virus. They encapsulate a part of the virus’s genetic code representing the targeted epitope in the form of RNA or DNA. When this code enters a human cell, it produces the epitope in question triggering an immune response. Given the ability of viruses to mutate, the vaccine needs to be based on a relatively stable genetic component to have a long-lasting effect. That’s where artificial intelligence becomes useful. AI algorithms can analyze enormous datasets containing genetically sequenced viruses to identify the more stable parts.

AI facilitates preclinical testing and clinical trials

Preclinical testing

The goal of this phase is to evaluate a vaccine’s safety and efficacy prior to testing it on people in clinical trials. Preclinical testing is typically conducted on a suitable animal model, but since the past decade, regulatory agencies have been calling for the use of alternative methods when possible. AI can be one of these methods as ML algorithms can predict compound toxicity.

Even if AI can’t fully replace preclinical testing, the technology can facilitate it by helping to set the proper dosage, anticipating some immune responses, and even selecting the best-suited animal model.

Clinical trials

Again, AI can’t make clinical trials virtual, but it can largely facilitate them. First, artificial intelligence can analyze the data obtained from preclinical testing and anticipate human immunogenic reactions.

Second, AI algorithms can help researchers find the best location for clinical trials. For example, the MIT School of Engineering built an ML-powered COVID-19 epidemiological model [6] that generates real-time insights about the pandemic and captures people’s behavior and health status (exposed, recovered, etc.). It can also predict how different governments would react to the challenge and their policy choices. All of this enabled the model to predict when and where COVID will spike, pinpointing ideal locations for clinical trials.

This AI tool could make intelligent predictions on 120 countries in addition to all 50 US states.

Third, AI accelerates vaccine rollout as it helps select the right people for trials through electronic health records mining. Provided that 86% of clinical trials can’t recruit [7] candidates within the expected time frame, any help is welcome.

AI outsmarts virus mutation

There is a general concern about viruses being able to change and adapt to medication. In light of the pandemic, SARS-CoV-2 is mutating and people are afraid that the publicly available vaccines will not provide long-lasting protection.

The scientific community is trying to stay ahead as researchers at USC Viterbi experiment with AI to predict and counter mutations [8]. This team has already used an AI-powered tool to determine potential vaccine targets using one B-cell and one T-cell epitope. Employing a wider dataset for AI-driven vaccine design will enable them to fight mutations more effectively. Especially that scientists claim their model can make accurate predictions using a set of 700,000 proteins.

Paul Bogdan, Associate Professor of Electrical and Computer Engineering at USC Viterbi, pointed out that their AI-based method could also vastly accelerate vaccine design, “This AI framework, applied to the specifics of this virus, can provide vaccine candidates within seconds and move them to clinical trials quickly to achieve preventive medical therapies without compromising safety.”

AI organizes data and makes it available for researchers

Researchers keep adding new reports to the already enormous stack of literature on the novel coronavirus and other viruses for that matter. It is becoming increasingly challenging to sift through all these publications. And again, scientists turn to AI to extract valuable insights from these papers.

For example, the Allen Institute built a resource called CORD-19, [9] which offers scientific articles on COVID-19 in a machine-readable format. Other researchers can develop AI algorithms to access this platform and answer queries.

How AI supports vaccine rollout

AI technology’s potential spans beyond vaccine development to its distribution, tracking, administration, and offering counseling. Artificial intelligence prioritizes people for vaccination

Many hospitals prioritize patients solely based on their age and rush to vaccinate everyone in the 65+ age category without further discrimination. For more awareness in vaccine distribution, AI-powered algorithms can help medical facilities identify the most fragile population segments.

Sanford Health, a Dakota-based healthcare organization, deployed AI to identify people at risk of having poor outcomes from COVID-19 [10]. They ran an algorithm on their patients of the age of 65 and older to produce a prioritized list based on various health-related factors, such as obesity, kidney disease, heart disease, and diabetes among others.

Artificial intelligence monitors vaccine distribution and tracking

Using artificial intelligence in vaccine distribution, handling, and storage can have many benefits. Cheryl Rodenfels, Healthcare Strategist at Nutanix, mentions some of them: [11] “Relying on the technology [AI] to manage distribution data eliminates human error and ensures that healthcare organizations are accurately tracking the vast amounts of data associated with the vaccine rollout.”

However, deploying AI at this level is difficult, as every manufacturer has its own procedures for vaccine storage and handling. There are no unified standards on, for example, how many vaccines a medical facility must store.

Eases vaccine hesitancy

The spread of misinformation and vaccine hesitancy presents another problem that AI can help address. AI-powered chatbots that combine knowledge of psychology, public health, and infectious diseases can offer counseling and answer some of the sensitive questions. A recent study conducted in France [12] shows that bots can make people feel more positive towards vaccines.

Johns Hopkins Bloomberg School of Public Health teamed with IBM to develop a chatbot named Vira (Vaccine Information Resource Assistant). [12] They trained the bot through conversations with healthcare workers. Now Vira is used by regular people, and it continues to improve and learn.

Obstacles on the way to AI deployment

No doubt that AI can analyze large volumes of data much faster than humans. According to Dr. Kamal Rawal, Associate Professor at Amity University, who participated in building an AI-driven platform [13] for vaccine development, “The key innovation is using artificial intelligence to combine several hundred parameters to mine several thousand proteins and genes to reach to the right targets and design vaccine using these proteins.”

One interesting characteristic of AI is that it doesn’t make assumptions about what is right and wrong, so it can test the options that researchers tend to discard based on biased beliefs. However, there are things to consider when deploying AI in vaccine development and administration:

Black-box models [14] are powerful, but their results can’t be justified, and bias can sneak in unnoticed. It is advisable to use explainable AI to understand how algorithms arrive at their conclusions. However, this will compromise their predictive power, so there is a tradeoff to make.
The performance of machine learning algorithms depends on the training dataset, and immunology models are being trained on significantly smaller datasets [15] than the ones available for other disciplines, such as voice recognition.
AI ethics is still a complex topic to approach. Using AI in vaccine development might grant it access to patient records, and the issue of privacy comes in. Another ethical concern arises when using AI in vaccine prioritization. Research shows [16] that race and ethnicity contribute to higher hospitalization risks in the case of COVID, but is it ethical to use such data?

Salesforce launched a Vaccine Cloud tool which is expected to help healthcare organizations manage vaccine administration. The company faced the same ethical concern. Here is what a Salesforce spokesperson told Healthcare IT News: “Our Principles for the Ethical Use of COVID-19 Vaccine Technology Solutions explicitly state that AI should not be used to predict personal characteristics or beliefs that would affect a person’s or group’s prioritization for access to vaccines, and we work closely with our partners and teams on this guidance.”

On a final note

With its analytical power, AI still can’t foresee everything. As Oren Etzioni, CEO at the Allen Institute for Artificial Intelligence, said, [17] “The human body is so complex that our models cannot necessarily predict with reliability what this molecule or this vaccine will do for the body.” So, using AI can’t replace clinical trials and can’t make vaccine development entirely virtual and fully automated.

Still, artificial intelligence can analyze large volumes of data and detect patterns that escape the human eye. With all the applications mentioned above, AI can vastly accelerate vaccine development and control their rollouts.

Links & Literature

[1] https://www.frontiersin.org/articles/10.3389/fimmu.2020.517290/full

[2] https://itrexgroup.com/services/ai-for-healthcare/

[3] https://jaxenter.com/basic-introduction-machine-learning-145140.html

[4] https://www.frontiersin.org/articles/10.3389/frai.2020.00065/full#B117

[5] https://www.bcm.edu/news/researchers-develop-ai-platform-to-boost-vaccine-development

[6] https://news.mit.edu/2021/behind-covid-19-vaccine-development-0518

[7] https://bioprocessintl.com/manufacturing/information-technology/in-silico-vaccine-design-the-role-of-artificial-intelligence-and-digital-health-part-1/

[8] https://news.usc.edu/181226/artificial-intelligence-ai-coronavirus-vaccines-mutations-usc-research/

[9] https://www.semanticscholar.org/cord19

[10] https://www.mprnews.org/story/2021/02/10/one-minn-health-care-provider-using-ai-to-pair-patients-with-covid19-shots

[11] https://www.techrepublic.com/article/how-ai-is-being-used-for-covid-19-vaccine-creation-and-distribution/

[12] https://psyarxiv.com/eb2gt/

[13] https://www.gavi.org/vaccineswork/are-chatbots-better-humans-fighting-vaccine-hesitancy

[14] https://www.bcm.edu/news/researchers-develop-ai-platform-to-boost-vaccine-development

[15] https://jaxenter.com/data-ai-models-172220.html

[16] https://www.brookings.edu/techstream/can-artificial-intelligence-help-us-design-vaccines/

[17] https://www.healthcareitnews.com/news/ai-has-advantages-covid-19-vaccine-rollout-potential-dangers-too

[18] https://spectrum.ieee.org/what-ai-can-and-cant-do-in-the-race-for-a-coronavirus-vaccine

The post AI in Vaccine Development and Rollout appeared first on ML Conference.

Speaker Testimonial: Christoph Henkelmann

kraheel — Wed, 03 Nov 2021 15:23:08 +0000

Christoph Henkelmann is a renown speaker at the ML Conference. He has been a pioneer in the field of machine learning and well-known for his work. At the ML Conference, he continues to lead from the front and present various machine learning topics.

Christoph Henklemann holds a degree in Computer Science from the University of Bonn. He currently works at DIVISIO, an AI company from Cologne, where he is CTO and co-founder. At DIVISIO, he combines practical knowledge from two decades of server and mobile development with proven AI and ML technology. In his free time he grows cacti, practices the piano, and plays video games. Watch what Christoph Henkelmann has to say about the unique conference experience at ML Conference and why it is a must-attend conference for ML enthusiasts. Stay ahead of the trends and attend the ML Conference across the globe and online.

The post Speaker Testimonial: Christoph Henkelmann appeared first on ML Conference.

4 arguments to convince your boss

ahenseleit — Tue, 10 Sep 2019 13:33:38 +0000

Sounds fantastic but you don’t know how to convince your boss to send you to the conference?
Don’t worry. We take you by the hand and lead you through the dark forest to the clearing and
show you the 4 most important arguments with which you can convince your boss.

So you don’t have to deal with the wording, we have the ultimate text template for your boss.

A template for the e-mail to your boss

Dear Mr / Mrs (…),

I would like to ask for your permission to participate in the ML Conference, which will take place in Berlin from December 9th – 11th.

The ML conference provides valuable know-how on machine learning as well as on topics such as Deep Learning, TensorFlow, Reinforcement Learning and Chatbots.

The highlights of the ML Conference Fall 2019 are:

- 3 conference days
- 1 power workshop day
- 25+ sessions, workshops and keynotes
- 25+ international experienced Machine Learning experts
- Speakers from all over the world share their knowledge and newest insights.
- Best practices and lessons learned on new trends and tools that provide ideas for daily work.
- Opportunity to meet and network with the best in the industry.
- Contents of the sessions are available for download.

All information about the conference and early bird prices can be found here.

If I may attend the conference, I would like to give my team a summary of the conference and share my experiences.
Many greetings

(your name)

Get to know the ML Conference better?

Here are more sessions

→ Honey Bee Conservation using Deep Learning

→ The Development of (Deep) Reinforcement Learning

→ Understanding how neural networks work by looking at the low level API of TensorFlow 2

→ AI Is Eating Software – How Second Generation AutoML Will Replace Software Development

Check out the ML Conference program

The post 4 arguments to convince your boss appeared first on ML Conference.

How UX can demystify AI: “We need more than just technical transparency”

hschlosser — Fri, 19 Jul 2019 08:05:57 +0000

JAXenter: Machine learning is regarded by many as a kind of miracle; we train the machine with data until it can make decisions independently. How these decisions are made is a kind of myth. No longer comprehensible, we end up with the “black box problem”. Does that have to be like that?

Ward Van Laer: The black box problem is a perception created by, for most people, the unintelligible jumble of machine learning models. But the decision the models make are always based on the data we feed the model. Will we be able to design completely transparent models without having to compromise the complexity of the problems to be solved? In my opinion, the real question is what kind of explainability do we really need to demystify the black box perception.

JAXenter: In your talk at the ML Conference you show how to develop transparent machine learning models. How does that work?

Ward Van Laer: I will demonstrate that explainability can be interpreted in multiple ways. Depending on the perspective from which we look at an AI system, explainable AI can mean different things.

We can look at explainability in a technical way, which means we are looking through the eyes of machine learning engineers, for example. In this case, transparent AI can help to spot dataset biases. More importantly, this technical explainability is not interesting or understandable for an end-user. From this perspective, UX will play a crucial role in demystifying AI applications.

JAXenter: Why do you think transparency in ML is important?

Ward Van Laer: I believe we need more than just technical transparency, or as it is referred to at the moment, “explainable AI”. We need to pinpoint the needed properties that lay at the ground of a trustworthy AI, instead of focussing on full transparency.

JAXenter: Can you give an example of how a good UX changes the acceptance of AI solutions?

Ward Van Laer: In one of our projects in the health care industry we visualize links between classification results and the dataset, which helps physicians understand why certain decisions are made.

To have more insight in the possibilities I can certainly encourage everyone to attend my talk at MLConference 2019

JAXenter: What is the core message of your session that every participant should take home?

Ward Van Laer: Creating a well-working machine learning model is only half of the work. Developing a thought-through User Experience is the key to successful AI.

Please complete the following sentences:

The fascinating thing about Machine Learning for me is…

… that it will be able to help us solve many complex problems (e.g. health care).

Without Machine Learning, humanity could never…

… improve itself.

The biggest current challenge in machine learning is…

… making sure that an AI system is successful in the eyes of user.

I advise everyone to get started with machine learning …

… to better understand what the real possibilities are.

Once the machines have taken power…

… hmm let’s hope we can explain how it happened!

JAXenter: Thank you very much!

The post How UX can demystify AI: “We need more than just technical transparency” appeared first on ML Conference.

The Ethics of AI – dealing with difficult choices in a non-binary world

ewendorf — Tue, 09 Jul 2019 16:19:03 +0000

Eric Reiss started working with user experience (UX) long before the term was even known. Over the past 40 years, he has encountered many issues that have disturbed him – from creating purposely addictive programs, sites, and apps, to the current zeitgeist for various design trends at the expense of basic usability. He has seen research that is faked, ignored, or twisted by internal company politics and by the cognitive bias of the design team. And he has seen countless dark patterns that suppress accessibility and diversity by promoting false beliefs and false security.

Whenever we say, “That’s not my problem,” or, “My company won’t let me do that,” we are handing over our ethical responsibility to someone else – for better or for worse. Do innocent decisions evolve so that they promote racism or gender discrimination through inadvertent cognitive bias or unwitting apathy? Far too often they do.

We, as technologists, hold incredible power to shape the things to come. That’s why he shares his thoughts with you so you can use this power to truly build a better world for those who come after us!

The post The Ethics of AI – dealing with difficult choices in a non-binary world appeared first on ML Conference.

Keynote http://commodity.ai

Julia Martin — Fri, 08 Feb 2019 14:30:39 +0000

As a consumer, commodities are great. Companies will compete for the lowest possible price because every product is almost the same. For the companies on the other hand, this is a cut throat business. Margins are low and differentiating yourself is nearly impossible. Building materials, vegetables, cars and even smartphones have become (near) commodities. But will AI ever become a commodity? And what are the hurdles we need to overcome to (not) get there?
(by the way the URL http://commodity.ai seems to be for sale for an outrageous amount of money)

Stay tuned!
Learn more about ML Conference:

Experience Dr. Pieter Buteneers live at ML Conference 2019 in Munich with his workshop: Machine Learning 101 ++ using Python

The post Keynote http://commodity.ai appeared first on ML Conference.