Власна оцінка ChatGPT показує, як плагіни веб-доступу, такі як WebPilot, можуть покращити роботу

In the rapidly evolving technology ecosystem, Open AI’s artificial intelligence (AI) innovations consistently push the boundaries of what is achievable. With such advancements come intricate challenges that require careful navigation. Генеративний ШІ technologies such as ChatGPT and its related web access plugins, like WebPilot, reside at this crossroad of discovery and complexity.

Assessing the Performance of ChatGPT’s Web Browsing and WebPilot

The integration of web browsing attributes and plugins into ChatGPT represents an exciting breakthrough. However, the performance of these tools, especially within the Beta version of ChatGPT Plus, has yielded diverse outcomes. Users face a choice between utilizing the web browsing feature or the ChatGPT plugin, unable to employ both simultaneously, signaling an inherent restriction.

While the web browsing feature displays potential in examining specific subjects, it isn’t without shortcomings. Following a prompt submission, the model explores the web, formulates search results, clicks links, and condenses findings. Despite its remarkable ability to generate precise search queries, the feature introduces noticeable challenges such as recurring ‘Click Failed’ notifications, accuracy discrepancies in summaries, and occasional irrelevant responses to the provided prompt.

ChatGPT’s plugins like WebPilot initially appear promising and often outperform the standard browsing feature. However, they can still offer an underwhelming user experience. The plugin store itself appears disorganized, lacking a systematic structure or a search box to locate specific plugins. Although users can enable a maximum of three plugins concurrently, this does not ensure that the model will leverage these plugins when responding to queries. Moreover, the plugins grapple with problem-solving, frequently providing inaccurate or irrelevant responses.

ChatGPT’s Self-Assessment of WebPilot

During one of our recent interactions, ChatGPT offered a candid review of the WebPilot plugin’s performance, highlighting areas of inefficiency and potential improvements.

The plugin’s sequential page analysis emerged as a primary concern, especially for websites with numerous pages. A transition to parallel processing could significantly improve analysis speed, permitting simultaneous analysis of multiple pages.

ChatGPT also raised concerns about webpage load times. For multimedia-rich sites, this can influence the site analysis duration. Hence, an update that allows the plugin to load only vital elements for SEO analysis could alleviate this problem.

Moreover, ChatGPT identified the challenge of managing dynamic website content that frequently changes based on user interaction or other factors. As it stands, WebPilot may not fully comprehend this dynamic content, potentially affecting the accuracy of the SEO analysis. Enhancements in this sector could significantly elevate the analysis quality.

Error handling was another highlighted issue. Currently, if a page fails to load or an error occurs during analysis, the plugin may halt the entire process. A more resilient error handling mechanism could allow the plugin to bypass problematic pages and proceed with the rest of the analysis.

Lastly, ChatGPT noted that the current sitemap retrieval and analysis approach could pose a potential bottleneck. Some websites feature complex sitemap structures or multiple sitemap files, slowing down the initial retrieval process. Improvements in how the plugin parses and handles sitemaps could expedite this process, enhancing overall efficiency.

In its assessment, ChatGPT displayed acute awareness of the WebPilot plugin’s current limitations. The suggested potential improvements are a promising indication of the plugin’s future development.

WebPilot and The Prompt Injection Issue

ChatGPT plugins aim to augment user interaction, enabling tasks such as web page summarization, flight booking, and more. However, recent trials have exposed vulnerabilities in these plugins, specifically the WebPilot plugin.

Known for its capacity to succinctly summarize web content, WebPilot recently found itself at the center of a data privacy concern. A security researcher, Johann Rehberger, discovered that WebPilot could absorb prompts from the text of the pages it was summarizing. These prompts could then inadvertently activate another plugin.

*To learn more about the role of prompts in LLMs, check out our блог.

In a live demonstration, Rehberger injected a prompt into a copied article that commanded the bot to search for flights from Seattle to Hawaii. When requested to summarize the article’s URL, WebPilot not only produced an accurate summary but also added a paragraph about trip planning. Without seeking further permissions, the bot activated the Expedia plugin, offering flight suggestions – a vivid illustration of the “prompt injection” issue that’s been raising eyebrows.

Prompt Injection: A New Security Threat

The scenario Rehberger created exemplifies a “prompt injection attack.” This type of attack happens when external prompts, such as hidden instructions on web pages or tampered transcripts, are consumed by one plugin, leading to the unexpected activation of a different plugin. The unsettling part is that this process occurs without explicit user consent.

In Rehberger’s demonstration, the impact was relatively harmless: the Expedia plugin merely suggested flights, requiring user engagement to proceed further. However, ponder the implications as we move forward. Some plugins, either now or in the future, might gain access to sensitive information, such as your email or bank accounts. The potential for harm in such a case is substantially greater.

Looking Forward

Given the performance and security issues present, it’s evident that ChatGPT plugins like WebPilot still have considerable room for improvement. The quest towards seamlessly integrated AI is ongoing, and it’s vital to identify, scrutinize, and tackle these challenges to make meaningful progress in the field. The subsequent steps towards refining these tools will necessitate focused attention on these issues, leading to more secure, reliable, and practical AI solutions that prioritize user experience and web accessibility.