Feature Question (AI)

Even though Hazel is a powerful tool, all rules are based on rigid matches (text, name, properties). What is missing are context-based rules, e.g. via the connection to AI tools.
Is something like this planned on the roadmap?

Can you be more specific about what you expect out of such a feature? AI is thrown around a lot without any thought to whether it's solving your problem.

You're right - maybe I'll briefly describe the use case:

I have tried to automate my entire document management as much as possible.
This involves searching through static folders that always contain the same type of document. For example, bank statements. These can be renamed and moved according to relatively simple rules. There are more rules scanning incoming documents (from emails, scanner etc)

Hazel rules can only recognize and re-name around 40 to 50% of all incoming documents.
For the rest of the documents, however, it is not worth creating separate rules, as in this form they occur rarely or only once.
For those remaining documents, I imagine that the context of the document can be evaluated in a similar way to the analysis in ChatGPT (“Briefly summarize the content of the PDF” --> “This is an invoice from company xyz for ABC”). I could then instruct Hazel to save the document based on the feedback according to my existing naming logic:

In the (simplified) example: "Invoice xyz for ABC 2024-11.pdf" and then move it using the existing rules.

Currently, I have to do the naming manually by checking each individual document manually. Afterwards my distribution rules take over again (probably I only have to store 5% of all documents myself)

Was I able to get across what I meant?

BTW: I'm aware that this can be done by scripts (e.g. AppleScript, Python, or shell scripts) already so AI models or services can be integrated into these scripts, for example by using APIs such as OpenAI or other ML tools. What I mean is a more Hazel integrated way....

Thanks for elaborating. What if ChatGPT is inconsistent with its summary or is just plain wrong? Are you willing to train the model yourself?

Mr_Noodle wrote:Thanks for elaborating. What if ChatGPT is inconsistent with its summary or is just plain wrong? Are you willing to train the model yourself?

Sure, that will happen. I would maybe start with reviewing the filename before finally sorting. Not sure if I would train the LLM myself. Maybe it would be best to start with simple use cases. For all basic documents chatGPT returned overall satisfying results.

I think there are meaningful use cases which are far away from simple AI slogans.

I could think of general support in the application, like error detection and optimization, where the AI checks existing rules for conflicts, redundancies or inefficient processes and suggests improvements. Hazel could also analyze files and suggest rules based on user behavior or file patterns that can be adopted or adapted with a single click.

But the bigger value might be pre-configured options to connect to existing tools (GPT /Gemini etc) - the token to access the service is just configurable by the user.

I would see several information valuable for AI analysis:

1. new "AI" Date fields like creation date or due date (e.g. on invoices)
2. Sender / Creator of the file
3. Category: e.g. “invoice”, “image”, “contract”, ideally from a pre-defined list (so if I have 20 categories defined and the document cannot matched to one of them it will be named "unknown", so I have to look at it)

Im pretty sure there are much more possibilities.
The advantage would be that the current logic of Hazel could be retained but Hazel would become much more powerful.

Since I love Hazel, I just wanted to start the discussion. I think it would be great if Hazel could continue to develop and become even better known...

I think AI in its current state is better suited to smaller, more directed tasks. Extraction of various data fields is definitely an option. There are also issues with licensing and whether processing is done on device or uploaded to a server. Having the user plug in their AI of choice is an interesting idea though not sure if there's any sort of common API for that (haven't looked at it from that angle so don't really know). Right now, though, if you can script it, that would be the way to integrate it.

Any updates on this now that it's been a bit and AI has matured?

I'm looking into Apple's Foundation Models at the moment, which helps in terms of issues of deployment. Users won't have to deal with any account stuff nor do they need to download anything extra. Plus, it's all on-device so it satisfies any privacy concerns.

That said, there are issues with consistency and reliability. When it works, it's great but when it doesn't, it can fail pretty badly. I may include AI-based field extraction in the next release but it will require a good deal of testing and has a high chance of not making it to final release but we'll see. Keep an eye out in the beta forums.

Quick update: I've changed my approach slightly. I no longer want Hazel to trigger AI renaming. There are several reasons for this. The most important one is that I don't want to lose control when everything happens somewhere hidden.

So Hazel remains my main tool for juggling files and renaming defined elements. For the increasingly rare cases where I need to rename files, I've created a quick action that triggers a Python script for Gemini or Perplexity and renames the file based on a script-independent prompt. This works great and provides a meaningful name in 99.9% of cases.

I would very much love for hazel to add AI functionality so that I can describe in natural language what I want, then hazel assembles the rule and I can edit or approve it.

I'm envisioning this working like Zapier's AI works. All I do in Zapier is type, for instance, "take new contact data from Google Sheets and create or update records in Hubspot".

I'd like to be able to tell hazel "Whenever I put an image in the "RESIZE" folder, resize the image to a width of 800px and then move the image to the IMAGES folder"

Claude Code seems to be moving a lot closer to taking over what Hazel is doing.

Actually, in my last post I said "Claude Code" is moving closer to doing what Hazel does. But they just announced Claude Cowork, which does exactly what Hazel, except for the automated running of rules - which I imagine they will add shortly.

So my question remains, will Hazel be integrating AI functionality for me to leverage it's power via natural language prompts?

At this moment no. For now, there are other products out there which you can use if that is what you want.

There are a ton of issues which make this more complicated than you think and even if you can get it working for your situation does not mean that it will work well as a general product.

Note that version 6.1 did have a small AI feature which no one gave feedback on when it was in beta and ended up being nixed because it didn't work reliably enough. That used Apple's Foundation LLM model which is on-device, which is a requirement because many use cases of Hazel involve processing sensitive data like SSNs and account numbers. Unfortunately, the on-device model is not up to snuff as of yet, even for the more limited use case, it had a pretty high failure rate.

Oh I was just looking for this feature and didn't realize it was removed. I heard the foundation models improved a bit in the XX.4 releases any better?

I will be revisiting it over time so we'll see how it goes. I'm sure there will be some movement on this when WWDC comes around in a couple of months.