The Challenge: Uncategorized Expense Entries

In many organisations, uncategorized expense entries have become a persistent headache for Finance. Employees submit expenses with vague descriptions, missing cost centers, or simply select “Other” to get their claims through. Finance teams then spend days chasing missing information, decoding receipts, and manually assigning GL accounts and project codes. Month-end closes stretch out, and no one fully trusts the spend reports.

Traditional approaches—policy PDFs, training sessions, and manual review—are no longer enough. As transaction volumes grow across travel, procurement, and subscriptions, Finance can’t scale headcount just to read receipts. ERP and expense tools help enforce some rules, but they struggle with free-text descriptions, mixed-language entries, and messy receipt photos. Static rules-based engines break whenever vendors change formats or employees get creative with descriptions.

The business impact is significant. Misposted costs distort margins by product, customer, and project. Budget owners see spend data too late to act. Controllers lose days of productive time on low-value classification work instead of analysis. Poor expense visibility undermines expense control, hides policy violations, and slows decision-making—especially critical when cash and profitability are under pressure.

The good news: this problem is very solvable with today’s AI. By combining Gemini’s multimodal understanding (text + images) with your financial coding logic, you can dramatically reduce uncategorized entries and manual touchpoints. At Reruption, we’ve helped organisations build AI-first workflows around complex document and data processing. Below, we’ll walk through practical, finance-ready ways to apply Gemini to expense classification—without disrupting your core ERP landscape.

Need a sparring partner for this challenge?

Let's have a no-obligation chat and brainstorm together.

Innovators at these companies trust us:

Our Assessment

A strategic assessment of the challenge and high-level tips how to tackle it.

From Reruption’s work building AI-powered document and data workflows, we’ve seen that uncategorized expenses are almost never a tooling issue alone—they’re a process and design issue. Gemini is a powerful engine to interpret merchant names, texts, and receipt images, but its real value comes when you embed it into a clear expense control strategy, with the right guardrails around GL logic, cost centers, and approval flows.

Anchor Gemini in Your Finance Data Model, Not Just in the Expense Tool

Before you start sending receipts to Gemini, Finance needs a clear, documented data model: which GL accounts are allowed for which cost centers and projects, what typical spend patterns look like, and which categories are “high-risk” from a policy perspective. Without this backbone, even a strong AI model will produce inconsistent classifications.

Treat Gemini as an interpreter between messy real-world inputs and your structured chart of accounts. Define explicit mappings, priorities, and override rules that reflect how your controllers already think. This keeps AI outputs aligned with your accounting logic and reduces pushback from auditors and local finance teams.

Start with a Narrow, High-Volume Use Case

Rather than “AI for all expenses”, start with one narrow but high-volume area: for example travel expenses (flights, hotels, taxis) or software subscriptions. In these domains, merchant patterns and descriptions are relatively consistent, making it easier for Gemini to learn and for Finance to validate results.

This focus lets you set clear success metrics (e.g. “reduce uncategorized travel expenses from 25% to <5% in three months”) and gather feedback from a smaller group of employees and approvers. Once the pilot is stable and trusted, you extend the same patterns to other spend types.

Design for Human-in-the-Loop, Not Full Automation on Day One

For Finance leaders, the biggest risk is not that Gemini misclassifies a taxi ride—it’s losing control over the process. To manage adoption and risk, design a human-in-the-loop workflow first: Gemini proposes categories, cost centers, and policy flags; Finance (or managers) review and accept, edit, or reject.

This approach builds trust and gives you labelled feedback data to improve Gemini’s prompts and fine-tuning. Over time, you can set confidence thresholds—e.g. auto-accept when confidence > 0.9, route to review between 0.6–0.9, and block or escalate below 0.6—so automation grows where the model has proven reliable.

Prepare Your Team and Policies for AI-Augmented Expense Management

Introducing AI-driven expense classification changes who does what in the process. Controllers spend less time coding expenses and more time defining rules, testing samples, and monitoring anomalies. Employees get faster feedback on policy breaches. To avoid resistance, make these shifts explicit.

Update your expense policy with a short section on AI support: what Gemini does, how categories are suggested, what overrides are allowed, and how data is used. Train Finance staff on reading AI outputs, understanding confidence scores, and interpreting edge cases. The more comfortable they are, the faster they will lean into automation instead of bypassing it.

Build for Auditability, Traceability, and Compliance from Day One

For Finance, a powerful model is useless if it’s a black box. From the beginning, ensure that your Gemini integration stores the input (sanitised where needed), the suggested classification, the reasoning (where technically possible via prompts), the confidence score, and the final human decision.

This traceability gives auditors comfort and lets you run periodic back-testing: How often does Gemini match final postings? In which categories does it struggle? Combined with clear data protection and access controls, this helps you meet internal controls, compliance, and works council requirements while still reaping the efficiency gains.

Used deliberately, Gemini can turn uncategorized expense entries from a chronic irritant into a largely automated, auditable workflow. The key is not just calling an API, but aligning the model with your chart of accounts, policies, and people. At Reruption, we specialise in building these AI-first finance processes end-to-end—from proof-of-concept to production-grade integrations. If you want to see what Gemini could do on your real expense data, we’re ready to help you test it in a focused, low-risk way.

Need help implementing these ideas?

Feel free to reach out to us with no obligation.

Real-World Case Studies

From Energy to Shipping: Learn how companies successfully use Gemini.

BP

Energy

BP, a global energy leader in oil, gas, and renewables, grappled with high energy costs during peak periods across its extensive assets. Volatile grid demands and price spikes during high-consumption times strained operations, exacerbating inefficiencies in energy production and consumption. Integrating intermittent renewable sources added forecasting challenges, while traditional management failed to dynamically respond to real-time market signals, leading to substantial financial losses and grid instability risks . Compounding this, BP's diverse portfolio—from offshore platforms to data-heavy exploration—faced data silos and legacy systems ill-equipped for predictive analytics. Peak energy expenses not only eroded margins but hindered the transition to sustainable operations amid rising regulatory pressures for emissions reduction. The company needed a solution to shift loads intelligently and monetize flexibility in energy markets .

Lösung

To tackle these issues, BP acquired Open Energi in 2021, gaining access to its flagship Plato AI platform, which employs machine learning for predictive analytics and real-time optimization. Plato analyzes vast datasets from assets, weather, and grid signals to forecast peaks and automate demand response, shifting non-critical loads to off-peak times while participating in frequency response services . Integrated into BP's operations, the AI enables dynamic containment and flexibility markets, optimizing consumption without disrupting production. Combined with BP's internal AI for exploration and simulation, it provides end-to-end visibility, reducing reliance on fossil fuels during peaks and enhancing renewable integration . This acquisition marked a strategic pivot, blending Open Energi's demand-side expertise with BP's supply-side scale.

Ergebnisse

  • $10 million in annual energy savings
  • 80+ MW of energy assets under flexible management
  • Strongest oil exploration performance in years via AI
  • Material boost in electricity demand optimization
  • Reduced peak grid costs through dynamic response
  • Enhanced asset efficiency across oil, gas, renewables
Read case study →

Morgan Stanley

Banking

Financial advisors at Morgan Stanley struggled with rapid access to the firm's extensive proprietary research database, comprising over 350,000 documents spanning decades of institutional knowledge. Manual searches through this vast repository were time-intensive, often taking 30 minutes or more per query, hindering advisors' ability to deliver timely, personalized advice during client interactions . This bottleneck limited scalability in wealth management, where high-net-worth clients demand immediate, data-driven insights amid volatile markets. Additionally, the sheer volume of unstructured data—40 million words of research reports—made it challenging to synthesize relevant information quickly, risking suboptimal recommendations and reduced client satisfaction. Advisors needed a solution to democratize access to this 'goldmine' of intelligence without extensive training or technical expertise .

Lösung

Morgan Stanley partnered with OpenAI to develop AI @ Morgan Stanley Debrief, a GPT-4-powered generative AI chatbot tailored for wealth management advisors. The tool uses retrieval-augmented generation (RAG) to securely query the firm's proprietary research database, providing instant, context-aware responses grounded in verified sources . Implemented as a conversational assistant, Debrief allows advisors to ask natural-language questions like 'What are the risks of investing in AI stocks?' and receive synthesized answers with citations, eliminating manual digging. Rigorous AI evaluations and human oversight ensure accuracy, with custom fine-tuning to align with Morgan Stanley's institutional knowledge . This approach overcame data silos and enabled seamless integration into advisors' workflows.

Ergebnisse

  • 98% adoption rate among wealth management advisors
  • Access for nearly 50% of Morgan Stanley's total employees
  • Queries answered in seconds vs. 30+ minutes manually
  • Over 350,000 proprietary research documents indexed
  • 60% employee access at peers like JPMorgan for comparison
  • Significant productivity gains reported by CAO
Read case study →

Unilever

Human Resources

Unilever, a consumer goods giant handling 1.8 million job applications annually, struggled with a manual recruitment process that was extremely time-consuming and inefficient . Traditional methods took up to four months to fill positions, overburdening recruiters and delaying talent acquisition across its global operations . The process also risked unconscious biases in CV screening and interviews, limiting workforce diversity and potentially overlooking qualified candidates from underrepresented groups . High volumes made it impossible to assess every applicant thoroughly, leading to high costs estimated at millions annually and inconsistent hiring quality . Unilever needed a scalable, fair system to streamline early-stage screening while maintaining psychometric rigor.

Lösung

Unilever adopted an AI-powered recruitment funnel partnering with Pymetrics for neuroscience-based gamified assessments that measure cognitive, emotional, and behavioral traits via ML algorithms trained on diverse global data . This was followed by AI-analyzed video interviews using computer vision and NLP to evaluate body language, facial expressions, tone of voice, and word choice objectively . Applications were anonymized to minimize bias, with AI shortlisting top 10-20% of candidates for human review, integrating psychometric ML models for personality profiling . The system was piloted in high-volume entry-level roles before global rollout .

Ergebnisse

  • Time-to-hire: 90% reduction (4 months to 4 weeks)
  • Recruiter time saved: 50,000 hours
  • Annual cost savings: £1 million
  • Diversity hires increase: 16% (incl. neuro-atypical candidates)
  • Candidates shortlisted for humans: 90% reduction
  • Applications processed: 1.8 million/year
Read case study →

AstraZeneca

Healthcare

In the highly regulated pharmaceutical industry, AstraZeneca faced immense pressure to accelerate drug discovery and clinical trials, which traditionally take 10-15 years and cost billions, with low success rates of under 10%. Data silos, stringent compliance requirements (e.g., FDA regulations), and manual knowledge work hindered efficiency across R&D and business units. Researchers struggled with analyzing vast datasets from 3D imaging, literature reviews, and protocol drafting, leading to delays in bringing therapies to patients. Scaling AI was complicated by data privacy concerns, integration into legacy systems, and ensuring AI outputs were reliable in a high-stakes environment. Without rapid adoption, AstraZeneca risked falling behind competitors leveraging AI for faster innovation toward 2030 ambitions of novel medicines.

Lösung

AstraZeneca launched an enterprise-wide generative AI strategy, deploying ChatGPT Enterprise customized for pharma workflows. This included AI assistants for 3D molecular imaging analysis, automated clinical trial protocol drafting, and knowledge synthesis from scientific literature. They partnered with OpenAI for secure, scalable LLMs and invested in training: ~12,000 employees across R&D and functions completed GenAI programs by mid-2025. Infrastructure upgrades, like AMD Instinct MI300X GPUs, optimized model training. Governance frameworks ensured compliance, with human-in-loop validation for critical tasks. Rollout phased from pilots in 2023-2024 to full scaling in 2025, focusing on R&D acceleration via GenAI for molecule design and real-world evidence analysis.

Ergebnisse

  • ~12,000 employees trained on generative AI by mid-2025
  • 85-93% of staff reported productivity gains
  • 80% of medical writers found AI protocol drafts useful
  • Significant reduction in life sciences model training time via MI300X GPUs
  • High AI maturity ranking per IMD Index (top global)
  • GenAI enabling faster trial design and dose selection
Read case study →

Cleveland Clinic

Healthcare

At Cleveland Clinic, one of the largest academic medical centers, physicians grappled with a heavy documentation burden, spending up to 2 hours per day on electronic health record (EHR) notes, which detracted from patient care time. This issue was compounded by the challenge of timely sepsis identification, a condition responsible for nearly 350,000 U.S. deaths annually, where subtle early symptoms often evade traditional monitoring, leading to delayed antibiotics and 20-30% mortality rates in severe cases. Sepsis detection relied on manual vital sign checks and clinician judgment, frequently missing signals 6-12 hours before onset. Integrating unstructured data like clinical notes was manual and inconsistent, exacerbating risks in high-volume ICUs.

Lösung

Cleveland Clinic piloted Bayesian Health’s AI platform, a predictive analytics tool that processes structured and unstructured data (vitals, labs, notes) via machine learning to forecast sepsis risk up to 12 hours early, generating real-time EHR alerts for clinicians. The system uses advanced NLP to mine clinical documentation for subtle indicators. Complementing this, the Clinic explored ambient AI solutions like speech-to-text systems (e.g., similar to Nuance DAX or Abridge), which passively listen to doctor-patient conversations, apply NLP for transcription and summarization, auto-populating EHR notes to cut documentation time by 50% or more. These were integrated into workflows to address both prediction and admin burdens.

Ergebnisse

  • 12 hours earlier sepsis prediction
  • 32% increase in early detection rate
  • 87% sensitivity and specificity in AI models
  • 50% reduction in physician documentation time
  • 17% fewer false positives vs. physician alone
  • Expanded to full rollout post-pilot (Sep 2025)
Read case study →

Best Practices

Successful implementations follow proven patterns. Have a look at our tactical advice to get started.

Use Multimodal Inputs: Combine Merchant, Text, and Receipt Images

Gemini’s strength for Finance is its ability to interpret multiple input types together. For each expense line, send merchant name, transaction amount, currency, employee free-text description, and a receipt image (where available). This helps Gemini distinguish, for example, between a hotel’s restaurant and room charges, or between personal and business items on the same receipt.

Structure your API payload to keep financial metadata separate from raw text, so prompts can explicitly reference them. Where your expense tool already extracts some OCR data from receipts, pass both the original image and the OCR text so Gemini can correct or enrich it.

Example Gemini prompt template (conceptual):
You are an expense classification assistant for the Finance department.
You receive: merchant, amount, date, employee description, and a receipt image.

Tasks:
1. Propose the most likely expense category (GL account) from this list: [...].
2. Propose cost center and project, if clearly inferable.
3. Flag potential policy violations (e.g. weekend spend, first-class travel).
4. Return a JSON object with fields: category_code, cost_center, project,
   policy_flags[], confidence_score, reasoning.

Now classify the following expense:
MERCHANT: {{merchant}}
AMOUNT: {{amount}} {{currency}}
DATE: {{date}}
DESCRIPTION: {{description}}
RECEIPT_IMAGE: <binary or URL>

Standardise Outputs with JSON Schemas and Confidence Thresholds

For reliable downstream posting, enforce a strict JSON schema for Gemini responses. Define required fields (e.g. category_code, confidence_score), permissible values (e.g. list of GL accounts), and default behaviours when the model is unsure.

On the consuming side (your middleware or expense tool), implement confidence thresholds. For example, if confidence_score >= 0.9, auto-apply the classification; if 0.7–0.9, route to a controller’s review queue with Gemini’s reasoning pre-filled; if < 0.7, keep the line as “uncategorized” but attach the suggestion for faster manual handling.

Expected Gemini response structure:
{
  "category_code": "6130_TRAVEL_HOTEL",
  "cost_center": "CC_102_MKT_DE",
  "project": "PRJ_4567_CAMPAIGN_Q4",
  "policy_flags": ["WEEKEND", "NO_APPROVAL_FOUND"],
  "confidence_score": 0.92,
  "reasoning": "Merchant is a hotel chain, date matches conference…"
}

Integrate Gemini into Existing Expense and ERP Workflows

Instead of building a separate tool, embed Gemini classification into the workflows your employees and controllers already use. Typical patterns include a middleware service between the expense app and ERP, or an API extension inside the expense management system.

Implementation sequence could look like this: (1) Employee submits an expense as usual; (2) Webhook triggers a Gemini API call with all inputs; (3) Middleware writes back suggested category, cost center, and flags into custom fields; (4) Approver or controller sees the pre-filled suggestions and can accept or adjust; (5) Final data posts to the ERP. This minimises change management and speeds up adoption.

Continuously Retrain Prompts with Feedback from Finance

Set up a simple loop where Finance feedback directly improves Gemini’s performance. Whenever a controller overrides a category or cost center, log both Gemini’s suggestion and the final choice. Regularly sample these overrides and update your prompt instructions (e.g. “for <merchant> in Germany, prefer cost center X unless description mentions Y”).

Even without model fine-tuning, iterative prompt refinement can move accuracy from “roughly helpful” to “good enough to automate most cases”. Schedule monthly prompt reviews with Finance and your technical team to adjust rules, add new merchant patterns, and refine policy checks.

Prompt refinement snippet:
Previous errors: Taxi rides for client visits were coded as "office travel".
Update instruction:
- If description contains words like "client", "customer", "meeting" and
  merchant type is taxi/ride-sharing, classify under
  6145_CLIENT_VISIT_TRAVEL instead of 6140_INTERNAL_TRAVEL.

Use Gemini for Policy Violation Detection and Anomaly Flags

Beyond basic categorisation, use Gemini to flag policy violations and anomalies in real time. For example, detect first-class or business-class fares from the ticket image, weekend or holiday expenses, or duplicate receipts submitted across different reports.

Design prompts that explicitly ask Gemini to reason about context: time of day, location vs. employee office, unusual amounts relative to typical spend for that merchant. Route flagged items into a specific queue with clear labels so controllers can quickly decide whether to approve, reject, or request more information.

Example policy check snippet:
In addition to classification, check for:
- Travel class (economy vs business/first) from the ticket or receipt.
- Weekend or public holiday dates.
- Multiple similar receipts in a short time window.
Return a "policy_flags" array with reasons, e.g. [
  "BUSINESS_CLASS_FLIGHT",
  "WEEKEND_EXPENSE"
]

Define KPIs and Dashboards to Track Impact

To prove value, track a small set of expense automation KPIs before and after Gemini deployment. Common metrics include: percentage of uncategorized lines, average time from submission to posting, manual touch rate per expense, and reclassification rate after monthly close.

Visualise these KPIs in your existing BI or ERP reporting, ideally by department and country. This helps you see where the model performs well, where additional training is needed, and where process issues (not AI) are causing delays. Use these insights to prioritise future improvements and justify further investment in AI-enabled Finance workflows.

Implemented pragmatically, Finance teams typically see 30–60% fewer uncategorized entries within the first 2–3 months, a 20–40% reduction in manual expense coding effort, and materially faster visibility into spend by cost center and project—enough to make a noticeable difference in month-end closing and budget steering.

Need implementation expertise now?

Let's talk about your ideas!

Frequently Asked Questions

Gemini combines text and image understanding to interpret what an expense really represents. It reads merchant names, employee descriptions, transaction amounts, and even receipt photos, then maps them to your predefined expense categories, cost centers, and projects.

Instead of employees guessing a category or choosing “Other”, Gemini proposes a concrete GL account and coding suggestion in real time. Finance can review and adjust these suggestions, and over time you can automate high-confidence cases, dramatically reducing the number of uncategorized lines that reach controllers.

You typically need three capabilities: (1) a Finance lead who understands your chart of accounts and expense policies, (2) a technical owner who can work with APIs or your expense tool’s integration layer, and (3) someone to handle security, data protection, and access control.

You do not need a large data science team to get started. A small cross-functional squad—Finance, IT, and one engineer—can set up a first Gemini-based classification pilot in a few weeks, especially if you use existing middleware or iPaaS tools as the integration backbone.

For a focused use case (e.g. travel expenses), organisations usually see tangible results within 4–8 weeks. In the first weeks, you’ll configure prompts, wire up the API, and run Gemini in suggestion-only mode alongside your current process.

Once Finance is comfortable with the quality of suggestions, you can start auto-applying high-confidence classifications. At that point, you should see a clear reduction in uncategorized entries and manual coding time. Expanding to additional spend categories is typically faster, because you can reuse the same technical foundation and only adapt the financial logic.

Cost has two components: (1) Gemini usage costs (API calls, which are typically low per transaction) and (2) your one-time integration and change effort. For most mid-sized and larger organisations, the main driver is internal or partner implementation time, not the model usage cost.

ROI comes from reduced manual classification work, faster month-end closing, fewer reclassifications, and better visibility into spend (enabling concrete savings actions in travel, procurement, and subscriptions). In practice, even a modest reduction of a few FTE-days per month plus avoided mispostings and better spend steering usually outweighs the ongoing cost of running Gemini.

Reruption supports you end-to-end, from idea to working solution. With our AI PoC offering (9,900€), we first validate that Gemini can reliably classify your real expense data: we define the use case, build a prototype that connects to sample receipts and ledger data, and measure accuracy, speed, and cost per run.

Beyond the PoC, our Co-Preneur approach means we embed with your Finance and IT teams, design the target workflow, implement the Gemini integration, and help set up KPIs, monitoring, and governance. We don’t stop at slides—we build and ship the actual automation inside your P&L so that uncategorized expenses become the exception, not the norm.

Contact Us!

0/10 min.

Contact Directly

Your Contact

Philipp M. W. Hoffmann

Founder & Partner

Address

Reruption GmbH

Falkertstraße 2

70176 Stuttgart

Social Media