Nexeed
    • Introduction
    • Concepts
      • Domain model
    • User manual
      • Device types
        • Manage or create a new Measuring Point for a device type
        • Manage or create a new error definition for a device type
        • Manage devices for a device type
      • Devices
        • Manage or create a new Measuring Point for a device
        • Manage or create a new error definition for a device
      • Topology
        • Navigate the topology
      • Error definitions
      • Measuring points
      • Processes
      • Process groups
      • Material definitions
    • Operations manual
      • Overview
      • System architecture and interfaces
      • System requirements
        • Equipment management service
        • Material service
        • Messaging service
        • Nginx gateway
        • Process service
      • Migration from previous versions
        • History of current versions
        • History of older versions
      • Setup and configuration
        • Helm configuration
        • Horizontal scalability for services in HELM deployments
        • Module health verification Endpoints and K8s probes
        • Data migration & synchronization
        • mmpd/equipment-service
        • mmpd/messaging-service
        • mmpd/process-service
        • mmpd/material-service
      • Start and shutdown
      • Regular operations
        • Deletion policy
        • Entities & fields
        • Resources and roles
      • Failure handling
        • Health verification Endpoints
        • Resiliency against failures in RabbitMQ
      • Backup and Restore
      • Logging and monitoring
      • Known limitations
    • API documentations
      • Equipment HTTP API
      • Process HTTP API
      • Material HTTP API
    • Glossary
Master Data Management
  • Industrial Application System
  • Core Services
    • Block Management
    • Deviation Processor
    • ID Builder
    • Multitenant Access Control
    • Notification Service
    • Ticket Management
    • Web Portal
  • Shopfloor Management
    • Andon Live
    • Global Production Overview
    • KPI Reporting
    • Operational Routines
    • Shift Book
    • Shopfloor Management Administration
  • Product & Quality
    • Product Setup Management
    • Part Traceability
    • Process Quality
    • Setup Specs
  • Execution
    • Line Control
    • Material Management
    • Order Management
    • Packaging Control
    • Rework Control
  • Intralogistics
    • AGV Control Center
    • Stock Management
    • Transport Management
  • Machine & Equipment
    • Condition Monitoring
    • Device Portal
    • Maintenance Management
    • Tool Management
  • Enterprise & Shopfloor Integration
    • Archiving Bridge
    • Data Publisher
    • Direct Data Link
    • Engineering UI
    • ERP Connectivity
    • Gateway
    • Information Router
    • Master Data Management
    • Orchestrator

Nexeed Learning Portal

  • Master Data Management
  • Operations manual
  • Failure handling
  • Resiliency against failures in RabbitMQ
preview v9.0.0

Resiliency against failures in RabbitMQ

Problem: The module used to run into errors when RabbitMQ was temporarily unavailable.

Solution: The module is now resilient against RabbitMQ failures and can handle temporary downtimes (due to updates or failures) of RabbitMQ by recovering to healthy after RabbitMQ is available again.

Equipment Management

MDM Equipment will reconnect for listening to incoming messages as soon as RabbitMq becomes available, with a delay in the range of 10 - 30 seconds.

  • The lost connection to RabbitMQ will be logged when tring to send an event.

  • We keep the Service alive and ready to accept requests: all read requests are successfull, all write requests will fail with 500 return code

    • The message sending is retried 3 times. If RabbitMQ is not available the messages are lost / not sent. The data changes are already committed and remain saved in the database.

  • The re-established connection to RabbitMQ will be logged for the first event that is sent after RabbitMQ comes back online.

Process

Any connection-errors or messaging failures while writing Data (create/update/delete entity) will result in an Exception which in turn rolls back the transaction. No data is modified in this case. Additionally, no automatic retries or anything like that will happen for that specific failed request → the user must send another request to actively retry. Further request will eventually work out correctly as soon as messaging is working fine again (e.g., RabbitMq is back online)

Material

Any connection-errors or messaging failures while writing Data (create/update/delete entity) will result in an Exception which in turn rolls back the transaction. No data is modified in this case. Additionally, no automatic retries or anything like that will happen for that specific failed request → the user must send another request to actively retry. Further request will eventually work out correctly as soon as messaging is working fine again (e.g., RabbitMq is back online)

Contents

© Robert Bosch Manufacturing Solutions GmbH 2023-2025, all rights reserved

Changelog Corporate information Legal notice Data protection notice Third party licenses