Background

Guizhou Power Grid Information Centre, as a digital transformation backbone for the Southern Power Grid Company, faces immense pressure in maintaining its extensive network and systems. With the shift towards cloud integration and microservice transformation, the centre has prioritised the development of a robust IT infrastructure. This includes transitioning to a multi-cloud environment to enhance flexibility and resource utilisation across different cloud platforms, such as Huawei Cloud and VMware.

Challenges:

  • Siloed Systems: The presence of numerous siloed or “stovepipe systems” created disjointed operational efforts, making unified management difficult.
  • Complex Configuration Management: Disorganised configuration management hindered effective operational troubleshooting and maintenance, particularly as systems became more integrated into various cloud services.
  • Microservice Maintenance Difficulties: The microservice architecture, while flexible, introduced complexities in maintenance and problem localisation, exacerbated by the multi-cloud setup.

Results:

  • Centralised Data Aggregation: By centralising the collection and management of operational data from various IT systems, the platform has streamlined data accessibility and analysis across cloud environments.
  • Operational Data Platform: A dedicated platform for processing vast amounts of operational and log data has been established, enhancing the responsiveness and efficiency of the IT operations.
  • Scenario-Based Operation Management: Detailed analysis of resource and PaaS component health, along with predictive algorithms for capacity and critical metrics, has improved the proactive management of IT resources.
  • Centralised Monitoring: Implementing a centralised monitoring system allows for a unified view of all IT resource health across the multi-cloud environment, enabling better decision-making and quicker response to potential issues.