Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

0800 2000
StartEndWhat System/Service is affectedWhat is happening?What will be affected?Contact PersonStatus







2022-09-08 -31 180008002022-09-01 0700NCSA File & Print ServersScheduled Windows Server MaintenanceFile & Print Shares will be unavailable during maintenance.  Users will be unable to access shares on Fileserver (e.g. home, busnoff, hr, etc.), and printing will be unavailable.08 2000GraniteGranite Bi-annual MaintenanceAny ingest or retrievel to/from the Archivebdickin2@illinois.edu  slack-id: briandi
set@ncsa.illinois.edu help@ncsa.illinois.edu

Status
colourBlue
titleSCHEDULED

2022-09-08 08002022-09-08 2000GraniteDelta, HOLL-I, Radiant, DES, vForgeTaiga Granite Bi-annual Planned Maintenance.Taiga mounts will be unavailable during maintenance. No data can be read or written Any ingest or retrievel to/from Taiga during the ArchivePM window.Christopher Hellerbdickin2@illinois.edu  slack-id: briandi
set@ncsa.illinois.edu 

Status
colourBlue
titleSCHEDULED

2022-09-08


0700

2022-09-08


1900

vForge / license serversQuarterly Planned Maintenanceall nodes and services will be unavailablehelp@ncsa.illinois.eduDelta, HOLL-I, Radiant, DES, vForgeTaiga Bi-annual Planned Maintenance.Taiga mounts will be unavailable during maintenance. No data can be read or written to/from Taiga during the PM window.Christopher Heller 
Status

Status

colourBlue
titleSCHEDULED

-09-08


0700

0800

-09-08


1900
vForge / license serversQuarterly Planned Maintenanceall nodes and services will be unavailable

2000

ICCICC Quarterly MaintenanceICC Cluster nodes only

help@campuscluster

help@ncsa

.illinois.edu

Status
colourBlue
titleSCHEDULED


0800


2000

ICCICC Quarterly MaintenanceICC Cluster nodes only

help@campuscluster.illinois.edu

Status
colourBlue
titleSCHEDULED

Previous Outages or Maintenance


Previous Outages or Maintenance

StartEndStartEndWhat System/Service was affected?What happened?What was affected?

Contact Person

Status
2022-08-31 18002022-09-01 0700NCSA File & Print ServersScheduled Windows Server MaintenanceFile & Print Shares were be unavailable during maintenance.  Users were unable to access shares on Fileserver (e.g. home, busnoff, hr, etc.), and printing was unavailable.help@ncsa.illinois.edu

Status
colourGreen
titlecomplete

1730

1830

JiraJira service will be restartedJira will not be availablehelp@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

08-24-22 183008-26-22 0800Granite Tape ArchiveFS crash and lockupA few files that were transferred into the archive shortly before the crash needed to be re-transferred.

bdickin2@illinois.edu  slack-id: briandi
set@ncsa.illinois.edu 


Status
colourGreen
titleCOMPLETE

2022-08-17
1200
n/aAll LSST hosts at NCSAServers will be shutoff and retired.All LSST servers and services at NCSA.lsst-admin@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-08-17 07002022-08-17 1320NightingaleQuarterly Planned MaintenanceAll Nightingale servers and services will be unavailable (other than the ngale-bastion* nodes)help@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-08-16 0700

2022-08-17 1305

HOLL-IQuarterly Planned Maintenance

All HOLL-I servers and services will be unavailable

2022-08-16 1505 - HOLL-I cluster return to service, but CS-2 remains offline for further work; CS-2 expected return to service by 2022-08-17 1000

2022-08-17 1305 - HOLL-I CS-2 is returned to service

help@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-08-09 20002022-08-09 2300Office Networks on 2nd FloorCode updates on office network switches.Office ports will be offline as switches reboot. help+neteng@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-08-10 20002022-08-10 2300Office Networks on 3rd FloorCode updates on office network switches.Office ports will be offline as switches reboot. help+neteng@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-08-11 20002022-08-11 2300Office Networks on 4th FloorCode updates on office network switches.Office ports will be offline as switches reboot. help+neteng@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-08-03
0900
2022-08-03
1000
NPCF Center-wide management firewallsSecondary firewall will be upgradedNo impact to services is anticipated.  Traffic will flow normally through the primary firewall as the secondary is upgraded.help+security@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-07-27
0940
2022-07-28
15:36
ACHE, Nightingale Several accounts have had their Covered Entity status revokedAffected users/accounts will not be able to access resources that requires Covered Entity enrollment 

help+hippa@ncsa.illinois.edu

Status
colourBlue
titleResolved

2022-07-27
0900
2022-07-27
1000
NPCF Center-wide management firewallsPrimary firewall will be upgradedNo impact to services is anticipated.  Traffic will be failed over to the secondary firewall, the primary will be updated, and then traffic will be moved back to the primary.help+security@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

0900

0915

JiraAdditional LDAP group will be added for exclusion to sync with LDAP users.In theory, nothing.help@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

0800

  2000

ICCICC Quarterly MaintenanceAll ICC services

help@campuscluster.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-07-19 07002022-07-19 0900RadiantVictoria UpdateMinimally disruptive, brief interruptions to OpenStack services, such as the Horizon dashboardradiant-admin@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-07-14
2345
2022-07-14
2359
WikiThe service will be restarted in order to increase the login timeout.Wiki will be unavailable for about 5 mins while it restarts.

Andrew Loftus 
help@ncsa.illinois.edu

Status
colourGreen
titlecomplete

2022-07-08
1700
2022-07-11
0800
LSST hosts in NCSA 3003Due to a full building power outage at NCSA on Sunday, 10 July, some LSST servers will be unavailable over the weekend. Servers will be shutdown at COB on Friday and returned to service on Monday morning.lsst-dbb-fts1
lsst-dbb-rucio
lsst-demo
lsst-dm-monitor
lsst-int-monitor
lsst-mon-dev
lsst-pup
lsst-test5
lsst-xfer
l1-cl-arctl
l1-cl-fault
l1-cl-header
nts-ccamfwdr1
nts-acamfwdr2
nts-acamfwdr1
lsst-admin@ncsa.illinois.edu

Status
colourGreen
titlecomplete

2022-07-11 08:302-22-07-11 9:30All ITSM (CMDB) VMsAll ITSM VMs are currently down. Ticket has been created to get them brought back up.Production CMDB service (openDCIM) is not availablekimber7@illinois.edu

Status
colourBlue
titleRESOLVED

2022-07-10 0700

2022-07-10 1430

NCSA building powerBuilding power feed work for multiple campus BuildingsAVL, LSST, ISL and Software standard services were down from Friday afternoon until Monday morning.Daniel Lapine 

Status
colourGreen
titleCOMPLETE

2022-07-8

1600

2022-07-11

0900

cerberus2 and cerberus4Campus is doing work on a common feed that affects multiple buildings, include the NCSA Building. Work is scheduled from 0700-1700, but may finish earlyVM hosts running these 2 bastions will be down for the weekend due to the scheduled power work at NCSAhelp+security@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-07-06 17302022-07-06 2030Wiki (wiki.ncsa.illinois.edu)Confluence and MySQL upgradeswiki will be down during the upgrade

Status
colourGreen
titleCOMPLETE

2022-07-05 18002022-07-05 2130NCSA File & Print ServersScheduled Windows Server MaintenanceFile & Print Shares were unavailable during maintenance.  Users were unable to access shares on Fileserver (e.g. home, busnoff, hr, etc.), and printing was unavailable.help@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-07-05 1800N/AiForgeend of serviceiForge was removed from service. Operations have moved to the new vForge virtual cluster.help+industry@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

2022-06-28 07002022-06-28 1900RadiantUnexpected complications during Radiant Maintenance

Minimally disruptive, brief interruptions to OpenStack services, such as the Horizon dashboard

Longer than expected outages of controller service. Instances that had floating IPs had no networking connectivity. Horizon dashboard and API was down (cannot launch new instances, etc).

radiant-admin@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

06-11-22 140006-14-2022 1630Granite Tape ArchiveFS was locked up due to a bug alert setting;Ingest or retrieval of data from the clusterbdickin2@illinois.edu  slack-id: briandi

Status
colourGreen
titleResolved

2022-06-02 18002022-06-07 1830NCSA Wiki ServcieDue to a critical security vulnerability announced  by Atlassian we have been forced to restrict access to the NCSA Wiki to NCSA internal networks. This restriction will remain in place until Atlassian is able to provide a patch or mitigation for the vulnerability.No remote access is allowed to the NCSA Wiki. Use the NCSA VPN for remote access. More information about using the VPN can be found here: https://users.ncsa.illinois.edu/clausen/NCSA_VPN_instructions_202206.pdfhelp@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-06-22 14302022-06-22
1900

NCSA LDAP1

replica is down

LDAP1 database server is failed. The IAM team is investigating.Only servers using ldap1 and should use ldap2tbouvet@illinois.edu

Status
colourGreen
titleRESOLVED

2022-06-22 14302022-06-22
1600
NCSA LDAP central replicas (ldap2-3) and any services that rely on them.LDAP database servers are failed. The IAM team is investigating.Any service, such as the internal web server and Jira and Confluence servers, that rely on LDAP for user identification data may be affected.help@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

1700

1830

Confluence (Wiki)Patching to address a security flawConfluence will not be accessiblehelp+service@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-06-02 06002022-06-02 0615NCSA GitLabGitLab was updated to latest versionAll GitLab services were unavailable for a few minutes.help+service@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

1700

1900

JiraUpgradeJira will not be availablehelp+service@illinois.edu

Status
colourGreen
titleCOMPLETE

2022-06-01 09002022-06-01 1015Facility UPSReplace two batteries,All system with UPS feed, the UPS will stay online supporting loads but at reduced capacity and no outage expected.rantissi@illinois.edu

Status
colourGreen
titleCOMPLETE

2022-05-25 22302022-05-26 16:15Delta

3 HSN switches were experiencing problems

switches were updated and reconfigured

  • Slurm scheduler was paused to prevent new jobs from starting
  • Taiga remained unmounted
  • various nodes had no connectivity to the HSN
  • most services were experiencing some amount of degradation
help@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-05-25 18002022-05-25 
2230
Taiga - CenterWide FSPartial outage. Some projects asked to temporary unmount /taigadeltaChristopher Heller

Status
colourGreen
titleCOMPLETE

2022-05-18 07002022-05-18 1400NightingaleNightingale Planned MaintenanceAll Nightingale Serviceshelp@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-05-12 17002022-05-12 1800Jira & WikiChange to puppet configsDowntime expected on each system for 1 to 5 minuteshelp+service@ncsa.illinois.edu

Status
colourGreen
titleCOMPLETE

2022-05-10 0700

2022-05-10 1900

iForge / vForge / license serversQuarterly Planned Maintenanceall nodes and services will be unavailablehelp@ncsa.illinois.edu

Status
colourGreen
titlecomplete

2022-05-10 08002022-05-10 0815cilogon.orgUpdate to OA4MP v5.2.6Improvements in the back-end servicehelp@cilogon.org

Status
colourGreen
titlecomplete

2022-05-09 18002022-05-09 2130NCSA File & Print ServersScheduled Windows Server MaintenanceFile & Print Shares were unavailable during maintenance.  Users were unable to access shares on Fileserver (e.g. home, busnoff, hr, etc.), and printing was unavailable.help+service@ncsa.illinois.edu

Status
colourGreen
titlecomplete

2022-05-04 10002022-05-04 1015IDDS Accounting ServicesPlanned Maintenance All IDDS services (APIs, acctd, etc)help+idds@ncsa.illinois.edu, tolbert@illinois.edu

Status
colourGreen
titleComplete

2022-05-04 06002022-05-04 0622NCSA GitLabGitLab was updated to latest versionAll GitLab services were unavailable for a few minutes.help+service@ncsa.illinois.edu

Status
colourGreen
titleComplete

2022-04-19 12:002022-04-19 12:01RadiantRestarted the AMQP service to put in some performance changesNew instance or virtual network changes that were submitted during the five-second restart may have failedradiant-admin@ncsa.illinois.edu

Status
colourGreen
titleComplete

2022 04-16 06002022 04-16 0630CILogonSeveral cilogon.org services will be updatedhttps://cilogon.org , https://crl.cilogon.org , https://demo.cilogon.org , ldaps://ldap.cilogon.orghelp@cilogon.org

Status
colourGreen
titleComplete

2022-04-14 2100

0915

JiraNew tickets cannot be created due to the user license limit being reachedCreation of new tickets.https://www.ncsa.illinois.edu/expertise/user-services/user-support/

Status
colourGreen
titleResolved

2022 04-14

0800

2022 04-14 0830Wifi, VoIP, CCTV and FS networks at NCSA.Tech services will be replacing their building router at NCSA.  They expect a 10 mins outage.  Services may see a temporary interruption as cables are being changed.help+neteng@ncsa.illinois.edu

Status
colourBlue
titleSCHEDULED

2022 04-09 0600

2022 04-09 0700Internet2 / ESnet WAN connections.
During a few minute outage, some of our WAN circuits will be migrated.  Traffic will be automatically re-routed. help+neteng@ncsa.illinois.edu

Status
colourBlue
titleSCHEDULED

2022-03-17 09002022-04-12
1030
jiraldap auths have been sporadically failing.  This service is being monitored to determine a root cause.Jira logins breakhelp+service@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

2022-04-12 09002022-04-12 0930vsphere.ncsa.illinois.eduvcenter security updates are being installed vm management interface will be unavailable for 15 mins.help@ncsa.illinois.edu

Status
colourGreen
titleComplete

2022-04-07 19002022-04-07 1950NCSA VPNSoftware Upgrades / SSL CertificateThe appliances hosting the NCSA VPN were patched and receive an updated SSL certificate. Users will experience a brief disconnect as load is failed over between the appliances.neteng@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

2022-04-06 22002022-04-07 0000Some office ports on the second floor. Once of the switches on the second floor is experiencing a software problem and is currently down.  Code updates are being applied.One of the six switches on the second floor is down.  Users who are connected to this port, might not receive link.help+neteng@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

2022-04-06 15302022-04-07 0630All systems which mount/utilize TaigaA bug involving the multirail functionality caused constant reboots with one of the metadata servers. This resulted in cluster de-stabilization and loss of function.All lustre/NFS mountpoints to Taiga, Globus to Taiga.help@ncsa.illinois.edu

Status
colourGreen
titleRESOLVED

2022-04-04 09302022-04-04 1000NCSA LDAPInstantiation of Delta resource OU branch in the NCSA LDAP database with replication testing.No impacts to properly configured systems or searches is expected.help@ncsa.illinois.edu

Status
colourGreen
titleComplete

2022-04-01 06002022-04-01 0700NCSA GitLabGitLab was updated to latest versionAll GitLab services was unavailable for a few minutes.help+service@ncsa.illinois.edu

Status
colourGreen
titleComplete

...