...
START | END | What System/Service is affected | What is happening? | What will be affected? | Contact Person | Status |
---|---|---|---|---|---|---|
2019-10-16 | ??? | DNS1 IPv6 reachability | Due to some switch issues, DNS1 IPv6 address is not reachable. DNS2 ipv6 address remains online. | help+neteng@ncsa.illinois.edu | ||
2019-12-17 06:00 | 2019-12-17 08:30 | JIRA | JIRA Upgrade from 7.6 to 8.5 | All JIRA users | swrights@illinois.edu | In Progress |
Upcoming Scheduled Maintenance
...
Start | End | What System/Service was affected? | What happened? | What was affected? | Contact Person | Status | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2019-12-17 06:00 | 2019-12-17 10:25 | JIRA | JIRA Upgrade from 7.6 to 8.5 | All JIRA users | swrights@illinois.edu |
| ||||||||
2019-12-12 08:00 | 2019-12-12 14:00 | LSST | Monthly Maintenance:
| ALL LSST systems will be updated, including:
| lsst-admin@ncsa.illinois.edu |
| ||||||||
2019-12-12 10:02 | 2019-12-12 10:08 | internal.ncsa.illinois.edu | System memory was exhausted and OOM killer started killing https connections. | Savanna tools were unavailable | help+its@ncsa.illinois.edu | Memory resources for the server were doubled and service was brought back online. | ||||||||
2019-12-10 13:45 | 2019-12-10 16:55 | Internet2 Connectivity | Internet2 Engineers isolated the issue to a malformed route update coming from an external peer to one of its nodes in Ashburn, VA. As this update was propagated throughout the Internet2 Network, it triggered a bug on the Internet2 routers and caused all internal BGP sessions of each router to rapidly flap, thus causing instability across the footprint. Engineers mitigated the issue by placing a filter on the specific peer to reject the malformed packet. The Major Incident has been resolved at this point. | Many different external resources, data transfers, sessions, etc. to various destinations. | help+neteng@ncsa.illinois.edu | Connectivity has stabilized. Please report any issues should they arise. | ||||||||
2019-12-2 | 2019-12-2 afternoon | Wireless network | Tech Services reports they are having authentication issues affecting Wifi and VPN. Engineers are working on the problem. Tech Services Issue Description. | NCSAnet, IllinoisNet wireless are non functional at the moment. NCSA wired network remains available. IllinoisNet_guest is also functional. | help+neteng@ncsa.illinois.edu | Troubleshooting in progress | ||||||||
2019-11-14 18:00 | 2019-11-14 19:00 | Exit-West Router | Software Upgrades | This should not be user impactful. All traffic will re-route via the other router. | help+neteng@ncsa.illinois.edu |
| ||||||||
2019-11-14 5:00 AM | 2019-11-14 3:30 PM | Nearline Endpoint | Issue with one storage library | Some Globus transfers were stalled for the period of the outage | bw+storage@ncsa.illinois.edu |
| ||||||||
Nov 7 10:00 | Nov 7 14:00 | ICCP. All login nodes will be down. | Reroute some IB cables between Core switches and compute nodes. Changing topology on Subnet Manager. | Scheduler will be pause. No users access to login nodes. All running jobs will be kill. | help@campuscluster.illinois.edu |
| ||||||||
2019-11-05 07:00 | 2019-11-05 16:53 | iForge | Quarterly Maintenance | All systems will be unavailable during the maintenance | iforge-admin@ncsa.illinois.edu |
| ||||||||
2019-10-1 | 2019-11-1 | NCSA Windows Domain Controllers | ITS Migrated all Windows Systems to using the Campus Domain. The existing NCSA Windows Domain has been decommissioned and shutdown. | NCSA Windows Systems | help+its@ncsa.illinois.edu |
| ||||||||
2019-10-23 8 a.m. | 2019-10-23 12:00 p.m. | Core-West | Code upgrades will be performed on Core-West network switch. | This should not be user impacting. All traffic will flow through the redundant Core. | neteng+help@ncsa.illinois.edu |
| ||||||||
2019-10-22 06:12 | 2019-10-22 07:18 | Jira and Wiki | During reboots for system patches the wiki and Jira got stuck in a state that was not providing data to the users. | Only web access to these tools was impacted. | help+its@ncsa.illinois.edu |
| ||||||||
2019-10-16 08:00 | 2019-10-16 20:30 | ICC system wide | Quarterly maintenance | All services on ICC | help@campuscluster.illinois.edu |
| ||||||||
2019-10-16 8 a.m. | 2019-10-16 12:00 p.m. | Core-East | Code upgrades will be performed on Core-East network switch. | This should not be user impacting. All traffic will flow through the redundant Core. | neteng+help@ncsa.illinois.edu |
| ||||||||
2019-10-15 11:45am | 2019-10-15 11:56AM | npcf-exit-east | BGP peering flapped over I2 AL2S circuit | Traffic got re-routed but some WAN services were impacted as reported by users. | help+neteng@ncsa.illinois.edu |
| ||||||||
2019-10-10 07:00 | 2019-10-10 07:30 | mysql.ncsa.illinois.edu | Some table repairs broke replication; this maintenance will update the replicas with newer databases so the service will work as expected again. | Wiki, JIRA, and some web sites will stop working. Email forwarding to user accounts at NCSA will be delayed during the outage. | lindsey@ncsa.illinois.edu |
| ||||||||
2019-10-01 | 2019-10-03 | NCSA-Print & Building Printers | Some printers are having issues connecting to the NCSA Print Server. After updating drivers on the print server, public printers are working as expected. | Printing | help+its@ncsa.illinois.edu |
| ||||||||
2019-10-03 6AM | 2019-10-03 7:45AM | Jira and Wiki | During reboots for system patches the wiki and Jira got stuck in a state that was not providing data to the users. | Only web access to these tools was impacted. | help+its@ncsa.illinois.edu |
| ||||||||
2019-10-01 7AM | 2019-10-01 8:30PM | Blue Waters | NGA work load scheduled testing | scheduler testing for NGA workload | David King |
| ||||||||
2019-10-01 10AM | 2019-10-01 12:04PM | Blue Waters | EPO 4 racks lost xdp (cooling) CRAY warm swapped racks back into system successfully. | scheduler, some computes missing and Gemini was rerouted |
| |||||||||
2019-10-01 07:00 | 2019-10-01 07:30 | mysql.ncsa.illinois.edu | MySQL servers needed to be synchronized to convert the server in NPCF back to a replicated host. | Wiki, JIRA, and some web sites stopped working. Email forwarding to user accounts at NCSA was delayed during the outage. | lindsey@ncsa.illinois.edu |
|
...