You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2040 Next »

status.ncsa.illinois.edu


Watch this page in the wiki to subscribe to automatic updates to this status page.

Please do not refer to any NCSA Industry Partners on this page. Please use the iforge nomenclature for all of the *forge infrastructure.

To see older events, see Archive of NCSA Status Home

Report a problem 

Current Status  

StartEndWhat System/Service is affectedWhat is happening?What will be affected?Contact PersonStatus

2022-07-10

0700

2022-07-10

1700

Complete NCSA Building Power outageCampus is doing work on a common feed that affects multiple buildings, include the NCSA Building. Work is scheduled from 0700-1700, but may finish earlyAll NCSA central services are expected to stay online as they are hosted from NPCF. AVL, LSST, ISL and Software services will be down from Friday afternoon until Monday morning. 

lapine@illinos.edu


Local services are being shutdown now
Power off in 15 minutes


Upcoming Scheduled Maintenance

Listed below in chronological order.

StartEndWhat System/Service is affectedWhat is happening?What will be affected?Contact PersonStatus

2022-07-10

0700

2022-07-10

1700

Complete NCSA Building Power outage

(some networking and emergency services on generator)

Campus is doing work on a common feed that affects multiple buildings, include the NCSA Building. Work is scheduled from 0700-1700, but may finish earlyAll NCSA central services are expected to stay online as they are hosted from NPCF. AVL, LSST, ISL and Software services will be down from Friday afternoon until Monday morning. Once power and cooling have been restored, we'll update the status page and make announcements on slack

lapine@illinios.edu


SCHEDULED

2022-07-8

1600

2022-07-11

0900

cerberus2 and cerberus4Campus is doing work on a common feed that affects multiple buildings, include the NCSA Building. Work is scheduled from 0700-1700, but may finish earlyVM hosts running these 2 bastions will be down for the weekend due to the scheduled power work at NCSAhelp+security@ncsa.illinois.edu

SCHEDULED

2022-07-08
1700
2022-07-11
0800
LSST hosts in NCSA 3003Due to a full building power outage at NCSA on Sunday, 10 July, some LSST servers will be unavailable over the weekend. Servers will be shutdown at COB on Friday and returned to service on Monday morning.lsst-dbb-fts1
lsst-dbb-rucio
lsst-demo
lsst-dm-monitor
lsst-int-monitor
lsst-mon-dev
lsst-pup
lsst-test5
lsst-xfer
l1-cl-arctl
l1-cl-fault
l1-cl-header
nts-ccamfwdr1
nts-acamfwdr2
nts-acamfwdr1
lsst-admin@ncsa.illinois.edu

PLANNED

2022-07-20 08002022-07-20 2000ICCICC Quarterly MaintenanceAll ICC services

help@campuscluster.illinois.edu

SCHEDULED

2022-10-19 08002022-10-19 2000ICCICC Quarterly MaintenanceICC Cluster nodes only

help@campuscluster.illinois.edu

SCHEDULED


Previous Outages or Maintenance

StartEndWhat System/Service was affected?What happened?What was affected?

Contact Person

Status
2022-07-06 17302022-07-06 2030Wiki (wiki.ncsa.illinois.edu)Confluence and MySQL upgradeswiki will be down during the upgrade

COMPLETE

2022-07-05 18002022-07-05 2130NCSA File & Print ServersScheduled Windows Server MaintenanceFile & Print Shares were unavailable during maintenance.  Users were unable to access shares on Fileserver (e.g. home, busnoff, hr, etc.), and printing was unavailable.help@ncsa.illinois.edu

COMPLETE

2022-07-05 1800N/AiForgeend of serviceiForge was removed from service. Operations have moved to the new vForge virtual cluster.help+industry@ncsa.illinois.edu

RESOLVED

2022-06-28 07002022-06-28 1900RadiantUnexpected complications during Radiant Maintenance

Minimally disruptive, brief interruptions to OpenStack services, such as the Horizon dashboard

Longer than expected outages of controller service. Instances that had floating IPs had no networking connectivity. Horizon dashboard and API was down (cannot launch new instances, etc).

radiant-admin@ncsa.illinois.edu

RESOLVED

06-11-22 140006-14-2022 1630Granite Tape ArchiveFS was locked up due to a bug alert setting;Ingest or retrieval of data from the clusterbdickin2@illinois.edu  slack-id: briandi

RESOLVED

2022-06-02 18002022-06-07 1830NCSA Wiki ServcieDue to a critical security vulnerability announced  by Atlassian we have been forced to restrict access to the NCSA Wiki to NCSA internal networks. This restriction will remain in place until Atlassian is able to provide a patch or mitigation for the vulnerability.No remote access is allowed to the NCSA Wiki. Use the NCSA VPN for remote access. More information about using the VPN can be found here: https://users.ncsa.illinois.edu/clausen/NCSA_VPN_instructions_202206.pdfhelp@ncsa.illinois.edu

COMPLETE

2022-06-22 14302022-06-22
1900

NCSA LDAP1

replica is down

LDAP1 database server is failed. The IAM team is investigating.Only servers using ldap1 and should use ldap2tbouvet@illinois.edu

RESOLVED

2022-06-22 14302022-06-22
1600
NCSA LDAP central replicas (ldap2-3) and any services that rely on them.LDAP database servers are failed. The IAM team is investigating.Any service, such as the internal web server and Jira and Confluence servers, that rely on LDAP for user identification data may be affected.help@ncsa.illinois.edu

RESOLVED

1700

1830

Confluence (Wiki)Patching to address a security flawConfluence will not be accessiblehelp+service@ncsa.illinois.edu

COMPLETE

2022-06-02 06002022-06-02 0615NCSA GitLabGitLab was updated to latest versionAll GitLab services were unavailable for a few minutes.help+service@ncsa.illinois.edu

COMPLETE

1700

1900

JiraUpgradeJira will not be availablehelp+service@illinois.edu

COMPLETE

2022-06-01 09002022-06-01 1015Facility UPSReplace two batteries,All system with UPS feed, the UPS will stay online supporting loads but at reduced capacity and no outage expected.rantissi@illinois.edu

COMPLETE

2022-05-25 22302022-05-26 16:15Delta

3 HSN switches were experiencing problems

switches were updated and reconfigured

  • Slurm scheduler was paused to prevent new jobs from starting
  • Taiga remained unmounted
  • various nodes had no connectivity to the HSN
  • most services were experiencing some amount of degradation
help@ncsa.illinois.edu

COMPLETE

2022-05-25 18002022-05-25 
2230
Taiga - CenterWide FSPartial outage. Some projects asked to temporary unmount /taigadeltaChristopher Heller

COMPLETE

2022-05-18 07002022-05-18 1400NightingaleNightingale Planned MaintenanceAll Nightingale Serviceshelp@ncsa.illinois.edu

COMPLETE

2022-05-12 17002022-05-12 1800Jira & WikiChange to puppet configsDowntime expected on each system for 1 to 5 minuteshelp+service@ncsa.illinois.edu

COMPLETE

2022-05-10 0700

2022-05-10 1900

iForge / vForge / license serversQuarterly Planned Maintenanceall nodes and services will be unavailablehelp@ncsa.illinois.edu

COMPLETE

2022-05-10 08002022-05-10 0815cilogon.orgUpdate to OA4MP v5.2.6Improvements in the back-end servicehelp@cilogon.org

COMPLETE

2022-05-09 18002022-05-09 2130NCSA File & Print ServersScheduled Windows Server MaintenanceFile & Print Shares were unavailable during maintenance.  Users were unable to access shares on Fileserver (e.g. home, busnoff, hr, etc.), and printing was unavailable.help+service@ncsa.illinois.edu

COMPLETE

2022-05-04 10002022-05-04 1015IDDS Accounting ServicesPlanned Maintenance All IDDS services (APIs, acctd, etc)help+idds@ncsa.illinois.edu, tolbert@illinois.edu

COMPLETE

2022-05-04 06002022-05-04 0622NCSA GitLabGitLab was updated to latest versionAll GitLab services were unavailable for a few minutes.help+service@ncsa.illinois.edu

COMPLETE

2022-04-19 12:002022-04-19 12:01RadiantRestarted the AMQP service to put in some performance changesNew instance or virtual network changes that were submitted during the five-second restart may have failedradiant-admin@ncsa.illinois.edu

COMPLETE

2022 04-16 06002022 04-16 0630CILogonSeveral cilogon.org services will be updatedhttps://cilogon.org , https://crl.cilogon.org , https://demo.cilogon.org , ldaps://ldap.cilogon.orghelp@cilogon.org

COMPLETE

2022-04-14 2100

0915

JiraNew tickets cannot be created due to the user license limit being reachedCreation of new tickets.https://www.ncsa.illinois.edu/expertise/user-services/user-support/

RESOLVED

2022 04-14

0800

2022 04-14 0830Wifi, VoIP, CCTV and FS networks at NCSA.Tech services will be replacing their building router at NCSA.  They expect a 10 mins outage.  Services may see a temporary interruption as cables are being changed.help+neteng@ncsa.illinois.edu

SCHEDULED

2022 04-09 0600

2022 04-09 0700Internet2 / ESnet WAN connections.
During a few minute outage, some of our WAN circuits will be migrated.  Traffic will be automatically re-routed. help+neteng@ncsa.illinois.edu

SCHEDULED

2022-03-17 09002022-04-12
1030
jiraldap auths have been sporadically failing.  This service is being monitored to determine a root cause.Jira logins breakhelp+service@ncsa.illinois.edu

RESOLVED

2022-04-12 09002022-04-12 0930vsphere.ncsa.illinois.eduvcenter security updates are being installed vm management interface will be unavailable for 15 mins.help@ncsa.illinois.edu

COMPLETE

2022-04-07 19002022-04-07 1950NCSA VPNSoftware Upgrades / SSL CertificateThe appliances hosting the NCSA VPN were patched and receive an updated SSL certificate. Users will experience a brief disconnect as load is failed over between the appliances.neteng@ncsa.illinois.edu

RESOLVED

2022-04-06 22002022-04-07 0000Some office ports on the second floor. Once of the switches on the second floor is experiencing a software problem and is currently down.  Code updates are being applied.One of the six switches on the second floor is down.  Users who are connected to this port, might not receive link.help+neteng@ncsa.illinois.edu

RESOLVED

2022-04-06 15302022-04-07 0630All systems which mount/utilize TaigaA bug involving the multirail functionality caused constant reboots with one of the metadata servers. This resulted in cluster de-stabilization and loss of function.All lustre/NFS mountpoints to Taiga, Globus to Taiga.help@ncsa.illinois.edu

RESOLVED

2022-04-04 09302022-04-04 1000NCSA LDAPInstantiation of Delta resource OU branch in the NCSA LDAP database with replication testing.No impacts to properly configured systems or searches is expected.help@ncsa.illinois.edu

COMPLETE

2022-04-01 06002022-04-01 0700NCSA GitLabGitLab was updated to latest versionAll GitLab services was unavailable for a few minutes.help+service@ncsa.illinois.edu

COMPLETE

Legend:

IN PROGRESS

RESOLVED

SCHEDULED

MONITORING


  • No labels