Document revision date: 19 July 1999 | |
Previous | Contents | Index |
You can define a Galaxy system so that one or more instances are not members of the Galaxy sharing community. These are known as independent instances, and they are visible to the GCU.
These independent instances can still participate in CPU reassignment.
They cannot utilize shared memory or related services.
11.2.2 Isolated Instances
It is possible for an instance to not be clustered, have no proxy
account established, and not have DECnet capability. These are known as
isolated instances. They are visible to the GCU, and
you can reassign CPUs to them. The only way to reassign resources from
an isolated instance is from the console of the isolated instance.
11.2.3 Required PROXY Access
When the GCU needs to execute a management action, it always attempts to use the SYSMAN utility first. SYSMAN requires that the involved instances be in the same cluster. If this is not the case, the GCU will next attempt to use DECnet task-to-task communications. For this to work, the involved instances must each have an Ethernet device, DECnet capability, and suitable proxy access on the target instance.
For example, consider a two-instance configuration that is not clustered. If instance 0 were running the GCU and the user attempts to reassign a CPU from instance 1 to instance 0, the actual reassignment command must be executed on instance 1. To do this, the GCU's action procedures in the file SYS$MANAGER:GCU$ACTIONS.COM will attempt to establish a DECnet task-to-task connection to the SYSTEM account on instance 1. This requires that instance 1 has granted proxy access to the SYSTEM account of instance 0. Using the established connection, the action procedure on instance 0 will pass its parameters to the equivalent action procedure on instance 1, which now treats the operation as a local operation.
The GCU action procedures assume that they will be used by the system manager. Thus, in the action procedure file SYS$MANAGER:GCU$ACTIONS.COM, the SYSTEM account is used. To grant access to the opposite instances SYSTEM account, the proxy must be set up on instance 1.
To establish proxy access:
$ SET DEFAULT SYS$SYSTEM $ RUN AUTHORIZE |
UAF> CREATE/PROXY UAF> ADD/PROXY instance::SYSTEM SYSTEM UAF> EXIT |
Replace instance with the name of the instance to which you are granting access. Perform these steps for each of the instances you want to manage from the instance on which you run the GCU. For example, in a typical two-instance Galaxy, if you run the GCU only on instance 0, then you need to add proxy access only on instance 1 for instance 0. If you intend to run the GCU on instance 1 also, then you need to add proxy access on instance 0 for instance 1. In three-instance Galaxy systems, you may need to add proxy access for each combination of instances you want to control. For this reason, a good rule of thumb is to always run the GCU from instance 0.
You are not required to use the SYSTEM account. To change the account, you need to edit SYS$MANAGER:GCU$ACTIONS.COM on each involved instance. Locate the line that establishes the task-to-task connection, and replace the SYSTEM account name with one of your choosing.
Note that the selected account must have OPER, SYSPRV, and CMKRNL
privileges. You also need to add the necessary proxy access to your
instances for this account.
11.3 Galaxy Configuration Models
The GCU is a fully programmable display engine. It uses a set of rules to learn the desired characteristics and interactive behaviors of the system components. Using this specialized configuration knowledge, the GCU assembles models that represent the relationships among system components. The GCU obtains information about the current system structure by parsing a configuration structure built by the console firmware. This structure, called the Galaxy Configuration File, is stored in memory and is updated as needed by firmware and by OpenVMS executive routines to ensure that it accurately reflects the current system configuration and state.
The GCU converts and extends the binary representation of the configuration file into a simple ASCII representation, which it can store in a file as an offline model. The GCU can later reload an offline model and alter the system configuration to match the model. Whether you are viewing the active model or an offline model, you are always free to save the current configuration as an offline Galaxy Configuration Model (.GCM) file.
To make an offline model drive the current system configuration, the model must be loaded and engaged. To engage a model, click the Engage button. The GCU will scan the current configuration file, compare it against the model, and create a list of any management actions that are required to engage the model. The GCU presents this list to you for final confirmation. If you approve, the GCU will execute the actions, and the model will become engaged to reflect the current system configuration and state.
When you disengage a model, the GCU immediately marks the CPUs and
instances as offline. You can then freely arrange the model however you
like, and either save the model, or reengage the model. In typical
practice, you are likely to have a small number of models that have
proved to be useful for your business operations. These can be engaged
by a system manager or a suitably privileged user, or through DCL
command procedures.
11.3.1 Active Model
The GCU maintains a single active model. This model is always derived
from the in-memory configuration file. The configuration file can be
from a Galaxy console or from a file-based, single-instance Galaxy on
any Alpha system. Regardless of its source, console callbacks maintain
the integrity of the file. The GCU utilizes Galaxy event services to
determine when a configuration change has occurred. When a change
occurs, the GCU parses the configuration file and updates its active
model to reflect the current system. The active model is not saved to a
file unless you choose to save it as an offline model. Typically, the
active model becomes the basis for creating additional models. When
creating models, it is generally best to do so online so that you are
sure your offline models can engage when they are needed.
11.3.2 Offline Models
The GCU can load any number of offline Galaxy configuration models and freely switch among them, assuming they were created for the specific system hardware. The model representation is a simple ASCII data definition format.
You should never need to edit a model file in its ASCII form. The GCU
models and ruleset adhere to a simple proprietary language known as the
Galaxy Configuration Language (GCL). This language continues to evolve
as needed to represent new Galaxy innovations. Beware of this fact if
you decide to explore the model and ruleset files directly. If you
accidentally corrupt a model, you can always generate another. If you
corrupt the ruleset, you may need to download another from the OpenVMS
Galaxy website.
11.3.2.1 Example: Creating an Offline Model
To create an offline Galaxy configuration model:
To engage an offline model:
The Galaxy Configuration File contains a considerable amount of configuration data and can grow quite large for complex Galaxy configurations. If the GCU displayed all the information it has about the system, the display would become unreasonably complex. To avoid this problem, the GCU provides Galaxy charts. Charts are simply a set of masks that control the visibility of the various components, devices, and interconnections. The entire component hierarchy is present, but only the components specified by the selected chart are visible. Selecting a different chart alters the visibility of component subsets.
By default, the GCU provides five preconfigured charts. Each is
designed to show a specific component relationship. Some GCU command
operations can be performed only within specific charts. For example,
you cannot reassign CPUs from within the Physical Structure chart. The
Physical Structure chart does not show the Galaxy instance components,
thus you would have no target to drag and drop a CPU on. Because you
can modify the charts the GCU does not restrict its menus and command
operations to specific chart selections. In some cases, the GCU
displays an informational message to help you select an appropriate
chart.
11.4.1 Component Identification and Display Properties
Each component has a unique identifier. This identifier can be a simple sequential number, such as with CPU IDs, a physical backplane slot number, as with I/O adapters, or a physical address, as with memory devices. Each component type is also assigned a shape and color by the GCU. Where possible, the GCU further distinguishes each component using supplementary information it gathers from the running system.
The display properties of each component are assigned within the Galaxy Configuration Ruleset (SYS$MANAGER:GALAXY.GCR). You should not edit this file, except to customize certain display properties, such as window color or display text style.
The text that gets displayed about each component is also customizable. Each component type has a set of statements in the ruleset that determine its appearance, data content, and interaction.
One useful feature is the ability to select which text is displayed in
each component type on the screen. The device declaration in the
ruleset allows you to specify the text and parameters, which make up
the display text statement. A subset of this display text is displayed
whenever the zoom scale factor does not allow the full text to be
displayed. This subset is known as the mnemonic. The mnemonic can be
altered to include any text and parameters.
11.4.2 Physical Structure Chart
The Physical Structure chart describes the physical hardware in the system. The large rectangular component at the top, or root, of the chart represents the physical system cabinet itself. Typically, below the root, you will find physical components such as modules, slots, arrays, adapters, and so on. The type of components presented and the depth of the component hierarchy is directly dependent on the level of support provided by the console firmware for each hardware platform. If you are viewing a single-instance Galaxy on any Alpha system, then only a small subset of components can be displayed. As a general rule, the console firmware presents components only down to the level of configurable devices, typically to the first-level I/O adapter or slightly beyond. It is not a goal of the GCU or of the Galaxy console firmware to map every device, but rather those that are of interest to Galaxy configuration management.
The Physical Structure chart is useful for viewing the entire collection of components in the system; however, it does not display any logical partitioning of the components.
In the Physical Structure chart you can:
The topmost component in the Physical Structure chart is known as the hardware root (HW_Root). Every Galaxy system has a single hardware root. It is useful to think of this as the physical floorplan of the machine. If a physical device has no specific lower place in the component hierarchy, it will appear as a child of the hardware root. A component that is a child can be assigned to other devices in the hierarchy when the machine is partitioned or logically defined.
Clicking the root instance of any chart will perform an auto-layout operation if the Auto Layout mode is set. |
Choose Ownership Overlay from the Windows menu to display the initial owner relationships for the various components. These relationships indicate the instance that will own the component after a power cycle. Once a system has been booted, migratable components may change owners dynamically. To alter the initial ownership, the console environment variables must be changed.
The ownership overlay has no effect on the Physical Structure chart or
the Failover Target chart.
11.4.3 Logical Structure Chart
The Logical Structure chart displays Galaxy communities and instances and is the best illustration of the relationships that form the Galaxy. Below these components are the various devices they currently own. Ownership is an important distinction between the Logical Structure chart and Physical Structure chart. In a Galaxy, resources that can be partitioned or dynamically reconfigured have two distinct "owners".
The owner describes where the device will turn up after a system power up. This value is determined by the console firmware during bus-probing procedures and through interpretation of the Galaxy environment variables. The owner values are stored in console nonvolatile memory so that they can be restored after a power cycle.
The current_owner describes the owner of a device at a particular moment in time. For example, a CPU is free to reassign among instances. As it does, its current_owner value is modified, but its owner value remains whatever it was set to by the lp_cpu_mask# environment variables.
The Logical Structure chart illustrates the current_owner
relationships. To view the nonvolatile owner relationships, select
Ownership Overlay from the Window menu.
11.4.3.1 Software Root
The topmost component in the Logical Structure chart is known as the software root (SW_Root). Every Galaxy system has a single software root. If a physical device has no specific owner, it will appear as a child of the software root. A component that has a child can be assigned to other devices in the hierarchy when the machine is logically defined.
Clicking the root instance of any chart will perform an auto layout operation if the Auto Layout mode is set. |
You can configure Galaxy partitions without assigning all devices to a partition, or you can define but not initialize one or more partitions. In either case, some hardware may be unassigned when the system boots.
The console firmware handles unassigned resources in the following manner:
Devices that remain unassigned after the system boots will appear
assigned to the software root component and may not be accessible.
11.4.3.3 Community Resources
Resources such as shared memory can be accessed by all instances within
a sharing community. Therefore, for shared memory, the community itself
is considered the owner.
11.4.3.4 Instance Resources
Resources that are currently or permanently owned by a specific
instance are displayed as children of the instance component.
11.4.4 Memory Assignment Chart
The Memory Assignment chart illustrates the partitioning and assignment of memory fragments among the Galaxy instances. This chart displays both hardware components (arrays, controllers, and so on) and software components (memory fragments).
Current Galaxy firmware and operating system software does not support
dynamic reconfiguration of memory. Therefore, the Memory Assignment
chart reflects the way the memory address space has been partitioned by
the console among the Galaxy instances. This information can be useful
for debugging system applications or for studying possible
configuration changes.
11.4.4.1 Console Fragments
The console requires one or more small fragments of memory. Typically,
a console allocates approximately 2MB of memory in the low address
range of each partition. This varies by hardware platform and firmware
revision. Additionally, some consoles allocate a small fragment in high
address space for each partition to store memory bitmaps. The console
firmware may need to create additional fragments to enforce proper
memory alignment.
11.4.4.2 Private Fragments
Each Galaxy instance is required to have at least 64MB of private
memory (includes the console fragments) to boot OpenVMS. This memory
can consist of a single fragment, or the console firmware may need to
create additional private fragments to enforce proper memory alignment.
11.4.4.3 Shared Memory Fragments
To create an OpenVMS Galaxy, a minimum of 8MB of shared memory must be
allocated. This means the minimum memory requirement for an OpenVMS
Galaxy is actually 72MB (64MB for a single instance, and 8MB for shared
memory).
11.4.5 CPU Assignment Chart
The CPU Assignment chart displays the minimal number of components
required to reassign CPUs among the Galaxy instances. This chart can be
useful for working with very large Galaxy configurations.
11.4.5.1 Primary CPU
Each primary CPU is displayed as an oval rather than a hexagon. This is
a reminder that primary CPUs cannot be reassigned or stopped. If you
attempt to drag and drop a primary CPU, the GCU displays an error
message in its status bar and does not allow the operation to occur.
11.4.5.2 Secondary CPUs
Secondary CPUs are displayed as hexagons. Secondary CPUs can be
reassigned among instances in either the Logical Structure chart or the
CPU Assignment chart. Simply drag and drop the CPU on the desired
instance. If you drop a CPU on the same instance that currently owns
it, the CPU will be stopped and restarted.
11.4.5.3 Fast Path and Affinitized CPUs
If you reassign a CPU that has a Fast Path device currently affinitized to the CPU, the affinity device will move to another CPU and the CPU reassignment will suceed. If a CPU has current process affinity assignment, the CPU cannot be reassigned.
For more information about using OpenVMS Fast Path features, see the
OpenVMS I/O User's Reference Manual.
11.4.5.4 Lost CPUs
You can reassign secondary CPUs to instances that are not yet booted (partitions).
Similarly, you can reassign a CPU to an instance that is not configured as a member of the Galaxy sharing community. In this case, you can push the CPU away from its current owner instance, but you cannot get it back unless you log in to the independent instance (a separate security domain) and reassign the CPU back to the current owner.
Regardless of whether an instance is part of the Galaxy sharing
community or is an independent instance, it will still be present in
the Galaxy configuration file; therefore, the GCU will still be able to
display it.
11.4.6 IOP Assignment Chart
The IOP Assignment chart displays the current relationship between I/O
modules and the Galaxy instances. Note that, depending on what type of
hardware platform is being used, a single-instance Galaxy on any Alpha
system may not show any I/O modules in this display.
11.4.7 Failover Target Chart
The Failover Target chart shows how each processor will automatically fail over to other instances in the event of a shutdown or failure. Additionally, this chart illustrates the state of each CPU's autostart flag.
For each instance, a set of failover objects are shown, representing the full set of potential CPUs. By default, no failover relationships are established and all autostart flags are set.
To establish automatic failover of specific CPUs, drag and drop the desired failover object to the instance you want the associated CPU to target. To set failover relationships for all CPUs owned by an instance, drag and drop the instance object on top of the instance you want the CPUs to target.
To clear individual failover targets, drag and drop a failover object back to its owner instance. To clear all failover relationships, right-click on the instance object to display the Parameters & Commands dialog box, click on the Commands button, click the "Clear ALL failover targets?", button and then click OK.
By default, whenever a failover operation occurs, the CPUs will automatically start once they arrive in the target instance. You can control this autostart function using the autostart commands found in the Parameters & Commands dialog box for each failover object, or each instance object. The Failover Target chart displays the state of the autostart flag by displaying the failover objects in green if autostart is set, and red if autostart is clear.
Please note the following restrictions in the current implementation of failover and autostart management:
Previous | Next | Contents | Index |
privacy and legal statement | ||
6512PRO_006.HTML |