Quantcast
Channel: Intel Communities : Discussion List - Servers
Viewing all 3923 articles
Browse latest View live

Intel RAID Controller RS3DC080 disabled in Device Manager after reboot of Windows Server 2016 Server

$
0
0

Every time I need to restart the two Windows Server 2016 Datacenter boxes I have the RS3DC080 card installed on, the driver is disabled under Device Manager -> Storage controllers.  I have the latest 6.713.5.0 drivers for Windows 10 x64 installed per the instructions.  Any help would be appreciated as it takes a few moments to remember that disabling and re-enabling the device solves the issue (but requires manual intervention).

 

Thanks.


S2600CP very loud fan

$
0
0

I have Intel Server Board S2600CP in Intel Server Chassis P4000S with 5 fan coolers. Server two year worked correctly. But, one weak ago, 5 fan coolers started working too fast and too load. Installed one processor - Xeon E5-2603, and her temperature about 30 C.

I upgrade BIOS on last firmware: BIOS - 02.06.0005, ME - 02.01.07.328, BMC - 01.27.9958, FRUSDR - 1.11. During upgrade i set Chassis serial number, Chassis part number, System serial number and System part number correctly.

In BIOS settings set parameters: Set Throttling Mode - [Auto], Altitude -[300m or less], Set Fan Profile - [Acoustic]. For tests i set various parameters in BIOS, but 5 fan coolers worked very fast and very load all time.

Ask me please, how to diagnostic and correctly configure my server.

 

Thanks.

S5520UR - Pwr Unit Status error 0x0061

$
0
0

Good day to all.

 

When the server is turned on, the information on the monitor is not transmitted, the periphery does not work.
There is a polling of the hard drives, after that the server gradually increases the fan speed to the maximum.


In RMM3 error:

Pwr Unit Status | reports the power unit is powered off or being powered down, reports there has been a soft power control failure, reports the power unit has suffered a failure | 0x0061.

 

In logs:

22908/18/2017 13:29:16Pwr Unit StatusPower Unitreports the power unit has suffered a failure - Asserted
22808/18/2017 13:29:10Pwr Unit StatusPower Unitreports the power unit is powered off or being powered down - Asserted
22708/18/2017 13:29:05ButtonButton / Switchreports the power button has been pressed - Asserted
22608/18/2017 13:29:03ButtonButton / Switchreports the power button has been pressed - Asserted
22508/18/2017 13:28:55Pwr Unit StatusPower Unitreports there has been a soft power control failure - Asserted
22408/18/2017 13:28:50Pwr Unit StatusPower Unitreports the power unit's AC is lost - Deasserted

 

 

After turning off the button, sound signals are heard, to decode the error of the power problem.


Resetting the BIOS does not help.
Power supplies were replaced from another server - there is no result.
Changed the board into which the power supplies are inserted to the same from the production server - there is no result.

 

A month before, there was an error in the logs:

BIOS Evt SensorSystem Eventreports Timestamp Clock Sync. Event is one of two expected events from BIOS on every power on. - Asserted

After resrart the server boots up normally.

 

What can I do to restore the server?

Powering the Intel S2600TPR in a 3rd party chassis

$
0
0

I'm looking into powering the S2600TPR motherboard in a 3rd party chassis. I noticed that there are two main power connectors and an auxiliary power connector as well. I was wondering if there was an adapter to connect power to this motherboard using an ATX power supply? If so what is the part number? If there is no power adapter cable how can I go about powering it?

2 S5500BC Server Boards Both Producing DIMM B1 Uncorrectable ECC Errors

$
0
0

Hi,

 

I'm building a Windows Server 2012 R2 system using some existing server parts that we have available. I started with one S5500BC board and added the required hardware, which is an LSI 9260-8i card (this card was already in a number of old servers we have with the same board and worked fine) and I've added an Intel I350-T4 network card and a 2 port USB 3 card. I have done the exact same upgrades / additions to 2 other servers based on the same S5500BC boards and Server 2012 R2 and neither of those other 2 servers have any issues.

 

So with this build, the server lost power 2 nights in a row, windows just logging the standard 'kernel power' error, which is as if the plug was pulled (but it wasn't). After the second night of this I installed the Intel ASC and it told me there was an uncorrectable ECC issue in dim slot B1. I powered down the server and moved the RAM about and it still reported the same error in the same slot. I did this a couple of times to be sure and it always reported DIMM slot B1 as the problem, regardless of what stick was in it. I took this to mean that the motherboard itself was faulty so I replaced the whole board with another S5500BC we have spare. This is a working board that also (like the first one) has never given any reason to believe there are any problems with it.

 

After putting the second board in, I ran the ASC again and the same B1 error was present. I thought the may be the ASC calling up the old logs so I uninstalled and reinstalled it, and the memory error was gone. I checked it after around 12 hours of the server running and this remained the same. However a couple of nights ago the server again 'lost power' or just shut down ungracefully. I check the ASC again, and the same B1 error is back! On a completely different board.

 

What could be going on here? I quite urgently need to get this server stable. I have tried to update the firmwares using the S5500BC_BIOS63_BMC61_FRUSDR22_ME112 package, I ran the one boot windows flash utility (v9.7 build 21) pointing to the extracted location using the following command:

 

flashupdt -u C:\TempPath

 

But I get the following:

 

Update file configuration: XXX S5500BC,1.0

*ERROR* BMC responded with incompatible values

 

Could anyone please help? The only thing that is different about this server to the other 2 that are stable is that this one has 8 drives instead of 6. But it had 8 drives anyway during it's previous installation and there were no issues... ??

 

Thanks.

PECI over DMI Interface Error

$
0
0

Hi All,

 

Does anybody have any information about this error?

 

We have 3x Brand new S2600WTTR Intel Servers and all 3 are producing this error every few weeks:

 

SPS FW Health reports SPS Health event type FW status. PECI over DMI interface error. Recovery via CPU Host reset or platform reset. DMI timeout of PECI request.

 

This is causing a hard reset on each of the systems.

 

If anyone has any more info on what would be causing this it would be greatly appreciated

Intel Xeon Phi 7120P

$
0
0

Ihave installed software stack on centos 7 for intel xeon phi 7120p , everything are succed , micctrl -s gives output mic0 ready ( online ) , my question is how to run this coreprocessor with the other 2 intel xeon e5-2603 v3 in the host server  , so that ican manage phi as main processor .??

Cooling issues and fans blowing hard after firmware updates on S2600CP Board

$
0
0

Hi,

 

I wanted to update the firmwares on a S2600CP that I've just started supporting. I previously installed Windows Server 2012 R2 on it and configured it as a Hyper-V Host and moved some VMs on to it and that was all working fine for a couple of weeks, including everything reporting as OK in the ASC.

 

So I tried to go to the latest firmwares (02.06.0006) via the EFI Shell and it failed the BMC update, at the end it said it could not leave Firmware Update mode and go back to Operation Mode.

 

I was on very old firmwares to begin with so I stepped through around 6 firmware bundles up to the very latest. When I booted into Windows I noticed the fans did not die down. I went into the ASC and could see the information attached in the images.

 

I tried downgrading back to the firmware before, which is what it's still currently on (02.06.0005) but this has not changed the situation.

 

Could anyone please help? Thanks.


S2600CW2R Board not booting up although the server is switching ON

$
0
0

I trust you are all well.

I faced an issue suddenly with a S2600CW2R server where the system stopped showing BIOS boot and even WINDOWS boot after about one hour from shutting it down. Also, the keyboard is not anymore being detected on boot.

The server was rendering a high resolution image then after completion I have shut down the system and unplugged the power cords for security reason. I always do this.

Then after an hour I turned it ON to continue my work and I got the issue described above.

 

Any idea what steps to follow please?

 

NOTE: I tried all the below with no results!

1- Discharge the power.

2- Reset Graphic cards.

3- Removed all Graphic cards and tried to boot on the on-board VGA.

4- Used one Power Supply instead of 2.

5- Swapped Power Supplies.

6- Used another Keyboard.

7- BOOT up without a Keyboard and mouse.

 

I preferred not to go through any FIRMWARE or BIOS upgrade before receiving your feedback.

 

Regards.

Nidal.

Intel® Server System R2208GZ4GC Fans Running High/Fast

$
0
0

Why are the fans running high?  No lights on by the fans are on and the power supplies lights are green. 

 

This was running fine last week.  Tried update the firmware/BIOS to 02060006 and problem remains.

 

The logs indicate various issues of the PS1 & PS2 not having enough voltage then they are fine; CPU #2 is not connected; Fans are not redundant.   What is the real issue here?

S2600WTTR Power failure if USB Disk connected

$
0
0

Hi,

we have several new Servers with the S2600WTTR Mainboard (System BIOS - R01.01.0021 and R01.01.0022).

When we connect a USB 3.0 Disk (WD Elements 2TB), the server beeps and restarts.

 

Beep Code: 1-5-4-2

Associated Sensor: Power fault

Reason for Beep: DC power unexpectedly lost (power good dropout) -Power unit sensors report power unit failure offset...

 

In the BMC Server Health log i have this entries:

 

807/21/2017 16:11:06Pwr Unit StatusPower Unitreports the power unit has suffered a failure - On-board 5V and 3V3 Power failure, - Deasserted
22707/21/2017 16:11:05Pwr Unit StatusPower Unitreports the power unit is powered off or being powered down - Deasserted
22607/21/2017 16:11:00Pwr Unit StatusPower Unitreports the power unit is powered off or being powered down - Asserted

 

Update:

I have this problem on Servers with Vmware installed / also on Systems with Windows - Do i need to check some Bios Settings ?

If the Server stands on BIOS Screen and you connect the USB Disk, also a power fault occure and the server restarts...

 

 

Any ideas ? any help is appreciated! Thanks in advance!

After that, I'm trying to install Windows 2008 R2, but the initial level is loading from the installation disk, and then drastically goes into reboot.

$
0
0

Good afternoon Dear experts!

 

I had the following problem, and I can not figure out what it is related to.

 

I use the S5500BC motherboard, to which the RAID controller RS2BL080 is connected in the quantity of 1 pc.

 

This RAID controller worked on SATA HDD in the amount of 6 pcs.

 

After 1200 days of HDD operation, the OS with RAID ceased to run.

I checked the SATA HDD and they are all working.

 

I created a new RAID configuration of 1 or 0.

 

After that I succeeded, where it is written that VD0 is created and online.

 

After that, I'm trying to install Windows 2008 R2, but the initial level is loading from the installation disk, and then drastically goes into reboot.

 

What is the reason for this?

 

The BIOS has RAID boot enabled.

 

Can you help with this issue?

 

 

_____________________________________

 

 

 

Добрый день Уважаемые специалисты!

У меня возникла следующая проблема, и я не могу разобраться с чем она связана.

 

Я использую материнскую плату S5500BC, к которой подключен RAID-контроллер RS2BL080 в количестве 1 шт.

 

Данный RAID контроллер работал на SATA HDD в количестве 6 шт.

 

Спустя 1200 дней работы HDD, прекратился запускаться ОС с RAID.

 

Я проверил SATA HDD и они все рабочии.

 

Я создал новый RAID конфигурацию 1 или 0.

 

После чего у меня все успешно получилось, где пишется что VD0 создан и online.

 

После этого я пытаюсь произвести установку ОС Windows 2008 R2, но начальном уровне происходит загрузка с установочного диска, а потом резко уходит в перезагрузку (reboot).

 

С чем это связано?

 

В BIOS установлен загрузка RAID.

 

Можете помочь в данном вопросе?

У меня возникла следующая проблема, и я не могу разобраться с чем она связана.

 

Я использую материнускую плату S5500BC, к которой подключен RAID-контроллер RS2BL080 в количествет 1 шт.

 

Данный RAID контроллер работал на SATA HDD в количестве 6 шт.

 

Спустя 1200 дней работы HDD, прекратился запускаться ОС с RAID.

 

Я проверил SATA HDD и они все рабочие.

 

Я создал новый RAID конфигурацию 1 или 0.

 

После чего у меня все успешно получилось, где пишеться что VD0 создан и online.

 

После этого я пытаюсь произвести установку ОС Windows 2008 R2, но начальном уровне происходит загрузка с установочного диска, а потом резко уходит в перезагрузку (reboot).

 

С чем это связано?

 

В BIOS установлен загрузка RAID.

 

Можете помочь в данном вопросе?

Chassis P4304XXMUXX with S2600CW2R is suddenly shutting down

$
0
0

Dear Intel Support,

My system has following configuration: Mainboard S2600CW2R, Chassis P4304XXMUXX with two PSUs FXX750PCRPS, two CPUs Xeon E5-2637V4, Memory 8 modules each 16GB, one SSD DC S3520 Series 480GB, one HDD Seagate 3.5" 8TB ST8000NM0045 and Graphic Card ASUS GTX 1080 Ti.

OS: Windows 10 Pro. All firmware was initially updated to the newest version.

System works properly, passes all tests, OS boots and runs OK.

When starts 3D modelling software Bentley Context Capture and Graphic Card is in full load, system is suddenly shutting down with beep code 1-5-4-2.

PSUs are in FULLY_REDUNDANT state.

Looks like insufficient PSU resources.

"S2600CW_H56450_P4304XXMXXX_power budget_tool R1.1" tells that there is enough PSU resources: System Totals: 647W; Power Supply Limits: 750W.

Could you please advise me what the possible cause of system shut down and which actions to be taken to resolve situation?

Maybe to set PSUs in NON_REDUNDANT state to gain available power resources?

But I've found none information in a lot of manuals available how can I make it.

Thanks in advance.

S1200SPLR/P4000XXSFDR high speed fan throttle up for no apparent reason

$
0
0

Has anyone come a cross this issue or a similar issue.

 

Here is what we are seeing. For no reason at all that we can see the fans while throttle up to high speed and remain "screaming" until the server is restarted. The problem is random but does happen on a regular basis. The system has all the latest firmware, it is running 32GB DDR4 2133 RAM E3-1230V6 (similar problems with 1230V5). L1 support tells me it is a memory issue because the memory isn't Intel validated. The RAM in use is Crucial and is guaranteed compatible. I am sure many of us use Crucial RAM and excellent results. Intel insists that non validated memory is causing the problem yet cannot provide me with a source to get this "validated memory".  The Intel support guy is really trying his best but so far, results are mediocre.

 

Any input will be helpful.

Intel Chipset Update Blue Screen ThinkServer TD350 Xeon E5-2600 MB

$
0
0

I am in the process of setting up a ThinkServer TD350 with a Intel Xeon E52650V3 2.3Ghz Processor, and a TD-350/E5-2600 Motherboard, 16GB of 2R x 4 PC4 1700R RDIMM RAM.

 

The server is running Windows Server 2016.  BIOS and TDM (Thinkserver Deployment Manager) were both updated prior to installing the O/S and before attempting the chipset update.

 

From the device manager it shows "Intel(R) C610 series/X99 chipset SMBus Controller 8D22 Version 10.1.2.19  Dated 1/26/2016"

 

According to Lenovo's suggested downloads I am to install the Intel DD_INTEL_CHIPSET_10.1.2.77_Public_Win_x86x64.   

 

Installing this update caused the server to go Blue Screen with the error code: WHEA UNCORRECTABLE ERROR, for which the server gathered information and rebooted.

 

Lenovo suggested I download and run a (Microsoft Partnered) utility named Winthrust by Avangate/Solvusoft to clean up the registry, unused dll files etc.  The program was to have 25 free uses.  When I tried to apply the removal of said problems, it popped up a message that the free trial was expired and I had to buy the product, which I did for $30.00.  I then found out from a Malwarebytes scan that this "utility" has a red flag warning that it is malware/spyware.  I promptly removed the product. In a search I found numerous mentions that the Winthrust software utility and Avangate/Solvusoft was marketing a spyware product.

 

I have also run the Intel driver update utility.   The Intel utility displayed that there was a suspended driver installation and that it should be removed.  I allowed the Intel driver utility to clean things up.   The Intel utility then indicated that there were NO driver updates needed.

 

I again tried to install the chipset update and the system went blue screen again with the same error.  After the system rebooted I again ran the Intel driver update utility in which it again indicated that there was a suspended driver installation.

 

I now believe that the employees at Lenovo who have answered my questions about this problem do not have the correct information.  Advising customers to download known malware/spyware for use indicates that Lenovo support may not have their customer's best interests in mind.

 

 

 


PECI over DMI Interface Error

$
0
0

Hi All,

 

Does anybody have any information about this error?

 

We have 3x Brand new S2600WTTR Intel Servers and all 3 are producing this error every few weeks:

 

SPS FW Health reports SPS Health event type FW status. PECI over DMI interface error. Recovery via CPU Host reset or platform reset. DMI timeout of PECI request.

 

This is causing a hard reset on each of the systems.

 

If anyone has any more info on what would be causing this it would be greatly appreciated

what does correctable ecc asserted explicily mean?

$
0
0

I have "correctable ecc asserted" warning in the bmc of my server. This event probably lead to the server status light turned amber and blink. I wonder is this event mean only one bit error occurred in the dimm or the number of error occurred in that dimm exceeded the threshold? If it is the first case, I think it is ok, and won't lead to any server health problem. I hope someone can help me with this!

S2600CP2 defective DIMM slots?

$
0
0

I have a s2600cp2, after a hdd upgrade i seem to have lost half my RAM. 

DIMM A1, B1 and DIMM C1-2 DIMM D1-2 indicate failure by 3 long beeps then pause then 3 long beeps again before boot and the LEDs on the corresponding DIMM slots light up.

I have sticks in A1, B1, C1, D1, E1, F1, G1, H1

 

I have cross-swapped the sticks so the fault follows the Slots not the sticks. I have tried letting the system sit with no power for about 1-2 hours, also tried removing the Cmos battery for about 15 mins.

 

Hoping anyone can help indicate if the board needs to be replaced or have any other tips to fix the problem.

Problems with Purley + ESRT2 + UEFI + Linux

$
0
0

We are developing a product on a Wolf Pass Intel server, S2600WF, using Linux.

We would like to use ESRT2 built-in RAID1, which is only available under UEFI boots.

Note that under UEFI boots there are no boot-up RAID config options, such as Ctl-I.

We have booted our Linux in UEFI, and it loads megaraid_sas.ko and creates/dev/megaraid_sas_ioctl_node.

But then we run CmdTool264 to configure the RAID, and it does not see any adapters nor drives.

I believe the Intel provided binary megasr driver is only required for RAID5, so we have not loaded it.

How do you configure and admin the RAID in this case?

Thank you for any advice.

S5520UR - Pwr Unit Status error 0x0061

$
0
0

Good day to all.

 

When the server is turned on, the information on the monitor is not transmitted, the periphery does not work.
There is a polling of the hard drives, after that the server gradually increases the fan speed to the maximum.


In RMM3 error:

Pwr Unit Status | reports the power unit is powered off or being powered down, reports there has been a soft power control failure, reports the power unit has suffered a failure | 0x0061.

 

In logs:

22908/18/2017 13:29:16Pwr Unit StatusPower Unitreports the power unit has suffered a failure - Asserted
22808/18/2017 13:29:10Pwr Unit StatusPower Unitreports the power unit is powered off or being powered down - Asserted
22708/18/2017 13:29:05ButtonButton / Switchreports the power button has been pressed - Asserted
22608/18/2017 13:29:03ButtonButton / Switchreports the power button has been pressed - Asserted
22508/18/2017 13:28:55Pwr Unit StatusPower Unitreports there has been a soft power control failure - Asserted
22408/18/2017 13:28:50Pwr Unit StatusPower Unitreports the power unit's AC is lost - Deasserted

 

 

After turning off the button, sound signals are heard, to decode the error of the power problem.


Resetting the BIOS does not help.
Power supplies were replaced from another server - there is no result.
Changed the board into which the power supplies are inserted to the same from the production server - there is no result.

 

A month before, there was an error in the logs:

BIOS Evt SensorSystem Eventreports Timestamp Clock Sync. Event is one of two expected events from BIOS on every power on. - Asserted

After resrart the server boots up normally.

 

What can I do to restore the server?

Viewing all 3923 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>