Redian新闻
>
Re: sql server 面试题 (5)
avatar
Re: sql server 面试题 (5)# Database - 数据库
z*y
1
今天刚刚出的incident, 拿出来跟大家讨论一下。
3:00am EST today dba team got alert that one of our main applications failed
in London. It was 8:00am in London and it's peak hour. The DBA in London
is not familiar with clustering so they ask help from headquarter.
Now it's 7:10am in office. As dba, you need get it solved asap. your
director, CFO, CIO,CTO are all stand behind you watching you typing.
The database was in two nodes A/A clustering environment. Before the
incident each node has one sql server instance
avatar
m*0
2
it's in production environment, should try to fix the problem first, then
look for answers later, in off-hour maybe.
on the side note, if you don't have enough money, don't use cluster, don't
try to save money by using a/a. cluster is for company who have enough
money to do cluster with a/p.
1) fail back to that server if you can, although it's running on 2nd code,
but performance is bad, not only because memory, they will fight for cpu
and most importantly, they will fight for i/o, which is

【在 z***y 的大作中提到】
: 今天刚刚出的incident, 拿出来跟大家讨论一下。
: 3:00am EST today dba team got alert that one of our main applications failed
: in London. It was 8:00am in London and it's peak hour. The DBA in London
: is not familiar with clustering so they ask help from headquarter.
: Now it's 7:10am in office. As dba, you need get it solved asap. your
: director, CFO, CIO,CTO are all stand behind you watching you typing.
: The database was in two nodes A/A clustering environment. Before the
: incident each node has one sql server instance

avatar
S*k
3
Hahahaha
Here is my thought:
Try to bring the failed node back.
Try to make more available VM in the working node.
If cannot solve in 30 minutes, say sorry to people standing around and
suggest to buy a new server as soon as possible.
Close the door and kick the ass who set up the A/A cluster.
avatar
z*y
4
Good one!
Any configuration setting you need to do down the road? Say it's Q4 and we
are run out of money or resources to do A/P...

t
,
sql

【在 m***0 的大作中提到】
: it's in production environment, should try to fix the problem first, then
: look for answers later, in off-hour maybe.
: on the side note, if you don't have enough money, don't use cluster, don't
: try to save money by using a/a. cluster is for company who have enough
: money to do cluster with a/p.
: 1) fail back to that server if you can, although it's running on 2nd code,
: but performance is bad, not only because memory, they will fight for cpu
: and most importantly, they will fight for i/o, which is

avatar
m*0
5
if you don't have money now, buy more memory then, or set up a WS server at
least, and have log shipping running. these two things won't cost you that
much, but you have more backup options now.

we

【在 z***y 的大作中提到】
: Good one!
: Any configuration setting you need to do down the road? Say it's Q4 and we
: are run out of money or resources to do A/P...
:
: t
: ,
: sql

avatar
z*y
6
好!
在实际工作中要学会根据情况做折中。 best practice 是一回事,valid practice又
是一回事。 实际中, 大家都是能省就省。 越是大公司,越抠门。 Q4每个部门的预算
都花了差不离了.木得钱乐!
前面有人提到reboot then fail back, 这是重点。
以后呢:
第一要分析问什么会fail over.
第二要调整sql server 的 max memory settings.
第三要预算买内存,或者买设备。 有的时候内存比设备还贵。 比如HP的DL6xx,
24GRam 要20k。 必要时要拽上几个PM一起帮你make a case.
mjd回答的很完整, 个人看法呵。
avatar
p*l
7
Good solution.
I usually say "You get what you paid for.".

t
,
sql

【在 m***0 的大作中提到】
: it's in production environment, should try to fix the problem first, then
: look for answers later, in off-hour maybe.
: on the side note, if you don't have enough money, don't use cluster, don't
: try to save money by using a/a. cluster is for company who have enough
: money to do cluster with a/p.
: 1) fail back to that server if you can, although it's running on 2nd code,
: but performance is bad, not only because memory, they will fight for cpu
: and most importantly, they will fight for i/o, which is

avatar
w*e
8
我们公司的A/A好着呢, 没什么问题.....
Last week, two instance ran on one node without any problem.....
倒是这Monday, 分开run出了事, 然后reboot 再fail back.....又没事了....
我觉得case by case吧

t
,
sql

【在 m***0 的大作中提到】
: it's in production environment, should try to fix the problem first, then
: look for answers later, in off-hour maybe.
: on the side note, if you don't have enough money, don't use cluster, don't
: try to save money by using a/a. cluster is for company who have enough
: money to do cluster with a/p.
: 1) fail back to that server if you can, although it's running on 2nd code,
: but performance is bad, not only because memory, they will fight for cpu
: and most importantly, they will fight for i/o, which is

相关阅读
logo
联系我们隐私协议©2024 redian.news
Redian新闻
Redian.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Redian.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。