mysql - mysqld out of memory with plenty (?) of memory -
i have busy server running percona 5.6.34-79.1-1.xenial on ubuntu 16.04. works great, however, every few weeks, mysqld shot out-of-memory killer , i'm not able find out why.
root@master02:~# grep out /var/log/syslog apr 6 13:37:03 master02 kernel: [17420955.874564] out of memory: kill process 36138 (mysqld) score 659 or sacrifice child so killed at 13:37:03.
however, 2 seconds before, using around 110 gb ram (the system has ~160 gb ram), 55 gb free:
root@master02:~# cat /root/logs/free/free-2017-04-06-13\:36\:01.log total used free shared buff/cache available mem: 165050752 109560372 593508 189240 54896872 54434632 swap: 0 0 0 root@master02:~# cat /root/logs/free/free-2017-04-06-13\:37\:01.log total used free shared buff/cache available mem: 165050752 109582416 602704 189624 54865632 54412072 swap: 0 0 0 root@master02:~# cat /root/logs/free/free-2017-04-06-13\:38\:01.log total used free shared buff/cache available mem: 165050752 17982728 92226488 189200 54841536 146007904 swap: 0 0 0 my.cnf has "innodb-buffer-pool-size = 130g" set. don't find mysqld allocated 50 gb in 2 secs , killed (though may wrong of course).
here dmesg showing full oom - memory allocation issue tunable? i'd appreciate hints here.
[17420955.874279] mysqld invoked oom-killer: gfp_mask=0x26000c0, order=2, oom_score_adj=0 [17420955.874282] mysqld cpuset=/ mems_allowed=0-1 [17420955.874287] cpu: 4 pid: 36138 comm: mysqld not tainted 4.4.0-59-generic #80-ubuntu [17420955.874288] hardware name: xen hvm domu, bios 4.2.amazon 11/11/2016 [17420955.874290] 0000000000000286 00000000dba8c85b ffff88142f4bbaf0 ffffffff813f7583 [17420955.874292] ffff88142f4bbcc8 ffff8827ac8db800 ffff88142f4bbb60 ffffffff8120ad5e [17420955.874295] ffffffff81cd2dc7 0000000000000000 ffffffff81e67760 0000000000000206 [17420955.874297] call trace: [17420955.874304] [<ffffffff813f7583>] dump_stack+0x63/0x90 [17420955.874307] [<ffffffff8120ad5e>] dump_header+0x5a/0x1c5 [17420955.874311] [<ffffffff81192722>] oom_kill_process+0x202/0x3c0 [17420955.874312] [<ffffffff81192b49>] out_of_memory+0x219/0x460 [17420955.874315] [<ffffffff81198abd>] __alloc_pages_slowpath.constprop.88+0x8fd/0xa70 [17420955.874317] [<ffffffff81198eb6>] __alloc_pages_nodemask+0x286/0x2a0 [17420955.874319] [<ffffffff81198f6b>] alloc_kmem_pages_node+0x4b/0xc0 [17420955.874323] [<ffffffff8107ea5e>] copy_process+0x1be/0x1b70 [17420955.874326] [<ffffffff811c164d>] ? handle_mm_fault+0xcbd/0x1820 [17420955.874328] [<ffffffff810805a0>] _do_fork+0x80/0x360 [17420955.874329] [<ffffffff81080929>] sys_clone+0x19/0x20 [17420955.874333] [<ffffffff818384f2>] entry_syscall_64_fastpath+0x16/0x71 [17420955.874343] mem-info: [17420955.874354] active_anon:27160197 inactive_anon:28926 isolated_anon:0 active_file:5497699 inactive_file:7563747 isolated_file:0 unevictable:914 dirty:2486 writeback:0 unstable:0 slab_reclaimable:556865 slab_unreclaimable:45056 mapped:20876 shmem:47414 pagetables:71927 bounce:0 free:154548 free_pcp:64 free_cma:0 [17420955.874357] node 0 dma free:15904kb min:8kb low:8kb high:12kb active_anon:0kb inactive_anon:0kb active_file:0kb inactive_file:0kb unevictable:0kb isolated(anon):0kb isolated(file):0kb present:15988kb managed:15904kb mlocked:0kb dirty:0kb writeback:0kb mapped:0kb shmem:0kb slab_reclaimable:0kb slab_unreclaimable:0kb kernel_stack:0kb pagetables:0kb unstable:0kb bounce:0kb free_pcp:0kb local_pcp:0kb free_cma:0kb writeback_tmp:0kb pages_scanned:0 all_unreclaimable? yes [17420955.874361] lowmem_reserve[]: 0 3706 80497 80497 80497 [17420955.874370] node 0 dma32 free:311248kb min:2072kb low:2588kb high:3108kb active_anon:3411180kb inactive_anon:1348kb active_file:4kb inactive_file:8kb unevictable:280kb isolated(anon):0kb isolated(file):0kb present:3915776kb managed:3835152kb mlocked:280kb dirty:0kb writeback:0kb mapped:524kb shmem:2308kb slab_reclaimable:65844kb slab_unreclaimable:14292kb kernel_stack:1760kb pagetables:13784kb unstable:0kb bounce:0kb free_pcp:0kb local_pcp:0kb free_cma:0kb writeback_tmp:0kb pages_scanned:0 all_unreclaimable? no [17420955.874373] lowmem_reserve[]: 0 0 76791 76791 76791 [17420955.874376] node 0 normal free:143060kb min:42940kb low:53672kb high:64408kb active_anon:62267420kb inactive_anon:31360kb active_file:7207656kb inactive_file:7550284kb unevictable:3296kb isolated(anon):0kb isolated(file):0kb present:79953920kb managed:78634400kb mlocked:3296kb dirty:3116kb writeback:0kb mapped:18940kb shmem:49312kb slab_reclaimable:838024kb slab_unreclaimable:78600kb kernel_stack:51840kb pagetables:159192kb unstable:0kb bounce:0kb free_pcp:136kb local_pcp:0kb free_cma:0kb writeback_tmp:0kb pages_scanned:76 all_unreclaimable? no [17420955.874380] lowmem_reserve[]: 0 0 0 0 0 [17420955.874383] node 1 normal free:147980kb min:45088kb low:56360kb high:67632kb active_anon:42962188kb inactive_anon:82996kb active_file:14783136kb inactive_file:22704696kb unevictable:80kb isolated(anon):0kb isolated(file):0kb present:83886080kb managed:82565296kb mlocked:80kb dirty:6828kb writeback:0kb mapped:64040kb shmem:138036kb slab_reclaimable:1323592kb slab_unreclaimable:87332kb kernel_stack:48624kb pagetables:114732kb unstable:0kb bounce:0kb free_pcp:120kb local_pcp:0kb free_cma:0kb writeback_tmp:0kb pages_scanned:0 all_unreclaimable? no [17420955.874387] lowmem_reserve[]: 0 0 0 0 0 [17420955.874390] node 0 dma: 0*4kb 0*8kb 0*16kb 1*32kb (u) 2*64kb (u) 1*128kb (u) 1*256kb (u) 0*512kb 1*1024kb (u) 1*2048kb (m) 3*4096kb (m) = 15904kb [17420955.874398] node 0 dma32: 13480*4kb (ume) 6994*8kb (ume) 2710*16kb (ume) 714*32kb (umeh) 354*64kb (umeh) 225*128kb (umeh) 107*256kb (umeh) 52*512kb (ume) 29*1024kb (umeh) 0*2048kb 0*4096kb = 311248kb [17420955.874408] node 0 normal: 29953*4kb (umeh) 2985*8kb (umeh) 1*16kb (h) 0*32kb 2*64kb (h) 2*128kb (h) 2*256kb (h) 0*512kb 1*1024kb (h) 0*2048kb 0*4096kb = 145628kb [17420955.874416] node 1 normal: 36438*4kb (ume) 412*8kb (umeh) 2*16kb (h) 2*32kb (h) 0*64kb 1*128kb (h) 1*256kb (h) 2*512kb (h) 0*1024kb 0*2048kb 0*4096kb = 150552kb [17420955.874425] node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kb [17420955.874427] node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kb [17420955.874428] node 1 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kb [17420955.874429] node 1 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kb [17420955.874430] 13109543 total pagecache pages [17420955.874431] 0 pages in swap cache [17420955.874440] swap cache stats: add 0, delete 0, find 0/0 [17420955.874441] free swap = 0kb [17420955.874442] total swap = 0kb [17420955.874443] 41942941 pages ram [17420955.874444] 0 pages highmem/movableonly [17420955.874444] 680253 pages reserved [17420955.874445] 0 pages cma reserved [17420955.874446] 0 pages hwpoisoned [17420955.874447] [ pid ] uid tgid total_vm rss nr_ptes nr_pmds swapents oom_score_adj name [17420955.874457] [ 677] 0 677 38915 23334 80 3 0 0 systemd-journal [17420955.874461] [ 698] 0 698 25742 47 17 3 0 0 lvmetad [17420955.874469] [ 729] 0 729 11545 1219 23 3 0 -1000 systemd-udevd [17420955.874472] [ 1112] 109 1112 25081 492 19 3 0 0 systemd-timesyn [17420955.874474] [ 1333] 0 1333 4030 637 12 3 0 0 dhclient [17420955.874475] [ 1541] 0 1541 6932 547 19 3 0 0 cron [17420955.874477] [ 1552] 0 1552 365572 5403 79 7 0 0 snapd [17420955.874479] [ 1559] 102 1559 10725 667 26 3 0 -900 dbus-daemon [17420955.874480] [ 1570] 0 1570 77473 728 21 3 0 0 lxcfs [17420955.874482] [ 1576] 101 1576 64099 1008 27 3 0 0 rsyslogd [17420955.874484] [ 1578] 106 1578 1884 403 9 3 0 0 vnstatd [17420955.874486] [ 1580] 0 1580 1100 322 8 3 0 0 acpid [17420955.874488] [ 1582] 0 1582 68674 1100 37 3 0 0 accounts-daemon [17420955.874489] [ 1584] 0 1584 6511 378 18 3 0 0 atd [17420955.874491] [ 1593] 0 1593 16380 700 36 3 0 -1000 sshd [17420955.874493] [ 1595] 0 1595 7157 635 18 3 0 0 systemd-logind [17420955.874495] [ 1599] 0 1599 7470 139 19 3 0 0 cgmanager [17420955.874497] [ 1618] 0 1618 3344 116 11 3 0 0 mdadm [17420955.874498] [ 1622] 0 1622 1306 441 8 3 0 0 iscsid [17420955.874500] [ 1623] 0 1623 1431 916 8 3 0 -17 iscsid [17420955.874511] [ 1717] 0 1717 3619 388 12 3 0 0 agetty [17420955.874513] [ 1718] 0 1718 3665 343 12 3 0 0 agetty [17420955.874515] [ 1731] 0 1731 5025 653 15 3 0 0 irqbalance [17420955.874517] [ 1744] 0 1744 69272 719 39 3 0 0 polkitd [17420955.874519] [ 2484] 1001 2484 11312 218 27 3 0 0 systemd [17420955.874520] [ 2488] 1001 2488 15805 475 34 3 0 0 (sd-pam) [17420955.874523] [ 6289] 0 6289 7718 1364 19 3 0 0 tmux [17420955.874531] [ 6290] 0 6290 5381 924 15 3 0 0 bash [17420955.874533] [ 6306] 0 6306 2158 395 10 3 0 0 mysqld_safe [17420955.874537] [36138] 107 36138 43942177 27121187 70971 157 0 0 mysqld [17420955.874539] [78610] 108 78610 5992 577 16 3 0 0 nrpe [17420955.874542] [19441] 0 19441 24876 1740 54 3 0 0 sshd [17420955.874543] [19447] 1008 19447 11312 1147 27 3 0 0 systemd [17420955.874555] [19449] 1008 19449 15817 487 34 3 0 0 (sd-pam) [17420955.874557] [19575] 1008 19575 24876 834 51 3 0 0 sshd [17420955.874558] [19576] 0 19576 14970 933 34 3 0 0 sudo [17420955.874560] [19577] 0 19577 5388 1343 16 3 0 0 bash [17420955.874561] [26883] 0 26883 8430 5113 22 5 0 0 mysqld_exporter [17420955.874564] out of memory: kill process 36138 (mysqld) score 659 or sacrifice child [17420955.890336] killed process 36138 (mysqld) total-vm:175768708kb, anon-rss:108470860kb, file-rss:13888kb here my.cnf file:
[mysql] # client # port = 3306 socket = /var/run/mysqld/mysqld.sock [mysqld] # general # user = mysql default-storage-engine = innodb socket = /var/run/mysqld/mysqld.sock pid-file = /var/run/mysqld/mysqld.pid # myisam # key-buffer-size = 32m myisam-recover = force,backup # safety # max-allowed-packet = 16m max-connect-errors = 1000000 # data storage # datadir = /var/lib/mysql/ # binary logging # log-bin = /var/lib/mysql-binlogs/mysql-bin expire-logs-days = 7 sync-binlog = 1 binlog-format = mixed # replication # server-id = 1 auto_increment_offset = 1 # total number of master servers auto_increment_increment = 2 log-slave-updates = 1 relay-log = /var/lib/mysql-binlogs/relay-bin slave-net-timeout = 60 # caches , limits # tmp-table-size = 32m max-heap-table-size = 32m query-cache-type = 0 query-cache-size = 0 thread-cache-size = 50 open-files-limit = 65535 table-definition-cache = 4096 table-open-cache = 4096 # innodb # innodb-flush-method = o_direct innodb-log-files-in-group = 2 innodb-log-file-size = 512m innodb-flush-log-at-trx-commit = 2 innodb-file-per-table = 1 innodb-buffer-pool-size = 125g innodb_large_prefix = 1 innodb_file_format = barracuda # logging # log-error = /var/log/mysql/mysql-error.log log-queries-not-using-indexes = 0 slow-query-log = 0 slow-query-log-file = /var/log/mysql/mysql-slow.log # other setup # character_set_server = utf8mb4 collation-server = utf8mb4_unicode_ci init-connect = 'set names utf8mb4' skip-name-resolve = 1 max_connections = 12288 wait_timeout = 120 connect_timeout = 30 interactive_timeout = 120
i'll assume using along php , apache. find causing issue in 1 of application's log files.
note time of failure in mysql log , in other log files during time period , should find clues lead answer.
i should note oom chooses service shut down when system runs out of memory , though shut down may not have been service created issue.
Comments
Post a Comment