kernel_arpi

Author	SHA1	Message	Date
Greg Kroah-Hartman	a8b5dc3032	Merge 5.15.17 into android13-5.15 Changes in 5.15.17 KVM: x86/mmu: Fix write-protection of PTs mapped by the TDP MMU KVM: VMX: switch blocked_vcpu_on_cpu_lock to raw spinlock HID: Ignore battery for Elan touchscreen on HP Envy X360 15t-dr100 HID: uhid: Fix worker destroying device without any protection HID: wacom: Reset expected and received contact counts at the same time HID: wacom: Ignore the confidence flag when a touch is removed HID: wacom: Avoid using stale array indicies to read contact count ALSA: core: Fix SSID quirk lookup for subvendor=0 f2fs: fix to do sanity check on inode type during garbage collection f2fs: fix to do sanity check in is_alive() f2fs: avoid EINVAL by SBI_NEED_FSCK when pinning a file nfc: llcp: fix NULL error pointer dereference on sendmsg() after failed bind() mtd: rawnand: gpmi: Add ERR007117 protection for nfc_apply_timings mtd: rawnand: gpmi: Remove explicit default gpmi clock setting for i.MX6 mtd: Fixed breaking list in __mtd_del_partition. mtd: rawnand: davinci: Don't calculate ECC when reading page mtd: rawnand: davinci: Avoid duplicated page read mtd: rawnand: davinci: Rewrite function description mtd: rawnand: Export nand_read_page_hwecc_oob_first() mtd: rawnand: ingenic: JZ4740 needs 'oob_first' read page function riscv: Get rid of MAXPHYSMEM configs RISC-V: Use common riscv_cpuid_to_hartid_mask() for both SMP=y and SMP=n riscv: try to allocate crashkern region from 32bit addressible memory riscv: Don't use va_pa_offset on kdump riscv: use hart id instead of cpu id on machine_kexec riscv: mm: fix wrong phys_ram_base value for RV64 x86/gpu: Reserve stolen memory for first integrated Intel GPU tools/nolibc: x86-64: Fix startup code bug crypto: x86/aesni - don't require alignment of data tools/nolibc: i386: fix initial stack alignment tools/nolibc: fix incorrect truncation of exit code rtc: cmos: take rtc_lock while reading from CMOS net: phy: marvell: add Marvell specific PHY loopback ksmbd: uninitialized variable in create_socket() ksmbd: fix guest connection failure with nautilus ksmbd: add support for smb2 max credit parameter ksmbd: move credit charge deduction under processing request ksmbd: limits exceeding the maximum allowable outstanding requests ksmbd: add reserved room in ipc request/response media: cec: fix a deadlock situation media: ov8865: Disable only enabled regulators on error path media: v4l2-ioctl.c: readbuffers depends on V4L2_CAP_READWRITE media: flexcop-usb: fix control-message timeouts media: mceusb: fix control-message timeouts media: em28xx: fix control-message timeouts media: cpia2: fix control-message timeouts media: s2255: fix control-message timeouts media: dib0700: fix undefined behavior in tuner shutdown media: redrat3: fix control-message timeouts media: pvrusb2: fix control-message timeouts media: stk1160: fix control-message timeouts media: cec-pin: fix interrupt en/disable handling can: softing_cs: softingcs_probe(): fix memleak on registration failure mei: hbm: fix client dma reply status iio: adc: ti-adc081c: Partial revert of removal of ACPI IDs iio: trigger: Fix a scheduling whilst atomic issue seen on tsc2046 lkdtm: Fix content of section containing lkdtm_rodata_do_nothing() bus: mhi: pci_generic: Graceful shutdown on freeze bus: mhi: core: Fix reading wake_capable channel configuration bus: mhi: core: Fix race while handling SYS_ERR at power up cxl/pmem: Fix reference counting for delayed work arm64: errata: Fix exec handling in erratum `1418040` workaround ARM: dts: at91: update alternate function of signal PD20 iommu/io-pgtable-arm-v7s: Add error handle for page table allocation failure gpu: host1x: Add back arm_iommu_detach_device() drm/tegra: Add back arm_iommu_detach_device() virtio/virtio_mem: handle a possible NULL as a memcpy parameter dma_fence_array: Fix PENDING_ERROR leak in dma_fence_array_signaled() PCI: Add function 1 DMA alias quirk for Marvell 88SE9125 SATA controller mm_zone: add function to check if managed dma zone exists dma/pool: create dma atomic pool only if dma zone has managed pages mm/page_alloc.c: do not warn allocation failure on zone DMA if no managed pages ath11k: add string type to search board data in board-2.bin for WCN6855 shmem: fix a race between shmem_unused_huge_shrink and shmem_evict_inode drm/ttm: Put BO in its memory manager's lru list Bluetooth: L2CAP: Fix not initializing sk_peer_pid drm/bridge: display-connector: fix an uninitialized pointer in probe() drm: fix null-ptr-deref in drm_dev_init_release() drm/panel: kingdisplay-kd097d04: Delete panel on attach() failure drm/panel: innolux-p079zca: Delete panel on attach() failure drm/rockchip: dsi: Fix unbalanced clock on probe error drm/rockchip: dsi: Hold pm-runtime across bind/unbind drm/rockchip: dsi: Disable PLL clock on bind error drm/rockchip: dsi: Reconfigure hardware on resume() Bluetooth: virtio_bt: fix memory leak in virtbt_rx_handle() Bluetooth: cmtp: fix possible panic when cmtp_init_sockets() fails clk: bcm-2835: Pick the closest clock rate clk: bcm-2835: Remove rounding up the dividers drm/vc4: hdmi: Set a default HSM rate drm/vc4: hdmi: Move the HSM clock enable to runtime_pm drm/vc4: hdmi: Make sure the controller is powered in detect drm/vc4: hdmi: Make sure the controller is powered up during bind drm/vc4: hdmi: Rework the pre_crtc_configure error handling drm/vc4: crtc: Make sure the HDMI controller is powered when disabling wcn36xx: ensure pairing of init_scan/finish_scan and start_scan/end_scan wcn36xx: Indicate beacon not connection loss on MISSED_BEACON_IND drm/vc4: hdmi: Enable the scrambler on reconnection libbpf: Free up resources used by inner map definition wcn36xx: Fix DMA channel enable/disable cycle wcn36xx: Release DMA channel descriptor allocations wcn36xx: Put DXE block into reset before freeing memory wcn36xx: populate band before determining rate on RX wcn36xx: fix RX BD rate mapping for 5GHz legacy rates ath11k: Send PPDU_STATS_CFG with proper pdev mask to firmware bpftool: Fix memory leak in prog_dump() mtd: hyperbus: rpc-if: Check return value of rpcif_sw_init() media: videobuf2: Fix the size printk format media: atomisp: add missing media_device_cleanup() in atomisp_unregister_entities() media: atomisp: fix punit_ddr_dvfs_enable() argument for mrfld_power up case media: atomisp: fix inverted logic in buffers_needed() media: atomisp: do not use err var when checking port validity for ISP2400 media: atomisp: fix inverted error check for ia_css_mipi_is_source_port_valid() media: atomisp: fix ifdefs in sh_css.c media: atomisp: add NULL check for asd obtained from atomisp_video_pipe media: atomisp: fix enum formats logic media: atomisp: fix uninitialized bug in gmin_get_pmic_id_and_addr() media: aspeed: fix mode-detect always time out at 2nd run media: em28xx: fix memory leak in em28xx_init_dev media: aspeed: Update signal status immediately to ensure sane hw state arm64: dts: amlogic: meson-g12: Fix GPU operating point table node name arm64: dts: amlogic: Fix SPI NOR flash node name for ODROID N2/N2+ arm64: dts: meson-gxbb-wetek: fix HDMI in early boot arm64: dts: meson-gxbb-wetek: fix missing GPIO binding fs: dlm: don't call kernel_getpeername() in error_report() memory: renesas-rpc-if: Return error in case devm_ioremap_resource() fails Bluetooth: stop proccessing malicious adv data ath11k: Fix ETSI regd with weather radar overlap ath11k: clear the keys properly via DISABLE_KEY ath11k: reset RSN/WPA present state for open BSS spi: hisi-kunpeng: Fix the debugfs directory name incorrect tee: fix put order in teedev_close_context() fs: dlm: fix build with CONFIG_IPV6 disabled drm/dp: Don't read back backlight mode in drm_edp_backlight_enable() drm/vboxvideo: fix a NULL vs IS_ERR() check arm64: dts: renesas: cat875: Add rx/tx delays media: dmxdev: fix UAF when dvb_register_device() fails crypto: atmel-aes - Reestablish the correct tfm context at dequeue crypto: qce - fix uaf on qce_aead_register_one crypto: qce - fix uaf on qce_ahash_register_one crypto: qce - fix uaf on qce_skcipher_register_one arm64: dts: qcom: sc7280: Fix incorrect clock name mtd: hyperbus: rpc-if: fix bug in rpcif_hb_remove cpufreq: qcom-cpufreq-hw: Update offline CPUs per-cpu thermal pressure cpufreq: qcom-hw: Fix probable nested interrupt handling ARM: dts: stm32: fix dtbs_check warning on ili9341 dts binding on stm32f429 disco libbpf: Fix potential misaligned memory access in btf_ext__new() libbpf: Fix glob_syms memory leak in bpf_linker libbpf: Fix using invalidated memory in bpf_linker crypto: qat - remove unnecessary collision prevention step in PFVF crypto: qat - make pfvf send message direction agnostic crypto: qat - fix undetected PFVF timeout in ACK loop ath11k: Use host CE parameters for CE interrupts configuration arm64: dts: ti: k3-j721e: correct cache-sets info tty: serial: atmel: Check return code of dmaengine_submit() tty: serial: atmel: Call dma_async_issue_pending() mfd: atmel-flexcom: Remove #ifdef CONFIG_PM_SLEEP mfd: atmel-flexcom: Use .resume_noirq bfq: Do not let waker requests skip proper accounting libbpf: Silence uninitialized warning/error in btf_dump_dump_type_data media: i2c: imx274: fix s_frame_interval runtime resume not requested media: i2c: Re-order runtime pm initialisation media: i2c: ov8865: Fix lockdep error media: rcar-csi2: Correct the selection of hsfreqrange media: imx-pxp: Initialize the spinlock prior to using it media: si470x-i2c: fix possible memory leak in si470x_i2c_probe() media: mtk-vcodec: call v4l2_m2m_ctx_release first when file is released media: hantro: Hook up RK3399 JPEG encoder output media: coda: fix CODA960 JPEG encoder buffer overflow media: venus: correct low power frequency calculation for encoder media: venus: core: Fix a potential NULL pointer dereference in an error handling path media: venus: core: Fix a resource leak in the error handling path of 'venus_probe()' net: stmmac: Add platform level debug register dump feature thermal/drivers/imx: Implement runtime PM support igc: AF_XDP zero-copy metadata adjust breaks SKBs on XDP_PASS netfilter: bridge: add support for pppoe filtering powerpc: Avoid discarding flags in system_call_exception() arm64: dts: qcom: msm8916: fix MMC controller aliases drm/vmwgfx: Remove the deprecated lower mem limit drm/vmwgfx: Fail to initialize on broken configs cgroup: Trace event cgroup id fields should be u64 ACPI: EC: Rework flushing of EC work while suspended to idle thermal/drivers/imx8mm: Enable ADC when enabling monitor drm/amdgpu: Fix a NULL pointer dereference in amdgpu_connector_lcd_native_mode() drm/radeon/radeon_kms: Fix a NULL pointer dereference in radeon_driver_open_kms() libbpf: Clean gen_loader's attach kind. crypto: caam - save caam memory to support crypto engine retry mechanism. arm64: dts: ti: k3-am642: Fix the L2 cache sets arm64: dts: ti: k3-j7200: Fix the L2 cache sets arm64: dts: ti: k3-j721e: Fix the L2 cache sets arm64: dts: ti: k3-j7200: Correct the d-cache-sets info tty: serial: uartlite: allow 64 bit address serial: amba-pl011: do not request memory region twice mtd: core: provide unique name for nvmem device floppy: Fix hang in watchdog when disk is ejected staging: rtl8192e: return error code from rtllib_softmac_init() staging: rtl8192e: rtllib_module: fix error handle case in alloc_rtllib() Bluetooth: btmtksdio: fix resume failure bpf: Fix the test_task_vma selftest to support output shorter than 1 kB sched/fair: Fix detection of per-CPU kthreads waking a task sched/fair: Fix per-CPU kthread and wakee stacking for asym CPU capacity bpf: Adjust BTF log size limit. bpf: Disallow BPF_LOG_KERNEL log level for bpf(BPF_BTF_LOAD) bpf: Remove config check to enable bpf support for branch records arm64: clear_page() shouldn't use DC ZVA when DCZID_EL0.DZP == 1 arm64: mte: DC {GVA,GZVA} shouldn't be used when DCZID_EL0.DZP == 1 samples/bpf: Install libbpf headers when building samples/bpf: Clean up samples/bpf build failes samples: bpf: Fix xdp_sample_user.o linking with Clang samples: bpf: Fix 'unknown warning group' build warning on Clang media: dib8000: Fix a memleak in dib8000_init() media: saa7146: mxb: Fix a NULL pointer dereference in mxb_attach() media: si2157: Fix "warm" tuner state detection wireless: iwlwifi: Fix a double free in iwl_txq_dyn_alloc_dma sched/rt: Try to restart rt period timer when rt runtime exceeded ath10k: Fix the MTU size on QCA9377 SDIO Bluetooth: refactor set_exp_feature with a feature table Bluetooth: MGMT: Use hci_dev_test_and_{set,clear}_flag Bluetooth: btusb: Handle download_firmware failure cases drm/amd/display: Fix bug in debugfs crc_win_update entry drm/amd/display: Fix out of bounds access on DNC31 stream encoder regs drm/msm/gpu: Don't allow zero fence_id drm/msm/dp: displayPort driver need algorithm rational rcu/exp: Mark current CPU as exp-QS in IPI loop second pass wcn36xx: Fix max channels retrieval drm/msm/dsi: fix initialization in the bonded DSI case mwifiex: Fix possible ABBA deadlock xfrm: fix a small bug in xfrm_sa_len() x86/uaccess: Move variable into switch case statement selftests: clone3: clone3: add case CLONE3_ARGS_NO_TEST selftests: harness: avoid false negatives if test has no ASSERTs crypto: stm32/cryp - fix CTR counter carry crypto: stm32/cryp - fix xts and race condition in crypto_engine requests crypto: stm32/cryp - check early input data crypto: stm32/cryp - fix double pm exit crypto: stm32/cryp - fix lrw chaining mode crypto: stm32/cryp - fix bugs and crash in tests crypto: stm32 - Revert broken pm_runtime_resume_and_get changes crypto: hisilicon/qm - fix incorrect return value of hisi_qm_resume() ath11k: Fix deleting uninitialized kernel timer during fragment cache flush spi: Fix incorrect cs_setup delay handling ARM: dts: gemini: NAS4220-B: fis-index-block with 128 KiB sectors perf/arm-cmn: Fix CPU hotplug unregistration media: dw2102: Fix use after free media: msi001: fix possible null-ptr-deref in msi001_probe() media: coda/imx-vdoa: Handle dma_set_coherent_mask error codes ath11k: Fix a NULL pointer dereference in ath11k_mac_op_hw_scan() net: dsa: hellcreek: Fix insertion of static FDB entries net: dsa: hellcreek: Add STP forwarding rule net: dsa: hellcreek: Allow PTP P2P measurements on blocked ports net: dsa: hellcreek: Add missing PTP via UDP rules arm64: dts: qcom: c630: Fix soundcard setup arm64: dts: qcom: ipq6018: Fix gpio-ranges property drm/msm/dpu: fix safe status debugfs file drm/bridge: ti-sn65dsi86: Set max register for regmap gpu: host1x: select CONFIG_DMA_SHARED_BUFFER drm/tegra: gr2d: Explicitly control module reset drm/tegra: vic: Fix DMA API misuse media: hantro: Fix probe func error path xfrm: interface with if_id 0 should return error xfrm: state and policy should fail if XFRMA_IF_ID 0 ARM: 9159/1: decompressor: Avoid UNPREDICTABLE NOP encoding usb: ftdi-elan: fix memory leak on device disconnect arm64: dts: marvell: cn9130: add GPIO and SPI aliases arm64: dts: marvell: cn9130: enable CP0 GPIO controllers ARM: dts: armada-38x: Add generic compatible to UART nodes mt76: mt7921: drop offload_flags overwritten wilc1000: fix double free error in probe() rtw88: add quirk to disable pci caps on HP 250 G7 Notebook PC rtw88: Disable PCIe ASPM while doing NAPI poll on 8821CE iwlwifi: mvm: fix 32-bit build in FTM iwlwifi: mvm: test roc running status bits before removing the sta iwlwifi: mvm: perform 6GHz passive scan after suspend iwlwifi: mvm: set protected flag only for NDP ranging mmc: meson-mx-sdhc: add IRQ check mmc: meson-mx-sdio: add IRQ check block: fix error unwinding in device_add_disk selinux: fix potential memleak in selinux_add_opt() um: fix ndelay/udelay defines um: rename set_signals() to um_set_signals() um: virt-pci: Fix 32-bit compile lib/logic_iomem: Fix 32-bit build lib/logic_iomem: Fix operation on 32-bit um: virtio_uml: Fix time-travel external time propagation Bluetooth: L2CAP: Fix using wrong mode bpftool: Enable line buffering for stdout backlight: qcom-wled: Validate enabled string indices in DT backlight: qcom-wled: Pass number of elements to read to read_u32_array backlight: qcom-wled: Fix off-by-one maximum with default num_strings backlight: qcom-wled: Override default length with qcom,enabled-strings backlight: qcom-wled: Use cpu_to_le16 macro to perform conversion backlight: qcom-wled: Respect enabled-strings in set_brightness software node: fix wrong node passed to find nargs_prop Bluetooth: hci_qca: Stop IBS timer during BT OFF x86/boot/compressed: Move CLANG_FLAGS to beginning of KBUILD_CFLAGS crypto: octeontx2 - prevent underflow in get_cores_bmap() regulator: qcom-labibb: OCP interrupts are not a failure while disabled hwmon: (mr75203) fix wrong power-up delay value x86/mce/inject: Avoid out-of-bounds write when setting flags io_uring: remove double poll on poll update serial: 8250_bcm7271: Propagate error codes from brcmuart_probe() ACPI: scan: Create platform device for BCM4752 and LNV4752 ACPI nodes pcmcia: rsrc_nonstatic: Fix a NULL pointer dereference in __nonstatic_find_io_region() pcmcia: rsrc_nonstatic: Fix a NULL pointer dereference in nonstatic_find_mem_region() power: reset: mt6397: Check for null res pointer net/xfrm: IPsec tunnel mode fix inner_ipproto setting in sec_path net: ethernet: mtk_eth_soc: fix return values and refactor MDIO ops net: dsa: fix incorrect function pointer check for MRP ring roles netfilter: ipt_CLUSTERIP: fix refcount leak in clusterip_tg_check() bpf, sockmap: Fix return codes from tcp_bpf_recvmsg_parser() bpf, sockmap: Fix double bpf_prog_put on error case in map_link bpf: Don't promote bogus looking registers after null check. bpf: Fix verifier support for validation of async callbacks bpf: Fix SO_RCVBUF/SO_SNDBUF handling in _bpf_setsockopt(). netfilter: nft_payload: do not update layer 4 checksum when mangling fragments netfilter: nft_set_pipapo: allocate pcpu scratch maps on clone net: fix SOF_TIMESTAMPING_BIND_PHC to work with multiple sockets ppp: ensure minimum packet size in ppp_write() rocker: fix a sleeping in atomic bug staging: greybus: audio: Check null pointer fsl/fman: Check for null pointer after calling devm_ioremap Bluetooth: hci_bcm: Check for error irq Bluetooth: hci_qca: Fix NULL vs IS_ERR_OR_NULL check in qca_serdev_probe net/smc: Reset conn->lgr when link group registration fails usb: dwc3: qcom: Fix NULL vs IS_ERR checking in dwc3_qcom_probe usb: dwc2: do not gate off the hardware if it does not support clock gating usb: dwc2: gadget: initialize max_speed from params usb: gadget: u_audio: Subdevice 0 for capture ctls HID: hid-uclogic-params: Invalid parameter check in uclogic_params_init HID: hid-uclogic-params: Invalid parameter check in uclogic_params_get_str_desc HID: hid-uclogic-params: Invalid parameter check in uclogic_params_huion_init HID: hid-uclogic-params: Invalid parameter check in uclogic_params_frame_init_v1_buttonpad debugfs: lockdown: Allow reading debugfs files that are not world readable drivers/firmware: Add missing platform_device_put() in sysfb_create_simplefb serial: liteuart: fix MODULE_ALIAS serial: stm32: move tx dma terminate DMA to shutdown x86, sched: Fix undefined reference to init_freq_invariance_cppc() build error net/mlx5e: Fix page DMA map/unmap attributes net/mlx5e: Fix wrong usage of fib_info_nh when routes with nexthop objects are used net/mlx5e: Don't block routes with nexthop objects in SW Revert "net/mlx5e: Block offload of outer header csum for UDP tunnels" Revert "net/mlx5e: Block offload of outer header csum for GRE tunnel" net/mlx5e: Fix matching on modified inner ip_ecn bits net/mlx5: Fix access to sf_dev_table on allocation failure net/mlx5e: Sync VXLAN udp ports during uplink representor profile change net/mlx5: Set command entry semaphore up once got index free lib/mpi: Add the return value check of kcalloc() Bluetooth: L2CAP: uninitialized variables in l2cap_sock_setsockopt() mptcp: fix per socket endpoint accounting mptcp: fix opt size when sending DSS + MP_FAIL mptcp: fix a DSS option writing error spi: spi-meson-spifc: Add missing pm_runtime_disable() in meson_spifc_probe octeontx2-af: Increment ptp refcount before use ax25: uninitialized variable in ax25_setsockopt() netrom: fix api breakage in nr_setsockopt() regmap: Call regmap_debugfs_exit() prior to _init() net: mscc: ocelot: fix incorrect balancing with down LAG ports can: mcp251xfd: add missing newline to printed strings tpm: add request_locality before write TPM_INT_ENABLE tpm_tis: Fix an error handling path in 'tpm_tis_core_init()' can: softing: softing_startstop(): fix set but not used variable warning can: xilinx_can: xcan_probe(): check for error irq can: rcar_canfd: rcar_canfd_channel_probe(): make sure we free CAN network device pcmcia: fix setting of kthread task states net/sched: flow_dissector: Fix matching on zone id for invalid conns net: openvswitch: Fix matching zone id for invalid conns arriving from tc net: openvswitch: Fix ct_state nat flags for conns arriving from tc iwlwifi: mvm: Use div_s64 instead of do_div in iwl_mvm_ftm_rtt_smoothing() bnxt_en: Refactor coredump functions bnxt_en: move coredump functions into dedicated file bnxt_en: use firmware provided max timeout for messages net: mcs7830: handle usb read errors properly ext4: avoid trim error on fs with small groups ASoC: Intel: sof_sdw: fix jack detection on HP Spectre x360 convertible ALSA: jack: Add missing rwsem around snd_ctl_remove() calls ALSA: PCM: Add missing rwsem around snd_ctl_remove() calls ALSA: hda: Add missing rwsem around snd_ctl_remove() calls ALSA: hda: Fix potential deadlock at codec unbinding RDMA/bnxt_re: Scan the whole bitmap when checking if "disabling RCFW with pending cmd-bit" RDMA/hns: Validate the pkey index scsi: pm80xx: Update WARN_ON check in pm8001_mpi_build_cmd() clk: renesas: rzg2l: Check return value of pm_genpd_init() clk: renesas: rzg2l: propagate return value of_genpd_add_provider_simple() clk: imx8mn: Fix imx8mn_clko1_sels powerpc/prom_init: Fix improper check of prom_getprop() ASoC: uniphier: drop selecting non-existing SND_SOC_UNIPHIER_AIO_DMA ASoC: codecs: wcd938x: add SND_SOC_WCD938_SDW to codec list instead RDMA/rtrs-clt: Fix the initial value of min_latency ALSA: hda: Make proper use of timecounter dt-bindings: thermal: Fix definition of cooling-maps contribution property powerpc/perf: Fix PMU callbacks to clear pending PMI before resetting an overflown PMC powerpc/modules: Don't WARN on first module allocation attempt powerpc/32s: Fix shift-out-of-bounds in KASAN init clocksource: Avoid accidental unstable marking of clocksources ALSA: oss: fix compile error when OSS_DEBUG is enabled ALSA: usb-audio: Drop superfluous '0' in Presonus Studio 1810c's ID misc: at25: Make driver OF independent again char/mwave: Adjust io port register size binder: fix handling of error during copy binder: avoid potential data leakage when copying txn openrisc: Add clone3 ABI wrapper iommu: Extend mutex lock scope in iommu_probe_device() iommu/io-pgtable-arm: Fix table descriptor paddr formatting scsi: core: Fix scsi_device_max_queue_depth() scsi: ufs: Fix race conditions related to driver data RDMA/qedr: Fix reporting max_{send/recv}_wr attrs PCI/MSI: Fix pci_irq_vector()/pci_irq_get_affinity() powerpc/powermac: Add additional missing lockdep_register_key() iommu/arm-smmu-qcom: Fix TTBR0 read RDMA/core: Let ib_find_gid() continue search even after empty entry RDMA/cma: Let cma_resolve_ib_dev() continue search even after empty entry ASoC: rt5663: Handle device_property_read_u32_array error codes of: unittest: fix warning on PowerPC frame size warning of: unittest: 64 bit dma address test requires arch support clk: stm32: Fix ltdc's clock turn off by clk_disable_unused() after system enter shell mips: add SYS_HAS_CPU_MIPS64_R5 config for MIPS Release 5 support mips: fix Kconfig reference to PHYS_ADDR_T_64BIT dmaengine: pxa/mmp: stop referencing config->slave_id iommu/amd: Restore GA log/tail pointer on host resume iommu/amd: X2apic mode: re-enable after resume iommu/amd: X2apic mode: setup the INTX registers on mask/unmask iommu/amd: X2apic mode: mask/unmask interrupts on suspend/resume iommu/amd: Remove useless irq affinity notifier ASoC: Intel: catpt: Test dmaengine_submit() result before moving on iommu/iova: Fix race between FQ timeout and teardown ASoC: mediatek: mt8195: correct default value of: fdt: Aggregate the processing of "linux,usable-memory-range" efi: apply memblock cap after memblock_add() scsi: block: pm: Always set request queue runtime active in blk_post_runtime_resume() phy: uniphier-usb3ss: fix unintended writing zeros to PHY register ASoC: mediatek: Check for error clk pointer powerpc/64s: Mask NIP before checking against SRR0 powerpc/64s: Use EMIT_WARN_ENTRY for SRR debug warnings phy: cadence: Sierra: Fix to get correct parent for mux clocks ASoC: samsung: idma: Check of ioremap return value misc: lattice-ecp3-config: Fix task hung when firmware load failed ASoC: mediatek: mt8195: correct pcmif BE dai control flow arm64: tegra: Remove non existent Tegra194 reset mips: lantiq: add support for clk_set_parent() mips: bcm63xx: add support for clk_set_parent() powerpc/xive: Add missing null check after calling kmalloc ASoC: fsl_mqs: fix MODULE_ALIAS ALSA: hda/cs8409: Increase delay during jack detection ALSA: hda/cs8409: Fix Jack detection after resume RDMA/cxgb4: Set queue pair state when being queried clk: qcom: gcc-sc7280: Mark gcc_cfg_noc_lpass_clk always enabled ASoC: imx-card: Need special setting for ak4497 on i.MX8MQ ASoC: imx-card: Fix mclk calculation issue for akcodec ASoC: imx-card: improve the sound quality for low rate ASoC: fsl_asrc: refine the check of available clock divider clk: bm1880: remove kfrees on static allocations of: base: Fix phandle argument length mismatch error message of/fdt: Don't worry about non-memory region overlap for no-map MIPS: boot/compressed/: add __ashldi3 to target for ZSTD compression MIPS: compressed: Fix build with ZSTD compression mailbox: fix gce_num of mt8192 driver data ARM: dts: omap3-n900: Fix lp5523 for multi color leds: lp55xx: initialise output direction from dts Bluetooth: Fix debugfs entry leak in hci_register_dev() Bluetooth: Fix memory leak of hci device drm/panel: Delete panel on mipi_dsi_attach() failure Bluetooth: Fix removing adv when processing cmd complete fs: dlm: filter user dlm messages for kernel locks drm/lima: fix warning when CONFIG_DEBUG_SG=y & CONFIG_DMA_API_DEBUG=y selftests/bpf: Fix memory leaks in btf_type_c_dump() helper selftests/bpf: Destroy XDP link correctly selftests/bpf: Fix bpf_object leak in skb_ctx selftest ar5523: Fix null-ptr-deref with unexpected WDCMSG_TARGET_START reply drm/bridge: dw-hdmi: handle ELD when DRM_BRIDGE_ATTACH_NO_CONNECTOR drm/nouveau/pmu/gm200-: avoid touching PMU outside of DEVINIT/PREOS/ACR media: atomisp: fix try_fmt logic media: atomisp: set per-device's default mode media: atomisp-ov2680: Fix ov2680_set_fmt() clobbering the exposure media: atomisp: check before deference asd variable ARM: shmobile: rcar-gen2: Add missing of_node_put() batman-adv: allow netlink usage in unprivileged containers media: atomisp: handle errors at sh_css_create_isp_params() ath11k: Fix crash caused by uninitialized TX ring usb: dwc3: meson-g12a: fix shared reset control use USB: ehci_brcm_hub_control: Improve port index sanitizing usb: gadget: f_fs: Use stream_open() for endpoint files psi: Fix PSI_MEM_FULL state when tasks are in memstall and doing reclaim drm: panel-orientation-quirks: Add quirk for the Lenovo Yoga Book X91F/L HID: magicmouse: Report battery level over USB HID: apple: Do not reset quirks when the Fn key is not found media: b2c2: Add missing check in flexcop_pci_isr: libbpf: Accommodate DWARF/compiler bug with duplicated structs ethernet: renesas: Use div64_ul instead of do_div EDAC/synopsys: Use the quirk for version instead of ddr version arm64: dts: qcom: sm8350: Shorten camera-thermal-bottom name soc: imx: gpcv2: Synchronously suspend MIX domains ARM: imx: rename DEBUG_IMX21_IMX27_UART to DEBUG_IMX27_UART drm/amd/display: check top_pipe_to_program pointer drm/amdgpu/display: set vblank_disable_immediate for DC soc: ti: pruss: fix referenced node in error message mlxsw: pci: Add shutdown method in PCI driver drm/amd/display: add else to avoid double destroy clk_mgr drm/bridge: megachips: Ensure both bridges are probed before registration mxser: keep only !tty test in ISR tty: serial: imx: disable UCR4_OREN in .stop_rx() instead of .shutdown() gpiolib: acpi: Do not set the IRQ type if the IRQ is already in use HSI: core: Fix return freed object in hsi_new_client crypto: jitter - consider 32 LSB for APT mwifiex: Fix skb_over_panic in mwifiex_usb_recv() rsi: Fix use-after-free in rsi_rx_done_handler() rsi: Fix out-of-bounds read in rsi_read_pkt() ath11k: Avoid NULL ptr access during mgmt tx cleanup media: venus: avoid calling core_clk_setrate() concurrently during concurrent video sessions regulator: da9121: Prevent current limit change when enabled drm/vmwgfx: Release ttm memory if probe fails drm/vmwgfx: Introduce a new placement for MOB page tables ACPI / x86: Drop PWM2 device on Lenovo Yoga Book from always present table ACPI: Change acpi_device_always_present() into acpi_device_override_status() ACPI / x86: Allow specifying acpi_device_override_status() quirks by path ACPI / x86: Add not-present quirk for the PCI0.SDHB.BRC1 device on the GPD win arm64: dts: ti: j7200-main: Fix 'dtbs_check' serdes_ln_ctrl node arm64: dts: ti: j721e-main: Fix 'dtbs_check' in serdes_ln_ctrl node usb: uhci: add aspeed ast2600 uhci support floppy: Add max size check for user space request x86/mm: Flush global TLB when switching to trampoline page-table drm: rcar-du: Fix CRTC timings when CMM is used media: uvcvideo: Increase UVC_CTRL_CONTROL_TIMEOUT to 5 seconds. media: rcar-vin: Update format alignment constraints media: saa7146: hexium_orion: Fix a NULL pointer dereference in hexium_attach() media: atomisp: fix "variable dereferenced before check 'asd'" media: m920x: don't use stack on USB reads thunderbolt: Runtime PM activate both ends of the device link arm64: dts: renesas: Fix thermal bindings iwlwifi: mvm: synchronize with FW after multicast commands iwlwifi: mvm: avoid clearing a just saved session protection id rcutorture: Avoid soft lockup during cpu stall ath11k: avoid deadlock by change ieee80211_queue_work for regd_update_work ath10k: Fix tx hanging net-sysfs: update the queue counts in the unregistration path net: phy: prefer 1000baseT over 1000baseKX gpio: aspeed: Convert aspeed_gpio.lock to raw_spinlock gpio: aspeed-sgpio: Convert aspeed_sgpio.lock to raw_spinlock selftests/ftrace: make kprobe profile testcase description unique ath11k: Avoid false DEADLOCK warning reported by lockdep ARM: dts: qcom: sdx55: fix IPA interconnect definitions x86/mce: Allow instrumentation during task work queueing x86/mce: Mark mce_panic() noinstr x86/mce: Mark mce_end() noinstr x86/mce: Mark mce_read_aux() noinstr net: bonding: debug: avoid printing debug logs when bond is not notifying peers kunit: Don't crash if no parameters are generated bpf: Do not WARN in bpf_warn_invalid_xdp_action() drm/amdkfd: Fix error handling in svm_range_add HID: quirks: Allow inverting the absolute X/Y values HID: i2c-hid-of: Expose the touchscreen-inverted properties media: igorplugusb: receiver overflow should be reported media: rockchip: rkisp1: use device name for debugfs subdir name media: saa7146: hexium_gemini: Fix a NULL pointer dereference in hexium_attach() mmc: tmio: reinit card irqs in reset routine mmc: core: Fixup storing of OCR for MMC_QUIRK_NONSTD_SDIO drm/amd/amdgpu: fix psp tmr bo pin count leak in SRIOV drm/amd/amdgpu: fix gmc bo pin count leak in SRIOV audit: ensure userspace is penalized the same as the kernel when under pressure arm64: dts: ls1028a-qds: move rtc node to the correct i2c bus arm64: tegra: Adjust length of CCPLEX cluster MMIO region crypto: ccp - Move SEV_INIT retry for corrupted data crypto: hisilicon/hpre - fix memory leak in hpre_curve25519_src_init() PM: runtime: Add safety net to supplier device release cpufreq: Fix initialization of min and max frequency QoS requests usb: hub: Add delay for SuperSpeed hub resume to let links transit to U0 mt76: mt7615: fix possible deadlock while mt7615_register_ext_phy() mt76: do not pass the received frame with decryption error mt76: mt7615: improve wmm index allocation ath9k_htc: fix NULL pointer dereference at ath9k_htc_rxep() ath9k_htc: fix NULL pointer dereference at ath9k_htc_tx_get_packet() ath9k: Fix out-of-bound memcpy in ath9k_hif_usb_rx_stream rtw88: 8822c: update rx settings to prevent potential hw deadlock PM: AVS: qcom-cpr: Use div64_ul instead of do_div iwlwifi: fix leaks/bad data after failed firmware load iwlwifi: remove module loading failure message iwlwifi: mvm: Fix calculation of frame length iwlwifi: mvm: fix AUX ROC removal iwlwifi: pcie: make sure prph_info is set when treating wakeup IRQ mmc: sdhci-pci-gli: GL9755: Support for CD/WP inversion on OF platforms block: check minor range in device_add_disk() um: registers: Rename function names to avoid conflicts and build problems ath11k: Fix napi related hang Bluetooth: btintel: Add missing quirks and msft ext for legacy bootloader Bluetooth: vhci: Set HCI_QUIRK_VALID_LE_STATES xfrm: rate limit SA mapping change message to user space drm/etnaviv: consider completed fence seqno in hang check jffs2: GC deadlock reading a page that is used in jffs2_write_begin() ACPICA: actypes.h: Expand the ACPI_ACCESS_ definitions ACPICA: Utilities: Avoid deleting the same object twice in a row ACPICA: Executer: Fix the REFCLASS_REFOF case in acpi_ex_opcode_1A_0T_1R() ACPICA: Fix wrong interpretation of PCC address ACPICA: Hardware: Do not flush CPU cache when entering S4 and S5 mmc: mtk-sd: Use readl_poll_timeout instead of open-coded polling drm/amdgpu: fixup bad vram size on gmc v8 amdgpu/pm: Make sysfs pm attributes as read-only for VFs ACPI: battery: Add the ThinkPad "Not Charging" quirk ACPI: CPPC: Check present CPUs for determining _CPC is valid btrfs: remove BUG_ON() in find_parent_nodes() btrfs: remove BUG_ON(!eie) in find_parent_nodes net: mdio: Demote probed message to debug print mac80211: allow non-standard VHT MCS-10/11 dm btree: add a defensive bounds check to insert_at() dm space map common: add bounds check to sm_ll_lookup_bitmap() bpf/selftests: Fix namespace mount setup in tc_redirect mlxsw: pci: Avoid flow control for EMAD packets net: phy: marvell: configure RGMII delays for 88E1118 net: gemini: allow any RGMII interface mode regulator: qcom_smd: Align probe function with rpmh-regulator serial: pl010: Drop CR register reset on set_termios serial: pl011: Drop CR register reset on set_termios serial: core: Keep mctrl register state and cached copy in sync random: do not throw away excess input to crng_fast_load net/mlx5: Update log_max_qp value to FW max capability net/mlx5e: Unblock setting vid 0 for VF in case PF isn't eswitch manager parisc: Avoid calling faulthandler_disabled() twice can: flexcan: allow to change quirks at runtime can: flexcan: rename RX modes can: flexcan: add more quirks to describe RX path capabilities x86/kbuild: Enable CONFIG_KALLSYMS_ALL=y in the defconfigs powerpc/6xx: add missing of_node_put powerpc/powernv: add missing of_node_put powerpc/cell: add missing of_node_put powerpc/btext: add missing of_node_put powerpc/watchdog: Fix missed watchdog reset due to memory ordering race ASoC: imx-hdmi: add put_device() after of_find_device_by_node() i2c: i801: Don't silently correct invalid transfer size powerpc/smp: Move setup_profiling_timer() under CONFIG_PROFILING i2c: mpc: Correct I2C reset procedure clk: meson: gxbb: Fix the SDM_EN bit for MPLL0 on GXBB powerpc/powermac: Add missing lockdep_register_key() KVM: PPC: Book3S: Suppress warnings when allocating too big memory slots KVM: PPC: Book3S: Suppress failed alloc warning in H_COPY_TOFROM_GUEST w1: Misuse of get_user()/put_user() reported by sparse nvmem: core: set size for sysfs bin file dm: fix alloc_dax error handling in alloc_dev interconnect: qcom: rpm: Prevent integer overflow in rate scsi: ufs: Fix a kernel crash during shutdown scsi: lpfc: Fix leaked lpfc_dmabuf mbox allocations with NPIV scsi: lpfc: Trigger SLI4 firmware dump before doing driver cleanup ALSA: seq: Set upper limit of processed events MIPS: Loongson64: Use three arguments for slti powerpc/40x: Map 32Mbytes of memory at startup selftests/powerpc/spectre_v2: Return skip code when miss_percent is high powerpc: handle kdump appropriately with crash_kexec_post_notifiers option powerpc/fadump: Fix inaccurate CPU state info in vmcore generated with panic udf: Fix error handling in udf_new_inode() MIPS: OCTEON: add put_device() after of_find_device_by_node() irqchip/gic-v4: Disable redistributors' view of the VPE table at boot time i2c: designware-pci: Fix to change data types of hcnt and lcnt parameters selftests/powerpc: Add a test of sigreturning to the kernel MIPS: Octeon: Fix build errors using clang scsi: sr: Don't use GFP_DMA scsi: mpi3mr: Fixes around reply request queues ASoC: mediatek: mt8192-mt6359: fix device_node leak phy: phy-mtk-tphy: add support efuse setting ASoC: mediatek: mt8173: fix device_node leak ASoC: mediatek: mt8183: fix device_node leak habanalabs: skip read fw errors if dynamic descriptor invalid phy: mediatek: Fix missing check in mtk_mipi_tx_probe mailbox: change mailbox-mpfs compatible string seg6: export get_srh() for ICMP handling icmp: ICMPV6: Examine invoking packet for Segment Route Headers. udp6: Use Segment Routing Header for dest address if present rpmsg: core: Clean up resources on announce_create failure. ifcvf/vDPA: fix misuse virtio-net device config size for blk dev crypto: omap-aes - Fix broken pm_runtime_and_get() usage crypto: stm32/crc32 - Fix kernel BUG triggered in probe() crypto: caam - replace this_cpu_ptr with raw_cpu_ptr ubifs: Error path in ubifs_remount_rw() seems to wrongly free write buffers tpm: fix potential NULL pointer access in tpm_del_char_device tpm: fix NPE on probe for missing device mfd: tps65910: Set PWR_OFF bit during driver probe spi: uniphier: Fix a bug that doesn't point to private data correctly xen/gntdev: fix unmap notification order md: Move alloc/free acct bioset in to personality HID: magicmouse: Fix an error handling path in magicmouse_probe() fuse: Pass correct lend value to filemap_write_and_wait_range() serial: Fix incorrect rs485 polarity on uart open cputime, cpuacct: Include guest time in user time in cpuacct.stat sched/cpuacct: Fix user/system in shown cpuacct.usage* tracing/kprobes: 'nmissed' not showed correctly for kretprobe tracing: Have syscall trace events use trace_event_buffer_lock_reserve() remoteproc: imx_rproc: Fix a resource leak in the remove function iwlwifi: mvm: Increase the scan timeout guard to 30 seconds s390/mm: fix 2KB pgtable release race device property: Fix fwnode_graph_devcon_match() fwnode leak drm/tegra: submit: Add missing pm_runtime_mark_last_busy() drm/etnaviv: limit submit sizes drm/amd/display: Fix the uninitialized variable in enable_stream_features() drm/nouveau/kms/nv04: use vzalloc for nv04_display drm/bridge: analogix_dp: Make PSR-exit block less parisc: Fix lpa and lpa_user defines powerpc/64s/radix: Fix huge vmap false positive scsi: lpfc: Fix lpfc_force_rscn ndlp kref imbalance drm/amdgpu: don't do resets on APUs which don't support it drm/i915/display/ehl: Update voltage swing table PCI: xgene: Fix IB window setup PCI: pciehp: Use down_read/write_nested(reset_lock) to fix lockdep errors PCI: pci-bridge-emul: Make expansion ROM Base Address register read-only PCI: pci-bridge-emul: Properly mark reserved PCIe bits in PCI config space PCI: pci-bridge-emul: Fix definitions of reserved bits PCI: pci-bridge-emul: Correctly set PCIe capabilities PCI: pci-bridge-emul: Set PCI_STATUS_CAP_LIST for PCIe device xfrm: fix policy lookup for ipv6 gre packets xfrm: fix dflt policy check when there is no policy configured btrfs: fix deadlock between quota enable and other quota operations btrfs: check the root node for uptodate before returning it btrfs: respect the max size in the header when activating swap file ext4: make sure to reset inode lockdep class when quota enabling fails ext4: make sure quota gets properly shutdown on error ext4: fix a possible ABBA deadlock due to busy PA ext4: initialize err_blk before calling __ext4_get_inode_loc ext4: fix fast commit may miss tracking range for FALLOC_FL_ZERO_RANGE ext4: set csum seed in tmp inode while migrating to extents ext4: Fix BUG_ON in ext4_bread when write quota data ext4: use ext4_ext_remove_space() for fast commit replay delete range ext4: fast commit may miss tracking unwritten range during ftruncate ext4: destroy ext4_fc_dentry_cachep kmemcache on module removal ext4: fix null-ptr-deref in '__ext4_journal_ensure_credits' ext4: fix an use-after-free issue about data=journal writeback mode ext4: don't use the orphan list when migrating an inode tracing/osnoise: Properly unhook events if start_per_cpu_kthreads() fails ath11k: qmi: avoid error messages when dma allocation fails drm/radeon: fix error handling in radeon_driver_open_kms of: base: Improve argument length mismatch error firmware: Update Kconfig help text for Google firmware can: mcp251xfd: mcp251xfd_tef_obj_read(): fix typo in error message media: rcar-csi2: Optimize the selection PHTW register drm/vc4: hdmi: Make sure the device is powered with CEC media: correct MEDIA_TEST_SUPPORT help text Documentation: coresight: Fix documentation issue Documentation: dmaengine: Correctly describe dmatest with channel unset Documentation: ACPI: Fix data node reference documentation Documentation, arch: Remove leftovers from raw device Documentation, arch: Remove leftovers from CIFS_WEAK_PW_HASH Documentation: refer to config RANDOMIZE_BASE for kernel address-space randomization Documentation: fix firewire.rst ABI file path error Bluetooth: btusb: Return error code when getting patch status failed net: usb: Correct reset handling of smsc95xx Bluetooth: hci_sync: Fix not setting adv set duration scsi: core: Show SCMD_LAST in text form scsi: ufs: ufs-mediatek: Fix error checking in ufs_mtk_init_va09_pwr_ctrl() RDMA/cma: Remove open coding of overflow checking for private_data_len dmaengine: uniphier-xdmac: Fix type of address variables dmaengine: idxd: fix wq settings post wq disable RDMA/hns: Modify the mapping attribute of doorbell to device RDMA/rxe: Fix a typo in opcode name dmaengine: stm32-mdma: fix STM32_MDMA_CTBR_TSEL_MASK Revert "net/mlx5: Add retry mechanism to the command entry index allocation" powerpc/cell: Fix clang -Wimplicit-fallthrough warning powerpc/fsl/dts: Enable WA for erratum A-009885 on fman3l MDIO buses block: fix async_depth sysfs interface for mq-deadline block: Fix fsync always failed if once failed drm/vc4: crtc: Drop feed_txp from state drm/vc4: Fix non-blocking commit getting stuck forever drm/vc4: crtc: Copy assigned channel to the CRTC bpftool: Remove inclusion of utilities.mak from Makefiles bpftool: Fix indent in option lists in the documentation xdp: check prog type before updating BPF link bpf: Fix mount source show for bpffs bpf: Mark PTR_TO_FUNC register initially with zero offset perf evsel: Override attr->sample_period for non-libpfm4 events ipv4: update fib_info_cnt under spinlock protection ipv4: avoid quadratic behavior in netns dismantle mlx5: Don't accidentally set RTO_ONLINK before mlx5e_route_lookup_ipv4_get() net/fsl: xgmac_mdio: Add workaround for erratum A-009885 net/fsl: xgmac_mdio: Fix incorrect iounmap when removing module parisc: pdc_stable: Fix memory leak in pdcs_register_pathentries riscv: dts: microchip: mpfs: Drop empty chosen node drm/vmwgfx: Remove explicit transparent hugepages support drm/vmwgfx: Remove unused compile options f2fs: fix remove page failed in invalidate compress pages f2fs: fix to avoid panic in is_alive() if metadata is inconsistent f2fs: compress: fix potential deadlock of compress file f2fs: fix to reserve space for IO align feature f2fs: fix to check available space of CP area correctly in update_ckpt_flags() crypto: octeontx2 - uninitialized variable in kvf_limits_store() af_unix: annote lockless accesses to unix_tot_inflight & gc_in_progress clk: Emit a stern warning with writable debugfs enabled clk: si5341: Fix clock HW provider cleanup pinctrl/rockchip: fix gpio device creation gpio: mpc8xxx: Fix IRQ check in mpc8xxx_probe gpio: idt3243x: Fix IRQ check in idt_gpio_probe net/smc: Fix hung_task when removing SMC-R devices net: axienet: increase reset timeout net: axienet: Wait for PhyRstCmplt after core reset net: axienet: reset core on initialization prior to MDIO access net: axienet: add missing memory barriers net: axienet: limit minimum TX ring size net: axienet: Fix TX ring slot available check net: axienet: fix number of TX ring slots for available check net: axienet: fix for TX busy handling net: axienet: increase default TX ring size to 128 bitops: protect find_first_{,zero}_bit properly um: gitignore: Add kernel/capflags.c HID: vivaldi: fix handling devices not using numbered reports rtc: pxa: fix null pointer dereference vdpa/mlx5: Fix wrong configuration of virtio_version_1_0 virtio_ring: mark ring unused on error taskstats: Cleanup the use of task->exit_code inet: frags: annotate races around fqdir->dead and fqdir->high_thresh netns: add schedule point in ops_exit_list() iwlwifi: fix Bz NMI behaviour xfrm: Don't accidentally set RTO_ONLINK in decode_session4() vdpa/mlx5: Restore cur_num_vqs in case of failure in change_num_qps() gre: Don't accidentally set RTO_ONLINK in gre_fill_metadata_dst() libcxgb: Don't accidentally set RTO_ONLINK in cxgb_find_route() perf script: Fix hex dump character output dmaengine: at_xdmac: Don't start transactions at tx_submit level dmaengine: at_xdmac: Start transfer for cyclic channels in issue_pending dmaengine: at_xdmac: Print debug message after realeasing the lock dmaengine: at_xdmac: Fix concurrency over xfers_list dmaengine: at_xdmac: Fix lld view setting dmaengine: at_xdmac: Fix at_xdmac_lld struct definition perf tools: Drop requirement for libstdc++.so for libopencsd check perf probe: Fix ppc64 'perf probe add events failed' case devlink: Remove misleading internal_flags from health reporter dump arm64: dts: qcom: msm8996: drop not documented adreno properties net: fix sock_timestamping_bind_phc() to release device net: bonding: fix bond_xmit_broadcast return value error bug net: ipa: fix atomic update in ipa_endpoint_replenish() net_sched: restore "mpu xxx" handling net: mscc: ocelot: don't let phylink re-enable TX PAUSE on the NPI port bcmgenet: add WOL IRQ check net: wwan: Fix MRU mismatch issue which may lead to data connection lost net: ethernet: mtk_eth_soc: fix error checking in mtk_mac_config() net: ocelot: Fix the call to switchdev_bridge_port_offload net: sfp: fix high power modules without diagnostic monitoring net: cpsw: avoid alignment faults by taking NET_IP_ALIGN into account net: phy: micrel: use kszphy_suspend()/kszphy_resume for irq aware devices net: mscc: ocelot: fix using match before it is set dt-bindings: display: meson-dw-hdmi: add missing sound-name-prefix property dt-bindings: display: meson-vpu: Add missing amlogic,canvas property dt-bindings: watchdog: Require samsung,syscon-phandle for Exynos7 sch_api: Don't skip qdisc attach on ingress scripts/dtc: dtx_diff: remove broken example from help text lib82596: Fix IRQ check in sni_82596_probe mm/hmm.c: allow VM_MIXEDMAP to work with hmm_range_fault bonding: Fix extraction of ports from the packet headers lib/test_meminit: destroy cache in kmem_cache_alloc_bulk() test scripts: sphinx-pre-install: add required ctex dependency scripts: sphinx-pre-install: Fix ctex support on Debian Linux 5.15.17 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I6ddef7c3463bfc127b34c39ebcf5d286d3117931	2022-01-31 12:35:09 +01:00
Li Hua	378723bd01	sched/rt: Try to restart rt period timer when rt runtime exceeded [ Upstream commit 9b58e976b3b391c0cf02e038d53dd0478ed3013c ] When rt_runtime is modified from -1 to a valid control value, it may cause the task to be throttled all the time. Operations like the following will trigger the bug. E.g: 1. echo -1 > /proc/sys/kernel/sched_rt_runtime_us 2. Run a FIFO task named A that executes while(1) 3. echo 950000 > /proc/sys/kernel/sched_rt_runtime_us When rt_runtime is -1, The rt period timer will not be activated when task A enqueued. And then the task will be throttled after setting rt_runtime to 950,000. The task will always be throttled because the rt period timer is not activated. Fixes: `d0b27fa778` ("sched: rt-group: synchonised bandwidth period") Reported-by: Hulk Robot <hulkci@huawei.com> Signed-off-by: Li Hua <hucool.lihua@huawei.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20211203033618.11895-1-hucool.lihua@huawei.com Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-01-27 11:03:30 +01:00
Rick Yiu	f0a317610a	ANDROID: sched: Export symbol for vendor RT hook funcion Export task_may_not_preempt. Bug: 174030348 Change-Id: I71b50f876306811f008414096043b883dc43b4d5 Signed-off-by: Rick Yiu <rickyiu@google.com> Signed-off-by: Will McVicker <willmcvicker@google.com> Signed-off-by: Shaleen Agrawal <quic_shalagra@quicinc.com>	2021-12-06 15:14:20 -08:00
Greg Kroah-Hartman	966869fb2a	Merge 5.15.5 into android13-5.15 Changes in 5.15.5 arm64: zynqmp: Do not duplicate flash partition label property arm64: zynqmp: Fix serial compatible string clk: sunxi-ng: Unregister clocks/resets when unbinding ARM: dts: sunxi: Fix OPPs node name arm64: dts: allwinner: h5: Fix GPU thermal zone node name arm64: dts: allwinner: a100: Fix thermal zone node name staging: wfx: ensure IRQ is ready before enabling it ARM: dts: BCM5301X: Fix nodes names ARM: dts: BCM5301X: Fix MDIO mux binding ARM: dts: NSP: Fix mpcore, mmc node names arm64: dts: broadcom: bcm4908: Move reboot syscon out of bus scsi: pm80xx: Fix memory leak during rmmod scsi: lpfc: Fix list_add() corruption in lpfc_drain_txq() ASoC: mediatek: mt8195: Add missing of_node_put() arm64: dts: rockchip: Disable CDN DP on Pinebook Pro arm64: dts: hisilicon: fix arm,sp805 compatible string RDMA/bnxt_re: Check if the vlan is valid before reporting bus: ti-sysc: Add quirk handling for reinit on context lost bus: ti-sysc: Use context lost quirk for otg usb: musb: tusb6010: check return value after calling platform_get_resource() usb: typec: tipd: Remove WARN_ON in tps6598x_block_read ARM: dts: ux500: Skomer regulator fixes staging: rtl8723bs: remove possible deadlock when disconnect (v2) staging: rtl8723bs: remove a second possible deadlock staging: rtl8723bs: remove a third possible deadlock ARM: BCM53016: Specify switch ports for Meraki MR32 arm64: dts: qcom: msm8998: Fix CPU/L2 idle state latency and residency arm64: dts: qcom: ipq6018: Fix qcom,controlled-remotely property arm64: dts: qcom: ipq8074: Fix qcom,controlled-remotely property arm64: dts: qcom: sdm845: Fix qcom,controlled-remotely property arm64: dts: freescale: fix arm,sp805 compatible string arm64: dts: ls1012a: Add serial alias for ls1012a-rdb RDMA/rxe: Separate HW and SW l/rkeys ASoC: SOF: Intel: hda-dai: fix potential locking issue scsi: core: Fix scsi_mode_sense() buffer length handling ALSA: usb-audio: disable implicit feedback sync for Behringer UFX1204 and UFX1604 clk: imx: imx6ul: Move csi_sel mux to correct base register ASoC: es8316: Use IRQF_NO_AUTOEN when requesting the IRQ ASoC: rt5651: Use IRQF_NO_AUTOEN when requesting the IRQ ASoC: nau8824: Add DMI quirk mechanism for active-high jack-detect scsi: advansys: Fix kernel pointer leak scsi: smartpqi: Add controller handshake during kdump arm64: dts: imx8mm-kontron: Fix reset delays for ethernet PHY ALSA: intel-dsp-config: add quirk for APL/GLK/TGL devices based on ES8336 codec ASoC: Intel: soc-acpi: add missing quirk for TGL SDCA single amp ASoC: Intel: sof_sdw: add missing quirk for Dell SKU 0A45 firmware_loader: fix pre-allocated buf built-in firmware use HID: multitouch: disable sticky fingers for UPERFECT Y ALSA: usb-audio: Add support for the Pioneer DJM 750MK2 Mixer/Soundcard ARM: dts: omap: fix gpmc,mux-add-data type usb: host: ohci-tmio: check return value after calling platform_get_resource() ASoC: rt5682: fix a little pop while playback ARM: dts: ls1021a: move thermal-zones node out of soc/ ARM: dts: ls1021a-tsn: use generic "jedec,spi-nor" compatible for flash ALSA: ISA: not for M68K iommu/vt-d: Do not falsely log intel_iommu is unsupported kernel option tty: tty_buffer: Fix the softlockup issue in flush_to_ldisc MIPS: sni: Fix the build scsi: scsi_debug: Fix out-of-bound read in resp_readcap16() scsi: scsi_debug: Fix out-of-bound read in resp_report_tgtpgs() scsi: target: Fix ordered tag handling scsi: target: Fix alua_tg_pt_gps_count tracking iio: imu: st_lsm6dsx: Avoid potential array overflow in st_lsm6dsx_set_odr() RDMA/core: Use kvzalloc when allocating the struct ib_port scsi: lpfc: Fix use-after-free in lpfc_unreg_rpi() routine scsi: lpfc: Fix link down processing to address NULL pointer dereference scsi: lpfc: Allow fabric node recovery if recovery is in progress before devloss memory: tegra20-emc: Add runtime dependency on devfreq governor module powerpc/5200: dts: fix memory node unit name ARM: dts: qcom: fix memory and mdio nodes naming for RB3011 arm64: dts: qcom: Fix node name of rpm-msg-ram device nodes ALSA: gus: fix null pointer dereference on pointer block ALSA: usb-audio: fix null pointer dereference on pointer cs_desc clk: at91: sama7g5: remove prescaler part of master clock iommu/dart: Initialize DART_STREAMS_ENABLE powerpc/dcr: Use cmplwi instead of 3-argument cmpli powerpc/8xx: Fix Oops with STRICT_KERNEL_RWX without DEBUG_RODATA_TEST sh: check return code of request_irq maple: fix wrong return value of maple_bus_init(). f2fs: fix up f2fs_lookup tracepoints f2fs: fix to use WHINT_MODE f2fs: fix wrong condition to trigger background checkpoint correctly sh: fix kconfig unmet dependency warning for FRAME_POINTER sh: math-emu: drop unused functions sh: define __BIG_ENDIAN for math-emu f2fs: compress: disallow disabling compress on non-empty compressed file f2fs: fix incorrect return value in f2fs_sanity_check_ckpt() clk: ingenic: Fix bugs with divided dividers clk/ast2600: Fix soc revision for AHB clk: qcom: gcc-msm8996: Drop (again) gcc_aggre1_pnoc_ahb_clk KVM: arm64: Fix host stage-2 finalization mips: BCM63XX: ensure that CPU_SUPPORTS_32BIT_KERNEL is set MIPS: boot/compressed/: add __bswapdi2() to target for ZSTD decompression sched/core: Mitigate race cpus_share_cache()/update_top_cache_domain() sched/fair: Prevent dead task groups from regaining cfs_rq's perf/x86/vlbr: Add c->flags to vlbr event constraints blkcg: Remove extra blkcg_bio_issue_init tracing/histogram: Do not copy the fixed-size char array field over the field size perf bpf: Avoid memory leak from perf_env__insert_btf() perf bench futex: Fix memory leak of perf_cpu_map__new() perf tests: Remove bash construct from record+zstd_comp_decomp.sh drm/nouveau: hdmigv100.c: fix corrupted HDMI Vendor InfoFrame bpf: Fix inner map state pruning regression. samples/bpf: Fix summary per-sec stats in xdp_sample_user samples/bpf: Fix incorrect use of strlen in xdp_redirect_cpu selftests: net: switch to socat in the GSO GRE test net/ipa: ipa_resource: Fix wrong for loop range tcp: Fix uninitialized access in skb frags array for Rx 0cp. tracing: Add length protection to histogram string copies nl80211: fix radio statistics in survey dump mac80211: fix monitor_sdata RCU/locking assertions net: ipa: HOLB register sometimes must be written twice net: ipa: disable HOLB drop when updating timer selftests: gpio: fix gpio compiling error net: bnx2x: fix variable dereferenced before check bnxt_en: reject indirect blk offload when hw-tc-offload is off tipc: only accept encrypted MSG_CRYPTO msgs sock: fix /proc/net/sockstat underflow in sk_clone_lock() net/smc: Make sure the link_id is unique NFSD: Fix exposure in nfsd4_decode_bitmap() iavf: Fix return of set the new channel count iavf: check for null in iavf_fix_features iavf: free q_vectors before queues in iavf_disable_vf iavf: don't clear a lock we don't hold iavf: Fix failure to exit out from last all-multicast mode iavf: prevent accidental free of filter structure iavf: validate pointers iavf: Fix for the false positive ASQ/ARQ errors while issuing VF reset iavf: Fix for setting queues to 0 iavf: Restore VLAN filters after link down bpf: Fix toctou on read-only map's constant scalar tracking MIPS: generic/yamon-dt: fix uninitialized variable error mips: bcm63xx: add support for clk_get_parent() mips: lantiq: add support for clk_get_parent() gpio: rockchip: needs GENERIC_IRQ_CHIP to fix build errors platform/x86: hp_accel: Fix an error handling path in 'lis3lv02d_probe()' platform/x86: think-lmi: Abort probe on analyze failure udp: Validate checksum in udp_read_sock() btrfs: make 1-bit bit-fields of scrub_page unsigned int RDMA/core: Set send and receive CQ before forwarding to the driver net/mlx5e: kTLS, Fix crash in RX resync flow net/mlx5e: Wait for concurrent flow deletion during neigh/fib events net/mlx5: E-Switch, Fix resetting of encap mode when entering switchdev net/mlx5e: nullify cq->dbg pointer in mlx5_debug_cq_remove() net/mlx5: Update error handler for UCTX and UMEM net/mlx5: E-Switch, rebuild lag only when needed net/mlx5e: CT, Fix multiple allocations and memleak of mod acts net/mlx5: Lag, update tracker when state change event received net/mlx5: E-Switch, return error if encap isn't supported scsi: ufs: core: Improve SCSI abort handling scsi: core: sysfs: Fix hang when device state is set via sysfs scsi: ufs: core: Fix task management completion timeout race scsi: ufs: core: Fix another task management completion race net: mvmdio: fix compilation warning net: sched: act_mirred: drop dst for the direction from egress to ingress net: dpaa2-eth: fix use-after-free in dpaa2_eth_remove net: virtio_net_hdr_to_skb: count transport header in UFO i40e: Fix correct max_pkt_size on VF RX queue i40e: Fix NULL ptr dereference on VSI filter sync i40e: Fix changing previously set num_queue_pairs for PFs i40e: Fix ping is lost after configuring ADq on VF RDMA/mlx4: Do not fail the registration on port stats i40e: Fix warning message and call stack during rmmod i40e driver i40e: Fix creation of first queue by omitting it if is not power of two i40e: Fix display error code in dmesg NFC: reorganize the functions in nci_request NFC: reorder the logic in nfc_{un,}register_device NFC: add NCI_UNREG flag to eliminate the race e100: fix device suspend/resume ptp: ocp: Fix a couple NULL vs IS_ERR() checks tools build: Fix removal of feature-sync-compare-and-swap feature detection riscv: fix building external modules KVM: PPC: Book3S HV: Use GLOBAL_TOC for kvmppc_h_set_dabr/xdabr() powerpc: clean vdso32 and vdso64 directories powerpc/pseries: rename numa_dist_table to form2_distances powerpc/pseries: Fix numa FORM2 parsing fallback code pinctrl: qcom: sdm845: Enable dual edge errata pinctrl: qcom: sm8350: Correct UFS and SDC offsets perf/x86/intel/uncore: Fix filter_tid mask for CHA events on Skylake Server perf/x86/intel/uncore: Fix IIO event constraints for Skylake Server perf/x86/intel/uncore: Fix IIO event constraints for Snowridge s390/kexec: fix return code handling blk-cgroup: fix missing put device in error path from blkg_conf_pref() dmaengine: remove debugfs #ifdef tun: fix bonding active backup with arp monitoring Revert "mark pstore-blk as broken" pstore/blk: Use "%lu" to format unsigned long hexagon: export raw I/O routines for modules hexagon: clean up timer-regs.h tipc: check for null after calling kmemdup ipc: WARN if trying to remove ipc object which is absent shm: extend forced shm destroy to support objects from several IPC nses mm: kmemleak: slob: respect SLAB_NOLEAKTRACE flag hugetlb, userfaultfd: fix reservation restore on userfaultfd error kmap_local: don't assume kmap PTEs are linear arrays in memory mm/damon/dbgfs: use '__GFP_NOWARN' for user-specified size buffer allocation mm/damon/dbgfs: fix missed use of damon_dbgfs_lock x86/boot: Pull up cmdline preparation and early param parsing x86/sgx: Fix free page accounting x86/hyperv: Fix NULL deref in set_hv_tscchange_cb() if Hyper-V setup fails KVM: x86: Assume a 64-bit hypercall for guests with protected state KVM: x86: Fix uninitialized eoi_exit_bitmap usage in vcpu_load_eoi_exitmap() KVM: x86/mmu: include EFER.LMA in extended mmu role KVM: x86/xen: Fix get_attr of KVM_XEN_ATTR_TYPE_SHARED_INFO powerpc/signal32: Fix sigset_t copy powerpc/xive: Change IRQ domain to a tree domain powerpc/8xx: Fix pinned TLBs with CONFIG_STRICT_KERNEL_RWX Revert "drm/i915/tgl/dsi: Gate the ddi clocks after pll mapping" Revert "parisc: Reduce sigreturn trampoline to 3 instructions" ata: libata: improve ata_read_log_page() error message ata: libata: add missing ata_identify_page_supported() calls scsi: qla2xxx: Fix mailbox direction flags in qla2xxx_get_adapter_id() pinctrl: ralink: include 'ralink_regs.h' in 'pinctrl-mt7620.c' s390/setup: avoid reserving memory above identity mapping s390/boot: simplify and fix kernel memory layout setup s390/vdso: filter out -mstack-guard and -mstack-size s390/kexec: fix memory leak of ipl report buffer s390/dump: fix copying to user-space of swapped kdump oldmem block: Check ADMIN before NICE for IOPRIO_CLASS_RT fbdev: Prevent probing generic drivers if a FB is already registered KVM: SEV: Disallow COPY_ENC_CONTEXT_FROM if target has created vCPUs KVM: nVMX: don't use vcpu->arch.efer when checking host state on nested state load drm/cma-helper: Release non-coherent memory with dma_free_noncoherent() printk: restore flushing of NMI buffers on remote CPUs after NMI backtraces udf: Fix crash after seekdir spi: fix use-after-free of the add_lock mutex net: stmmac: socfpga: add runtime suspend/resume callback for stratix10 platform Drivers: hv: balloon: Use VMBUS_RING_SIZE() wrapper for dm_ring_size btrfs: fix memory ordering between normal and ordered work functions fs: handle circular mappings correctly net: stmmac: Fix signed/unsigned wreckage parisc/sticon: fix reverse colors cfg80211: call cfg80211_stop_ap when switch from P2P_GO type mac80211: fix radiotap header generation mac80211: drop check for DONT_REORDER in __ieee80211_select_queue drm/amd/display: Update swizzle mode enums drm/amd/display: Limit max DSC target bpp for specific monitors drm/i915/guc: Fix outstanding G2H accounting drm/i915/guc: Don't enable scheduling on a banned context, guc_id invalid, not registered drm/i915/guc: Workaround reset G2H is received after schedule done G2H drm/i915/guc: Don't drop ce->guc_active.lock when unwinding context drm/i915/guc: Unwind context requests in reverse order drm/udl: fix control-message timeout drm/prime: Fix use after free in mmap with drm_gem_ttm_mmap drm/nouveau: Add a dedicated mutex for the clients list drm/nouveau: use drm_dev_unplug() during device removal drm/nouveau: clean up all clients on device removal drm/i915/dp: Ensure sink rate values are always valid drm/i915/dp: Ensure max link params are always valid drm/i915: Fix type1 DVI DP dual mode adapter heuristic for modern platforms drm/amdgpu: fix set scaling mode Full/Full aspect/Center not works on vga and dvi connectors drm/amd/pm: avoid duplicate powergate/ungate setting signal: Implement force_fatal_sig exit/syscall_user_dispatch: Send ordinary signals on failure signal/powerpc: On swapcontext failure force SIGSEGV signal/s390: Use force_sigsegv in default_trap_handler signal/sparc32: Exit with a fatal signal when try_to_clear_window_buffer fails signal/sparc32: In setup_rt_frame and setup_fram use force_fatal_sig signal/vm86_32: Properly send SIGSEGV when the vm86 state cannot be saved. signal/x86: In emulate_vsyscall force a signal instead of calling do_exit signal: Replace force_sigsegv(SIGSEGV) with force_fatal_sig(SIGSEGV) signal: Don't always set SA_IMMUTABLE for forced signals signal: Replace force_fatal_sig with force_exit_sig when in doubt hugetlbfs: flush TLBs correctly after huge_pmd_unshare RDMA/netlink: Add __maybe_unused to static inline in C file bpf: Forbid bpf_ktime_get_coarse_ns and bpf_timer_* in tracing progs selinux: fix NULL-pointer dereference when hashtab allocation fails ASoC: DAPM: Cover regression by kctl change notification fix ASoC: rsnd: fixup DMAEngine API usb: max-3421: Use driver data instead of maintaining a list of bound devices ice: Fix VF true promiscuous mode ice: Delete always true check of PF pointer fs: export an inode_update_time helper btrfs: update device path inode time instead of bd_inode net: add and use skb_unclone_keeptruesize() helper x86/Kconfig: Fix an unused variable error in dell-smm-hwmon ALSA: hda: hdac_ext_stream: fix potential locking issues ALSA: hda: hdac_stream: fix potential locking issue in snd_hdac_stream_assign() Linux 5.15.5 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: If86a02ba2cf9af765d9838ada3b9a2cbcea9a08d	2021-11-25 10:40:10 +01:00
Mathias Krause	512e21c150	sched/fair: Prevent dead task groups from regaining cfs_rq's [ Upstream commit b027789e5e50494c2325cc70c8642e7fd6059479 ] Kevin is reporting crashes which point to a use-after-free of a cfs_rq in update_blocked_averages(). Initial debugging revealed that we've live cfs_rq's (on_list=1) in an about to be kfree()'d task group in free_fair_sched_group(). However, it was unclear how that can happen. His kernel config happened to lead to a layout of struct sched_entity that put the 'my_q' member directly into the middle of the object which makes it incidentally overlap with SLUB's freelist pointer. That, in combination with SLAB_FREELIST_HARDENED's freelist pointer mangling, leads to a reliable access violation in form of a #GP which made the UAF fail fast. Michal seems to have run into the same issue[1]. He already correctly diagnosed that commit `a7b359fc6a` ("sched/fair: Correctly insert cfs_rq's to list on unthrottle") is causing the preconditions for the UAF to happen by re-adding cfs_rq's also to task groups that have no more running tasks, i.e. also to dead ones. His analysis, however, misses the real root cause and it cannot be seen from the crash backtrace only, as the real offender is tg_unthrottle_up() getting called via sched_cfs_period_timer() via the timer interrupt at an inconvenient time. When unregister_fair_sched_group() unlinks all cfs_rq's from the dying task group, it doesn't protect itself from getting interrupted. If the timer interrupt triggers while we iterate over all CPUs or after unregister_fair_sched_group() has finished but prior to unlinking the task group, sched_cfs_period_timer() will execute and walk the list of task groups, trying to unthrottle cfs_rq's, i.e. re-add them to the dying task group. These will later -- in free_fair_sched_group() -- be kfree()'ed while still being linked, leading to the fireworks Kevin and Michal are seeing. To fix this race, ensure the dying task group gets unlinked first. However, simply switching the order of unregistering and unlinking the task group isn't sufficient, as concurrent RCU walkers might still see it, as can be seen below: CPU1: CPU2: : timer IRQ: : do_sched_cfs_period_timer(): : : : distribute_cfs_runtime(): : rcu_read_lock(); : : : unthrottle_cfs_rq(): sched_offline_group(): : : walk_tg_tree_from(…,tg_unthrottle_up,…): list_del_rcu(&tg->list); : (1) : list_for_each_entry_rcu(child, &parent->children, siblings) : : (2) list_del_rcu(&tg->siblings); : : tg_unthrottle_up(): unregister_fair_sched_group(): struct cfs_rq *cfs_rq = tg->cfs_rq[cpu_of(rq)]; : : list_del_leaf_cfs_rq(tg->cfs_rq[cpu]); : : : : if (!cfs_rq_is_decayed(cfs_rq) \|\| cfs_rq->nr_running) (3) : list_add_leaf_cfs_rq(cfs_rq); : : : : : : : : : : (4) : rcu_read_unlock(); CPU 2 walks the task group list in parallel to sched_offline_group(), specifically, it'll read the soon to be unlinked task group entry at (1). Unlinking it on CPU 1 at (2) therefore won't prevent CPU 2 from still passing it on to tg_unthrottle_up(). CPU 1 now tries to unlink all cfs_rq's via list_del_leaf_cfs_rq() in unregister_fair_sched_group(). Meanwhile CPU 2 will re-add some of these at (3), which is the cause of the UAF later on. To prevent this additional race from happening, we need to wait until walk_tg_tree_from() has finished traversing the task groups, i.e. after the RCU read critical section ends in (4). Afterwards we're safe to call unregister_fair_sched_group(), as each new walk won't see the dying task group any more. On top of that, we need to wait yet another RCU grace period after unregister_fair_sched_group() to ensure print_cfs_stats(), which might run concurrently, always sees valid objects, i.e. not already free'd ones. This patch survives Michal's reproducer[2] for 8h+ now, which used to trigger within minutes before. [1] https://lore.kernel.org/lkml/20211011172236.11223-1-mkoutny@suse.com/ [2] https://lore.kernel.org/lkml/20211102160228.GA57072@blackbody.suse.cz/ Fixes: `a7b359fc6a` ("sched/fair: Correctly insert cfs_rq's to list on unthrottle") [peterz: shuffle code around a bit] Reported-by: Kevin Tanguy <kevin.tanguy@corp.ovh.com> Signed-off-by: Mathias Krause <minipli@grsecurity.net> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Sasha Levin <sashal@kernel.org>	2021-11-25 09:48:32 +01:00
Stephen Dickey	81a0b58261	ANDROID: sched: add hook to rto_next_cpu Restricted vendor hook to modify the cpu selected in rto_next_cpu, which is needed for the implementation of CPU Pause. Bug: 205164003 Change-Id: I0dc675e54f7f116d538840262fbb0ba6d28246f4 Signed-off-by: Stephen Dickey <quic_dickey@quicinc.com>	2021-11-09 00:42:30 +00:00
John Dias	027f8bd863	Revert "Revert "ANDROID: sched: avoid migrating when softint on tgt cpu should be short"" This reverts commit `4196c1dafc`, as the merge conflicts have been resolved. Bug: 31752786 Bug: 168521633 Change-Id: I6cb3fc698d567e03c67e2c4373ce75cc71cdfe9c Signed-off-by: John Dias <joaodias@google.com> [elavila: Amend commit text for AOSP, port to mainline] Signed-off-by: J. Avila <elavila@google.com> [ashayj@codeaurora.org: update usage of __IRQ_STAT and minor conflicts] Signed-off-by: Ashay Jaiswal <ashayj@codeaurora.org> Signed-off-by: Shaleen Agrawal <shalagra@codeaurora.org>	2021-10-15 16:15:20 +00:00
Pavankumar Kondeti	1085eff98a	ANDROID: sched: Add restrict vendor hooks for balance_rt() Add rvh called android_rvh_sched_balance_rt to influence balance_rt() from vendor modules. Bug: 178572414 Change-Id: I555c8ebcf5a3a5d8e3ab881ab9aa507f325285c2 Signed-off-by: Pavankumar Kondeti <pkondeti@codeaurora.org>	2021-10-12 15:36:44 -07:00
Lee Jones	7889eed917	Merge `54a728dc5e` ("Merge tag 'sched-core-2021-06-28' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip") into android-mainline A little step towards 5.14-rc1 Signed-off-by: Lee Jones <lee.jones@linaro.org> Change-Id: I2573a6df9f4e7b67194327ac6db6082a574d2809	2021-07-09 10:55:21 +01:00
Vincent Donnefort	fecfcbc288	sched/rt: Fix RT utilization tracking during policy change RT keeps track of the utilization on a per-rq basis with the structure avg_rt. This utilization is updated during task_tick_rt(), put_prev_task_rt() and set_next_task_rt(). However, when the current running task changes its policy, set_next_task_rt() which would usually take care of updating the utilization when the rq starts running RT tasks, will not see a such change, leaving the avg_rt structure outdated. When that very same task will be dequeued later, put_prev_task_rt() will then update the utilization, based on a wrong last_update_time, leading to a huge spike in the RT utilization signal. The signal would eventually recover from this issue after few ms. Even if no RT tasks are run, avg_rt is also updated in __update_blocked_others(). But as the CPU capacity depends partly on the avg_rt, this issue has nonetheless a significant impact on the scheduler. Fix this issue by ensuring a load update when a running task changes its policy to RT. Fixes: `371bf427` ("sched/rt: Add rt_rq utilization tracking") Signed-off-by: Vincent Donnefort <vincent.donnefort@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Vincent Guittot <vincent.guittot@linaro.org> Link: https://lore.kernel.org/r/1624271872-211872-2-git-send-email-vincent.donnefort@arm.com	2021-06-22 16:41:59 +02:00
Peter Zijlstra	21f56ffe44	sched: Introduce sched_class::pick_task() Because sched_class::pick_next_task() also implies sched_class::set_next_task() (and possibly put_prev_task() and newidle_balance) it is not state invariant. This makes it unsuitable for remote task selection. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> [Vineeth: folded fixes] Signed-off-by: Vineeth Remanan Pillai <viremana@linux.microsoft.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Tested-by: Don Hiatt <dhiatt@digitalocean.com> Tested-by: Hongyu Ning <hongyu.ning@linux.intel.com> Tested-by: Vincent Guittot <vincent.guittot@linaro.org> Link: https://lkml.kernel.org/r/20210422123308.437092775@infradead.org	2021-05-12 11:43:28 +02:00
Peter Zijlstra	5cb9eaa3d2	sched: Wrap rq::lock access In preparation of playing games with rq->lock, abstract the thing using an accessor. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Tested-by: Don Hiatt <dhiatt@digitalocean.com> Tested-by: Hongyu Ning <hongyu.ning@linux.intel.com> Tested-by: Vincent Guittot <vincent.guittot@linaro.org> Link: https://lkml.kernel.org/r/20210422123308.136465446@infradead.org	2021-05-12 11:43:26 +02:00
Lee Jones	4797acfb9c	Merge `16b3d0cf5b` Merge tag 'sched-core-2021-04-28' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip into android-mainline A little step en route to v5.13-rc1 Signed-off-by: Lee Jones <lee.jones@linaro.org> Change-Id: Ic2fb8aa220023572c96907aebce0a675333ef29f	2021-05-10 10:28:52 +01:00
Ingo Molnar	3b03706fa6	sched: Fix various typos Fix ~42 single-word typos in scheduler code comments. We have accumulated a few fun ones over the years. :-) Signed-off-by: Ingo Molnar <mingo@kernel.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Mike Galbraith <efault@gmx.de> Cc: Juri Lelli <juri.lelli@redhat.com> Cc: Vincent Guittot <vincent.guittot@linaro.org> Cc: Dietmar Eggemann <dietmar.eggemann@arm.com> Cc: Steven Rostedt <rostedt@goodmis.org> Cc: Ben Segall <bsegall@google.com> Cc: Mel Gorman <mgorman@suse.de> Cc: linux-kernel@vger.kernel.org	2021-03-22 00:11:52 +01:00
Greg Kroah-Hartman	1e5781383b	Merge `657bd90c93` ("Merge tag 'sched-core-2021-02-17' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip") into android-mainline Steps on the way to 5.12-rc1 Resolves conflicts in: kernel/sched/cpufreq_schedutil.c Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I06d90f919467f3e7e8970aaedbb872a10eb699ff	2021-03-03 15:29:15 +01:00
Hui Su	65bcf072e2	sched: Use task_current() instead of 'rq->curr == p' Use the task_current() function where appropriate. No functional change. Signed-off-by: Hui Su <sh_def@163.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Link: https://lkml.kernel.org/r/20201030173223.GA52339@rlk	2021-01-14 11:20:11 +01:00
Greg Kroah-Hartman	4196c1dafc	Revert "ANDROID: sched: avoid migrating when softint on tgt cpu should be short" This reverts commit `8d19443b0b` as the softirq code is rewritten in 5.11-rc1 and massive merge conflicts are happening. If this change is still needed, please work with upstream to get the patches accepted so they can then come into this tree automatically. Bug: 31752786 Bug: 168521633 Cc: John Dias <joaodias@google.com> Cc: J. Avila <elavila@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I6875407b586f505c2045e4cf40682831b4fceac1	2020-12-18 14:43:05 +01:00
Quentin Perret	0dd08d5801	Merge `adb35e8dc9` ("Merge tag 'sched-core-2020-12-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip") into android-mainline Now that CPU pause is gone, this is a lot more manageable. The remaining conflicts were caused mostly by vendor hooks and Android-specific tweaks to the EAS topology code, but easily fixable by hand. Signed-off-by: Quentin Perret <qperret@google.com> Change-Id: I3665a2d78cb0b8eca6ba5110e90dc7f72030805e	2020-12-16 09:09:52 +00:00
Satya Durga Srinivasu Prabhala	ebce8ec6bd	ANDROID: sched: rt: rearrange invocation of find_lowest_rq() vendor hook Right now, invocation of find_lowest_rq() vendor hook is made before error checks and also, cpupri_find() isn't exported either. It would be appropriate to move invocation of find_lowest_rq() vendor hook after error checks are done & calling cpupri_find(). Bug: 173559623 Change-Id: I298dffd39be0451b0b154930ace4e16763c6e78d Signed-off-by: Satya Durga Srinivasu Prabhala <satyap@codeaurora.org>	2020-11-20 09:46:29 +00:00
John Dias	8d19443b0b	ANDROID: sched: avoid migrating when softint on tgt cpu should be short The scheduling change to avoid putting RT threads on cores that are handling softint's was catching cases where there was no reason to believe the softint would take a long time, resulting in unnecessary migration overhead. This patch reduces the migration to cases where the core has a softint that is actually likely to take a long time, as opposed to the RCU, SCHED, and TIMER softints that are rather quick. Bug: 31752786 Bug: 168521633 Change-Id: Ib4e179f1e15c736b2fdba31070494e357e9fbbe2 Signed-off-by: John Dias <joaodias@google.com> [elavila: Amend commit text for AOSP, port to mainline] Signed-off-by: J. Avila <elavila@google.com>	2020-11-10 19:07:11 +00:00
John Dias	3adfd8e344	ANDROID: sched: avoid placing RT threads on cores handling softirqs In certain audio use cases, scheduling RT threads on cores that are handling softirqs can lead to glitches. Prevent this behavior. Bug: 31501544 Bug: 168521633 Change-Id: I99dd7aaa12c11270b28dbabea484bcc8fb8ba0c1 Signed-off-by: John Dias <joaodias@google.com> [elavila: Port to mainline, amend commit text] Signed-off-by: J. Avila <elavila@google.com>	2020-11-10 19:07:11 +00:00
Valentin Schneider	3aef1551e9	sched: Remove select_task_rq()'s sd_flag parameter Only select_task_rq_fair() uses that parameter to do an actual domain search, other classes only care about what kind of wakeup is happening (fork, exec, or "regular") and thus just translate the flag into a wakeup type. WF_TTWU and WF_EXEC have just been added, use these along with WF_FORK to encode the wakeup types we care about. For select_task_rq_fair(), we can simply use the shiny new WF_flag : SD_flag mapping. Signed-off-by: Valentin Schneider <valentin.schneider@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20201102184514.2733-3-valentin.schneider@arm.com	2020-11-10 18:39:06 +01:00
Peter Zijlstra	12fa97c64d	Merge branch 'sched/migrate-disable'	2020-11-10 18:39:04 +01:00
Peter Zijlstra	a7c81556ec	sched: Fix migrate_disable() vs rt/dl balancing In order to minimize the interference of migrate_disable() on lower priority tasks, which can be deprived of runtime due to being stuck below a higher priority task. Teach the RT/DL balancers to push away these higher priority tasks when a lower priority task gets selected to run on a freshly demoted CPU (pull). This adds migration interference to the higher priority task, but restores bandwidth to system that would otherwise be irrevocably lost. Without this it would be possible to have all tasks on the system stuck on a single CPU, each task preempted in a migrate_disable() section with a single high priority task running. This way we can still approximate running the M highest priority tasks on the system. Migrating the top task away is (ofcourse) still subject to migrate_disable() too, which means the lower task is subject to an interference equivalent to the worst case migrate_disable() section. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Daniel Bristot de Oliveira <bristot@redhat.com> Link: https://lkml.kernel.org/r/20201023102347.499155098@infradead.org	2020-11-10 18:39:01 +01:00
Peter Zijlstra	95158a89dd	sched,rt: Use the full cpumask for balancing We want migrate_disable() tasks to get PULLs in order for them to PUSH away the higher priority task. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Valentin Schneider <valentin.schneider@arm.com> Reviewed-by: Daniel Bristot de Oliveira <bristot@redhat.com> Link: https://lkml.kernel.org/r/20201023102347.310519774@infradead.org	2020-11-10 18:39:00 +01:00
Peter Zijlstra	14e292f8d4	sched,rt: Use cpumask_any_distribute() Replace a bunch of cpumask_any() instances with cpumask_any*_distribute(), by injecting this little bit of random in cpu selection, we reduce the chance two competing balance operations working off the same lowest_mask pick the same CPU. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Valentin Schneider <valentin.schneider@arm.com> Reviewed-by: Daniel Bristot de Oliveira <bristot@redhat.com> Link: https://lkml.kernel.org/r/20201023102347.190759694@infradead.org	2020-11-10 18:39:00 +01:00
Peter Zijlstra	120455c514	sched: Fix hotplug vs CPU bandwidth control Since we now migrate tasks away before DYING, we should also move bandwidth unthrottle, otherwise we can gain tasks from unthrottle after we expect all tasks to be gone already. Also; it looks like the RT balancers don't respect cpu_active() and instead rely on rq->online in part, complete this. This too requires we do set_rq_offline() earlier to match the cpu_active() semantics. (The bigger patch is to convert RT to cpu_active() entirely) Since set_rq_online() is called from sched_cpu_activate(), place set_rq_offline() in sched_cpu_deactivate(). Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Valentin Schneider <valentin.schneider@arm.com> Reviewed-by: Daniel Bristot de Oliveira <bristot@redhat.com> Link: https://lkml.kernel.org/r/20201023102346.639538965@infradead.org	2020-11-10 18:38:59 +01:00
Sai Harshini Nimmala	bf3d991a7d	ANDROID: sched: Add trace hook for rt throttle dump Create a trace hook when RT tasks are throttled. This allows vendors to debug long RT runs. Bug: 172264047 Change-Id: I534959f8e8d714463aac2f9f1c5627d2e735f543 Signed-off-by: Sai Harshini Nimmala <snimmala@codeaurora.org>	2020-11-05 19:50:27 +00:00
Peter Zijlstra	43c31ac0e6	sched: Remove relyance on STRUCT_ALIGNMENT Florian reported that all of kernel/sched/ is rebuild when CONFIG_BLK_DEV_INITRD is changed, which, while not a bug is unexpected. This is due to us including vmlinux.lds.h. Jakub explained that the problem is that we put the alignment requirement on the type instead of on a variable. Type alignment is a minimum, the compiler is free to pick any larger alignment for a specific instance of the type (eg. the variable). So force the type alignment on all individual variable definitions and remove the undesired dependency on vmlinux.lds.h. Fixes: `85c2ce9104` ("sched, vmlinux.lds: Increase STRUCT_ALIGNMENT to 64 bytes for GCC-4.9") Reported-by: Florian Fainelli <f.fainelli@gmail.com> Suggested-by: Jakub Jelinek <jakub@redhat.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>	2020-10-29 11:00:32 +01:00
Peter Zijlstra	934fc3314b	sched/cpupri: Remap CPUPRI_NORMAL to MAX_RT_PRIO-1 This makes the mapping continuous and frees up 100 for other usage. Prev mapping: p->rt_priority p->prio newpri cpupri -1 -1 (CPUPRI_INVALID) 100 0 (CPUPRI_NORMAL) 1 98 98 1 ... 49 50 50 49 50 49 49 50 ... 99 0 0 99 New mapping: p->rt_priority p->prio newpri cpupri -1 -1 (CPUPRI_INVALID) 99 0 (CPUPRI_NORMAL) 1 98 98 1 ... 49 50 50 49 50 49 49 50 ... 99 0 0 99 Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Dietmar Eggemann <dietmar.eggemann@arm.com>	2020-10-29 11:00:30 +01:00
Greg Kroah-Hartman	67d3ed5765	Merge 'v5.10-rc1' into android-mainline Linux 5.10-rc1 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: Iace3fc84a00d3023c75caa086a266de17dc1847c	2020-10-29 06:32:38 +01:00
Joe Perches	33def8498f	treewide: Convert macro and uses of __section(foo) to __section("foo") Use a more generic form for __section that requires quotes to avoid complications with clang and gcc differences. Remove the quote operator # from compiler_attributes.h __section macro. Convert all unquoted __section(foo) uses to quoted __section("foo"). Also convert __attribute__((section("foo"))) uses to __section("foo") even if the __attribute__ has multiple list entry forms. Conversion done using the script at: https://lore.kernel.org/lkml/75393e5ddc272dc7403de74d645e6c6e0f4e70eb.camel@perches.com/2-convert_section.pl Signed-off-by: Joe Perches <joe@perches.com> Reviewed-by: Nick Desaulniers <ndesaulniers@gooogle.com> Reviewed-by: Miguel Ojeda <ojeda@kernel.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>	2020-10-25 14:51:49 -07:00
Greg Kroah-Hartman	00d6a8a7ee	Merge `e4cbce4d13` ("Merge tag 'sched-core-2020-08-03' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip") into android-mainline Baby steps for 5.9-rc1 Resolves some kernel/sched/ merge issues. Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I88cf5411ac7251f9795d9c50cb18b0df5bf0bcd6	2020-08-07 14:17:39 +02:00
Park Bumgyu	16330b3560	ANDROID: Add vendor hooks to the scheduler Add vendor hooks for vendor-specific scheduling. android_rvh_select_task_rq_rt: To perform vendor-specific RT task placement. android_rvh_select_fallback_rq: To restrict cpu usage. android_rvh_scheduler_tick: To collect periodic scheduling information and to schedule tasks. android_rvh_enqueue_tas/android_rvh_dequeue_task: For vendor to be aware of the task schedule in/out. android_rvh_can_migrate_task: To limit task migration based on vendor requirements. android_rvh_find_lowest_rq: To find the lowest rq for RT task with vendor-specific way. Bug: 155241766 Change-Id: I926458b0a911d564e5932e200125b12406c2deee Signed-off-by: Park Bumgyu <bumgyu.park@samsung.com>	2020-07-17 14:38:05 +00:00
Steven Rostedt (VMware)	a87e749e8f	sched: Remove struct sched_class::next field Now that the sched_class descriptors are defined in order via the linker script vmlinux.lds.h, there's no reason to have a "next" pointer to the previous priroity structure. The order of the sturctures can be aligned as an array, and used to index and find the next sched_class descriptor. Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20191219214558.845353593@goodmis.org	2020-06-25 13:45:44 +02:00
Steven Rostedt (VMware)	590d697963	sched: Force the address order of each sched class descriptor In order to make a micro optimization in pick_next_task(), the order of the sched class descriptor address must be in the same order as their priority to each other. That is: &idle_sched_class < &fair_sched_class < &rt_sched_class < &dl_sched_class < &stop_sched_class In order to guarantee this order of the sched class descriptors, add each one into their own data section and force the order in the linker script. Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lore.kernel.org/r/157675913272.349305.8936736338884044103.stgit@localhost.localdomain	2020-06-25 13:45:43 +02:00
Linus Torvalds	cb8e59cc87	Merge git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net-next Pull networking updates from David Miller: 1) Allow setting bluetooth L2CAP modes via socket option, from Luiz Augusto von Dentz. 2) Add GSO partial support to igc, from Sasha Neftin. 3) Several cleanups and improvements to r8169 from Heiner Kallweit. 4) Add IF_OPER_TESTING link state and use it when ethtool triggers a device self-test. From Andrew Lunn. 5) Start moving away from custom driver versions, use the globally defined kernel version instead, from Leon Romanovsky. 6) Support GRO vis gro_cells in DSA layer, from Alexander Lobakin. 7) Allow hard IRQ deferral during NAPI, from Eric Dumazet. 8) Add sriov and vf support to hinic, from Luo bin. 9) Support Media Redundancy Protocol (MRP) in the bridging code, from Horatiu Vultur. 10) Support netmap in the nft_nat code, from Pablo Neira Ayuso. 11) Allow UDPv6 encapsulation of ESP in the ipsec code, from Sabrina Dubroca. Also add ipv6 support for espintcp. 12) Lots of ReST conversions of the networking documentation, from Mauro Carvalho Chehab. 13) Support configuration of ethtool rxnfc flows in bcmgenet driver, from Doug Berger. 14) Allow to dump cgroup id and filter by it in inet_diag code, from Dmitry Yakunin. 15) Add infrastructure to export netlink attribute policies to userspace, from Johannes Berg. 16) Several optimizations to sch_fq scheduler, from Eric Dumazet. 17) Fallback to the default qdisc if qdisc init fails because otherwise a packet scheduler init failure will make a device inoperative. From Jesper Dangaard Brouer. 18) Several RISCV bpf jit optimizations, from Luke Nelson. 19) Correct the return type of the ->ndo_start_xmit() method in several drivers, it's netdev_tx_t but many drivers were using 'int'. From Yunjian Wang. 20) Add an ethtool interface for PHY master/slave config, from Oleksij Rempel. 21) Add BPF iterators, from Yonghang Song. 22) Add cable test infrastructure, including ethool interfaces, from Andrew Lunn. Marvell PHY driver is the first to support this facility. 23) Remove zero-length arrays all over, from Gustavo A. R. Silva. 24) Calculate and maintain an explicit frame size in XDP, from Jesper Dangaard Brouer. 25) Add CAP_BPF, from Alexei Starovoitov. 26) Support terse dumps in the packet scheduler, from Vlad Buslov. 27) Support XDP_TX bulking in dpaa2 driver, from Ioana Ciornei. 28) Add devm_register_netdev(), from Bartosz Golaszewski. 29) Minimize qdisc resets, from Cong Wang. 30) Get rid of kernel_getsockopt and kernel_setsockopt in order to eliminate set_fs/get_fs calls. From Christoph Hellwig. * git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net-next: (2517 commits) selftests: net: ip_defrag: ignore EPERM net_failover: fixed rollback in net_failover_open() Revert "tipc: Fix potential tipc_aead refcnt leak in tipc_crypto_rcv" Revert "tipc: Fix potential tipc_node refcnt leak in tipc_rcv" vmxnet3: allow rx flow hash ops only when rss is enabled hinic: add set_channels ethtool_ops support selftests/bpf: Add a default $(CXX) value tools/bpf: Don't use $(COMPILE.c) bpf, selftests: Use bpf_probe_read_kernel s390/bpf: Use bcr 0,%0 as tail call nop filler s390/bpf: Maintain 8-byte stack alignment selftests/bpf: Fix verifier test selftests/bpf: Fix sample_cnt shared between two threads bpf, selftests: Adapt cls_redirect to call csum_level helper bpf: Add csum_level helper for fixing up csum levels bpf: Fix up bpf_skb_adjust_room helper's skb csum setting sfc: add missing annotation for efx_ef10_try_update_nic_stats_vf() crypto/chtls: IPv6 support for inline TLS Crypto/chcr: Fixes a coccinile check error Crypto/chcr: Fixes compilations warnings ...	2020-06-03 16:27:18 -07:00
Huaixin Chang	d505b8af58	sched: Defend cfs and rt bandwidth quota against overflow When users write some huge number into cpu.cfs_quota_us or cpu.rt_runtime_us, overflow might happen during to_ratio() shifts of schedulable checks. to_ratio() could be altered to avoid unnecessary internal overflow, but min_cfs_quota_period is less than 1 << BW_SHIFT, so a cutoff would still be needed. Set a cap MAX_BW for cfs_quota_us and rt_runtime_us to prevent overflow. Signed-off-by: Huaixin Chang <changhuaixin@linux.alibaba.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Ben Segall <bsegall@google.com> Link: https://lkml.kernel.org/r/20200425105248.60093-1-changhuaixin@linux.alibaba.com	2020-05-19 20:34:14 +02:00
Christoph Hellwig	32927393dc	sysctl: pass kernel pointers to ->proc_handler Instead of having all the sysctl handlers deal with user pointers, which is rather hairy in terms of the BPF interaction, copy the input to and from userspace in common code. This also means that the strings are always NUL-terminated by the common code, making the API a little bit safer. As most handler just pass through the data to one of the common handlers a lot of the changes are mechnical. Signed-off-by: Christoph Hellwig <hch@lst.de> Acked-by: Andrey Ignatov <rdna@fb.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2020-04-27 02:07:40 -04:00
Qais Yousef	d94a9df490	sched/rt: Remove unnecessary push for unfit tasks In task_woken_rt() and switched_to_rto() we try trigger push-pull if the task is unfit. But the logic is found lacking because if the task was the only one running on the CPU, then rt_rq is not in overloaded state and won't trigger a push. The necessity of this logic was under a debate as well, a summary of the discussion can be found in the following thread: https://lore.kernel.org/lkml/20200226160247.iqvdakiqbakk2llz@e107158-lin.cambridge.arm.com/ Remove the logic for now until a better approach is agreed upon. Signed-off-by: Qais Yousef <qais.yousef@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Fixes: `804d402fb6` ("sched/rt: Make RT capacity-aware") Link: https://lkml.kernel.org/r/20200302132721.8353-6-qais.yousef@arm.com	2020-03-06 12:57:29 +01:00
Qais Yousef	98ca645f82	sched/rt: Allow pulling unfitting task When implemented RT Capacity Awareness; the logic was done such that if a task was running on a fitting CPU, then it was sticky and we would try our best to keep it there. But as Steve suggested, to adhere to the strict priority rules of RT class; allow pulling an RT task to unfitting CPU to ensure it gets a chance to run ASAP. LINK: https://lore.kernel.org/lkml/20200203111451.0d1da58f@oasis.local.home/ Suggested-by: Steven Rostedt <rostedt@goodmis.org> Signed-off-by: Qais Yousef <qais.yousef@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Fixes: `804d402fb6` ("sched/rt: Make RT capacity-aware") Link: https://lkml.kernel.org/r/20200302132721.8353-5-qais.yousef@arm.com	2020-03-06 12:57:28 +01:00
Qais Yousef	a1bd02e1f2	sched/rt: Optimize cpupri_find() on non-heterogenous systems By introducing a new cpupri_find_fitness() function that takes the fitness_fn as an argument and only called when asym_system static key is enabled. cpupri_find() is now a wrapper function that calls cpupri_find_fitness() passing NULL as a fitness_fn, hence disabling the logic that handles fitness by default. LINK: https://lore.kernel.org/lkml/c0772fca-0a4b-c88d-fdf2-5715fcf8447b@arm.com/ Reported-by: Dietmar Eggemann <dietmar.eggemann@arm.com> Signed-off-by: Qais Yousef <qais.yousef@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Fixes: `804d402fb6` ("sched/rt: Make RT capacity-aware") Link: https://lkml.kernel.org/r/20200302132721.8353-4-qais.yousef@arm.com	2020-03-06 12:57:27 +01:00
Qais Yousef	b28bc1e002	sched/rt: Re-instate old behavior in select_task_rq_rt() When RT Capacity Aware support was added, the logic in select_task_rq_rt was modified to force a search for a fitting CPU if the task currently doesn't run on one. But if the search failed, and the search was only triggered to fulfill the fitness request; we could end up selecting a new CPU unnecessarily. Fix this and re-instate the original behavior by ensuring we bail out in that case. This behavior change only affected asymmetric systems that are using util_clamp to implement capacity aware. None asymmetric systems weren't affected. LINK: https://lore.kernel.org/lkml/20200218041620.GD28029@codeaurora.org/ Reported-by: Pavan Kondeti <pkondeti@codeaurora.org> Signed-off-by: Qais Yousef <qais.yousef@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Fixes: `804d402fb6` ("sched/rt: Make RT capacity-aware") Link: https://lkml.kernel.org/r/20200302132721.8353-3-qais.yousef@arm.com	2020-03-06 12:57:27 +01:00
Konstantin Khlebnikov	b4fb015eef	sched/rt: Optimize checking group RT scheduler constraints Group RT scheduler contains protection against setting zero runtime for cgroup with RT tasks. Right now function tg_set_rt_bandwidth() iterates over all CPU cgroups and calls tg_has_rt_tasks() for any cgroup which runtime is zero (not only for changed one). Default RT runtime is zero, thus tg_has_rt_tasks() will is called for almost at CPU cgroups. This protection already is slightly racy: runtime limit could be changed between cpu_cgroup_can_attach() and cpu_cgroup_attach() because changing cgroup attribute does not lock cgroup_mutex while attach does not lock rt_constraints_mutex. Changing task scheduler class also races with changing rt runtime: check in __sched_setscheduler() isn't protected. Function tg_has_rt_tasks() iterates over all threads in the system. This gives NR_CGROUPS * NR_TASKS operations under single tasklist_lock locked for read tg_set_rt_bandwidth(). Any concurrent attempt of locking tasklist_lock for write (for example fork) will stuck with disabled irqs. This patch makes two optimizations: 1) Remove locking tasklist_lock and iterate only tasks in cgroup 2) Call tg_has_rt_tasks() iff rt runtime changes from non-zero to zero All changed code is under CONFIG_RT_GROUP_SCHED. Testcase: # mkdir /sys/fs/cgroup/cpu/test{1..10000} # echo 0 \| tee /sys/fs/cgroup/cpu/test*/cpu.rt_runtime_us At the same time without patch fork time will be >100ms: # perf trace -e clone --duration 100 stress-ng --fork 1 Also remote ping will show timings >100ms caused by irq latency. Signed-off-by: Konstantin Khlebnikov <khlebnikov@yandex-team.ru> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Link: https://lkml.kernel.org/r/157996383820.4651.11292439232549211693.stgit@buzz	2020-01-28 21:37:09 +01:00
Qais Yousef	804d402fb6	sched/rt: Make RT capacity-aware Capacity Awareness refers to the fact that on heterogeneous systems (like Arm big.LITTLE), the capacity of the CPUs is not uniform, hence when placing tasks we need to be aware of this difference of CPU capacities. In such scenarios we want to ensure that the selected CPU has enough capacity to meet the requirement of the running task. Enough capacity means here that capacity_orig_of(cpu) >= task.requirement. The definition of task.requirement is dependent on the scheduling class. For CFS, utilization is used to select a CPU that has >= capacity value than the cfs_task.util. capacity_orig_of(cpu) >= cfs_task.util DL isn't capacity aware at the moment but can make use of the bandwidth reservation to implement that in a similar manner CFS uses utilization. The following patchset implements that: https://lore.kernel.org/lkml/20190506044836.2914-1-luca.abeni@santannapisa.it/ capacity_orig_of(cpu)/SCHED_CAPACITY >= dl_deadline/dl_runtime For RT we don't have a per task utilization signal and we lack any information in general about what performance requirement the RT task needs. But with the introduction of uclamp, RT tasks can now control that by setting uclamp_min to guarantee a minimum performance point. ATM the uclamp value are only used for frequency selection; but on heterogeneous systems this is not enough and we need to ensure that the capacity of the CPU is >= uclamp_min. Which is what implemented here. capacity_orig_of(cpu) >= rt_task.uclamp_min Note that by default uclamp.min is 1024, which means that RT tasks will always be biased towards the big CPUs, which make for a better more predictable behavior for the default case. Must stress that the bias acts as a hint rather than a definite placement strategy. For example, if all big cores are busy executing other RT tasks we can't guarantee that a new RT task will be placed there. On non-heterogeneous systems the original behavior of RT should be retained. Similarly if uclamp is not selected in the config. [ mingo: Minor edits to comments. ] Signed-off-by: Qais Yousef <qais.yousef@arm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Dietmar Eggemann <dietmar.eggemann@arm.com> Reviewed-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Link: https://lkml.kernel.org/r/20191009104611.15363-1-qais.yousef@arm.com Signed-off-by: Ingo Molnar <mingo@kernel.org>	2019-12-25 10:42:10 +01:00
Peter Zijlstra	a0e813f26e	sched/core: Further clarify sched_class::set_next_task() It turns out there really is something special to the first set_next_task() invocation. In specific the 'change' pattern really should not cause balance callbacks. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: bsegall@google.com Cc: dietmar.eggemann@arm.com Cc: juri.lelli@redhat.com Cc: ktkhai@virtuozzo.com Cc: mgorman@suse.de Cc: qais.yousef@arm.com Cc: qperret@google.com Cc: rostedt@goodmis.org Cc: valentin.schneider@arm.com Cc: vincent.guittot@linaro.org Fixes: `f95d4eaee6` ("sched/{rt,deadline}: Fix set_next_task vs pick_next_task") Link: https://lkml.kernel.org/r/20191108131909.775434698@infradead.org Signed-off-by: Ingo Molnar <mingo@kernel.org>	2019-11-11 08:35:21 +01:00
Peter Zijlstra	98c2f700ed	sched/core: Simplify sched_class::pick_next_task() Now that the indirect class call never uses the last two arguments of pick_next_task(), remove them. Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: bsegall@google.com Cc: dietmar.eggemann@arm.com Cc: juri.lelli@redhat.com Cc: ktkhai@virtuozzo.com Cc: mgorman@suse.de Cc: qais.yousef@arm.com Cc: qperret@google.com Cc: rostedt@goodmis.org Cc: valentin.schneider@arm.com Cc: vincent.guittot@linaro.org Link: https://lkml.kernel.org/r/20191108131909.660595546@infradead.org Signed-off-by: Ingo Molnar <mingo@kernel.org>	2019-11-11 08:35:20 +01:00
Peter Zijlstra	6e2df0581f	sched: Fix pick_next_task() vs 'change' pattern race Commit `67692435c4` ("sched: Rework pick_next_task() slow-path") inadvertly introduced a race because it changed a previously unexplored dependency between dropping the rq->lock and sched_class::put_prev_task(). The comments about dropping rq->lock, in for example newidle_balance(), only mentions the task being current and ->on_cpu being set. But when we look at the 'change' pattern (in for example sched_setnuma()): queued = task_on_rq_queued(p); /* p->on_rq == TASK_ON_RQ_QUEUED / running = task_current(rq, p); / rq->curr == p / if (queued) dequeue_task(...); if (running) put_prev_task(...); / change task properties */ if (queued) enqueue_task(...); if (running) set_next_task(...); It becomes obvious that if we do this after put_prev_task() has already been called on @p, things go sideways. This is exactly what the commit in question allows to happen when it does: prev->sched_class->put_prev_task(rq, prev, rf); if (!rq->nr_running) newidle_balance(rq, rf); The newidle_balance() call will drop rq->lock after we've called put_prev_task() and that allows the above 'change' pattern to interleave and mess up the state. Furthermore, it turns out we lost the RT-pull when we put the last DL task. Fix both problems by extracting the balancing from put_prev_task() and doing a multi-class balance() pass before put_prev_task(). Fixes: `67692435c4` ("sched: Rework pick_next_task() slow-path") Reported-by: Quentin Perret <qperret@google.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Tested-by: Quentin Perret <qperret@google.com> Tested-by: Valentin Schneider <valentin.schneider@arm.com>	2019-11-08 22:34:14 +01:00
Linus Torvalds	7f2444d38f	Merge branch 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull core timer updates from Thomas Gleixner: "Timers and timekeeping updates: - A large overhaul of the posix CPU timer code which is a preparation for moving the CPU timer expiry out into task work so it can be properly accounted on the task/process. An update to the bogus permission checks will come later during the merge window as feedback was not complete before heading of for travel. - Switch the timerqueue code to use cached rbtrees and get rid of the homebrewn caching of the leftmost node. - Consolidate hrtimer_init() + hrtimer_init_sleeper() calls into a single function - Implement the separation of hrtimers to be forced to expire in hard interrupt context even when PREEMPT_RT is enabled and mark the affected timers accordingly. - Implement a mechanism for hrtimers and the timer wheel to protect RT against priority inversion and live lock issues when a (hr)timer which should be canceled is currently executing the callback. Instead of infinitely spinning, the task which tries to cancel the timer blocks on a per cpu base expiry lock which is held and released by the (hr)timer expiry code. - Enable the Hyper-V TSC page based sched_clock for Hyper-V guests resulting in faster access to timekeeping functions. - Updates to various clocksource/clockevent drivers and their device tree bindings. - The usual small improvements all over the place" * 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (101 commits) posix-cpu-timers: Fix permission check regression posix-cpu-timers: Always clear head pointer on dequeue hrtimer: Add a missing bracket and hide `migration_base' on !SMP posix-cpu-timers: Make expiry_active check actually work correctly posix-timers: Unbreak CONFIG_POSIX_TIMERS=n build tick: Mark sched_timer to expire in hard interrupt context hrtimer: Add kernel doc annotation for HRTIMER_MODE_HARD x86/hyperv: Hide pv_ops access for CONFIG_PARAVIRT=n posix-cpu-timers: Utilize timerqueue for storage posix-cpu-timers: Move state tracking to struct posix_cputimers posix-cpu-timers: Deduplicate rlimit handling posix-cpu-timers: Remove pointless comparisons posix-cpu-timers: Get rid of 64bit divisions posix-cpu-timers: Consolidate timer expiry further posix-cpu-timers: Get rid of zero checks rlimit: Rewrite non-sensical RLIMIT_CPU comment posix-cpu-timers: Respect INFINITY for hard RTTIME limit posix-cpu-timers: Switch thread group sampling to array posix-cpu-timers: Restructure expiry array posix-cpu-timers: Remove cputime_expires ...	2019-09-17 12:35:15 -07:00
Thomas Gleixner	3a245c0f11	posix-cpu-timers: Move expiry cache into struct posix_cputimers The expiry cache belongs into the posix_cputimers container where the other cpu timers information is. Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Reviewed-by: Frederic Weisbecker <frederic@kernel.org> Link: https://lkml.kernel.org/r/20190821192921.014444012@linutronix.de	2019-08-28 11:50:35 +02:00

1 2 3 4

196 Commits