Systems and methods are described herein for determining a level of user engagement based on user interactions. A media guidance application is configured to retrieve a first record of a first plurality of user inputs, including input type, received during consumption of a media asset. A second time interval, prior to the first time interval, is selected by shifting the first start time and the first end time. A second record of a second plurality of user inputs received during the second time interval, including input type, is retrieved. A first media asset consumed during the first time interval and a second media asset consumed during the second time interval are determined, and the level of user engagement for the first media asset is determined based on the first record and the second record.