|
MultiAgentDecisionProcess
|
AgentFullyObservable represents an agent that receives the true state, the joint observation and also the reward signal. More...
#include <AgentFullyObservable.h>
Public Member Functions | |
| virtual Index | Act (Index sI, Index joI, double reward)=0 |
| Return an individual action based on state, last joint observation and reward. More... | |
| AgentFullyObservable (const PlanningUnitDecPOMDPDiscrete *pu, Index id) | |
| (default) Constructor More... | |
| AgentFullyObservable (const AgentFullyObservable &a) | |
| Copy constructor. More... | |
| ~AgentFullyObservable () | |
| Destructor. More... | |
Public Member Functions inherited from AgentDecPOMDPDiscrete | |
| AgentDecPOMDPDiscrete (const PlanningUnitDecPOMDPDiscrete *pu, Index id) | |
| (default) Constructor More... | |
| AgentDecPOMDPDiscrete (const AgentDecPOMDPDiscrete &a) | |
| Copy constructor. More... | |
| const PlanningUnitDecPOMDPDiscrete * | GetPU () const |
Public Member Functions inherited from SimulationAgent | |
| virtual Index | GetIndex () const |
| Retrieves the index of this agent. More... | |
| virtual bool | GetVerbose () const |
| If true, the agent will report more. More... | |
| void | Print () const |
| Print out some information about this agent. More... | |
| virtual void | ResetEpisode ()=0 |
| Will be called before an episode, to reinitialize the agent. More... | |
| virtual void | SetIndex (Index id) |
| Sets the index of this agent. More... | |
| virtual void | SetVerbose (bool verbose) |
| Set whether this agent should be verbose. More... | |
| SimulationAgent (Index id, bool verbose=false) | |
| (default) Constructor More... | |
| virtual std::string | SoftPrint () const |
| Return some information about this agent. More... | |
| virtual | ~SimulationAgent () |
| Destructor. More... | |
AgentFullyObservable represents an agent that receives the true state, the joint observation and also the reward signal.
|
inline |
(default) Constructor
a derived class should first call the AgentDecPOMDPDiscrete constructor. Therefore, we do not need to pass along any arguments anymore here.
|
inline |
Copy constructor.
|
inline |
Destructor.
Return an individual action based on state, last joint observation and reward.
Implemented in AgentQLearner, AgentMDP, AgentRandom, and AgentOnlinePlanningMDP.